|
Project Information
|
In many applications like telecoms etc, a huge amount of data is being generated in files, Many applications store this data in DB. Many datawarehouse applicatoins & Analytics based applications process this data and generate some aggregated or derived data. However an unseen challenge here is to maintain the source data in DB, which makes a database usually un-manageable. hadoop-etl project creates a framework to interface the unstructured data to aggregated data |