Showing posts with label Hive. Show all posts
Showing posts with label Hive. Show all posts

Wednesday, 29 May 2013

How is Apache Hadoop used Big Data Analytics and Inteligence


Big Data Analytics - Usage of Apache Hadoop / What is the use of Apache Hadoop in an Analytical Project

There are various ways Apache Hadoop is used in a Big Data Project.  +Hortonworks latest report
  • Data Refinery Pattern
  • Data Exploratory Pattern
  • Application Enrichment Pattern
In Data Refinery - Hadoop is used for Cleansing up the data and sending the output (probably as aggregation / refinement) to the Enterprise Data Warehouse which might be +Oracle +SQLServer etc

In Data Exploration pattern - data is analysed in Hadoop environment, using Hive etc. and the results are shared with the Applications. There are many Big Data Visualization Tools that provide good reports. There are some Open Source tools like Pentaho Business Intelligence Studio that offers Big Data Visualization and Reports

In Application Enrichment Pattern the entire data is stored in Apache Hadoop. For example, the Web session is stored in Hadoop and appropriate actions are taken based on the user's web navigation .


These are some broad classification of the use of +Apache Hadoop
Related Posts Plugin for WordPress, Blogger...