Sabtu, 21 September 2013

BigData: Apache Flume, HDFS and HBase

In this post, I will show how to log very large amount of web requests to a BigData storage for traffic analysis. The source code for the project is on github. We will rely on the logging library log4j and the associated Flume NG appender implementation. For storage, we will place the log information into a set of HDFS files or into an HBase table. The HDFS files...

Selasa, 17 September 2013

Apache HBase Certification

I am now a Cloudera Certified Specialist in Apache HBase. Woohoo ...

Jumat, 06 September 2013

BigData GeoEnrichment

What is GeoEnrichment? An example would best describe it. Given a big set of customer location records, I would like each location to be GeoEnriched with the average income of the zip code where that location falls into and with the number of people between the age of 25 and 30 that live in that zip code.Before GeoEnrichment:CustId,Lat,LonAfter GeoEnrichment:CustId,Lat,Lon,AverageIncome,Age25To30Of...
 

Virush-SGB Copyright © 2012 Fast Loading -- Powered by Blogger