Implement Hadoop jobs to extract business value from large and varied data sets
Write, customise and deploy Java MapReduce jobs to summarise data
Develop Hive and Pig queries to simplify data analysis
Test and debug jobs using MRUnit
Monitor task execution and cluster health
Developing efficient parallel algorithms
Analysing unstructured files and developing Java MapReduce jobs
Loading and retrieving data from HDFS and HBase
Extending Hive and Pig with User Defined Functions (UDFs)
Analysing data with Spark RDDs

There is currently no content classified with this term.

Back to Top