Warning: stripos() expects parameter 1 to be string, array given in /home4/jerry/public_html/wp-includes/formatting.php on line 3391

Changing The Output File Prefix Of Hadoop MapReduce Job

Changing The Output File Prefix Of Hadoop MapReduce Job Your Hadoop job can have multiple reducers and each reducer will create a file by default with prefix part-r-xxxxx. The first reducer will create a file as part-r-00000 and second reducer will create a file as part-r-00001 What if you don't like the default prefix "part" [...]

By | July 10th, 2016|Hadoop|0 Comments

Missing Artifact JDK Tools Jar

Missing Artifact JDK Tools Jar Some versions of Maven / eclipse will give you the below error in your pom.xml [crayon-57c60a8ab9527448333535/] It is very easy to solve the problem. Simply add the below dependency to your pom.xml. [crayon-57c60a8ab9533761694209/] You might still see an error even after adding the above dependency to your pom.xml. If you [...]

By | June 23rd, 2016|Troubleshooting|0 Comments

Hadoop Mapper and Reducer Output Type Mismatch

Hadoop Mapper and Reducer Output Mismatch Can you have different output Key Value pair types for Mapper and Reducer in a MapReduce program? Short answer - absolutely yes. Below signature for Mapper and Reducer from the same MapReduce program and they both are totally valid. [crayon-57c60a8ab9ad0307440742/]   We absolutely know the above is valid. Yet, [...]

By | June 22nd, 2016|Hadoop|0 Comments

Apache Pig Tutorial – Map

Apache Pig Tutorial - Map Goal of this tutorial is to learn Apache Pig concepts in a fast pace. So don’t except lengthy posts. All posts will be short and sweet. Most posts will have (very short) “see it in action” video. In the previous post, we saw 2 complex types - Tuple and Bag. [...]

By | December 31st, 2015|Apache Pig, Hadoop|0 Comments

Apache Pig Tutorial – Tuple & Bag

Apache Pig Tutorial - Tuple & Bag Goal of this tutorial is to learn Apache Pig concepts in a fast pace. So don’t except lengthy posts. All posts will be short and sweet. Most posts will have (very short) “see it in action” video. So far we have been using simple datatypes in Pig like [...]

By | December 31st, 2015|Apache Pig, Hadoop|0 Comments

Apache Pig Tutorial – Executing Script with Parameters

Apache Pig Tutorial - Executing Script with Parameters Goal of this tutorial is to learn Apache Pig concepts in a fast pace. So don’t except lengthy posts. All posts will be short and sweet. Most posts will have (very short) “see it in action” video. In the previous post, we saw how to run Pig Latin [...]

By | December 20th, 2015|Apache Pig, Hadoop|0 Comments

Apache Pig Tutorial – Executing as a Script

Apache Pig Tutorial - Executing as a Script Goal of this tutorial is to learn Apache Pig concepts in a fast pace. So don’t except lengthy posts. All posts will be short and sweet. Most posts will have (very short) “see it in action” video. So far in a series of lessons we saw step [...]

By | December 20th, 2015|Apache Pig, Hadoop|0 Comments

Apache Pig Tutorial – Ordering Records

Apache Pig Tutorial - Ordering Records Goal of this tutorial is to learn Apache Pig concepts in a fast pace. So don’t except lengthy posts. All posts will be short and sweet. Most posts will have (very short) “see it in action” video. In the previous post we look at how to group records and [...]

By | December 20th, 2015|Hadoop|0 Comments

Apache Pig Tutorial – Grouping Records

Apache Pig Tutorial - Grouping Records Goal of this tutorial is to learn Apache Pig concepts in a fast pace. So don’t except lengthy posts. All posts will be short and sweet. Most posts will have (very short) “see it in action” video. We have been learning a lot of concepts in Apache Pig (look [...]

By | December 19th, 2015|Apache Pig, Hadoop|0 Comments

Apache Pig Tutorial – Filter Records

Apache Pig Tutorial - Filter Records Goal of this tutorial is to learn Apache Pig concepts in a fast pace. So don’t except lengthy posts. All posts will be short and sweet. Most posts will have (very short) “see it in action” video. In our previous posts we saw different variations of loading a dataset in [...]

By | December 18th, 2015|Apache Pig, Hadoop|0 Comments