+91 90691 39140 | +1 253 214 3115 | info@hub4tech.com | hub4tech

Apache Hadoop

Table of Content

Topic Duration
Apache Pig ½ day
Apache Hive + Apache Impala ½ day
Hbase ½ day
MongoDB ½ day
Python ½ day
Spark 2½ days

Pig overview

  • Created in Yahoo Research Labs
  • Built to avoid low level programming of Map Reduce
  • Committers: Yahoo, Hortonworks, LinkedIN, Netflix, IBM, Twitter

Why pig?

Pig Operators

Command Use
LOAD get the data accessible to Pig
Dump Display the results on screen
Store Saves results in a file
ForEcah One record at a time (map)
Filter Select few records based upon condition
Group Group records based upon some key
Count Count number of records based upon some key
Join Join two datasets based upon some key
Is it Helpful?
Copyright ©2015 Hub4Tech.com, All Rights Reserved. Hub4Tech™ is registered trademark of Hub4tech Portal Services Pvt. Ltd.
All trademarks and logos appearing on this website are the property of their respective owners.