Course Batch Starts, Timing, Price & Enroll

Program Duration Batch Starts Time Price # Enroll Book free demo
Weekend
60 Hrs Weekend Morning-Batch USD 600
INR 30000
Enroll Now Book free demo class
Weekend
60 Hrs Weekend Evening-Batch USD 600
INR 30000
Enroll Now Book free demo class
Weekdays
60 Hrs Weekdays Morning-Batch USD 600
INR 30000
Enroll Now Book free demo class
Weekdays
60 Hrs Weekdays Evening-Batch USD 600
INR 20000
Enroll Now Book free demo class

# Cloud lab charges will be extra. Our technical consultant will share actual lab charges with you.

About Course

The targeted audience for this course can be -

Data analysts
Business Intelligence Specialists
Developers
System Architects
Database Administrators
This course is designed to help you to understand the core concepts of Apache Hadoop, Pig, Hive and Impala that you can utilize the BigData buzzword. In this course you will get the end to end exposure on how to access, manipulate, transform, and analyze complex data sets using SQL and familiar scripting languages.
After completing this course you will be able to -

Navigate easily in Apache Hadoop ecosystem
Understand various data analytic tools such as Pig, Hive, and Impala
Work with Apache Hadoop and data ETL (extract, transform, load)
Perform real-time and complex queries on datasets
Work with data transformation using custom scripts
Before attending this course you need to be familiar with the big data concepts and data science terminology.

CURRICULUM

Data Scientist

  • 1.1 The Motivation for Hadoop
  • 1.2 Hadoop Overview
  • 1.3 Data Storage: HDFS
  • 1.4 Distributed Data Processing: YARN, MapReduce and Spark
  • 1.5 Data Processing and Analysis: Pig, Hive and Impala
  • 1.6 Data Integration: Sqoop
  • 1.7 Other Hadoop Data Tools
  • 1.8 Exercise Scenarios Explanation
  • 2.1 What Is Pig?
  • 2.2 Pig’s Features
  • 2.3 Pig Use Cases
  • 2.4 Interacting with Pig
  • 3.1 Pig Latin Syntax
  • 3.2 Loading Data
  • 3.3 Simple Data Types
  • 3.4 Field Definitions
  • 3.5 Data Output
  • 3.6 Viewing the Schema
  • 3.7 Filtering and Sorting Data
  • 3.8 Commonly-Used Functions
  • 4.1 Storage Formats
  • 4.2 Complex/Nested Data Types
  • 4.3 Grouping
  • 4.4 Built-In Functions for Complex Data
  • 4.5 Iterating Grouped Data
  • 5.1 Techniques for Combining Data Sets
  • 5.2 Joining Data Sets in Pig
  • 5.3 Set Operations
  • 5.4 Splitting Data Sets
  • 6.1 Troubleshooting Pig
  • 6.2 Logging
  • 6.3 Using Hadoop’s Web UI
  • 6.4 Data Sampling and Debugging
  • 6.5 Performance Overview
  • 6.6 Understanding the Execution Plan
  • 6.7 Tips for Improving the Performance of Your Pig Jobs
  • 7.1 What Is Hive?
  • 7.2 What Is Impala?
  • 7.3 Schema and Data Storage
  • 7.4 Comparing Hive to Traditional Databases
  • 7.5 Hive Use Cases
  • 8.1 Databases and Tables
  • 8.2 Basic Hive and Impala Query Language Syntax
  • 8.3 Data Types
  • 8.4 Differences Between Hive and Impala Query Syntax
  • 8.5 Using Hue to Execute Queries
  • 8. 6 Using the Impala Shell
  • 9.1 Data Storage
  • 9.2 Creating Databases and Tables
  • 9.3 Loading Data
  • 9.4 Altering Databases and Tables
  • 9.5 Simplifying Queries with Views
  • 9.6 Storing Query Results
  • 10.1 Partitioning Tables
  • 10.2 Choosing a File Format
  • 10.3 Managing Metadata
  • 10.4 Controlling Access to Data
  • 11.1 Joining Datasets
  • 11.2 Common Built-In Functions
  • 11.3 Aggregation and Windowing
  • 12.1 How Impala Executes Queries
  • 12.2 Extending Impala with User-Defined Functions
  • 12.3 Improving Impala Performance
  • 13.1 Complex Values in Hive
  • 13.2 Using Regular Expressions in Hive
  • 13.3 Sentiment Analysis and N-Grams
  • 13.4 Conclusion
  • 14.1 Understanding Query Performance
  • 14.2 Controlling Job Execution Plan
  • 14.3 Bucketing
  • 14.4 Indexing Data
  • 15.1 SerDes
  • 15.2 Data Transformation with Custom Scripts
  • 15.3 User-Defined Functions
  • 15.4 Parameterized Queries

Exam & Certification

Cloudera is organizing a certification path named as CCP Data Scientists. To obtain this certificate you need to pass 3 exams –

DS700 – Descriptive and Inferential Statistics on Big Data
DS701 – Advanced Analytical Techniques on Big Data
DS702 - Machine Learning at Scale

Each exam is a single challenge scenario and you have 8 hours to complete the challenge. Each exam may be taken in any order but to avail a valid CCP Data Scientist certificate you must need to pass all three exams within 365 days of each other.

Select Trainer for Demo


Dr. Kamal Gulati
Certification: CCNA , MCP , Willey Certified Big Data Analyst (WCBDA)
From
Professional Experience
Training Experience

Qualification
Ph. D., M.Sc. (Computer Science), M.C.A.

Skills
AWS, Big Data, CCNA, Database, Hadoop , HTML, MongoDB, MS Office, MySQL , Networking , QlikView, SAAS , SQL, Tableau, VBA Macros, Data Science,

Profile
A dedicated and astute professor with 13+ of exper ience in Teaching, Research, Corporate Trainings, Student Management and Coordination with various Academic & Corporate Institute. - Currently working with Amity University, Noida as Assistant Professor, Grade-III – Information Technology & Management. - Worked as Sr. Lecturer with G.L. Bajaj Institute of Technology and Management, Greater Noida. - Worked as I.T Instructor with Bahrain Training Institute, Kingdom of Bahrain (Gulf). - Result oriented, excellent communication, strong interpersonal skills and the ability to relate to students at all levels across the organization; possess stupendous skills for teaching, mentoring, guiding and controlling the students. - Possess knowledge about the internal administrative tasks that are performed within institutions. Read More...
RATING & REVIEWS
Sasidhar Mukthinuthalapati
It is my privilege to recommend Dr. Gulati who taught me I.T. Project Management (SOF – 501). He is an excellent Professor in the university of CSIT Department. Due to his real world experience, he was able to use business examples to supplement textbooks in class. He is truly an extremely intelligent and dedicated professional in all his endeavors with good communication and collaboration skill. I highly recommend him.
Vishwanath Lingamurthy
It is my privilege to write feedback regarding your Knowledge in the subject, teaching capabilities, interaction with students and enlightening them in subject as a CSIT Visiting Scholar Stratford University. Sir your knowledge in the in IT project management is extremely appreciated by all the student community. I was always amazed by the interest you show to motivate to acquire more knowledge and leverage it in research and teaching. I have not seen any situation that you do not have proper answer whenever I ask any questions, relevant or irrelevant, regarding my subject. I am really impressed by the depth of knowledge you have in IT project management. Sir you have got very good communication skills in explaining the subject in a simple and systematic way, which will make any student comfortable and confident about the subject. Your way of presenting simple and friendly lectures helped me a lot to get up to the speed and excel in my work and studies. Dr. Gulati, you show enormously great patience to make anybody comfortable in the subject. As I said earlier, you have spent so many hours with us to explain various techniques which made me thorough in what I am doing in class. Dr. Gulati you are not only friendly but also is gauging our needs as graduate students. You display a genuine concern for the advancement of his students. As a mentor you freely share your knowledge and experience and helps us to be confident in the chosen field of subject. I am sublimating with great pleasure that you are Master in molding me as dynamic and enthusiastic towards the subject. You are the instrumental to ensure the students get quality education they deserve. It is really an honor to the University to have Dr. Gulati a well-qualified and dedicate CIST Scholar sir. I can state with absolute certainty that you are an exceptional teacher I ever seen in my career. Dr. Gulati sir you are one on the best teachers and mentors I ever had in my life and I strongly believe that you w
Dr. Shakir Ullah Khan
Dr. Kamal Gulati is an enthusiastic and helpful person and a great teacher. He has received a very positive feedback from his students at Stratford University. He is committed to excellence. I am sure will be a valuable asset for any university.
Avishek Chakrabarty
I consider myself extremely privileged to have received my education in the field of Computer Sciences from Dr. Kamal Gulati during my MBA, at Amity University. With excellent knowledge and command over the subject, Prof. Kamal always gave his 100% so that we could receive the best education. He was not just a professor to us, but also a friend who was always there for his students whenever we needed him. I thank you sir, for always guiding us in the best possible way.
billy Mwamba
dr kamal gulaty was one of my best faculty in amity university... he thought me two subjects from whrere i learnt many things... proud to have a such of faculty...speaks well teach very... all the best
Kapil Dhingra
Kamal is an ace professional. He is calm, focused & has an eye for detail. He has the ability to work under tight deadlines and has the willingness to take on challenges and achieve his objectives within the given timelines which is what makes him a perfect professional... Wish you luck!!!
Neha Malhotra
I have had the pleasure of knowing Dr. Kamal since 2009. Ambitious professor. Always maintains very good relation with students.Dr. Kamal is an intelligent, achievement oriented, independent and careful teacher. He is not only a multi-skilled and proactive professional but also an inspiring person.
Rajesh. C. Bhatnagar
He is highly qualified both academically and professionally. He has shown extra ordinary competencies in problem resolutions in our department. He has rich experience of Industry working. For last 7 years he has working as a member of faculty at Amity University, Noida he is well known and recognized for his teaching and administrative skills. He is apt in Multi tasking and has dis-changed his administrative duties with excellence in performance. He is held in high esteem by his colleagues, seniors and students. He has qualities of head and heart which have endeared him to his students and colleagues. He will be an asset for any organization he works for. I wish him all success in life.
Praveen Telu
Very fortunate to study under a faculty like you sir. The inputs, support and encouragement you gave me has changed my student career, Whatever I am now it is all coz of you. Thank you sir and hoping that my next batch students will also get benefited with the indepth knowledge of IT skills you have.
J L KAPUR
Dr. Kamal Gulati is a dedicated, committed & intelligent faculty member who can handle any matter given with full competency. His approach to solve any problem is very innovative and out of box. He is an asset to any organisation on account of his positive approach & problem solving skill.
Dr. MUKUL GUPTA
Kamal is a gem and one among the best teachers I have worked with and is a passionate teacher.
shantanu Chakraborty
He is Hardworking ,Committed and Extremely focused on whatever assignment he undertakes.He had an ability to explore new idea and is very flexible.He easily adopt him selves to different roles and responsibility assigned to him
B R Singh
I have been working with Prof Kamal Gulati for the last 5 years and during this period I obsereved that he is highly dedicated Professional, carrier oriented person. He is good team performer and also a good team manager. I can safely vouch, he is an asset to any organization - education provider or otherwise
Aparna Datt
Kamal has been an excellent person to work with. His in depth knowledge about E-commerce, CRM and other aspects of IT combined with management is commendable. He is a thorough and dedicated professional who is always ready to help and a wonderful human being to work with.
Pranav Gupta
u r the best.....
Inderjit Jain
An energatic young man with commitments,convictions and simple habits. Always availble to help others with a smile. I wish him all the success in life
Suddhendu Biswas
He is exceptionally brilliant and sincere worker.He knows his subject and well manages his profession. I wish him success in all his endeavors.
Priyanka Bhagat
Mr. Kamal Gulati was my IT teacher in MBA (G.L.Bajaj Institute of Technology & Management. along with our syllabus ,he gave us a lots of extra knowledge which is useful in day to day activities.He has a vast knowledge of subject he teaches.Any query he use to solve on same day and always helped us whenever we asked for notes etc.
Pankaj Kumar Pathak
Certification:
From
Professional Experience
Training Experience

Qualification
MCA

Skills
Cloudera, Cloudera Impala, Hadoop , Hbase, OpenStack, Python, R, Red Hat , Redhat Linux , RUBY, Ruby on Rails, SAS, Tableau, UNIX, Data Science,

Profile
Pankaj has around 16 years of experience in IT cor porate along with 4 years of experience in Data Science. He is expertise in complete Data Science (Data Analysis, Data Migration and Data Visualization) also and already has trained more than 100 Hadoop architect. he has worked with prominent IT companies like HCL and CSC. he has handle various national and overseas projects on various technologies like Python, R, Linux, Hadoop , Tableau an Qlikview. I also provide world wide training on open source hot skills such as Apache Hadoop, R, Python, Tableau, Qlikview etc.... Read More...
RATING & REVIEWS
Mukesh kumar
I do not hesitate to give a 5 star for Mr Pankaj. He has enormous knowledge on Analytics, he knows what industries needs and how can a student be trained in that path. Even a beginner can be able to catch up with an experienced one with his way of teaching. His knowledge on Data science concepts was exceptional well.
Gaurav Kumar
Pankaj has excellent knowledge about statistics and modelling and would be the best tutor, if you are looking out for any kind of knowledge on Data science. He is very patient with the students and goes extra mile to make them understand the subject. Hats of to good work. Thank you.
Sunil baliwal
Pankaj is a very knowledgeable and efficient tutor. He knows the subject and also teaches very well. His classes are a must take for some one looking for statistics or modelling or data science knowledge. Highly appreciate his knowledge and effort he takes for making the students understand the subject.
Anurag Sharma
Highly recommended. Pankaj has fantastic understanding about the subject and also the current economic situation. Pankaj teaching skills are fantastic and the best thing is he always happy to help.
Anurag Sharma
Excellent Trainer!!!
Vinay Rastogi
it was a fantastic experience with Mr Pankaj, According to me he is one of the best trainer for the experienced candidates. He has the ability to guide the students for industrial environment.
Farhan
Good trainer .
Sachin Pandey
Pankaj sir is a very calm teacher he clear our each and every doubt calmly.
tribhuwan
I am very much delighted to let you know that I have successfully finished the Hadoop classes with Mr pankaj Sir . I am pleased with the class lectures, lecturer, tech hub team members. Provide good quality lectures for all the classes. Its a good experienced took hadoop course with tech hub institute specially pankaj sir. He has a very good experienced.
Deepanjali Saxena
Class s gd
Priyanshu Purwar
I am taking python classes from Pankaj sir. The methodology of teaching is quite different i.e. more hands-on and less theory, that is good for the students. he is very good in explaining things.
Sakshi Kaul
Good knowledge.
Jatin Mahajan
Good knowledge and able to resolve my issue .... Always motivating for good performance...
prachi,saraph
Certification:
From
Professional Experience
Training Experience

Qualification
Master of Science, Computer Science, Universoty of South Florida

Skills
Big Data, SAS,

Profile
RATING & REVIEWS
Sachin Adnaik
Certification: Statistics
From
Professional Experience
Training Experience

Qualification
Ph.D. (Statistics)

Skills
Business Analysis, Machine Learning, Microsoft Azure, SAS, Statistics, Data Science,

Profile
Have strong knowledge of R, SAS, RapidMiner, Machi ne Learning and Predictive Modeling. Conducted trainings (for Industries and for Training Institutes) on Data Science, Machine Learning, Predictive Modeling, Statistics using R and other tools. Read More...
RATING & REVIEWS
Disclaimer

** The above course information is taken from Cloudera Inc., Apache Software Foundation

* Money Back Guarantee till demo and 1st class of the course.

* All trademarks and logos appearing on this website are the property of their respective owners.


Copyright ©2015 Hub4Tech.com, All Rights Reserved. Hub4Tech™ is registered trademark of Hub4tech Portal Services Pvt. Ltd.
All trademarks and logos appearing on this website are the property of their respective owners.
FOLLOW US