Course Batch Starts, Timing, Price & Enroll

Program Duration Batch Starts Time Price Enroll Book free demo
Weekend
40 Hrs Weekend Morning-Batch USD 500
INR 20000
Enroll Now Book free demo class
Weekend
40 Hrs Weekend Evening-Batch USD 500
INR 20000
Enroll Now Book free demo class
Weekdays
40 Hrs Weekdays Morning-Batch USD 500
INR 20000
Enroll Now Book free demo class
Weekdays
40 Hrs Weekdays Evening-Batch USD 500
INR 20000
Enroll Now Book free demo class

About Course

This course is designed for professionals who are familiar with programming, linux commands and looking a career path in Data Analytics area. The targeted audience for this course can be –

Developers
IT Architects
Programmers
Business Intelligence professionals
Recent graduates looking for a career path in Big Data
This course is aiming to prepare you for the Big Data world by delivering in-depth knowledge of key concepts required to ingest and process data on a Hadoop cluster. In this course you will get the insight understanding of Hadoop ecosystem like Spark, Hive, Flume, Sqoop, and Impala.
After completing this course you will be able to -

Understand the core concept of HDFS
Distribute and process data using Hadoop cluster and Apache Spark
Use Sqoop and Flume for data loading
Work with Pig, Hive and YARN to perform data analytics
Understand the data structure in tables using Impala and Hive
There is no prerequisite required to join this course.

CURRICULUM

Big Data and Hadoop Developer

  • 1.1 Problems with Traditional Large-Scale Systems
  • 1.2 Hadoop!
  • 1.3 Data Storage and Ingest
  • 1.4 Data Processing
  • 1.5 Data Analysis and Exploration
  • 1.6 Other Ecosystem Tools
  • 2.1 Distributed Processing on a Cluster
  • 2.2 Storage: HDFS Architecture
  • 2.3 Storage: Using HDFS
  • 2.4 Resource Management: YARN Architecture
  • 2.5 Resource Management: Working with YARN
  • 3.1 Sqoop Overview
  • 3.2 Basic Imports and Exports
  • 3.3 Limiting Results
  • 3.4 Improving Sqoop’s Performance
  • 3.5 Sqoop 2
  • 4.1 Introduction to Impala and Hive
  • 4.2 Why Use Impala and Hive?
  • 4.3 Querying Data With Impala and Hive
  • 4.4 Comparing Hive and Impala to Traditional Databases
  • 5.1 Data Storage Overview
  • 5.2 Creating Databases and Tables
  • 5.3 Loading Data into Tables
  • 5.4 HCatalog
  • 5.5 Impala Metadata Caching
  • 6.1 Selecting a File Format
  • 6.2 Hadoop Tool Support for File Formats
  • 6.3 Avro Schemas
  • 6.4 Using Avro with Impala, Hive, and Sqoop
  • 6.5 Avro Schema Evolution
  • 6.6 Compression
  • 7.1 Partitioning Overview
  • 7.2 Partitioning in Impala and Hive
  • 8.1 What is Apache Flume?
  • 8.2 Basic Flume Architecture
  • 8.3 Flume Sources
  • 8.4 Flume Sinks
  • 8.5 Flume Channels
  • 8.6 Flume Configuration
  • 9.1 What is Apache Spark?
  • 9.2 Using the Spark Shell
  • 9.3 RDDs (Resilient Distributed Datasets)
  • 9.4 Functional Programming in Spark
  • 10.1 Creating RDDs
  • 10.2 Other General RDD Operations
  • 11.1 Spark Applications vs. Spark Shell
  • 11.2 Creating the SparkContext
  • 11.3 Building a Spark Application (Scala and Java)
  • 11.4 Running a Spark Application
  • 11.5 The Spark Application Web UI
  • 11.6 Configuring Spark Properties
  • 11.7 Logging
  • 12.1 Review: Spark on a Cluster
  • 12.2 RDD Partitions
  • 12.3 Partitioning of File-Based RDDs
  • 12.4 HDFS and Data Locality
  • 12.5 Executing Parallel Operations
  • 12.6 Stages and Tasks
  • 13.1 RDD Lineage
  • 13.2 RDD Persistence Overview
  • 13.3 Distributed Persistence
  • 14.1 Common Spark Use Cases
  • 14.2 Iterative Algorithms in Spark
  • 14.3 Graph Processing and Analysis
  • 14.4 Machine Learning
  • 14.5 Example: k-means
  • 15.1 Spark SQL and the SQL Context
  • 15.2 Creating DataFrames
  • 15.3 Transforming and Querying DataFrames
  • 15.4 Saving DataFrames
  • 15.5 DataFrames and RDDs
  • 15.6 Comparing Spark SQL, Impala, and Hive-on-Spark

Exam & Certification

Exam Name: CCA Spark and Hadoop Developer Exam
Exam Code: CCA175
Number of question: 10 – 12 Performance based
Exam duration: 120 Minutes
Passing marks: 70%
Language: English, Japanese

POWER OF CHOOSING YOUR TRAINER - TAKE DEMO FROM 3 TRAINERS AND SELECT


Shrirang Mukund Garge
MBA,Oracle,Scrum & Horton Certified
From
Professional Experience

Subject Expertise
Big Data

Certification

Profile
Hortonworks certified developer. Certified Scrum Master. Oracle Certified Associate. Total 15 years of IT experience. Around 4 years of experience in the areas of Data Warehousing and Data Migration using PL/SQL, BI, Informatica, Cognos.
RATING & REVIEWS
Ankita Agrawal
I joined Rave as a fresher and I was totally new to the IT industry. The kind of support I got from him was terrific. Anyone can learn from him the art of "Staying Calm and Cool" irrespective of the situation. One very good thing about him is he believes in giving timely Feedback to his juniors which helps his juniors to learn more and more and to improve performance. And yes , he is a dependable person. I am sure he will be a pillar for any organization he joins.
Ravi Mittal
I worked with Shrirang for closed to 5 years. He is someone who you can bank on. Once he gets his arms around a process he follows it to the minutest detail. He was a very calm demeanor. Great person to know and work with.
Cyril Rayan
Shrirang is an excellent program manager and has a strong team to deliver great results. He has great expertise in software projects and can deliver extremely well.
Anand Jain
Shrirang is a person with a broad vision and clear mindset. He always aim to the best possible results by getting his team involved in making decisions and put their best efforts in place. He is confident and a person full of business knowledge as well as capabilities of leading a team and himself.
Pallavi Bakliwal
I had the opportunity of working with Shrirang at Rave for 4 years. He is methodical, organized and very committed towards the job he undertakes. He has always demonstrated a high degree of expertise and his ability to connect with people, inspire and motivate them is incredible. He is always around to help people and he is the best manager one can ever work with.
Pallavi Bakliwal
I had the opportunity of working with Shrirang at Rave for 4 years. He is methodical, organized and very committed towards the job he undertakes. He has always demonstrated a high degree of expertise and his ability to connect with people, inspire and motivate them is incredible. He is always around to help people and he is the best manager one can ever work with.
Ankit Johari
Working with Shrirang was a great phase of my life. He is always supportive and helpful in the difficult times. He is one of the good and skillful manager I have worked in...
Sankalp Shekhar
Shrirang, a great colleague and a peer who you could look up-to for counsel. With his acute understanding in Agile methodology, he has been delight as a Scrum Master to work along. His understanding and standards of work quality, has always been motivating to bring the best out of the resources.
Aliakbar Badshah
I had the pleasure of working with Shrirang at Rave Infosys. I have learn lots of things from start to end of my career in Rave. He is a good project manager, director and also a good person.
Ashok Patidar
I have worked with Shrirang for more than 1 year. It was really a pleasure working under his guidance in Rave Infosys. It’s an honor and privilege for me realizing that I have worked with an exceptional director. No Doubt, he is an outstanding leader, a person and a friend. I highly recommend him to work with.
Sandeep Patle
Still I can remember that 1 year of Span. Working under Mr.Shrirang, was a great Experience for me and my Team. Though we had too much pressure from client but Shrirang handled it quite nicely. He is one of the Best Managers that I ever seen in 5 years of my career. I could remember his words "Family is very important than Office !!!"
Ankita choudhary
I started my career from Rave and I was fortunate to have Shrirang as my manager. He guided and motivated me and all the people working under him and with him whenever required. He is very detail oriented, hardworking and a good team leader. He is a great person and an outstanding director and mentor.
Anil Kumar
Master of Science Middlesex University,Bachelor of Engineering
From
Professional Experience

Subject Expertise
Big Data

Certification

Profile
RATING & REVIEWS
Tarun Sukhani
Ph.D
From
Professional Experience

Subject Expertise
Agile

Certification

Profile
I am an IT executive, educator, author, international speaker, security expert, agile coach, coder, and entrepreneur with over 19 years of combined professional experience both in the U.S. and internationally. As a seasoned veteran, my expertise lies in leading teams in the design and delivery of highly scalable, concurrent, and performant enterprise software solutions with budgets of up to $100 million. I am particularly adept at building productive, self-managing agile teams with predictable velocities and delivery timeframes. I am skilled in all phases of the SDLC/ALM, with a solid foundation in Agile (XP, SAFe, Lean, Scrum, Kanban, and Scrumban) and traditional (PMI and PRINCE2) project management frameworks and methodologies. Academic credentials: Degrees in Math, Computer Science, Business Administration (Finance, Accounting, and Operations Management), and Information Systems Management Certified in Scrum (CSM, CSP, PSM I), Data Warehousing/Mining/Science, E-commerce, and Microsoft/Oracle technologies (.NET/Java) PMI/PRINCE2 project management CISO (Infosec) 6 Sigma/Lean at GEMS (GE Medical Systems) ITIL,TOGAF,COBIT,ISO27000,NIST Machine Learning/Text Mining, Genetic Algorithms, Combinatorial Optimization, Computational Finance, Driver-Based Budgeting. Proficient in Big Data/Data Science: Hadoop, Pig, Hive, HBase, Spark, R/Rattle, Cassandra, YARN, Zookeeper, Mahout Pursuing Ph.D. at Coventry University Achievements: Project Lead on over 50 projects at clients that included Fortune 500 and Global 1000 companies, spanning the healthcare, manufacturing, Internet, media, BFSI, telco, oil & gas, and education verticals.Corporate trainer and university lecturer, conducting academic and vocational business and IT courses in North/South America, Europe and Asia. Co-founder of 4 companies Consultant to 5 Fortune 500 companies Ernst & Young Winner of 2005 business plan competition Member of Who of accomplished professionals
RATING & REVIEWS
Pawan Sharma
MCA
From
Professional Experience

Subject Expertise
Big Data

Certification

Profile
-- Leading Data Request (Business Analysis) team size of 10. -- Working on JIRA (task management system). -- Working on Big Data (Hadoop (HortonWorks),Hive, Sqoop)
RATING & REVIEWS
Brajveer Singh
M.Tech
From
Professional Experience

Subject Expertise
Data Structure

Certification

Profile
RATING & REVIEWS
Venkita Krishnan
Hortonworks Certified
From
Professional Experience

Subject Expertise
Big Data

Certification

Profile
- 25 years of total experience in IT as a consultant / trainer / mentor, with 19 years of assisting Staffing organizations in mentoring their candidates in J2EE, Spring, Web Services, Hadoop, YARN, Spark, Storm and other Eco-System components. - 6 years of experience on Hadoop and its Eco-system [Pig, Hive, Sqoop, HBase] trainings and actively involved in mentoring organization in their Hadoop Usage Analysis and implementation. Trained participants from JPMC, Microsoft, WellsFargo, Fidelity, Amdocs, SAS, Bank of America, Vodafone. Have done Online-Trainings for Zarantec, Collabera, Edureka and NicheTek on Hadoop, YARN, Spark and Storm, - Have mentored Microsoft India IT / Professional Services team on HD Insight and JPMC team members on Hadoop understanding and implementations. - 8 years of experience in Savvion BPM Training to customers like IBM, e-Rewards, AT&T, Bell Canana, Sun Microsystems, ADP, Sandia Labs, Penske, PWC, Advanced Equities, Seagate, PayPal, Micron, DocHarbour, Visa, Anacomp, HBO, Citadel, Morgan Stanley, Reply Group (Italy), RBC Dexia (Canada), Rogers Communication (Canada), Virtusa (Sri Lanka), GVT (Brazil), Kernel (Mexico), Bank of America (Canada), Motorola (Germany), -Have exposure in different phases of software development life cycle including Business Requirements, System Analysis, Documentation, Designing, Development, Issues & CR Management, Unit Testing & Integration Testing and Production Deployment and hence relate to the same in Training - Constantly have got a participant satisfaction of more than 92% in all trainings. Specialties: Hadoop, YARN, Spark, Storm, Data Science, Hive, Pig, Sqoop, Java, Spring MVC, Savvion BPM, HD Insight.
RATING & REVIEWS
Ravi Kumar Mishra
B.Tech
From
Professional Experience

Subject Expertise
Big Data

Certification

Profile
RATING & REVIEWS
Mukesh Kumar Shukla
MBA,BTech
From
Professional Experience

Subject Expertise
Big Data

Certification

Profile
I am currently working as Consultant and freelancer trainer in the Big data Technologies. Big Data technologies are my forte; with over 10 years of extensive experience in the IT industry I have designed and developed large scale enterprise applications, including building of a $30 million Big Data platform for one of the big Indian banks and 3 major big data platform for Indian Telecom Vendors. I take keen interest in Hadoop and related technologies such as Apache Storm, Spark,Mahout and R-Programming etc. I am also a strong proponent of NoSQL databases such as Elastisearch, MongoDB, Cassandra, Redis/Riak etc., and I have been using these technologies for designing and deploying Big Data solutions for Telecom, Storage and Financial domains. My technical expertise comes to fore in my training delivery. My ability to powerfully communicate complex technology concepts to audiences of various levels is highly appreciated by my trainees and training managers. Global giants, such as Nokia, IBM, Sun, Capgemini, GE, SAP Labs, Accenture, Wipro, TCS, Atos,Honeywell,CTS, Genpact,Intuit, Flipkart,Annik-System, Visual-IQ etc have benefited from my training deliveries, and keep coming back for repeated deliveries. I have overall conducted more than 40 corporate batches and 35 online batches for Bigdata technologies such as Hadoop, MongoDB, Cassandra, Spark, Storm and data warehouse . Additionally, I am a Certified SpringSource/Pivotal trainer - one of the few expert trainers globally who have measured up to the extremely high-standards laid down by SpringSource/Pivotal, the company leading the advancement of Big Data & Java ecosystem. I am also an active participant in leading technology forums and conferences - I have authored various white papers, articles and presented at multiple conferences.
RATING & REVIEWS
Isaiah Babu
MCA
From
Professional Experience

Subject Expertise
Hadoop

Certification

Profile
RATING & REVIEWS
Munaswamy
MCA
From
Professional Experience

Subject Expertise
Hadoop

Certification

Profile
Application Developer, Scrum Master, and Project Lead in a wide variety of business applications. Interested in client/server and relational database design & migration projects.
RATING & REVIEWS
Disclaimer

** The above course information is taken from Cloudera Inc., Apache Software Foundation

* Money Back Guarantee till demo and 1st class of the course.


Copyright ©2015 Hub4Tech.com, All Rights Reserved. Hub4Tech™ is registered trademark of Hub4tech Portal Services Pvt. Ltd.
All trademarks and logos appearing on this website are the property of their respective owners.
FOLLOW US