Course Batch Starts, Timing, Price & Enroll

Program Duration Batch Starts Time Price Enroll
Weekend
40 Hrs Weekend Morning-Batch USD 500 400
INR 20000 18500
Enroll Now
Weekend
40 Hrs Weekend Evening-Batch USD 500 400
INR 20000 18500
Enroll Now
Weekdays
40 Hrs Weekdays Morning-Batch USD 500 400
INR 20000 18500
Enroll Now
Weekdays
40 Hrs Weekdays Evening-Batch USD 500 400
INR 20000 18500
Enroll Now

About Course

This course is designed for professionals who are familiar with programming, linux commands and looking a career path in Data Analytics area. The targeted audience for this course can be –

Developers
IT Architects
Programmers
Business Intelligence professionals
Recent graduates looking for a career path in Big Data
This course is aiming to prepare you for the Big Data world by delivering in-depth knowledge of key concepts required to ingest and process data on a Hadoop cluster. In this course you will get the insight understanding of Hadoop ecosystem like Spark, Hive, Flume, Sqoop, and Impala.
After completing this course you will be able to -

Understand the core concept of HDFS
Distribute and process data using Hadoop cluster and Apache Spark
Use Sqoop and Flume for data loading
Work with Pig, Hive and YARN to perform data analytics
Understand the data structure in tables using Impala and Hive
There is no prerequisite required to join this course.

CURRICULUM

Big Data and Hadoop Developer

  • 1.1 Problems with Traditional Large-Scale Systems
  • 1.2 Hadoop!
  • 1.3 Data Storage and Ingest
  • 1.4 Data Processing
  • 1.5 Data Analysis and Exploration
  • 1.6 Other Ecosystem Tools
  • 2.1 Distributed Processing on a Cluster
  • 2.2 Storage: HDFS Architecture
  • 2.3 Storage: Using HDFS
  • 2.4 Resource Management: YARN Architecture
  • 2.5 Resource Management: Working with YARN
  • 3.1 Sqoop Overview
  • 3.2 Basic Imports and Exports
  • 3.3 Limiting Results
  • 3.4 Improving Sqoop’s Performance
  • 3.5 Sqoop 2
  • 4.1 Introduction to Impala and Hive
  • 4.2 Why Use Impala and Hive?
  • 4.3 Querying Data With Impala and Hive
  • 4.4 Comparing Hive and Impala to Traditional Databases
  • 5.1 Data Storage Overview
  • 5.2 Creating Databases and Tables
  • 5.3 Loading Data into Tables
  • 5.4 HCatalog
  • 5.5 Impala Metadata Caching
  • 6.1 Selecting a File Format
  • 6.2 Hadoop Tool Support for File Formats
  • 6.3 Avro Schemas
  • 6.4 Using Avro with Impala, Hive, and Sqoop
  • 6.5 Avro Schema Evolution
  • 6.6 Compression
  • 7.1 Partitioning Overview
  • 7.2 Partitioning in Impala and Hive
  • 8.1 What is Apache Flume?
  • 8.2 Basic Flume Architecture
  • 8.3 Flume Sources
  • 8.4 Flume Sinks
  • 8.5 Flume Channels
  • 8.6 Flume Configuration
  • 9.1 What is Apache Spark?
  • 9.2 Using the Spark Shell
  • 9.3 RDDs (Resilient Distributed Datasets)
  • 9.4 Functional Programming in Spark
  • 10.1 Creating RDDs
  • 10.2 Other General RDD Operations
  • 11.1 Spark Applications vs. Spark Shell
  • 11.2 Creating the SparkContext
  • 11.3 Building a Spark Application (Scala and Java)
  • 11.4 Running a Spark Application
  • 11.5 The Spark Application Web UI
  • 11.6 Configuring Spark Properties
  • 11.7 Logging
  • 12.1 Review: Spark on a Cluster
  • 12.2 RDD Partitions
  • 12.3 Partitioning of File-Based RDDs
  • 12.4 HDFS and Data Locality
  • 12.5 Executing Parallel Operations
  • 12.6 Stages and Tasks
  • 13.1 RDD Lineage
  • 13.2 RDD Persistence Overview
  • 13.3 Distributed Persistence
  • 14.1 Common Spark Use Cases
  • 14.2 Iterative Algorithms in Spark
  • 14.3 Graph Processing and Analysis
  • 14.4 Machine Learning
  • 14.5 Example: k-means
  • 15.1 Spark SQL and the SQL Context
  • 15.2 Creating DataFrames
  • 15.3 Transforming and Querying DataFrames
  • 15.4 Saving DataFrames
  • 15.5 DataFrames and RDDs
  • 15.6 Comparing Spark SQL, Impala, and Hive-on-Spark

Exam & Certification

Exam Name: CCA Spark and Hadoop Developer Exam
Exam Code: CCA175
Number of question: 10 – 12 Performance based
Exam duration: 120 Minutes
Passing marks: 70%
Language: English, Japanese

POWER OF CHOOSING YOUR TRAINER - TAKE DEMO FROM 3 TRAINERS AND SELECT


Ashutosh Agarwal
From
Professional Experience

Subject Expertise
Hadoop

Certification

Profile
Ashutosh Agarwal has about 15 years of experience in enterprise software product development with focus on big data technologies and predictive analytics for last several years. His specific areas of interest include search & discovery, semantics, text analytics, scalability and performance.
RATING & REVIEWS
Pankaj Pathak
MCA
From
Professional Experience

Subject Expertise
Cloudera

Certification

Profile
I have around 16 years of experience in IT corporate along with 4 years of experience in Data Science. I am expertise in complete Data Science (Data Analysis, Data Migration and Data Visualization) also. I have worked with prominent IT companies like HCL and CSC. I have handle various national and overseas projects on various technologies like Python, R, Linux, Hadoop , Tableau an Qlikview. I also provide world wide training on open source hot skills such as Apache Hadoop, R, Python, Tableau, Qlikview etc....
RATING & REVIEWS
Shrirang Mukund Garge
MBA,Oracle,Scrum & Horton Certified
From
Professional Experience

Subject Expertise
Big Data

Certification

Profile
Hortonworks certified developer. Certified Scrum Master. Oracle Certified Associate. Total 15 years of IT experience. Around 4 years of experience in the areas of Data Warehousing and Data Migration using PL/SQL, BI, Informatica, Cognos.
RATING & REVIEWS
Anil Kumar
Master of Science Middlesex University,Bachelor of Engineering
From
Professional Experience

Subject Expertise
Big Data

Certification

Profile
RATING & REVIEWS
Tarun Sukhani
Ph.D
From
Professional Experience

Subject Expertise
Agile

Certification

Profile
I am an IT executive, educator, author, international speaker, security expert, agile coach, coder, and entrepreneur with over 19 years of combined professional experience both in the U.S. and internationally. As a seasoned veteran, my expertise lies in leading teams in the design and delivery of highly scalable, concurrent, and performant enterprise software solutions with budgets of up to $100 million. I am particularly adept at building productive, self-managing agile teams with predictable velocities and delivery timeframes. I am skilled in all phases of the SDLC/ALM, with a solid foundation in Agile (XP, SAFe, Lean, Scrum, Kanban, and Scrumban) and traditional (PMI and PRINCE2) project management frameworks and methodologies. Academic credentials: Degrees in Math, Computer Science, Business Administration (Finance, Accounting, and Operations Management), and Information Systems Management Certified in Scrum (CSM, CSP, PSM I), Data Warehousing/Mining/Science, E-commerce, and Microsoft/Oracle technologies (.NET/Java) PMI/PRINCE2 project management CISO (Infosec) 6 Sigma/Lean at GEMS (GE Medical Systems) ITIL,TOGAF,COBIT,ISO27000,NIST Machine Learning/Text Mining, Genetic Algorithms, Combinatorial Optimization, Computational Finance, Driver-Based Budgeting. Proficient in Big Data/Data Science: Hadoop, Pig, Hive, HBase, Spark, R/Rattle, Cassandra, YARN, Zookeeper, Mahout Pursuing Ph.D. at Coventry University Achievements: Project Lead on over 50 projects at clients that included Fortune 500 and Global 1000 companies, spanning the healthcare, manufacturing, Internet, media, BFSI, telco, oil & gas, and education verticals.Corporate trainer and university lecturer, conducting academic and vocational business and IT courses in North/South America, Europe and Asia. Co-founder of 4 companies Consultant to 5 Fortune 500 companies Ernst & Young Winner of 2005 business plan competition Member of Who of accomplished professionals
RATING & REVIEWS
Pawan Sharma
MCA
From
Professional Experience

Subject Expertise
Big Data

Certification

Profile
-- Leading Data Request (Business Analysis) team size of 10. -- Working on JIRA (task management system). -- Working on Big Data (Hadoop (HortonWorks),Hive, Sqoop)
RATING & REVIEWS
Brajveer Singh
From
Professional Experience

Subject Expertise
Data Structure

Certification

Profile
RATING & REVIEWS
Venkita Krishnan
Hortonworks Certified
From
Professional Experience

Subject Expertise
Big Data

Certification

Profile
- 25 years of total experience in IT as a consultant / trainer / mentor, with 19 years of assisting Staffing organizations in mentoring their candidates in J2EE, Spring, Web Services, Hadoop, YARN, Spark, Storm and other Eco-System components. - 6 years of experience on Hadoop and its Eco-system [Pig, Hive, Sqoop, HBase] trainings and actively involved in mentoring organization in their Hadoop Usage Analysis and implementation. Trained participants from JPMC, Microsoft, WellsFargo, Fidelity, Amdocs, SAS, Bank of America, Vodafone. Have done Online-Trainings for Zarantec, Collabera, Edureka and NicheTek on Hadoop, YARN, Spark and Storm, - Have mentored Microsoft India IT / Professional Services team on HD Insight and JPMC team members on Hadoop understanding and implementations. - 8 years of experience in Savvion BPM Training to customers like IBM, e-Rewards, AT&T, Bell Canana, Sun Microsystems, ADP, Sandia Labs, Penske, PWC, Advanced Equities, Seagate, PayPal, Micron, DocHarbour, Visa, Anacomp, HBO, Citadel, Morgan Stanley, Reply Group (Italy), RBC Dexia (Canada), Rogers Communication (Canada), Virtusa (Sri Lanka), GVT (Brazil), Kernel (Mexico), Bank of America (Canada), Motorola (Germany), -Have exposure in different phases of software development life cycle including Business Requirements, System Analysis, Documentation, Designing, Development, Issues & CR Management, Unit Testing & Integration Testing and Production Deployment and hence relate to the same in Training - Constantly have got a participant satisfaction of more than 92% in all trainings. Specialties: Hadoop, YARN, Spark, Storm, Data Science, Hive, Pig, Sqoop, Java, Spring MVC, Savvion BPM, HD Insight.
RATING & REVIEWS
Ravi Kumar Mishra
B.Tech
From
Professional Experience

Subject Expertise
Big Data

Certification

Profile
RATING & REVIEWS
Mukesh Kumar Shukla
MBA,BTech
From
Professional Experience

Subject Expertise
Big Data

Certification

Profile
I am currently working as Consultant and freelancer trainer in the Big data Technologies. Big Data technologies are my forte; with over 10 years of extensive experience in the IT industry I have designed and developed large scale enterprise applications, including building of a $30 million Big Data platform for one of the big Indian banks and 3 major big data platform for Indian Telecom Vendors. I take keen interest in Hadoop and related technologies such as Apache Storm, Spark,Mahout and R-Programming etc. I am also a strong proponent of NoSQL databases such as Elastisearch, MongoDB, Cassandra, Redis/Riak etc., and I have been using these technologies for designing and deploying Big Data solutions for Telecom, Storage and Financial domains. My technical expertise comes to fore in my training delivery. My ability to powerfully communicate complex technology concepts to audiences of various levels is highly appreciated by my trainees and training managers. Global giants, such as Nokia, IBM, Sun, Capgemini, GE, SAP Labs, Accenture, Wipro, TCS, Atos,Honeywell,CTS, Genpact,Intuit, Flipkart,Annik-System, Visual-IQ etc have benefited from my training deliveries, and keep coming back for repeated deliveries. I have overall conducted more than 40 corporate batches and 35 online batches for Bigdata technologies such as Hadoop, MongoDB, Cassandra, Spark, Storm and data warehouse . Additionally, I am a Certified SpringSource/Pivotal trainer - one of the few expert trainers globally who have measured up to the extremely high-standards laid down by SpringSource/Pivotal, the company leading the advancement of Big Data & Java ecosystem. I am also an active participant in leading technology forums and conferences - I have authored various white papers, articles and presented at multiple conferences.
RATING & REVIEWS
Isaiah Babu
MCA
From
Professional Experience

Subject Expertise
Hadoop

Certification

Profile
RATING & REVIEWS
Munaswamy
MCA
From
Professional Experience

Subject Expertise
Hadoop

Certification

Profile
Application Developer, Scrum Master, and Project Lead in a wide variety of business applications. Interested in client/server and relational database design & migration projects.
RATING & REVIEWS
Disclaimer

** The above course information is taken from Cloudera Inc., Apache Software Foundation

* Money Back Guarantee till demo and 1st class of the course.

Enhance your Skill

Drop us a Query


Copyright © 2015 Hub4Tech.com, All Rights Reserved. Hub4Tech™ is registered trademark of Hub4tech Portal Services Pvt. Ltd.
All trademarks and logos appearing on this website are the property of their respective owners.