Big Data Hadoop And Spark Developer Certification Training
simplilearn
Key Features:
- 40 hours of instructor-led training
- 24 hours of self-paced video
- 5 real-life industry projects in banking, telecom, insurance, and e-commerce domains
- Hands-on practice with CloudLabs
- Includes training on Yarn, MapReduce, Pig, Hive, Impala, HBase, and Apache Spark
- Aligned to Cloudera CCA175 certification exam
What are the course objectives?
This course will enable you to:
- Understand the different components of Hadoop ecosystem such as Hadoop 2.7, Yarn, MapReduce, Pig, Hive, Impala, HBase, Sqoop, Flume, and Apache Spark
- Understand Hadoop Distributed File System (HDFS) and YARN as well as their architecture, and learn how to work with them for storage and resource management
- Understand MapReduce and its characteristics, and assimilate some advanced MapReduce concepts
- Get an overview of Sqoop and Flume and describe how to ingest data using them
- Create database and tables in Hive and Impala, understand HBase, and use Hive and Impala for partitioning
- Understand different types of file formats, Avro Schema, using Arvo with Hive, and Sqoop and Schema evolution
- Understand Flume, Flume architecture, sources, flume sinks, channels, and flume configurations
- Understand HBase, its architecture, data storage, and working with HBase. You will also understand the difference between HBase and RDBMS
- Gain a working knowledge of Pig and its components
- Do functional programming in Spark
- Understand resilient distribution datasets (RDD) in detail
- Implement and build Spark applications
- Gain an in-depth understanding of parallel processing in Spark and Spark RDD optimization techniques
- Understand the common use-cases of Spark and the various interactive algorithms
- Learn Spark SQL, creating, transforming, and querying Data frames
- Prepare for Cloudera Big Data CCA175 certification
Who should take this course?
Big Data career opportunities are on the rise, and Hadoop is quickly becoming a must-know technology for the following professionals:
- Software Developers and Architects
- Analytics Professionals
- Senior IT professionals
- Testing and Mainframe professionals
- Data Management Professionals
- Business Intelligence Professionals
- Project Managers
- Aspiring Data Scientists
- Graduates looking to build a career in Big Data Analytics
Prerequisite:
- As the knowledge of Java is necessary for this course, we are providing a complimentary access to “Java Essentials for Hadoop” course
- For Spark we use Python and Scala and an Ebook has been provided to help you with the same
- Knowledge of an operating system like Linux is useful for the course