Font size:

Course Overview
This course is designed as an entry point for developers who need to create applications to analyze Big Data stored in Apache Hadoop using Spark. Topics include: An overview of the Hortonworks Data Platform (HDP), including HDFS and YARN Using Spark Core

Course Target Audience
Software engineers that are looking to develop in-memory applications for time sensitive and highly iterative applications in an Enterprise HDP environment.

Course Prerequisites
Students should be familiar with programming principles and have previous experience in software development using either Python or Scala. Previous experience with data streaming, SQL, and HDP is also helpful, but not required.

Course Objectives
Describe Hadoop, HDFS, YARN, and the HDP ecosystem Describe Spark use cases Explore and manipulate data using Zeppelin Explore and manipulate data using a Spark REPL Explain the purpose and function of RDDs Employ functional programming practices Perform

Course Outline
Format 50% Lecture/Discussion 50% Hands-on Labs Hands-On Lab Activities Labs can be performed using either Python or Scala Use common HDFS commands Use a REPL to program in Spark Use Zeppelin to program in Spark Perform RDD transformations and actions Per

The course you have selected has limited or no upcoming scheduled training dates!

Please browse similar courses or request more information for assistance.'s training support team will respond within one business day with relevant offerings.