Data Factory with Pig

When you register for training with TrainUp.com, you are also supporting local education. Find out how.

Course Description

Hadoop is an open source software for affordable supercomputing. It provides the distributed file system and the parallel processing required to run a massive computing cluster. This course explains Pig as a data flow scripting tool for interfacing with Hadoop. You'll learn about the installation and configuration of Pig and explore a demonstration of Pig in action. This learning path can be used as part of the preparation for the Cloudera Certified Administrator for Apache Hadoop (CCA-500) exam.

Learning Objectives

Start the course
Describe Pig and its strengths
Recall the minimal edits needed to be made to the configuration file
Install and configure Pig
Recall the complex data types used by Pig
Recall some of the relational operators used by Pig
Use the Grunt shell with Pig Latin
Set parameters from both a text file and with the command line
Write a Pig script
Use a Pig script to filter data
Use the FOREACH operator with a Pig script
Set parameters and arguments in a Pig script
Write a Pig script to count data
Perform data joins using a Pig script
Group data using a Pig script
Cogroup data with a Pig script
Flatten data using a pig script
Recall the languages that can be used to write user defined functions
Create a user defined function for Pig
Recall the different types of error categories
Use explain in a Pig script
Install Pig, use Pig operators and Pig Latin, and retrieve and group records

Audience

Technical personnel with a background in Linux, SQL, and programming who intend to join a Hadoop Engineering team in roles such as Hadoop developer, data architect, or data engineer or roles related to technical project management, cluster operations, or data analysis

Duration

113 minutes

Course Price

$74.95

This course is also included in the Following Elearning Collection(s):

view more collections

Looking to Enroll a Group?

Instant Access
From Anywhere
Unlimited
Viewing
6-12 Months
To Complete

Join The 50,000+ Companies That Have Purchased Training from TrainUp.com

50K+ Companies Trained
Including 90% Of Fortune 500 Companies Have Purchased Training With TrainUp.com
300K+ Courses & Videos
Live Instructor-Led (Classroom & Virtual), Self-Paced E-learning & Custom OnSite Training Solutions From Leading Training Providers
800+ Expert Instructors
Industry-Leading Subject Matter Experts (SMEs).Tenured &
Award-Winning Instructor Network

Online Course

Data Factory with Pig

Join The 50,000+ Companies That Have Purchased Training from TrainUp.com

Details about this course, including summary information, upcoming dates, and a link to this page, will be sent to the recipient specified below.

Send course information to the email address below:

Need to request approval from a manager at your organization? Add their email address below:

Bring Any training to your facility or train your team members remotely with a customized virtual course. Fill out the form to get started and a solutions rep will be in touch shortly!

Our on-site team responds to every request the same day!