Disclosure: when you buy through links on our site, we may earn an affiliate commission.

Process Big Data using Apache PIG

Learn analyzing and processing big data Using Apache Pig
(29 reviews)
181 students
Created by


CourseMarks Score®







Platform: Udemy
Video: 5h 41m
Language: English
Next start: On Demand

Table of contents


Pig is a high-level platform for creating MapReduce programs used with Hadoop. The language for this platform is called Pig Latin. In this course we will go through the PIG data flow platform and the language used by PIG tool. The concepts which are covered in this course are:
Writing complex MapReduce transformations using a simple scripting language.
Basics of Big Data, Hadoop and MapReduce Framework.
PIG Data Model and Different type of operators to operate on datasets.
Built-in Functions as well as User Defined Functions for performing a specific task.
Running PIG Script, Unit Testing and Compression.
Many more advance topics such as Embedding PIG in Java, PIG Macros etc.
All the books and PDFs are included, allowing you to follow along with the author throughout the modules in this course.

You will learn

✓ Overview of Big Data and Hadoop Framework
✓ Anatomy of a MapReduce Framework
✓ Basics of Apache Pig tool and Where we should use it or not
✓ Run Pig in different Modes
✓ Use Pig Latin Queries
✓ Different types of PIG Operators for analysing the data
✓ Understand the architecture of PIG tool
✓ Work with PIG data model
✓ Different kinds of built-in functions
✓ Advanced PIG concepts such as PIG Streaming, PIG scripts and User Defined Functions(UDFs)
✓ Compress the input files, final output files and intermediate output files
✓ Pig Unit Testing, PIG Macros and Parameter Substitution
✓ How to embed PIG in Java


• Basic Understanding of Hadoop
• Basic knowledge of Declarative Language such as SQL
• Basic knowledge of Java Programming Language
• Basic Knowledge of Big Data is required but not mandatory

This course is for

• Students having interest in Big Data and Hadoop Field
• Database Developers and Administrator
• Software developers want to build their career in Big Data field
• Data Analysts
• Data Scientists and Resesarcher
Engraving Intelligence
Insculpt technologies is a leading publisher of development courses which provide in-depth knowledge and high quality training.  Insculpt technologies is serving with a mission of providing right direction to  people who are looking for a career in IT/software industry. Insculpt is the best place for learning  new technologies and making things easy to understand virtually.
Browse all courses by on Coursemarks.
Platform: Udemy
Video: 5h 41m
Language: English
Next start: On Demand

Students are also interested in