Talend Studio for Big Data helps you develop faster with a drag-and-drop UI and pre-built connectors and components. Because Open Studio for Big Data is fully open source, you can see the code and work with it. Take advantage of Cloud, Hadoop and NoSQL databases. Customize and create components or leverage community components and code to extend your project.
Exam Topics
•Create cluster metadata
•Create HDFS and Hive metadata
•Connect to your cluster to use HDFS, HBase, Hive, Pig, Sqoop, and MapReduce
•Read data from and write it to HDFS (HDFS, HBase)
•Read tables from and write them to HDFS (Hive)
•Process tables stored in HDFS with Hive
•Process data stored in HDFS with Pig
•Process data stored in HDFS with Big Data batch Jobs
•Develop a Big Data batch Job using the Spark framework
•Execute Spark Jobs in YARN client and cluster mode
•Enable Spark history server event logging
•Copy data from a local file to HDFS
•Copy data from MySQL to HDFS
•Create a Hive table and copy data from HDFS to it
•Import tweets to HDFS
•Join, sort, and aggregate data
•Use caches for faster processing
•Query data from a Hive table using Hive QL
•Query data from Spark datasets using Spark SQL
•Connect to a Hadoop cluster from a Talend Job
•Use context variables and metadata
•Read and write files in HDFS or HBase in a Big Data batch or Big Data streaming Job
•Read and write messages in a Kafka topic in real time
•Configure a Big Data batch Job to use the Spark framework
•Configure a Big Data streaming Job to use the Spark streaming framework
•Save logs to Elasticsearch
•Configure a Kibana dashboard
•Ingest a stream of data to a NoSQL database, HBase
Exam Details
•Please note that these questions are meant to test your knowledge of the topics, they are not derived from the actual exam questions.
•Total 65 question are given with time limit 65 Mins