Cloudera Data Analyst Training (CDAT)

Course Description Schedule Course Outline

Course Overview

Apache Hive makes multi-structured data accessible to analysts, database administrators, and others without Java programming expertise. Apache Pig applies the fundamentals of familiar scripting languages to the Hadoop cluster. Cloudera Impala enables real-time interactive analysis of the data stored in Hadoop via a native SQL environment.

Who should attend

  • Data Analysts
  • Application Developers
  • Database Programmers
  • Data Warehouse Administrators
  • System Administrators


  • A basic understanding of Structured Query Language (SQL) and scripting languages used in SQL is helpful but not required
  • A basic understanding of distributed file system concepts such as clustering, MapReduce, BigTable, MetaData storage concepts are helpful but not required.

Course Objectives

Upon successful completion of this course and it's interactive hands-on exercises, you should be able to:

  • Understand the fundamentals of Apache Hadoop, data ETL (extract, transform, load), ingestion, and processing with Hadoop tools
  • Join multiple data sets and analyzing disparate data with Pig
  • Organize data into tables, perform transformations, and simplify complex queries with Hive
  • Perform real-time interactive analyses on massive data sets stored in HDFS or HBase using SQL with Impala
  • Understand how to pick the best tool for a given task in Hadoop to achieve interoperability, and manage recurring workflows
Classroom Training
Modality: G

Duration 3 days

Enroll now

Currently there are no training dates scheduled for this course.  Enquire a date


Accessing our website tells us you are happy to receive all our cookies. However you can change your cookie settings at any time. Find out more.   Got it!