Impala-an Open Source SQL Engine for Hadoop


The ‘Impala-an Open Source SQL Engine for Hadoop’ is an ideal course package for individuals who want to understand the basic concepts of Massively Parallel Processing or MPP SQL query engine that runs on Apache Hadoop. On completing this course, learners will be able to interpret the role of Impala in the Big Data Ecosystem. The course focuses on the basics of Impala. It further provides an overview of the superior performance of Impala, against other popular SQL-on-Hadoop systems.

Price (*ask for discount) 150 USD
Access Period 180 days

Prerequisite list

  • There are no prerequisites for this course.

Audience list

  • Analysts
  • Data scientists
  • Hadoop administrator and developers
  • SQL developers
  • Data warehouse developers
  • Database administrators and developers

What is included

  • 5 hours of self-paced video.
  • Includes 6 high-quality demos covering important topics.
  • Includes 1 Impala simulation exam.
  • Includes 12 chapter-end quizzes and downloadable e-book.
  • Course completion certificate.

Certification Info

  • How To Earn?  Complete 85% of the course. Complete 1 simulation test with a minimum score of 60%.
  • How To Maintain?  N/A

Certification Exam Format

  • No Exam

Retake policy

  • N/A.

Enrollment Policy

  • You should pay the online course fee then the online course access will be granted to you within 1 week after receiving payment.
  • Course fee payment is not refundable.

Frequently Asked Questions

Course Outline

Introduction to Impala
  • Objectives
  • What is Impala
  • Benefits of Impala
  • Exploratory Business Intelligence
  • Impala Installation
  • Demo - Using Cloudera Manager for Impala
  • Starting and Stopping Impala
  • Demo - Starting Impala from Command Line
  • Data Storage
  • Managing Metadata
  • Controlling Access to Data
  • Impala Shell Commands and Interface
  • Demo - Launching Impala Shell and Shell Command
  • Quiz
  • Summary
Querying with Hive and Impala
  • Objectives
  • SQL Language Statements
  • DDL Statements
  • DML Statements
  • CREATE TABLE - Examples
  • Internal and External Tables
  • Loading Data into Impala Table
  • DESCRIBE Statement
  • EXPLAIN Statement
  • SHOW TABLE Statement
  • INSERT Statement
  • INSERT Statement - Examples
  • SELECT Statement
  • Data Type
  • Operators
  • Functions
  • CREATE VIEW in Impala
  • Hive and Impala Query Syntax
  • Demo - Using Impala Shell for DDL and DDML SQL Statements
  • Quiz
  • Summary
Data Storage and File Format
  • Objectives
  • Partitioning Tables
  • SQL Statements for Partitioned Tables
  • File Format and Performance Considerations
  • Choosing File Type and Compression Technique
  • Demo - File Formats and Compression Techniques
  • Quiz
  • Summary
Working with Impala
  • Objectives
  • Impala Architecture
  • Impala Daemon
  • Impala Statestore
  • Impala Catalog Service
  • Query Execution Flow in Impala
  • User - Defined Functions
  • Hive UDFs with Impala
  • Demo - UDF in Impala
  • Improving Impala Performance
  • Quiz
  • Summary