Course Details

Big Data Basics Course

Big Data Basics

Picture

Introduction to Big Data

  • Welcome Notes
  • Big Data – Why and Where
  • Characteristics of Big Data and Dimensions of Scalability
  • Data Science: Getting Value out of Big Data
  • Foundations for Big Data Systems and Programming
  • Traditional Data Processing Technologies – Comparison with existing technologies
3 Hours
Movie

Hadoop and HDFS

  • Apache Hadoop Architecture and Ecosystem
  • Hadoop Subprojects
  • Hadoop Distributions
  • Setting up Hadoop
  • Installing Hadoop
  • Configuring Hadoop
  • Starting Hadoop
  • Running Hadoop Clients
  • Browsing Hadoop UI Consoles
  • HDFS Architecture
  • Hadoop 1.0 HDFS Architecture
  • Hadoop 1.0 HDFS Architectural Capabilities - – Performance, Scalability, Availability, Installability, Comnfigurability, Operability, Usability, Security
  • Hadoop 2.0 HDFS Architecture
  • HDFS API Overview
  • HDFS File CRUD API
  • HDFS Directory CRUD API
  • Difference of NFS and HDFS
6 Hours
Picture

MapReduce

  • MapReduce Architecture
  • Hadoop 1.0 MapReduce Architecture
  • Hadoop 1.0 MapReduce Architectural Capabilities – Performance, Scalability, Availability, Installability, Comnfigurability, Operability, Usability, Programmability
  • Hadoop 2.0 MapReduce Architecture
  • MapReduce Programming Basics
  • MapReduce Programming – Map Phase and Reduce Phase
  • MapReduce API – Key Java Classes
  • Steps to Write a MapReduce Program
  • MapReduce Programming Intermediate
3 Hours
Movie

Pig

  • Pig Background
  • Architecture
  • Downloading, Installing and Configuring Pig
  • Running Pig
  • Pig Latin Language Basics
  • Core Relational Operators – DISTINCT, FILTER, SPLIT, ORDER BY, LIMIT, GROUP, FOREACH
  • Built-in Functions
3 Hours
Picture

Hive

  • Hive Background
  • Hive Architecture
  • Hive data loading practices
  • Hive performance parameters
  • Hive best practices
  • User defined fucntions
  • File types and serde properties
  • Downloading, Installing and Configuring Hive
  • Simple Hive Example using Hue & Beehive
  • Loading Data into Hive
  • Hive Query Statements using Hue and Beehive
  • Hive Schema Violations
  • Using Built-in Hive Functions
  • Partitioning & Bucketing of Data using Hive
  • Joining Data
6 Hours
Location

Sqoop

  • Introduction
  • Installation
  • Import single tables
  • Import All Tables / databases
  • Export
  • Sqoop Job
  • Codegen
  • Eval
  • List Database
  • List Table
3 Hours
Location

Hbase/NO SQL

  • HBase Overview
  • NO SQL Databases (Graph/Document/Key Value Pair)
  • Data Model
  • Architecture
  • Downloading, Installing and Configuring HBase
  • HBase Shell
  • HBase database operations
6 Hours
Location

Project

  • End to End project assignment to clear the concept, architecture and real world bigdata application (using MapR, Hive, Pig, Sqoop, Tableau etc.)
10 Hours
Location

Interview

  • Interview Preparation /Mock test/Certification tips
3 Hours

Our Bigdata Courses