Home  |  About Us  |  Careers |  Resources  |  Client List  |  Sitemap  |  Contact Us

Call: +1 416-333-3717

Toll Free: 1-866-955-4526
E-mail: info@globalerp.ca

Software Quality Assurance Training Business Analysis Training Microsoft BI Training Azure Business Intelligence Training Cyber Security SAP Training DevOps Training PMP Training Informatica Training JAVA Training MICROSOFT .NET Training Big Data & Hadoop Training Scrum Training ORACLE Admin Training

Spark Specialization Training
  DURATION   39hrs
  Course Fee   $$2100+HST
  DELIVERY METHOD   Class room and online training

Intro to Hadoop

  • RDBMS vs Hadoop
  • Ecosystem tour (9 products)
  • Vendor comparison (Cloudera, Hortonworks, MapR, Amazon EMR)
  • Hardware Recommendations

HDFS: File System details

  • Heartbeats
  • Rack awareness

Hands-on on Hadoop machine?

  • Introduction to Hadoop FS and Processing Environment?s UIs
  • How to read and write files
  • Basic Unix commands for Hadoop
  • Hadoop ?FS shell
  • Hadoop releases practical
  • Hadoop daemons practical?

Hive Introduction

  • Meta storage and meta store
  • Hive Data types
  • HQL
  • DDL, DML and sub languages of Hive
  • Internal , external and Temp tables in Hive
  • Differentiation between SQL based Datawarehouse and Hive?
  • Hive releases
  • Why Hive is not best solution for OLTP
  • OLAP in Hive
  • Partitioning
  • Bucketing
  • Hive Architecture
  • Hue Interface for Hive
  • Complex Use cases in Hive
  • Hive Advanced Assignment
  • Real time scenarios of Hive
  • POC on Pig and Hive , With real time data sets and problem statements?

End to End execution flow of Map Reduce job

  • Different tasks in Map Reduce job
  • Introduction to Combiner
  • Introduction to Partitioner
  • POC based on Pig, Hive, HDFS, MR?

Introduction to NOSQL

  • Why NOSQL if SQL is in market since several years
  • Databases in market based on NOSQL
  • OLTP Solutions with different capabilities
  • Which Nosql based solution is capable to handle specific requirements
  • Examples of companies like Google, Facebook, Amazon, and other clients who are using NOSQL based databases

HBase Architecture of column families?

  • Introduction to HBase
  • Introduction to other NOSQL based data models
  • Drawbacks of Hadoop
  • Why Hadoop can?t work for real time processing
  • HBase table and column family structure
  • HBase versioning concept
  • HBase flexible schema
  • HBase Advanced?

Introduction to Zookeeper

  • How Zookeeper helps in Hadoop Ecosystem
  • How to load data from Relational storage in Hadoop
  • Sqoop basics
  • Sqoop practical implementation
  • Sqoop alternative
  • Sqoop connector
  • Quick revision of previous classes to fill the gap in your understanding and correct understandings
  • Nifi


  • YARN vs v1 of hadoop
  • Introduction to YARN
  • Significance of YARN?

Introduction to Hue

  • Real time Hadoop usage
  • Real time cluster introduction
  • Hadoop real time project
  • Real time problems and frequently faced errors with solution?

Introduction to Spark

  • Why Spark demand is increasing in market
  • How can we use Spark with Hadoop Eco System
  • Datasets for practice purpose?
  • Spark use cases with ?real time scenarios

Apache Spark

  • Why Spark demand is increasing in market

  • Introduction to Spark

  • Introduction to scala

  • Spark Shell

  • Spark Context

  • RDD

  • Spark Architecture

  • Transformations

  • Actions

  • Caching

  • Spark SQL

  • Real time project use cases examples based on Spark and Scala

  •   SOFTWARE  

    Mississauga: 1065 Canadian Place, Suite 201, Mississauga ON L4W 0C2
    Scarborough: 2401 Eglinton Ave E, Suite 304, Scarborough(Eglinton & Kennedy) ON M1K 2M5
    Montreal: 279 Rue Sherbrooke O, Suite 209, Montreal, QC H2X 1Y2 PH: +1-514-664-3900

    Phone: +1 416-623-9493 or +1 416-333-3717

    E-mail: training@globalerp.ca