Spark Specialization Training |
|
|
|
|
|
COURSE ID |
|
GES-SP |
|
DURATION |
|
39hrs |
|
Course Fee |
|
$$2100+HST |
|
DELIVERY METHOD |
|
Class room and online training |
|
COURSE OVERVIEW |
|
|
|
AUDIENCE |
|
|
|
PREREQUISITES |
|
|
|
COURSE OBJECTIVES |
|
|
|
COURSE OUTLINE |
|
Intro to Hadoop
- RDBMS vs Hadoop
- Ecosystem tour (9 products)
- Vendor comparison (Cloudera, Hortonworks, MapR, Amazon EMR)
- Hardware Recommendations
HDFS: File System details
- Heartbeats
- Rack awareness
Hands-on on Hadoop machine?
- Introduction to Hadoop FS and Processing Environment?s UIs
- How to read and write files
- Basic Unix commands for Hadoop
- Hadoop ?FS shell
- Hadoop releases practical
- Hadoop daemons practical?
Hive Introduction
- Meta storage and meta store
- Hive Data types
- HQL
- DDL, DML and sub languages of Hive
- Internal , external and Temp tables in Hive
- Differentiation between SQL based Datawarehouse and Hive?
- Hive releases
- Why Hive is not best solution for OLTP
- OLAP in Hive
- Partitioning
- Bucketing
- Hive Architecture
- Hue Interface for Hive
- Complex Use cases in Hive
- Hive Advanced Assignment
- Real time scenarios of Hive
- POC on Pig and Hive , With real time data sets and problem statements?
End to End execution flow of Map Reduce job
- Different tasks in Map Reduce job
- Introduction to Combiner
- Introduction to Partitioner
- POC based on Pig, Hive, HDFS, MR?
Introduction to NOSQL
- Why NOSQL if SQL is in market since several years
- Databases in market based on NOSQL
- OLTP Solutions with different capabilities
- Which Nosql based solution is capable to handle specific requirements
- Examples of companies like Google, Facebook, Amazon, and other clients who are using NOSQL based databases
HBase Architecture of column families?
- Introduction to HBase
- Introduction to other NOSQL based data models
- Drawbacks of Hadoop
- Why Hadoop can?t work for real time processing
- HBase table and column family structure
- HBase versioning concept
- HBase flexible schema
- HBase Advanced?
Introduction to Zookeeper
- How Zookeeper helps in Hadoop Ecosystem
- How to load data from Relational storage in Hadoop
- Sqoop basics
- Sqoop practical implementation
- Sqoop alternative
- Sqoop connector
- Quick revision of previous classes to fill the gap in your understanding and correct understandings
- Nifi
YARN
- YARN vs v1 of hadoop
- Introduction to YARN
- Significance of YARN?
Introduction to Hue
- Real time Hadoop usage
- Real time cluster introduction
- Hadoop real time project
- Real time problems and frequently faced errors with solution?
Introduction to Spark
- Why Spark demand is increasing in market
- How can we use Spark with Hadoop Eco System
- Datasets for practice purpose?
- Spark use cases with ?real time scenarios
Apache Spark
- Why Spark demand is increasing in market
- Introduction to Spark
- Introduction to scala
- Spark Shell
- Spark Context
- RDD
- Spark Architecture
- Transformations
- Actions
- Caching
- Spark SQL
Real time project use cases examples based on Spark and Scala |
|
SOFTWARE |
|
|
|
CONTACT INFORMATIONwe |
|
Mississauga: 1065 Canadian Place, Suite 201, Mississauga ON L4W 0C2 Scarborough: 2401 Eglinton Ave E, Suite 304, Scarborough(Eglinton & Kennedy) ON M1K 2M5 Montreal: 279 Rue Sherbrooke O, Suite 209, Montreal, QC H2X 1Y2 PH: +1-514-664-3900
Phone: +1 416-623-9493 or +1 416-333-3717
E-mail: training@globalerp.ca |
|