Learn Big Data Engineering with Hadoop & Spark
Assalam Aleikom Ummah
Big Bang Data Science Solutions has launched another 8 Week program in Big Data Engineering with Hadoop & Spark for only $799 with 10% early bird registration – promo code is ALMPG10
The batch starts in October 20th and ends in Dec 20th
Link to the program
http://bbds.ma/big-data-engineering-with-hadoop-spark/
Link to register
https://camperregsecure.com/bbds/index.php
Program Schedule
Week 1: Introduction to Big Data and HDFS
Topics Covered
• Introduction to Big Data
• Why Big Data?
• Characteristics of Big Data – 4 Vs
• Applications of Big Data
• Introduction to Hadoop
• HDFS – Hadoop Distributed file system
• Components of HDFS
• HDFS terminology
• HDFS Federation
• HDFS high availability
• Role of zoo keeper
• Replica pipeline and network distance algorithm
• HDFS Read and Write
• Installing Hadoop in Windows/Mac using Cloudera Quickstart VM
Week 2 : Basic Map Reduce
Topics Covered
• Introduction to Map Reduce Framework
• Mapper and Reducer APIs • First Map Reduce program – Word Count
• Map Reduce examples – Inverted Index and Titanic Data Analysis
• Modes of execution
• Job execution in MRV1 VS YARN
• Serialization and Deserialization • Writable Classes
• Input and Output Formats
• Distributed Cache
Week 3 : Advanced Map Reduce
Topics Covered
• Using Partitioner and Combiner classes
• Joins – Map side Join and Reduce side Join
• Counters • Sequence File Format
• Optimizing techniques of MR jobs
• Speculative Execution
• Hadoop Streaming and Pipes
Week 4 : Hive
Topics Covered
• Introduction to Hive
• RDBMS VS Hive
• Hive DDL : Managed Table VS External Table
• Issues with delimiters
• Hive Architecture
• Partioning – Static and Dynamic
• Bucketing
• Dealing JSON data – using JSON SerDe
• Hive UDF
• Creating Views
Week 5 : Pig and No SQL Database - HBase
Topics Covered
• Introduction to Pig
• Why Pig ?
• Motivation by example
• SQL vs Pig
• Modes of Running Pig
• Introduction to Pig Latin
• Pig Latin Data types
• Pig Latin Operators
• Type casting and validation • Process of Pig Latin Processing
• Pig UDF with example
• What is a No SQL Database ?
• Why Hbase ?
• Introduction to Hbase • Hbase high level architecture
• Hbase commands
• Indepth architectural view of Hbase
• Java APIs for Hbase operations
• Bulk Load using Table Mapper and Table Reducer API
• Bulk Load using import TSV tool from a file
Week 6 : Sqoop ,Oozie and Scala
Topics Covered
• Introduction to Sqoop
• Sqoop Architecture
• Sqoop import and Export with Examples
• Introduction to Oozie
• Oozie workflow
• Oozie Action Tags
• Oozie Parametrization
• Programming concepts of Scala
Week 7 : Apache Spark
Topics Covered
• Introduction to Spark
• Why Spark?
• Applications of Spark
• Spark Terminology
• RDD
• Architecture of Spark
• Transformations and Actions
• RDD Hierarchy
• Lazy Execution
• Shared Variables
• RDD persistence
• Spark SQL – Data Frames , Data Sets and SQL
• Realtime streaming with Kafka and Spark Streaming
About the trainer : Suresh Gilakamsettii
. Currently working as a Big Data Engineer with MNC
. Industry Certified Big Data Developer
. 5 years of IT industry experience
. Worked for Google, EY, Nissan, TD Bank etc
. He developed End-to-End solutions for Real-Time Streaming applications using Big Data tools
. Published a paper on "Time Series Classification using kernel weighted k-NN and DTW algorithms" in an International conference
. Training experience of 2 years and worked for various online training institutes like Edureka, Collabera TACT as a freelancer
. Trained more than 1000 associates on Big Data and Analytics
Key Highlights | Big Data Data Science Hadoop Spark |
Time | 8 Weeks |
Training | Online - Paid |
Fees | $799 - Use ALMPG10 to get 10% discount if you register before the start date |
Discussion ( 0 )