Next class starting
January 12, 2019
This course will be delivered through LIVE Instructor Led Online Training. There will be 8 online sessions, each session being of 2 hours.
Each session will have presentation about theory, concepts and technology, followed by Hands-on Lab practice exercises.
Each session will be recorded and the recordings, along with training material, code samples, will be uploaded on Microsoft cloud and shared with students.
16 hours over 4 weekends
Saturday and Sunday each weekend
10:00 AM - 12:00 PM US Eastern time each day
Please check your local date and time for first session
Video Conference Details
Will be sent once you register and payment is received
This course is meant for IT professionals who are Database Administrators, Systems Admins, Developers, Testers, Solutions Architect, Release Engineers, Cloud Professionals and others who want to build a career in big data and hadoop.
Desired but not required - Exposure to, Working proficiency of BI, sql, scripting, how to handle and manage data and databases, using Excel.
Some activities will require some prior programming experience, preferably in Python or Scala.
A basic familiarity with the Linux command line will be very helpful.
You will need access to a PC running 64-bit Windows, MacOS, or Linux with an Internet connection, if you want to participate in the hands-on activities and exercises.
You must have at least 8GB of free RAM on your system; 10GB or more is recommended. If your PC does not meet these requirements, you can still follow along in the course without doing hands-on activities.
A Microsoft cloud Azure account will be provided to every student where they will install hortonworks hadoop on the cloud virtual machines. Students will carry out the hands-on lab exercises with instructor guidance.
1. Big Data Basics
An introduction to Big Data?
Why is Big Data? Why now?
The Three Dimensions of Big Data (Three Vs)
Evolution of Big Data
Big Data versus Traditional RDBMS Databases
Big Data versus Traditional BI and Analytics
Big Data versus Traditional Storage
Key Challenges in Big Data adoption
Benefits of adoption of Big Data
Introduction to Big Data Technology Stack
Apache Hadoop Framework
Introduction to Microsoft HDInsight – Microsoft’s Big Data Service
Creating Azure Storage Account
Creating HDInsight Cluster
Using services on HDInsight Cluster
2. The Big Data Technology Stack
Basics of Hadoop Distributed File System (HDFS)
Basics of Hadoop Distributed Processing (Map Reduce Jobs)
Loading files to Azure storage account
Moving files across HDInsight Cluster
Remote Access to Azure Storage Account and HDInsight Cluster
3. Deep dive into Hadoop Storage System (HDFS) (1 Hour)
Reading files with HDFS
Writing files with HDFS
Accessing Hadoop configuration files using HDInsight Cluster
4. Processing Big Data –MapReduce and YARN
How MapReduce works
Handling Common Errors
Bottlenecks with MapReduce
How YARN (MapReduceV2) works
Difference between MR1 and MR2
Running a simple MapReduce application (word count)
Running a custom MapReduce application (census data)
Running MapReduce via PowerShell
Running a MapReduce application using PowerShell
Monitoring application status
5. Big Data Development Framework
Introduction to HIVE
Introduction to PIG
Loading the data into HIVE
Submitting Pig jobs using HDInsight
Submitting Pig jobs via PowerShell
6. Big Data Integration and Management
Big Data Integration using Polybase
Big Data Management using Ambari
Fetching HDInsight data into SQL
Using Ambari for managing HDInsight cluster
7. Store and query your data with Sqoop, Hive, MySQL,
8. Design real-world systems using the Hadoop ecosystem
9. Learn how your cluster is managed with YARN, Mesos, Zookeeper,
10. Handle streaming data in real time with Kafka, Spark Streaming
1. Class recordings will be made available. 2. Post class support3. Course material available. 4. Software access5. Career advancement and Job placement assistance
100% refund will be provided only if we DO NOT hold the class and/or we reschedule the class and the new dates and timings don't work for you.
If the class is held as per schedule, you don't show up or you register, purchase a training ticket and then change your mind, we will not issue a refund.