9448474282
learn@ismuniv.com

Hadoop Administration

Certified Course on Hadoop Administration Training:

Hadoop Training: The Hadoop Cluster Administration training course is designed to provide knowledge and skills to become a successful Hadoop Architect. It starts with the fundamental concepts of Apache Hadoop and Hadoop Cluster. It covers topics to deploy, configure, manage, monitor, and secure a Hadoop Cluster.

Best Hadoop Training Institute:

The course will also cover HBase Administration. There will be many challenging, practical and focused hands-on exercises for the learners. By the end of this Hadoop Cluster Administration training, you will be prepared to understand and solve real world problems that you may come across while working on Hadoop Cluster.


ades-prospectusCertified course on HADOOP ADMINISTRATION

Download Prospectus


Course outline

  1. Introduction to Big Data
  • What is Big Data ?
  • Big Data Facts
  • The Three V’s of Big Data
  1. Understanding Hadoop
  • What is Hadoop ?
  • Why learn Hadoop ?
  • Relational Databases Vs. Hadoop
  • Motivation for Hadoop
  • 6 Key Hadoop Data Types
  1. The Hadoop Distributed File system (HDFS)
  • What is HDFS ?
  • HDFS components
  • Understanding Block storage
  • The Name Node
  • The Data Nodes
  • Data Node Failures
  • HDFS Commands
  • HDFS File Permissions
  1. The MapReduce Framework
  • Overview of MapReduce
  • Understanding MapReduce
  • The Map Phase
  • The Reduce Phase
  • WordCount in MapReduce
  • Running MapReduce Job
  1. Planning Your Hadoop Cluster
  • Single Node Cluster Configuration
  • Multi-Node Cluster Configuration
  1. Cluster Maintenance
  • Checking HDFS Status
  • Breaking the cluster
  • Copying Data Between Clusters
  • Adding and Removing Cluster Nodes
  • Rebalancing the cluster
  • Name Node Metadata Backup
  • Cluster Upgrading
  1. Installing and Managing Hadoop Ecosystem Projects
  • Sqoop
  • Flume
  • Hive
  • Pig
  • HBase
  • Oozie
  1. Managing and Scheduling Jobs
  • Managing Jobs
  • The FIFO Scheduler
  • The Fair Schedule
  • How to stop and start jobs running on the cluster
  1. Cluster Monitoring, Troubleshooting, and Optimizing
  • General System conditions to Monitor
  • Name Node and Job Tracker Web Uis
  • View and Manage Hadoop’s Log files
  • Ganglia Monitoring Tool
  • Common cluster issues and their resolutions
  • Benchmark your cluster’s performance
  1. Populating HDFS from External Sources
  • How to use Sqoop to import data from RDBMSs to HDFS
  • How to gather logs from multiple systems using Flume
  • Features of Hive, Hbase and Pig
  • How to populate HDFS from external Sources

Eligibility : Basics of Linux

Duration : 45 hours

Course Fee : Please contact us …..


The course Includes:

  1. 80% Instructor lead Training (36 hrs )
  2. 20% Online training ( 09 hrs )
  3. Course Materials
  4. Certificate
  5. Placement Guidance
  6. Recaps & Tests

Value Addition:

  1. 2 e-learning courses
  2. Soft copy of all software used in the course
  3. Access to E-library during course period

Share