Hadoop Admin Online Training PowerPoint PPT Presentation

presentation player overlay
About This Presentation
Transcript and Presenter's Notes

Title: Hadoop Admin Online Training


1
Hadoop Admin Online Training
  • Glory IT
    Technologies

2
Prerequisites
  • Knowledge of Hadoop and Distributed Computing.

3
Module 1 Introduction to Hadoop
  • The amount of data processing in todays life
  • What Hadoop is why it is important?
  • Hadoop comparison with traditional systems
  • Hadoop history
  • Hadoop main components and architecture

4
Module 2 Hadoop Distributed File System (HDFS)
  • HDFS overview and design
  • HDFS architecture
  • HDFS file storage
  • Component failures and recoveries
  • Block placement
  • Balancing the Hadoop cluster

5
Module 3 Planning your Hadoop cluster
  • Planning a Hadoop cluster and its capacity
  • Hadoop software and hardware configuration
  • HDFS Block replication and rack awareness
  • Network topology for Hadoop cluster

6
Module 4 Hadoop Deployment
  • Different Hadoop deployment types
  • Hadoop distribution options
  • Hadoop competitors
  • Hadoop installation procedure
  • Distributed cluster architecture

7
Module 5 Working with HDFS
  • Ways of accessing data in HDFS
  • Common HDFS operations and commands
  • Different HDFS commands
  • Internals of a file read in HDFS
  • Data copying with distcp

8
Module 6 -Mapreduce Abstraction
  • What MapReduce is and why it is popular
  • The Big Picture of the MapReduce
  • MapReduce process and terminology
  • MapReduce components failures and recoveries
  • Working with MapReduce

9
Module 7 Hadoop Cluster Configuration
  • Hadoop configuration overview and important
    configuration file
  • Configuration parameters and values
  • HDFS parameters MapReduce parameters
  • Hadoop environment setup
  • Include and Exclude configuration files

10
Module 8 Hadoop Administration and Maintenance
  • Namenode/Data node directory structures and files
  • File system image and Edit log
  • The Checkpoint Procedure
  • Namenode failure and recovery procedure
  • Safe Mode
  • Metadata and Data backup
  • Potential problems and solutions / what to look
    for
  • Adding and removing nodes

11
Module 9 Hadoop Monitoring and Troubleshooting
  • Best practices of monitoring a Hadoop cluster
  • Using logs and stack traces for monitoring and
    troubleshooting
  • Using open-source tools to monitor Hadoop cluster

12
Module 10 Job Scheduling
  • How to schedule Hadoop Jobs on the same cluster
  • Default Hadoop FIFO Schedule
  • Fair Scheduler and its configuration

13
Module 11 Hadoop Multi Node Cluster Setup and
Running Map Reduce Jobs on Amazon Ec2
  • Hadoop Multi Node Cluster Setup using Amazon ec2
    Creating 4 node cluster setup
  • Running Map Reduce Jobs on Cluster

14
Contact us free Demo
  • We stay with you until you get the results you
    want.
  • If you really interested, please let me know .
  • We will arrange the Demo Session.
  • Feel Free to call us any time
  • Thanks RegardsSrinivasGloryITTechnologiesEmai
    lInfo_at_gloryittechnologies.comPhone91-903281345
    6/91-9160177789Skype ID gloryittechnologies

15
  • THANK YOU
Write a Comment
User Comments (0)
About PowerShow.com