Hadoop Online Training PowerPoint PPT Presentation

presentation player overlay
About This Presentation
Transcript and Presenter's Notes

Title: Hadoop Online Training


1
  • HADOOP

2
  • The following topics will be covered in our
  • HADOOP
  • Online Training

3
Hadoop Administration Training Hadoop Cluster
Administration
  • Hadoop Administration Training Learning
    Objectives In this module, you will understand
    what is Big Data and Apache Hadoop, How Hadoop
    solves the Big Data problems, Hadoop Cluster
    Architecture, Introduction to MapReduce
    framework, Hadoop Data Loading techniques, and
    Role of a Hadoop Cluster Administrator.

4
Topics
  • Introduction to Big Data
  • Use cases where Big Data is used.
  • Introduction to Hadoop framework.
  • HDFS File system.
  • Hadoop Architecture
  • MapReduce Framework
  • A typical Hadoop Cluster
  • Hadoop Cluster Administrator Roles and
    Responsibilities, Current Job Market

5
Hadoop Architecture and Cluster setup
  • Learning Objectives After this module, you will
    understand Multiple Hadoop Server roles such as
    NameNode and DataNode, and MapReduce data
    processing. You will also understand the Hadoop
    2.x Cluster setup and configuration, Setting up
    Hadoop Clients using Hadoop 2.x, and important
    Hadoop configuration files and parameters.

6
Hadoop Administration Training Topics
Hadoop server roles and their usage. Hadoop Installation and Initial Configuration. Understand Namenode and Datanodes Communication channels. Setup a Single Node Cluster. Namenode Metadatas details. Setup a Multi Node Cluster Deploying Hadoop in pseudo-distributed mode Setup Pass phraseless Access. Rack Awareness. Anatomy of Write and Read,. Replication Pipeline, Data Processing. Installing Hadoop Clients. Scalability best practices. Adding/Removing nodes into/from the cluster.
7
Hadoop Cluster Planning and Managing
  • Learning Objectives  In this module, you will
    understand Planning and Managing a Hadoop
    Cluster, Hadoop Cluster Monitoring and
    Troubleshooting, Analyzing logs, and Auditing.
    You will also understand Scheduling and Executing
    MapReduce Jobs, and different Schedulers.

8
Topics
  • Planning the Hadoop Cluster.
  • Cluster Sizing.
  • Hardware and Software considerations.
  • Managing and Scheduling Jobs.
  • Types of schedulers in Hadoop FIFO, FAIR
    SCHEDULER
  • Setup Queues and Pools for Jobs.
  • Configuring the schedulers and run MapReduce
    jobs.
  • Cluster Monitoring and Troubleshooting.

9
Value Ads (as per latest industry standards)
  • Running Hadoop on cloud Connectivity/administrat
    ion to AWS
  • Installation/administration of Cloudera Mgr HDP
    (Free version)
  • Cluster Monitoring and Troubleshooting.

10
(No Transcript)
Write a Comment
User Comments (0)
About PowerShow.com