Hadoop Online Training - PowerPoint PPT Presentation

About This Presentation
Title:

Hadoop Online Training

Description:

Learntek is global online training provider on Big Data Analytics, Hadoop, Machine Learning, Deep Learning, IOT, AI, Cloud Technology, DEVOPS, Digital Marketing and other IT and Management courses. We are dedicated to designing, developing and implementing training programs for students, corporate employees and business professional. – PowerPoint PPT presentation

Number of Views:69
Learn more at: http://www.learntek.org
Slides: 11
Provided by: Learntek

less

Transcript and Presenter's Notes

Title: Hadoop Online Training


1
  • HADOOP

2
  • The following topics will be covered in our
  • HADOOP
  • Online Training

3
Hadoop Administration Training Hadoop Cluster
Administration
  • Hadoop Administration Training Learning
    Objectives In this module, you will understand
    what is Big Data and Apache Hadoop, How Hadoop
    solves the Big Data problems, Hadoop Cluster
    Architecture, Introduction to MapReduce
    framework, Hadoop Data Loading techniques, and
    Role of a Hadoop Cluster Administrator.

4
Topics
  • Introduction to Big Data
  • Use cases where Big Data is used.
  • Introduction to Hadoop framework.
  • HDFS File system.
  • Hadoop Architecture
  • MapReduce Framework
  • A typical Hadoop Cluster
  • Hadoop Cluster Administrator Roles and
    Responsibilities, Current Job Market

5
Hadoop Architecture and Cluster setup
  • Learning Objectives After this module, you will
    understand Multiple Hadoop Server roles such as
    NameNode and DataNode, and MapReduce data
    processing. You will also understand the Hadoop
    2.x Cluster setup and configuration, Setting up
    Hadoop Clients using Hadoop 2.x, and important
    Hadoop configuration files and parameters.

6
Hadoop Administration Training Topics
Hadoop server roles and their usage. Hadoop Installation and Initial Configuration. Understand Namenode and Datanodes Communication channels. Setup a Single Node Cluster. Namenode Metadatas details. Setup a Multi Node Cluster Deploying Hadoop in pseudo-distributed mode Setup Pass phraseless Access. Rack Awareness. Anatomy of Write and Read,. Replication Pipeline, Data Processing. Installing Hadoop Clients. Scalability best practices. Adding/Removing nodes into/from the cluster.
7
Hadoop Cluster Planning and Managing
  • Learning Objectives  In this module, you will
    understand Planning and Managing a Hadoop
    Cluster, Hadoop Cluster Monitoring and
    Troubleshooting, Analyzing logs, and Auditing.
    You will also understand Scheduling and Executing
    MapReduce Jobs, and different Schedulers.

8
Topics
  • Planning the Hadoop Cluster.
  • Cluster Sizing.
  • Hardware and Software considerations.
  • Managing and Scheduling Jobs.
  • Types of schedulers in Hadoop FIFO, FAIR
    SCHEDULER
  • Setup Queues and Pools for Jobs.
  • Configuring the schedulers and run MapReduce
    jobs.
  • Cluster Monitoring and Troubleshooting.

9
Value Ads (as per latest industry standards)
  • Running Hadoop on cloud Connectivity/administrat
    ion to AWS
  • Installation/administration of Cloudera Mgr HDP
    (Free version)
  • Cluster Monitoring and Troubleshooting.

10
(No Transcript)
Write a Comment
User Comments (0)
About PowerShow.com