Title: Hadoop online training fro EasylearningGuru
1Welcome to the World of Big Data Hadoop
2Agenda
- What is Big Data ?
- Different Kinds of Big Data
- Big Data Global Market
- Hadoop Global job trends
- What is Hadoop ?
3What is Big Data?
- Big data is the term for a collection of data
sets so large and complex that it becomes
difficult to process using on-hand database
management tools or traditional data processing
applications.
4Types of Big Data ?
Semi-Structured Data
Traditional RDBMS deals with only Structured data.
Need of a technology which deals with
Semi-structured data, Unstructured data and
Structured data as well
5The 3Vs of Big Data
6Sources of Data
Mobile Devices (Tracking all the objects all the
time)
Social Media Networks (All of us are generating
data)
Sensor Technology Networks (Measuring all kinds
of data)
Scientific Instruments (Collecting all sorts of
data)
7Where Big Data is used ?
8Facebook Scenario
Facebook on an average generates 70 thousand MB
in 1 minute.
1 hour 70,000 MB 60 4.2 Million
MB 1 Day 4.2 Million 24 MB 10.8
Billion MB 98438 GB 1 week 6.9 thousand
GB 690 TB 4 weeks 690 TB 4 2756 TB
2.7 PB 52 weeks 2.7 PB 52 143.3 PB
And thats aloooooooooot of data !
9Various Bigdata Technologies
10Big Data Global Market
Sources Dice, LinkedIn.
11Hadoop Global Job Trends
More than 17,000 employees with Hadoop skill
across these companies
Top Hadoop Technology Companies
Sources Dice, LinkedIn.
12Hadoop Global Job Trends
Sources Dice, LinkedIn.
13What is Hadoop ?
Hadoop was created by Doug Cutting and Mike
Cafarella. Hadoop provides the reliable shared
storage and analysis system. It is designed to
scale up from a single server to thousand of
machines, with a high degree of fault
tolerance.
14Hadoop History
15Hadoop Core Components
- Core Hadoop has two main systems
- Hadoop Distributed File System The Hadoop file
system is a Distributed file system which holds
the large amount of data across multiple nodes in
a cluster. - MapReduce MapReduce is a distributed programming
paradigm used to analyze the data in the HDFS.
16Hadoop Distributed File System (HDFS)
- A given file is broken down into blocks
(default64MB), then blocks are replicated
across cluster (default3). - Optimized for throughput.
- HDFS allows you to put/get/delete files.
- Follows the philosophy
- Write Once and Read Multiple times
- Block Replication for
- - Durability, High Availability and
Throughput.
17MapReduce Flow
18MapReduce Framework
Map Reduce works by breaking the processing into
two phases Map Phase and Reduce Phase.
19(No Transcript)
20What we offer
21(No Transcript)
22Syllabus
- Introduction
- Big Data
- Hadoop
- Hadoop
- HDFS
- MapReduce
- PIG
- Pig 1
- Pig 2
- Hive
- Hive 1
- Hive 2
- Hbase
- Zookeeper
- Sqoop
- Yarn
- Project Class
23Thank you for watching the Live Demo for
Hadoop. You can always contact us on Your
queries are always welcome.
- Phone 91 124 4763660 (India)
- Email contact_at_easylearning.guru
- Skype Id easylearning.guru
- Website www.easylearning.guru