7 KEY TECHNOLOGIES SHAPING THE HADOOP ECOSYSTEM - PowerPoint PPT Presentation

About This Presentation
Title:

7 KEY TECHNOLOGIES SHAPING THE HADOOP ECOSYSTEM

Description:

Hadoop Online Training and Hadoop Corporate Training services. We framed our syllabus to match with the real world requirements for both beginner level to advanced level. – PowerPoint PPT presentation

Number of Views:31
Slides: 12
Provided by: genesissarah
Tags: hadoop

less

Transcript and Presenter's Notes

Title: 7 KEY TECHNOLOGIES SHAPING THE HADOOP ECOSYSTEM


1
7 KEY TECHNOLOGIES SHAPING THE HADOOP ECOSYSTEM
2
  • Key Technologies
  • WEB NOTEBOOKS
  • Calculations FOR MACHINE LEARNING
  • SQL ON HADOOP
  • DATABASES
  • STREAM PROCESSING TECHNOLOGIES
  • Informing PLATFORMS
  • Worldwide RESOURCE MANAGEMENT

3
  • 1. WEB NOTEBOOKS
  •  
  • Web note pads are an approach to compose code
    inside the web program and have it keep running
    against a group of servers. For the most part,
    web note pads can bolster dialects, for example,
    Scala and Python, and also more fundamental
    dialects, for example, HTML and Markdown, which
    permit the formation of a journal that can be
    exhibited all the more effortlessly.
    Reconciliation of SQL into web scratch pad has
    likewise turned into a more well known element,
    in spite of the fact that the capacities of web
    journals fluctuate extraordinarily.
  • It includes a pluggable piece design with the
    goal that it could bolster more dialects that
    could be incorporated into the Jupyter stage. It
    now bolsters in excess of 50 dialects with a
    simple to-utilize interface.

4
  • Possibly the most prevalent web note pad at
    present being used is Jupyter, which was at first
    called ipython. Because of the developing
    requirement for a basic method to compose and
    execute code, Jupyter advanced rapidly.

5
  • 2. Calculations FOR MACHINE LEARNING
  •  
  • The utilization of machine-learning calculations
    is an intriguing issue, and there are various
    imperative explanations behind this. The first is
    that a great many people can see the capability
    of utilizing machine-learning calculations to
    acquire experiences into the information they
    have. In the case of making a suggestion motor,
    customizing a site, recognizing oddities, or
    identifying extortion, the prevalence of this
    zone is solid.
  •  
  • The most ideal approach to pick up a superior
    comprehension of machine learning calculations is
    by perusing these free books by Ted Dunning and
    Ellen Friedman, which cover these themes in an
    exceptionally compact and simple to expend way.
    Reasonable Machine Learning A New Look at
    Anomaly Detection and Practical Machine Learning
    Innovations in Recommendation can each be perused
    inside a couple of hours.

6
  • 3. STREAM PROCESSING TECHNOLOGIES
  •  
  • It appears nowadays that everybody needs their
    stream preparing system to be "the" structure
    utilized. There are such huge numbers of
    undertakings (free and paid) in this space it can
    influence your make a beeline for turn Apache
    Flink, Spark Streaming, Apache Apex (hatching),
    Apache Samza, Apache Storm, and Akka Streams, and
    also StreamSetsand this isn't even a thorough
    rundown

7
  • 4. Informing PLATFORMS
  •  
  • While stream handling motors are hot, informing
    stages are likely more smoking. They can be
    utilized to make adaptable models and are taking
    off like insane crosswise over numerous
    associations.
  •  
  • Organizations, for example, LinkedIn have begun
    influencing informing stages to cool once more.
    The venture it added to the Apache Foundation,
    Apache Kafka, has made a truly strong and easy
    to-utilize API, and now this API has turned into
    a to some degree suggested standard.

8
  • 5. SQL ON HADOOP
  •  
  • Apache Hive is the SQL-on-Hadoop innovation that
    has been around the longest, and is presumably
    the most generally utilized. The Hive Metastore
    can be utilized by different advancements, for
    example, Apache Drill. The advantage for this
    situation is that Drill can read the metadata
    from Hive and afterward run the questions itself,
    rather than relying on the Hive MapReduce
    runtime. This approach is fundamentally quicker
    and is one of the favored methods for utilizing
    Hive.

9
  • 6. DATABASES
  •  
  • Databases in the huge information space are
    normally alluded to as NoSQL databases. This term
    is imperfect, as non-social databases are what
    are typically being talked about. A significant
    number of the NoSQL databases may really be
    questioned with SQL through apparatuses, for
    example, Apache Drill. To be clear, there is
    nothing inalienably amiss with a social database
    it's simply that the vast majority have utilized
    them for putting away nonrelational information
    for a long while, and now the more up to date
    advances have extraordinarily disentangled the
    capacity and access of nonrelational information.
  •  

10
  • 7. Worldwide RESOURCE MANAGEMENT
  •  
  • Asset administration identifies with the capacity
    to compel the assets (CPU and memory) of an
    application. Apache Mesos was made to be a
    universally useful asset chief for everything in
    the server farm, or even over various server
    farms. Apache YARN was made to be a Hadoop asset
    director.

11
  • Thank you
  • Visit us - http//www.traininginrajajinagar.in/Had
    oop-training-in-rajaji-nagar
Write a Comment
User Comments (0)
About PowerShow.com