Title: Managing your Assets with Big Data Tools (1)
1 Managing your Assets with Big Data Tools
- Karthigai Muthu, MachinePulse
2Agenda
- Big Data value proposition
- Big Data Technology Stack
3Hype Cycle for Emerging Technologies
Source Wikipedia
4Sources of data
12 TBs of tweet data every day
? TBs ofdata every day
25 TBs oflog data every day
5What makes Data Big
Characteristics Description Attributes Drivers
Volume The amount of data generated or intensify that must be ingested, analyzed and managed to make decision based on complete data analysis Exabyte (EB) Zettabyte (ZB) Yottabyte (YB) Increase in data sources Higher resolution sensors Scalable infrastructure
Velocity How fast the data is being produced and changed and the speed at which is transformed into insight Batch Near real time Real time and Streams Rapid feedback loop Improved throughput connectivity Competitive advantage Pre-computed information
Variety The degree of diversity of data from sources both inside and outside an organization Degree of structure Complexity M2M/IoT Social Media Genomics Video and Mobile
Veracity The quality and provenance of data Consistency Completeness Ambiguity Integrity Cost Need of traceability and justification
6Big Datas Greatest Power Predictive Analytics
7Whats driving Big Data
8The Evolution of Business Intelligence
Big Data Real Time Single View Graph
Databases
Interactive Business Intelligence In-memory
RDBMS QlikView, Tableau,HANA
Speed
Scale
BI Reporting OLAP Data warehouse Business
Objects, SAS, Informatica, Cognos other SQL
Reporting Tools
Speed
Scale
Big Data Batch Processing Distributed Data
Store Hadoop/Spark HBase/Cassandra/MongoDB
1990s
2000s
2010s
9Solving business problem with big data
10Formulation of big data strategy
11Companies Market share in Big Data
12Big Data Investments
13Priority for big data across industry
14Are you aware the risk of not implementing Big
Data in your company
15Big data changed connected things to Internet of
Everything(IoE)
16How the industry can leverage from big data
17Challenges in implementing the big data
18Returns of Investment(ROI)
19How do companies get MORE from big data
- Merge
- Optimize
- Respond
- Empower
20Are you planning to launch your new product.
21Customer 360
Social Media
Banking Finance
Our Known History
Gaming
Purchase
Entertain
Customer
22Real-Time Analytics/Decision Requirement
Product Recommendations that are Relevant
Compelling
Friend Invitations to join a Game or
Activity that expands business
Influence Behavior
Improving the Marketing Effectiveness of a
Promotion while it is still in Play
Customer
Learning why Customers Switch to competitors
and their offers in time to Counter
Preventing Fraud as it is Occurring
preventing more proactively
23IoTBig Data IoE(Internet-of-Everything)
24Role of Big Data in M2M/IoT
- Big Data is a factor that will, to a large
extent, determine the future growth rate in the
M2M industry - M2M will connect increasingly more nodes that
will provide data from endpoints. - Data will be more granular, more frequent, and
more accurate, with bigger data sets or even live
data streams - Large volume of endpoint connections IPv4
addressing scheme cant accommodate
everything(sensors, smart phones, smart
factories, smart grids, smart vehicles,
controllers, meters ) that it requires IPv6 - IoE Convergence of IoT, Big Data Analytics
,Cloud Computing and other technologies is
collectively called as Internet of Everything
25Challenges of Big Data in M2M/IoT
- Meeting the need for speed
- Data understanding
- Maintaining data quality
- Displaying the meaningful result
26IoT/M2M Applications..
27Big Data Use Cases IoT/M2M
- Personal IoT the scope is a single person, such
as a smartphone equipped with GPS sensor or a
fitness device that measures the heart rate. This
is one of the fastest growing, consumer-oriented
areas of IoT. - Group IoT the scope is a fairly small group of
people, such as a family in a smart house,
co-workers in a van or a group of tourists. This
is one of the most challenging areas and is still
in its early phase. - Community IoT the scope is a large group of
people, potentially thousands and more usually
this is in a public infrastructure context, such
as smart cities or smart roads. This is a young
and potentially promising IoT area. - Industrial IoT the scope can be within an
organization (smart factory) or between
organizations (retailer supply chain). This is
arguably the most established and mature part of
IoT.
28Big Data Use cases IoT/M2M
- Agriculture - sensors can be deployed on farm
machinery in order to provide data about the
equipment, soil temperature, moisture, etc. - Buildings/Smart Homes - Building sensors be used
to help facility managers become more proactive
about ensuring that their buildings operate at
peak efficiency. - Communities Smart cities make use of parking
space availability systems, intelligent traffic
monitoring systems, intelligent highways,
weather-adaptive street lighting, and more. - Healthcare Infant monitors, smart diapers,
pills with ingestible sensors are just some of
the IOT-based devices. - Manufacturing factories with sensors can
improve operations, product quality, and decrease
safety hazards. - Smartphones can control everything from door
locks, thermostats, light bulbs, vacuum cleaners,
and more. - Utilities smart water meters can be used to
reduce water leaks. Smart electric grids can
adjust rates depending on usage. - Wearables Smart watches, fitness trackers and
health monitors may become primary source for
human-related data, and can also be used in
sports, retail, travel and manufacturing.
29Benefits of Big Data Analytics in M2M/IoT
- Device Maintenance
- a. Time for next patch upgrade
- b. Energy management
- c. Inventory management and track replacement
- 2. Proactive Healthcare
- Capture and analyze real time data from medical
monitors to predict potential health problems
before patients manifest clinical signs of
infection. - 3. Monetize Machine Data
- a. Monitor performance, usage and capacity
details to uncover up-sell and cross-sell
opportunities - b. Maximize the lifespan and performance of high
value medical assets
30Benefits of Big Data Analytics cont..
- 4. Optimize Support Operations
- a. Reduce MTTR and support escalations
- b. Preempt failures with proactive support
- c. Troubleshoot with accurate information
- d. Proactive consultation to customers on
approaching expiry dates
31Big Data Analytics Stack
32Lamda Architecture
33(No Transcript)
34Batch vs. Real-Time processing
- Batch processing
- - Gathering of data and processing as a group at
one time. - - Jobs run to completion
- - Data might be out of date
- Real-time processing
- - Processing of data that takes place as the
information is being entered. - - Run for ever
35Storm
- Apache Storm is a free and open source
distributed real-time computation system. - Storm makes it easy to reliably process unbounded
streams of data, doing for real-time processing
what Hadoop did for batch processing
36Storm Is
-
- Stream Processing
- Fast
- Scalable
- Fault Tolerant
- Reliable
37Tuple
38Streams
39Spouts
40Bolts
41Topologies
42Reliable Processing
43Reliable Processing
44Stream Grouping
- Groupings are used to decide to which task in
the - subscribing bolt (group) a tuple is sent.
- Possible Groupings
- - Shuffle
- - Fields
- - All
- - Global
- - None
- - Direct
- - Local or Shuffle
45Storm Cluster View
46Fault Tolerance
47Fault Tolerance
48Fault Tolerance
49Fault Tolerance
50Fault Tolerance
51Parallelism
52Parallelism
53Apache Storm Real-time -Use cases
Segment Prevent Use Cases Optimize Use Cases
Financial Services Securities fraud Operational risks compliance violations Order routing Pricing
Telecom Security breaches Network outages Bandwidth allocation Customer service
Retails Shrinkage Stock outs Offers Pricing
Manufacturing Preventative maintenance Quality assurance Supply chain optimization Reduced plant downtime
Transportation Driver monitoring Predictive maintenance Routes Pricing
Web Application failures Operational Issues Personalized content
54The End