Microsoft Ignite /28/2017 6:07 PM

Slides:



Advertisements
Similar presentations
Big Data Training Course for IT Professionals Name of course : Big Data Developer Course Duration : 3 days full time including practical sessions Dates.
Advertisements

FAST FORWARD WITH MICROSOFT BIG DATA Vinoo Srinivas M Solutions Specialist Windows Azure (Hadoop, HPC, Media)
Observation Pattern Theory Hypothesis What will happen? How can we make it happen? Predictive Analytics Prescriptive Analytics What happened? Why.
Running Hadoop-as-a-Service in the Cloud
Transform + analyze Visualize + decide Capture + manage Dat a.
Platinum Sponsors Titanium Sponsors. ETL Tool (SSIS, etc) EDW (SQL Svr, Teradata, etc) Extract Original Data Load Transformed Data Transform BI Tools.
Azure HDInsight Hadoop Meets the Cloud Microsoft’s managed Hadoop as a Service 100% open source Apache Hadoop Built on the latest releases across.
Mihai Pintea. 2 Agenda Hadoop and MongoDB DataDirect driver What is Big Data.
HOL9396: Oracle Event Processing 12c
Hadoop Ecosystem Overview
SQL on Hadoop. Todays agenda Introduction Hive – the first SQL approach Data ingestion and data formats Impala – MPP SQL.
Microsoft’s IoT Platform ….. why it matters to the data platform Gary Hope Data Platform Technologist Microsoft South Africa.
This presentation was scheduled to be delivered by Brian Mitchell, Lead Architect, Microsoft Big Data COE Follow him Contact him.
SQL Server 2014: The Data Platform for the Cloud.
Architecting the Internet of Things Darren Hubert M256.
An Introduction to HDInsight June 27 th,
When bet365 met Riak and discovered a true, “always on” database.
Data and SQL on Hadoop. Cloudera Image for hands-on Installation instruction – 2.
How Companies are Using Spark And where the Edge in Big Data will be Matei Zaharia.
Windows Azure. Azure Application platform for the public cloud. Windows Azure is an operating system You can: – build a web application that runs.
Hadoop IT Services Hadoop Users Forum CERN October 7 th,2015 CERN IT-D*
Matthew Winter and Ned Shawa
Nov 2006 Google released the paper on BigTable.
Breaking points of traditional approach What if you could handle big data?
What we know or see What’s actually there Wikipedia : In information technology, big data is a collection of data sets so large and complex that it.
Big Data Analytics Platforms. Our Team NameApplication Viborov MichaelApache Spark Bordeynik YanivApache Storm Abu Jabal FerasHPCC Oun JosephGoogle BigQuery.
Big Data for the.NET Developer Scott Klein M310
Streaming Relational Internal & external Non-relational NoSQL MobileReports Natural language queryDashboardsApplications Orchestration Machine learningModeling.
Big Data Analytics with Excel Peter Myers Bitwise Solutions.
Azure HDInsight And Excel Analyze unstructured data at scale, then visualize! George Walters Sr. Technical Solutions Professional, Data Platform Microsoft.
Beyond Hadoop The leading open source system for processing big data continues to evolve, but new approaches with added features are on the rise. Ibrahim.
Harnessing Big Data with Hadoop Dipti Sangani; Madhu Reddy DBI210.
Andy Roberts Data Architect
Agility Dev TestDeploy Learn Agility.
An Introduction To Big Data For The SQL Server DBA.
BIG DATA. Big Data: A definition Big data is a collection of data sets so large and complex that it becomes difficult to process using on-hand database.
Apache Hadoop on Windows Azure Avkash Chauhan
Microsoft Partner since 2011
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT Our experience with NoSQL and MapReduce technologies Fabio Souto.
Let’s chat about Azure…. Michelle Olson Azure Partnerships BDM US Channel Development C :
BI 202 Data in the Cloud Creating SharePoint 2013 BI Solutions using Azure 6/20/2014 SharePoint Fest NYC.
Energy Management Solution
OMOP CDM on Hadoop Reference Architecture
Connected Infrastructure
WPC047 Data ON THE ROAD: the Azure part
5/9/2018 7:28 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS.
Connected Living Connected Living What to look for Architecture
Smart Building Solution
Connected Health Solution
Parcel Tracking Solution Parcel Tracking What to look for Architecture
Smart Building Solution
Hadoopla: Microsoft and the Hadoop Ecosystem
Connected Living Connected Living What to look for Architecture
Connected Infrastructure
Connected Health Solution
Remote Monitoring solution
Energy Management Solution
9/18/2018 Big Data Analytics with HDInsight Module 6 – Storm Essentials Asad Khan Nishant Thacker Principal PM Manager Technical Product Manager.
Shubha Vijayasarathy Program Manager, Azure Event Hubs - Microsoft
ETL Architecture for Real-Time BI
Microsoft Azure Cloud Platform Enables Mobile App Marketing Platform to Focus on its Growth By moblin.com “Using the Microsoft Azure platform and solutions,
Microsoft Azure Enables Big-Data-as-a-Service Applications for Industry and Government Use “Microsoft Azure is the most innovative and robust suite of.
Microsoft Azure Carries the Load, Enabling IT Companies to Offer New Services to Customers “When we realized the volume of network traffic and the amount.
Overview of big data tools
Big Data Young Lee BUS 550.
Architecture for Real-Time ETL
Big-Data Analytics with Azure HDInsight
SQL Server 2019 Bringing Apache Spark to SQL Server
Presentation transcript:

Microsoft Ignite 2015 4/28/2017 6:07 PM © 2015 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Real-Time Analytics at Scale for Internet of Things Asad Khan Principal Program Manager Nishant Thacker Product Marketing Manager

IoT Scenario - Connected Cars / Devices Document Store No SQL Store Relational Store Queue Service Get Data Get Reference Data Business Logic Store Raw Data Store Reporting Data Live Dashboard Cloud gateways Queue Service Event Hubs

Customer use cases Scenario Connected Cars ETL IoT Fraud detection Input Operators (Examples) Side Lookup Output Programming Language Connected Cars Event hubs Window based aggregation, Join stream / split stream HBase, ML DocumentDB C# hybrid, Java ETL Event Hubs Partitioning / organize N/A WASB Java IoT Window based aggregation Hbase, ML DocumentDB, HBase Fraud detection ServiceBus Queue Filter ML Hbase C# hybrid Social analytics Twitter Groupby / trending topics Realtime dashboard (BI) Trident Network monitoring Kafka Split (on success / failure) SQL Log Search Storage Queue / Event Hub Parsing & index Elastic Search Mobile engagement Eventhub Count HBase SignalR

Hadoop Data Platform Hadoop - Data processing and storage platform 4/28/2017 Hadoop Data Platform Hadoop - Data processing and storage platform Batch Hive, Pig, MapReduce NoSQL HBase Stream Storm Other Mahout, Oozie, Spark Microsoft Big Data solutions, including HDInsight on Microsoft Azure, are based on a Hadoop distribution called the HortonWorks Data Platform (HDP). It uses the YARN resource manager to implement a runtime platform for a wide range of data query, transformation, and storage tools and applications. The figure shows the high-level architecture of HDP, and how it supports the tools and applications. High-level architecture of the HortonWorks Data Platform The three most commonly used tools for processing data by executing queries and transformations, in order of popularity, are Hive, Pig, and map/reduce. HCatalog is a feature of Hive that provides, amongst other features, a way to remove dependencies on literal file paths in order to stabilize and unify solutions that incorporate multiple steps. Mahout is a scalable machine learning library for clustering, classification, and collaborative filtering that you can use to examine data files in order to extract specific types of information. Storm is a real-time data processing application that is designed to handle streaming data. These applications can be used for a wide variety of tasks, and many of them can be easily combined into multi-step workflows by using Oozie. Data Storage Layer (HDFS) Hadoop is an Open Source, scalable, fault tolerant platform for large amount of unstructured data storage and processing, distributed across large number of machines © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

What is Azure HDInsight Microsoft’s cloud Big Data offering 100% open source Apache Hadoop Up and running in minutes with no hardware to deploy Harness existing .NET and Java skills Utilize familiar BI tools and application frameworks

What is HIVE? Create, load, and query Hive tables. Hive SQL Includes data definition language, data import/export and data manipulation language statements Create table Query data using SQL-like statements

What is HBase? Apache HBase is a distributed low-latency NoSQL database designed to handle large scale datasets. Storm NoSQL on top of Hadoop Large scale Low latency Open Source Columnar, schema-free data model Events HBase Hadoop APIs Mobile Batch Analytics Web Apps Web

Queuing Service / Direct API What is Storm? Apache Storm is a distributed, fault-tolerant, open-source, real-time event processing solution for large, fast streams of data. Batch Processing Sentiment Clickstream Machine/Sensor Server Logs Geo-location Data Store (HBase, SQL) Queuing Service / Direct API Storm Real time processing System Real time dashboard

IoT Scenario - Connected Cars / Devices Document Store No SQL Store Relational Store Queue Service Get Data Get Reference Data Business Logic Store Raw Data Store Reporting Data Live Dashboard Cloud gateways Queue Service Event Hubs

IoT Scenario - Connected Cars / Devices HBase SQL Azure DocumentDB Document Store No SQL Store Relational Store Event Hubs Queue Service PowerBI Get Data Get Reference Data Business Logic Store Raw Data Store Reporting Data Live Dashboard Cloud gateways Apache Storm Queue Service Event Hubs

Demo Asad Khan

HDInsight – Call to Action 4/28/2017 6:07 PM HDInsight – Call to Action Key Sessions at Ignite BRK3555-Real-Time Analytics at Scale for Internet of Things BRK2550-Big Data for the SQL Ninja BRK2576-Planning your Big Data Architecture on Azure BRK3556-Optimizing Hadoop using Microsoft Azure HDInsight BRK3559-Build Hybrid Big Data Pipelines with Azure Data Factory and Azure HDInsight Sign Up for HDInsight Free Trial http://azure.com/hdinsight Sign up for Azure Data Lake Preview http://azure.com/datalake © 2014 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Ignite Azure Challenge Sweepstakes 4/28/2017 6:07 PM Ignite Azure Challenge Sweepstakes Attend Azure sessions and activities, track your progress online, win raffle tickets for great prizes! Aka.ms/MyAzureChallenge Enter this session code online: “XXDD” (10) - Microsoft Surface Pro 3 Core i5 256GB (30) – Xbox One (55) – Microsoft Band Offers throughout the week NO PURCHASE NECESSARY. Open only to event attendees. Winners must be present to win. Game ends May 9th, 2015. For Official Rules, see The Cloud Platform Lounge or aka.ms/myazurechallenge © 2014 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

4/28/2017 6:07 PM © 2014 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.