Presentation is loading. Please wait.

Presentation is loading. Please wait.

Harnessing Big Data with Hadoop Dipti Sangani; Madhu Reddy DBI210.

Similar presentations


Presentation on theme: "Harnessing Big Data with Hadoop Dipti Sangani; Madhu Reddy DBI210."— Presentation transcript:

1 Harnessing Big Data with Hadoop Dipti Sangani; Madhu Reddy DBI210

2 REAL DEAL? BIG HYPE? OR

3

4

5

6 May 21, 2012 VC firms pour money into big-data vendors

7 title Microsoft’s Big Data Vision and Platform

8 Madhu Reddy mareddy@microsoft.com Dipti Sangani diptis@microsoft.com Sr. Program Manager Sr. Product Planner

9 Relational Data

10 New Economics Open Source Software Commodity Hardware. Cloud Scale. New Economics Open Source Software Commodity Hardware. Cloud Scale.

11 LIVE DATA FEEDS How do I optimize my fleet based on weather and traffic patterns? ADVANCED ANALYTICS How do I better predict future outcomes? SOCIAL & WEB ANALYTICS What’s the social sentiment for my brand or products?

12 CONNECTING WITH THE WORLD’S DATA IMMERSIVE INSIGHT, WHEREVER YOU ARE ANY DATA, ANY SIZE ANYWHERE

13 NON-RELATIONAL 1 0 0 11 1 DATA MANAGEMENT RELATIONAL STREAMING

14 NON-RELATIONAL 1 0 0 11 1 DATA MANAGEMENT RELATIONAL STREAMING SHARE AND GOVERN DISCOVER AND RECOMMEND TRANSFORM AND CLEAN DATA ENRICHMENT

15 NON-RELATIONAL 1 0 0 11 1 DATA MANAGEMENT SHARE AND GOVERN DISCOVER AND RECOMMEND TRANSFORM AND CLEAN DATA ENRICHMENT RELATIONAL STREAMING INSIGHT SELF-SERVICE | COLLABORATIVE | MOBILE | REAL-TIME

16 NON-RELATIONAL 1 0 0 11 1 DATA MANAGEMENT SHARE AND GOVERN DISCOVER AND RECOMMEND TRANSFORM AND CLEAN INSIGHT DATA ENRICHMENT SELF-SERVICE | COLLABORATIVE | MOBILE | REAL-TIME RELATIONAL STREAMING

17 Submit changes back to Apache Foundation ‘Just works’ on Windows Azure and Server Integration with Visual Studio Performance, Scale, High Availability Management, Ease of use Security, Data Governance Integration with AD and SC. Integration with SQL Server

18 18 Microsoft BI Tools Hive Data Warehouse OLAP Cube Use Case: Klout provides a score to measure customer influence Klout used SSAS OLAP Cube to speed queries and offer custom BI OLAP Cube loads data from Hive Data Warehouse New solution analyzes 35 billions rows of data and delivers fast queries in under 10 seconds!

19 DEMOS!

20 MapReduce (Job Scheduling/Execution System) HDFS (Hadoop Distributed File System) HBase (Column DB) Pig (Data Flow) Hive (Warehouse and Data Access) Oozie (Workflow) Sqoop Traditional BI Tools HBase / Cassandra (Columnar NoSQL Databases) Avro (Serialization) Zookeeper (Coordination) Apache Mahout Karmasphere (Development Tool) Hadoop = MapReduce + HDFS Flume

21 Self-Service BI Data Warehouse & Analytics Digital Shoebox ETL & Data Mgmt

22 demo Name Title Group Hadoop on Azure

23 Integration with.NET and new JavaScript libraries for Hadoop JS MapReduce programs in JavaScript Simplified Programming Deploy JavaScript Hadoop jobs from a simple web browser on any supported device Simplified Deployment of MapReduce jobs Benefits Key Features

24 demo Big Data App Development

25 Hive ODBC Driver integrates Hadoop to SQL Server Analysis Services, PowerPivot, and Power View, Hive Add-in for excel Familiar self service BI tools Benefits Key Features

26 demo Big Data Analytics with Hive and Excel

27 CALL TO ACTION Checkout: http://HadoopOnAzure.com http://HadoopOnAzure.com

28 THANK YOU!

29

30 Connect. Share. Discuss. http://northamerica.msteched.com Learning Microsoft Certification & Training Resources www.microsoft.com/learning TechNet Resources for IT Professionals http://microsoft.com/technet Resources for Developers http://microsoft.com/msdn

31 Required Slide Complete an evaluation on CommNet and enter to win!

32 Scan the Tag to evaluate this session now on myTechEd Mobile

33

34


Download ppt "Harnessing Big Data with Hadoop Dipti Sangani; Madhu Reddy DBI210."

Similar presentations


Ads by Google