Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 1.

Slides:



Advertisements
Similar presentations
Copyright © 2012, Oracle and/or its affiliates. All rights reserved. 1 The following is intended to outline our general product direction. It is intended.
Advertisements

1.
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Your customer as a segment of one That changes every second! Hein Van Der Merwe Chief.
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 1.
Oracle Labs Graph Analytics Research Hassan Chafi Sr. Research Manager Oracle Labs Graph-TA 2/21/2014.
Jim Ferrentino Elizabeth West
Virtualization of Fixed Network Functions on the Oracle Fabric Krishna Srinivasan Director, Product Management Oracle Networking Savi Venkatachalapathy.
A Java Architecture for the Internet of Things Noel Poore, Architect Pete St. Pierre, Product Manager Java Platform Group, Internet of Things September.
Oracle Cloud Marketplace Neelesh Gurnani Director, Product Development Arif Khan Director, Product Management September 29, 2014 Copyright © 2014, Oracle.
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Oracle SQL Developer What’s New in Version 4.1 Jeff Smith
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. | Advanced Metadata Modeling Modeling for the Oracle Business Intelligence Cloud.
Architecting for the Internet of Things
CON Software-Defined Networking in a Hybrid, Open Data Center Krishna Srinivasan Senior Principal Product Strategy Manager Oracle Virtual Networking.
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | 2 Hidden Gems of APEX David Gale Software Engineer Oracle Application Express November,
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Oracle SQL Developer For the DBA Jeff Smith
The Safe Harbor The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated.
HOL9396: Oracle Event Processing 12c
Best Practices for Supporting Oracle Hyperion EPM and Business Intelligence Solutions Mitra Veluri Senior Principal Technical Support Engineer David Valociek.
Best Practices for Upgrading Oracle PeopleSoft Environments
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. | Amit Jha Project Leader, Product Management Oracle EBS Procurement & Contracts.
OEP BOF9272 SOA Event Delivery Network
PARALLEL DBMS VS MAP REDUCE “MapReduce and parallel DBMSs: friends or foes?” Stonebraker, Daniel Abadi, David J Dewitt et al.
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. JD Edwards Summit The Newest JDE Module – Rental Management Joel Sandberg Sales Consultant.
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 1 Preview of Oracle Database 12 c In-Memory Option Thomas Kyte
Getting Started with Oracle Compute Cloud
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. JD Edwards Summit PaaS from an Applications Perspective Charles McGuinness Director,
Hive: A data warehouse on Hadoop Based on Facebook Team’s paperon Facebook Team’s paper 8/18/20151.
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy Classification from Slide 12 1.
1Copyright © 2012, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy Classification from Slide 8 Reporting from Contract.
Oracle E-Business Suite Order Management: Presenting the HTML and Mobile User Experience Durgaprasad Bodapati Director, Product Management Bhavana Sharma.
MapReduce April 2012 Extract from various presentations: Sudarshan, Chungnam, Teradata Aster, …
HBase A column-centered database 1. Overview An Apache project Influenced by Google’s BigTable Built on Hadoop ▫A distributed file system ▫Supports Map-Reduce.
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. An Auto-Join Network of Things Wong, H. and Wesson, B. Oracle Confidential – Internal/Restricted/Highly.
Hadoop 2 cluster with Oracle Solaris Zones, ZFS and unified archives Orgad Kimchi - Principal Software Engineer September 29, 2014 Oracle Confidential.
Hive Facebook 2009.
Copyright © 2012, Oracle and/or its affiliates. All rights reserved. 1 Quick Tutorial – Part 2 Open Data Web Services for Oracle BPM August, 2013 Forms.
1Copyright © 2012, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy Classification from Slide 8 Contract Management.
MapReduce High-Level Languages Spring 2014 WPI, Mohamed Eltabakh 1.
An Introduction to HDInsight June 27 th,
CON Software-Defined Networking in a Hybrid, Open Data Center Krishna Srinivasan Senior Principal Product Strategy Manager Oracle Virtual Networking.
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. JD Edwards Summit Really Cool Demo Stuff – 2015 Edition!!!!! For demo content, please.
“This presentation is for informational purposes only and may not be incorporated into a contract or agreement.”
Copyright © 2002 ProsoftTraining. All rights reserved. JavaServer Pages.
1. When things go wrong: how to find SQL error Sveta Smirnova Principle Technical Support Engineer, Oracle.
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy Classification from Slide 12 1.
Workforce Scheduling Release 5.0 for Windows Implementation Overview OWS Development Team.
RESTful Microservices In Java With Jersey Jakub Podlešák Software Engineer Oracle, Application Server Group September 29, 2014 Copyright © 2014, Oracle.
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. | Planning & Budgeting Cloud Service (PBCS) Overview Business Analytics Product Group.
Maximum Application Availability with Oracle Database 12c
Rajesh Bhat Director, PLM Analytics Applications
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. JD Edwards Summit Recruit and Learn Solutions Extend HR into the Cloud Marie Scott.
Oracle SIS and Sakai Integration Linda Feng, Architect Oracle Academic Enterprise Initiative.
Travel and Transportation General Session and Industry Excellence Awards Vijay Anand, Oracle Sundar Swaminathan, Oracle September 30, 2014 Copyright ©
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 1.
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy Classification from Slide 12 1.
Copyright © 2015 Oracle and/or its affiliates. All rights reserved. How Can RDF and OWL Coexist with Property Graph Zhe Wu Architect Oracle Spatial and.
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT Our experience with NoSQL and MapReduce technologies Fabio Souto.
CON8473 – Oracle Distribution of OpenStack Ronen Kofman Director of Product Management Oracle OpenStack September, 2014 Copyright © 2014, Oracle and/or.
Microsoft Ignite /28/2017 6:07 PM
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. | Daddy, what's a middle wear? An incredibly oversimplified explanation of what Middleware.
JD Edwards EnterpriseOne In-Memory Sales Advisor
Charles Phillips screen
JD Edwards Support and Oracle Cloud Infrastructure: A Successful Path to Oracle Cloud
Confidential – Oracle Internal/Restricted/Highly Restricted
Big Data - in Performance Engineering
Oracle Database Multilingual Engine
Introduction to Apache
Overview of big data tools
R11 Product Release Update March 2008
Charles Tappert Seidenberg School of CSIS, Pace University
Presentation transcript:

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 1

Oracle In-Database MapReduce: When Hadoop Meets Exadata Kuassi Mensah Director Product Management

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 3 The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, and timing of any features or functionality described for Oracle’s products remains at the sole discretion of Oracle.

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 4 Agenda Big Data & In-Database MapReduce  SQL Map Reduce  In-Database Container for Hadoop  Oracle’s Big Data Solution

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 5

6 Big Data Concept Any Data RDBMS DataMining (phase II) MapReduce (phase I) MapReduce Convention: Process Data Locally MapReduce Infrastructure

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 7 Big Data In Real Life, Today RDBMS DataMining (phase II) MapReduce (phase I) MapReduce Infrastructure Unstructured Data (HDFS, NoSQL, etc) Structured Data RDBMS

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 8 Problems with Big Data Today  Shipping Data from RDBMS to MapReduce Infrastructure – Too Big to Move – Operational Issues – Data Correctness/Loss – Lack of Enterprise Class Security on MapReduce Infrastructure – Breaking MapReduce Convention – Cost of MapReduce Infrastructure or Storage – Lack of MapReduce Development Skills – Lack of MapReduce Deployment Skills

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 9 Big Data with In-Database MapReduce Hadoop Cluster Unstructured Data (HDFS, NoSQL, etc) Structured Data (RDBMS) RDBMS DataMining MapReduce DataMining In-Database MapReduce

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 10 In-Database MapReduce Trends  Hybrid Platforms: DBMS + MapReduce  Projects/Products/Initiatives – DataStax: Cassandra + Hadoop – Hadapt HadoopDB: Postgress + Hadoop – Greenplum HD – MongoDB MapReduce: JavaScript – Aster Data / TeraData  Limitations – Dependency on a Hadoop infrastructure in addition to DBMS – Source compatibility: Need to rewrite Hadoop jobs in different lang.

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 11 Oracle’s Big Data Strategy MapReduce APIs Across Data Infrastructure Hadoop, R, SQL Weblogs Sales Records RDBMS ( In-Database MapReduce) Big Data Appliance

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 12 Oracle In-Database MapReduce Integration with Oracle Big Data Solution

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 13 Oracle In-Database MapReduce In-Database Container for Hadoop (currently Beta) Feature of Oracle database 12c releases SQL MapReduce ( )

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 14 Agenda  Big Data & In-Database MapReduce SQL Map Reduce  In-Database Container for Hadoop  Oracle’s Big Data Solution

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 15 Collection of Existing and New Features  SQL Analytic functions  User-defined Aggregates functions  Parallel Pipelined Table Functions  SQL Pattern Matching MATCH_RECOGNIZE -- new! SQL MapReduce Declarative MR Analytics

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 16 SQL Pattern Matching SQL Pattern Matching provides expressive syntax and fast execution for pattern matching New SQL construct: MATCH_RECOGNIZE Define patterns using regular expression syntax Find event A (“privilege revoked”) followed by 3 or more occurences of event B (“attempted login”) within 1 minute days Stock price Find 10-day periods where a stock price has “double-bottomed”

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 17 SQL Pattern Matching SELECT user_id, session_id start_time, no_of_events, duration FROM Events MATCH_RECOGNIZE ( PARTITION BY User_ID ORDER BY Time_Stamp MEASURES match_number() session_id, count(*) as no_of_events, first(time_stamp) start_time, last(time_stamp) - first(time_stamp) duration PATTERN (b s*) DEFINE s as (s.Time_Stamp - prev(Time_Stamp) <= 10) ) ORDER BY user_id, session_id; Sessionization

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 18 DEMO

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 19 Agenda  Big Data & In-Database MapReduce  SQL Map Reduce In-Database Container for Hadoop  Oracle’s Big Data Solution

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 20 Vanilla Hadoop Mappers Reducers Materialization of Intermediate data Hadoop Cluster Physical partitions (DataNodes)

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 21 In-Database Container for Hadoop  Apache Hadoop  Task execution: In-Database JVM  Data partitioning & task scheduling: PQ engine  Data storage: Table, external table, object view.  Data type mapping: TableReader, TableWriter Components

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 22 In-Database Container For Hadoop Mappers processes Reducers processes Pipelining Intermediate data Table partitions Parallel DML RDBMS Server

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 23 In-DB Cont. 4 Hadoop vs Vanilla Hadoop Mappers Reducers Materialization vs Pipelining Intermediate data Physical vs Logical data partitions Parallel DML RDBMS Server Hadoop Cluster

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 24  A “Hadoop container” in the RDBMS engine: no Hadoop cluster required.  Data processing in-situ: no need to ship data to a separate infrastructure.  API and Source-compatibility: accept Hadoop Mappers and Reducers as-is  Java interface: invoke Hadoop jobs a-la vanilla Hadoop  SQL interface: Map & Reduce steps in SQL statements In-Database Container for Hadoop Summary

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 25 In-Database Container for Hadoop SQL and Java interfaces SELECT * FROM TABLE (HREDUCE_JP_WORDCOUNT(:ConfKey, CURSOR(SELECT * FROM TABLE (HMAP_JP_WORDCOUNT(:ConfKey, CURSOR(SELECT * from InTable)))))) public class WordCount { public static void main() throws Exception { /* Setup the parameters and run the job */ …… job.init(); job.run(); }

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 26 DEMO

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 27 Pipelining Hadoop Jobs Pipelining Hadoop steps without intermediate materialization select * from table (HREDUCE_JP_JOB2 (:Confkey2,.... (HMAP_JP_JOB2 (:ConfKey2,.... (HREDUCE_JP_JOB1 (:ConfKey1,.... (HMAP_JP_JOB1 (:ConfKey1,...), )))); Through the SQL Interface

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 28 In-Database Container for Hadoop  Reuse Mappers & Reducers (including R-generated)  Dynamic Data Partitioning  Apache Hadoop API 2.00  Custom Writables Hadoop types  Serialized Data Formats  InputFormats: HDFS, HBase, Others  Java interface (Similar to Vanilla Hadoop Driver).  SQL interface: Hadoop Job Steps in SQL queries  Mahout Projected Features

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 29 Develop/Deploy with In-Db Cont. 4 Hadoop Develop Hadoop Mappers & Reducers from scratch Create or Update Hadoop Job Configuration file Reuse existing Mappers & Reducers Load all Java code in RDBMS and create Call Specs Invoke Hadoop job via Java or SQL interfaces. Populate output table with parallel INSERT

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 30 Agenda  Big Data & In-Database MapReduce  SQL Map Reduce  In-Database Container for Hadoop Oracle’s Big Data Solution

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 31 Oracle’s Big Data Solution Oracle Exalytics InfiniBand Oracle Real-Time Decisions Oracle Big Data Appliance Oracle Exadata InfiniBand AcquireOrganize AnalyzeDecide Oracle Endeca Information Discovery

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 32 Oracle In-Database MapReduce Summary  Declarative Analytics (SQL MapReduce)  Programmatic Analytics (Complex Algorithms, Hadoop)  MapReduce Jobs steps in SQL Queries.  Custom extensions (InputFormats)  RDBMS QoS (e.g., Enterprise Class Security)  Developers and DBAs friendly  Seamless integration with Oracle’s Big Data solution

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 33

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 34