JET INFOSYSTEMS 02.10.2016 The main approach to Big Data parallel processing: Oracle way Aleksey Struchenko Database Department Leader.

Slides:



Advertisements
Similar presentations
ScaleDB Transactional Shared Disk storage engine for MySQL
Advertisements

Extreme Performance with Oracle Data Warehousing
1 Copyright © 2012 Oracle and/or its affiliates. All rights reserved. Convergence of HPC, Databases, and Analytics Tirthankar Lahiri Senior Director, Oracle.
Exadata for Oracle DBAs Arup Nanda Longtime DBA and now DMA.
Exadata Distinctives Brown Bag New features for tuning Oracle database applications.
Oracle for Data Warehousing
Oracle Exadata for SAP.
Copyright © 2012, Oracle and/or its affiliates. All rights reserved. 1.
Living with Exadata Presented by: Shaun Dewberry, OS Administrator, RDC Tom de Jongh van Arkel, Database Administrator, RDC Komaran Hansragh, Data Warehouse.
10 REASONS Why it makes a good option for your DB IN-MEMORY DATABASES Presenter #10: Robert Vitolo.
Copyright © 2012, Oracle and/or its affiliates. All rights reserved. 1.
High Performance Analytical Appliance MPP Database Server Platform for high performance Prebuilt appliance with HW & SW included and optimally configured.
Violin Memory Inc. Proprietary 11 Dell Partnering Solution Examples AccountSolutionVMEM $ ValueDell Drag Database as a Service$1.5MComing Soon Splunk DB.
Oracle Enterprise Manager – Cloud Control 12c Simon Keys, The Small Ronnie Martin Lambert, The Large Ronnie.
® IBM India Research Lab © 2006 IBM Corporation Challenges in Building a Strategic Information Integration Infrastructure Mukesh Mohania IBM India Research.
John Sadd Progress Fellow and OpenEdge Evangelist
Data-centric computing with Netezza Architecture DISC reading group September 24, 2007.
Grid Computing Veronique Anxolabehere Senior Director of Product Marketing Mike Margulies Senior Director, Grid Platform Solutions.
PMIT-6102 Advanced Database Systems
Oracle: Database and Data Management Innovations with CERN
Database Services for Physics at CERN with Oracle 10g RAC HEPiX - April 4th 2006, Rome Luca Canali, CERN.
NCR CORPORATION Presented by: Dave Raspberry Cheng Murray-Khoo Eric Braun A Data Warehousing Solutions Provider A Data Warehousing Solutions Provider.
1 Progress Software’s OpenEdge Platform Which database is right for your environment? Simon Epps.
Oracle Challenges Parallelism Limitations Parallelism is the ability for a single query to be run across multiple processors or servers. Large queries.
Chapter 2 Computer Clusters Lecture 2.2 Computer Cluster Architectures.
Data Warehousing 1 Lecture-24 Need for Speed: Parallelism Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
Business Intelligence Appliance Powerful pay as you grow BI solutions with Engineered Systems.
Our Process We Threw Out Preconceptions and Left No Stone Unturned We looked at white papers, articles, Gartner and Forrester reports, and marketing collateral.
SESSION CODE: BIE07-INT Eric Kraemer Senior Program Manager Microsoft Corporation.
Criteria for D/W Platform Selection Simple Architecture –Easy to deploy the solution with minimal efforts Scalable (Scale Out - Scale Up) –Ability to handle.
 2009 Calpont Corporation 1 Calpont Open Source Columnar Storage Engine for Scalable MySQL Data Warehousing April 22, 2009 MySQL User Conference Santa.
Page 1 © Copyright 2003, the Yankee Group. All rights reserved. IBM SAN File System: It’s Finally Here. Now What? Jamie Gruener Senior Analyst Enterprise.
1 Top Five Tips and Tricks for DBAs and Storage Admins Deploying Oracle Database 12c Gagan Singh Sr. Database Architect Technology and Manufacturing Group.
CERN - IT Department CH-1211 Genève 23 Switzerland t High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN,
Mapping the Data Warehouse to a Multiprocessor Architecture
CS240A: Databases and Knowledge Bases Temporal Databases Carlo Zaniolo Department of Computer Science University of California, Los Angeles.
2 Copyright © Oracle Corporation, Private and Confidential. All rights reserved. What is Corporate Performance Management? CPM encompasses three.
Copyright ©2003 Dell Inc. All rights reserved. Scaling-Out with Oracle® Grid Computing on Dell™ Hardware J. Craig Lowery, Ph.D. Software Architect and.
WHAT EXACTLY IS ORACLE EXALYTICS?. 2 What Exactly Is Exalytics? AGENDA Exalytics At A Glance The Exa Family Do We Need Exalytics? Hardware & Software.
Shared Nothing Architecture Allen Archer. What is Shared Nothing architecture? It is a distributed architecture in which each node is independent and.
Introduction to Exadata X5 and X6 New Features
® IBM Software Group © 2004 IBM Corporation IBM Information Management 소개.
Modern Data Warehousing Symmetric Multi-Processing SQL (SMP) vs Massive Parallel Processing SQL (MPP) Alain Dormehl P-Cubed Session Level : Intermediary.
BIG DATA/ Hadoop Interview Questions.
Peter Idoine Managing Director Oracle New Zealand Limited.
Abstract MarkLogic Database – Only Enterprise NoSQL DB Aashi Rastogi, Sanket V. Patel Department of Computer Science University of Bridgeport, Bridgeport,
Exadata Distinctives 988 Bobby Durrett US Foods. What is Exadata? Complete Oracle database platform Disk storage system Unique to Exadata – intelligent.
IBM Systems and Technology Group © 2008 IBM Corporation Oracle Exadata Storage and the HP Oracle Database Machine Competitive Seller Podcast Mark Wulf.
1 Cloud-Native Data Warehousing Bob Muglia. 2 Scenarios with affinity for cloud Gartner 2016 Predictions: By 2018, six billion connected things will be.
Software architectures and tools for highly distributed applications Voldemaras Žitkus.
Eric Grancher CERN IT department Overview of Database Technologies Computing and Astroparticle Physics 2 nd ASPERA Workshop /1.
…the secret sauce! Diagrams and video from Microsoft white papers and slide decks.
BIG DATA Initiative SMART SubstationBig Data Solution.
Supervisor : Prof . Abbdolahzadeh
Connected Infrastructure
Big Data Enterprise Patterns
Exadata and ZFS Storage at Nielsen
CLUSTER COMPUTING Presented By, Navaneeth.C.Mouly 1AY05IS037
Connected Infrastructure
Exadata for Oracle DBAs
Real-time data delivery may be easier than you think
Mapping the Data Warehouse to a Multiprocessor Architecture
April 30th – Scheduling / parallel
ASM-based storage to scale out the Database Services for Physics
Massively Parallel Processing in Azure Comparing Hadoop and SQL based MPP architectures in the cloud Josh Sivey SQL Saturday #597 | Phoenix.
Designing Business Intelligence Solutions with Microsoft SQL Server
CloudAnt: Database as a Service (DBaaS)
Moving your on-prem data warehouse to cloud. What are your options?
IBM SAP Alliance TRUMPF Production technology specialist reduces database costs by 50 percent, saves 15 percent on license, maintenance and data storage.
Presentation transcript:

JET INFOSYSTEMS The main approach to Big Data parallel processing: Oracle way Aleksey Struchenko Database Department Leader

© 2015 Jet Infosystems 2 Parallel Processing of Data Parallel processing is the main principle of big data computing Clusters with hundreds of nodes compute the Physics data In databases the parallel querying methods for SQL and Non-SQL Databases can be different, this presentation focuses on SQL DB (first of all – Oracle as a leader in databases) In databases area there are two main approaches for clusters in architecture: Shared Nothing and Shared Everything

© 2015 Jet Infosystems 3 Shared Nothing Cluster Every computing node has its own storage and data, the coordinator distributes the query between nodes and aggregates the results Shared Noting Clusters are used for parallel processing in all Non-SQL Databases and most SQL Databases For SQL Databases this approach is often called MPP (Massive Parallel Processing): Teradata, IBM Netezza, EMC Green Plum The main benefit is the absolute scalability, but cluster reconfigurations and multi-table join queries are the real problems for MPP

© 2015 Jet Infosystems 4 Shared Everything Cluster All cluster nodes query the data from shared storage, in any case the memory is shared between nodes with special technology The best-known implementation is RAC (Real Application Clusters) – Oracle Database Enterprise Edition Option No limitation for multi-table join queries and easy cluster reconfiguration, but the scalability of RAC needs special testing

© 2015 Jet Infosystems 5 Oracle Exadata = Exadata Software: Smart Scan (offload) Compression Exadata Storage Server Disks + Flash (or Flash only) InfiniBand Exadata Software: Smart Scan (offload) Compression Exadata Storage Server Disks + Flash (or Flash only) Database Server RAC

© 2015 Jet Infosystems 6 Oracle Engineered Systems

© 2015 Jet Infosystems 7 About Jet Infosystems A full-profile systems integrator, and a leader in IT market of Russia and CIS, more than 1000 employees An Oracle Platinum Partner (15+ years) with OCM specializations The first Oracle Exadata demo-center in Russia (since 2010) The largest service centre for corporate class solutions (including Oracle Database and Oracle Exadata outsourcing) Collaborated with JINR since 1993 (Sun, Cisco, Brocade, Huawei) Questions: (Aleksey