CERN - IT Department CH-1211 Genève 23 Switzerland www.cern.ch/i t Partitioning in COOL Andrea Valassi (CERN IT-DM) R. Basset (CERN IT-DM) Distributed.

Slides:



Advertisements
Similar presentations
BY LECTURER/ AISHA DAWOOD DW Lab # 3 Overview of Extraction, Transformation, and Loading.
Advertisements

CERN - IT Department CH-1211 Genève 23 Switzerland t LCG Persistency Framework CORAL, POOL, COOL – Status and Outlook A. Valassi, R. Basset,
CERN IT Department CH-1211 Genève 23 Switzerland t ES Discussion of COOL - CORAL - POOL priorities for ATLAS Andrea Valassi (IT-ES) For the.
CERN - IT Department CH-1211 Genève 23 Switzerland t Transportable Tablespaces for Scalable Re-Instantiation Eva Dafonte Pérez.
1 Databases in ALICE L.Betev LCG Database Deployment and Persistency Workshop Geneva, October 17, 2005.
CERN - IT Department CH-1211 Genève 23 Switzerland t Relational Databases for the LHC Computing Grid The LCG Distributed Database Deployment.
Information systems and databases Database information systems Read the textbook: Chapter 2: Information systems and databases FOR MORE INFO...
CERN IT Department CH-1211 Genève 23 Switzerland t Recovery Exercise Wrap-up Jacek Wojcieszuk, CERN IT-DM Distributed Database Operations.
CERN IT Department CH-1211 Genève 23 Switzerland t Messaging System for the Grid as a core component of the monitoring infrastructure for.
CERN IT Department CH-1211 Genève 23 Switzerland t Streams new features in 11g Zbigniew Baranowski.
Operating Systems & Infrastructure Services CERN IT Department CH-1211 Geneva 23 Switzerland t OIS CERN Search Updates Eduardo Alvarez November.
1  MyOnlineITCourses.com 1 MyOnlineITCourses.com Oracle Partitioning -- A Primer.
CERN IT Department CH-1211 Genève 23 Switzerland t Integrating Lemon Monitoring and Alarming System with the new CERN Agile Infrastructure.
IST Databases and DBMSs Todd S. Bacastow January 2005.
CERN - IT Department CH-1211 Genève 23 Switzerland t Monitoring the ATLAS Distributed Data Management System Ricardo Rocha (CERN) on behalf.
CERN - IT Department CH-1211 Genève 23 Switzerland t The High Performance Archiver for the LHC Experiments Manuel Gonzalez Berges CERN, Geneva.
IT The Relational DBMS Section 06. Relational Database Theory Physical Database Design.
CERN IT Department CH-1211 Genève 23 Switzerland t SDC Stabilizing SQL execution plans in COOL using Oracle hints Andrea Valassi (IT-SDC)
Database Management System Module 5 DeSiaMorewww.desiamore.com/ifm1.
1 Adapted from Pearson Prentice Hall Adapted form James A. Senn’s Information Technology, 3 rd Edition Chapter 7 Enterprise Databases and Data Warehouses.
CERN - IT Department CH-1211 Genève 23 Switzerland t Tier0 database extensions and multi-core/64 bit studies Maria Girone, CERN IT-PSS LCG.
Alignment Strategy for ATLAS: Detector Description and Database Issues
CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Job Monitoring for the LHC experiments Irina Sidorova (CERN, JINR) on.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES PhEDEx Monitoring Nicolò Magini CERN IT-ES-VOS For the PhEDEx.
LHC: ATLAS Experiment meeting “Conditions” data challenge Elizabeth Gallas - Oxford - August 29, 2009 XLDB3.
11-1 Improve response time of interactive programs. Improve batch throughput. To ensure scalability of applications load vs. performance. Reduce system.
3 Copyright © 2005, Oracle. All rights reserved. Partitioning Basics.
CERN IT Department CH-1211 Genève 23 Switzerland t LCG Gridview / LCG SAM use cases Miguel Anjo 8 th July 2008 Database Developers’ Workshop.
CHEP 2006, Mumbai13-Feb-2006 LCG Conditions Database Project COOL Development and Deployment: Status and Plans Andrea Valassi On behalf of the COOL.
Databases E. Leonardi, P. Valente. Conditions DB Conditions=Dynamic parameters non-event time-varying Conditions database (CondDB) General definition:
CERN - IT Department CH-1211 Genève 23 Switzerland t Oracle Metalink for Tier 1 Miguel Anjo Database mini workshop 26.January.2007.
CERN - IT Department CH-1211 Genève 23 Switzerland t DB Development Tools Benthic SQL Developer Application Express WLCG Service Reliability.
CERN IT Department CH-1211 Genève 23 Switzerland t Castor development status Alberto Pace LCG-LHCC Referees Meeting, May 5 th, 2008 DRAFT.
CERN IT Department CH-1211 Genève 23 Switzerland t ES Future plans for CORAL and COOL Andrea Valassi (IT-ES) For the Persistency Framework.
The Persistency Patterns of Time Evolving Conditions for ATLAS and LCG António Amorim CFNUL- FCUL - Universidade de Lisboa A. António, Dinis.
3rd November Richard Hawkings Luminosity, detector status and trigger - conditions database and meta-data issues  How we might apply the conditions.
SQL Server 2005 Implementation and Maintenance Chapter 3: Tables and Views.
CERN - IT Department CH-1211 Genève 23 Switzerland t Oracle Real Application Clusters (RAC) Techniques for implementing & running robust.
CERN - IT Department CH-1211 Genève 23 Switzerland t COOL Conditions Database for the LHC Experiments Development and Deployment Status Andrea.
08-Nov Database TEG workshop, Nov 2011 ATLAS Oracle database applications and plans for use of the Oracle 11g enhancements Gancho Dimitrov.
CERN IT Department CH-1211 Genève 23 Switzerland t DM Database Monitoring Tools Database Developers' Workshop CERN, July 8 th, 2008 Dawid.
Chapter 4 Logical & Physical Database Design
CERN IT Department CH-1211 Genève 23 Switzerland t Streams Service Review Distributed Database Workshop CERN, 27 th November 2009 Eva Dafonte.
CERN IT Department CH-1211 Geneva 23 Switzerland t WLCG Operation Coordination Luca Canali (for IT-DB) Oracle Upgrades.
CERN IT Department CH-1211 Genève 23 Switzerland t COOL performance optimization using Oracle hints Andrea Valassi and Romain Basset (IT-DM)
Andrea Valassi (CERN IT-DB)CHEP 2004 Poster Session (Thursday, 30 September 2004) 1 HARP DATA AND SOFTWARE MIGRATION FROM TO ORACLE Authors: A.Valassi,
CERN IT Department CH-1211 Geneva 23 Switzerland t Eva Dafonte Perez IT-DB Database Replication, Backup and Archiving.
11-Nov Distr. DB Operations workshop - November 2008 The PVSS Oracle DB Archive in ATLAS ( life cycle of the data ) Gancho Dimitrov (LBNL)
11th November Richard Hawkings Richard Hawkings (CERN) ATLAS reconstruction jobs & conditions DB access  Conditions database basic concepts  Types.
CERN IT Department CH-1211 Genève 23 Switzerland t COOL Performance Tests ATLAS Conditions Database example Romain Basset, IT-DM October.
The DCS Databases Peter Chochula. 31/05/2005Peter Chochula 2 Outline PVSS basics (boring topic but useful if one wants to understand the DCS data flow)
1 A Scalable Distributed Data Management System for ATLAS David Cameron CERN CHEP 2006 Mumbai, India.
Data & Storage Services CERN IT Department CH-1211 Genève 23 Switzerland t DSS Data architecture challenges for CERN and the High Energy.
Database Issues Peter Chochula 7 th DCS Workshop, June 16, 2003.
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF CC Monitoring I.Fedorko on behalf of CF/ASI 18/02/2011 Overview.
8 Copyright © 2005, Oracle. All rights reserved. Gathering Statistics.
Conditions Database Status and Plans for 2005 Andrea Valassi (CERN IT-ADC) LCG Applications Area Review 31 March 2005.
CERN IT Department CH-1211 Genève 23 Switzerland t CERN Agile Infrastructure Monitoring Pedro Andrade CERN – IT/GT HEPiX Spring 2012.
Maria del Carmen Barandela Pazos CERN CHEP 2-7 Sep 2007 Victoria LHCb Online Interface to the Conditions Database.
ATLAS The ConditionDB is accessed by the offline reconstruction framework (ATHENA). COOLCOnditions Objects for LHC The interface is provided by COOL (COnditions.
CERN - IT Department CH-1211 Genève 23 Switzerland t Persistency Framework CORAL, POOL, COOL status and plans Andrea Valassi (IT-PSS) On.
Joe Foster 1 Two questions about datasets: –How do you find datasets with the processes, cuts, conditions you need for your analysis? –How do.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES Author etc Alarm framework requirements Andrea Sciabà Tony Wildish.
CERN IT Department CH-1211 Genève 23 Switzerland t Load testing & benchmarks on Oracle RAC Romain Basset – IT PSS DP.
L1Calo Databases ● Overview ● Trigger Configuration DB ● L1Calo OKS Database ● L1Calo COOL Database ● ACE Murrough Landon 16 June 2008.
DBMS & TPS Barbara Russell MBA 624.
Relational Algebra 461 The slides for this text are organized into chapters. This lecture covers relational algebra, from Chapter 4. The relational calculus.
1 Demand of your DB is changing Presented By: Ashwani Kumar
Data Lifecycle Review and Outlook
Oracle Solutions for Data Archiving
Presentation transcript:

CERN - IT Department CH-1211 Genève 23 Switzerland t Partitioning in COOL Andrea Valassi (CERN IT-DM) R. Basset (CERN IT-DM) Distributed Database Operations Workshop, 11th November 2008 Data Management Group

CERN - IT Department CH-1211 Genève 23 Switzerland t DB Workshop – A. Valassi COOL Status - 2 Outline Motivation Option 1 – homemade partitioning –COOL database ‘federation’ Option 2 – Oracle partitioning –Table and index partitioning –Performance implications and tests Conclusions

CERN - IT Department CH-1211 Genève 23 Switzerland t DB Workshop – A. Valassi COOL Status - 3 Motivation Data management – for instance: –Several smaller, more manageable data chunks –Declare old partitions as read-only –Take old partitions offline –Export individual partitions by file copy –Import partition into existing COOL database Performance is not the argument –Scalability tests with 10 years of Atlas data show good SQL performance for index-based queries –But must ensure performance is not degraded by whichever partitioning solution we choose

CERN - IT Department CH-1211 Genève 23 Switzerland t DB Workshop – A. Valassi COOL Status - 4 Use cases in COOL Single-version data –e.g. temperatures (copied from PVSS to COOL) –Very large samples (Atlas ~200 GB/year) –No write access for old data Measured data is strictly historical data –Infrequent read access to old data? Multi-version data: not a candidate? –e.g. calibration, alignment –Relatively small data samples –Write access for old data Compute new calibrations using new algorithms –Frequent read access to old data

CERN - IT Department CH-1211 Genève 23 Switzerland t DB Workshop – A. Valassi COOL Status - 5 Homemade partitioning COOL database federation –Attach validity range to each COOL ‘database’ –Federation (‘super-database’) includes several databases covering different validity ranges MUST ensure there is no overlap between databases: start- and end-of-validity must fall within same partition This partitioning is not ‘transparent’ –New metadata tables needed in COOL schema Keep track of which data is in which COOL database

CERN - IT Department CH-1211 Genève 23 Switzerland t DB Workshop – A. Valassi COOL Status - 6 Oracle partitioning Transparent partitioning –No new tables needed in COOL schema –Data management using Oracle tools (add/split, offline, read-only, transportable tablespaces…) Table and index partitioning –Partitioning key: start-of-validity (IOV_SINCE) Better choose partition boundaries so that end-of- validity (IOV_UNTIL) also falls within the same partition –Must use local (partitioned) indexes Global indexes hinder partition-wise data management –Technicality: must declare tables as partitioned at creation time (not an ‘alter table’ property)

CERN - IT Department CH-1211 Genève 23 Switzerland t DB Workshop – A. Valassi COOL Status - 7 Oracle partitioning – performance IOV retrieval from a partitioned table –Ideally, Oracle partition pruning ensures that: First, the relevant partition is located Then, the IOV is located within the partition –Do not expect large (any?) benefit over non- partitioned query if indexes were already used Loop over partitions vs. loop over index branches Test with 3 samples of ‘Atlas 10 years’ –Non-partitioned –Partitioned tables, global indexes –Partitioned tables, local indexes –Work done by Romain Basset

CERN - IT Department CH-1211 Genève 23 Switzerland t DB Workshop – A. Valassi COOL Performance - 8 Oracle partitioning – tests Observe slight performance degradation –Caution: work in progress!!! –Loop on partitions less efficient than index scan? –Complex SQL with SELECT MAX() subquery –Differences between 10g and 11g also observed Affected by bug fixed in / ? Query response time is flat within each partition, but increases from one partition to the next

CERN - IT Department CH-1211 Genève 23 Switzerland t DB Workshop – A. Valassi COOL Status - 9 Conclusions Two options are being considered –Custom partitioning (database ‘federation’) No work in progress on this option –Oracle partitioning Performance tests are in progress They are not mutually exclusive –Internally, a database in a COOL federation can be partitioned using Oracle partitioning

CERN - IT Department CH-1211 Genève 23 Switzerland t DB Workshop – A. Valassi COOL Status - 10 Reserve slides

CERN - IT Department CH-1211 Genève 23 Switzerland t DB Workshop – A. Valassi COOL data model Modeling of conditions data objects –System-managed common “metadata” Data items: many tables, each with many channels Interval of validity - “IOV” [since, until] Versioning information - with handling of interval overlaps –User-defined schema for “data payload” Support for fields of simple C++ types Main use case: event reconstruction –Lookup data payload valid at a given event time COOL Status - 11