Updates from Database Services at CERN Andrei Dumitru CERN IT Department / Database Services.

Slides:



Advertisements
Similar presentations
ITEC474 INTRODUCTION.
Advertisements

Cloud Computing: Theirs, Mine and Ours Belinda G. Watkins, VP EIS - Network Computing FedEx Services March 11, 2011.
System Center 2012 R2 Overview
Introduction to DBA.
High Availability Group 08: Võ Đức Vĩnh Nguyễn Quang Vũ
CERN IT Department CH-1211 Geneva 23 Switzerland t Marcin Blaszczyk, IT-DB Atlas standby database tests February.
Evolution of database services
Replication Technologies at WLCG Lorena Lobato Pardavila CERN IT Department – DB Group JINR/CERN Grid and Management Information Systems, Dubna (Russia)
VMware vCenter Server Module 4.
BNL Oracle database services status and future plans Carlos Fernando Gamboa RACF Facility Brookhaven National Laboratory, US Distributed Database Operations.
© Hitachi Data Systems Corporation All rights reserved. 1 1 Det går pænt stærkt! Tony Franck Senior Solution Manager.
© 2009 Oracle Corporation. S : Slash Storage Costs with Oracle Automatic Storage Management Ara Vagharshakian ASM Product Manager – Oracle Product.
CERN IT Department CH-1211 Geneva 23 Switzerland t T0 report WLCG operations Workshop Barcelona, 07/07/2014 Maite Barroso, CERN IT.
High Availability & Oracle RAC 18 Aug 2005 John Sheaffer Platform Solution Specialist
CERN IT Department CH-1211 Genève 23 Switzerland t Next generation of virtual infrastructure with Hyper-V Michal Kwiatek, Juraj Sucik, Rafal.
Scale-out databases for CERN use cases Strata Hadoop World London 6 th of May,2015 Zbigniew Baranowski, CERN IT-DB.
Selling the Database Edition for Oracle on HP-UX November 2000.
13 October 2014 Eric Grancher, head of database services, CERN IT Manuel Martin Marquez, data scientist, CERN openlab.
SANPoint Foundation Suite HA Robert Soderbery Sr. Director, Product Management VERITAS Software Corporation.
Database Services for Physics at CERN with Oracle 10g RAC HEPiX - April 4th 2006, Rome Luca Canali, CERN.

Oracle Application Server 10g (9.0.4) Recommended Topologies Pavana Jain.
Bob Thome, Senior Director of Product Management, Oracle SIMPLIFYING YOUR HIGH AVAILABILITY DATABASE.
Status of WLCG Tier-0 Maite Barroso, CERN-IT With input from T0 service managers Grid Deployment Board 9 April Apr-2014 Maite Barroso Lopez (at)
CERN - IT Department CH-1211 Genève 23 Switzerland t The High Performance Archiver for the LHC Experiments Manuel Gonzalez Berges CERN, Geneva.
CERN - IT Department CH-1211 Genève 23 Switzerland t Tier0 database extensions and multi-core/64 bit studies Maria Girone, CERN IT-PSS LCG.
ORACLE GOLDENGATE AT CERN
Business Intelligence Appliance Powerful pay as you grow BI solutions with Engineered Systems.
CERN IT Department CH-1211 Geneva 23 Switzerland t Daniel Gomez Ruben Gaspar Ignacio Coterillo * Dawid Wojcik *CERN/CSIC funded by Spanish.
CERN Physics Database Services and Plans Maria Girone, CERN-IT
CERN - IT Department CH-1211 Genève 23 Switzerland t Oracle Real Application Clusters (RAC) Techniques for implementing & running robust.
Distributed Data Management Graeme Kerr Oracle in R&D Programme.
Marcin Blaszczyk, Zbigniew Baranowski – CERN Outline Overview & Architecture Use Cases for Our experience with ADG and lessons learned Conclusions.
CERN IT Department CH-1211 Genève 23 Switzerland t Possible Service Upgrade Jacek Wojcieszuk, CERN/IT-DM Distributed Database Operations.
CERN Database Services for the LHC Computing Grid Maria Girone, CERN.
Hadoop IT Services Hadoop Users Forum CERN October 7 th,2015 CERN IT-D*
1 D0 Taking Stock By Anil Kumar CD/LSCS/DBI/DBA June 11, 2007.
CERN - IT Department CH-1211 Genève 23 Switzerland t High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN,
Database Competence Centre openlab Major Review Meeting nd February 2012 Maaike Limper Zbigniew Baranowski Luigi Gallerani Mariusz Piorkowski Anton.
CERN IT Department CH-1211 Geneva 23 Switzerland t WLCG Operation Coordination Luca Canali (for IT-DB) Oracle Upgrades.
BNL Oracle database services status and future plans Carlos Fernando Gamboa, John DeStefano, Dantong Yu Grid Group, RACF Facility Brookhaven National Lab,
Maria Girone CERN - IT Tier0 plans and security and backup policy proposals Maria Girone, CERN IT-PSS.
Scalable data access with Impala Zbigniew Baranowski Maciej Grzybek Daniel Lanza Garcia Kacper Surdy.
CNAF Database Service Barbara Martelli CNAF-INFN Elisabetta Vilucchi CNAF-INFN Simone Dalla Fina INFN-Padua.
LHC Logging Cluster Nilo Segura IT/DB. Agenda ● Hardware Components ● Software Components ● Transparent Application Failover ● Service definition.
Enterprise Manager at CERN Andrei Dumitru IT Department / Database Services / openlab.
BNL dCache Status and Plan CHEP07: September 2-7, 2007 Zhenping (Jane) Liu for the BNL RACF Storage Group.
© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Database Growth: Problems & Solutions.
DB Questions and Answers open session (comments during session) WLCG Collaboration Workshop, CERN Geneva, 24 of April 2008.
Data Analytics and Hadoop Service in IT-DB Visit of Cloudera - April 19 th, 2016 Luca Canali (CERN) for IT-DB.
Calgary Oracle User Group
Univa Grid Engine Makes Work Management Automatic and Efficient, Accelerates Deployment of Cloud Services with Power of Microsoft Azure MICROSOFT AZURE.
RHEV Platform at LHCb Red Hat at CERN 17-18/1/17
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
IT Services Katarzyna Dziedziniewicz-Wojcik IT-DB.
Database Services Katarzyna Dziedziniewicz-Wojcik On behalf of IT-DB.
Database Services Katarzyna Dziedziniewicz-Wojcik On behalf of IT-DB.
Future Database Challenges
LCG 3D Distributed Deployment of Databases
Evolution of Data(base) Replication Technologies for WLCG
Database Services at CERN Status Update
Database Workshop Report
Couchbase Server is a NoSQL Database with a SQL-Based Query Language
NGS Oracle Service.
Scalable Database Services for Physics: Oracle 10g RAC on Linux
Oracle Storage Performance Studies
Overview Introduction VPS Understanding VPS Architecture
CERN DB Services: Status, Activities, Announcements
Scalable Database Services for Physics: Oracle 10g RAC on Linux
PerformanceBridge Application Suite and Practice 2.0 IT Specifications
Presentation transcript:

Updates from Database Services at CERN Andrei Dumitru CERN IT Department / Database Services

Credit: Mariusz Piorkowski

Databases at CERN ~100 Oracle databases, most of them RAC Mostly NAS storage plus some SAN with ASM More than 500 TB of data files for production DBs in total Example of critical production DBs: LHC logging database ~170 TB, expected growth up to ~70 TB / year But also as DBaaS, as single instances MySQL Open community databases PostgreSQL databases Oracle 11g 4

Accelerator logging 5 ~ 95% data reduction Millions of records/min ~ signals ~ 15 data loading processes ~ 5.5 billion records/day ~ 275 GB/day  100 TB/year throughput ~ 1 million signals ~ 300 data loading processes ~ 4 billion records/day ~ 160 GB/day  52 TB/year stored Credit: C. Roderick

Administrative systems All applications are based on Oracle as database Oracle Weblogic server manages numerous HR and administrative Java-based web applications used by CERN Java EE and Apex Oracle HR is E-Business Suite R12 6

Engineering & equipment data Managing a million components over a lifecycle of 30 years Integrated PLM platform by linking together commercial systems 7 3D CAD CATIA Design data management Manufacturing follow-up Installation follow-up Maintenance management Data publishing Workflow actions Credit: D. Widegren

Experiment systems Online Data-taking operations Rely on SCADA system to store and monitor the detector parameters (temperatures, voltages, …) Up to changes/second stored in Oracle databases Offline Post data-taking analysis, reconstruction and reprocessing, logging of critical operations, … Database replication: Oracle Streams, Oracle Golden Gate, active standby databases 8

New DBs Services QPSR Quench Protection System will store ~150 K rows/second (64GB per redo log) rows/second achieved during catchup tests need to keep data for a few days (~ 50 TB) Doubtful that previous HW could handle that SCADAR Consolidated WinCC/PVSS archive repository Will store ~50-60K rows/second (may increase in the future) the data retention varies depending on the application (from a few days to 5 years) 9

Service Lifecycle 10 HW MigrationSW upgrade Stable services Decomission New systems Installation Every ~3-4 years

Preparation for LHC Run 2 Requirement: changes had to fit LHC schedule New HW installation on critical power Decommission of some old HW Critical power move from current location to new location Keep up with Oracle SW evolution Applications’ evolution - more resources needed Integration with Agile LS1: no stop DB services 11

Service Evolution during LS1 Hardware evolution New DB servers Storage evolution New generation of Storage servers Refresh cycle of OS and OS related Puppet RHEL 6 Database Software evolution Upgrade to newer Oracle version 12

Oracle Real Application Clusters - Overview 13 Operating System Oracle RAC Instance 1 Operating System Oracle RAC Instance 2 Oracle Clusterware Public Network a.k.a Cluster Interconnect Oracle Database on Shared Storage Node 1 Node 2 Oracle Clusterware Storage Network Private Network Example of 2-node DB cluster

Our Deployment Model Database Clusters with RAC Servers Running Red Hat Enterprise Linux Storage NAS (Network-attached Storage) from NetApp High capacity SATA + SSD cache Network 10 Gig Ethernet - for Storage, Interconnect, Users Number of nodes: 2 – 5 14

New Hardware Consolidation of HW 100 Production Servers Dual 8 core XEON e GB/256GB RAM 3x10Gb interfaces Specific network requirements 1. IP1 (cs network) 2. ATLAS Pit 3. Technical network 4. Routed network accessible from outside of CERN, 5. Non-routed network only internal to CERN 15 Credit: Paul Smith

Storage Evolution NetApp FAS3240NetApp FAS8060 NVRAM1.0 GB8.0 GB System memory8GB64GB CPU1 x 64-bit 4-core 2.33 Ghz2 x 64-bit 8-core 2.10 Ghz SSD layer (maximum)512GB8TB Aggregate size180TB400TB OS controllerData ONTAP® 7-mode Data ONTAP® C-mode* 16 scaling up scaling out * Cluster made of 8 controllers. Shared with other services. Credit: Ruben Gaspar

Storage Evolution Centrally managed storage Monitored: Netapp + home made tools Enables consolidation Thin provisioning on file systems Transparent volume move More capacity for growth More SSD => Performance gains for DB service ~2-3 times more of overall performance 17 Credit: Ruben Gaspar

Advantage of the New Hardware More memory & more CPU MEM: RAM 48GB -> 128GB / 256GB DB Cache: 20GB -> 86GB / 197GB Faster storage Storage cache 18 HW Migration

Available Software releases Production databases on Oracle before LHC LS1 All databases were upgraded and migrated to new hardware Oracle 11g – version Terminal patch set of Oracle 11g Extended support ends January 2018 Oracle 12c - versions and First release of 12c and the subsequent patch set Users of will have to upgrade to or higher by 2016 No current Oracle version fits well the entire LHC Run 2. Preparation for LHC Run 2 - software 19

Consolidation Schema based consolidation Many applications share the same RAC cluster Consolidation - per customer and/or functionality Host based consolidation Run different DB services on the same machine Support for different Oracle homes (versions) on the same host RAC clusters Load Balancing and possibility to growth High Availability: cluster survives node failures Maintenance: scheduled rolling interventions 20

Replication Disaster Recovery Data Guard Databases in Wigner Active Data Guard available to users for read-only operations Streams to Golden Gate migration completed! Improved scalability – performance better than Streams ATLAS ONLINE – OFFLINE (condDB) ATLAS OFFLINE – TIER1s RAL IN2P3 TRIUMF LHCb ONLINE-OFFLIN 21

Scalable Databases Goal: open and scalable analytic platform for the data currently stored in traditional databases LHClog / control systems archives / other monitoring and auditing systems Solution - Hadoop cluster shared nothing - scalable open systems - many approaches on storing and processing the data Conclusions Data processing with Hadoop scales-out - no matter what engine you will use Choosing the right data format for storing certain data is a key to deliver high performance All the details in “Evaluation of distributed open source solutions in CERN database use cases” by Kacper Surdy (Tuesday) 22

Database on Demand

Database on Demand Covers a demand from CERN community not addressed by the Oracle service Different RDBMS: MySQL, PostgreSQL and Oracle Follows a DBaaS paradigm Making users database owners - full DBA privileges No access to underlying hardware No DBA support or application support No vendor support (except for Oracle) Foreseen as single instance service It provides tools to manage DBA actions: configuration, start/stop, upgrades, backups & recoveries, instance monitoring 24

Database on Demand Evolution of the amount of MySQL, Oracle and PostgreSQL instances in the DBoD service 25

Database on Demand instances per Database Management System 26

Enhanced Monitoring now available 27

Acknowledgements Work presented here on behalf of the CERN Database Services group In particular key contributions to this presentation from: Marcin Blaszczyk, Ruben Gaspar, Zbigniew Baranowski, Lorena Lobato Pardavila, Luca Canali, Eric Grancher 28