ATLAS 3D Requests LCG3D Kickoff Workshop CERN, Geneva, Swizerland December 13, 2004 Alexandre Vaniachine, Jeff Tseng, Andrea Formica.

Slides:



Advertisements
Similar presentations
Metadata Progress GridPP18 20 March 2007 Mike Kenyon.
Advertisements

1 Databases in ALICE L.Betev LCG Database Deployment and Persistency Workshop Geneva, October 17, 2005.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL March 25, 2003 CHEP 2003 Data Analysis Environment and Visualization.
F Fermilab Database Experience in Run II Fermilab Run II Database Requirements Online databases are maintained at each experiment and are critical for.
Database Deployment on OSG Yuri Smirnov BNL US ATLAS DDM operations and MC production Workshop, BNL September 28-29, 2006.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
CERN - IT Department CH-1211 Genève 23 Switzerland t Monitoring the ATLAS Distributed Data Management System Ricardo Rocha (CERN) on behalf.
03/27/2003CHEP20031 Remote Operation of a Monte Carlo Production Farm Using Globus Dirk Hufnagel, Teela Pulliam, Thomas Allmendinger, Klaus Honscheid (Ohio.
+ discussion in Software WG: Monte Carlo production on the Grid + discussion in TDAQ WG: Dedicated server for online services + experts meeting (Thusday.
ATLAS Metrics for CCRC’08 Database Milestones WLCG CCRC'08 Post-Mortem Workshop CERN, Geneva, Switzerland June 12-13, 2008 Alexandre Vaniachine.
Conditions DB in LHCb LCG Conditions DB Workshop 8-9 December 2003 P. Mato / CERN.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
Update on Database Issues Peter Chochula DCS Workshop, June 21, 2004 Colmar.
Nick Brook Current status Future Collaboration Plans Future UK plans.
LHC: ATLAS Experiment meeting “Conditions” data challenge Elizabeth Gallas - Oxford - August 29, 2009 XLDB3.
The ATLAS Database Project Richard Hawkings, CERN Torre Wenaus, BNL/CERN ATLAS plenary meeting June 24, 2004.
ATLAS Scalability Tests of Tier-1 Database Replicas WLCG Collaboration Workshop (Tier0/Tier1/Tier2) Victoria, British Columbia, Canada September 1-2, 2007.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
ATLAS Database Operations Invited talk at the XXI International Symposium on Nuclear Electronics & Computing Varna, Bulgaria, September 2007 Alexandre.
OSG Area Coordinator’s Report: Workload Management April 20 th, 2011 Maxim Potekhin BNL
2nd September Richard Hawkings / Paul Laycock Conditions data handling in FDR2c  Tag hierarchies set up (largely by Paul) and communicated in advance.
Databases E. Leonardi, P. Valente. Conditions DB Conditions=Dynamic parameters non-event time-varying Conditions database (CondDB) General definition:
LHC Computing Review Recommendations John Harvey CERN/EP March 28 th, th LHCb Software Week.
ATLAS, U.S. ATLAS, and Databases David Malon Argonne National Laboratory DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National Laboratory.
Status of the LHCb MC production system Andrei Tsaregorodtsev, CPPM, Marseille DataGRID France workshop, Marseille, 24 September 2002.
NOVA Networked Object-based EnVironment for Analysis P. Nevski, A. Vaniachine, T. Wenaus NOVA is a project to develop distributed object oriented physics.
4/5/2007Data handling and transfer in the LHCb experiment1 Data handling and transfer in the LHCb experiment RT NPSS Real Time 2007 FNAL - 4 th May 2007.
David Adams ATLAS ADA, ARDA and PPDG David Adams BNL June 28, 2004 PPDG Collaboration Meeting Williams Bay, Wisconsin.
ATLAS Detector Description Database Vakho Tsulaia University of Pittsburgh 3D workshop, CERN 14-Dec-2004.
ATLAS applications and plans LCG Database Deployment and Persistency Workshop 17-Oct-2005,CERN Stefan Stonjek (Oxford), Torre Wenaus (BNL)
CERN Physics Database Services and Plans Maria Girone, CERN-IT
BNL ATLAS Database service update Yuri Smirnov, Iris Wu BNL, USA LCG Database Deployment and Persistency Workshop, CERN, Geneva October 17-19, 2005.
The Persistency Patterns of Time Evolving Conditions for ATLAS and LCG António Amorim CFNUL- FCUL - Universidade de Lisboa A. António, Dinis.
ATLAS Data Challenges US ATLAS Physics & Computing ANL October 30th 2001 Gilbert Poulard CERN EP-ATC.
Introduction CMS database workshop 23 rd to 25 th of February 2004 Frank Glege.
Databases in CMS Conditions DB workshop 8 th /9 th December 2003 Frank Glege.
08-Nov Database TEG workshop, Nov 2011 ATLAS Oracle database applications and plans for use of the Oracle 11g enhancements Gancho Dimitrov.
CERN-IT Oracle Database Physics Services Maria Girone, IT-DB 13 December 2004.
Peter Chochula ALICE Offline Week, October 04,2005 External access to the ALICE DCS archives.
Conditions Metadata for TAGs Elizabeth Gallas, (Ryan Buckingham, Jeff Tseng) - Oxford ATLAS Software & Computing Workshop CERN – April 19-23, 2010.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
NOVA A Networked Object-Based EnVironment for Analysis “Framework Components for Distributed Computing” Pavel Nevski, Sasha Vanyashin, Torre Wenaus US.
Integration of the ATLAS Tag Database with Data Management and Analysis Components Caitriana Nicholson University of Glasgow 3 rd September 2007 CHEP,
Alexandre Vaniachine (ANL) LCG PEB Applications Area Meeting November 20, 2002 Alexandre Vaniachine (ANL) MySQL Service Plans and Needs in ATLAS.
Oracle for Physics Services and Support Levels Maria Girone, IT-ADC 24 January 2005.
3D Testing and Monitoring Lee Lueking LCG 3D Meeting Sept. 15, 2005.
Testing and integrating the WLCG/EGEE middleware in the LHC computing Simone Campana, Alessandro Di Girolamo, Elisa Lanciotti, Nicolò Magini, Patricia.
ATLAS Database Access Library Local Area LCG3D Meeting Fermilab, Batavia, USA October 21, 2004 Alexandre Vaniachine (ANL)
Introduction S. Rajagopalan August 28, 2003 US ATLAS Computing Meeting.
Pavel Nevski DDM Workshop BNL, September 27, 2006 JOB DEFINITION as a part of Production.
11th November Richard Hawkings Richard Hawkings (CERN) ATLAS reconstruction jobs & conditions DB access  Conditions database basic concepts  Types.
The MEG Offline Project General Architecture Offline Organization Responsibilities Milestones PSI 2/7/2004Corrado Gatto INFN.
1 A Scalable Distributed Data Management System for ATLAS David Cameron CERN CHEP 2006 Mumbai, India.
ATLAS Distributed Analysis Dietrich Liko IT/GD. Overview  Some problems trying to analyze Rome data on the grid Basics Metadata Data  Activities AMI.
Status of tests in the LCG 3D database testbed Eva Dafonte Pérez LCG Database Deployment and Persistency Workshop.
David Adams ATLAS ATLAS Distributed Analysis (ADA) David Adams BNL December 5, 2003 ATLAS software workshop CERN.
Database Access Patterns in ATLAS Computing Model G. Gieraltowski, J. Cranshaw, K. Karr, D. Malon, A. Vaniachine ANL P, Nevski, Yu. Smirnov, T. Wenaus.
LHCC Referees Meeting – 28 June LCG-2 Data Management Planning Ian Bird LHCC Referees Meeting 28 th June 2004.
Database Issues Peter Chochula 7 th DCS Workshop, June 16, 2003.
M.Frank, CERN/LHCb Persistency Workshop, Dec, 2004 Distributed Databases in LHCb  Main databases in LHCb Online / Offline and their clients  The cross.
David Adams ATLAS ADA: ATLAS Distributed Analysis David Adams BNL December 15, 2003 PPDG Collaboration Meeting LBL.
INFSO-RI Enabling Grids for E-sciencE File Transfer Software and Service SC3 Gavin McCance – JRA1 Data Management Cluster Service.
Update on CHEP from the Computing Speaker Committee G. Carlino (INFN Napoli) on behalf of the CSC ICB, October
Oracle for Physics Services and Support Levels Maria Girone, IT-ADC 6 April 2005.
ATLAS – statements of interest (1) A degree of hierarchy between the different computing facilities, with distinct roles at each level –Event filter Online.
Database Replication and Monitoring
Conditions Data access using FroNTier Squid cache Server
Data Lifecycle Review and Outlook
ATLAS DC2 & Continuous production
Presentation transcript:

ATLAS 3D Requests LCG3D Kickoff Workshop CERN, Geneva, Swizerland December 13, 2004 Alexandre Vaniachine, Jeff Tseng, Andrea Formica

LCG3D Kickoff Workshop December 13-15, 2004 Vaniachine, Tseng, Formica Outline ATLAS Database Project and LCG3D Migration to Oracle and 2005 data volume 3D data inventory efforts Inventory: data vs. database applications Parade of ATLAS database applications Offline Technical Coordination (detector production DB) Online Conclusions

LCG3D Kickoff Workshop December 13-15, 2004 Vaniachine, Tseng, Formica ATLAS Database Project All ATLAS database domains are now under the umbrella of the new Database Project Launched in May 2004 Led by Richard Hawkings and Torre Wenaus Emphasis on integration of different domains Provides representatives to LCG3D

LCG3D Kickoff Workshop December 13-15, 2004 Vaniachine, Tseng, Formica ATLAS and LCG3D ATLAS makes use of a distributed infrastructure of relational databases for access to various types of non- event data and event metadata Distribution, replication, and synchronization of these databases, which are likely to employ more than one database technology: Oracle in larger centers MySQL in smaller settings and for smaller applications must be supported according to the needs of the various database service client applications ATLAS favors common LHC-wide solutions and will work with the LCG3D and the other experiments to define and deploy common solutions

LCG3D Kickoff Workshop December 13-15, 2004 Vaniachine, Tseng, Formica Migration to Oracle ATLAS acquire significant experience in MySQL database services, been critical for: 2004 Combined Test Beam effort Current Data Challenge 2 efforts Following CERN policy and encouragement to concentrate our database usage around Oracle, we are making increasing use of these services and support

LCG3D Kickoff Workshop December 13-15, 2004 Vaniachine, Tseng, Formica Volume Request for 2005 ‘Psuedo-Cocotime’ request made in October 2004 Combined with CPU, AFS, tapes requests made by G Poulard Volumes given as estimated raw data size, not including backup, indexing, mirroring Usage2005 space – GB Geometry primary numbersfew GB Conditions data100 GB TechCoord production data100 GB Event meta-data250 GB 3D project data?

LCG3D Kickoff Workshop December 13-15, 2004 Vaniachine, Tseng, Formica 3D Data Inventory ATLAS started collecting data inventory and submitted required templates to 3D A very sound initiative from 3D Focus is now shifting from the simple data volume estimated to database applications inventory efforts ATLAS parades its first database application ready for distribution at this workshop Vakhtang Tsulaia talk: ATLAS Detector Description DB Note: not only distributed application must be counted A sample of ATLAS database application follows next

LCG3D Kickoff Workshop December 13-15, 2004 Vaniachine, Tseng, Formica ATLAS Database Applications 1) Inventory for (broadly defined) offline sw domain Tag Collector – software infrastructure support AMI – Bookkeeping DB (file-level metadata) Production system DB Detector Description DB ConditionsDB IOV DB Conditions data payload DB Reliable File Transfer DB – ATLAS DMS: DQ File Catalogs: POOL, RLS,… Collections DB (event-level metadata)

LCG3D Kickoff Workshop December 13-15, 2004 Vaniachine, Tseng, Formica Tag Collector DB Supported by LPSC Grenoble (Solveig Albrand, Jerôme Fulachier) Currently in MySQL Critical application for software development infrastructure Migration to CERN Oracle is in plans Not distributed, a geographically separate replica required as a failover service Application is merging within AMI framework

LCG3D Kickoff Workshop December 13-15, 2004 Vaniachine, Tseng, Formica ATLAS Metadata Interface AMI is supported by LPSC Grenoble (Solveig Albrand, Jerôme Fulachier) Currently in MySQL Bookkeeping metadata database for: DCs - Data Challenges CTB - Combined Test Beam ADA – ATLAS Distributed Analysis In 2005 migration to Oracle (may be not at CERN) Distributed Oracle services is in the plans CERN replica server could be a failover node

LCG3D Kickoff Workshop December 13-15, 2004 Vaniachine, Tseng, Formica Production DB ATLAS Production System database Data processing tasks configuration, jobs submission, bookkeeping Supported by CERN (Luc Goossens) MySQL prototypes used in DC1 Designed in Oracle for DC2 Hosted on pdb01 Provided first Oracle operational experience in ATLAS

LCG3D Kickoff Workshop December 13-15, 2004 Vaniachine, Tseng, Formica Production DB book-keeping remarks Significant performance problems seen Some addressed with improved and tuned applications Some show need for more servers, dedicated to specific tasks What lessons can we learn from this? DB volume is not the whole story, usage patterns place heavy demands on the service IT application consultancy needs to be involved to avoid poor design / inefficient DB usage In the imperfect distributed world, load estimates are going to be very approximate – IT needs to provide appropriate server capacity headroom

LCG3D Kickoff Workshop December 13-15, 2004 Vaniachine, Tseng, Formica Lifecycle Patterns Disclaimer: observations by an outsider, not by the prodsys team Focus is on general patterns - what we should be ready for in next ATLAS database applications deployments ATLAS ProductionDB operational experience: Initially designed for a few writers writers count exceeded expectations by ten Initially not designed for monitoring used heavily to monitor production progress Best Practices are hard to follow under pressure of DC2 Should the database application be ready for: ten times more users then was foreseen will be used in a data access/queries pattern never foreseen Emerging/familiar pattern? (Already in Murphy's laws?)

LCG3D Kickoff Workshop December 13-15, 2004 Vaniachine, Tseng, Formica Detector Description DB DDDB – primary numbers for Detector Description Supported by U Pittsburgh (Joe Boudreau, Vakhtang Tsulaia) Initially in MySQL (NOVA) Redesigned for Oracle Focus on Best Practices from the start fruitful interactions with IT/DB Being deployed for DC2 production use Served data to thousands of reconstruction jobs on worldwide grids Replication tools JDBC-based (Julius Hrivnac, see his talk at workshop) RAL-based (Vakhtang Tsulaia) work in progress First ATLAS distributed database application ready for 3D testbed Expected to produce high (read-only) server load in production

LCG3D Kickoff Workshop December 13-15, 2004 Vaniachine, Tseng, Formica ConditionsDB Note: separation of IOV data and payload data (NovaBlob DB) Supported by Lisbon group (led by Antonio Amorim) and BNL/ANL team (Hong Ma, Alexandre Vaniachine) Both currently in MySQL (with payload also in POOL files) Integrated: Web browser: CondDBrowser Replication tools (Sven Schmidt) Served well during Combined Test Beam This week: first large scale production exercise Deployed for Commissioning activities at point 1 Database applications requirements: Must be replicated within T0: online-offline Have to be distributed beyond T0 (scale to be defined) Expecting Oracle implementation in March

LCG3D Kickoff Workshop December 13-15, 2004 Vaniachine, Tseng, Formica Data Management System ATLAS Data Management System – Don Quixote (Miguel Branco) elegantly integrates a zoo of database applications: Mostly File Catalogs: POOL, RLS,… mixture of MySQL and Oracle Reliable File Transfer DB Currently in MySQL (Miguel Branco) Integrated in Globus GT4 RFT Not replicated/distributed (to our knowledge) except failover needs

LCG3D Kickoff Workshop December 13-15, 2004 Vaniachine, Tseng, Formica Collections DB This database application is from POOL project significant ATLAS contributions (David Malon, Kristo Karr) Very data volume intensive Event-level metadata (tag database) Currently in MySQL Heterogeneous replication tools (Julius Hrivnac) DC2 Collections data is ready for 3D testbed

LCG3D Kickoff Workshop December 13-15, 2004 Vaniachine, Tseng, Formica Technical Coordination ATLAS equipment management database, used for managing racks, cables Common project with CMS, runs on pdb servers – usage increasing as detector installation / commissioning is proceeding Use of MTF and EDMS for handling production data and documentation User perception that database infrastructure is slow (but also slow interfaces) Subdetector production data in various technologies Encouraging data migration to CERN Oracle for long term security No replication/distribution requests

LCG3D Kickoff Workshop December 13-15, 2004 Vaniachine, Tseng, Formica Online application types DAQ Configuration Tdaq current state (RelDB should replace XML files) Sub-detector Configuration ( Electronics config, Cable mapping ) XML files or RelDB tables, versioning needed DCS ( Temperature, HV, pressure,…) Large use in H8 during 2004 testbeam (Condition DB) Activity foreseen for 2005 during commissioning Calib/Align Large amount of data in RelDB (used in Muon system align. also as data source, i.e. for determining the corrections) Sensors data available during commissioning in 2005 (Bfield, Align.,..) Usage of POOL(ROOT) files referenced in relational tables Monitoring Histograms important during commissioning ROOT files referenced in a relational DB

LCG3D Kickoff Workshop December 13-15, 2004 Vaniachine, Tseng, Formica Online inventory remarks Inventory for 2005 (preliminar) Conditions Data : requirements from several subsytems already available (data volume/year) Central DCS : ~4GB Muons (DCS+Align+BField): <15GB Lar : ~100 MB Configuration Data : not yet well defined TDAQ : ~1GB Monitoring data : mainly histogram files (large data volume) Do they need replication ? Number of clients and type of access still difficult to be estimated. Extrapolation of H8 situation not always realistic To Do : try to collect missing information and needs for replication/distribution in the online community

LCG3D Kickoff Workshop December 13-15, 2004 Vaniachine, Tseng, Formica Conclusions ATLAS Oracle-CERN usage is now ramping up fast 2005 will see (at least) three very high profile activities: Production DB for offline production dozens of concurrent writers Geometry DB for offline production thousands of concurrent readers Online (conditions) DB for commissioning focussed at point 1 In all cases, will need significant dedicated resources Database problems if any will be very ‘visible’ Other smaller scale activities will continue TAG database may require significant volumes