The POOL Persistency Framework POOL Summary and Plans.

Slides:



Advertisements
Similar presentations
Object Persistency & Data Handling Session C - Summary Object Persistency & Data Handling Session C - Summary Dirk Duellmann.
Advertisements

Data Management TEG Status Dirk Duellmann & Brian Bockelman WLCG GDB, 9. Nov 2011.
Prescriptive Process models
1 User Analysis Workgroup Update  All four experiments gave input by mid December  ALICE by document and links  Very independent.
D. Düllmann - IT/DB LCG - POOL Project1 POOL Release Plan for 2003 Dirk Düllmann LCG Application Area Meeting, 5 th March 2003.
Software Development Overview CPSC 315 – Programming Studio Spring 2009.
SEAL V1 Status 12 February 2003 P. Mato / CERN Shared Environment for Applications at LHC.
SPI Software Process & Infrastructure EGEE France - 11 June 2004 Yannick Patois
EGEE is a project funded by the European Union under contract IST JRA1 Testing Activity: Status and Plans Leanne Guy EGEE Middleware Testing.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 4 Slide 1 Software Processes.
David Adams ATLAS ATLAS Distributed Analysis David Adams BNL March 18, 2004 ATLAS Software Workshop Grid session.
DOSAR Workshop, Sao Paulo, Brazil, September 16-17, 2005 LCG Tier 2 and DOSAR Pat Skubic OU.
Workshop Summary (my impressions at least) Dirk Duellmann, CERN IT LCG Database Deployment & Persistency Workshop.
LCG Application Area Internal Review Persistency Framework - Project Overview Dirk Duellmann, CERN IT and
CERN Physics Database Services and Plans Maria Girone, CERN-IT
Chapter 2 Software processes. Topics covered Software process models Process activities Coping with change.
LCG-SPI: SW-Testing LCG AppArea internal review (20/10/03)
Distribution and components. 2 What is the problem? Enterprise computing is Large scale & complex: It supports large scale and complex organisations Spanning.
POOL Status and Plans Dirk Düllmann IT-DB & LCG-POOL Application Area Meeting 10 th March 2004.
20/09/2006LCG AA 2006 Review1 Committee feedback to SPI.
Feedback from the POOL Project User Feedback from the POOL Project Dirk Düllmann, LCG-POOL LCG Application Area Internal Review October 2003.
SEAL Core Libraries and Services CLHEP Workshop 28 January 2003 P. Mato / CERN Shared Environment for Applications at LHC.
SEAL Project Core Libraries and Services 18 December 2002 P. Mato / CERN Shared Environment for Applications at LHC.
CASTOR evolution Presentation to HEPiX 2003, Vancouver 20/10/2003 Jean-Damien Durand, CERN-IT.
The POOL Persistency Framework POOL Project Review Introduction & Overview Dirk Düllmann, IT-DB & LCG-POOL LCG Application Area Internal Review October.
LHCbComputing Manpower requirements. Disclaimer m In the absence of a manpower planning officer, all FTE figures in the following slides are approximate.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
15 December 2015M. Lamanna “The ARDA project”1 The ARDA Project (meeting with the LCG referees) Massimo Lamanna CERN.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
Gül Begüm ŞEMİS. Milestones Appointments Due dates Check lists Approval The FOAK Process Phase I2 Project Process.
G.Govi CERN/IT-DB 1 September 26, 2003 POOL Integration, Testing and Release Procedure Integration  Packages structure  External dependencies  Configuration.
D. Duellmann - IT/DB LCG - POOL Project1 The LCG Pool Project and ROOT I/O Dirk Duellmann What is Pool? Component Breakdown Status and Plans.
Oracle for Physics Services and Support Levels Maria Girone, IT-ADC 24 January 2005.
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT Upcoming Features and Roadmap Ricardo Rocha ( on behalf of the.
Data Placement Intro Dirk Duellmann WLCG TEG Workshop Amsterdam 24. Jan 2012.
D. Duellmann - IT/DB LCG - POOL Project1 The LCG Dictionary and POOL Dirk Duellmann.
Marco Cattaneo Core software programme of work Short term tasks (before April 2012) 1.
Summary of persistence discussions with LHCb and LCG/IT POOL team David Malon Argonne National Laboratory Joint ATLAS, LHCb, LCG/IT meeting.
Site Services and Policies Summary Dirk Düllmann, CERN IT More details at
David Adams ATLAS ATLAS Distributed Analysis (ADA) David Adams BNL December 5, 2003 ATLAS software workshop CERN.
CS223: Software Engineering Lecture 18: The XP. Recap Introduction to Agile Methodology Customer centric approach Issues of Agile methodology Where to.
1 Configuration Database David Forrest University of Glasgow RAL :: 31 May 2009.
POOL & ARDA / EGEE POOL Plans for 2004 ARDA / EGEE integration Dirk Düllmann, IT-DB & LCG-POOL LCG workshop, 24 March 2004.
David Adams ATLAS ATLAS Distributed Analysis and proposal for ATLAS-LHCb system David Adams BNL March 22, 2004 ATLAS-LHCb-GANGA Meeting.
D. Duellmann - IT/DB LCG - POOL Project1 Internal Pool Release V0.2 Dirk Duellmann.
LHCC Referees Meeting – 28 June LCG-2 Data Management Planning Ian Bird LHCC Referees Meeting 28 th June 2004.
Follow-up to SFT Review (2009/2010) Priorities and Organization for 2011 and 2012.
D. Duellmann, IT-DB POOL Status1 POOL Persistency Framework - Status after a first year of development Dirk Düllmann, IT-DB.
David Adams ATLAS ADA: ATLAS Distributed Analysis David Adams BNL December 15, 2003 PPDG Collaboration Meeting LBL.
JRA1 Meeting – 09/02/ Software Configuration Management and Integration EGEE is proposed as a project funded by the European Union under contract.
A quick summary and some ideas for the 2005 work plan Dirk Düllmann, CERN IT More details at
Comments on SPI. General remarks Essentially all goals set out in the RTAG report have been achieved. However, the roles defined (Section 9) have not.
POOL Based CMS Framework Bill Tanenbaum US-CMS/Fermilab 04/June/2003.
POOL Historical Notes POOL has been the most advanced and the most used AA project. Currently, excellent teamwork with experiments on new features and.
Databases and DBMSs Todd S. Bacastow January 2005.
EGEE Middleware Activities Overview
(on behalf of the POOL team)
Chapter 18 Maintaining Information Systems
SPI external software build tool and distribution mechanism
3D Application Tests Application test proposals
Distribution and components
POOL persistency framework for LHC
Dirk Düllmann CERN Openlab storage workshop 17th March 2003
First Internal Pool Release 0.1
Script-less Automation: An Approach to Shift-Left.
POOL/RLS Experience Current CMS Data Challenges shows clear problems wrt to the use of RLS Partially due to the normal “learning curve” on all sides in.
Data, Databases, and DBMSs
Simulation Framework Subproject cern
POOL Status & Release Plan for V0.4
Extreme Programming.
Presentation transcript:

The POOL Persistency Framework POOL Summary and Plans

The POOL Persistency FrameworkD.Duellmann2 Storage Manager Plans Mainly performance and consolidation, but…Mainly performance and consolidation, but… Current dictionary loading creates deployment problemsCurrent dictionary loading creates deployment problems –All class dictionaries need to be loaded when ROOT file is opened –ROOT provides functionality to relax this constraint POOL will work with ROOT team to make lazy dictionary loading available for POOL clients Some Object Model support requestedSome Object Model support requested –Embedded pointer to non-polymorph type – POOL should store objects based on the pointer type Recent DataService Review gave some input on enhancements/consolidation requests in that areaRecent DataService Review gave some input on enhancements/consolidation requests in that area

The POOL Persistency FrameworkD.Duellmann3 POOL Performance - first cut.. Has not really been optimised systematicallyHas not really been optimised systematically –But first results look reasonable We won’t be faster than ROOT We won’t create smaller files than ROOT –But we want to control the overhead we put on top of ROOT – comparing to ROOT in areas where root offers similar functionality POOL collection performance show clearly that POOL insulation overhead can be kept minimal –But we provide more functionality and flexibility than ROOT so comparing raw IO speed is sometimes comparing apples with pears

The POOL Persistency FrameworkD.Duellmann4 File Catalog Plans Moving into production environmentMoving into production environment Extension to allow for typical the Meta Data evolution use casesExtension to allow for typical the Meta Data evolution use cases –Eg A new meta data element is introduced during production Composite CatalogsComposite Catalogs –Accessing eg a local writable together with a shared read-only catalog –Eg a job reads some user files in addition to any file from the large experiment production

The POOL Persistency FrameworkD.Duellmann5 POOL Collection Futures First implementation(s) exist and are being picked up byFirst implementation(s) exist and are being picked up by Integration in experiment frameworks just startingIntegration in experiment frameworks just starting –How does one find a collection? –Is there a Collection Catalog (like the File Catalog)? A central one? –Is a File just a special Collection to the end user? Role of collections in a grid environment not clear yet - ARDARole of collections in a grid environment not clear yet - ARDA Collection implementation in POOL is a first stepCollection implementation in POOL is a first step –But the real issue is not the implementation but rather conceptual –Need active experiment involvement in this area

The POOL Persistency FrameworkD.Duellmann6 RDBMS Independence POOL should not be directly be coupled to just one RDBMS implementationPOOL should not be directly be coupled to just one RDBMS implementation MySQL++ is a painMySQL++ is a pain –Need a replacement for many additional reasons Performance constraint on collections implementation Product does not evolve anymore Dependent on internals of the GCC compiler –Hard to port mysql++ based code to icc/ecc Did a short “market survey”Did a short “market survey” Outcome proposal to move to OTL Started evaluation by providing two prototype implementations of MySQL FileCatalog and Collections Prototypes are now part of V1.4 internal releases cyclesPrototypes are now part of V1.4 internal releases cycles

The POOL Persistency FrameworkD.Duellmann7 Infrastructure & Testing Move to QMtestMove to QMtest –To align with other LCG projects Will soon have to add several new platformsWill soon have to add several new platforms –icc ecc and VC for portability check of POOL code and also as additional development platform Fully Automate Data Format RegressionFully Automate Data Format Regression –Highest priority now as experiment data is produced Add traceability between bug reports and release contents and release validation testsAdd traceability between bug reports and release contents and release validation tests –In collaboration with SPI

The POOL Persistency FrameworkD.Duellmann8Summary POOL has delivered a functional persistency framework and has been integrated into frameworks of CMS, ATLAS and soon LHCbPOOL has delivered a functional persistency framework and has been integrated into frameworks of CMS, ATLAS and soon LHCb –Currently used for test productions in CMS –Possibly with more effort than integration teams expected POOL as a development team works well and would profit more from insuring stability than additional manpowerPOOL as a development team works well and would profit more from insuring stability than additional manpower –Some central positions inside POOL are more difficult to back up, but we remained productive even through vacation periods overlapping with experiment integrations POOL operates close to its release planPOOL operates close to its release plan –Following “release early, release often” strategy –Many experiment requirements have been clarified and agreed only during experiment integration phase rather than upfront Many thanks toMany thanks to –all developers working on the project for their commitment –all experiment integration teams for their patience and very constructive feedback!