SAM - Sequential Data Access via Metadata Schema Metadata Functionality Workshop Glasgow University April 26-28,2004.

Slides:



Advertisements
Similar presentations
GridPP July 2003Stefan StonjekSlide 1 SAM middleware components Stefan Stonjek University of Oxford 7 th GridPP Meeting 02 nd July 2003 Oxford.
Advertisements

GTS MetaData Generation data GTS data bases GTS Switch Volume C1 Central Support Office Information Classes white-list Metadata Synchronization.
M. D'Amato, M. Mennea, L.Silvestris INFN-Bari CMS Data Model 9-11 Aprile 2001, Catania I Workshop INFN Grid CMS DATA MODEL M. D’Amato, M. Mennea, L. Silvestris.
CMS Applications Towards Requirements for Data Processing and Analysis on the Open Science Grid Greg Graham FNAL CD/CMS for OSG Deployment 16-Dec-2004.
Batch Production and Monte Carlo + CDB work status Janusz Martyniak, Imperial College London MICE CM37 Analysis, Software and Reconstruction.
DataGrid is a project funded by the European Union 22 September 2003 – n° 1 EDG WP4 Fabric Management: Fabric Monitoring and Fault Tolerance
Reconstruction and Analysis on Demand: A Success Story Christopher D. Jones Cornell University, USA.
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
Relational DatabaseData Grid Oracle Sybase DB2 MySQL Others Integrasoft Avaki Others Data Management Tables Query Language Procedures Locking Indexing.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL June 23, 2003 GAE workshop Caltech.
Oxford Jan 2005 RAL Computing 1 RAL Computing Implementing the computing model: SAM and the Grid Nick West.
F Fermilab Database Experience in Run II Fermilab Run II Database Requirements Online databases are maintained at each experiment and are critical for.
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
CLEO’s User Centric Data Access System Christopher D. Jones Cornell University.
L3 Filtering: status and plans D  Computing Review Meeting: 9 th May 2002 Terry Wyatt, on behalf of the L3 Algorithms group. For more details of current.
Workload Management WP Status and next steps Massimo Sgaravatto INFN Padova.
OASIS ebXML Registry Standard Open Forum 2003 on Metadata Registries 10:30 – 11:15 January 20, 2003 Kathryn Breininger The Boeing Company Chair, OASIS.
Operating Systems Advanced OS - E. OS Advanced Evaluating an Operating System.
S. Veseli - SAM Project Status SAMGrid Developments – Part I Siniša Veseli CD/D0CA.
The SAMGrid Data Handling System Outline:  What Is SAMGrid?  Use Cases for SAMGrid in Run II Experiments  Current Operational Load  Stress Testing.
Remote Production and Regional Analysis Centers Iain Bertram 24 May 2002 Draft 1 Lancaster University.
3 Sept 2001F HARRIS CHEP, Beijing 1 Moving the LHCb Monte Carlo production system to the GRID D.Galli,U.Marconi,V.Vagnoni INFN Bologna N Brook Bristol.
5 November 2001GridPP Collaboration Meeting1 CDF and the Grid Requirements and Anti-Requirements CDF-o-Centric View Proposal Conclusion: CDF/D0 Deliverables.
03/27/2003CHEP20031 Remote Operation of a Monte Carlo Production Farm Using Globus Dirk Hufnagel, Teela Pulliam, Thomas Allmendinger, Klaus Honscheid (Ohio.
CDF data production models 1 Data production models for the CDF experiment S. Hou for the CDF data production team.
November 7, 2001Dutch Datagrid SARA 1 DØ Monte Carlo Challenge A HEP Application.
Building a distributed software environment for CDF within the ESLEA framework V. Bartsch, M. Lancaster University College London.
CDF Grid Status Stefan Stonjek 05-Jul th GridPP meeting / Durham.
D0 SAM – status and needs Plagarized from: D0 Experiment SAM Project Fermilab Computing Division.
3rd June 2004 CDF Grid SAM:Metadata and Middleware Components Mòrag Burgon-Lyon University of Glasgow.
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
Nick Brook Current status Future Collaboration Plans Future UK plans.
1 st December 2003 JIM for CDF 1 JIM and SAMGrid for CDF Mòrag Burgon-Lyon University of Glasgow.
Databases E. Leonardi, P. Valente. Conditions DB Conditions=Dynamic parameters non-event time-varying Conditions database (CondDB) General definition:
The huge amount of resources available in the Grids, and the necessity to have the most up-to-date experimental software deployed in all the sites within.
Data Grid projects in HENP R. Pordes, Fermilab Many HENP projects are working on the infrastructure for global distributed simulated data production, data.
ORBMeeting July 11, Outline SAM Overview and Station description Resource Management Station Cache Station Prioritized Fair Share Job Control File.
9 February 2000CHEP2000 Paper 3681 CDF Data Handling: Resource Management and Tests E.Buckley-Geer, S.Lammel, F.Ratnikov, T.Watts Hardware and Resources.
Status of the LHCb MC production system Andrei Tsaregorodtsev, CPPM, Marseille DataGRID France workshop, Marseille, 24 September 2002.
Dzero MC production on LCG How to live in two worlds (SAM and LCG)
EGEE is a project funded by the European Union under contract IST HEP Use Cases for Grid Computing J. A. Templon Undecided (NIKHEF) Grid Tutorial,
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
Data reprocessing for DZero on the SAM-Grid Gabriele Garzoglio for the SAM-Grid Team Fermilab, Computing Division.
INTRODUCTION TO DBS Database: a collection of data describing the activities of one or more related organizations DBMS: software designed to assist in.
DØ Data Handling & Access The DØ Meta-Data Browser Pushpa Bhat Fermilab June 4, 2001.
Lee Lueking 1 The Sequential Access Model for Run II Data Management and Delivery Lee Lueking, Frank Nagy, Heidi Schellman, Igor Terekhov, Julie Trumbo,
Management Information Systems, 4 th Edition 1 Chapter 8 Data and Knowledge Management.
Database Server Concepts and Possibilities Lee Lueking D0 Data Browser Workshop April 8, 2002.
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
DBS/DLS Data Management and Discovery Lee Lueking 3 December, 2006 Asia and EU-Grid Workshop 1-4 December, 2006.
UTA MC Production Farm & Grid Computing Activities Jae Yu UT Arlington DØRACE Workshop Feb. 12, 2002 UTA DØMC Farm MCFARM Job control and packaging software.
CD FY09 Tactical Plan Status FY09 Tactical Plan Status Report for Neutrino Program (MINOS, MINERvA, General) Margaret Votava April 21, 2009 Tactical plan.
Analysis Tools at D0 PPDG Analysis Grid Computing Project, CS 11 Caltech Meeting Lee Lueking Femilab Computing Division December 19, 2002.
November 1, 2004 ElizabethGallas -- D0 Luminosity Db 1 D0 Luminosity Database: Checklist for Production Elizabeth Gallas Fermilab Computing Division /
OASIS ebXML Registry Standard Open Forum 2003 on Metadata Registries 10:30 – 11:15 January 20, 2003 Kathryn Breininger The Boeing Company Chair, OASIS.
D0 File Replication PPDG SLAC File replication workshop 9/20/00 Vicky White.
Grid Activities in CMS Asad Samar (Caltech) PPDG meeting, Argonne July 13-14, 2000.
Alien and GSI Marian Ivanov. Outlook GSI experience Alien experience Proposals for further improvement.
Simulation Production System Science Advisory Committee Meeting UW-Madison March 1 st -2 nd 2007 Juan Carlos Díaz Vélez.
A Data Handling System for Modern and Future Fermilab Experiments Robert Illingworth Fermilab Scientific Computing Division.
Jianming Qian, UM/DØ Software & Computing Where we are now Where we want to go Overview Director’s Review, June 5, 2002.
FIFE Architecture Figures for V1.2 of document. Servers Desktops and Laptops Desktops and Laptops Off-Site Computing Off-Site Computing Interactive ComputingSoftware.
How to run MC on the D0 farm. Steps On hoeve (user fbsuser) –Get MC request –Create macro –Submit jobs On schuur (user willem) –Store files into SAM –Clear.
Simulation Production System
Moving the LHCb Monte Carlo production system to the GRID
LCG 3D Distributed Deployment of Databases
Technical Capabilities
On Parametric Obligation Policies: Enabling Privacy-aware Information Lifecycle Management in Enterprises IEEE Policy Workshop 2007 Marco Casassa Mont.
Belle II experiment Requirement of data handling system Belle II Metadata service system Data Cache system at Belle II experiment Summary.
Database SQL.
Presentation transcript:

SAM - Sequential Data Access via Metadata Schema Metadata Functionality Workshop Glasgow University April 26-28,2004

Sam Services Experiment Specific Runs Luminosity Streams & Triggers Events (D0) Core Files Processes Cache/Resource Mgt (Stations) Job Handling Job Request (MC) Batch Processing General Support Tables Metadata Relational DB Metadata Query

Metadata Query Language (Dimensions) Is a metadata service. Associates keyword value pairs to their representations on the experiment independent databases. Allows definition of metadata within the query service. The constraints per dimension are needed to discover the relevant physics metadata. Is strongly enhanced by the dynamic parameter definition mechanisms associated with the job request services.

Files Are the heart and soul of Sam, they are the complete file metadata catalog. Also, maintains volume information for enstore (Should move to that specific SE). Files Job Request Luminosity Stream& Triggers Events Batch Processing Processes Stations General Support Runs

Processes Process metadata stores –application, version –status of file processing (requested, delivered, crashed, ok) Files Job Request Luminosity Stream& Triggers Events Batch Processing Processes Stations General Support Runs

Stations Caching –All locations, sizes, for a file on all hardware –State: locked, available –Algorithm (policy) Resource Management –Admins (control resources) –Cache quota by group –Station rules (cache space, project limits) Files Job Request Luminosity Stream& Triggers Events Batch Processing Processes Stations General Support Runs

Production Job Requests (MC & Batch) Processes monte carlo & farm requests. Proposal to unify submission services. File metadata predefined by request metadata. Files Job Request Luminosity Stream& Triggers Events Batch Processing Processes Stations General Support Runs

General Support Authentication –Who am I? –What group am I in? –What can I do? –Grid Subjects Fabric Definition –Nodes –Operating systems –Hardware Files Job Request Luminosity Stream& Triggers Events Batch Processing Processes Stations General Support Runs

Tracks run numbers from online (key to Experiment-specific Online metadata). Maintains association between events and files. Files Job Request Luminosity Stream& Triggers Events Batch Processing Processes Stations General Support Runs

Luminosity Streams & Triggers Two sections in sam dealing with luminosity, and streams & triggers. Files Job Request Luminosity Stream& Triggers Events Batch Processing Processes Stations General Support Runs

Events Tracking of events within a file Metadata for each event Huge volumes of data (D0: 17M evts/wk) Files Job Request Luminosity Stream& Triggers Events Batch Processing Processes Stations General Support Runs