22nd January 2003Tim Adye1 Summary of Bookkeeping discussions at RAL Workshop Tim Adye Rutherford Appleton Laboratory Kanga Phone Meeting 22 nd January.

Slides:



Advertisements
Similar presentations
Object Persistency & Data Handling Session C - Summary Object Persistency & Data Handling Session C - Summary Dirk Duellmann.
Advertisements

© 2008 EBSCO Information Services SUSHI, COUNTER and ERM Systems An Update on Usage Standards Ressources électroniques dans les bibliothèques électroniques.
B A B AR and the GRID Roger Barlow for Fergus Wilson GridPP 13 5 th July 2005, Durham.
11th December 2002Tim Adye1 BaBar UK Grid Work Tim Adye Rutherford Appleton Laboratory BaBar Collaboration Meeting SLAC 11 th December 2002.
12th September 2002Tim Adye1 RAL Tier A Tim Adye Rutherford Appleton Laboratory BaBar Collaboration Meeting Imperial College, London 12 th September 2002.
13 December 2000Tim Adye1 New KanGA Export Scheme Tim Adye Rutherford Appleton Laboratory BaBar Collaboration Meeting Data Distribution Session 13 th December.
CSCE 3110 Data Structures & Algorithm Analysis
KANGA: ROOT Access to BABAR Data for Physics Analysis David Kirkby, UC Irvine for the BABAR Computing Group CHEP ‘03 - Data Management & Persistency 25.
©Brooks/Cole, 2003 Chapter 7 Operating Systems Dr. Barnawi.
The B A B AR G RID demonstrator Tim Adye, Roger Barlow, Alessandra Forti, Andrew McNab, David Smith What is BaBar? The BaBar detector is a High Energy.
Operating Systems (CSCI2413) Lecture 3 Processes phones off (please)
assumes basic arithmetic
Systems Life Cycle A summary of what needs to be done.
Code Management James N. Bellinger University of Wisconsin at Madison 19 January
25 February 2000Tim Adye1 Using an Object Oriented Database to Store BaBar's Terabytes Tim Adye Particle Physics Department Rutherford Appleton Laboratory.
GLAST LAT ProjectDOE/NASA Baseline-Preliminary Design Review, January 8, 2002 K.Young 1 LAT Data Processing Facility Automatically process Level 0 data.
Irina Sourikova Brookhaven National Laboratory for the PHENIX collaboration Migrating PHENIX databases from object to relational model.
1 The Instant Data Warehouse Released 15/01/ Hello and Welcome!! Today I am very pleased to announce the release of the 'Instant Data Warehouse'.
The HDF Group Multi-threading in HDF5: Paths Forward Current implementation - Future directions May 30-31, 2012HDF5 Workshop at PSI 1.
CSE 548 Advanced Computer Network Security Document Search in MobiCloud using Hadoop Framework Sayan Cole Jaya Chakladar Group No: 1.
8th November 2002Tim Adye1 BaBar Grid Tim Adye Particle Physics Department Rutherford Appleton Laboratory PP Grid Team Coseners House 8 th November 2002.
March 6, 2009Tofigh Azemoon1 Real-time Data Access Monitoring in Distributed, Multi Petabyte Systems Tofigh Azemoon Jacek Becla Andrew Hanushevsky Massimiliano.
Data Distribution and Management Tim Adye Rutherford Appleton Laboratory BaBar Computing Review 9 th June 2003.
9 February 2000CHEP2000 Paper 3681 CDF Data Handling: Resource Management and Tests E.Buckley-Geer, S.Lammel, F.Ratnikov, T.Watts Hardware and Resources.
Status of the LHCb MC production system Andrei Tsaregorodtsev, CPPM, Marseille DataGRID France workshop, Marseille, 24 September 2002.
CE Operating Systems Lecture 3 Overview of OS functions and structure.
Author - Title- Date - n° 1 Partner Logo EU DataGrid, Work Package 5 The Storage Element.
19th September 2003Tim Adye1 RAL Tier A Status Tim Adye Rutherford Appleton Laboratory BaBar UK Collaboration Meeting Royal Holloway 19 th September 2003.
Analysis Model Migration Plan Accessing the Mini Converting to the New (dual-read) Micro Producing the Skims Cleaning up the Tag Beta Development.
25th October 2006Tim Adye1 RAL Tier A Tim Adye Rutherford Appleton Laboratory BaBar UK Physics Meeting Queen Mary, University of London 25 th October 2006.
SkimData and Replica Catalogue Alessandra Forti BaBar Collaboration Meeting November 13 th 2002 skimData based replica catalogue RLS (Replica Location.
Analysis trains – Status & experience from operation Mihaela Gheata.
26 September 2000Tim Adye1 Data Distribution Tim Adye Rutherford Appleton Laboratory BaBar Collaboration Meeting 26 th September 2000.
PROOF and ALICE Analysis Facilities Arsen Hayrapetyan Yerevan Physics Institute, CERN.
11th November 2002Tim Adye1 Distributed Analysis in the BaBar Experiment Tim Adye Particle Physics Department Rutherford Appleton Laboratory University.
PERFORMANCE AND ANALYSIS WORKFLOW ISSUES US ATLAS Distributed Facility Workshop November 2012, Santa Cruz.
11th April 2003Tim Adye1 RAL Tier A Status Tim Adye Rutherford Appleton Laboratory BaBar UK Collaboration Meeting Liverpool 11 th April 2003.
(1) Introduction to Continuous Integration Philip Johnson Collaborative Software Development Laboratory Information and Computer Sciences University of.
1 GLOBAL BIOMETRICS Biostatistics Clinical Data Management Epidemiology & Patient Reported Outcomes Statistical Programming and Analysis Strategic Planning,
BaBar and the GRID Tim Adye CLRC PP GRID Team Meeting 3rd May 2000.
ATLAS-specific functionality in Ganga - Requirements for distributed analysis - ATLAS considerations - DIAL submission from Ganga - Graphical interfaces.
Data processing Offline review Feb 2, Productions, tools and results Three basic types of processing RAW MC Trains/AODs I will go through these.
M. Gheata ALICE offline week, October Current train wagons GroupAOD producersWork on ESD input Work on AOD input PWG PWG31 (vertexing)2 (+
15 December 2000Tim Adye1 Data Distribution Tim Adye Rutherford Appleton Laboratory BaBar Collaboration Meeting 15 th December 2000.
D0 File Replication PPDG SLAC File replication workshop 9/20/00 Vicky White.
D.Spiga, L.Servoli, L.Faina INFN & University of Perugia CRAB WorkFlow : CRAB: CMS Remote Analysis Builder A CMS specific tool written in python and developed.
Software. Introduction n A computer can’t do anything without a program of instructions. n A program is a set of instructions a computer carries out.
Joe Foster 1 Two questions about datasets: –How do you find datasets with the processes, cuts, conditions you need for your analysis? –How do.
24/06/20161 Hardware Processor components & ROM. 224/06/2016 Learning Objectives Describe the function and purpose of the control unit, memory unit and.
GDB Meeting CERN 09/11/05 EGEE is a project funded by the European Union under contract IST A new LCG VO for GEANT4 Patricia Méndez Lorenzo.
11th September 2002Tim Adye1 BaBar Experience Tim Adye Rutherford Appleton Laboratory PPNCG Meeting Brighton 11 th September 2002.
Multi-threading and other parallelism options J. Apostolakis Summary of parallel session. Original title was “Technical aspects of proposed multi-threading.
Afternoon session: The archival problem and infrastructure for solutions Prof John R Helliwell Interactive Publications.
29/04/2008ALICE-FAIR Computing Meeting1 Resulting Figures of Performance Tests on I/O Intensive ALICE Analysis Jobs.
ANALYSIS TRAIN ON THE GRID Mihaela Gheata. AOD production train ◦ AOD production will be organized in a ‘train’ of tasks ◦ To maximize efficiency of full.
Evolution of storage and data management
Big Data is a Big Deal!.
BaBar-Grid Status and Prospects
MICE Computing and Software
Managing Diffraction Beam
Spark Presentation.
Informatica PowerCenter Performance Tuning Tips
External Sorting The slides for this text are organized into chapters. This lecture covers Chapter 11. Chapter 1: Introduction to Database Systems Chapter.
Chapter 2: Operating-System Structures
Using an Object Oriented Database to Store BaBar's Terabytes
Kanga Tim Adye Rutherford Appleton Laboratory Computing Plenary
HEC Beam Test Software schematic view T D S MC events ASCII-TDS
Leadership Group 9th November 2013 Feedback Summary.
Chapter 2: Operating-System Structures
Presentation transcript:

22nd January 2003Tim Adye1 Summary of Bookkeeping discussions at RAL Workshop Tim Adye Rutherford Appleton Laboratory Kanga Phone Meeting 22 nd January 2003

Tim Adye2 RAL Workshop Two half-day parallel sessions Monday afternoon: presentations from Adil, Jean-Yves, Andy, Alessandra, Gregory, Alessio, and myself Tuesday afternoon: discussion Joined by the other parallel (event store) at the end See presentations here Computing/Distributed/workshops/Jan2003/ I summarise the Tuesday discussion session Andy took the minutes, so these notes are just my own memory/interpretation Andy should send out notes tomorrow

22nd January 2003Tim Adye3 CMWG2 recommendations Many CMWG2 recommendations. One was that we develop a general framework for dataset management Persuasively presented by Gregory Generic enough to be of interest to other experiments? We should try to work with others (and recruit effort!), but BaBar should lead (due to our shorter timescales) Hopefully this can be built “on top of” SkimTools.

22nd January 2003Tim Adye4 Technical decisions Will start new SkimTools package, borrowing code from the old. Decided to support only Oracle and MySql, but encourage people to maintain ODBC compliance wherever possible. Stick to Perl wherever possible.

22nd January 2003Tim Adye5 Planning Decisions Identified 3.5 FTE ~0.5x7 FTE: Alessandra, Douglas, Jacek, Antonio, Martino, Paul Jackson, Tim Two stage plan (can go in parallel): (Stage 0: immediately-required changes  existing SkimTools) Stage 1: new SkimTools to handle immediate requirements of new model and user requests Come up with use-cases in each area: Alessandra: skimData Tim: Data distribution … Stage 2: CMWG2’s dataset management framework

22nd January 2003Tim Adye6 File size considerations (1) It would be very useful to try to maintain reasonably large file sizes More efficient for analysis job access Simpler for archiving Archiving: mass-store systems (HPSS etc) have problems with too many files: catalogue problems too small files: overhead per GB is larger

22nd January 2003Tim Adye7 File size considerations (2) Figure of merit ~200 MB If many files smaller than this, then we would need to start blobbing files together (eg. with tar) for HPSS This is not trivial to manage Should be able to merge runs for SP and skims Most OPR output files should be > 200MB Teela agreed to make a ballpark estimate to check this Hope to hold off implementing mass-store blobbing until needed System must allow for the possibility of introducing it later