Dr Chris Maynard Application Consultant, EPCC +44 131 650 5077 Tools for ILDG.

Slides:



Advertisements
Similar presentations
UKQCD GridPP NeSCAC Irving, 4/2/041 9 th GridPP Collaboration Meeting QCDgrid: Status and Future Alan Irving University of Liverpool.
Advertisements

Chapter 17: WEB COMPONENTS
HTML5 ETDs Edward A. Fox, Sung Hee Park, Nicholas Lynberg, Jesse Racer, Phil McElmurray Digital Library Research Laboratory Virginia Tech ETD 2010, June.
1 OBJECTIVES To generate a web-based system enables to assemble model configurations. to submit these configurations on different.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
KITPC Calculating with ILDG lattices Carleton DeTar KITPC 2009.
Sharing Lattices Throughout the World: An ILDG Status Report ILDG July 31, 2007.
Implementation of One Stop Search by XSLT By Dave Low University of Hong Kong 9-Dec-2003.
Interpret Application Specifications
Overview of Search Engines
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
A Scalable Application Architecture for composing News Portals on the Internet Serpil TOK, Zeki BAYRAM. Eastern MediterraneanUniversity Famagusta Famagusta.
SCRAM Software Configuration, Release And Management Background SCRAM has been developed to enable large, geographically dispersed and autonomous groups.
Aurora: A Conceptual Model for Web-content Adaptation to Support the Universal Accessibility of Web-based Services Anita W. Huang, Neel Sundaresan Presented.
DATA COMMUNICATION DONE BY: ALVIN SAMPATH CARLVIN SAMPATH.
ILDG5QCDgrid1 QCDgrid status report UKQCD data grid Chris Maynard.
Lattice 2004Chris Maynard1 QCDml Tutorial How to mark up your configurations.
Dr Chris Maynard Application Consultant, EPCC Muttering about metadata Report from the Metadata work group.
QCDgrid UKQCD Achievements and Future Priorities Who and what Achievements QCDgrid middleware Future priorities Demo of meta-data catalogue browser Alan.
Report from Metadata Working Group ILDG7 (Dec.08,2005) T.Yoshie for MDWG CCS,Tsukuba ILDG6  file format was proposed and approved  QCDml1.1 had been.
LSC Segment Database Duncan Brown Caltech LIGO-G Z.
GDT V5 Web Services. GDT V5 Web Services Doug Evans and Detlef Lexut GDT 2008 International User Conference August 10 – 13  Lake Las Vegas, Nevada GDT.
ILDG Middleware Status Presented By: Bálint Joó, Jlab, USA Working Group Members: G. Beckett (EPCC, UK) T. Boku (CCS Tsukuba, Japan) D. Byrne (EPCC, UK)
Virtual Logbooks and Collaboration in Science and Software Development Dimitri Bourilkov, Vaibhav Khandelwal, Archis Kulkarni, Sanket Totala University.
The RunTimeTester - where to now? Eric NZUOBONTANE Peter SHERWOOD Brinick SIMMONS 29 th July 2004.
CHEP 2000, Giuseppe Andronico Grid portal based data management for Lattice QCD data ACAT03, Tsukuba, work in collaboration with A.
© Geodise Project, University of Southampton, Data Management in Geodise Zhuoan Jiao, Jasmin Wason and Marc Molinari
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
UKQCD QCDgrid Richard Kenway. UKQCD Nov 2001QCDgrid2 why build a QCD grid? the computational problem is too big for current computers –configuration generation.
Chad Berkley NCEAS National Center for Ecological Analysis and Synthesis (NCEAS), University of California Santa Barbara Long Term Ecological Research.
Bringing “it” all Together !? Dean Djokic, ESRI David Maidment.
ILDG Middleware Status Chip Watson ILDG-6 Workshop May 12, 2005.
An application architecture specifies the technologies to be used to implement one or more (and possibly all) information systems in terms of DATA, PROCESS,
DIRAC Review (13 th December 2005)Stuart K. Paterson1 DIRAC Review Exposing DIRAC Functionality.
ILDG Middleware Status Bálint Joó UKQCD University of Edinburgh, School of Physics on behalf of ILDG Middleware Working Group alternative title: Report.
Verified Network Configuration. Verinec Goals Device independent network configuration Automated testing of configuration Automated distribution of configuration.
Large Scale Nuclear Physics Calculations in a Workflow Environment and Data Provenance Capturing Fang Liu and Masha Sosonkina Scalable Computing Lab, USDOE.
1 ILDG Status in Japan  Lattice QCD Archive(LQA) a gateway to ILDG Japan Grid  HEPNet-J/sc an infrastructure for Japan Lattice QCD Grid A. Ukawa Center.
Report from Metadata Working Group ILDG9 (Dec.01,2006) T. Yoshie for MDWG CCS,Tsukuba ILDG8  QCDml1.3 solved all known issues, except “action normalization”
1 Updates of the QCDml ILDG11 (Nov.30,2007) T. Yoshie for MDWG CCS,Tsukuba revised the QCDml twice Ensemble1.4.1 (June 03) anisotropic action, link smearing.
UKQCD Grid Status Report GridPP 13 th Collaboration Meeting Durham, 4th—6th July 2005 Dr George Beckett Project Manager, EPCC +44.
Marking up lattice QCD configurations and ensembles for ILDG Metadata Working Group P.Coddington, B.Joo, C.Maynard, D.Pleiter, T.Yoshie Working group members.
1 Metadata Working G roup Report Members (fixed in mid-January) G.AndronicoINFN,Italy P.CoddingtonAdelaide,Australia R.EdwardsJlab,USA C.MaynardEdinburgh,UK.
Lattice QCD Data Grid Middleware: Meta Data Catalog (MDC) -- CCS ( tsukuba) proposal -- M. Sato, for ILDG Middleware WG ILDG Workshop, May 2004.
Introduction to Taverna Online and Interaction service Aleksandra Pawlik University of Manchester.
USQCD regional grid Report to ILDG /28/09ILDG14, June 5, US Grid Usage  Growing usage of gauge configurations in ILDG file format.  Fermilab.
UCL DEPARTMENT OF SPACE AND CLIMATE PHYSICS MULLARD SPACE SCIENCE LABORATORY Taverna Plugin VAMDC and HELIO (part of the ‘taverna-astronomy’ edition) Kevin.
1 Metadata WG Status Report QCDML v1.0 –proposed at ILDG4 (May,2004) –presented at Lattice 2004 (talk: Dirk Pleiter, tutorial: Chris.
© FPT SOFTWARE – TRAINING MATERIAL – Internal use 04e-BM/NS/HDCV/FSOFT v2/3 JSP Application Models.
A QCD Grid: 5 Easy Pieces? Richard Kenway University of Edinburgh.
© Geodise Project, University of Southampton, Geodise Middleware Graeme Pound, Gang Xue & Matthew Fairman Summer 2003.
© Geodise Project, University of Southampton, Integrating Data Management into Engineering Applications Zhuoan Jiao, Jasmin.
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
May 2005 PPARC e-Science PG School1 QCDgrid Chris Maynard A Grid for UKQCD National collaboration for lattice QCD.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
TRIUMF HLA Development High Level Applications Perform tasks of accelerator and beam control at control- room level, directly interfacing with operators.
Holding slide prior to starting show. Lessons Learned from the GECEM Portal David Walker Cardiff University
Ganga/Dirac Data Management meeting October 2003 Gennady Kuznetsov Production Manager Tools and Ganga (New Architecture)
 Project Team: Suzana Vaserman David Fleish Moran Zafir Tzvika Stein  Academic adviser: Dr. Mayer Goldberg  Technical adviser: Mr. Guy Wiener.
Introduction of Wget. Wget Wget is a package for retrieving files using HTTP and FTP, the most widely-used Internet protocols. Wget is non-interactive,
The Earth System Curator Metadata Infrastructure for Climate Modeling Rocky Dunlap Georgia Tech.
StratusLab Tutorial (Bordeaux, France)
Software Design and Architecture
ILDG Implementation Status
Serpil TOK, Zeki BAYRAM. Eastern MediterraneanUniversity Famagusta
Introduction to the ILDG session
SDMX IT Tools SDMX Registry
DIBBs Brown Dog BDFiddle
Presentation transcript:

Dr Chris Maynard Application Consultant, EPCC Tools for ILDG

The need for tools Tools for ILDG: Lattice /07 Squaw Valley, CA2 A tool is a device that can be used to produce an item or achieve a task, but that is not consumed in the process Wrong sort of tool can produce poor results, or not scale to larger problems

Lattice 2009 Beijing, I said … How do we access our data? –In the same way we did a decade ago –ssl terminal client (ssh) and copy protocol (scp) Tools for ILDG: Lattice /07 Squaw Valley, CA3 We really need some tools! Data explosion –Data volumes – Tbytes, Pbytes soon –Data complexity –many ensemble, many measurements –Rise of the mega collaboration –Globally distributed {machines, data, people}

Tools Globus online (Monday) –Reliable Data Movement via SaaS Raj Kettimuthu Web2py (Poster) –Poster A new user interface for the Gauge Connection lattice data archive, M. Di Pierro, J. Hetrick, D. Skinner, and S. Cholia –plus demo after this talk LATFOR grid tools, Dirk Pleiter et al. ildg-get, web client UKQCD Ildg-browser JLQCD faceted web client Metadata capture project –EPCC and Tsukuba University –T. Amagasa, M.G. Beckett, C.M. Maynard, J. Perry, T. Yoshie Tools for ILDG: Lattice /07 Squaw Valley, CA4

LATFOR tools ildg-get can access data, metadata, and ILDG services –need to know LFN, or markovChainURI of the metadata Metadata webclient Tools for ILDG: Lattice /07 Squaw Valley, CA5

JLDG Faceted browsing Tools for ILDG: Lattice /07 Squaw Valley, CA6

UKQCD ILDG-browser MDC GUI client –Self-contained Java application, runs on Windows/Mac/Linux. Allows users to: –GUI to construct queries to MDC –Search Metadata –Store queries –Retrieve metadata Does not have data access –use browser to find the Logical File Name (LFN) –Get data with ildg-get Tools for ILDG: Lattice /07 Squaw Valley, CA7

UKQCD ILDG-browser demo Tools for ILDG: Lattice /07 Squaw Valley, CA8

Metadata capture Tools thus described are for accessing ILDG services –they exist and are useful No tools for metadata capture –Ensuring data provenance is difficult –are there degrees of provenance? Tools for ILDG: Lattice /07 Squaw Valley, CA9 QCD production codes are highly optimised –run on highly diverse (and bespoke) architectures Require lightweight process to ease pain of post-processing data Hard Work

ETMDC Edinburgh - Tsukuba Metadata capture project –T. Amagasa, M.G. Beckett, C.M. Maynard, J. Perry, T. Yoshie Explore workflow as a mechanism for MDC Edinburgh funded by –OMII-UK –Software Sustainability Institute –Edinburgh Global (UoE) End product –Demonstrator - universal metadata capture tool for ILDG –Linux/Unix environment –Python, XSLT, make –QCD utils –some hints from QCD code gen Tools for ILDG: Lattice /07 Squaw Valley, CA10

MDCdesign criteria Considered workflow tools –Metadata generated and manipulated as part of data generation process –Examples: Kepler, Taverna, Ruby –QCD ConfGen Jim Simone’s FNAL group Complex tools with rich functionality –Will they run in bespoke QCD environment Lightweight is key criterion –opted for simplest solution –build demonstrator out of most commonly available components –Used make to manage dependencies, but could upgrade to Kepler Used two example codes –JLQCD, CPS Tools for ILDG: Lattice /07 Squaw Valley, CA11

Metadata ALL QCD codes output meaningful metadata –plus input parameter files –system size, physical parameters, quark, gluon couplings –algorithmic parameters, step size –measured quantities, plaquette, checksums etc –state information, user, code version, machine information –Gauge configuration file No scheme for organising this information –parse and process this information Add some minimal mark-up to information already produced –some hints for the tool Tools for ILDG: Lattice /07 Squaw Valley, CA12

Hints Add simple markup to output –easy for user to implement – its just plain text –gives tool something to work with tag for interesting information in plain text files Tools for ILDG: Lattice /07 Squaw Valley, CA13

User input QCDml Ensemble ID [XML] –written by human once per ensemble gauge configuration files log files with hints Curator metadata file (CMF) –where are the data, log files etc MDC demonstrator will do the rest! –Two main components –Configuration File generator –Configuration XML generator Tools for ILDG: Lattice /07 Squaw Valley, CA14

MDC architecture Tools for ILDG: Lattice /07 Squaw Valley, CA15

Example CMF Tools for ILDG: Lattice /07 Squaw Valley, CA16 ensemble1.xml config.%04 configILDG.%04 64 specify batch processing of to delimit information in log file specify batch processing of to delimit information in log file format string-style pattern to specify file name

Configuration File Generator Two components –XSLT transform creates CaPU XML from –Ensemble XML ID –CMF Conversion and Packing Utility (CaPU) –specific to collaboration, but has common interface –converts data to ILDG format –measures plaquette, CRC checksum etc –writes Configuration Information File (CIF) (above + LFN) UKQCD based on qdp++ utility –if qdp++ can read your data, easy to modify the CaPU JLQCD is shell script + data conversion Tools for ILDG: Lattice /07 Squaw Valley, CA17

Configuration XML Generator Creates the QCDml config ID Several components - Python Extract configuration specific information –from CMF, CIF and log files Consistency and completeness checker –Do I have all the information I need? –Do the sources of metadata agree? –am I processing the data I think I am? Provenance Include collaboration specific information –e.g. VML from CPS Write the XML Tools for ILDG: Lattice /07 Squaw Valley, CA18 calculated plaquette = logfile plaquette

Summary MDC Demonstrator –Using common linux/unix tools/software to build components –Can automatically post-process data into QCDml Others can use or adapt demonstrator –simple modifications to output of QCD code –simple modifications to CaPU Can be downloaded from ILDG web site Tools for ILDG: Lattice /07 Squaw Valley, CA19

Conclusions ILDG – we need tools There are tools out there –useful! More groups are developing tools If you need help get in touch Share experiences Neolithic  bronze age –cross over or 1 st order transition? Tools for ILDG: Lattice /07 Squaw Valley, CA20

NERSC gauge connection Tools for ILDG: Lattice /07 Squaw Valley, CA21

Tools for ILDG: Lattice /07 Squaw Valley, CA22