1 Metadata Working G roup Report Members (fixed in mid-January) G.AndronicoINFN,Italy P.CoddingtonAdelaide,Australia R.EdwardsJlab,USA C.MaynardEdinburgh,UK.

Slides:



Advertisements
Similar presentations
MicroKernel Pattern Presented by Sahibzada Sami ud din Kashif Khurshid.
Advertisements

Introduction to C Programming
SDL+ The Simplest, Useful Enhanced SDL-Subset The documentation is the design, the design is the system! Copyright © SDL Task Force Consortium.
Introduction to the BinX Library eDIKT project team Ted Wen Robert Carroll
Programming Paradigms and languages
Web Services Nasrullah. Motivation about web service There are number of programms over the internet that need to communicate with other programms over.
Data Manipulation Overview and Applications. Agenda Overview of LabVIEW data types Manipulating LabVIEW data types –Changing data types –Byte level manipulation.
Streaming NetCDF John Caron July What does NetCDF do for you? Data Storage: machine-, OS-, compiler-independent Standard API (Application Programming.
IEC Substation Configuration Language and Its Impact on the Engineering of Distribution Substation Systems Notes Dr. Alexander Apostolov.
Lucene Part3‏. Lucene High Level Infrastructure When you look at building your search solution, you often find that the process is split into two main.
Chapter 11 File-System Interface
Sharing Annotations Copyright Edward A. Clancy and Kevin C. McGill, Some rights reserved. Content in this presentation is licensed under a Creative.
ILDG File Format Chip Watson, for Middleware & MetaData Working Groups.
Xyleme A Dynamic Warehouse for XML Data of the Web.
CS335 Networking & Network Administration Tuesday, May 11, 2010.
Portability CPSC 315 – Programming Studio Spring 2008 Material from The Practice of Programming, by Pike and Kernighan.
The Event as an Object-Relational Database: Avoiding the Dependency Nightmare Christopher D. Jones Cornell University, USA.
I/O and the SciDAC Software API Robert Edwards U.S. SciDAC Software Coordinating Committee May 2, 2003.
SNMP & MIME Rizwan Rehman, CCS, DU. Basic tasks that fall under this category are: What is Network Management? Fault Management Dealing with problems.
The OSI Model A layered framework for the design of network systems that allows communication across all types of computer systems regardless of their.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
Linux Operations and Administration
16-1 The World Wide Web The Web An infrastructure of distributed information combined with software that uses networks as a vehicle to exchange that information.
Lattice 2004Chris Maynard1 QCDml Tutorial How to mark up your configurations.
Report from Metadata Working Group ILDG7 (Dec.08,2005) T.Yoshie for MDWG CCS,Tsukuba ILDG6  file format was proposed and approved  QCDml1.1 had been.
Java: Chapter 1 Computer Systems Computer Programming II.
Miscellaneous Excel Combining Excel and Access. – Importing, exporting and linking Parsing and manipulating data. 1.
What is XML?  XML stands for EXtensible Markup Language  XML is a markup language much like HTML  XML was designed to carry data, not to display data.
Usability Issues Documentation J. Apostolakis for Geant4 16 January 2009.
Database structure for the European Integrated Tokamak Modelling Task Force F. Imbeaux On behalf of the Data Coordination Project.
XML: Changing the Way SFA Does Business Presented by Paul Hill & Holly Hyland.
Introduction to Distributed Systems Slides for CSCI 3171 Lectures E. W. Grundke.
Chapter 10: File-System Interface 10.1 Silberschatz, Galvin and Gagne ©2011 Operating System Concepts – 8 th Edition 2014.
2005 Epocrates, Inc. All rights reserved. Integrating XML with legacy relational data for publishing on handheld devices David A. Lee Senior member of.
Web Services Standards. Introduction A web service is a type of component that is available on the web and can be incorporated in applications or used.
The european ITM Task Force data structure F. Imbeaux.
ReiserFS Hans Reiser
Lattice QCD Data Grid Middleware: status report M. Sato, CCS, University of Tsukuba ILDG6, May, 12, 2005.
Dr Chris Maynard Application Consultant, EPCC Tools for ILDG.
What it is and how it works
Chapter 9 Hardware Addressing and Frame Type Identification 1.Delivering and sending packets 2.Hardware addressing: specifying a destination 3. Broadcasting.
Report from Metadata Working Group ILDG9 (Dec.01,2006) T. Yoshie for MDWG CCS,Tsukuba ILDG8  QCDml1.3 solved all known issues, except “action normalization”
1 Updates of the QCDml ILDG11 (Nov.30,2007) T. Yoshie for MDWG CCS,Tsukuba revised the QCDml twice Ensemble1.4.1 (June 03) anisotropic action, link smearing.
Disk & File System Management Disk Allocation Free Space Management Directory Structure Naming Disk Scheduling Protection CSE 331 Operating Systems Design.
Marking up lattice QCD configurations and ensembles for ILDG Metadata Working Group P.Coddington, B.Joo, C.Maynard, D.Pleiter, T.Yoshie Working group members.
Module: Software Engineering of Web Applications Chapter 2: Technologies 1.
1 Metadata WG Status Report QCDML v1.0 –proposed at ILDG4 (May,2004) –presented at Lattice 2004 (talk: Dirk Pleiter, tutorial: Chris.
Comanche A GUI management tool for Apache Daniel López Ridruejo
COMPUTER ORGANIZATION AND ASSEMBLY LANGUAGE Lecture 21 & 22 Processor Organization Register Organization Course Instructor: Engr. Aisha Danish.
A QCD Grid: 5 Easy Pieces? Richard Kenway University of Edinburgh.
It consists of two parts: collection of files – stores related data directory structure – organizes & provides information Some file systems may have.
New Generation University Faculty of Computer Science Chapter Five: File Uploaded and Ad Rotate Lecturer: Mukhtar Mohamed Ali “Hakaale”
TRIUMF HLA Development High Level Applications Perform tasks of accelerator and beam control at control- room level, directly interfacing with operators.
GDML “Geometry Description Markup Language” by Daniele Francesco Kruse University of Rome “Tor Vergata” European Organization for Nuclear Research.
XML scheme for configuration data V.Černý on behalf of the Bratislava group Initiated by Marco’s call:...a software layer for implementing the handling.
Text2PTO: Modernizing Patent Application Filing A Proposal for Submitting Text Applications to the USPTO.
Lecture 1 Page 1 CS 111 Summer 2013 Important OS Properties For real operating systems built and used by real people Differs depending on who you are talking.
The purpose of a CPU is to process data Custom written software is created for a user to meet exact purpose Off the shelf software is developed by a software.
Application Sharing Bhavesh Amin Casey Miller Casey Miller Ajay Patel Ajay Patel Bhavesh Thakker Bhavesh Thakker.
Training for developers of X-Road interfaces
Training for developers of X-Road interfaces
CPSC 315 – Programming Studio Spring 2012
ILDG Implementation Status
Portability CPSC 315 – Programming Studio
Chapter 9 Instruction Sets: Characteristics and Functions
Chapter 10: File-System Interface
Follow the Data Data (and information) move from place to place in computer systems and networks. As it moves it changes form frequently. This story.
Real-World File Structures
CS2013 Lecture 7 John Hurley Cal State LA.
Chapter 10 Instruction Sets: Characteristics and Functions
Presentation transcript:

1 Metadata Working G roup Report Members (fixed in mid-January) G.AndronicoINFN,Italy P.CoddingtonAdelaide,Australia R.EdwardsJlab,USA C.MaynardEdinburgh,UK D.PleiterDESY,Germany J.SimoneFNAL,USA T.YoshieTsukuba,Japan B.Joo (observer)Edinburgh,UK Mailing List –About 80 mails circulated QCDML (QCD Markup Language) for ILDG

2 0. Introduction 1.QCDML: Strategy and Standard Configuration Format (T.Yoshie) 2.QCDML: Physics (C.Maynard) 3.QCDML: Machine and Management (D.Pleiter) My proposal for QCDML not be used in my talk may be useful for discussions

3 Strategy QCDML: XML schema for ILDG –write a QCDML document for each configuration –store QCDML documents in (a) database(s) –search/retrieve configurations design QCDML so that developing applications is easy QCDML defines a minimal set of XML tags –necessary for exchanging configurations tags which will be searched –researchers are usually interested in required: physics parameters (beta,mq) not included: random number seed

4 Strategy (cont.) Each collaboration can extend QCDML and use it for own purposes Every collaborations are asked to provide values of all relevant QCDML tags

5 Category of QCDML Standard configuration format (SCF) 1.Physics and parameters 2.Algorithm and status 3.Code 4.Machine 5.Management 6.Miscellaneous finalized 4,5: almost finalized 1: discussions on-going (different opinions)

6 SCF: Strategy Standard Format is an abstract (reference) format for exchanging configurations –collaborations submitting configurations to ILDG do not have to convert archived files –some groups have already archived a lot of configurations with an original format –each format is chosen for convenience Conversions will be done at a user side –two methods to convert format of configurations given format to the standard one via C-library one format to another using BinX technology (without referring to the standard format)

7 SCF: Format Definition of Gauge configuration – i,j=1,2,3 color indices mu=1,2,3,4 (x,y,z,t) employ NERSC (Gauge Connection) format –a sequence of 8-byte double precision real numbers –coded in 32-bit IEEE numerical format –endian is not specified

8 SCF: Format (cont.) In C-program, –last index runs faster, index runs from 0 re =0 (real part) re=1 (imaginary part) Store first two rows (2x3) of 3x3 link matrix –U11,U12,U13,U21,U22,U23 mu=1,2,3,4 x=0,1,2,...NX-1 y=0,1,2,...NY-1 z, t double Complex*16 Row-Column Column-Row

9 SCF: C-library Each collaboration submitting configurations to ILDG prepares a C-library to read their configurations in the standard format –pointer to the C-library is stored in QCDML document read a hyper-cubic region – (ix0:ix1)* (iy0:iy1) *( iz0:iz1)* (it0:it1) of (0:NX-1)*(0:NY-1)*(0:NZ-1)*(0:NT-1) lattice void ILDG_read_conf(file, NX, ix0,ix1, NY, iy0,iy1, NZ, iz0,iz1, NT, it0,it1, endian,config)

10 SCF: C-library (cont.) the region (0-3)*(4-7)*(4-7)*(0-15) of the whole lattice (0-7)*(0-7)*(0-7)*(0-15) will be read in big endian format and stored in U[8][4][4][4][4][2][3][2]. main() { int NX=8,NY=8,NZ=8,NT=16 ; int endian=1 ; /* big endian, =0 for little endian */ double U[8][4][4][4][4][2][3][2] ; ILDG_read_conf("test-file", NX,0,3, NY,4,7, NZ,4,7, NT,0,15, endian,U) ; }

11 SCF: C-library (cont.) in general, the conversion program requires huge memory of 1-2 configuration size: --- memory bottleneck cannot be avoided We propose the above interface: –Simple –mainly for full QCD configurations 32^3 x Nt lattice for forthcoming several years can be handled by a high-end PC with memory of 2GB some extension might be necessary in future

12 SCF: BinX BinX –an XML schema to describe format of binary file developed by the edikt project (a part of OGSA-DIA) –software to convert one binary format to the other will be available in May, 2003 –enables us to convert configuration without referring to the standard format Each collaboration submitting configurations to the ILDG describes its own format by BinX –User may write his/her favorite format in BinX

13 SCF: BinX (Cont.)

14 SCF: BinX (Cont.) Mechanism for describing an array split across several files

15 Distribution SCF defines format of only binary configuration –no parameters (size,coupling..) –no management info (checksums, collaboration name..) –all of them are described in a QCDML document Keeping identification of configuration –encapsulate the configuration and the QCDML document into one file –distribute it via ILDG –(need opinions and help from the middleware working group)

16 Distribution (cont.) Candidate : DIME (Direct Internet Message Encapsulation) –format is fixed (different from MIME) header (fixed bytes) length (fixed bytes) body of data (QCDML document) length (fixed bytes) body of data (QCDML-BinX document) length (fixed bytes) body of data (configuration itself) footer (fixed bytes)

17 Distribution (cont.) Merits –don’t have to unpack files before reading –file size is not increased (cf. MIME: factor 3/2 incl.) Discussions: –prepare a tool to extract QCDML document –C-library has to seek the file to point the origin (the first byte) of binary configuration –Compatibility with BinX

18 My opinion for QCDML my opinion/proposal agreed by working group Physics –actions, physics parameters, lattice size Simulation –algorithm, machine, code, series, trajectory Management –revision, crc, reference, collaboration, project, action Pointers –site, file, C-library

19 Action a human readable document for each action –XML schema is powerful, but cannot describe completely the action Three versions –UKQCD Schema v0.5 –A compromise proposal –My very simple version Problems in UKQCD schema –too complicated Action consists of operators Operators consist of coupling and fields –Action and operator names are XML tags

20 Action (cont.) My very simple version –just listing up coupling names and values A compromised version sample2.xml sample2.xml –fields for each operator are removed –names of actions and operators are described by values –action is divided into gluon and quark sections enables us to include boundary conditions

21 Simulation Algorithm section: –we may have to prepare a human readable document –simple version is sufficient Machine Code Series –several runs with the same parameter sets –distinguishes them Trajectory_or_Sweep

22 Management Action Checksums –CRC32 or MD5 –for binary configuration with original format Collaboration name and Project Name –Useful tags to search configuration Reference –some information not suitable to include into QCDML auto-correlation time –do not have to include all references Revision –To check whether the QCDML document is changed