Www.efda-taskforce-itm.org Database structure for the European Integrated Tokamak Modelling Task Force F. Imbeaux On behalf of the Data Coordination Project.

Slides:



Advertisements
Similar presentations
CIMCO Integration Software Products
Advertisements

EUFORIA FP7-INFRASTRUCTURES , Grant JRA4 Overview and plans M. Haefele, E. Sonnendrücker Euforia kick-off meeting 22 January 2008 Gothenburg.
Test Case Management and Results Tracking System October 2008 D E L I V E R I N G Q U A L I T Y (Short Version)
Programming Languages Marjan Sirjani 2 2. Language Design Issues Design to Run efficiently : early languages Easy to write correctly : new languages.
Snejina Lazarova Senior QA Engineer, Team Lead CRMTeam Dimo Mitev Senior QA Engineer, Team Lead SystemIntegrationTeam Telerik QA Academy SOAP-based Web.
University of Leeds Department of Chemistry The New MCM Website Stephen Pascoe, Louise Whitehouse and Andrew Rickard.
© 2005 Prentice Hall7-1 Stumpf and Teague Object-Oriented Systems Analysis and Design with UML.
1 Introducing Collaboration to Single User Applications A Survey and Analysis of Recent Work by Brian Cornell For Collaborative Systems Fall 2006.
Direct Congress Dan Skorupski Dan Vingo 15 October 2008.
NextGRID & OGSA Data Architectures: Example Scenarios Stephen Davey, NeSC, UK ISSGC06 Summer School, Ischia, Italy 12 th July 2006.
Software Frameworks for Acquisition and Control European PhD – 2009 Horácio Fernandes.
Online Data Analysis and Simulation Sven Reiche UCLA - 09/22/04 Sven Reiche UCLA - 09/22/04.
Application architectures
Russell Taylor Lecturer in Computing & Business Studies.
Mgt 240 Lecture Website Construction: Software and Language Alternatives March 29, 2005.
Chapter 2 Database System Concepts and Architecture
Introduction to Information System Development.
Hunt for Molecules, Paris, 2005-Sep-20 Software Development for ALMA Robert LUCAS IRAM Grenoble France.
Background The Encyclopedio of Life (EOL) is an ROR open source project to create a free, online reference source and database for every one of the 1.8.
COMPUTER SOFTWARE Section 2 “System Software: Computer System Management ” CHAPTER 4 Lecture-6/ T. Nouf Almujally 1.
The Directory A distributed database Distributed maintenance.
4 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
Using XML technologies to implement complex tables in short- term statistics Francesco Rizzo
DCS Overview MCS/DCS Technical Interchange Meeting August, 2000.
PHP TUTORIAL. HISTORY OF PHP  PHP as it's known today is actually the successor to a product named PHP/FI.  Created in 1994 by Rasmus Lerdorf, the very.
1 Computing Software. Programming Style Programs that are not documented internally, while they may do what is requested, can be difficult to understand.
 To explain the importance of software configuration management (CM)  To describe key CM activities namely CM planning, change management, version management.
Measurement Requirements and Assessment of Measurement Capability David Johnson, Réjean Boivin, Ken Young.
Automated Plotting with the Emme Desktop API Peter Hidas Bureau of Transport Statistics Transport NSW.
©Ian Sommerville 2000 Software Engineering, 6th edition. Chapter 10Slide 1 Architectural Design l Establishing the overall structure of a software system.
HERA/LHC Workshop, MC Tools working group, HzTool, JetWeb and CEDAR Tools for validating and tuning MC models Ben Waugh, UCL Workshop on.
The Network Performance Advisor J. W. Ferguson NLANR/DAST & NCSA.
ITPA/IMAGE 7-10 May 2007 Software and Hardware Infrastructure for the ITM B.Guillerminet, on behalf of the ITM & ISIP teams (P Strand, F Imbeaux, G Huysmans,
1/15 G. Manduchi EPICS Collaboration Meeting, Aix-en-Provence, Spring 2010 INTEGRATION OF EPICS AND MDSplus G. Manduchi, A. Luchetta, C. Taliercio, R.
SE: CHAPTER 7 Writing The Program
Eurostat Expression language (EL) in Eurostat SDMX - TWG Luxembourg, 5 Jun 2013 Adam Wroński.
Architectural Design Yonsei University 2 nd Semester, 2014 Sanghyun Park.
RFX-mod Programme Workshop 2009 – January 2009 Scaling and validation plans Paolo Franz, Monica Spolaore.
The european ITM Task Force data structure F. Imbeaux.
 Repository Model  Client-Server Model  Layered Model  Modular decomposition styles  Object Models  Function Oriented Pipelining  Control Styles.
_______________________________________________________________CMAQ Libraries and Utilities ___________________________________________________Community.
TRANSP users meeting Agenda 1.TRANSP development in 2010/2011, user’s suggestions to facilitate/accelerate TRANSP submissions. 2.TRANSP work:
ITPA Meeting, PPPL, April, y1999: two core and two SOL transport codes with about 15 users, who worked locally at JET; y2000: Secondees from.
D.G.Muir May 2010 MAST’s Integrated Data Access Management system: IDAM An Overview: D.G.Muir May 2010 designed to address specific MAST data management.
E.g.: MS-DOS interface. DIR C: /W /A:D will list all the directories in the root directory of drive C in wide list format. Disadvantage is that commands.
A Practical Approach to Metadata Management Mark Jessop Prof. Jim Austin University of York.
SOAP-based Web Services Telerik Software Academy Software Quality Assurance.
Summary CDBM IMAGE meeting, 07 IMAGE SUMMARY IMAGE set up to provide a venue for discussion between the different Integrated Modelling (IM) activities.
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
TPB Structure St.. Petersburg NEXT STEPS IN PROCESS Approach identified participants to ensure they are willing/able to participate Section coordinators.
00/XXXX 1 Data Processing in PRISM Introduction. COCO (CDMS Overloaded for CF Objects) What is it. Why is COCO written in Python. Implementation Data Operations.
SUMMARY OF 4th IPTA TRANSPORT AND ITB PHYSICS TG MEETING St. Petersburg, Russia, April 8-11, 2003 Presented by E.J. Doyle for the TG Note: this summary.
STAR Scheduling status Gabriele Carcassi 9 September 2002.
CGI – GeoSciML Testbed 3 Status for BRGM Jean-Jacques Serrano.
Validation db status and plans (what happened since the Collaboration meeting) Hans Wenzel 10th Physics Lists and Validation Tools working group meeting.
Eurostat Sharing data validation services Item 5.1 of the agenda.
Slide 1 Chapter 8 Architectural Design. Slide 2 Topics covered l System structuring l Control models l Modular decomposition l Domain-specific architectures.
Application architectures Advisor : Dr. Moneer Al_Mekhlafi By : Ahmed AbdAllah Al_Homaidi.
January 2010 – GEO-ISC KickOff meeting Christian Gräf, AEI 10 m Prototype Team State-of-the-art digital control: Introducing LIGO CDS.
Part 1 The Basics of Information Systems. Purpose of Information Systems Information systems ◦ Collects, stores and organizes information ◦ Retrieves.
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
Advanced Higher Computing Science The Project. Introduction Worth 60% of the total marks for the course Must include: An appropriate interface using input.
HYDRA Framework. Setup of software environment Setup of software environment Using the documentation Using the documentation How to compile a program.
Advanced Higher Computing Science
Space FSW Conference 2016 Matthew Conte Software Engineer
Operating System Interface between a user and the computer hardware
reduction data treatment for ARCS
Applied Software Implementation & Testing
Princess Nourah bint Abdulrahman University
Overview Activities from additional UP disciplines are needed to bring a system into being Implementation Testing Deployment Configuration and change management.
Presentation transcript:

Database structure for the European Integrated Tokamak Modelling Task Force F. Imbeaux On behalf of the Data Coordination Project of the Task Force

Gather multi-machine experimental data for code benchmarking –With detailed information about machine characteristics in a standard format Storage of simulation results : reproducibility and flexibility –Including detailed information about code parameters (reproducible simulation) –Suited to all possible kind of stand-alone or integrated simulation request Accessible from several programming languages –Access layer available in several languages with the same call request Can ultimately become a database for predictive ITER simulations Goals for the ITM-TF Database

Multi-machine experimental data : not only profiles of physical quantities, but characteristics of the various tokamak subsystems (magnetic coils, heating systems, diagnostics). Storage of simulation results : not only general transport simulations, but suited to all possible kinds of simulations (turbulence, MHD, equilibria, heating & current drive, …). Includes detailed information about code parameters. Aiming at a better consistency of simulations with original experimental data (less preliminar processing by black-box type codes), detailed bookkeeping of simulations  higher quality of the benchmarking exercise. Comparison to the ITPA Profile Database

Full description of a tokamak : physics quantities + subsystems characteristics + diagnostics measurements  Object oriented data structure : High degree of organisation : several subtrees corresponding to « Consistent Physical Objects » (avoid flat structures with long list of parameter names). Substructures correspond to Consistent Physical Object : –Subsystem : (e.g. a heating system, or a diagnostic) : will contain structured information on the hardware setup and the measured data by / related to this object. –Code results (e.g. a given plasma plasma equilibrium, or the various source terms and fast particle distribution function from an RF code) : will contain structured information on the code parameters and the physics results. Programming Language flexibility : use of recent software technologies : Database structure is defined using XML schemas How to do it ?

XML is a generic and standardised object-oriented language, quite convenient to describe structures XML files can also contain the actual data, but we do not use this possibility (ASCII format not convenient for large size numerical data) XML schemas are used to define the data structure (arborescence, type of the objects, …). User-friendly tools (XML editors) allow fast and easy design of the data structure. Small translations scripts “parsers” allow to translate the schema in other languages (HTML, Fortran, C, …)  automated translation of the structure in any programming language. Use of XML schemas

The data is presently stored on an MDS+ server –Widely used data access system in the fusion community –Interfaces already exist with many languages –Convenient for storing multi-dimensional arrays, no problem with large data size –Not really object oriented (arrays of objects not possible), slow for large number of data calls The data storage system may evolve in the future The XML schemas defining the data structure are used to build the MDS+ model tree (automated script) Data storage

The Data Structure (a part of it) XML schema as displayed by XMLSpy®

Unique data structure for experimental data and all kinds of simulations Each entry of the database corresponds to a unique consistent physics dataset –Each new simulation or version of the experimental data creates a new entry –do not allow competing codes to write their results in the same entry –do not allow coexistence of different versions of experimental data in the same entry Use of the MDS+ shot number as a Generalised Pulse Number (GPN) containing information on : –The shot number –Whether the data is experimental or simulation results –The version of the data / reference number of the simulation Referencing system (draft proposal)

Guarantee data consistency within one entry  each new simulation or version of the experimental data creates a new entry. Copying all data present in the structure would cost a lot of storage space. Only data that are modified are explicitly written in the « output » GPN The unmodified data can be tracked down using a signal referencing the « input » GPN. –This signal would be located at the top of the tree –Valid for all subtrees (subtrees of different origin not allowed, since it may violate data consistency)  simple and efficient bookkeeping Referencing system (draft proposal)

Referencing system (draft proposal) Exp. Data Ref : none Ref : Simulation # Ref : Simulation # Ref : Simulation # Ref : Simulation # Exp. Data Ref : none Guarantees data consistency Referencing system  recursive search, hidden from the user if he does not want to know about it

MDS+ server operational (Frascati, ENEA) XML philosophy and tools operational Work has started with IMP1 (equilibrium and linear MHD stability) : –Some experimental data has been put on the server using a temporary data structure (one ITER equilibrium done, JET and MAST data ongoing) –Equilibrium codes coupled to the database structure, first simulations reading the data from the database have been produced –First benchmarking exercise to be carried out for the EPS (equilibrium codes benchmarking) Status of the ITM Database

Update existing database structure in line with the data referencing system Set up the referencing system tools and provide generic access tools to the users Gather experimental data from the various machines Extend the database structure to the other « Integrated Modelling Projects » of the Task Force Evaluate possible alternatives to MDS+ for data access / storage Ongoing Work and perspectives