Data Format Description Language (DFDL) WG Martin Westhead EPCC, University of Edinburgh

Slides:



Advertisements
Similar presentations
Can I Use It, and If so, How? Christian Lieske SAP AG – MultiLingual Technology Discussion of Consortium Proposal for OLIF2 File Header.
Advertisements

Introduction to the BinX Library eDIKT project team Ted Wen Robert Carroll
Data Format Description Language (DFDL) WG Martin Westhead EPCC, University of Edinburgh Alan Chappell PNNL
Datatypes for OGSA Dr Martin Westhead Principal Consultant, EPCC Telephone: Fax:
E-Science Data Information and Knowledge Transformation BinX – A Tool for Binary File Access eDIKT project team Ted Wen
Enterprise Java and Data Services Designing for Broadly Available Grid Data Access Services.
1 University of Namur, Belgium PReCISE Research Center Using context to improve data semantic mediation in web services composition Michaël Mrissa (spokesman)
A centre of expertise in digital information management UKOLN is supported by: XML and the DCMI Abstract Model DC Architecture WG Meeting,
XML: Extensible Markup Language
Programming Languages and Paradigms
Web Services Seminar: Service Description Languages
E-Science Data Information and Knowledge Transformation The BinX Language.
Mark Evans, Tessella Digital Preservation Boot Camp – PASIG meeting, Washington DC, 22 nd May 2013 PREMIS Practical Strategies For Preservation Metadata.
Mapping Physical Formats to Logical Models to Extract Data and Metadata Tara Talbott IPAW ‘06.
Framework for Model Creation and Generation of Representations DDI Lifecycle Moving Forward.
HDF 1 NCSA HDF XML Activities Robert E. McGrath Mike Folk National Center for Supercomputing Applications.
JXON An Architecture for Schema and Annotation Driven JSON/XML Bidirectional Transformations David A. Lee Senior Principal Software Engineer Slide 1.
Scientific Workflows Scientific workflows describe structured activities arising in scientific problem-solving. Conducting experiments involve complex.
Data Formats: Using Self-describing Data Formats Curt Tilmes NASA Version 1.0 February 2013 Section: Local Data Management Copyright 2013 Curt Tilmes.
EARTH SCIENCE MARKUP LANGUAGE “Define Once Use Anywhere” INFORMATION TECHNOLOGY AND SYSTEMS CENTER UNIVERSITY OF ALABAMA IN HUNTSVILLE.
Why XML ? Problems with HTML HTML design - HTML is intended for presentation of information as Web pages. - HTML contains a fixed set of markup tags. This.
CIS Computer Programming Logic
Database System Concepts and Architecture Lecture # 2 21 June 2012 National University of Computer and Emerging Sciences.
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
Software School of Hunan University Database Systems Design Part III Section 5 Design Methodology.
ATLAS Demystified: A Practical Introduction Christophe Laprun, Jonathan Fiscus, John Garofolo, Sylvain Pajot National Institute of Standards and Technology.
1 Introduction Modules  Most computer programs solve much larger problem than the examples in last sessions.  The problem is more manageable and easy.
The netCDF-4 data model and format Russ Rew, UCAR Unidata NetCDF Workshop 25 October 2012.
Serialization. Serialization is the process of converting an object into an intermediate format that can be stored (e.g. in a file or transmitted across.
1 Cisco Unified Application Environment Developers Conference 2008© 2008 Cisco Systems, Inc. All rights reserved.Cisco Public Introduction to Etch Scott.
COP4020 Programming Languages Names, Scopes, and Bindings Prof. Xin Yuan.
Ch. 5 Ch. 51 jcmt CSE 3302 Programming Languages CSE3302 Programming Languages (more notes) Dr. Carter Tiernan.
Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
CS551 - Lecture 11 1 CS551 Object Oriented Middleware (III) (Chap. 5 of EDO) Yugi Lee STB #555 (816)
PHS / Department of General Practice Royal College of Surgeons in Ireland Coláiste Ríoga na Máinleá in Éirinn Knowledge representation in TRANSFoRm AMIA.
STEP Tutorial: “ Fundamentals of STEP” David Briggs, Boeing January 16, 2001 ® PDES, Inc NASA STEP Workshop step.nasa.gov.
Ch. 5 Ch. 51 jcmt Summer 2003Programming Languages CSE3302 Programming Languages (more notes) Summer 2003 Dr. Carter Tiernan.
PRACTICAL KNOWLEDGE REPRESENTATION FOR THE WEB Frank van Harmelen Dieter Fensel AIFB Kim Kangil Structural Complexity Laboratory.
How to execute Program structure Variables name, keywords, binding, scope, lifetime Data types – type system – primitives, strings, arrays, hashes – pointers/references.
E-Science Data Information and Knowledge Transformation BinX – A Tool for Binary File Access eDIKT project team Ted Wen
1 5/18/2007ã 2007, Spencer Rugaber Acme Architectural interchange language – CMU and ISI Extensible Tool support –AcmeStudio.
Semantic Analysis II Type Checking EECS 483 – Lecture 12 University of Michigan Wednesday, October 18, 2006.
® Using (testing?) the HY_Features model, 95th OGC Technical Committee Boulder, Colorado USA Rob Atkinson 3 June 2015 Copyright © 2015 Open Geospatial.
Post-NASA Review Schema Harmonisation CCSDS Spring Meeting 2014 Peter Mendham, Richard Melvin, Stuart Fowell.
Web services. The SOAP Data Model, Schema Validation, and Introduction to WSDL. February 16, 2006.
2) Database System Concepts and Architecture. Slide 2- 2 Outline Data Models and Their Categories Schemas, Instances, and States Three-Schema Architecture.
 Data Type is a basic classification which identifies different types of data.  Data Types helps in: › Determining the possible values of a variable.
Data Format Description Language (DFDL) WG Martin Westhead EPCC, University of Edinburgh
Apache Avro CMSC 491 Hadoop-Based Distributed Computing Spring 2016 Adam Shook.
Scalar and composite data Programming Language Design and Implementation (4th Edition) by T. Pratt and M. Zelkowitz Prentice Hall, 2001 Section
DFDL WG Session 1 Summary of Status Mike Beckerle Ascential Software.
Jackson, Web Technologies: A Computer Science Perspective, © 2007 Prentice-Hall, Inc. All rights reserved Chapter 9 Web Services: JAX-RPC,
Binary Data Format (BDF) ARCH/VCDE Small Working Group 03/28/2008
DFDL WG Session 3 Mike Beckerle Ascential Software Two note-takers please?
Databases (CS507) CHAPTER 2.
CS 326 Programming Languages, Concepts and Implementation
Web Service Modeling Ontology (WSMO)
A DFDL Proposal based on Commercial Data Processing Requirements
Object-Orientated Programming
Final Review In Text: Chapters 1-3, 5-11,
Final Review In Text: Chapters 1-3, 5-10, 12,
Final Review In Text: Chapters 1-3, 5-10, 12,
Final Review In Text: Chapters 1-3, 5-10,
Enterprise Architect, CNA
Midterm Review In Text: Chapters 1-3, 5-9, 15.
Final Review In Text: Chapters 1-3, 5-10,
Midterm Review In Text: Chapters 1-3, 5-10, 15.
Midterm Review In Text: Chapters 1-3, 5-9, 15.
Midterm Review In Text: Chapters 1-3, 5-11, 15.
Presentation transcript:

Data Format Description Language (DFDL) WG Martin Westhead EPCC, University of Edinburgh

Overview Aims Approach Documents Discussion

Aims A language for describing formats –Transparency of physical representation –Unambiguous (persistent) description of data –Standard implicit XML “view” of data –Generic tools (browsing, conversion) Existing work –BinX –BFD part of the Scientific Annotation Middleware project ( –ESML

Basic Mechanism XML description of structure Description can be annotated –byteOrder=“littleEndian” In DFDL description language extensible DFDL description Data file

Approach Separate out structure and semantics Tried to avoid a data model General structural language –Repetition –Pointers –References to data –New structures can be built (compositionality) Semantics –Hard to express so…we don’t –General labeling –Label semantics define elsewhere (ontologies) –Labels can be added (extensibility)

Structure – arbitrary labels fooSet fooPair foo bunchThings thing bunchThings foo fooPair......

Structure – example labels complex Array complex float byte bit byte float complex......

Structural language Formal semantics –Structured binary sequence –Defines hierarchical structure over underlying sequence of binary values Language for describing hierarchical structure –Repetition Explicit number repeats Termination characters –Data reference Conditionals Data size –Pointers Scope –As general as possible but –Must be concise and implementable

Ontologies Define mappings binary structure language primitive SDL define binary structure Core API: –getAsInt, getAsFloat, getAsByte… –getAsIntArray, getAsFloatArray… For each structure, ontology defines semantics for each method

Ontology 2 short … byte byte byte … … Ontology defines: –New structures –New XML associated with them –Mappings from new structures to language primitives via core API –API extensions –Relationships between structures (RDF/OWL)? Java primitives

Current documents SDL formal definition XML representation of SDL Primitives ontology Basic structures ontology Primer All strawmen known to be incomplete/flawed aim to kick start discussion

Discussion issues Transformations –Low level: encodings, compression, blocking –High level: filtering out formatting Concept of type needed –distinct from binary representation Programming language independence Pointer semantics Expressive power of SDL vs. implementability –Layered standard (?)

Current proposals More transformation oriented view –as opposed to representation oriented Introduction of data model … dfdl:char Java charJava byteC char bindings representations