Metadata for the SKA - Niruj Mohan Ramanujam, NCRA.

Slides:



Advertisements
Similar presentations
Configuration Management
Advertisements

CACORE TOOLS FEATURES. caCORE SDK Features caCORE Workbench Plugin EA/ArgoUML Plug-in development Integrated support of semantic integration in the plugin.
National Radio Astronomy Observatory June 13/14, 2005 EVLA Phase II Proposal Review EVLA Phase II Computing Development Bryan Butler (EVLA System Engineer.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Information Retrieval in Practice
SpaceGRID and EGSO Satu Keski-Jaskari Maria Vappula Parallal Computing – Seminar
VB in Context Michael B. Spring Department of Information Science and Telecommunications University of Pittsburgh Pittsburgh, Pa 15260
Overview of Search Engines
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
Introduction to DBMS Purpose of Database Systems View of Data
Upcoming Enhancements to the HST Archive Mark Kyprianou Operations and Engineering Division Data System Branch.
1. 2 Purpose of This Presentation ◆ To explain how spacecraft can be virtualized by using a standard modeling method; ◆ To introduce the basic concept.
Chapter 3: Operating-System Structures System Components Operating System Services System Calls System Programs System Structure Virtual Machines System.
` Tangible Interaction with the R Software Environment Using the Meuse Dataset Rachel Bradford, Landon Rogge, Dr. Brygg Ullmer, Dr. Christopher White `
Ch 5. The Evolution of Analytic Processes
Metadata IN Smart Grid Group Name: REQ
Chapter 2 Architecture of a Search Engine. Search Engine Architecture n A software architecture consists of software components, the interfaces provided.
MWA Data Capture and Archiving Dave Pallot MWA Conference Melbourne Australia 7 th December 2011.
ALMA Software B.E. Glendenning (NRAO). 2 ALMA “High Frequency VLA” in Chile Presently a European/North American Project –Japan is almost certainly joining.
Adaptive Hypermedia Tutorial System Based on AHA Jing Zhai Dublin City University.
Chapter 1 : Introduction §Purpose of Database Systems §View of Data §Data Models §Data Definition Language §Data Manipulation Language §Transaction Management.
Doug Tody E2E Perspective EVLA Advisory Committee Meeting December 14-15, 2004 EVLA Software E2E Perspective.
1 Software Design Overview Reference: Software Engineering, by Ian Sommerville, Ch. 12 & 13.
An Integrated Meteorological Surface Observation System By Wan Mohd. Nazri Wan Daud Malaysian Meteorological Department Due to the increasing and evolving.
©Silberschatz, Korth and Sudarshan1.1Database System Concepts Chapter 1: Introduction Purpose of Database Systems View of Data Data Models Data Definition.
Ocean Observatories Initiative Data Management (DM) Subsystem Overview Michael Meisinger September 29, 2009.
Unit-1 Introduction Prepared by: Prof. Harish I Rathod
Archive Access Tool Review of SSS Readiness for EVLA Shared Risk Observing, June 5, 2009 John Benson Scientist.
INFORMATION SYSTEM-SOFTWARE Topic: OPERATING SYSTEM CONCEPTS.
R MoeserCorrelator f2f Meeting1 MCAF (Metadata Capture and Formatting) Rich Moeser.
A university for the world real R © 2009, Chapter 9 The Runtime Environment Michael Adams.
14 June, 2004 EVLA Overall Design Subsystems II Tom Morgan 1 EVLA Overall Software Design Final Internal Review Subsystems II by Tom Morgan.
Department of Industrial Engineering Sharif University of Technology Session# 10.
Introduction to EVLA Software Bryan Butler. 2006Dec05/06EVLA M&C Transition Software CDR2 EVLA Computing (Terse) History The original EVLA Phase I proposal.
M.P. RupenCorrelator Face-to-Face Meeting 31 Oct 2006 Output Formats Part II Michael P. Rupen.
ICALEPCS 2005 Geneva, Oct. 12 The ALMA Telescope Control SystemA. Farris The ALMA Telescope Control System Allen Farris Ralph Marson Jeff Kern National.
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
5 June 2002DOM Main Board Engineering Requirements Review 1 DOM Main Board Software Engineering Requirements Review June 5, 2002 LBNL Chuck McParland.
Peter W. PhillipsATLAS SCT Week, CERN, September/October 2002 Electrical Tests of SCT modules using RODs Peter W Phillips Rutherford Appleton Laboratory.
Towards Unifying Vector and Raster Data Models for Hybrid Spatial Regions Philip Dougherty.
Mountaintop Software for the Dark Energy Camera Jon Thaler 1, T. Abbott 2, I. Karliner 1, T. Qian 1, K. Honscheid 3, W. Merritt 4, L. Buckley-Geer 4 1.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
GT3 Index Services Lecture for Cluster and Grid Computing, CSCE 490/590 Fall 2004, University of Arkansas, Dr. Amy Apon.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
The AstroGrid-D Information Service Stellaris A central grid component to store, manage and transform metadata - and connect to the VO!
September 2003, 7 th EDG Conference, Heidelberg – Roberta Faggian, CERN/IT CERN – European Organization for Nuclear Research The GRACE Project GRid enabled.
Scenario use cases Szymon Mueller PSNC. Agenda 1.General description of experiment use case. 2.Detailed description of use cases: 1.Preparation for observation.
Statistical process model Workshop in Ukraine October 2015 Karin Blix Quality coordinator
M.-E. Bégin¹, S. Da Ronco², G. Diez-Andino Sancho¹, M. Gentilini³, E. Ronchieri ², and M. Selmi² ¹CERN, Switzerland, ² INFN-Padova, Italy, ³INFN-CNAF,
Store and exchange data with colleagues and team Synchronize multiple versions of data Ensure automatic desktop synchronization of large files B2DROP is.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
Metadata V1 By Dick M.A. Schaap – technical coordinator Oostende, June 08.
System Components Operating System Services System Calls.
TIBCO Business Events Online Training. Introduction to TIBCO BE Tibco Business Events is complex event processing software with a powerful engine enables.
Information Retrieval in Practice
Introduction to DBMS Purpose of Database Systems View of Data
Fault Management for the SKA - Niruj Mohan Ramanujam, NCRA
WP18, High-speed data recording Krzysztof Wrona, European XFEL
From LSE-30: Observatory System Spec.
Self Healing and Dynamic Construction Framework:
Computing Architecture
EVLA Overall Software Design
The Re3gistry software and the INSPIRE Registry
Introduction to DBMS Purpose of Database Systems View of Data
Chapter 2: Operating-System Structures
EVLA Advisory Committee Meeting, March 19-20, 2009
Introduction to Operating Systems
SDP Interface Identification
Chapter 2: Operating-System Structures
<Your Team # > Your Team Name Here
Presentation transcript:

Metadata for the SKA - Niruj Mohan Ramanujam, NCRA

Overview Any auxillary data needed to fully interpret and process the science data is termed as metadata. This metadata is generated by multiple domains. Metadata is collected by the M&C system from various components and sub-systems of the SKA (engineering and array configuration data, alarms etc) & external Elements and equipment connected to the M&C (signal transport, weather monitoring equipment etc), based on multiple tables, throughout the course of an observation, is written to a repository along with the science data, with links provided to the rest of the relevant M&C Archive data, to be analysed and interpreted by science data analysis tools

Overview The issues related to metadata include Definition Diversity Generation Storage Drill-down or backtracking Formats

Metadata Definitions Metadata for radio telescopes have traditionally been a collection of keywords (and tables) in ascii in FITS files, and some text in log files.

Metadata Definitions LOFAR and ALMA have enlarged the concept of metadata and this is even more so for the SKA. We expect metadata to be generated through metadata definitions read from Static configuration files, which encode certain default metadata which remain constant through substantial parts of the observation, e.g. array configuration, frequency settings, Dynamic files, the contents and nature of which are changing, e.g. system alarms, RFI and weather monitoring, User-specified files, wherein the user can decide to include certain M&C-accessible data in their metadata SKA needs for metadata will be defined by Software and Computing.

Diversity of Metadata Metadata differ in their specification method, origin, modes of acquisition, timescale for change, storage method etc. Both metadata generation and storage need to be able to handle this diversity, which is dependent on an SKA Science Data Model (ALMA SDM may be generic enough to handle SKA needs). We attempt to provide an non-exhaustive list of the various types of metadata, to help show the complexity involved.

Diversity of Metadata Metadata typeOriginsExamples Array configurationInstallation teamTree from sub array down to receiver Observation configuration Observation Preparation Pointing direction, channel frequencies, receiver id,etc. Observing projectProject id, observer, links to other project data Log fileArchiveLog file Alarms and EventsAlarm Handler, Archive Antennae down, number of dipoles in a station less than optimal etc. Data processingScience Data Archive Flagging, initial calibration, visibility weights etc. System performance M&C Stability of power levels, temperature etc., available disk memory, real-time correction performance etc. RFITime-tagged, frequency tagged information on RFI EnvironmentTemperature, precipitation, wind speed, TEC estimates Applied correctionsPointing corrections, instrument offsets, weights etc. Instrument dataTotal power, frequency stability, clock errors etc. Performance gapsPointing errors, non working dipoles etc.

Metadata Generation These metadata are generated by the M&C system by abstracting from the information acquired by its sub-systems, collected by the M&C system by subscribing to custom applications and interfacing with other domains (e.g. engineering, science, external monitors, data analysis etc), generated and archived by other domains and applications M&C needs to ensure that all relevant M&C data will be stored in the Metadata archive.

Metadata Generation This leads to two models for the generation of Metadata... A central Metadata Engine sits inside the Central M&C and assembles the required metadata by subscribing to whichever service is required (e.g. LOFAR), and Every Regional M&C has its own Metadata engine which collects necessary information locally, and sends it upwards, along with a System M&C Metadata Engine. These have an impact on synchronicity, metadata aggregation, alarm generation and so on.

Storage Metadata will be stored in the Archive in suitable form(s), with drill-down/backtracking, and bundled together with the science data. The nature of storage of different types of Metadata will depend partly on the SDM. Some issues are.. Differing time granularities - separate for efficiency (ALMA) Differing latencies, especially those needed as feedback for other applications Ability to change metadata post-observations (AWE) Ability to recreate metadata from the Archive Ability to track back (drill-down) from the data (AWE) Different delivery times for different metadata

Drill-down Starting from the metadata, the user should be able to drill-down to the Component level M&C information through the Engineering Archive - called drill-down or trackability. Any changes to the data downstream of acquisition will also need to be registered in the metadata, which is therefore dynamic. AstroWise (AWE) has this capability, wherein the metadata are stored in a relational database. Metadata will also need to be linked to the Log Archive.

Metadata Formats The recommended practice for Metadata is to define them through an XML schema, using controlled vocabularies and link them to the binary files containing science data. The structure and format will also be influenced by the science data format. LOFAR uses HDF5 files ALMA uses CASA MS tables AWE uses relational databases The format of the metadata will depend on the SKA Science Data Model and may well change with time initially. This imposes requirements on the flexibility of analysis tools and SDM formats

Metadata Standards There are quite a few Metadata standards which could be useful... ARDA Metadata Grid Application (AMGA) for the LHC AstroGrid-D, RTML, RDF - Grid and VO protocols Open Archives Initiative Protocol for Metadata harvesting ISO IEC an international metadata registry standard

Summary Any auxillary data needed to fully interpret and process the science data is termed as metadata. This metadata is generated by multiple domains. M&C supports science operation by collecting Metadata from various components and sub-systems of the SKA (engineering and array configuration data, alarms etc) & from external Elements and equipment connected to the M&C (signal transport, weather monitoring equipment etc), based on multiple tables, throughout the course of an observation, is written to a repository along with the science data, with links provided to the rest of the relevant M&C Archive data, to be analysed and interpreted by science data analysis tools.

See section 5.5 of M&C Design Concept Descriptions, “04_WP TD-002v0.2_dcd”