VO Query Language GSFC XML Group Ed Shaya Brian Thomas Kirk Borne.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Registries Work Package 2 Requirements, Science Cases, Use Cases, Test Cases Charter: Focus on science case scenarios, and use cases related specifically.
Interoperability meeting S. Derriere, Strasbourg, 2003 October UCD2... When all you have is a hammer every problem looks like a nail. Registry,
September 13, 2004NVO Summer School1 VO Protocols Overview Tom McGlynn NASA/GSFC T HE US N ATIONAL V IRTUAL O BSERVATORY.
September 13, 2004NVO Summer School1 VO Protocols Overview Tom McGlynn NASA/GSFC T HE US N ATIONAL V IRTUAL O BSERVATORY.
9 September 2005NVO Summer School Aspen Astronomical Dataset Query Language (ADQL) Ray Plante T HE US N ATIONAL V IRTUAL O BSERVATORY.
1 ICS-FORTH & Univ. of Crete SeLene November 15, 2002 A View Definition Language for the Semantic Web Maganaraki Aimilia.
CASDA Virtual Observatory CSIRO ASTRONOMY AND SPACE SCIENCE Arkadi Kosmynin 11 March 2014.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
XML: Extensible Markup Language
19-20 March 2003 IVOA Registry Workgroup LeSc Astrogrid Registry: Early Designs Elizabeth Auden Astrogrid Registry Workgroup Leader IVOA Registry Workgroup.
Data Intensive Techniques to Boost the Real-time Performance of Global Agricultural Data Infrastructures SEMAGROW U SING A POWDER T RIPLE S TORE FOR BOOSTING.
Solar and STP Physics with AstroGrid 1. Mullard Space Science Laboratory, University College London. 2. School of Physics and Astronomy, University of.
1 Draft of a Matchmaking Service Chuang liu. 2 Matchmaking Service Matchmaking Service is a service to help service providers to advertising their service.
A Web service for Distributed Covariance Computation on Astronomy Catalogs Presented by Haimonti Dutta CMSC 691D.
1 COS 425: Database and Information Management Systems XML and information exchange.
1 Lecture 13: Database Heterogeneity Debriefing Project Phase 2.
McGraw-Hill/Irwin Copyright © 2008, The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin Copyright © 2008 The McGraw-Hill Companies, Inc.
U of R eXtensible Catalog Team MetaCat. Problem Domain.
River Campus Libraries Find Articles A Web Redesign for ENCompass David Lindahl Web Initiatives Manager River Campus Libraries University of Rochester.
River Campus Libraries Find Articles A Web Redesign for ENCompass David Lindahl Web Initiatives Manager River Campus Libraries University of Rochester.
XML –Query Languages, Extracting from Relational Databases ADVANCED DATABASES Khawaja Mohiuddin Assistant Professor Department of Computer Sciences Bahria.
Z39.50, XML & RDF Applications ZIG Tutorial January 2000 Poul Henrik Jørgensen, Danish Bibliographic Centre,
Data Mining Techniques
2003 April 151 Data Centres: Connecting to the Real World Clive Page.
S. Derriere et al., ESSW03 Budapest, 2003 May 20 UCDs - metadata for astronomy Sébastien Derriere François Ochsenbein Thomas Boch CDS, Observatoire astronomique.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
Astrogrid Resource Registry Querying the Registry 1.Mullard Space Science Laboratory, University College London, Holmbury St. Mary, Dorking, Surrey RH5.
Astronomical Data Query Language Simple Query Protocol for the Virtual Observatory Naoki Yasuda 1, William O'Mullane 2, Tamas Budavari 2, Vivek Haridas.
COMPUTER MODELS FOR SKY IMAGE ANALYSIS OF THE INASAN ZVENIGOROD OBSERVATORY Sergei Pirogov ( Institute of Astronomy, Russian Academy of Sciences) VIIth.
Characterisation Data Model applied to simulated data Mireille Louys, CDS and LSIIT Strasbourg.
Building Search Portals With SP2013 Search. 2 SharePoint 2013 Search  Introduction  Changes in the Architecture  Result Sources  Query Rules/Result.
F. Genova, VOTECH kickoff meeting, 2004/11/ Interoperability standards F. Genova, M. Allen, T. Boch, F. Bonnarel, S. Derriere, P. Fernique, F. Ochsenbein,
Introducing Reporting Services for SQL Server 2005.
Peer-to-Peer Data Integration Using Distributed Bridges Neal Arthorne B. Eng. Computer Systems (2002) Supervisor: Babak Esfandiari April 12, 2005 Candidate.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
Functions and Demo of Astrogrid 1.1 China-VO Haijun Tian.
GCMD/IDN STATUS AND PLANS Stephen Wharton CWIC Meeting February19, 2015.
AstroGrid AstroGrid increases scientific research possibilities by : enabling access to distributed astronomical data and information resources. providing.
AIXM Users’ Conference, March Implementing AIXM in Instrument Flight Procedures Automation Presenter: Iain Hammond MacDonald, Dettwiler &
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
Prototype system of the Japanese Virtual Observatory The Japanese Virtual Observatory (JVO) aims at providing easy access to federated astronomical databases.
Using SAS® Information Map Studio
Information System Development Courses Figure: ISD Course Structure.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
IVOA Interoperalibity JVO Query Language Naoki Yasuda (NAOJ/Japanese VO)
CMU-CS lunch talk, Gerard Lemson1 Computational and statistical problems for the Virtual Observatory With contributions from/thanks to: GAVO.
Progress Report of VOQL WG May 15 (Thu) Masatoshi Ohishi (Japan)
Functional Requirements for Bibliographic Records The Changing Face of Cataloging William E. Moen Texas Center for Digital Knowledge School of Library.
1 Registry Services Overview J. Steven Hughes (Deputy Chair) Principal Computer Scientist NASA/JPL 17 December 2015.
F. Genova, VO as a Data Grid, 2003/06/301 Interoperability of astronomy data bases Françoise Genova, CDS.
OwlViper: Web Ontology Language Visual Programming Environment for Data Reduction (of work) Ed Shaya (PI), Brian Thomas, Peter Teuben, Zhenping Huang (UMD),
12 Oct 2003VO Tutorial, ADASS Strasbourg, Data Access Layer (DAL) Tutorial Doug Tody, National Radio Astronomy Observatory T HE US N ATIONAL V IRTUAL.
A centre of expertise in digital information management UKOLN is supported by: Functional Requirements Eprints Application Profile Working.
June 27-29, DC2 Software Workshop - 1 Tom Stephens GSSC Database Programmer GSSC Data Servers for DC2.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
February 12, 2002Tom McGlynn ADEC Interoperability Technical Working Group Report.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
IPDA Registry Definitions Project Dan Crichton Pedro Osuna Alain Sarkissian.
Introduction: AstroGrid increases scientific research possibilities by enabling access to distributed astronomical data and information resources. AstroGrid.
DataGrid France 12 Feb – WP9 – n° 1 WP9 Earth Observation Applications.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Progress Report of VOQL WG
Session 2: Metadata and Catalogues
Introduction of Week 9 Return assignment 5-2
Google Sky.
Presentation transcript:

VO Query Language GSFC XML Group Ed Shaya Brian Thomas Kirk Borne

May IVO Cambridge VOQL Requirements Provide a means for users to submit general requests for astronomical information from a distributed set of repositories. Allow for the science use cases. Easy to learn and use: – Hide from the user obvious but tedious steps – May require several levels o f language with only the top level being easy. Allow for web form entry. Independent of internal arrangement of data at repositories. Plug-n-play metadata and ontology. Span a distributed set of heterogeneous services. – Each VO query can transform to multiple queries in local dialects. – Workflow of interactions between registries, services, and user. – Integration of multiple responses

May IVO Cambridge More VOQL Requirements Easy to parse and transform into other forms Extensible – Sites can extend query language through local namespaces – VO namespace can add language elements into the future.

May IVO Cambridge XML Query Language Compatible XML and Human-Readable versions Xquery is a superset of Xpath Based on Quilt, XQL, and XML-QL – Quilt is based on Object Query Langauge (OQL) – OQL is based on Structured Query Language (SQL) If,then,else: case switch: basic functions: define new functions FLWR (for, let, where, return) for $i in (1 to 3) let $j := (1 to $i) Results in: $i = 1, $j = 1 $I = 2, $j = (1,2) $I =3, $j = (1,2,3)

May IVO Cambridge XQuery Continued for $s in document('‘bright_stars.xml'')/*/id_main let $b := document('‘photometry.xml'')/*/star[name = $s]/band where count ($b) > 1 return $i for $j in (2 to count($b)) $b[$j]/value - $b[$j-1]/value

May IVO Cambridge

May IVO Cambridge OLAP/XMLA On-line Analytical Processes Reduces bandwidth/time of data out Statistical Package add on to Databases Analysis of DataCubes – Hierarchy of Axis Values Years, Months, Days, Hours, minutes Degrees, minutes, seconds Interior, core, mantle, atmosphere, mesosphere, exosphere

May IVO Cambridge JVO Query Language – Naoki Yasuda Retrieves catalog data and images from multiple data servers via a single user interface Extension of SQL – Catalog.UCD – Box(Point(c1.ra,c1.dec), width1,height1) – XMATCH(c1,c2,!c3,…)< 3 arcsec – Select Catalog: Keyword1 & Keyword2 Select by [[MAX|MIN](PROPERTY) | ALL] [NAME] – Area : [inside|outside] area0 Area1 [overlap|union] area2 | shape SHAPE: box, circle, oval, triangle,point DIFF(x.obs_date, y.obs_date) > 30 days

May IVO Cambridge Data mining Beyond finding data; intense data filtering, conditioning, knowledge synthesis. Grid Services? – Principal Component Analysis – Iterative solutions – Genetic algorithms – Maximum-likelihood functions – Neural nets – Decision trees – Cluster analysis – Regression analysis

May IVO Cambridge Data Objects Dataset – Tables Fields – Units – Class (UCD) – Range – Values – Images Axes Coordinate Maps Data Values – Spectra Wavelength Intensity

May IVO Cambridge ADQL Obtain Data Sets – By bibliographic query Author, date published, title, journal, volume – By description Keywords, abstract, mission name Obtain tables – By title, table #, field names – By Xpath /LocalGroup/[galaxy=“M31”]/region7/v-band – Obtain table data by UCDs or field names – Min/max of range, regular expression Obtain N-cube data – Subset by axis values, – subset by ra,dec, radius or more generally Func(axes1..)

May IVO Cambridge Astronomy Data Query Language (ADQL)

May IVO Cambridge ADQL/Query Schema

May IVO Cambridge Knowledge Based Query Class  Instance  Objects Property (V-band)  Instance  value (-1.4) – Measurement property values are Data – Modifier (aperture)  Instance  value (3 arcsec) Modifier (inequality)  Instance  value (before, not) – Aggregate property – member, region, component Values are bags of objects – SubclassOf property – subclass has restricted property value range or restricted list of properties. Property Space – N-properties form a space. A bit of math is needed to relate values.

May IVO Cambridge Problem Statement Language: Root

May IVO Cambridge PSL Constraint

May IVO Cambridge PSL AstroObject

May IVO Cambridge AC : The Astrographic Catalogue on the Hipparcos System 1275 I/275 The AC Catalogue AC : The Astrographic Catalogue on the Hipparcos System S E Urban T E Corbin G L Wycoff E Hoeg C Fabricius V V Makarov Astron. J AJ U Dataset Schema

May IVO Cambridge <keywords xml:base= parentListURL="adc_keywordList.html"> Positional data Astrographic zones Surveys The AC is a revised version of the 1997 release of the AC 2000 (Cat. ). It was decided that the availability of an improved reference catalogue and the inclusion of photometry from the Tycho-2 catalogue would be sufficient to warrant a complete re- reduction of the data and a new distribution of the catalogue. The AC catalog contains positions of 4,621,751 stars at the average epoch of plate exposures for each star (average 1907). Dataset Continued

May IVO Cambridge Case Study 0: Setting up the Query Return RA, Dec, Vmag for stars with 13<Vmag<15 and 10:12:53.5<RA<13:13:43 and 18:38:00<DE< 18:40:00. PSL: ?vmag \ ?ra ?de

May IVO Cambridge Case Study 0: Mapping Query to Metadata Search for tables with metadata that satisfy: – Object/[class=“star”] –search-> keyword, description – –search-> field/UCD, name – –search-> field/UCD, name – –search-> field/UCD, name – Property/range –search-> field/min and field/max or coverage attributes For all such tables, return: ?vmag, ?ra, ?de Also, return for group with Vmag info.

May IVO Cambridge Problem Statement Language (PSL) Begin Request Constraint Find astronomical objects with the following properties: AND these properties 1. Name: assign to var1 2. Class is "cluster of galaxies | galaxy cluster" 3. Measurement quantities satisfy: a. X-ray brightness > 3.3E7Jy : assign to var2 1. Time interval of measurement: 1998Y-1999Y Using the above variables satisfy, the math formulae: 1. (var2 + var3) < (var1 – log[var4]) OR these constraints [several constraints for which one must be true etc ] Return a table with the following sequence of fields: var1 var2 End Request PSL Pull down AndConstrainties, Andproperties PSL Pull down AndConstrainties, Andproperties Property Name Pull Down Name, Class, etc. Property Name Pull Down Name, Class, etc. MathML Pull down *,-,/,+,sum,avg,, etc MathML Pull down *,-,/,+,sum,avg,, etc

May IVO Cambridge Brian Thomas’ Infrastructure

May IVO Cambridge Tony Linde’s Infrastructure VO activity – User User – Problem Assistant – service to help user state the problem Problem Assistant – Ontology – terms and relationships derived from existing data Ontology Workflow – to retrieve data, merge it, analyze it, reduce it Workflow Registry – lists all services and their high level metadata Registry Job Control – decides which jobs and when Job Control Data Centre – receiver of query for all internal data sources Data Centre Data Source Service – uses translator to restate query Data Source Translator – from data query language to implemented service Translator Languages – Problem Statement Language (PSL) Problem Statement Language (PSL) – Workflow Language (WFL) Workflow Language (WFL) – Astronomical dataset Query Language (ADQL) Astronomical dataset Query Language (ADQL) – Ontology Query Language (OQL) Ontology Query Language (OQL) – Registry Query Language (RQL) Registry Query Language (RQL)

May IVO Cambridge Conclusion Metadata should clearly distinguish between values that are property values and those that are modifiers of properties. Then, a mapping from a natural(ish) scientific knowledge based language (PSL) to a request language for data-center common items (ADQL) is possible. A federated system with a VO-wide vocabulary plus specialized (local) namespaces is best for getting started right away and permitting for evolution.