ALMA Integrated Computing Team Coordination & Planning Meeting #1 Santiago, 17-19 April 2013 Relational APDM & Relational ASDM models effort done in online.

Slides:



Advertisements
Similar presentations
Connecting to Databases. relational databases tables and relations accessed using SQL database -specific functionality –transaction processing commit.
Advertisements

XIr2 Recommended Performance Tuning Andy Erthal BI Practice Manager.
XML DOCUMENTS AND DATABASES
C6 Databases.
1 Projection Indexes in HDF5 Rishi Rakesh Sinha The HDF Group.
Database Management: Getting Data Together Chapter 14.
Physical Database Monitoring and Tuning the Operational System.
U of R eXtensible Catalog Team MetaCat. Problem Domain.
Chapter 7 Managing Data Sources. ASP.NET 2.0, Third Edition2.
Data Persistence and Object-Relational Mapping Slides by James Brucker, used with his permission 1.
Objectives of the Lecture :
Rice KRAD Data Layer JPA Design Eric Westfall July 2013.
CSE446 S OFTWARE Q UALITY M ANAGEMENT Spring 2014 Yazılım ve Uyguluma Geliştirme Yöneticisi Orhan Başar Evren.
Chapter 4: Organizing and Manipulating the Data in Databases
Apache Chemistry face-to-face meeting April 2010.
Database Design for DNN Developers Sebastian Leupold.
CS370 Spring 2007 CS 370 Database Systems Lecture 2 Overview of Database Systems.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 10 Database Performance Tuning and Query Optimization.
ESO - Garching 23 June – 02 July, 2003 ACS Course Data entities and XML serialization H. Sommer.
Chapter Oracle Server An Oracle Server consists of an Oracle database (stored data, control and log files.) The Server will support SQL to define.
CSCI 6962: Server-side Design and Programming Support Classes and Shopping Carts.
DAY 14: ACCESS CHAPTER 1 Tazin Afrin October 03,
/11/2003 C-JDBC: a High Performance Database Clustering Middleware Nicolas Modrzyk
 DATABASE DATABASE  DATABASE ENVIRONMENT DATABASE ENVIRONMENT  WHY STUDY DATABASE WHY STUDY DATABASE  DBMS & ITS FUNCTIONS DBMS & ITS FUNCTIONS 
HBase A column-centered database 1. Overview An Apache project Influenced by Google’s BigTable Built on Hadoop ▫A distributed file system ▫Supports Map-Reduce.
Miscellaneous Excel Combining Excel and Access. – Importing, exporting and linking Parsing and manipulating data. 1.
ALMA Integrated Computing Team Coordination & Planning Meeting #2 Santiago, January 2014 ASDM relational database Rafael Hiriart / Jorge Avarias.
A Metadata Based Approach For Supporting Subsetting Queries Over Parallel HDF5 Datasets Vignesh Santhanagopalan Graduate Student Department Of CSE.
Physical Database Design & Performance. Optimizing for Query Performance For DBs with high retrieval traffic as compared to maintenance traffic, optimizing.
ALMA Integrated Computing Team Coordination & Planning Meeting #1 Santiago, April 2013 Evaluation of mongoDB for Persistent Storage of Monitoring.
File Processing - Database Overview MVNC1 DATABASE SYSTEMS Overview.
ALMA Integrated Computing Team Coordination & Planning Meeting #2 Santiago, January 2014 Control Group Planning Rafael Hiriart, Control Group Lead.
NoSQL Databases Oracle - Berkeley DB Rasanjalee DM Smriti J CSC 8711 Instructor: Dr. Raj Sunderraman.
NoSQL Databases Oracle - Berkeley DB. Content A brief intro to NoSQL About Berkeley Db About our application.
Object Oriented Analysis and Design 1 Chapter 7 Database Design  UML Specification for Data Modeling  The Relational Data Model and Object Model  Persistence.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
C6 Databases. 2 Traditional file environment Data Redundancy and Inconsistency: –Data redundancy: The presence of duplicate data in multiple data files.
Hibernate 3.0. What is Hibernate Hibernate is a free, open source Java package that makes it easy to work with relational databases. Hibernate makes it.
ICDL 2004 Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer Science Old Dominion University.
INFORMATION MANAGEMENT Unit 2 SO 4 Explain the advantages of using a database approach compared to using traditional file processing; Advantages including.
Database structure and space Management. Database Structure An ORACLE database has both a physical and logical structure. By separating physical and logical.
Observing Modes from a Software viewpoint Robert Lucas and Philippe Salomé (SSR)
INTRODUCTION lecture1 1. Data base concept Data is a meaningless static value. What does 3421 means? Information is the data you process in a manner that.
Oracle 10g Database Administrator: Implementation and Administration Chapter 5 Basic Storage Concepts and Settings.
Introduction.  Administration  Simple DBMS  CMPT 454 Topics John Edgar2.
JAVA BEANS JSP - Standard Tag Library (JSTL) JAVA Enterprise Edition.
Feb 24-27, 2004ICDL 2004, New Dehli Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer.
ALMA Integrated Computing Team Coordination & Planning Meeting #1 Santiago, April 2013 ICT Group planning: Scheduling Jorge Avarias ICT Scheduling.
20 Copyright © 2008, Oracle. All rights reserved. Cache Management.
Andrea Valassi (CERN IT-DB)CHEP 2004 Poster Session (Thursday, 30 September 2004) 1 HARP DATA AND SOFTWARE MIGRATION FROM TO ORACLE Authors: A.Valassi,
Correlator GUI Sonja Vrcic Socorro, April 3, 2006.
Experience with XML Schema Ashok Malhotra Schema Usage  Mapping XML Schema and XML documents controlled by the Schema to object classes and instances.
Scalable data access with Impala Zbigniew Baranowski Maciej Grzybek Daniel Lanza Garcia Kacper Surdy.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Hibernate Thuy, Le Huu. Pentalog VN. Agenda Hibernate Annotations Improving performance – Lazy loading – Fetching Strategies – Dynamic insert, dynamic.
Event Management. EMU Graham Heyes April Overview Background Requirements Solution Status.
CS223: Software Engineering Lecture 19: Unit Testing.
Aggregator Stage : Definition : Aggregator classifies data rows from a single input link into groups and calculates totals or other aggregate functions.
ESO - Garching 08 – 09 March, st ALMA Common Software Workshop XML « Data by Value » Transport.
Apache Solr Dima Ionut Daniel. Contents What is Apache Solr? Architecture Features Core Solr Concepts Configuration Conclusions Bibliography.
CS 440 Database Management Systems Stored procedures & OR mapping 1.
The Database Project a starting work by Arnauld Albert, Cristiano Bozza.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
Jean-Philippe Baud, IT-GD, CERN November 2007
DCS Status and Amanda News
CMS High Level Trigger Configuration Management
Physical Database Design and Performance
Data Model.
Testing a persistence layer
Developing and testing enterprise Java applications
Presentation transcript:

ALMA Integrated Computing Team Coordination & Planning Meeting #1 Santiago, April 2013 Relational APDM & Relational ASDM models effort done in online software Jorge Avarias & Rafael Hiriart

ICT-CPM April 2013 Used by scheduling subsystem to:  Improve performance of the scheduling panel in OMC to show the list of observable Projects/SchedBlocks.  Run the selection step of the DSA based on criteria defined by users (Scheduling DSA Policy)  In Planning Mode Simulator as data source. The original motivation was to reduce the start-up time of the Scheduling Panel in the OMC. Relational APDM

ICT-CPM April 2013 The model is a subset of the APDM, plus some extra fields required by Scheduling. Currently in use as the schema of the SWDB. The SWDB is populated by the Scheduling updater:  In a incremental way if there is already data in the SWDB, otherwise Scheduling updater performs a full import of the APDM projects.  Scheduling Updater validates the APDM projects rejecting every inconsistent entity. Current Status (1/2)

ICT-CPM April 2013 Recently scheduling updater performance has been improved, creating a custom made read-only DAO for the XML store:  The DAO reads the APDM ObsProjects, ObsPrograms and SchedBlocks directly from Oracle using hibernate.  These retrieved XML Documents are converted to a DOM tree in memory, then passed to castor to convert them into Castor's mappings at the moment of the conversion to the scheduling data-model. Current Status (2/2)

ICT-CPM April 2013 To be used by Data Capturer to save observation metadata into a database (most likely Oracle) The main motivation is to solve the memory usage constraint issue found in the past in Data Capturer during long observations with a large number of antennas.  DC uses ~1.5 GB of RAM to store ~40 min. of an observation using 32 antennas.  Main contributor is the Pointing table (saved as binary and pushed into bulk store at the end of the observation) Relational ASDM

ICT-CPM April 2013 Workaround implemented in ALMA-9_1_4-B.  It considers only the Pointing table, which is written to a file in binary format (/tmp directory).  Then the file is pushed to BulkStore incrementally (implemented new method storeDataIncrementally in BulkStore) Long term implementation considers every single ASDM table defined as XML Schema. Current Status (1/4)

ICT-CPM April 2013 Prototype has been implemented. It is based on hibernate  The hibernate mapping is generated using Java reflection to know what are the columns and the keys of every table.  ASDM java row classes cannot be used as mapping classes for hibernate.  New wrappers classes are created for mapping: Defining the no-param constructor. Doing public some setters. Fixing some getters returning Exceptions. DB is auto-generated by Hibernate. Relationships (FK) have not yet defined among tables. Current Status (2/4)

ICT-CPM April 2013 New hibernate simple and complex types classes (to convert table value to POJO and vice versa) were implemented to handle custom ASDM types:  Simple types for angle, temperature, speed, etc.  Complex types for timeInterval and others.  Complex type for arrays (including IDL enumerations): Arrays are saved serialized (current XML format) as CLOB into DB.  Complex types for IDL enumerations. In the table two columns are used: IDL_Type and actual valor Newly defined hibernate types classes are easy because the ASDM data is immutable. Current Status (3/4)

ICT-CPM April 2013 Defined a DAO to read-write data.  Supports concurrent transactions a sessions using the ThreadedSessionFactory coming by default in Hibernate.  Save methods receive Row ASDM class as parameter, then the DAO encapsulates the parameter into the respective RowWrapper class, the the data is saved into DB.  Get methods support Criteria queries and incremental iterators to avoid to put the entire ASDM table in memory. Current Status (4/4)

ICT-CPM April 2013 Unit test implemented as part of the prototype:  Test zero: tests hibernate simple and complex types classes.  Test one: Take a XML ASDM on disk, load it on memory using ASDM Java classes, then every row in every table is saved into DB (HSQLDB file mode). Finally the ASDM is reloaded from DB and every row is compared as XML string with the original XML rows loaded from disk. Integration test used is CONTROL/IntTest  Interferometry with 4 arrays and flagging is tested.  So far no performance problems are detected. Current Status: Testing

ICT-CPM April 2013 Scalability tests, focused in known what is the maximum throughput.  They can be done after the DC scalability test are completed.  There are place for optimizations ;) Well defined DB model.  Model should not be generated automatically by Hibernate.  To define relationships (FK and constraints) among tables. This API is just for Java. Should Python and C++ be supported as well? Pending