ARCS Data Analysis Software An overview of the ARCS software management plan Michael Aivazis California Institute of Technology ARCS Baseline Review March.

Slides:



Advertisements
Similar presentations
Accelerating The Application Lifecycle. DEPLOY DEFINE DESIGN TEST DEVELOP CHANGE MANAGEMENT Application Lifecycle Management #1 in Java Meta, Giga, Gartner.
Advertisements

A Workflow Engine with Multi-Level Parallelism Supports Qifeng Huang and Yan Huang School of Computer Science Cardiff University
INTRODUCTION TO SIMULATION WITH OMNET++ José Daniel García Sánchez ARCOS Group – University Carlos III of Madrid.
HP Quality Center Overview.
May 17, Capabilities Description of a Rapid Prototyping Capability for Earth-Sun System Sciences RPC Project Team Mississippi State University.
Summary Role of Software (1 slide) ARCS Software Architecture (4 slides) SNS -- Caltech Interactions (3 slides)
Building Enterprise Applications Using Visual Studio ®.NET Enterprise Architect.
MIT iCampus iLabs Software Architecture Workshop June , 2006.
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
DANSE Central Services Michael Aivazis Caltech NSF Review May 23, 2008.
Presented by IBM developer Works ibm.com/developerworks/ 2006 January – April © 2006 IBM Corporation. Making the most of Creating Eclipse plug-ins.
© , Michael Aivazis DANSE Software Issues Michael Aivazis California Institute of Technology DANSE Software Workshop September 3-8, 2003.
The ARCS Data Analysis Software Michael Aivazis California Institute of Technology.
CASE Tools CIS 376 Bruce R. Maxim UM-Dearborn. Prerequisites to Software Tool Use Collection of useful tools that help in every step of building a product.
The ARCS Data Analysis Software Michael Aivazis California Institute of Technology.
© , Michael Aivazis DANSE Software Architecture Challenges and opportunities for the next generation of data analysis software Michael Aivazis.
An overview of the DANSE software architecture Michael Aivazis Caltech DANSE Kick-Off Meeting Pasadena Aug 15, 2006.
Supplement 02CASE Tools1 Supplement 02 - Case Tools And Franchise Colleges By MANSHA NAWAZ.
Pyre: a distributed component framework Michael Aivazis Caltech DANSE Developers Workshop January 22-23, 2007.
Galin, SQA from theory to implementation © Pearson Education Limited Chapter 13 CASE Tools and their Effect on Software Quality.
(1) Introduction © Sudhakar Yalamanchili, Georgia Institute of Technology, 2006.
The Astro-Wise Pipeline Conceptual Design Review Roeland Rengelink Requirements Development Framework Prototype.
Chapter 2 The process Process, Methods, and Tools
Metadata Creation with the Earth System Modeling Framework Ryan O’Kuinghttons – NESII/CIRES/NOAA Kathy Saint – NESII/CSG July 22, 2014.
Zhonghua Qu and Ovidiu Daescu December 24, 2009 University of Texas at Dallas.
THE GITB TESTING FRAMEWORK Jacques Durand, Fujitsu America | December 1, 2011 GITB |
DCS Overview MCS/DCS Technical Interchange Meeting August, 2000.
Architecture-Based Runtime Software Evolution Peyman Oreizy, Nenad Medvidovic & Richard N. Taylor.
Introduction to MDA (Model Driven Architecture) CYT.
Cluster Reliability Project ISIS Vanderbilt University.
Flexibility and user-friendliness of grid portals: the PROGRESS approach Michal Kosiedowski
EGEE is a project funded by the European Union under contract IST Testing processes Leanne Guy Testing activity manager JRA1 All hands meeting,
Resource Management Working Group SSS Quarterly Meeting November 28, 2001 Dallas, Tx.
AUTOMATION OF WEB-FORM CREATION - KINNERA ANGADI – MS FINAL DEFENSE GUIDANCE BY – DR. DANIEL ANDRESEN.
DANSE Central Services Michael Aivazis Caltech NSF Review May 31, 2007.
CAM-I Scalable Flexible Manufacturing Initiative NGMS Task 6.1.
A performance evaluation approach openModeller: A Framework for species distribution Modelling.
Hyper/J and Concern Manipulation Environment. The need for AOSD tools and development environment AOSD requires a variety of tools Life cycle – support.
Nick Draper 05/11/2008 Mantid Manipulation and Analysis Toolkit for ISIS data.
DataNet – Flexible Metadata Overlay over File Resources Daniel Harężlak 1, Marek Kasztelnik 1, Maciej Pawlik 1, Bartosz Wilk 1, Marian Bubak 1,2 1 ACC.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
SEE-GRID-SCI The SEE-GRID-SCI initiative is co-funded by the European Commission under the FP7 Research Infrastructures contract no.
March 2004 At A Glance NASA’s GSFC GMSEC architecture provides a scalable, extensible ground and flight system approach for future missions. Benefits Simplifies.
Portable Infrastructure for the Metafor Metadata System Charlotte Pascoe 1, Gerry Devine 2 1 NCAS-BADC, 2 NCAS-CMS University of Reading PIMMS provides.
Confidential Continuous Integration Framework (CIF) 5/18/2004.
CASE (Computer-Aided Software Engineering) Tools Software that is used to support software process activities. Provides software process support by:- –
Connections to Other Packages The Cactus Team Albert Einstein Institute
Mantid Stakeholder Review Nick Draper 01/11/2007.
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
Overview of the Systems Biology Workbench Michael Hucka, Andrew Finney, Herbert Sauro, Hamid Bolouri ERATO Kitano Systems Biology Project California Institute.
Mike Hildreth DASPOS Update Mike Hildreth representing the DASPOS project 1.
Executive Overview. Software modeling is essential, because it is the map that guides your developers. Additionally: Modeling Software  Visual information.
TRIUMF HLA Development High Level Applications Perform tasks of accelerator and beam control at control- room level, directly interfacing with operators.
March 2004 At A Glance The AutoFDS provides a web- based interface to acquire, generate, and distribute products, using the GMSEC Reference Architecture.
INFSO-RI JRA2 Test Management Tools Eva Takacs (4D SOFT) ETICS 2 Final Review Brussels - 11 May 2010.
Origami: Scientific Distributed Workflow in McIDAS-V Maciek Smuga-Otto, Bruce Flynn (also Bob Knuteson, Ray Garcia) SSEC.
Simulation Production System Science Advisory Committee Meeting UW-Madison March 1 st -2 nd 2007 Juan Carlos Díaz Vélez.
Configuration Control (Aliases: change control, change management )
CASE Tools and their Effect on Software Quality
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
CIS 375 Bruce R. Maxim UM-Dearborn
Building Enterprise Applications Using Visual Studio®
Deployment of Flows Loretta Auvil
Spark Presentation.
GeoFramework: Example of using frameworks in geophysics
Applied Software Implementation & Testing
The Extensible Tool-chain for Evaluation of Architectural Models
OO Design and Development
Overview of Workflows: Why Use Them?
Overview Activities from additional UP disciplines are needed to bring a system into being Implementation Testing Deployment Configuration and change management.
Presentation transcript:

ARCS Data Analysis Software An overview of the ARCS software management plan Michael Aivazis California Institute of Technology ARCS Baseline Review March 14, 2002

2 Overview Software engineering practices –Version control –Configuration management –Bug tracking –Regression testing Architectural overview –A success story –The key design elements –Design goals Compatibility –Programming languages –File formats –Data analysis and visualization environments

3 Software engineering practices Version control –Provides a record of the evolution of the software –CVS: well supported, open source Configuration management –Uniform, portable build procedure –Automatic, regular builds of the entire software base –config: a system based on make Regression testing –Test cases that Exercise expected behavior Exercise fixes for known bugs Bug tracking –Organize the “to do” list, the feature requests … and the known defects –Gnats: well supported, open source

4 Overview of the Caltech ASCI center Constructing a Virtual Testing Facility (VTF) Large, multi-disciplinary project –Five Caltech research groups and a dozen sub-contractors Application highlights: –Currently: 250 kloc in C, F77, F90 and C++ –Massively parallel: ~10K processors –Large data storage requirements: ~10Tb per run –Relies heavily on existing code, both solvers and infrastructure Python provides the glue –Extensibility is an essential feature Pyre: the application framework –Python scripts: 700 classes, 35,000 lines of code –C++: lines of bindings and infrastructure

5 The scope of Pyre Problem specification Solid modeling Boundary and initial conditions Materials and constitutive models Solvers Simulation driver Simulation instrumentation Integration of parallel/distributed programming infrastructure Real-time visualization Full simulation archiving

6 Flexibility through the use of scripting Scripting enables us to –Organize the large number of parameters –Allow the environment to discover new capabilities without the need for recompilation or relinking The python interpreter –The interpreter modern object oriented language robust, portable, mature, well supported, well documented easily extensible rapid application development –Support for parallel programming (if necessary…) trivial embedding of the interpreter in an MPI compliant manner a python interpreter on each compute node MPI is fully integrated: bindings + OO layer –Experience: no measurable impact on either performance or scalability

7 Integrated facilities The existence of the framework enables the introduction of new facilities without requiring any work on the existing ones Scripted facilities: –Infrastructure: MPI, ACIS, GrACE –Solvers: adlib, tetra; rm3d, rm3dge, arm3d –HDF5 –Sensors and probes In progress: –Existing material models –Material properties Near future –Grid support –Next generation of solvers

8 MATLAB MATLAB is fully accessible from within Python –The reverse is not true (…yet)

9 IRIS Explorer

10 Design goals Integrate analysis modules using scripting –Python Data flow paradigm –Well understood –Easy to implement and document XML used to construct a fully reproducible description of the data analysis pipeline –Tag and archive data –Record the version number of each module File formats: –Reuse, reuse, reuse –Augment, contribute –HDF5 NeXus + XML meta-data (?)

11 Conclusions The software engineering issues are well understood The architectural vision for the analysis software –Is based on an intuitive paradigm –Leverages existing work ARCS is in a good position to have a positive influence on the field –Enough lead time –Well funded –Multiple leveraging –Low risk Near term activities –Define policies –Implement and deploy software management tools –Take inventory of existing analysis and visualization tools –Construct a prototype of analysis