DANSE Central Services Michael Aivazis Caltech NSF Review May 23, 2008.

Slides:



Advertisements
Similar presentations
A Workflow Engine with Multi-Level Parallelism Supports Qifeng Huang and Yan Huang School of Computer Science Cardiff University
Advertisements

Configuration management
Configuration management
DIGIDOC A web based tool to Manage Documents. System Overview DigiDoc is a web-based customizable, integrated solution for Business Process Management.
COM vs. CORBA.
LUNARC, Lund UniversityLSCS 2002 Transparent access to finite element applications using grid and web technology J. Lindemann P.A. Wernberg and G. Sandberg.
Summary Role of Software (1 slide) ARCS Software Architecture (4 slides) SNS -- Caltech Interactions (3 slides)
Technical Architectures
ARCS Data Analysis Software An overview of the ARCS software management plan Michael Aivazis California Institute of Technology ARCS Baseline Review March.
Components and Architecture CS 543 – Data Warehousing.
© , Michael Aivazis DANSE Software Issues Michael Aivazis California Institute of Technology DANSE Software Workshop September 3-8, 2003.
The ARCS Data Analysis Software Michael Aivazis California Institute of Technology.
The ARCS Data Analysis Software Michael Aivazis California Institute of Technology.
© , Michael Aivazis DANSE Software Architecture Challenges and opportunities for the next generation of data analysis software Michael Aivazis.
An overview of the DANSE software architecture Michael Aivazis Caltech DANSE Kick-Off Meeting Pasadena Aug 15, 2006.
Chapter 9: Moving to Design
Pyre: a distributed component framework Michael Aivazis Caltech DANSE Developers Workshop January 22-23, 2007.
Deploying Visual Studio Team System 2008 Team Foundation Server at Microsoft Published: June 2008 Using Visual Studio 2008 to Improve Software Development.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
.NET, and Service Gateways Group members: Andre Tran, Priyanka Gangishetty, Irena Mao, Wileen Chiu.
BMC Software confidential. BMC Performance Manager Will Brown.
Architecture Of ASP.NET. What is ASP?  Server-side scripting technology.  Files containing HTML and scripting code.  Access via HTTP requests.  Scripting.
1 DAN FARRAR SQL ANYWHERE ENGINEERING JUNE 7, 2010 SCHEMA-DRIVEN EXPERIMENT MANAGEMENT DECLARATIVE TESTING WITH “DEXTERITY”
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
Framework for Automated Builds Natalia Ratnikova CHEP’03.
COM vs. CORBA Computer Science at Azusa Pacific University September 19, 2015 Azusa Pacific University, Azusa, CA 91702, Tel: (800) Department.
DCS Overview MCS/DCS Technical Interchange Meeting August, 2000.
COMP 410 & Sky.NET May 2 nd, What is COMP 410? Forming an independent company The customer The planning Learning teamwork.
Cluster Reliability Project ISIS Vanderbilt University.
Flexibility and user-friendliness of grid portals: the PROGRESS approach Michal Kosiedowski
DCE (distributed computing environment) DCE (distributed computing environment)
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
Service Computation 2010November 21-26, Lisbon.
Configuration Management (CM)
Through the development of advanced middleware, Grid computing has evolved to a mature technology in which scientists and researchers can leverage to gain.
BLU-ICE and the Distributed Control System Constraints for Software Development Strategies Timothy M. McPhillips Stanford Synchrotron Radiation Laboratory.
Web Services based e-Commerce System Sandy Liu Jodrey School of Computer Science Acadia University July, 2002.
DANSE Central Services Michael Aivazis Caltech NSF Review May 31, 2007.
Contents 1.Introduction, architecture 2.Live demonstration 3.Extensibility.
Invitation to Computer Science 5 th Edition Chapter 6 An Introduction to System Software and Virtual Machine s.
The Future of the iPlant Cyberinfrastructure: Coming Attractions.
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
Middleware for FIs Apeego House 4B, Tardeo Rd. Mumbai Tel: Fax:
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
9 Systems Analysis and Design in a Changing World, Fourth Edition.
Distribution and components. 2 What is the problem? Enterprise computing is Large scale & complex: It supports large scale and complex organisations Spanning.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
LegendCorp What is System Center Virtual Machine Manager (SCVMM)? SCVMM at a glance Features and Benefits Components / Topology /
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
NEES Cyberinfrastructure Center at the San Diego Supercomputer Center, UCSD George E. Brown, Jr. Network for Earthquake Engineering Simulation NEES TeraGrid.
A Software Framework for Distributed Services Michael M. McKerns and Michael A.G. Aivazis California Institute of Technology, Pasadena, CA Introduction.
Near Real-Time Verification At The Forecast Systems Laboratory: An Operational Perspective Michael P. Kay (CIRES/FSL/NOAA) Jennifer L. Mahoney (FSL/NOAA)
CPSC 171 Introduction to Computer Science System Software and Virtual Machines.
Creating SmartArt 1.Create a slide and select Insert > SmartArt. 2.Choose a SmartArt design and type your text. (Choose any format to start. You can change.
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
March 2004 At A Glance ITPS is a flexible and complete trending and plotting solution which provides user access to an entire mission full-resolution spacecraft.
Design and implementation Chapter 7 – Lecture 1. Design and implementation Software design and implementation is the stage in the software engineering.
Tackling I/O Issues 1 David Race 16 March 2010.
INFSO-RI JRA2 Test Management Tools Eva Takacs (4D SOFT) ETICS 2 Final Review Brussels - 11 May 2010.
Simulation Production System Science Advisory Committee Meeting UW-Madison March 1 st -2 nd 2007 Juan Carlos Díaz Vélez.
Amazon Web Services. Amazon Web Services (AWS) - robust, scalable and affordable infrastructure for cloud computing. This session is about:
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
VisIt Project Overview
Simulation Production System
CMS High Level Trigger Configuration Management
Platform as a Service.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
Introduction to Apache
Presentation transcript:

DANSE Central Services Michael Aivazis Caltech NSF Review May 23, 2008

2 Goal: producing high quality software Correctness Coherence Robustness errors in the analysis codes prevent, handle environmental conditions outages misconfigurations Usability making everybody happy… Aesthetics

3 Mission Central Services provides expertise and support for hardware project servers computational cluster data storage and simulation archives software engineering tools and practices web, wiki, bugs and feature requests configuration and release management software architecture and framework components user interface design services algorithms and data structures

4 Hardware Servers web source control distribution Cluster currently: 7 nodes (x4 Opteron cores with 2Gb of memory per core) future expansion limited by budget Data storage 36 Tb of raw capacity 2.5GB per second sustained read, 1.5GB per second sustained write data storage, simulation archiving

5 Software engineering Source control Robust and portable software build procedure Modern software design and implementation practices documentation of requirements, use cases, design, algorithms agile programming: “test before you implement” Quality control automation: regular (nightly?) builds of the software base unit testing regression testing Tracking of bugs and feature requests Release management Documentation doxygen, epydoc, docbook, wiki, UML

6 Pyre: the software framework Projects Caltech ASC+PSAAP Center (DOE) Computational Infrastructure in Geodynamics (NSF): DANSE (NSF) Portability: languages: C, C++, F77, F90 compilers: all native compilers on supported platforms, gcc, Absoft, PGI platforms: all common Unix variants, OSX, Windows Statistics: 1200 classes, 75,000 lines of Python, 30,000 lines of C++ Largest run: nirvana at LANL, 1764 processors for 24 hrs, generated 1.5 Tb

7 Flexibility through the use of scripting Scripting enables us to Organize the large number of simulation parameters Allow the simulation environment to discover new capabilities without the need for recompilation or relinking The python interpreter The interpreter modern object oriented language robust, portable, mature, well supported, well documented easily extensible rapid application development Support for parallel and distributed programming a python interpreter on each compute node MPI is fully integrated: bindings + OO layer evolving support for both ad-hoc and grid-based solutions No measurable impact on either performance or scalability

8 Persistence The goal is to provide applications with seamless access to large scientific data stores that are becoming available for an increasing number of scientific domains DANSE applications will benefit by getting access to relevant public databases, as well as having a scalable store for archived simulation and analysis meta- data Design is complete; implementation has been through a 2-year beta cycle objects persist in RDBMS tables whose rows are object instances and columns are object attributes all SQL 2003 data types are supported PostgreSQL and MySQL backend support In progress: support for more database back-ends tighter coupling to the underlying database to eliminate the need for intermediate data representations automatic generation of an object model based on a developer specified schema

9 DANSE and distributed computing We will continue to build services and support for creating and monitoring large scale data analyses and simulations from the user’s workstation Infrastructure encapsulation of commonly used communication protocols protocol is negotiated at run-time Superstructure creation and deployment of a distributed computation programmatic interface management and monitoring tools Services global unique identifier generator one-step remote authentication visualization and monitoring

10 Distributed computing support We are continuing the migration of existing support for distributed processing into gsl, a package that completely encapsulates the middleware Provide both user space and grid-enabled solution User space: ssh, scp pyre service factories and component management Web services pyGridWare from Keith Jackson’s group Advanced features: coming soon dynamic discovery for optimized deployment reservation system for computational resources

11 Optimization framework deployment model monitor journal optimizer convergence? population generator job manager queue manager job 1 job 2 job n … job manager queue manager job 1 job 2 job n … machine 1 machine N archiver job manager

12 User interfaces Identified four types: Batch scripts Graphical user interfaces Web-based interfaces Services Support for the development of “science portals” is the most frequently requested feature pyre applications can already be re-hosted as CGI scripts in progress: support for near transparent re-hosting: structured document object model for user configurable parameters, component interface invocation and result delivery “natural” mapping between input parameter types and the widgets that collect them complete separation of the UI specification from the underlying application behavior Underway: reusing this technology to enable applications written for conventional GUI toolkits

13 VNF: a web hosted application

14 Conclusions We well underway hardware fully deployed support for software practices in place Adjusted plan to meet the needs of other subprojects moved up release management and user interfaces postponed some implementation details of distributed computing until year three Challenges learning curve maintaining project coherence