Madagascar Software Package Technology for Reproducible Computational Experiments Sergey Fomel Bureau of Economic Geology Jackson School of Geosciences.

Slides:



Advertisements
Similar presentations
Reproducibility as a Community Effort Lessons from the Madagascar Project Sergey Fomel Jackson School of Geosciences The University of Texas at Austin.
Advertisements

Peaster Elementary. STAAR – What is it? State’s student testing program for Mathematics, Reading, Writing, and Science Emphasizes “readiness” standards,
AASPI Software Computational Environment Tim Kwiatkowski Welcome Consortium Members November 18, 2008.
Reproducible Research Sergey Fomel The University of Texas at Austin.
Eduardo Miranda C. Marc Ramirez Stanford University PEER Annual Meeting - Thrust Area 1 Meeting Jan. 20 th, 2007 PEER Loss Estimation Toolbox.
© , Michael Aivazis DANSE Software Issues Michael Aivazis California Institute of Technology DANSE Software Workshop September 3-8, 2003.
Time-shift imaging condition Paul Sava & Sergey Fomel Bureau of Economic Geology University of Texas at Austin.
Data Handling & Analysis BD Andrew Jackson Zoology, School of Natural Sciences
Illinois MBA The IT Track Objective Give students the opportunity to leverage their technical expertise, and to enter the workforce in an area where both.
What to Expect Learning the concepts of AIS “Teachers open the door. You enter by yourself” (old Chinese proverb) Applying the concepts (Group Case) “What.
Design and Implementation of a Single System Image Operating System for High Performance Computing on Clusters Christine MORIN PARIS project-team, IRISA/INRIA.
Communicating with Users about HTCondor and High Throughput Computing Lauren Michael, Research Computing Facilitator HTCondor Week 2015.
Building Successful K-12 Outreach Programs By Sigrid Clift Bureau of Economic Geology Jackson School of Geosciences The University of Texas at Austin.
PDF Wikispaces Blogging PBWorks You are now ready to cut the red ribbon and unveil your project to your intended audience.
CIS-305: Data Structures Fall Organizational Details Class Meeting: 4 :00-6:45pm, Tuesday, Room SCIT215 Instructor: Dr. Igor Aizenberg Office:
WIKI IN EDUCATION Giti Javidi. W HAT IS WIKI ? A Wiki can be thought of as a combination of a Web site and a Word document. At its simplest, it can be.
NCL Website Tour NCL Website Tour  NCL Website Tour Overview, downloading, what’s new FAQ Examples.
Writing your own Madagascar applications Sergey Fomel University of Texas at Austin August 30, 2006 Vancouver, BC.
Word Processors, Databases, Spreadsheets, and Data Problems.
Reproducible Computational Experiments Using MADAGASCAR Software Package Sergey Fomel Bureau of Economic Geology University of Texas at Austin Applied.
Progress in Multi-platform Software Deployment (Linux and Windows) Tim Kwiatkowski Welcome Consortium Members November 29,
NA-MIC National Alliance for Medical Image Computing NA-MIC Software Engineering Bill Lorensen GE Research NA-MIC Engineering Core PI.
Introduction to the HPCC Jim Leikert System Administrator High Performance Computing Center.
Objectives Understand what MATLAB is and why it is widely used in engineering and science Start the MATLAB program and solve simple problems in the command.
1 MA471 Introduction To Scientific Computing Fall 2003 Prof. Tim Warburton
Introduction to Course MMIS 656 Web Design Technologies.
UDoCument: Electronic Scrapbook for the Information Era Soufiane Berouel, Undergraduate Student Supervised by Prof. Lily Liang Department of Computer Science.
Introduction to the HPCC Dirk Colbry Research Specialist Institute for Cyber Enabled Research.
How to Contribute to Madagascar and Why Sergey Fomel July 22, 2011 Beijing, China.
17-April-2007 High Performance Computing Basics April 17, 2007 Dr. David J. Haglin.
An Introduction to Linux Name: Haixin Wang ID :
INFSO-RI Module 01 ETICS Overview Etics Online Tutorial Marian ŻUREK Baltic Grid II Summer School Vilnius, 2-3 July 2009.
Statipedia: a platform for collaboration across statistical agencies Peter B. Meyer Office of Productivity and Technology, BLS and James A.
Grid Computing Research Lab SUNY Binghamton 1 XCAT-C++: A High Performance Distributed CCA Framework Madhu Govindaraju.
Presented by Document Clustering on Supercomputers Yu (Cathy) Jiao, Ph.D. Applied Software Engineering Research Group Computational Sciences and Engineering.
Postgraduate Computing Lectures Applications I: Overview 1 Applications: Overview Symbiosis: Theory v. Experiment Theory –Build models to explain existing.
Subject Outline Grade 12 Computer Studies HG. Final Mark 4 Components External Theory (140 marks 3hrs) External Practical (100 marks 3hrs) Project (60.
The Geosciences are a discipline that is strongly data driven, and large data sets are often developed by researchers and government agencies. The complexity.
The Internet The History and Future of the Internet.
System Construction System Construction is the development, installation and testing of system components.
National Center for Supercomputing ApplicationsNational Computational Science Grid Packaging Technology Technical Talk University of Wisconsin Condor/GPT.
AASPI Software Computational Environment Tim Kwiatkowski Welcome Consortium Members November 10, 2009.
NEES Cyberinfrastructure Center at the San Diego Supercomputer Center, UCSD George E. Brown, Jr. Network for Earthquake Engineering Simulation NEES TeraGrid.
General Purpose Packages DATA TYPES. Data Types Computer store information in the form of data. Information has meaning. Eg 23 May 2005 Data has no meaning.
Scientific Workflow systems: Summary and Opportunities for SEEK and e-Science.
AN ORGANISATION FOR A NATIONAL EARTH SCIENCE INFRASTRUCTURE PROGRAM Virtual Geophysics Laboratory (VGL): Scientific workflows Exploiting the Cloud Josh.
Structural Design Software TYLER HUTCHISON. Requirements  Develop an interface to generate a text file.  The text file is then passed to one of five.
This poster has been designed to act as a customisable template. You do not have to use this template but it might be a useful starting point. The poster.
UTAM Reproducible Research Package and C++ Seismic Library Samuel Brown November 5, 2008.
Cscape EnvisionFX Horner APG, LLC March 7, EnvisionFX - What Does it Do? Graphical interface allows easy transfers of data to and from the PC and.
Copyright © Texas Education Agency, All rights reserved. Images and other multimedia content used with permission. 1 Graphic Design & Illustration.
Digital Media Lecture 0: It’s all just bits! Georgia Gwinnett College School of Science and Technology Dr. Jim Rowan.
Building on CUNY’s history of service to New York City, the newly-created CUNY Service Corps creates opportunities for students, faculty, and staff at.
Large-scale accelerator simulations: Synergia on the Grid turn 1 turn 27 turn 19 turn 16 C++ Synergia Field solver (FFT, multigrid) Field solver (FFT,
Challenges and Solutions Will Schroeder, co-Founder, President VAC Big Data Consortium Meeting July 31, 2012.
ENGINEERING 2304 Computer Programming for Engineers ENGR Summer 2015 Class 1.
Copyright © 2007, Decision Support Solutions. All rights reserved. 1 SAS ® Resources (Trying to find what you want) Alan Tsuji Consultant: Decision Support.
Dong Zhang and Dr. Mubarak Shah
Advanced Higher Computing Science The Project. Introduction Worth 60% of the total marks for the course Must include: An appropriate interface using input.
American History Unit: World War II A Day: Wednesday, April 13, 2016 B Day: Thursday, April 14, 2016.
American History Unit: World War II B Day: Thursday, April 21, 2016 A Day: Friday, April 22, 2016.
Mako Hill Wikimania 2007 Resonant Divergance Collaboration in Diverged Branches MIT Media Lab Electronic Publishing Group Computing.
Madagascar Installation
MATLAB/SIMULINK Professor Walter W. Olson
Constructing a system with multiple computers or processors
Svug models J. W. Jennings
Constructing a system with multiple computers or processors
Hilbert-Huang Transform Data Processing System (HHT-DPS) V1.2
Lesson 1: Top 10 List Unit 1: Formatting Lists,
Reproducible Research
Presentation transcript:

Madagascar Software Package Technology for Reproducible Computational Experiments Sergey Fomel Bureau of Economic Geology Jackson School of Geosciences The University of Texas at Austin Scientific Software Day 2007

Outline Past  Seismic Imaging MIT, Industry, Stanford, … Present  Test-driven development Reproducible computational experiments Future  Asking for help Automatic testing, parallel processing, graphics, …

Seismic Imaging Computations 45 clusters on the top 500 list  many missing  Europe’s most powerful supercomputer (CGG/IBM) Data size  Gb (2-D; 1980x)  Tb (3-D; 1990x)  ?? (4-D, 4-C, multi-azimuth, …; 200x)

History of Seismic Imaging Norbert Wiener (© RLE/MIT) Enders Robinson Jon Claerbout

SEPlib Software Innovations Universal data file format (RSF)  Data separated from text headers  Conceptual regularly gridded N-dimensional hypercubes If you feel an urge to design a complex binary file format, or a complex binary application protocol, it is generally wise to lie down until the feeling passes. -- Eric S. Raymond, TAUP Universal graphics format (Vplot) Reproducible research 30+ years legacy  R. Clayton, J. Claerbout, D. Hale, S. Levin, J. Dellinger, S. Cole, D. Nichols, M. Karrenbach, B. Biondi, R. Clapp, … Data n1=1000 n2=500 n3=100 d1=0.001 d2=0.1 o2=1

“Madagascar” Innovations Open source (GPL) Open community (SourceForge) Open science  Command-line modules  Data processing scripts  Documentation

MondayGet an idea TuesdayImplement it WednesdayTest it ThursdayCommunicate it FridayApply it Technology Transfer I23I23

Three Levels of Technology Implement it (Tuesday)  Command-line modules C (C++, Fortran-77, Fortran-90, Matlab, Python) 400 modules Test it (Wednesday)  Data processing flows Python/SCons 4,000 tests Communicate it (Thursday)  Papers and reports LaTeX + Python/SCons 60 papers

Reproducible Computations Test Driven Development  Document computational experiments  Old experiments become regression tests Within the world of science, computation is now rightly seen as a third vertex of a triangle complementing experiment and theory. However, as it is now often practiced, one can make a good case that computing is the last refuge of the scientific scoundrel. –R. LeVeque

Asking for Help Open problems  Automatic reproducibility testing  Parallelization  Interfaces  Graphics  … Open opportunities  Summer students  Consultants

Information  Wiki, blog, lists, forums RSF/Madagascar School in Austin  April 20, 2007  Bureau of Economic Geology