EVLA Data Processing PDR Pipeline design Tim Cornwell, NRAO.

Slides:



Advertisements
Similar presentations
National Radio Astronomy Observatory June 13/14, 2005 EVLA Phase II Proposal Review EVLA Phase II Computing Development Bryan Butler (EVLA System Engineer.
Advertisements

OVERVIEW TEAM5 SOFTWARE The TEAM5 software manages personnel and test data for personal ESD grounding devices. Test and personnel data may be viewed/reported.
Automating with Open Source Testing Tools Corey McGarrahan rSmart 01-July-08.
EVLA Data Processing PDR Proposal Handling Honglin Ye, NRAO.
Software for Science Support Systems EVLA Advisory Committee Meeting, March 19-20, 2009 David M. Harland & Bryan Butler.
Bill SahrEVLA M&C Transition System Software CDR December 5-6, EVLA Monitor & Control Transition System Software Overview.
© , Michael Aivazis DANSE Software Issues Michael Aivazis California Institute of Technology DANSE Software Workshop September 3-8, 2003.
Lesson 18: Configuring Application Restriction Policies
Collaboration Suite Business Process Management
Aus-VO: Progress in the Australian Virtual Observatory Tara Murphy Australia Telescope National Facility.
©Ian Sommerville 2000 Software Engineering, 6th edition. Chapter 8 Slide 1 Software Prototyping l Rapid software development to validate requirements l.
Commissioning the NOAO Data Management System Howard H. Lanning, Rob Seaman, Chris Smith (National Optical Astronomy Observatory, Data Products Program)
Hunt for Molecules, Paris, 2005-Sep-20 Software Development for ALMA Robert LUCAS IRAM Grenoble France.
Metadata Creation with the Earth System Modeling Framework Ryan O’Kuinghttons – NESII/CIRES/NOAA Kathy Saint – NESII/CSG July 22, 2014.
Data Management Subsystem: Data Processing, Calibration and Archive Systems for JWST with implications for HST Gretchen Greene & Perry Greenfield.
DCS Overview MCS/DCS Technical Interchange Meeting August, 2000.
GLAST LAT ProjectDOE/NASA Baseline-Preliminary Design Review, January 8, 2002 K.Young 1 LAT Data Processing Facility Automatically process Level 0 data.
EVLA Early Science: Shared Risk Observing EVLA Advisory Committee Meeting, March 19-20, 2009 Claire Chandler Deputy AD for Science, NM Ops.
1 ANASAC Meeting – May 20, 2015 ALMA Pipeline Brian Glendenning (for Jeff Kern)
McMullinEVLA Advisory Committee Meeting December 14-15, 2004 EVLA Data Post-processing: SSG (AIPS++/CASA) Development J. McMullin.
How to Adapt existing Archives to VO: the ISO and XMM-Newton cases Research and Scientific Support Department Science Operations.
Prototype system of the Japanese Virtual Observatory The Japanese Virtual Observatory (JVO) aims at providing easy access to federated astronomical databases.
1 The Software Development Process  Systems analysis  Systems design  Implementation  Testing  Documentation  Evaluation  Maintenance.
JVO JVO Portal Japanese Virtual Observatory (JVO) Prototype 2 Masahiro Tanaka, Yuji Shirasaki, Satoshi Honda, Yoshihiko Mizumoto, Masatoshi Ohishi (NAOJ),
Testing Workflow In the Unified Process and Agile/Scrum processes.
Tim CornwellEVLA System PDR Dec 5-6, EVLA: Data Management EVLA sub-contracts data management to NRAO Data Management group –End-to-end processing.
ALMA Software B.E. Glendenning (NRAO). 2 ALMA “High Frequency VLA” in Chile Presently a European/North American Project –Japan is almost certainly joining.
Accelerating Scientific Exploration Using Workflow Automation Systems Terence Critchlow (LLNL) Ilkay Altintas (SDSC) Scott Klasky(ORNL) Mladen Vouk (NCSU)
Doug Tody E2E Perspective EVLA Advisory Committee Meeting December 14-15, 2004 EVLA Software E2E Perspective.
Overall Data Processing Architecture Review EVLA Monitor and Control Interfaces July , 2002EVLA Data Processing PDR Bill Sahr.
ALMA Pipeline Heuristics Frederic Boone - LERMA Lindsey Davis - NRAO John Lightfoot - UKATC Dirk Muders - MPIfR Christine Wilson - McMaster Friedrich Wyrowski.
Using the NSA Presentation to NOAO Users Committee October 5, 2005.
Designing Experiments T. Venturi Istituto di Radioastronomia, Bologna, INAF ERIS, Manchester, 9 September 2005.
LSST: Preparing for the Data Avalanche through Partitioning, Parallelization, and Provenance Kirk Borne (Perot Systems Corporation / NASA GSFC and George.
VLA Archive Image Pilot Pilot to create images from VLA archive data Loránt Sjouwerman National Radio Astronomy Observatory.
EVLA Software Bryan Butler. 2007May22EVLA SAGE Meeting2 Requirements and Goals of EVLA Software Maximize scientific throughput of the instrument At a.
© 2007 IBM Corporation SOA on your terms and our expertise Software WebSphere Process Server and Portal Integration Overview.
2007Sep06 EAC Butler - Software Overview 1 Software Overview Bryan Butler.
1 KFPA Critical Design Review – Fri., Jan. 30, 2009 KFPA Data Pipeline Bob Garwood- NRAO-CV.
SAGE meeting Socorro, May 22-23, 2007 EVLA Science Operations: the Array Science Center Claire Chandler NRAO/Socorro.
Atacama Large Millimeter/submillimeter Array Expanded Very Large Array Robert C. Byrd Green Bank Telescope Very Long Baseline Array Data Processing Progress.
EVLA Data Processing PDR Scale of processing needs Tim Cornwell, NRAO.
Introduction to EVLA Software Bryan Butler. 2006Dec05/06EVLA M&C Transition Software CDR2 EVLA Computing (Terse) History The original EVLA Phase I proposal.
ADMIT: ALMA Data Mining Toolkit  Developed by University of Maryland, University of Illinois, and NRAO (PI: L. Mundy)  Goal: First-view science data.
Pipeline Basics Jared Crossley NRAO NRAO. What is a data pipeline?  One or more programs that perform a task with reduced user interaction.  May be.
Tim CornwellEVLA Advisory May 10-11, EVLA: Data Management EVLA has sub-contracted data management to NRAO Data Management group End-to-end processing.
EVLA Monitor & Control Software PDR E2E Interfaces: Observation Scheduling Complaints to: Boyd Waters John Benson, Barry Clark, Tim Cornwell, Rich Moeser,
Ray Plante for the DES Collaboration BIRP Meeting August 12, 2004 Tucson Fermilab, U Illinois, U Chicago, LBNL, CTIO/NOAO DES Data Management Ray Plante.
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
ESO, 17 April 2007ESAC meeting1 ALMA offline User Test 5 Silvia Leurini, ESO.
EVLA Data Processing PDR E2E Data Archive System John Benson, NRAO July 18, 2002.
1 The Software Development Process ► Systems analysis ► Systems design ► Implementation ► Testing ► Documentation ► Evaluation ► Maintenance.
ADMIT: ALMA Data Mining Toolkit  Developed by University of Maryland, University of Illinois, and NRAO (PI: L. Mundy)  Goal: First-view science data.
Gustaaf van MoorselEVLA Advisory Committee Meeting December 14-15, 2004 EVLA Computing End-to-end (E2e) software.
T EST T OOLS U NIT VI This unit contains the overview of the test tools. Also prerequisites for applying these tools, tools selection and implementation.
Computing Introduction Gareth Hunt EVLA System PDR 2001 December 04.
Fermilab Scientific Computing Division Fermi National Accelerator Laboratory, Batavia, Illinois, USA. Off-the-Shelf Hardware and Software DAQ Performance.
Anita Richards and members of MERLIN, RadioNet and AstroGrid teams. ● 'Old' radio archive access ● Source lists, fixed images or visibility data ● Most.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
 1- Definition  2- Helpdesk  3- Asset management  4- Analytics  5- Tools.
Bryan Butler EVLA Computing Division Head
NRAO VLA Archive Survey
Lecture 8 Database Implementation
EVLA Archive The EVLA Archive is the E2E Archive
EAC Butler - SSS Software
EVLA Data Processing PDR
Scheduling Toolkit Observation Scheduling Boyd Waters, NRAO
Bryan Butler (for Bill Sahr)
Gustaaf van Moorsel September 9, 2003
EVLA Algorithm Research & Development
Presentation transcript:

EVLA Data Processing PDR Pipeline design Tim Cornwell, NRAO

July , 2002EVLA Data Processing PDRTim Cornwell 2 Background Pipeline processing automates by-rote processing from e.g. AIPS or AIPS++ cookbooks Many different radio astronomy pipelines now being developed –BIMA, ATCA, WSRT, MERLIN, EVN, etc. AIPS++ ideally suited to pipeline development –No accident! –Rich scripting language, tool-based applications ALMA strongly committed to pipeline reduction –NRAO contracted to provide pipeline framework Known processing background important for use of radio data within the National Virtual Observatory

July , 2002EVLA Data Processing PDRTim Cornwell 3 Run Through 1.The user requests pipeline processing of a project AB973 2.An appropriate processing script (AB973.g) is constructed based upon the meta-data in the archive for project AB973, some standard choices, and information about calibrators in the calibrator archive. 3.The resulting processing script is executed in a queue by an executive. 4.During execution, the script requests data from the archive 5.The results are passed through quality assessment to derive numerical measures of the quality of the result 6.Results from the processing (calibrated data, images, calibration tables, processing logs, diagnostic plots, processing script, source catalogs) are archived. 7.The user receives notification of the result of the processing. 8.The user downloads information from the designated web page.

July , 2002EVLA Data Processing PDRTim Cornwell 4 Requirements –Process EVLA observations from original observed data through to reference image or spectrum without human intervention –Adaptable to different calibration and imaging strategies –Provide standard log of processing –Run on non-specialized hardware –Be operable automatically or manually –Each result will have a quality measure attached to it –Required success rate not yet specified but expect ~ 50 – 70% Assume rest of system is designed to support pipeline reduction –e.g. observation scripting, monitor and control, archive

July , 2002EVLA Data Processing PDRTim Cornwell 5 Design Looked at other pipeline systems –ORAC-DR –OPUS –BIMA Pipeline –ACSIS Major issues are: –Determining observation meta-data –Capturing heuristics –Generating processing scripts appropriate to actual observation –Running scripts robustly, allowing for failure, stalling, pre-emption, etc. –Publishing results Since we expect to build on top of AIPS++, none of the above systems were judged worth adopting directly. –Could change with experience!

July , 2002EVLA Data Processing PDRTim Cornwell 6 Pipeline toolkit Prototype is implemented entirely in AIPS++ Use standard AIPS++ tools for filling, editing, calibration, imaging How to capture processing rules? –Use production rules software (make) to capture dependencies and processing rules –Rules for processing (“heuristics”) expressed as glish/AIPS++ script fragments Encapsulated in makefiles Prototype framework exists and works end-to-end –Reads VLA archive tapes (from disk) and produces HTML pages –No human interaction needed –~ 1000 lines of Glish code for entire framework –Working on two examples currently VLA A-configuration, 8 GHz continuum including self-calibration VLA D-configuration, HI synthesis –Expect further significant evolution of framework as experience accumulates

July , 2002EVLA Data Processing PDRTim Cornwell 7 Production rules!

July , 2002EVLA Data Processing PDRTim Cornwell 8 Prototype pipeline software design include “e2epipeline.g” myp := e2epipeline() myp.project(‘AB973’)

July , 2002EVLA Data Processing PDRTim Cornwell 9 e2e/AIPS++ pipeline results

July , 2002EVLA Data Processing PDRTim Cornwell 10 Difficulties No person dedicated to this development – I did about 2 – 3 months work Working from VLA historical archive but EVLA will be designed to support pipelined reduction –Most glaring omission is the intention of the observer –Have tried to work around this limitation Very limited heuristics used in this version –Need much more work with scientific staff in capturing best practices. –Expect to get help from DM Scientific Working Group during next development cycle

July , 2002EVLA Data Processing PDRTim Cornwell 11 In next cycle… Lindsey Davis will work mostly on this development, starting in September Second prototype will have limited goals –Explicitly record observing intent in first generation observing script –Construction of processing scripts determined by observing script –Expand range of test projects –Implement connections to calibrator database, and quality measurement –Improved publication of results, perhaps by VOTables (XML-based format from the NVO) Parallel development of simulator –To enable end-to-end testing (from observing script to simulator to archived results) Evaluate again at end of second development cycle