QuakeSim Project: Portals and Web Services for Geo-Sciences Marlon Pierce Indiana University

Slides:



Advertisements
Similar presentations
LEAD Portal: a TeraGrid Gateway and Application Service Architecture Marcus Christie and Suresh Marru Indiana University LEAD Project (
Advertisements

Developing Grid User Interface Components Portlets, gadgets, etc.
General introduction to Web services and an implementation example
Wrapping Scientific Applications as Web Services Gopi Kandaswamy (RENCI) Marlon Pierce (IU)
Building Problem Solving Environments with Application Web Service Toolkits Choonhan Youn and Marlon Pierce Computer Science, Syracuse University And Community.
Reusable Components for Grid Computing Portals Marlon Pierce Community Grids Lab Indiana University.
E-DECIDER: QuakeSim Tools and Products Marlon Pierce, Co-Investigator Margaret Glasscoe, PI
IU QuakeSim Updates Co-PIs: Marlon Pierce, Geoffrey Fox Students and Staff: Xiaoming Gao, Jun Ji, Jun Wang, Marie Ma, Josh Rosen.
QuakeSim Science Gateway: ACES Update Marlon Pierce, Jun Wang, Yu Ma, Jun Ji, Xiaoming Gao, Geoffrey Fox Indiana University.
E-DECIDER Workshop: QuakeSim Tools and Products Marlon Pierce Indiana University.
Integrating Geographical Information Systems and Grid Applications Marlon Pierce Contributions: Ahmet Sayar, Galip Aydin, Mehmet Aktas, Harshawardhan Gadgil.
Service Oriented Architecture for Geographic Information Systems Supporting Real Time Data Grids Galip Aydin Department Of Computer Science Indiana University.
Building Science Gateways Marlon Pierce Community Grids Laboratory Indiana University.
Indiana University QuakeSim Activities Marlon Pierce, Geoffrey Fox, Xiaoming Gao, Jun Ji, Chao Sun.
1 Earthquake Polar and Sensor Grids Community Grids Laboratory November Geoffrey Fox Community Grids Laboratory, School of informatics Indiana.
Open Grid Computing Environments Marlon Pierce, Suresh Marru, Gregor von Laszewski, Mary Thomas, Maytal Dahan, Gopi Kandaswamy, and Wenjun Wu.
1 Multicore and Cloud Futures CCGSC September Geoffrey Fox Community Grids Laboratory, School of informatics Indiana University
JIM BASNEY 1, STUART MARTIN 2, JP NAVARRO 2, MARLON PIERCE 3, TOM SCAVO 1, LEIF STRAND 4, TOM URAM 2,5, NANCY WILKINS-DIEHR 6, WENJUN WU 2, CHOONHAN YOUN.
Toward an OpenSocial Life Science Gateway Wenjun Wu, Michael E. Papka, Rick Stevens.
Apache Airavata GSOC Knowledge and Expertise Computational Resources Scientific Instruments Algorithms and Models Archived Data and Metadata Advanced.
Integrating Geographical Information Systems and Grid Applications Marlon Pierce Contributions: Yili Gong,
- 1 - Grid Programming Environment (GPE) Ralf Ratering Intel Parallel and Distributed Solutions Division (PDSD)
FALL 2005CSI 4118 – UNIVERSITY OF OTTAWA1 Part 4 Web technologies: HTTP, CGI, PHP,Java applets)
CSCI 6962: Server-side Design and Programming Course Introduction and Overview.
Updating and Improving the INTAMAP web service Madhu Rani 2012 Intern 1.
Application Web Service Toolkit Geoffrey Fox, Marlon Pierce, Ozgur Balsoy Indiana University July
Future Grid Future Grid User Portal Marlon Pierce Indiana University.
QuakeSim: Grid Computing, Web Services, and Portals for Earthquake Science Marlon Pierce Community Grids Lab Indiana University.
Software for Science Gateways: Open Grid Computing Environments Marlon Pierce, Suresh Marru Pervasive Technology Institute Indiana University
High Performance Web Service Architecture for Sensors and Geographic Information Systems Galip Aydin.
IU QuakeSim/E-DECIDER Effort. QuakeSim Accomplishments (1) Deployed, improved many QuakeSim gadgets for standalone integration into QuakeSim.org – Disloc,
J2EE Structure & Definitions Catie Welsh CSE 432
Grids and Portals for VLAB Marlon Pierce Community Grids Lab Indiana University.
GML Data Models and Web Services for GPS and Earthquake Catalogs Marlon Pierce, Galip Aydin Community Grids Lab, Indiana University
23:48:11Service Oriented Cyberinfrastructure Lab, Grid Portals Fugang Wang April 29
COMP3019 Coursework: Introduction to GridSAM Steve Crouch School of Electronics and Computer Science.
QuakeSim Work: Web Services, Portlets, Real Time Data Services Marlon Pierce Contributions: Ahmet Sayar,
GRAM5 - A sustainable, scalable, reliable GRAM service Stuart Martin - UC/ANL.
GEM Portal and SERVOGrid for Earthquake Science PTLIU Laboratory for Community Grids Geoffrey Fox, Marlon Pierce Computer Science, Informatics, Physics.
Application portlets within the PROGRESS HPC Portal Michał Kosiedowski
Using Topic-Based Publish/Subscribe for Managing Real Time GPS Streams Marlon Pierce, Galip Aydin, Zhigang Qi Community Grids Lab Indiana University 1.
WebSphere Portal Technical Conference U.S Creating Rich Internet (AJAX) Applications with WebSphere Portlet Factory.
Wrapping Scientific Applications As Web Services Using The Opal Toolkit Wrapping Scientific Applications As Web Services Using The Opal Toolkit Sriram.
1 Grid Portal for VN-Grid Cu Nguyen Phuong Ha. 2 Outline Some words about portals in principle Overview of OGCE GridPortlets.
GPS Sensor Web Time Series Analysis Using SensorGrid Technology Robert Granat 1, Galip Aydin 2, Zhigang Qi 2, Marlon Pierce 2 1 Science Data Understanding.
Holding slide prior to starting show. A Portlet Interface for Computational Electromagnetics on the Grid Maria Lin and David Walker Cardiff University.
SensorGrid Galip Aydin June SensorGrid A flexible computing environment for coupling real-time data sources to High Performance Geographic Information.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
QuakeSim Project: Portals and Web Services for Geo-Sciences Marlon Pierce Indiana University
QuakeSim Project: Portals and Web Services for Geophysics Marlon Pierce Indiana University
ISERVOGrid Architecture Working Group Brisbane Australia June Geoffrey Fox Community Grids Lab Indiana University
QuakeSim Project: Portals and Web Services for Geo-Sciences Marlon Pierce Indiana University
Integrating Geographical Information Systems and Grid Applications Marlon Pierce Contributions: Ahmet Sayar,
Update on GPS/SCIGN REASoN CAN GPS Data Products for Solid Earth Science (GDPSES) Sponsored by NASA F Webb, Y Bock, D Dong, B Newport, P Jamason, M Scharber,
Some comments on Portals and Grid Computing Environments PTLIU Laboratory for Community Grids Geoffrey Fox, Marlon Pierce Computer Science, Informatics,
Building Grid Portals with OGCE: Big Red Portal and GTLAB Mehmet A. Nacar, Jong Youl Choi, Marlon Pierce, Geoffrey Fox Community Grids Lab Indiana University.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
Building Science Gateways Marlon Pierce Community Grids Laboratory Indiana University.
Portals, Services, Interfaces Marlon Pierce Indiana University March 15, 2002.
Partnerships in Innovation: Serving a Networked Nation Grid Technologies: Foundations for Preservation Environments Portals for managing user interactions.
The Gateway Computational Web Portal Marlon Pierce Indiana University March 15, 2002.
PROGRESS: GEW'2003 Using Resources of Multiple Grids with the Grid Service Provider Michał Kosiedowski.
Application Web Service Toolkit Allow users to quickly add new applications GGF5 Edinburgh Geoffrey Fox, Marlon Pierce, Ozgur Balsoy Indiana University.
Holding slide prior to starting show. Lessons Learned from the GECEM Portal David Walker Cardiff University
Interacting Data Services for Distributed Earthquake Modeling Marlon Pierce, Choonhan Youn, and Geoffrey Fox Community Grids Lab Indiana University.
1 Implementing Geographic Information System Grid Services Using Distributed Messaging Systems Marlon Pierce Community Grids Lab Indiana University December.
Integrating Geographical Information Systems and Grid Applications
Integrating Geographical Information Systems and Grid Applications
GeoFEST tutorial What is GeoFEST?
QuakeSim Quarterly Update
Presentation transcript:

QuakeSim Project: Portals and Web Services for Geo-Sciences Marlon Pierce Indiana University

QuakeSim Project Summary Goal is to provide a distributed environment for connecting scientific computing and data resources with Web based user interfaces. QuakeSim’s IT development includes Portals for user interfaces. Web Services for running remote applications and accessing databases Databases for semantic fault models and INSAR(USC) This talk reviews major revisions that we have undertaken since 2006.

Some QuakeSim Applications and Their Data Disloc, Simplex Fault models are used to calculate surface displacements (Disloc) using Okada method. Simplex is the inverse. GeoFEST (JPL/CalTech) Finite element code for detailed modeling of fault stresses, seismic displacements, uses fault models as input. Coupled to mesh generation tools Regularized Dynamic Annealing Hidden Markov Method (RDAHMM) (JPL) Time series analysis code, can be applied to GPS and seismic archives. Identifies signal components (possibly associated with underlying physical causes) with no fixed parameters.

Portlets + Client Stubs DB Service JDBC DB Job Sub/Mon And File Services Operating and Queuing Systems WSDL Browser Interface WSDL Visualization Or Map Service DB WSDL Host 1 (Quaketables) Host 2 (Grid)Host 3 (G Maps) SOAP/HTTP HTTP(S)

Some Design Choices Build portals out of portlets (Java Standard) Reuse capabilities from our Open Grid Computing Environments (OGCE) project, the REASoN GPS Explorer project, and many TeraGrid Science Gateways. Decorate with Google Maps, Yahoo UI gadgets, etc. Use Java Server Faces to build individual component portlets. Build standalone tools, then convert to portlets at the very end. Use simple Web Services for accessing codes and data. Keep It Stateless … Use Condor-G and Globus job and file management services for interacting with high performance computers. TeraGrid Favor Google Maps and Google Earth for their simplicity, interactivity and open APIs. Generate KML and GeoRSS Use Apache Maven based build and compile system, SVN on SourceForge

Daily RDAHMM Updates Daily analysis and event classification of GPS data from REASoN’s GRWS.

Disloc output converted to KML and plotted.

Simplex refines fault models from GPS displacements

TeraGrid Supercomputing Resources (GPIR)

UCSB’s Queue Prediction Service (QBETS) Forecasts time you will wait in the queue on various TG super computers. Inherited from OGCE project.

GeoFEST Finite Element Modeling portlet and plotting tools

Enterprise ApproachWeb 2.0 Approach JSR 168 PortletsGadgets, Widgets Server-side integration and processing AJAX, client-side integration and processing, JavaScript SOAPRSS, Atom, JSON WSDLREST (GET, PUT, DELETE, POST) Portlet ContainersOpen Social Containers (Orkut, LinkedIn, Shindig); Facebook; StartPages User Centric GatewaysSocial Networking Portals Workflow managers (Taverna, Kepler, etc) Mash-ups Grid computing: Globus, condor, etcCloud computing: Amazon WS Suite, Xen Virtualization

What Is a Gadget? Simple gadgets for getting a Grid proxy credential and running remote commands. Both run on my own Web server.

Google Reader and GeoRSS

Google Maps and GeoRSS

Google Earth and KML

More Information QuakeSim Web Site: Portal URL: Portal SourceForge Page: Code SVN:

Acknowledgments QuakeSim work is funded by NASA AIST (A. Donnellan, PI) and ACCESS (Y. Bock, PI) programs. Indiana University developers: Galip Aydin, Xiaoming Gao, Zhigang Qi Robert Granat (JPL), Jay Parker (JPL), Maggi Glasscoe (JPL), John Rundle (UC-Davis), Harout Nazerian (JPL), Rami Al-Ghanmi (USC), Dennis Mcleod (USC), Paul Jamason (Scripps), Ruey-Juin Chang (Scripps), Gerry Simila (CSUN)

Stop Talking Now, Champ

QuakeSim, Version 1Reason to ReviseQuakeSim, Version 2 Application Web Service for wrapping a.out executables. Execution management service built with Apache Ant. Services too coupled to portal; no simple WSDL programming interface; could not be used in workflow engines; not self contained Give each code a proper service interface. Retain Apache Ant core but extend. Keep WSDL message structure simple (Strings, ints, doubles, URLs), wrapped as Java Beans File Management ServiceUnnecessary, too coupled to Apache Axis 1.0 HTTP GET, URLs Context Management Service manages persistent portal sessions using recursive XML structure. Too slow (file system); didn’t scale; XML databases didn’t mature; Object-Relational Mappings (ORM) not efficient Using DB40; all services communicate with easily XML serializable JavaBeans. OGC-compatible map and data services Too complicated; ORM is a big overhead. Google Maps, KML generating services Serial job submissionNSF TeraGrid and Open Science Grid run full time production Grids for HPC. Condor-G/Birdbath based job management extensions to GeoFEST service.

Grid Job Submission Globus provides a universal queuing system interface. PBS, LoadLeveler, Sun Grid Engine, LSF We chose Condor-G as our job management software for submitting jobs to HPC queuing systems. University of Wisconsin Works with Globus, Matlab DCE, Unicore, etc. We co-locate Condor-G with our GeoFEST Web Service. Communication is through Birdbath, Condor’s Web Service interface. So GeoFEST service API is more or less the same, just now Grid enabled. We also plan to release a general version of this service. Condor command line and Birdbath have different names for job description parameters. Big Easter Egg hunt to find this, but now we know.

Portlet Summary RDAHMMSet up and run RDAHMM, query Scripps GRWS GPS Service, maintain persistent user sessions. ST_FilterSimilar to RDAHMM portlet; ST_Filter has much more input. Station MonitorShows GPS stations on a Google Map, displays last 10 minutes of data. Real Time RDAHMMDisplays RDAHMM results of last 10 minutes of GPS data in a Google map. Daily RDAHMMCalculates, updates RDAHMM event classifications with daily updated GPS data from SOPAC’s GRWS service (14 day delay, but uses all the data). GeoFESTCreate input geometries, generate FE meshes, run parallel FEM solvers. Disloc, SimplexCalculate service displacements from fault models.

Security Concerns They’ll see the Big Board!

QuakeSim  Distributed Environment for Modeling Observations

Managing Real Time GPS Data Slides from Galip Aydin

California Real Time Network Network Data Rates Message Format TimeRYOASCIIGML CRTN GPS Site Positions (9 Stations) 1 second1.5KB4.03KB48.7KB 1 hour5.31MB14.18MB171.31MB 1 day127.44MB340.38MB4.01GB 1 month3.8GB9.97GB123.3GB 1 year45.8GB119.67GB1.41TB Entire SCIGN Network (250 stations) 1year1.23TB16.18TB160TB Continuous GPS Stations (CGPS) are depicted as triangles while the Real-Time stations are represented as circles. Image is obtained from SOPAC GPS Explorer at How does one manage all the data generated by the 85 stations? How can you get just the data you want? Note this is fundamentally different from traditional request/response style Web Services.

31 Processing Real-Time GPS Streams ryo2nb Raw Data RYO Ports NB Server ryo2asc ii ascii2gm l ascii2pos Single Station Displaceme nt Filter Station Health Filter RDAHMM Filter Scripps RTD Server Scripps RTD Server ryo2nb Raw Data ryo2asc ii ascii2pos Single Station RDAHMM Filter A Complete Sensor Message Processing Path, including a data analysis application. /SOPAC/GPS/CRTN01/RY O /SOPAC/GPS/CRTN01/AS CII /SOPAC/GPS/CRTN01/PO S /SOPAC/GPS/CRTN01/DSM E GPS Networks

32 Application Integration with Real-Time Filters Station Monitor Filter records real-time positions for 10 minutes and calculates position changes Graph Plotter Application creates visual representation of the positions. RDAHMM Filter records real-time positions for 10 minutes and invokes RDAHMM application which determines state changes in the XYZ signal. Graph Plotter Application creates visual representation of the RDAHMM output.

NB Server RYO To ASCII Converter Simple Filter RYO Publisher 1 RYO Publisher 2 RYO Publisher n 33 2 – Multiple Publishers Test We add more GPS networks by running more publishers. The results show that 1000 publishers can be supported with no performance loss. This is an operating system limit. Topi c 1A Topi c 1B Topi c 2 Topi c n

34 4 – Multiple Brokers Test NaradaBrokering allows creation of Broker networks. We create a two-broker network. Messages published to first broker can be received from the second broker. We take timings on each broker. We connect 750 clients to each broker and run for 24 hours. We chose 750 clients to stay well below the saturation limit. The results show that the performance is very good and similar to single broker test. NB Server 1 NB Server 2 RYO To ASCII Converter Simpl e Filter 1 RYO Publisher Topi c 1A Topi c 1B Simpl e Filter 2 Simple Filter 750 Simple Filter 751 Simple Filter 752 Simple Filter 1500 Topi c 1B NB Serve r 2