Ohio State University Middleware Systems Driven by Sensing Scenarios Gagan Agrawal CSE (Joint Work with Qian Zhu, David Chiu, Ron Li, Keith Bedford ….

Slides:



Advertisements
Similar presentations
Cyberinfrastructure for Coastal Forecasting and Change Analysis
Advertisements

Hydrological information systems Svein Taksdal Head of section, Section for Hydroinformatics Hydrology department Norwegian Water Resources and Energy.
The Big Picture Scientific disciplines have developed a computational branch Models without closed form solutions solved numerically This has lead to.
Data Grid Research Group Dept of Computer Science and Engineering The Ohio State University David Chiu and Gagan Agrawal Cost and Accuracy Sensitive Dynamic.
Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.
Business Intelligence Dr. Mahdi Esmaeili 1. Technical Infrastructure Evaluation Hardware Network Middleware Database Management Systems Tools and Standards.
Cloud based Dynamic workflow with QOS for Mass Spectrometry Data Analysis Thesis Defense: Ashish Nagavaram Graduate student Computer Science and Engineering.
Image Query (IQ) Project Update Building queries one question mark at a time March, 2009.
Query Planning for Searching Inter- Dependent Deep-Web Databases Fan Wang 1, Gagan Agrawal 1, Ruoming Jin 2 1 Department of Computer.
Katanosh Morovat.   This concept is a formal approach for identifying the rules that encapsulate the structure, constraint, and control of the operation.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 10 Database Performance Tuning and Query Optimization.
Universität Stuttgart Universitätsbibliothek Information Retrieval on the Grid? Results and suggestions from Project GRACE Werner Stephan Stuttgart University.
Ohio State University Department of Computer Science and Engineering Automatic Data Virtualization - Supporting XML based abstractions on HDF5 Datasets.
Active Monitoring in GRID environments using Mobile Agent technology Orazio Tomarchio Andrea Calvagna Dipartimento di Ingegneria Informatica e delle Telecomunicazioni.
A Metadata Based Approach For Supporting Subsetting Queries Over Parallel HDF5 Datasets Vignesh Santhanagopalan Graduate Student Department Of CSE.
Performance Issues in Parallelizing Data-Intensive applications on a Multi-core Cluster Vignesh Ravi and Gagan Agrawal
Ohio State University Department of Computer Science and Engineering 1 Cyberinfrastructure for Coastal Forecasting and Change Analysis Gagan Agrawal Hakan.
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
Young Suk Moon Chair: Dr. Hans-Peter Bischof Reader: Dr. Gregor von Laszewski Observer: Dr. Minseok Kwon 1.
A Framework for Elastic Execution of Existing MPI Programs Aarthi Raveendran Tekin Bicer Gagan Agrawal 1.
Xiao Liu CS3 -- Centre for Complex Software Systems and Services Swinburne University of Technology, Australia Key Research Issues in.
1 A Framework for Data-Intensive Computing with Cloud Bursting Tekin Bicer David ChiuGagan Agrawal Department of Compute Science and Engineering The Ohio.
A Framework for Elastic Execution of Existing MPI Programs Aarthi Raveendran Graduate Student Department Of CSE 1.
© DATAMAT S.p.A. – Giuseppe Avellino, Stefano Beco, Barbara Cantalupo, Andrea Cavallini A Semantic Workflow Authoring Tool for Programming Grids.
Euro-Par, A Resource Allocation Approach for Supporting Time-Critical Applications in Grid Environments Qian Zhu and Gagan Agrawal Department of.
Issues in (Financial) High Performance Computing John Darlington Director Imperial College Internet Centre Fast Financial Algorithms and Computing 4th.
Auspice: AUtomatic Service Planning in Cloud/Grid Environments David Chiu Dissertation Defense May 25, 2010 Committee: Prof. Gagan Agrawal, Advisor Prof.
Ohio State University 1 Cyberinfrastructure for Coastal Forecasting and Change Analysis Gagan Agrawal Hakan Ferhatosmanoglu Xutong Niu Ron Li Keith Bedford.
ICDL 2004 Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer Science Old Dominion University.
Data-Intensive Computing: From Clouds to GPUs Gagan Agrawal June 1,
Kelvin K. Droegemeier and Yunheng Wang Center for Analysis and Prediction of Storms and School of Meteorology University of Oklahoma 19 th Conference on.
GUIDED BY DR. A. J. AGRAWAL Search Engine By Chetan R. Rathod.
Efficient RDF Storage and Retrieval in Jena2 Written by: Kevin Wilkinson, Craig Sayers, Harumi Kuno, Dave Reynolds Presented by: Umer Fareed 파리드.
Data Grid Research Group Dept. of Computer Science and Engineering The Ohio State University Columbus, Ohio 43210, USA David Chiu & Gagan Agrawal Enabling.
DDM Monitoring David Cameron Pedro Salgado Ricardo Rocha.
GEON2 and OpenEarth Framework (OEF) Bradley Wallet School of Geology and Geophysics, University of Oklahoma
A Fault-Tolerant Environment for Large-Scale Query Processing Mehmet Can Kurt Gagan Agrawal Department of Computer Science and Engineering The Ohio State.
1 Supporting Dynamic Migration in Tightly Coupled Grid Applications Liang Chen Qian Zhu Gagan Agrawal Computer Science & Engineering The Ohio State University.
Data Integration Hanna Zhong Department of Computer Science University of Illinois, Urbana-Champaign 11/12/2009.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Ohio State University Department of Computer Science and Engineering An Approach for Automatic Data Virtualization Li Weng, Gagan Agrawal et al.
Elastic Cloud Caches for Accelerating Service-Oriented Computations Gagan Agrawal Ohio State University Columbus, OH David Chiu Washington State University.
David Chiu and Gagan Agrawal Department of Computer Science and Engineering The Ohio State University 1 Supporting Workflows through Data-driven Service.
PDAC-10 Middleware Solutions for Data- Intensive (Scientific) Computing on Clouds Gagan Agrawal Ohio State University (Joint Work with Tekin Bicer, David.
An Overview of Scientific Workflows: Domains & Applications Laboratoire Lorrain de Recherche en Informatique et ses Applications Presented by Khaled Gaaloul.
High-level Interfaces for Scalable Data Mining Ruoming Jin Gagan Agrawal Department of Computer and Information Sciences Ohio State University.
OGCE Workflow and LEAD Overview Suresh Marru, Marlon Pierce September 2009.
Erik Jonsson School of Engineering and Computer Science The University of Texas at Dallas Cyber Security Research on Engineering Solutions Dr. Bhavani.
Data Grid Research Group Dept. of Computer Science and Engineering The Ohio State University Columbus, Ohio 43210, USA David Chiu and Gagan Agrawal Enabling.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
Euro-Par, HASTE: An Adaptive Middleware for Supporting Time-Critical Event Handling in Distributed Environments ICAC 2008 Conference June 2 nd,
Vertical Integration Across Biological Scales A New Framework for the Systematic Integration of Models in Systems Biology University College London CoMPLEX.
Developing GRID Applications GRACE Project
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
September 2003, 7 th EDG Conference, Heidelberg – Roberta Faggian, CERN/IT CERN – European Organization for Nuclear Research The GRACE Project GRid enabled.
1 Supporting a Volume Rendering Application on a Grid-Middleware For Streaming Data Liang Chen Gagan Agrawal Computer Science & Engineering Ohio State.
Servicing Seismic and Oil Reservoir Simulation Data through Grid Data Services Sivaramakrishnan Narayanan, Tahsin Kurc, Umit Catalyurek and Joel Saltz.
A Context Framework for Ambient Intelligence
QianZhu, Liang Chen and Gagan Agrawal
Liang Chen Advisor: Gagan Agrawal Computer Science & Engineering
Supporting Fault-Tolerance in Streaming Grid Applications
Research Challenges of Autonomic Computing
An Adaptive Middleware for Supporting Time-Critical Event Response
GATES: A Grid-Based Middleware for Processing Distributed Data Streams
Resource Allocation for Distributed Streaming Applications
Chaitali Gupta, Madhusudhan Govindaraju
Context-Aware Internet
VIFI : Virtual Information Fabric for Data-Driven Discovery from Distributed Fragmented Repositories PI: Dr. Ashit Talukder Bank of America Endowed Chair.
LCPC02 Wei Du Renato Ferreira Gagan Agrawal
L. Glimcher, R. Jin, G. Agrawal Presented by: Leo Glimcher
Presentation transcript:

Ohio State University Middleware Systems Driven by Sensing Scenarios Gagan Agrawal CSE (Joint Work with Qian Zhu, David Chiu, Ron Li, Keith Bedford …. ) 1

Ohio State University 2 Context: Cyberinfrastructure for Coastal Forecasting and Change Analysis: Gagan Agrawal (PI) Hakan Ferhatosmanoglu Ron Li Keith Bedford

Ohio State University 3 Coastal Forecasting and Change Detection (Lake Erie)

Ohio State University Need for New CS Research Adapting to Time Constraints –Standard Computing Model: Run this program –Our need: Do the best in time X –General middleware solution Querying Low-level data –Existing solutions »Database Systems »Low-level Tools –Our need: High-level Queries on Low-level datasets 4

Ohio State University Other Considerations Work in Context of Grid / Cloud / Cyberinfrastructre –Service-oriented Solutions –Dynamic Resources General Solutions –Not specific to geospatial data or Nowcasting/Forecasting Models 5

Ohio State University HASTE: Autonomic Middleware Adaptive System for Time-critical Events Optimize a Benefit Function Within the Time Constraint –Numerous Performance-Related Parameters Buzz-word Intensive –For Grid and Cloud Environments –Supports Software as a Service (SaaS) –Autonomic »Self-managing »Self-optimizing

Ohio State University 7 Motivation: Great Lakes Forecasting System Regularly Scheduled Nowcasts /Forecasts of the Great Lakes’ physical conditions Joint venture of OSU Civil Engineering Dept. and NOAA/GLERL Meteorological data and consultation provided by the National Weather Service, Cleveland Office Great Lakes Forecasting System Low water due to negative storm surge on eastern end of Lake Erie - Oct. 25, 2001

Ohio State University Specific Scenario A significant event occurs –Accident / Storm Local and State Authorities Need to React Existing Models can Provide Helpful Information –Where to target the search –How will a storm impact the sewage systems Limited time before one needs to act Give me most in 10 / 30 minutes / 6 hours 8

Ohio State University Scenario (Contd). A lot of flexibility in the application –Spatial and Temporal Granularity –How many models to run Find most resources for the computation –Grid/ Cloud / SaaS models are helpful Can’t tell parameter choices for the time constraint Can a Runtime System / Middleware Help ? 9

Ohio State University Specifics of Functionality Application developer specifies a QoS or Benefit function –Capture adaptable parameters Middleware’s goal is to maximize this –Fixed resources and time –Other issues »Resource allocation for this purpose (Grid Computing) »Tradeoff between Budget and Benefit (Cloud Computing) 10

Ohio State University 11 Middleware Design

Ohio State University 12 Autonomic Adaptation Algorithm ICAC 2008 Optimize the Benefit Function Within the Time Constraints by Adapting Service Parameters In the Normal Processing Phase –Multiple processing rounds –For each checkpoint of parameter X in service S Learn the Estimators of the value of X with –execution time –benefit function Update the system model In the Time Critical Event Handling Phase –Adjust X based on the system model –Accelerate the adaptation if violating the time deadline

Ohio State University Control Model VariableDescription x(k)Adjustable service parameters u(k)Increase/Decrease to parameters w(k)Estimated overall response time System Model Definitions ICAC 2008

Ohio State University 14 System Model ICAC 2008 State Equation Performance Measure time constraint benefit adaptation overhead Constraints

Ohio State University 15 Policy Without Learning ICAC 2008 It is simple and straightforward Parameter convergence depends on the learning rate It may incur a large adaptation overhead

Ohio State University 16 Policy with Learning Reinforcement Learning Based Normal Processing Phase – Explore –Q-learning –Discrete and continuous parameters Global Pattern –Correlation between adaptable service parameters if x is continuous otherwise

Ohio State University 17 Experimental Evaluation Goals Demonstrate that parameters converge meet the time constraint Overhead of adaptation is modest Overhead caused by learning is very small.

Ohio State University 18 Image Size ICAC 2008

Ohio State University Overhead of the Adaptation Algorithm 9% 11% 12%

Ohio State University 20 Overhead of the Adaptation Algorithm (Learning Phase) ICAC 2008 Normal Execution (Min) Number of Adapted Parameters (Min) The overhead of the adaptation algorithm for tuning 1,2 and 3 parameters is 2.2%, 3.0% and 4.8%.

Ohio State University HASTE Summary Significant new functionality Combines control models, machine learning, and service-oriented computing Other work on –Resource Allocation –Fault Tolerance –Budget Management (Cloud Computing) 21

Ohio State University 22 Motivation Again: Coastal Forecasting and Change Detection (Lake Erie)

Ohio State University Observations A lot of low-level data –Different modalities, formats –A number of different users / use cases Different Programs (Services) –Computations –Format conversions –Viewing results Choosing right dataset and workflow is hard 23

Ohio State University More Globally Data-intensive sciences Scientific data repositories Web services / Service-oriented software Metadata standards –Within domains / countries 24

Ohio State University Questions Can we provide simple access to low-level information –Not just data, but derived results Very simple interfaces –`Google’ to low-level datasets Other considerations –Time vs. Quality of Service –Cache derived data results 25

Ohio State University 26 Summary of Desiderata US EU AU... High level query... - Keywords - Natural language Don’t just give me the data, but... - Transform it - Manipulate it - Compose it with other processes and data sets And do this with the least amount of work required from me!

Ohio State University 27 System Goals To enable queries over low level data sets, which involves: –identification of relevant data sets –automatic planning for the composition of dependent services (processes) for derivation... while being non-intrusive to existing schemes, i.e., –avoids a standardized format for storing data sets –accommodates heterogeneous metadata

Ohio State University 28 System Overview

Ohio State University 29 In the Semantics Layer Applying Domain Information Domain concepts can be derived from executing a service Domain concepts can also be derived from retrieving an existing data set Service parameters represent different domain concepts

Ohio State University 30 Data Registration Service Indexing Data Sets Handling heterogeneous metadata For instance, just within the geospatial domain, CountryMetadata Standards USCSDGM AU, NZANZLIC EU??? CDN???...

Ohio State University 31 Data Registration Service Handling Heterogenuous Metadata

Ohio State University 32 Supporting High Level Queries Entire system is domain-concept-driven So, we should decompose queries into concepts first

Ohio State University 33 Supporting High Level Queries

Ohio State University 34 Original Query: –“return water level from station=32125 on 10/31/2008” The elements of our query have been parsed against the ontology Supporting High Level Queries

Ohio State University 35 The Planning Layer Service Composition: An Example

Ohio State University 36 The Planning Layer Service Composition: An Example A subset of the ontology (unrolled)

Ohio State University 37 Planning Times

Ohio State University 38 AUSPICE: Summary We came up with acronym only recently –AUtomatic Service Planning and execution In Cloud/Grid Environments Our system... –proposes to unify heterogeneous metadata –extracts certain metadata attributes and indexes low level data sets and services for fast access from distributed repositories –automatically composes these services and data sets to answer user queries

Ohio State University 39 The AUSPICE System AUSPICE: Automatic Service Planning and Execution in Cloud/Grid Environments

Ohio State University Conclusions Interesting CS research can be done driven by (sensing) applications –Apologies to NSF !! Both systems applicable / extendable to other circumstances –Wanna write more proposals ? We had fun !! 40