A Web-enabled Approach for generating data processors

Slides:



Advertisements
Similar presentations
Maines Sustainability Solutions Initiative (SSI) Focuses on research of the coupled dynamics of social- ecological systems (SES) and the translation of.
Advertisements

Earth System Curator Spanning the Gap Between Models and Datasets.
Southwest U.S. Water Resources Thomas Piechota, Ph.D., P.E University of Nevada, Las Vegas Director of Sustainability and Multidisciplinary Research Associate.
Enhance legal retrieval applications with an automatically induced knowledge base Ka Kan Lo.
Losing the Lake: Development and Deployment of an Educational Game Joseph M. Vesco, Katie Gilgen, Anne Paine, Marissa Owens, E. Michael Nussbaum, Gale.
1 Building National Cyberinfrastructure Alan Blatecky Office of Cyberinfrastructure EPSCoR Meeting May 21,
V. Chandrasekar (CSU), Mike Daniels (NCAR), Sara Graves (UAH), Branko Kerkez (Michigan), Frank Vernon (USCD) Integrating Real-time Data into the EarthCube.
The Yellow Group Design Informatics (Regli, Stone, Kusiak, Leifer, Gupta, Chung, Fenves, Law, Kopena)
material assembled from the web pages at
Research Overview Frederick C Harris, Jr. Professor, Computer Science & Engineering University of Nevada, Reno.
Frederick C Harris, Jr. Professor Department of Computer Science and Engineering University of Nevada, Reno.
Dr. Michael P. Bishop Professor and Haynes Chair in Geosciences Department of Geography.
1 Web: Steve Brewer: Web: EGI Science Gateways Initiative.
The ACGT Workflow Editing & Enactment Environment Giorgos Zacharioudakis Institute of Computer Science, Foundation for Research & Technology – Hellas (ICS-FORTH)
Synchronization Transformations for Parallel Computing Pedro Diniz and Martin Rinard Department of Computer Science University of California, Santa Barbara.
2007. Software Engineering Laboratory, School of Computer Science S E Web-Harvest Web-Harvest: Open Source Web Data Extraction tool 이재정 Software Engineering.
Geosciences - Observations (Bob Wilhelmson) The geosciences in NSF’s world consists of atmospheric science, ocean science, and earth science Many of the.
Soil and Water Conservation Modeling: MODELING SUMMIT SUMMARY COMMENTS Dennis Ojima Natural Resource Ecology Laboratory COLORADO STATE UNIVERSITY 31 MARCH.
Clinical Collaboration Platform Overview ST Electronics (Training & Simulation Systems) 8 September 2009 Research Enablers  Consulting  Open Standards.
Google Refine for Data Quality / Integrity. Context BioVeL Data Refinement Workflow Synonym Expansion / Occurrence Retrieval Data Selection Data Quality.
Portable Infrastructure for the Metafor Metadata System Charlotte Pascoe 1, Gerry Devine 2 1 NCAS-BADC, 2 NCAS-CMS University of Reading PIMMS provides.
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
Outline What are RII awards? What is the application process? What’s the timeline?
GEOSCIENCE NEEDS & CHALLENGES Dogan Seber San Diego Supercomputer Center University of California, San Diego, USA.
Three Critical Matters in Big Data Projects for e- Science Kerk F. Kee, Ph.D. Assistant Professor, Chapman University Orange, California
Earth System Curator and Model Metadata Discovery and Display for CMIP5 Sylvia Murphy and Cecelia Deluca (NOAA/CIRES) Hannah Wilcox (NCAR/CISL) Metafor.
Big Data in the Geosciences, University Corporation for Atmospheric Research (UCAR/NCAR), and the NCAR Wyoming Supercomputing Center (NWSC) Marla Meehl.
1  Trend/gap detection of changing environments Quantity/extremes, characteristics/cause/attribution.
What do we use water for?. The Water Cycle We are learning to: Explain how rain is formed. Judge how important water is to our lives.
A WEB-ENABLED APPROACH FOR GENERATING DATA PROCESSORS University of Nevada Reno Department of Computer Science & Engineering Jigar Patel Sergiu M. Dascalu.
The Semantic Web: Transforming Society Science James Hendler Maryland Information and Network Dynamics Laboratory (MIND) Semantic Web Agents Project (SWAP)
A WEB-ENABLED APPROACH FOR GENERATING DATA PROCESSORS University of Nevada Reno Department of Computer Science & Engineering Jigar Patel Sohei Okamoto.
Open Science (publishing) as-a-Service Paolo Manghi (OpenAIRE infrastructure) Institute of Information Science and Technologies Italian Research Council.
An Open Data Platform in the framework of the EGI-LifeWatch Competence Centre Fernando Aguilar Jesús Marco
A41I-0105 Supporting Decadal and Regional Climate Prediction through NCAR’s EaSM Data Portal Doug Schuster and Steve Worley National Center for Atmospheric.
“ subject centric world ” Collective Intelligence Overview Challenges and Opportunities 121/03/13.
Building a Data Warehouse
Data Management Program Introduction
RDA 9th Plenary Breakout 3, 5 April :00-17:30
Engineering (Richard D. Braatz and Umberto Ravaioli)
Joslynn Lee – Data Science Educator
Themes in Geosciences.
Data Ingestion in ENES and collaboration with RDA
Joseph JaJa, Mike Smorul, and Sangchul Song
Similarities between Grid-enabled Medical and Engineering Applications
improve the efficiency, collaborative potential, and
The 2007 Winter Conference on Business Intelligence
Topic Area 3. Water Management and Planning
Ramesh Baral Team: Marjani Peterson, Andre Guerrero
GFDL Climate Model Status and Plans for Product Generation
Business Rule Based Configuration Management and Software System Implementation Using Decision Tables Olegas Vasilecas, Aidas Smaizys VGTU, Vilnius, Lithuania.
WEB BASED PREDICTIVE DEFUZZIFIER
Rui Wu, Jose Painumkal, Sergiu M. Dascalu, Frederick C. Harris, Jr
Introduction to D4Science
Cyber-Infrastructure for Marine Biodiversity Data
Staying afloat in the sensor data deluge
EOSCpilot All Hands Meeting 8 March 2018 Pisa
LifeWatch Cloud Computing Workshop
Project Information Management Jiwei Ma
An EUDAT-based FAIR Data Approach for Data Interoperability
Workshop on Gap Analysis and Prioritization
Guided Research: Intelligent Contextual Task Support for Mails
What is the percentage of global precipitation falling on the land?
What is the percentage of global precipitation falling on the land?
Metadata Development in the Earth System Curator
Bird of Feather Session
Data(trans)forming Roberto Barcellan European Commission NTTS2019
Chaitali Gupta, Madhusudhan Govindaraju
Brokering as a Core Element of EarthCube’s Cyberinfrastructure
Microsoft Azure Data Catalog
Presentation transcript:

A Web-enabled Approach for generating data processors Jigar Patel Sergiu M. Dascalu Frederick C. Harris, Jr University of Nevada Reno CTS 2013 MAY 2013 University of Nevada Reno Department of Computer Science & Engineering

Outline Introduction Problem Background Proposed Approach Conclusions & Future Work May 2013

Introduction 1 Feb 2012

About the Larger NSF Project NSF EPSCoR funded project Nevada, Idaho, and New Mexico Effects of climate change on their regional environment and ecosystem resources Cyber-infrastructure (CI) Facilitate and support interdisciplinary climate change research, education, policy, decision-making, and outreach Design, develop and make available integrated data repositories and intelligent, user-friendly software solutions May 2013

Problem Background 2 Feb 2012

What is a model? It could have different meaning in different context and research areas Climate change research Software Engineering Definition of Model SE: Description of the software that will be built. E.g ER diagram, class diagram, or activity diagram Science: A model is a mathematical description of a problem/phenomenon. http://goo.gl/5ZCIP http://goo.gl/wjeo8 May 2013

What is model coupling? Any single model cannot explain every system Surface water level Ground water level Precipitation Moisture Temperature Relative humidity Model coupling involves a process to exchange data between models Two way vs. linking May 2013

Significance of model coupling Combines knowledge of multiple domains Eliminates some level of uncertainty from the model in process Water level depends on rain, temperature, moisture, relative humidity of given time and location This can be achieved by coupling an atmospheric model with hydrological model Helps to understand and predict natural phenomenon at a larger scale May 2013

Data related issues in model coupling File formats Apr 2013

Data related issues in model coupling File Formats Orange circle represents a record line in a data set Green container represents file format container May 2013

Data related issues in model coupling Data subsetting and merging Extract only partial data and merge with other data set May 2013

Data related issues in model coupling Data sampling issues Some models run at different scale so data sampling becomes a major challenge Terrain also becomes a big challenge Time scale becomes an important issue as well May 2013

Data related issues in model coupling Data subsetting in complex data sets and file formats May 2013

Proposed Solution 3 Feb 2012

Data Structures Data structures May 2013

Data Structures May 2013

Data Structure Operations May 2013

Data Structure Operation May 2013

Data Processor May 2013

Generic Data Processor May 2013

Data Processor Definition File May 2013

Generic Data Processor Configuration File May 2013

Generic Processor in Action May 2013

Auto Generated Class May 2013

Auto Generated Processor May 2013

Conclusions & Future Work 5 Feb 2012

Conclusions There are many challenges related to data processing Results of the proposed work can also be used to generate data filtering and transformation tools for day to day data processing in other areas of scientific research Collaboration and reusability of generated data processors via web Dynamically generated source code be used as a starting point to further address complex issues May 2013

Future Work Support for additional file formats Ability to create extended workflows Including models and other processes Model coupling with pre-defined set of models Integrate the solution with Nevada Climate Portal Expose the API via RESTful services May 2013

Questions & Comments Feb 2012