SWWG PROJECT OVERVIEW Semantic Technologies for Integrating USGS Data.

Slides:



Advertisements
Similar presentations
Copyright © , Armstrong Process Group, Inc., and others All rights reserved Made available under EPL v1.01 Project Management Review Eclipse Process.
Advertisements

Armstrong Process Group, Inc. Copyright © , Armstrong Process Group, Inc., and others All rights reserved Armstrong Process.
Team Development with Microsoft Scrum 1.0 Doncho Angelov Developer Evangelist Microsoft Bulgaria.
Step 1: Valley Segment Classification Our first step will be to assign environmental parameters to stream valley segments using a series of GIS tools developed.
HydroServer A Platform for Publishing Space- Time Hydrologic Datasets Support EAR CUAHSI HIS Sharing hydrologic data Jeffery.
What is Agile? Agile is a software methodology based on iterative and incremental development, where requirements and solutions evolve through collaboration.
Agenda −Scrum with TFS 2010 using MSF for Agile 5.0 −Planning the Project −How do you plan the project? −Project planning in TFS 2010 −Planning a Sprint.
ECE44x SCRUM Overview slides adapted from Marty Stepp
RTC Agile Planning Component
Scrum. An evolutionary/iterative/incremental/agile software process The main roles in Scrum are: – Scrum team: Team of software developers – Scrum master.
 User assignments (product owner)  ‘circle’  1 st sprint: ◦ Scrum Boards (informative workspace)  Product -, release -, sprint -, defect backlog 
Agile Approach: Case Study
Evolving the BCO-DMO search interface - experience with semantic and smart search Cyndy Chandler (WHOI) Peter Fox (RPI and WHOI) Robert Groman, Dicky Allison.
Semantic Representation of Temporal Metadata in a Virtual Observatory Han Wang 1 Eric Rozell 1
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
Semantic Representation of Temporal Metadata in a Virtual Observatory Han Wang 1 Eric Rozell 1
Managing a Project Using an Agile Approach and the PMBOK® Guide
Agile Design and SCRUM Brent M. Dingle, Ph.D. “For the last few centuries, … science has been attempting to break matter down into ever smaller bits, in.
TWC Knowledge Evolution in Distributed Geoscience Datasets and the Role of Semantic Technologies Xiaogang (Marshall) Ma Tetherless World Constellation.
Linking Disparate Datasets of the Earth Sciences with the SemantEco Annotator Session: Managing Ecological Data for Effective Use and Reuse Patrice Seyed.
Rajashree Deka Tetherless World Constellation Rensselaer Polytechnic Institute.
Beyond a Data Portal: A Collaborative Environment for the Deep Carbon Science Communities Han Wang, Yu Chen, Patrick West, John Erickson, Xiaogang Ma,
CSE G674/2009 Project Project Management Section Presented by: Amir Aref Adib.
What is Scrum Process? Where is it used? How is it better?
Software Engineering- Scrum 徐 瑋 Alen 林芳瑜 Flora 1.
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
SCRUM Not your average bacon and eggs. The Basics Product Backlog Sprint Backlogs Sprints – The Daily Scrum – Burn-down chart Sprint Retrospective.
SCRUM introduction 6 April Scrum Team are known as pigs because they’re committed to delivering Sprint Goal People who are involved but not dedicated.
SCRU M Scrum Overview - Commonly Used Terms Ali Qureshi, parorrey.com – 31 st Aug, 2015 PI Media parorrey.com.
1 Data Integration Community of Practice Meeting September 15, 2009 Science Data Integration.
Water Quality Data, Maps, and Graphs Over the Web · Chemical concentrations in water, sediment, and aquatic organism tissues.
Inventory and Monitoring Terrestrial Fauna Inventory and Monitoring Terrestrial Fauna Linking Field Activities to Budget Processes.
U.S. Department of the Interior U.S. Geological Survey CDI Webinar Sept. 5, 2012 Janice Gordon September 5, 2012 Semantic Technologies for Integrating.
1 - Agile in a nutshell. 2 - Basic principles ●Relies on an iterative, incremental development mechanism with continuous adaptation to customer requirements.
NFHP Assessment Update Gary E. Whelan and Andrea C. Ostroff NFHP Science and Data Committee Co-Chairs November 2014.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Software Project Management Team 04 – K15T2. Content Summarizing your view on “Software development process”. Answer 3 question: ◦ What is Software Development.
Modeling and Representing National Climate Assessment Information using Linked Data Jin Guang Zheng 1 Curt Tilmes 2
NEON non-specialist use case; Science data reuse in a classroom Peter Fox Brian Wee Patrick West 1
1 Advanced Semantic Technologies Prof. Deborah McGuinness and Dr. Patrice Seyed CSCI CSCI ITWS ITWS TA: Justin.
GEON2 and OpenEarth Framework (OEF) Bradley Wallet School of Geology and Geophysics, University of Oklahoma
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
PRJ566 Project Planning & Management Software Architecture.
Introduction to the Semantic Web and Linked Data
Semantic Similarity Computation and Concept Mapping in Earth and Environmental Science Jin Guang Zheng Xiaogang Ma Stephan.
WHEN TITLE IS NOT A QUESTION N O ‘WE CAN’ CA Agile Vision Product Manager Michael Lester.
AGILE - IMPLEMENTATION (C) CLARION TECHNOLOGIES. ability to move quickly and easily…. AGILE MEANING (LITERALLY)
Iowa Rivers Information System Inventory, Modeling, and Evaluation of Basin, In-Stream Habitat, and Fishery Resource Relationships Kevin Kane, Iowa State.
Lecture 5 17/9/15. What is Scrum? Scrum is one of the leading agile software development processes Agile framework for completing complex projects. Originally.
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
AGILE - IMPLEMENTATION (C) CLARION TECHNOLOGIES. ability to move quickly and easily…. AGILE MEANING (LITERALLY)
TWC Adoption* of RDA DTR and PIT in the Deep Carbon Observatory Data Portal Xiaogang Ma, John Erickson, Patrick West, Stephan Zednik, Peter Fox, & the.
Software Quality Assurance Chip Ene, February 14, 2015.
Managing Agile Software Development Teams Using Scrum AKA: Wrangling Developers for Fun and Profit!
Informed Traveler Program and Applications Agile / Scrum Overview Jerry Inberg.
The Scrum Framework Presented by Somnath Ghosh Scrum Practitioner 24 hours weeks.
CMPS 116 Software Design Project. Introduction Instructor: Dr. Huahai Yang IBM Research – Almaden Former SUNY Albany Programming.
Scuola Politecnica Dipartimento DITEN Università degli Studi di Genova An Introduction to Scrum and XP Prof. Riccardo Berta.
Samad Paydar WTLab Research Group Ferdowsi University of Mashhad LD2SD: Linked Data Driven Software Development 24 th February.
Agile Project Management
AGILE SCRUM METHODOLOGY
Scrum and TargetProcess
Agile Development -- an overview
Xiaogang Ma, John Erickson, Patrick West, Stephan Zednik, Peter Fox,
Product Backlog List of things that needs to be done to make the product come into existence 
Chapter 3: The Project Management Process Groups: A Case Study
Scrum MODULE 3 – Part 3.
Scrum Overview.
Modeling Data Set Versioning Operations
Sprints.
Presentation transcript:

SWWG PROJECT OVERVIEW Semantic Technologies for Integrating USGS Data

Past, Present, Future of The Web

What is the Semantic Web? “the idea of having data on the web derived and linked in a way that it can be used by machines not just for display purposes, but for automation, integration and reuse of data across various applications” Tim Berners-Lee (2001) “The main idea of the semantic web is to support a distributed web at the level of data rather than at the level of presentation” quote from the book Semantic Web for the Working Ontologist The Semantic Web is the “Web of Data” Moving forward from the “Document Web”  “Data Web” Why is it important? Data can answer questions that documents can’t More important to put data on the web than it is to create a beautiful website that silos the data Open linked data can be used by anyone It sounds like a good idea, but why should we do it? Supports dynamic applications Designed for change Changes are made at the data modeling level Reference multiple sources of data without combining it, data sources remain separate No need for the mega-database (data warehouse model) Ability to map distributed data Merging information from multiple sources Relational and semantic technologies work together well

“Generic” Web vs. Semantic Web Generic Web = Document Web Links Documents to Documents Focused on presenting documents to humans Standard used: HTML HTML describes the syntax not the semantics Example: WikiPedia Semantic Web = Data Web Links Data to Data Focused on providing meaningful data to machines Standard used: RDF RDF represents the semantics of the data to machines Example: DBPedia (linked open data source)

SWWG Project Goals Learn Semantic Web technologies Integrate Sample Data Sets using a common ontology Develop a semantic data integration prototype

Methodology Semantic Web Methodology & Technology Development Process Graphic Credit & Copyright: Dr. Peter Fox, Rensselaer Polytechnic Institute (RPI)

The Use Case Goal: Combine data from a variety of sources into a single dataset to support aquatic habitat research of freshwater fish species in the Susquehanna River Basin. Data Sets Aquatic Bioassessment Data for the Nation (BioData) BioData provides access to aquatic bioassessment data (biological community and physical habitat data) collected by USGS scientists from stream ecosystems across the Nation. Available online at Mineral Resources Online Spatial Data (Geochemistry) Offers national-scale geochemical analysis of stream sediments and soils in the United States collected and analyzed under the National Uranium Resource Evaluation program. Available online at Multistate Aquatic Resources Information System (MARIS) MARIS serves as an online resource containing over one million population estimate, total catch, total weight, and water quality records for nearly 600 fish species sampled by a growing number of state fish and wildlife agencies Available online at National Hydrography Dataset (NHD) NHD contains detailed geospatial information about the Nation's surface water including features such as lakes, ponds, streams, rivers, canals, dams, and stream gages. Available online at

Methodology Semantic Web Methodology & Technology Development Process Graphic Credit & Copyright: Dr. Peter Fox, Rensselaer Polytechnic Institute (RPI)

Heterogeneous Information Models

Map to O & M Ontology Open Geospatial Consortium: Observations & Measurements Ontology

Methodology Semantic Web Methodology & Technology Development Process Graphic Credit & Copyright: Dr. Peter Fox, Rensselaer Polytechnic Institute (RPI)

Flexible Data Model Resource Description Framework (RDF) Convert Relational Data to RDF for integration using the O & M Ontology Data will then be stored in Triple Stores with SPARQL endpoints “The Resource Description Framework (RDF) provides a flexible data model that is used to build a conceptual representation of the data with formal semantics and allows disparate data to share formal relationships along points of integration such as spatial, temporal, and taxonomic information” Stephan Zednik, RPI

High Level Architecture Apache Jena Framework a configurable way to access RDF data using simple RESTful URLs that are translated into queries to a SPARQL endpoint

Methodology Semantic Web Methodology & Technology Development Process Graphic Credit & Copyright: Dr. Peter Fox, Rensselaer Polytechnic Institute (RPI)

Rapid Prototype Development: Using Scrum Scrum is an agile framework for completing complex projects (mostly software development). Roles Product Owner: responsible for the business value of the product ScrumMaster: ensures that the team is functional & productive Team: self-organizes to get work done Meetings Sprint* Planning: team meets with the product owner to choose the set of work to be delivered Daily Scrum: the team meets each day to share struggles & progress Sprint Review: the team demonstrates to the product owner what it has completed during the sprint Sprint Retrospective: the team looks at ways to improve the product and the process Artifacts Product Backlog: prioritized list of desired project outcomes/features Sprint Backlog: set of work from the product backlog that the team agress to complete in a sprint, broken out into tasks Burndown chart: at-a-glance look at the work remaining (can be 2 charts: one for the sprint, one for the overall project) *A Sprint is a development period typically 2-4 weeks in length. Scrum Alliance :

Next Steps Finalize Functional Requirements & Create Product Backlog Sprint Planning Start the Development (over a 2-3 week “sprint”) Prototype Testing

Methodology Semantic Web Methodology & Technology Development Process Graphic Credit & Copyright: Dr. Peter Fox, Rensselaer Polytechnic Institute (RPI)

Questions?