Presentation is loading. Please wait.

Presentation is loading. Please wait.

1 EPA Data Architecture for DRM 2.0 Brand Niemann Enterprise Architecture Team, US EPA, and Co-Chair, SICoP August 7, 2006.

Similar presentations


Presentation on theme: "1 EPA Data Architecture for DRM 2.0 Brand Niemann Enterprise Architecture Team, US EPA, and Co-Chair, SICoP August 7, 2006."— Presentation transcript:

1 1 EPA Data Architecture for DRM 2.0 Brand Niemann Enterprise Architecture Team, US EPA, and Co-Chair, SICoP August 7, 2006

2 2 Request 1. Data Models by EPA Business Line using DRM 2.0 Structure: –Example: Made the Draft 2007 EPA Report on Indicators DRM 2.0 Compliant and integrated it with other indicator reports using an overall data model (actually ontology) for indicators being developed in cooperation with the NAS Key National Indicator Initiative and others (CEQ, etc.). 2. Data collection, sources, and validation: –Example: Relied on OIAA's work on Indicators from 2003 to the present for metadata and broadened that to include many others high-quality sources of indicator data that have metadata since EPA originates very little indicator data itself. Note: These new Data Models are available as DRM 2.0 Templates at the top of the online data assets (XML) and/or as formal ontologies (OWL). See slides 3-5.

3 3 DRM 2.0 Templates

4 4 Ontology for Indicators Indicators TopicsOrganizationsJurisdictions The Economy Society Culture The Environment Cross-Cutting Publicly led Privately led Led by public-private partnership U.S. local/regional level U.S. state level National level outside the United States Supranational level Note that each of these classes can and do have multiple instances underneath them, etc. Schematic of the Ontology

5 5 Semantic Wiki Ontologies E.g. Data Model Classes of interest to Kevin Brett, SRA

6 6 Explanation The "data model" for DRM 2.0 has three parts (description, context, and sharing) and its best practice implementation (which I led for OMB/AIC and now oversee in SICoP/SWIM WG under the Best Practices Committee*) is fundamentally different from what EPA has been doing in the past in that: –the three types of data (structured, unstructured, and semi- structured) are integrated together, along with their metadata; –the data assets and taxonomies/ontologies are made visible and operational; and –the data is actually made available as either exchange packages or query points or both. Note: See Slides 8-12. * http://colab.cim3.net/cgi-bin/wiki.pl?SICoP/SemanticWikisandInformationManagementWG

7 7 Explanation We have many examples of this to show from the past year and are been doing this in the Summer Collaborative Expedition Workshops using new Semantic Wiki Technology for: –Geospatial Data (June 20-22nd); –Information Sharing and Knowledge Management (July 18th); and –Multiple Ontologies in Semantic Data Networks (August 15th). I would strongly encourage attendance at the August 15th Workshop to more fully understand this new approach than is going to be possible in just two hours. –See http://colab.cim3.net/cgi- bin/wiki.pl?ExpeditionWorkshop/OpenCollaboration_Networkin gSemanticInteroperability_2006_08_15

8 8 Data Reference Model 2.0 One of SICoPs Projects Was the DRM: –December 2004, DRM 1.0 – Just structured data (Description) and exchange packages (Sharing). –February 2005, SICoP White Paper 1 (“Data Architecture of the Future”) – All three types of data (Description) and ontologies (Context). –October 2005, SICoP DRM 2.0 Implementation Guide – Metamodel and Semantic Metadata (see slides 9 - 10). –December 2005, DRM 2.0 – Description (3), Context (2), and Sharing (2) (see slide 11). So DRM 2.0 + Semantic Metadata = SICoP Knowledge Reference Model (KRM). DRM 2.0 Implementation Evolves to the SICoP Semantic Wikis and Information Management (SWIM) WG

9 9 Data Reference Model 2.0 Metamodel: Precise definitions of constructs and rules needed for abstraction, generalization, and semantic models. Model: Relationships between the data and its metadata. Metadata: Data about the data. Data: Facts or figures from which conclusions can be inferred. Relationships and associations The purpose of this schematic is to show that we need to describe information model relationships and associations in a way that can be accessed and searched. Source: Professor Andreas Tolk, August 16, 2005

10 10 Data Reference Model 2.0 The point of this graph is that Increasing Metadata (from glossaries to ontologies) is highly correlated with Increasing Search Capability (from discovery to reasoning).

11 11 Data Reference Model 2.0 Source: Expanding E-Government, Improved Service Delivery for the American People Using Information Technology, December 2005, pp. 2-3. http://www.whitehouse.gov/omb/budintegration/expanding_egov_2005.pdf DRM 1.0 SICoP Ontologies All Three

12 12 Data Reference Model 2.0 Summary: –Slide 9: We need to describe information model relationships and associations in a way that can be accessed and searched. –Slide 10: Increasing Metadata (from glossaries to ontologies) is highly correlated with Increasing Search Capability (from discovery to reasoning). –Slide 11: Three things about data (Description, Context, and Sharing) are needed for information sharing.

13 13 More Results Excerpts: Roadmap for EPA Data Architecture for DRM 2.0 and Semantic Networks, July 3, 2006: –Bigger Picture for EPA Data Architecture. http://colab.cim3.net/cgi- bin/wiki.pl?EPADataArchitectureforDRM2 Excerpts: Implementing the Semantic Web - Part 1. Semantic Technology Profile for the Data Reference Model: Use Case 2 - U.S. EPA, August 24, 2005: –Bigger Picture for EPA Enterprise Architecture: See Government Computer News, At your service: Forget buying software; build your next application from existing components, April 24, 2006, See end of article.

14 14 Roadmap for EPA Data Architecture for DRM 2.0 and Semantic Networks Brand Niemann Enterprise Architecture Team, US EPA, and Co-Chair, SICoP July 3, 2006

15 15 Overview Reviewed the earlier “data architecture” work with SRA consultants and concluded that we need more than use of the EPA Strategic Plan and the “EPA data standards – System of Registries”. Decided to focus on making EPA Data DRM 2.0 Compliant in a Semantic Network in support of Network Data Model (Ontologies), Indicators, the “Big Decision”, Model-Driven Architecture, and recent requests for assistance.

16 16 Roadmap for EPA Data Architecture Data TypeBaselineTargetActivity 1. Compliance / Performance Exchange Network Semantic WikiJune 21 ECC Forum 2. IndicatorsReport on the Environment OntologyJune 2 KNII Meeting 3. GeospatialBlueprint 2003 & 2006 Update OntologiesJune 20-22 CEW Workshop 4. Emergency Response Region 4 PilotEvent OntologyRecent SOA Conference 5. AssessmentORD/SABBayesian Web Ontology Recent Tutorial 6. HealthSubstance Registry Semantic WikiRecent EPA Science Forum See notes on next slide.

17 17 Roadmap for EPA Data Architecture General – all of these are developing ontologies individually that can be managed and interrelated collectively in the Semantic Wiki. 1. Compliance/Performance – see Facility Registry System Pilot to improve accuracy in 2003 using the Qsent Telephone Directory Web Service. 2. Indicators – see Sustainable Water Resources Roundtable and Chesapeake Bay 2005 Health and Restoration Assessment Reports (DRM 2.0 - Compliant Semantic Wiki Versions) 3. Geospatial – see Collaborative Expedition Workshop, June 20- 22nd Open Collaboration: Networking Geospatial Information Technology. 4. Emergency Response – see Improving Rapid First Response at the SOA for E-Government Conference, May 24th. 5. Assessment – see Semantic (Bayesian) Web Tutorial, April 28th. 6. Health – see Third National Report on Human Exposure to Environmental Chemicals (2005) (in process) (DRM 2.0 - Compliant and Semantic Wiki Version).

18 18 EPA System of Registries AbbreviationNameTypeComments FRS Facility Registry System Linked to OEI Facility ID Records EIMS Environmental Information Management System Linked to ORD Geospatial and Scientific Models XML eXtensible Markup Language Registry Integrated Exchange Network READ Registry of EPA Applications and Databases Integrated About 900! EDR Environmental Data Registry Integrated Structured Data SRS Substance Registry System Integrated Chemicals and Biota TRS Terminology Reference System Integrated Environmental Terms See next slide for contents and types.

19 19 EPA System of Registries http://iaspub.epa.gov/sor/intro$.startup

20 20 Implementing the Semantic Web Part 1. Semantic Technology Profile for the Data Reference Model: Use Case 2 - U.S. EPA Brand Niemann (US EPA), Chair, Semantic Interoperability Community of Practice (SICoP) Best Practices Committee (BPC), CIO Council August 24, 2005 http://web-services.gov/ and http://colab.cim3.net/cgi-bin/wiki.pl?SICoP

21 21 2. EPA DRM Status 2.1 DRM Water Resources Use Case in Composite Application Platform: –Use Case 3: Demonstrating the maturity of FEA for multi-agency and community-of-interest information sharing and issue analysis. –This use case demonstrates how semantic models (ontology) can provide a framework for data interoperability where similar datasets need to be combined to support an aggregated business context for tactical decision making, analysis, and policy making across multiple communities of interest.

22 22 2. EPA DRM Status 2.1 DRM Water Resources Use Case in Composite Application Platform (continued): –The FEA Business Reference Model provides a framework that facilitates a functional (rather than organizational) view of the federal government’s lines of business (LoBs), independent of the agencies, bureaus and offices that perform them. –This pilot focuses on the specific business function Environmental Monitoring and Forecasting (1-108- 023). The end Use Case is to provide an interactive Composite Report on the Environment (EPA ROE 2006) that combines data about water indicators from a number of agencies into a coherent picture reflecting the status of nation’s waters in the social, economic and environmental context.

23 23 2. EPA DRM Status 2.1 DRM Water Resources Use Case in Composite Application Platform (continued): –Recently started, the pilot will illustrate how indicator ontology can be created for water that uses ontological relationships to relate different indicators based on their properties, use and the type and quality of impact. The ontology can then be used to drive an interactive report that supports seamless navigation across the indicators and their cumulative impact on a particular area of concern. See Use Case 1 Demo: –http://web-services.gov/pilots/DigitalHarbor/terrorismdemo.htm See Use Case 2 Demo: –http://web-services.gov/pilots/DigitalHarbor/CampaignFinance.htm

24 24 2. EPA DRM Status Executable Integration of the FEA Reference Models in Composite Applications Fact Sheet at http://web-services.gov/SICoPPilotFactSheet_Final.pdf

25 25 2. EPA DRM Status 2.1 DRM Water Resources Use Case in Composite Application Platform (continued): –Repurposed all the text and graphics from PDF to Word and PNG and added semantic structure linking (see next slide). –Extract the water data sources URL and documents for data mining by the Digital Harbor PiiE Platform. –Categorized the EPA Applied Analyses & Applications Workshop (AAAW) Into a Composite Application Business Ontology: See Section 2.2 and http://www.aaaw2005.com/

26 26 2. EPA DRM Status See http://web-services.gov/lpBin22/lpext.dll/Folder5/Infobase5/1?fn=main-j.htm&f=templates&2.0

27 27 2. EPA DRM Status 2.2 EPA Applied Analyses & Applications Workshop (AAAW) Composite Application Ontology: –The Composite Applications Business Ontology consists of: Service Oriented Architecture (SOA) Portals Enterprise Integration –The titles and contents of the 64 presentations were analyzed and assigned to three basic categories and 14 sub-categories (see next slide).

28 28 2. EPA DRM Status 2.2 EPA Applied Analyses & Applications Workshop (AAAW) Composite Application Ontology (continued): –The Composite Applications Business Ontology consists of: SOA:.NET (1), Scientific Computing (1), GIS (2), Architecture (19), & Web Services (5). Portals: Agency (3), Health (3), Environmental (4), Research & Science (4), and Other (3). Enterprise Integration: Environmental Justice and Siting (4), Emergency Response (2), Report on the Environment (3), and Analytical and Quality Services (7). Other: Future Trends (3). –Preparing for Composite Applications that (1) reduce redundant IT applications, increase interoperability, and facilitate business improvement (2) and produce data integration with EPA content and interoperability with non-EPA content. (1) Source: Pat Plunkett, Chair, Federal IT Performance Measurement CoP, January 13, 2005. (2) Source: John Sullivan, EPA Chief Architect, March 2, 2005.


Download ppt "1 EPA Data Architecture for DRM 2.0 Brand Niemann Enterprise Architecture Team, US EPA, and Co-Chair, SICoP August 7, 2006."

Similar presentations


Ads by Google