Presentation on theme: "A New Enterprise Data Management Strategy for the U. S"— Presentation transcript:
1 A New Enterprise Data Management Strategy for the U. S A New Enterprise Data Management Strategy for the U.S. Government: Support for the Semantic Web and Semantic TechnologyBrand Niemann,Senior Enterprise Architect, U.S. EPA and Web 2.0 (Wiki)/Web 3.0 (Semantic Web) Community LeaderApril 14, 2008
2 AbstractAt the recent W3C / WSRI "Toward More Transparent Government Workshop on eGovernment and the Web," it was stated that U.S. Government data is not readily accessible to search engines and reuse projects and that major government data projects need an enterprise data architecture for funding. The Federal Semantic Interoperability Community of Practice (SICoP), established by a group of individuals for the purpose of achieving "semantic interoperability" and "semantic data integration" focused on the government sector, is working on both of these problems in the Federal Sitemaps Initiative and by participation in the follow-on W3C "Workshop on RDF Access to Relational Databases."
3 AbstractSICoP is recommending A New Enterprise Data Management Strategy for the U.S. Government based on:The premise of reusing the data and information rather than changing the data systems themselves.Putting the business and technical rules, logic, etc. into the data itself using markup languages.The concepts and standards of the Semantic Web (also called the Data Web or Web 3.0).
4 Abstract The most important tenets of the reuse are: Bringing the data and the metadata back together.Bringing the structured and unstructured data and information back together.Bringing the data and information description and context back together.
5 AbstractSICoP is looking for partners to work with Federal Government data and metadata sources. SICoP would welcome the participation of the Semantic Universe in working with government data sources in the following basic ways:Individual participants working with multiple data sources (e.g., GovTrack.us).Multiple participants working with multiple data sources (e.g., Open Data Community).Individual and multiple participants piloting Semantic Wikis and Semantic Desktops.
6 Overview 1. History: 2007 SemTech 2. Context 3. Enterprise Mashups 5. Vision and Implementation6. Pilot Data Sources7. Questions & AnswersBio
7 1. History: 2007 SemTech Community of Practice Session: U.S. Federal Government (Open to All).Semanticommunity.net: Community Infrastructure (slides 10 & 23).Conference Session (Panel of 3):Spectrum of Reasoning and Applications.A New Enterprise Data Management Strategy for the U.S. Government: Support for the Semantic Web and Semantic Technology (slide 29).Conference Session (Team of 2):Advanced Intelligence Community R&D Meets the Semantic Web!Semantics for Net-Centric Operations (slide 24).Keynote (Panel of 7):Accelerating the Deployment of Semantic Technology.Getting to Web Semantics for Spreadsheets in the U.S. Government (slide 27).
8 2. ContextJanuary 7, 2008, Blog: Speaking the same language, Government Computer News. (Launch of Our Space).February 5, 2008, SICoP Special Conference 4: Building Semantic Interoperability Applications for Information Sharing and Integration, Computer Science Corporation Executive Briefing Center, Falls Church, VA.March 19, 2008, Getting to SOA and Semantic Interoperability for DoD Architectures.March 19, 2008, DoDAF 3.0: A Web 2.0 and SOA Mashup! (March ??, 2009) (a bit of humor)March 29, 2008, Data Architecture Subcommittee: A Knowledgebase with Implications for EPA Data Architecture.April 30-May 1, 2008, 5th SOA for E-Government Conference:Theme: Our Space - Connecting Web 2.0, Enterprise Mashups, SOA, and the Semantic Web for Government 2.0 (Making It Easier to Do!).Like the 2008 SemTech: Building the Next Generation of SOA with Semantics!
9 2. Context Organization Name Conferences Zepheria Eric Miller SICoP C5 Wright State UniversityAmit ShethSICoP SC4DERIStefan DeckerSOA CoP C4BAH (SAIC)Sam Chance and othersStarbourne CommunicationsRex BrooksSOA CoP C4 & C5Semantic ArtsDave McCombSOA CoP C2
11 2. ContextSource: Mills Davis, Four Stages of the Web at
12 2. Context Some basic functionalities of Web 2.0 Wikis: Author like WordEdit/comment on every pageSome level of security for every pageTaggingVersioningWatchlistRSS/XML between applicationsSearchetc.Doing data models (ontologies) in Web 3.0 Semantic Wikis (see next slide).
13 Michael Lang, Revelytix Federal CIO Council’s Service-Oriented Architectures Community of Practice (SOA CoP)Special RecognitionMichael Lang, RevelytixSOA in Semantic Wikis: A Story About CommunicationFor the Best Presentation at the3rd SOA for E-Government Conference, May 1-2, 2007,at The MITRE Corporation, McLean, VA.By SOA CoP Co-Chairs, Greg Lomow, Bearing Point & Brand Niemann, US EPABest Practices andArchitecture & Infrastructure Committeesof the Federal Chief Information Officers CouncilSOA CoPProduced in Collaboration With
14 3. Enterprise MashupsEnterprise Mashups are application hybrids combining content and functions from more than one existing source to create powerful Web applications, integrated Web experiences and expanded customer value networks.Mashups are created by combining internal sources such as enterprise data with external Web resources such as Google Maps.Mashups are created when different application program interfaces (API) are combined or 'mashed' to create an entirely new application.
16 3. Enterprise MashupsMashups are the fastest growing enterprise ecosystem on the Web by far. Mashups are social, role-based, network-centric, complex, distributed and essential to all knowledge-based networks, models and businesses.They are a critical method to enhance customer experience, to drive productivity growth (innovation) and achieve fundamental advancements in knowledge economy competitiveness. Customers, employees, suppliers and other stakeholders gain enormous benefits using enterprise mashups to create new applications and expanding customer value networks.Open Enterprise 2.0 Mashup Summit: Expanding Customer Value Networks
17 3. Enterprise MashupsRecall that George Thomas referred to this.
19 3. Enterprise Mashups PGFSOA Topic (1) AIC (2) DoD (3) NCIOC (4) OMG (5)SOA CoPSICoP (6)AudienceChief Architects, CIOs, etc.Agency & Coalition PartnersOpen Membership & ClientsOpen CommunitiesRationalePerformance ImprovementInformation SharingInteroperabilityStandardsCollaboration to get the Medici Effect (7)SOA & Semantic InteroperabilityTarget ArchitectureSOA, SOA, & SOI (8)DoDAF 2.0Adaptable Computing Infrastructure CoPSOA Consortium
20 3. Enterprise Mashups PGFSOA Topic (1) AIC (2) DoD (3) NCIOC (4) OMG (5)SOA CoPSICoP (6)ImplementationExamplesCoI and EA ActivitiesContinuously increasing levels of interoperabilityRFIs & RFPs (9)PilotsMITRE SOA Lab (10)Center of Excellence?RoadmapMaturity Model & AssessmentSOASCOPE, etc.Government Information DaysConferences
21 3. Enterprise Mashups Footnotes: (1) Practical Guide to Federal Service Oriented Architecture (2) Architecture & Infrastructure Committee of the Federal CIO Council (3) Department of Defense (4) Network Centric Operations Industry Consortium (5) Object Management Group (6) Semantic Interoperability Community of Practice(7) December 10, 2007, Social networking and the Medici Effect, Government Computer News 25th Anniversary Issue.(8) Service Oriented Enterprise (SOE), Service Oriented Architecture (SOA), & Service Oriented Infrastructure (SOI)(9) Requests for Information & Requests for Proposals(10) See Bob Sadler, May 1st, 1-1:50 PM, 5th SOA for E-Government Conference.
22 SemTechTuesday, May 20, 2008, Communities of Practice, 7:15AM - 08:15AM, US Federal Government CoP.Tuesday, May 20, 2008, Conference Sessions, 3:15-4:15 PM, Semantics for Net-Centric Operations.Wednesday, May 21, 2008, Conference Sessions, 8:30AM - 09:30AM, Getting to Web Semantics for Spreadsheets in the U.S. Government, Lee Feigenbaum, Cambridge Semantics Inc., and Brand Niemann, US EPA.Wednesday, 5/21/2008, Conference Sessions, 2:45PM - 3:45PM, A New Enterprise Data Management Strategy for the U.S. Government: Support for the Semantic Web and Semantic Technology, Brand Niemann, US EPA.
23 SemTechTuesday, May 20, 2008, Communities of Practice, 7:15AM - 08:15AM, US Federal Government CoP:Welcome and Introduction to Brief Presentations: Brand Niemann.Semantic Exchange and Automatic and Semi-automatic Semantification of s and Documents: Mills Davis, Project10x.Review of Entity Extraction Market, Steve Hamby, Orbis Technologies, Inc.Cooperative Agreements: Hans Polzer, LMCO (NCIOC) and Stefan Decker, DERI.Demo of Getting to Web Semantics for Spreadsheets in the U.S. Government, Lee Feigenbaum, Cambridge Semantics, Inc.Demo of Enterprise Social Computing: Steve Ardire, Trampoline Systems.
24 SemTechTuesday, May 20, 2008, Conference Sessions, 3:15-4:15 PM, Semantics for Net-Centric Operations:The Net-Centric Operations Industry Consortium (NCOIC) comprises a who's who of defense, intelligence, and aerospace contractors. It's mission is to collaboratively engineer new concepts and approaches that will enable the US, its allies, and industry to transition from an era of essentially stand-alone systems and data silos into a world where organizations, personnel, systems, information, and other assets dynamically interface, interoperate, and communicate in context across the Global Information Grid (GIG). A very tall order. This session features three perspectives on the challenges, progress to date, lessons learned, and the role of semantics in delivering net-centric interoperability.See February 5, 2008: Network Centric Operations Industry Consortium - Semantics, Interoperability, Network Centric Operations Industry Consortium & SICoP: Todd Schneider (Raytheon) (also Hans Polzer, Lockheed Martin, John Yanosy, Rockwell-Collins, and Steve Russell, L-3Com).
25 SemTechTuesday, May 20, 2008, Conference Sessions, 3:15-4:15 PM, Semantics for Net-Centric Operations:Introduction to NCOIC, emphasizing the social aspect of net-centricity and it's role in establishing "shared situational awareness" across the community regarding what it means to be net-centric, what the full spectrum and complexity/diversity of the problem space really is, and how this impacts the NCOIC approach to interoperability (Mills Davis for Steve Russell).The Systems, Capabilities, Operations, Programs, and Enterprises (SCOPE) model for interoperability assessment as a way to describe/measure the diversity of the net-centric ecosystem (see next slide) (Hans Polzer).The semantic approach being taken towards pattern development and characterization, why semantics are needed, and the work that has been done to date (Brand Niemann for John Yanosy.
27 SemTechWednesday, May 21, 2008, Conference Sessions, 8:30AM - 09:30AM, Getting to Web Semantics for Spreadsheets in the U.S. Government, Lee Feigenbaum, Cambridge Semantics Inc., and Brand Niemann, US EPA:Overview – what is this project?The Situation – what's done today?Challenges – what are we trying to improve?The Approach – why do semantics help?The Solution – what does the future look like?
28 4. 2008 SemTech The Solution – what does the future look like?: A live connection between spreadsheets and the Web can eliminate the time usually needed for (re-)publishing cycles.On the Web, the Statistical Abstract can be presented as linked, open data, woven into the emerging Semantic Web.More precise and accurate search via guided query with autocomplete, informed by the semantic repository.
29 5. Vision and Implementation 2005: Federal Enterprise Architecture's Data Reference Model (DRM 2.0):Recognized by the leadership of the CIO Council’s Architecture & Infrastructure Committee for outstanding contributions to DRM 2.0 in “forging the necessary conceptual agreements crucial to advancing information-sharing challenges faced by all agencies”.2006: EPA Data Architecture for DRM 2.0:Selected by the Editors of Federal Computer Week for the 2006 Power Players Special Report, Brand Niemann champions collaboration: EPA’s data architect does the groundwork to facilitate electronic data exchanges, October 20, 20062007: EPA Data Architecture for DRM 3.0 and Web 3.0:Online Book: A New Enterprise Information Architecture and Data Management Strategy for the U.S. EPA and the Federal Government. Partnership with DERI for Book Chapter in “Semantic Interoperability for E-Government” (Springer).2008: Semanticommunity.Net – Community Infrastructure Sandbox:Our Space: Connecting Web 2.0, Enterprise Mashups, SOA, and the Semantic Web for Government 2.0 (Making It Easier to Do!).
30 5. Vision and Implementation Our initial objective is to see if Web 2.0 Wikis can be useful in bringing about collaborationacross the Metadata Data Architecture, and Enterprise Architecture activities.A longer range goal would be to see if this Web 2.0 Wiki could be used as an EnterpriseMetadata Management and Application Development Tool (e.g. data and metadata mashups).Data ArchitectureComponentSpecific ArtifactExampleDRM 2.0 (1)DescriptionMetadata (4)Spreadsheet *ContextTaxonomy/Ontology (5)Web 2.5 Wiki SharingData (4)DRM 3.0 (2)Semantics (3)RDF/SPARQL (6)Middleware SOAServices (7)Web 2.5 Wiki (8)Web 3.0 Wiki (9)Footnotes: See next slide.Note: Web 2.0 does DRM 2.0 and Web 3.0 does DRM 2.0/3.0!
31 5. Vision and Implementation Footnotes:(1) FEA DRM 2.0 and Report to Congress (2005).(2) February 6, 2007, and February 5, 2008.(3) Combines Description and Context from DRM 2.0. See (2).(4) The data and metadata are combined together (see Brand Niemann).(5) Information Architecture (topics and subtopics) and Data Architecture (data tables and data elements) are integrated. See Web 2.0 Wiki Pilot: Information Classifications.(6) This specification defines the syntax and semantics of the SPARQL query language for RDF. SPARQL can be used to express queries across diverse data sources.(7) EPA Data Architecture Enterprise Metadata.(8) Video on data reuse in mashups that will revolutionize EPA data architecture, data management, and data reuse applications!* Note: This also works with relational databases.
33 5. Vision and Implementation Data Architecture Subcommittee RecommendationEPA Data ArchitectureImplementationCommentsWeb 2.0 Wiki for CollaborationPilotingPassword protectedData Quality ProfileImplemented for indicatorsFrom Report on the Environment 2008GovernanceDRM 3.0 SuggestionsImplementingFrom SICoP
34 5. Vision and Implementation Enterprise Data Architecture and Implementation:Federated,Faceted,Semantic Searchof both EPA:Data andMetadataWith:GovernanceDRM 2.0 Compliance (shown on next slide)Discovery and Business IntelligenceNote: All of these have been demonstrated in the slides.
35 5. Vision and Implementation DRM 2.0 Compliance:The Three Requirements for Information Sharing Have Been Satisfied!
37 6. Pilot Data SourcesAutomatic and semi-automatic semantification of s and documents:(1) Work with several vendors of natural language processing software(2) Provide each collaborating vendor with a specific test corpus to analyze(3) Have each vendor process the corpus to extract concepts, relationships, etc. from messages and documents in the corpus(4) Have each vendor construct a semantic graph (or other knowledge model) from the semantic metadata extracted from the source content.(5) Have each vendor enable a visualization, navigation, or query against this representation.Note: We will work with you to conduct this pilot test, writeup the results, and present them. The source file(s) for this test are available.
38 6. Pilot Data Sources Data Element Dictionary Environmental ContaminantsBody BurdensChildhood IllnessesSpecial FeaturesEmerging IssuesAmerica'sChildren&the_Environment2003.xls
39 7. Questions & AnswersContact Information:Brand NiemannPress Coverage:December 10, 2007, Semantic Report, Issue 3, Support for the Semantic Web and Semantic Technology in the U.S. GovernmentFebruary 18, 2008, Government Computer News, Context is everything: CIO Council looks to semantic interoperability to expand data sharing.
40 BIODr. Brand Niemann is a Senior Enterprise Architect in the Office of the Chief Information Officer of the Environmental Protection Agency. His work on EPA and Interagency data architecture to facilitate electronic information sharing was recognized by Federal Computer Week in its 2006 Power Player Series Special Report and recently at the Gartner Spring Enterprise Architecture Summit Conference. He was asked by Federal Computer Week to help organize The 22nd Semi-Annual Spring Government CIO Summit “Government by Wiki: New Tools for Collaboration, Information-Sharing, and Decision-Making,” and he provided a keynote on Government by Web 2.0 (Wiki): A Guided Tour. Brand is a recognized leader in the use of communities of practice (CoP) supported by Web 2.0 (Wiki) technology to develop service systems and serves as Co-chair of both the Federal Service-Oriented Architecture CoP and the Federal Semantic Interoperability CoP. He has received special recognitions and awards from the U.S. EPA, the U.S. Department of the Interior, the U.S. Geological Survey, LOTUS Best Application, ComputerWorld-Smithsonian Innovation Competition, Vice President Gore's Hammer Award, OMB, the Quad Council, the Council for Excellence in Government, the Architecture & Infrastructure Committee, the Best Practices Committee, and the American Council for Technology and its Industry Advisory Council.