Build DoD Vocabularies in the Cloud 3 rd Annual SOA & Semantic Technology Symposium: Interoperable Business Operations Through Shared Understanding Dr.

Slides:



Advertisements
Similar presentations
Introduction Lesson 1 Microsoft Office 2010 and the Internet
Advertisements

Data Science for Business: Semantic Verses Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
Federal SOA for E-Government The Top Ten Things You Need to Know for YouTube October 15, 2011 DRAFT 1
OMB Data Visualization Tool Requirements Analysis: Oracle Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
Virtual SharePoint Summit 2010 hosted by Rackspace Overcoming Collaboration Challenges with SharePoint Chris Samson Leslie Sistla Virtual SharePoint Summit.
Information and Business Work
DoDAF 3.0: A Web 2.0 and SOA Mashup!
Presentation to Data.gov PMO Semantic Web/Linked Data Team Dr. Brand Niemann Director and Senior Data Scientist Semantic Community July 27,
Build Air Force OneSource in the Cloud for the Data.Gov and Open Government Vocabulary Teams UDEF Deployment Workshop Planning Meeting at the Open Group.
Build DoD Vocabularies in the Cloud 3 rd Annual SOA & Semantic Technology Symposium: Interoperable Business Operations Through Shared Understanding Dr.
SICoP 2011: Transforming Government through Innovation with Semantic Technologies Brand L. Niemann Director and Senior Data Scientist, Semantic Community.
Build the Binary Group in the Cloud Brand Niemann Senior Enterprise Architect Binary Group August 5, Updated August 8,
Build Systems of Systems in the Cloud: Tutorial Brand Niemann Director and Senior Data Scientist Semantic Community November 9,
Binary Group at the LandWarNet Conference "Transforming Cyber While at War" Tampa Convention Center August 22-26, 2011 This document contains Binary Group,
Semantic Interoperability Community of Practice (SICoP) Semantic Web Applications for National Security Conference Hyatt Regency Crystal City, Regency.
OMB Data Visualization Tool Requirements Analysis: Logi Analytics Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
OMB Data Visualization Tool Requirements Analysis: Microsoft Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
NLM-Semantic Medline Data Science Data Publication Commons Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data.
Big Data and Social Media & Web Analytics Innovation Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community
NIST Scientific Data for Data Science United Nations Open Data / Open Government Conference, April 26-28, Abu Dhabi
Semantic Data Discovery: Proof of Concept for DHS
Linked Data Visualizations for Eurostat Linked Data Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
Cloud: SOA, Semantics, & Data Science Welcome and Overview Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
OMB Data Visualization Tool Requirements Analysis: SAP Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
1 Semantic Cloud Computing & Open Linked Data Pattern Brand Niemann Invited Expert to the NCIOC SCOPE and Services WGs September 22, 2009.
1 Gov 2.0 for EPA: Pollution Prevention and Toxics In Support of the June 9-13, 2008 National Dialogue on How to Enhance Access to Environmental Information:
Imagine Everything is Before You: Past, Present, and Future Paper and Demonstration for the 2014 Family History Technology BYU Dr. Brand Niemann.
Information Sharing Begins With Me Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community
GIS Data Science for Collaboration Across Communities: GIScience 2.0 and Beyond Dr. Brand Niemann Director and Senior Data Scientist Semantic Community.
1 Briefing for EPA and OEI Communications Coordinators and Press Officers Brand Niemann US EPA Senior Enterprise Architect and Federal CoP Leader January.
Using Data Science as Evidence in Public Policy With Big Data and Elections Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist.
EPA Indicators of Our Health and Environment Updated and Improved Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic.
Big Data Symposium: Analytics and Applications for Federal Big Data – Bureau of Justice Statistics Dr. Brand Niemann Director and Senior Enterprise Architect.
The Semantic Community: Building Knowledge-Centric Systems in the Cloud Keynote Presentation for the SEMIC.EU Conference on Rethinking Semantic Interoperability.
Big Data Symposium: Analytics and Applications for Federal Big Data - FEMA Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist.
Farm Data Dashboards: USDA and Microsoft Innovation Challenge Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data.
Data Science for DataBay DataBay "Reclaim the Bay" Innovation Challenge: August 1-3, 2014, Smithsonian Environmental Research Center, 647 Contees Wharf.
Data Science for DTIC Data Ecosystem Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
1 Wikify Your Best Content in Support of the OGD and Data.gov/semantic: Information Architecture Tutorial EPA Web Work Group, EPA Wiki and Blog Work Group,
The 2012 EuroStat Regional Yearbook for Semantic Interoperability Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic.
Why Doesn't EPA Have a Self- Contained Statistical Unit?: A Tribute to Doug Engelbart Dr. Brand Niemann Director and Senior Data Scientist Semantic Community.
1 Services and Cloud Computing Work Groups: Status Report Brand Niemann US EPA December 3, 2009.
Open DATA METI: All Content As Big Data Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community
OEI’s Services Portfolio December 13, 2007 Draft / Working Concepts.
Research on US Federal Government Handling of Data Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community
Binary Group Knows What It Knows Because of It’s Information Attitude Brand Niemann Senior Enterprise Architect and Data Scientist August 26,
1 A Target Data Architecture for the US EPA: Implementing DRM 3.0 and Data.gov Brand Niemann Senior Enterprise Architect, US EPA April 21, 2009 PARS 2009.
Data Science for the NOAA Chief Data Officer Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
1 Services and Cloud Computing Work Groups: Status Update Brand Niemann US EPA December 18, 2009.
Data Science for HealthCare.gov Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
Data Science for Semantics Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science Data Science for Semantics.
Department of Commerce App Challenge: Big Data Dashboards Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community.
Data Science for DoI BSEE Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science Data Science for DoI BSEE.
SICoP 2011: Transforming Government through Innovation with Semantic Technologies Semantic Tech and Business Conference, November 29 – December 1, 2011.
NGA Demo Participant Collaboration Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community
Cross Information Sharing and Integration for the Intelligence Community: 13 th SOA for eGovernment Conference Dr. Brand Niemann Director and Senior Enterprise.
NIEM 3.0 Data Analytics App Dr. Brand Niemann Director and Senior Data Scientist Semantic Community AOL Government Blogger.
1 Promoting Careers in Knowledge Management: My Experiences Brand Niemann Library of Congress June 3, 2010.
1 Improved Access to EPA and Interagency Information: Before and After with Web 2.0 – Part 4 Interagency and Non-government (in process) Brand Niemann.
Using Open Data to Create Value for Citizens. Data.gov Provides instant access to ~400,000 datasets in easy to use formats Contributions from UN, World.
Connecting People With Information Transforming the Way the DoD Manages Data M. David Allen OASD(NII)/DoD CIO May 23, 2006 “The.
Project Management May 30th, Team Members Name Project Role Gint of Communications Sai
System Development & Operations NSF DataNet site visit to MIT February 8, /8/20101NSF Site Visit to MIT DataSpace DataSpace.
Driving Innovation with Open Data Chris Musialek in place for Jeanne Holm Data.gov February 9, 2012.
Data Science for the National Big Data R&D Initiative Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Semantic Enhancements for DoD Information Sharing, Enterprise Architecture, and Standards Dr. Brand Niemann Director and Senior Enterprise Architect –
Federal Communities of Practice: IBM Contributions
Brand Niemann, US EPA and
Spotfire 5 Users Guide Dashboard
Title: Build EPA Apps in the Cloud
Presentation transcript:

Build DoD Vocabularies in the Cloud 3 rd Annual SOA & Semantic Technology Symposium: Interoperable Business Operations Through Shared Understanding Dr. Brand Niemann, Director and Senior Data Scientist, Semantic Community July 13 th Competency Track - 11:55am-12:30pm July 13-14, 2011 Waterford, Springfield, Virginia 1

Semantic Community So far in 2011, Semantic Community has built Knowledge-Centric Systems in the Cloud for: – Data Science and Journalism: Data.gov and Federal Computer Week, Ongoing Since January Government Information Group/FOSE Institute’s KM 2011 Conference, May 4, 2011, and Geospatial Summit, September 13, AOL Government “Show Me The Data” Due to Launch July 11, – The Open Group’s TOGAF and UDEF: The Open Group San Diego Conference, February 7, The Open Group London Conference, May 11, – Semantic Interoperability: Keynote at SEMIC.EU Annual Conference, May 18, Conference Presentation at SemTech 2011, June 7, Federal Data Architecture Subcommittee, June 9, – “Big Health Data”: One of the Top Submissions for HealthyPeople.gov Challenge, March 14, Finalist in the Health Data Initiative Forum, June 9, – DoD: RFI for Data Analysis and Collaboration Tool to Support the DoD OIG, June 28, rd Annual DoD SOA and Semantic Technology Symposium, July 13, This presentation will show examples from simple (e.g. Air Force One Source) to complex (DOD Office of the Inspector General) DoD Vocabularies. 2

Take-Home Message Competency: Creating Competency for Shared Understanding and Interoperable Business Operations. – This track focuses on the development of knowledge and skills for SOA & Semantic projects, the handling of organizational change management, and the governance needed for and associated with such projects and initiatives. Semantic Community Knowledge-Centric Systems: – We take the data (and metadata) directly to information modeling and mashup tools where we then can apply stronger semantic analytics tools. We keep the data (structured and unstructured) and metadata (ontology) together in the knowledgebase in cloud computing tools. – We use effective standards-based approaches for real-world case studies. This presentation could also be in the other two tracks! 3

Abstract Several DoD vocabularies have been harvested into the cloud computing tools used by the author to produce data science products. Those are Air Force OneSource and the DoD Common Vocabulary with two vocabularies, one for the HR community and one for UCORE-SL. The purpose of the Semantic Community’s data science products are to show when/where it is practical to insert semantic technologies in support of cross-domain process and analysis, and the value/ease of using other more mature technologies for certain tasks. The practical boundaries we have found supporting data fusion and analysis for information sharing, and when in the process to maximize the value from applying semantic technologies, are discussed. 4 Note: Credit due to Robert Damashek for suggesting this topic to me.

Overview 1. Introductions 2. Background 3. Semantic Community Apps 4. DoD Common Vocabulary 5. Data Analysis and Collaboration Tool to Support the DoD OIG 6. Questions and Answers 7. Supplemental Slides – Recreating Other People’s App the Semantic Community Way! 5

2. Background My Experience with “Handling of organizational change management, and the governance needed for and associated with such projects and initiatives”: – I tried to change EPA from the inside ( ). – I served a detail to the Department of Interior where I was able to start a new organization ( ). – I tried to change the Federal Government in my Federal CIO Council ( ) Roles. – I also tried to change EPA from the outside at the same time. – I am now enjoying being free to do what I think is best to support the Semantic Web/Linked Open Data and Semantic Technologies, but in an easier and simpler way! 6

2. Background Federal Semantic Interoperability Community of Practice (SICoP) :SICoP – Five Annual Conferences and Four Special Conferences. Federal SOA Community of Practices (SOA CoP) 2006-Present:SOA CoP – Eleven Semi-Annual Conferences. 12 th October 11 th. Only Special Recognition for Outstanding Contributions to Both SICoP and SOA CoP:Special Recognition – Arun Majumdar, Cutter Consortium/VivoMind Intelligence for Operationalizing SOA-Lessons Learned (Take Home Message: Multi- Level Model-Driven Architecture & First Order Logic). Now from the pilots at these conference come powerful new semantic analytics tools like VivoMind's Textrium and PrologIKS and Semantic Insights Research Assistant (SIRA) that can be used to mine content to produce data science products that support data journalism! 7

2. Background ProgramChampionCoP LeaderStandards eForms for eGovMark Forman, OMBRick Rogers, Fenestra Technologies Fenestra Technologies eGrants XML Schema and Web Services Federal SOA CoPRoy Maybury, DoDCory Casanave, Model Driven Solutions Model Driven Solutions Web Services and Open Group MDA and SoAML Federal Semantic Interoperability CoP David Wennergren, Navy CIO Rick Morris, US Army, and Mills Davis, Project10XProject10X W3C Semantic Web in Semantic Technologies Cloud Computing Desktop for OGD & Data.gov/semantic Vivek Kundra, Federal CIO Brand Niemann, US EPA and Semantic CommunitySemantic Community Web Oriented Architecure (MindTouch) Gov 2.0 Platform for Data Science ProductsGov 2.0 Platform for Data Science Products and 5 Stars of LOD5 Stars of LOD Aneesh Chopra, Federal CTO Tim Berners-Lee, W3C Director Brand Niemann, US EPA and Semantic CommunitySemantic Community Open and Quality Data Visualizations (Spotfire) 8 My Experience with “development of knowledge and skills for SOA & Semantic projects”.

3. Semantic Community Apps GeneralWeb SiteBest Content - Centralized Best Content - Distributed US Federal Government (1) Community Sandbox (2) Annual Statistical Abstract (3) and EPA Report on the Environment (4) FedStats.net (5) TOGAF (6)EA Principals, Inc. (7) Training Materials (8) Ecosystem of Frameworks (9) SEMIC.EU (10)Web Site (11)EuroStats (12) and European Environment State and Outlook (13) Global Data Catalog and Data Services (14) Key: See next slide for Key. 9 Source: Some Best Practice Examples of Semantic Interoperability Interfaces* *The term "interoperable interface" comes from the recent Report to the President and Congress "Designing a Digital Future: Federally Funded Research and Development in Networking and Information Technology", Executive Office of the President and the President's Council of Advisors on Science and Technology, December 2010 (see excerpts in the wiki). excerpts in the wiki

4. DoD Common Vocabulary The mission of the Enterprise Information Web (EIW) project is to create an extensible analytical capability built on top of a federation of information systems across the Department of Defense and provide information visibility and access: – Archives: All wikis and vocabularies relevant to the HR EIW project. – Business Process Area: Semantic models for the HRM Domain. – CHRIS Reference Ontology: ?. – Retirements and Separations: DIMHRS Ontology. – HR Analytics: Queries the HR Domain Ontology. – HR Domain Ontology: Central Knowledgebase for Concepts and Terminology within the DoD HR Domain. – Knowledge Center: EIW Training Materials – ODSE Sample Database: Multiple Vocabularies. – Ontology Repository: An important contribution in the overall goal of data integration across the HR domain Sample Content Included in Next Section

5. Data Analysis and Collaboration Tool to Support the DoD OIG The mission of the Department of Defense, Office of the Inspector General (DOD OIG) is to promote integrity, accountability, and improvement of Department of Defense personnel, programs, and operations to support the Department’s mission and serve the public interest. Each goal of the DOD OIG requires personnel to perform analysis using structured and unstructured data, both government and non-government sources, and in a wide variety of file formats. Personnel and data sources are spread throughout the globe, requiring teams to acquire data in a remote access storage system for use. Personnel access analysis tools remotely using laptops running Windows XP (SP3) with dual core processors, 3GB RAM, and 50GB memory. The DOD OIG has recognized a need to improve the efficiency and effectiveness of how data is ingested, shared and analyzed across the organization. As well as the need to explore advanced analysis capabilities to better assist personnel in identifying fraud, waste, and abuse in the Department Note: Bolding is mine.

5. Data Analysis and Collaboration Tool to Support the DoD OIG Semantic Community Workflow: – 5.1 Information Architecture of Public Web Pages in Spreadsheets as Linked Open Data. – 5.2 Public Reports (Web and PDF) in Wiki as Linked Open Data. – 5.3 Desktop and Network Databases in Wiki and Spreadsheets in Linked Open Data Format. – 5.4 Spreadsheets in Spotfire as Linked Open Data. – 5.5 Spreadsheets in Semantic Insights Research Assistant for Semantic Search, Report Writing, and Ontology Development. 12

5. Data Analysis and Collaboration Tool to Support the DoD OIG Information Architecture of Public Web Pages in Spreadsheets as Linked Open Data. Tabs (12): Cover Page Press Room Publications 2011 DoD IG Appendices A, F, & I Report to Congress Statistical Highlights Table 3.1 & Figures 3.1 & 3.2

5. Data Analysis and Collaboration Tool to Support the DoD OIG MindTouch makes the world's most respected social knowledge base. They power purpose-built help 2.0 communities that connect companies with their customers. Millions use their software every day. Many of the world's most respected brands rely on MindTouch including NASA, SAIC, Booz Allen, Microsoft, Cisco, Washington Post, Viacom, the New York Times, AXA, Timberland and HCA. Innovative companies like RightScale, ExactTarget and Mozilla have standardized on MindTouch for their documentation strategy. The open source.NET Web Oriented Architecture Framework (WOAF) is redefining how enterprise software is built. MindTouch is a recognized expert in both open source and Enterprise 2.0 technologies. The MindTouch Productivity Tools bridge Microsoft office and your desktop for all Windows applications. Have your users continue to work with the applications they're familiar with, instead of forcing them to learn a new tool with our document management solution. With the MindTouch Desktop Suite, you'll save time and money by not having to train users on a new system. 14

5. Data Analysis and Collaboration Tool to Support the DoD OIG Public Reports (Web and PDF) in Wiki as Linked Open Data.

5. Data Analysis and Collaboration Tool to Support the DoD OIG Desktop and Network Databases in Wiki and Spreadsheets in Linked Open Data Format.

5. Data Analysis and Collaboration Tool to Support the DoD OIG 17 PC Desktop Spotfire Spreadsheets in Spotfire as Linked Open Data. 5.4 Spreadsheets in Spotfire as Linked Open Data.

5. Data Analysis and Collaboration Tool to Support the DoD OIG 18 SIRA can be used to find similarity between current and past events that are expressed or hinted at in text. SIRA can be used to find relationships of people, places, things and activities that may be expressed or hinted at in text.

6. Questions and Answers Sound Byte: Bring the data and the metadata back together and do the data science first to accomplish a business need and lay a solid foundation for integration and application of semantic technologies. Questions about the steps I followed? Questions about the results I produced? See Supplemental Slides for the Data Science Approach to Semantic Web/Technology Pilots. 19

7. Supplemental Slides 7.1 Semantic Technology Training: Building Knowledge-Centric Systems – KM 2011 – SemTech W3C Government Linked Data Working Group – Clinical Quality Linked Data on Health.data.gov – Build Clinical Quality Linked Data on Health.data.gov in the Cloud – Hospital Compare Downloadable Database Example of "5 Star Government Data“ 7.3 Library of Congress Project Recollection and Digital Preservation Initiative 7.4 Elsevier/Tetherless World Health and Life Sciences Hackathon (27-28 June 2011) – Build TWC in the Cloud – Build NCI CLASS in the Cloud – Build the NYC Data Mine Health in the Cloud – Build SciVerse Apps in the Cloud (IN PROCESS) 7.5 Be Informed (IN PROCESS) 20