Presentation is loading. Please wait.

Presentation is loading. Please wait.

Build DoD Vocabularies in the Cloud 3 rd Annual SOA & Semantic Technology Symposium: Interoperable Business Operations Through Shared Understanding Dr.

Similar presentations


Presentation on theme: "Build DoD Vocabularies in the Cloud 3 rd Annual SOA & Semantic Technology Symposium: Interoperable Business Operations Through Shared Understanding Dr."— Presentation transcript:

1 Build DoD Vocabularies in the Cloud 3 rd Annual SOA & Semantic Technology Symposium: Interoperable Business Operations Through Shared Understanding Dr. Brand Niemann, Director and Senior Data Scientist, Semantic Community July 13 th Competency Track - 11:55am-12:30pm July 13-14, 2011 Waterford, Springfield, Virginia 1

2 Semantic Community So far in 2011, Semantic Community has built Knowledge-Centric Systems in the Cloud for: – Data Science and Journalism: Data.gov and Federal Computer Week, Ongoing Since January 2011. 1105 Government Information Group/FOSE Institute’s KM 2011 Conference, May 4, 2011, and Geospatial Summit, September 13, 2011. AOL Government “Show Me The Data” Due to Launch July 11, 2011. – The Open Group’s TOGAF and UDEF: The Open Group San Diego Conference, February 7, 2011. The Open Group London Conference, May 11, 2011. – Semantic Interoperability: Keynote at SEMIC.EU Annual Conference, May 18, 2011. Conference Presentation at SemTech 2011, June 7, 2011. Federal Data Architecture Subcommittee, June 9, 2011. – “Big Health Data”: One of the Top Submissions for HealthyPeople.gov Challenge, March 14, 2011. Finalist in the Health Data Initiative Forum, June 9, 2011. – DoD: RFI for Data Analysis and Collaboration Tool to Support the DoD OIG, June 28, 2011. 3 rd Annual DoD SOA and Semantic Technology Symposium, July 13, 2011. This presentation will show examples from simple (e.g. Air Force One Source) to complex (DOD Office of the Inspector General) DoD Vocabularies. 2

3 Take-Home Message Competency: Creating Competency for Shared Understanding and Interoperable Business Operations. – This track focuses on the development of knowledge and skills for SOA & Semantic projects, the handling of organizational change management, and the governance needed for and associated with such projects and initiatives. Semantic Community Knowledge-Centric Systems: – We take the data (and metadata) directly to information modeling and mashup tools where we then can apply stronger semantic analytics tools. We keep the data (structured and unstructured) and metadata (ontology) together in the knowledgebase in cloud computing tools. – We use effective standards-based approaches for real-world case studies. This presentation could also be in the other two tracks! 3

4 Abstract Several DoD vocabularies have been harvested into the cloud computing tools used by the author to produce data science products. Those are Air Force OneSource and the DoD Common Vocabulary with two vocabularies, one for the HR community and one for UCORE-SL. The purpose of the Semantic Community’s data science products are to show when/where it is practical to insert semantic technologies in support of cross-domain process and analysis, and the value/ease of using other more mature technologies for certain tasks. The practical boundaries we have found supporting data fusion and analysis for information sharing, and when in the process to maximize the value from applying semantic technologies, are discussed. 4 Note: Credit due to Robert Damashek for suggesting this topic to me.

5 Bio Dr. Brand Niemann is the Director and Senior Data Scientist of the Semantic Community. He was the former Senior Enterprise Architect and Data Scientist at the U.S. Environmental Protection Agency and co-led the Federal CIO Council’s Semantic Interoperability Community of Practice (SICOP) with Mills Davis from 2003-2008. He is currently authoring a series of Editorials for Federal Computer Week on his work and recently made Spotfire's Twitter list for his cool visualizations on government data to produce more transparent, open and collaborative business analytics applications. – http://semanticommunity.info/A_Gov_2.0_spin_on_archiving_2.0_data http://semanticommunity.info/A_Gov_2.0_spin_on_archiving_2.0_data – http://spotfireblog.tibco.com/?p=5328 http://spotfireblog.tibco.com/?p=5328 He is working as a data journalist for AOL Government due to launch July 11 th. – http://semanticommunity.info/AOL_Government http://semanticommunity.info/AOL_Government He is also helping organize the 12 th SOA for eGov Conference, October 11 th. – http://semanticommunity.info/Federal_SOA http://semanticommunity.info/Federal_SOA 5

6 Overview 1. Introductions 2. Background 3. Semantic Community Apps 4. DoD Common Vocabulary 5. Data Analysis and Collaboration Tool to Support the DoD OIG 6. Questions and Answers 7. Supplemental Slides – Recreating Other People’s App the Semantic Community Way! 6

7 1. Introductions Six QuestionsDiscussion Leader Attendee 1Attendee 2Etc. Who (are you)?Brand Niemann What (do you do)? Enterprise Architecture & Data Science When (have you done it)? 30+ years of federal service Where (did you do it)? U.S. EPA Why (did you do it)? Idealistic about public service How (do you do it)? Communities of Practice 7 Purpose: To help build your professional network especially for doing SOA and Semantic Technologies going forward. Source: http://en.wikipedia.org/wiki/Five_Wshttp://en.wikipedia.org/wiki/Five_Ws

8 2. Background My Experience with “Handling of organizational change management, and the governance needed for and associated with such projects and initiatives”: – I tried to change EPA from the inside (1980-1996). – I served a detail to the Department of Interior where I was able to start a new organization (1997-2001). – I tried to change the Federal Government in my Federal CIO Council (2002-2008) Roles. – I also tried to change EPA from the outside at the same time. – I am now enjoying being free to do what I think is best to support the Semantic Web/Linked Open Data and Semantic Technologies, but in an easier and simpler way! 8

9 2. Background Federal Semantic Interoperability Community of Practice (SICoP) 2003-2008:SICoP – Five Annual Conferences and Four Special Conferences. Federal SOA Community of Practices (SOA CoP) 2006-Present:SOA CoP – Eleven Semi-Annual Conferences. 12 th October 11 th. Only Special Recognition for Outstanding Contributions to Both SICoP and SOA CoP:Special Recognition – Arun Majumdar, Cutter Consortium/VivoMind Intelligence for Operationalizing SOA-Lessons Learned (Take Home Message: Multi- Level Model-Driven Architecture & First Order Logic). Now from the pilots at these conference come powerful new semantic analytics tools like VivoMind's Textrium and PrologIKS and Semantic Insights Research Assistant (SIRA) that can be used to mine content to produce data science products that support data journalism! 9

10 2. Background ProgramChampionCoP LeaderStandards eForms for eGovMark Forman, OMBRick Rogers, Fenestra Technologies Fenestra Technologies eGrants XML Schema and Web Services Federal SOA CoPRoy Maybury, DoDCory Casanave, Model Driven Solutions Model Driven Solutions Web Services and Open Group MDA and SoAML Federal Semantic Interoperability CoP David Wennergren, Navy CIO Rick Morris, US Army, and Mills Davis, Project10XProject10X W3C Semantic Web in Semantic Technologies Cloud Computing Desktop for OGD & Data.gov/semantic Vivek Kundra, Federal CIO Brand Niemann, US EPA and Semantic CommunitySemantic Community Web Oriented Architecure (MindTouch) Gov 2.0 Platform for Data Science ProductsGov 2.0 Platform for Data Science Products and 5 Stars of LOD5 Stars of LOD Aneesh Chopra, Federal CTO Tim Berners-Lee, W3C Director Brand Niemann, US EPA and Semantic CommunitySemantic Community Open and Quality Data Visualizations (Spotfire) 10 My Experience with “development of knowledge and skills for SOA & Semantic projects”.

11 2. Background Integrate and then automatically semantify with sitemap.org and schema.org, respectively, in the MindTouch Technical Communication Suite. – See Build SemTech 2011 and SemWeb.com Archives in the Cloud.Build SemTech 2011 and SemWeb.com Archives in the Cloud “My Triple” is: Web (Hendler) vs Data (Fox) Science, Metadata (Hendler) vs Metadata+Data (Niemann), and Little (Hendler) vs Stronger (Sowa) Semantics: – (Subject) Data Science (Predicate) is (Object) Metadata+Data and Stronger Semantics. Our Next Events will Focus on Building Knowledge-Centric Systems for “Big Data”. – See Build Health Data Infrastructure and Apps in the Cloud for the recent Health Data Initiative Forum.Build Health Data Infrastructure and Apps in the Cloud 11

12 2. Background Ontology 101: An Introduction to Knowledge Representation, the Web Ontology Language (OWL)& Ontology Development, Elisa Kendall, Thematix, and Deborah McGuinness, RPI/McGuinness Associates. SemTech 2011, June 6, 2011.June 6, 2011 – Knowledge bases, databases, & ontology An ontology is a conceptual model of some aspect of a particular universe of discourse (or a domain of discourse). Typically ontologies contain only "rarified" or "special" individuals, metadata, representing elemental concepts critical to the domain. – A knowledge base is a persistent repository for: Ontology & metadata representing individuals, facts, & rules about how they can be combined or relate to one another. Metadata, facts & rules only - in some applications and frameworks the ontology is separately maintained. – Most inference engines require in-memory deductive databases for efficient reasoning (including commercially available reasoners). – A knowledge base may be implemented in a physical, external databases, such as a relational database, but reasoning is typically done on a subset (partition) of that knowledge base in memory. 12

13 2. Background Detailed Roadmap: – Reuse Content: Text from Desktop, CMS, Web, etc. Tables from Excel, RDBs, etc. “Big Data” from Google, Cray, etc. – Exploit Expertise and Training: SemTech 2011 Interviews, Sponsors, Exhibitors, Vendors, Participants, & Presentations. – Reproduce Other’s Demos: New York Data Mine, Library of Congress, DoD, Top Quadrant, Cambridge Semantics, Revelytx, etc. – Work in the Semantic Community Data Science Product Way: Started with NCOIC Semantic Wiki (Semantic Media Wiki, Visual Knowledge, and MindTouch): – http://semanticommunity.info/Network_Centricity/Semantic_Wikis http://semanticommunity.info/Network_Centricity/Semantic_Wikis Do the data applications while the semantic technologies are maturing then semantify with SIRA, Be Informed, etc. Data Integration with Linked Open Data and Semantics (Five Star Government Data) instead of Semantic Data Integration (Jim Hendler, George Thomas, etc.). 13

14 2. Background Detailed Roadmap (continued): – Build Knowledge-Centric Systems: The concepts, definitions, and instances are together: – Typically instances are not in the ontology, but in the knowledgebase-We put them together - see My EPA Ontology with the sitemap and the schema to support search and inference. Information Architecture, then Data Architecture: – Text as is, give structure, or extract entities. – Tables with URLs and key fields for joins. – Other Resources with URLs. 14

15 2. Background 15 http://semanticommunity.info/EPA/EPA_Ontology The Ontology and Knowledgebase are Together in a Knowledge-Centric System!

16 2. Background Recent Example from SemTech 2011 Sounds Like the Semantic Community Way: – How Hollywood Learned to Love the Semantic Web, Chris Testa, Ad.ly, Inc.: This 5 step process to integrating linked data sets you up to do long term advanced Semantic Analysis with Reasoning, Machine Learning, and so much more: – Understand what your linked “things” are, – Choose a linked dataset, – Reconcile your things, – Build business intelligence, and – Feedback and maintenance. 16 http://www.slideshare.net/testac/how-hollywood-learned-to-love-the-semantic-web

17 2. Background Another Good Lesson from SemTech 2011 That Supports the Semantic Community Way: – When Good Semantic Data and Tools Turn Bad, Baden Hughes, Technology Strategist, Researcher, Entrepreneur and Investor: Much of the value proposition of semantic technology centers on the idea that data is more accessible, interlinked and open for re-use, and that as a result, gains can be made through increased information leverage, better informed decision making, lower operational latency etc. However, with the rapid emergence of large public or semi-public data collections, and the increased prevalence of semantic machinery by which this data can be collected, integrated and analyzed, there is also the risk that the very same semantic data and technologies can be applied for other less desirable purposes. This presentation will consider a number of case studies where availability of enterprise and personal information in the public or semi-public web, combined with freely available semantic analytics tools can be used to determine ‘interesting’ facts about organizations and individuals, at a level of confidence that will be alarming. I Agree! 17 http://semtech2011.semanticweb.com/sessionPop.cfm?confid=62&proposalid=3807

18 3. Semantic Community Apps GeneralWeb SiteBest Content - Centralized Best Content - Distributed US Federal Government (1) Community Sandbox (2) Annual Statistical Abstract (3) and EPA Report on the Environment (4) FedStats.net (5) TOGAF (6)EA Principals, Inc. (7) Training Materials (8) Ecosystem of Frameworks (9) SEMIC.EU (10)Web Site (11)EuroStats (12) and European Environment State and Outlook (13) Global Data Catalog and Data Services (14) Key: See next slide for Key. 18 Source: http://semanticommunity.info/Build_SEMIC.EU_in_the_Cloudhttp://semanticommunity.info/Build_SEMIC.EU_in_the_Cloud Some Best Practice Examples of Semantic Interoperability Interfaces* *The term "interoperable interface" comes from the recent Report to the President and Congress "Designing a Digital Future: Federally Funded Research and Development in Networking and Information Technology", Executive Office of the President and the President's Council of Advisors on Science and Technology, December 2010 (see excerpts in the wiki). excerpts in the wiki

19 3. Semantic Community Apps Key: 1.http://usa.govhttp://usa.gov 2.http://semanticommunity.nethttp://semanticommunity.net 3.http://semanticommunity.info/2010_Annual_Statistical_Abstracthttp://semanticommunity.info/2010_Annual_Statistical_Abstract 4.http://semanticommunity.info/EPA/EPA_Ontologyhttp://semanticommunity.info/EPA/EPA_Ontology 5.No longer operational – see http://www.sdi.govhttp://www.sdi.gov 6.The Open Group Architecture Framework - http://pubs.opengroup.org/architecture/togaf9- doc/arch/index.htmlhttp://pubs.opengroup.org/architecture/togaf9- doc/arch/index.html 7.http://eaprincipals.com/index.htmhttp://eaprincipals.com/index.htm 8.http://semanticommunity.info/Build_TOGAF_in_the_Cloudhttp://semanticommunity.info/Build_TOGAF_in_the_Cloud 9.http://semanticommunity.info/Build_TOGAF_in_the_Cloud#Alternative_enterprise_archite cture_frameworkshttp://semanticommunity.info/Build_TOGAF_in_the_Cloud#Alternative_enterprise_archite cture_frameworks 10.Semantic Interoperability Centre Europe 11.http://www.semic.eu/http://www.semic.eu/ 12.http://ec.europa.eu/eurostathttp://ec.europa.eu/eurostat 13.http://eea.europa.euhttp://eea.europa.eu 14.http://semanticommunity.info/Data.gov/An_Open_Data_Public_Dataset_Catalogs_Faceted _Browserhttp://semanticommunity.info/Data.gov/An_Open_Data_Public_Dataset_Catalogs_Faceted _Browser 19 http://semanticommunity.info/Build_SEMIC.EU_in_the_Cloud

20 3. Semantic Community Apps 20 Spotfire Web Player A Best Practice Examples of Semantic Interoperability Interfaces Everything within four clicks: Click 1-See your data Click 2: Sort/Search your data Click 3: Download your data Click 4: Share your data (e.g. iPad)

21 3. Semantic Community Apps Build EPA in the Cloud Build Data.gov in the Cloud Build NIEM and UCORE Semantic Layer in the Cloud Build the Federal Data Architecture Subcommittee in the Cloud Build FAA NextGen SWIM in the Cloud Build Alion Science in the Cloud Build Air Force One Source in the Cloud Build UDEF in the Cloud Build TOGAF in the Cloud Build SEMIC.EU in the Cloud Etc. DoD-SemTech 2011 Oracle-SemTech 2011 Binary-Robert Damashek gave me the idea for this! Spry-SemTech 2011 iTKO-SOA for e-Gov at MITRE IBM-SemTech 2011 & SOA for e-Gov at MITRE ManTech- ASC-SemTech 2011 BBN-SemTech 2011 BMA-?? (DoD??) SRA-?? HHS-SemTech 2011 (Supplemental Slides) SPAWAR-July 26-28 Conference AMDOCS-SemTech 2011 Alion-Example App Expert Systems-SemTech 2011 IMSC-SemTech 2011 Modus 21-?? JPDO-Example App DovelTech- IDA-SemTech 2011 ORBIS-SemTech 2011 MITRE-SemTech 2011 NATO-NCOIC SOA Systems-Previous Conferences 21 Participants in This Conference Semantic Community Apps Give us your data and we will reproduce your app!

22 3. Semantic Community Apps 22 http://semanticommunity.info/Build_DoD_Vocabularies_in_the_Cloud/Proposal_Demo#Step_5._Inventory_and_Visualization_of_Spotfire_Assets Spotfire: About 100 Semantic Community: About 100

23 3. Semantic Community Apps DEFENSE INTELLIGENCE AGENCY SELECTS TIBCO SPOTFIRE ANALYTICS SOLUTIONS FOR DEPARTMENT OF DEFENSE INTELLIGENCE INFORMATION SYSTEM COMMUNITY – DIA/DoDIIS Acquires the Power of Spotfire Analytics and Data Visualization Capabilities – Somerville, Mass., May 12, 2011 – TIBCO Software Inc. (NASDAQ: TIBX) today announced that the Defense Intelligence Agency (DIA) has selected the TIBCO Spotfire® Analytics Platform for the Department of Defense Intelligence Information System (DoDIIS). By leveraging predictive analytics of Spotfire, users like the DIA will gain the power to make quick, real-time decisions that will in turn give them an advantage over others. – With more than 16,500 military and civilian employees worldwide, DIA is a major producer and manager of foreign military intelligence. Within the Department of Defense and the Intelligence Community, DIA provides military intelligence to personnel in the field, defense policymakers, and force planners in support of U.S. military planning and operations, and weapon systems acquisition. – “Years ago, the Abrams tank marked a new technological era on the warfront but tomorrow’s battlefield will be fought in the information realm,” said Rock Gnatovich, senior vice president and chief operating officer for Spotfire, TIBCO Software Inc. “Achieving the concept of ‘the two-second advantage’ – or the ability to capture the right information at the right time and act on it preemptively for a competitive advantage – is absolutely critical to the time-sensitive nature of today’s global intelligence environment.” 23 http://spotfire.tibco.com/about-spotfire/news-room/press-releases/2011/05_12_11-Defense-Intelligence-Agency.aspx

24 3. Semantic Community Apps Smart Mapping: Automatic Creation of Information Models: – Spotfire 3.3 Information Services users can automatically generate 1-to-1 mappings of the existing tables and columns in their Data Sources. Just generate a Data Source in Spotfire, then right click it and select “Create Default Information Model…” This helps a lot when the work has already been done to nicely model and expose tables for business applications such as Spotfire, so the mapping step is more about transparency than transformation. For example, if you use Spotfire Application Data Services, you do the work in ADS to expose Spotfire-ready tables and columns, so a simple transparent mapping of those elements through Spotfire Information Services can now be accomplished in one click. Note that the automated creation will work through nested levels of data objects in the data source you supply. – The result is a folder structure that matches the catalogs, schemas etc. that were selected with a column element for each column and an information link for each table containing those column elements. Procedures will get a procedure element and an information link of their own if they return data. – See next slide. 24 http://semanticommunity.info/@api/deki/files/10975/=Whats_New_in_Spotfire_3.3.pdf

25 3. Semantic Community Apps 25

26 3. Semantic Community Apps The Open Group San Diego Conference, February 7-11, 2011 - Enabling Semantic Interoperability Through Next Generation UDEF (joint session with IEEE), Monday, February 7, 6:30 - 9:00 pm: – The Open Group's Universal Data Element Framework (UDEF) standard provides a framework for categorizing, naming, and indexing enterprise data elements. The current version of UDEF is based on an international data management standard (ISO/IEC 11179) and supports enterprise architecture frameworks such as The Open Group Architecture Framework (TOGAF), as well as, recent architecture approaches such as Service Oriented Architecture (SOA) and Cloud Computing. The UDEF standard is also consistent with the "Data-Information-Knowledge-Wisdom (DIKW)" knowledge hierarchy model and related DIKW-oriented data management standards (e.g. TechAmerica/ANSI GEIA-859-2009 and DMG-1, DAMA Data-Management Book-of-Knowledge). UDEF has also made considerable progress as an emerging international standard for providing element level semantic interoperability. For example, the UDEF standard currently provides multi-language support for 11179 compliant object and property UDEF taxonomies (English, Dutch, and French) with additional language support in progress (Chinese, German, Spanish). – Based on the lessons learned from the development and use of the current version of UDEF, the Semantic Interoperability Working Group (SIWG) within The Open Group has begun to explore how to address next generation semantic interoperability needs across different existing information exchange standards, vocabularies, and web resources (e.g. UCore 2.0, DISA Metadata Registry, Air Force Vocab OneSource). The briefing will primarily focus on initial progress of this SIWG effort. – In addition to highlighting the goals and objectives for a "Next Generation UDEF" capability, the briefing will also highlight the need, benefits, and potential payoff for a variety of applications, such as rapidly deployable sensor-networks, inter-operable unmanned sensor systems, and mobile platforms (e.g. robotics). (NOTE: BOLDING IS MINE) – https://www.opengroup.org/sandiego2011/udef-ieee-joint-session.htm https://www.opengroup.org/sandiego2011/udef-ieee-joint-session.htm 26

27 3. Semantic Community Apps 27 http://semanticommunity.info/UDEF

28 3. Semantic Community Apps 28 http://semanticommunity.info/Air_Force_OneSource I did not find the triple store!

29 3. Semantic Community Apps 29 http://semanticommunity.info/Universal_Core_Semantic_Layer

30 4. DoD Common Vocabulary The mission of the Enterprise Information Web (EIW) project is to create an extensible analytical capability built on top of a federation of information systems across the Department of Defense and provide information visibility and access: – Archives: All wikis and vocabularies relevant to the HR EIW project. – Business Process Area: Semantic models for the HRM Domain. – CHRIS Reference Ontology: ?. – Retirements and Separations: DIMHRS Ontology. – HR Analytics: Queries the HR Domain Ontology. – HR Domain Ontology: Central Knowledgebase for Concepts and Terminology within the DoD HR Domain. – Knowledge Center: EIW Training Materials – ODSE Sample Database: Multiple Vocabularies. – Ontology Repository: An important contribution in the overall goal of data integration across the HR domain. 30 https://www.commonvocabulary.army.mil/ui/groups/HR_EIW Sample Content Included in Next Section

31 5. Data Analysis and Collaboration Tool to Support the DoD OIG The mission of the Department of Defense, Office of the Inspector General (DOD OIG) is to promote integrity, accountability, and improvement of Department of Defense personnel, programs, and operations to support the Department’s mission and serve the public interest. Each goal of the DOD OIG requires personnel to perform analysis using structured and unstructured data, both government and non-government sources, and in a wide variety of file formats. Personnel and data sources are spread throughout the globe, requiring teams to acquire data in a remote access storage system for use. Personnel access analysis tools remotely using laptops running Windows XP (SP3) with dual core processors, 3GB RAM, and 50GB memory. The DOD OIG has recognized a need to improve the efficiency and effectiveness of how data is ingested, shared and analyzed across the organization. As well as the need to explore advanced analysis capabilities to better assist personnel in identifying fraud, waste, and abuse in the Department. 31 http://semanticommunity.info/Build_DoD_Vocabularies_in_the_Cloud/Proposal_Demo#BACKGROUND Note: Bolding is mine.

32 5. Data Analysis and Collaboration Tool to Support the DoD OIG 32 http://semanticommunity.info/@api/deki/files/12766/=SAR_Insert_MAR_11.pdf Insert for Report to Congress 2011.

33 5. Data Analysis and Collaboration Tool to Support the DoD OIG Semantic Community Workflow: – 5.1 Information Architecture of Public Web Pages in Spreadsheets as Linked Open Data. – 5.2 Public Reports (Web and PDF) in Wiki as Linked Open Data. – 5.3 Desktop and Network Databases in Wiki and Spreadsheets in Linked Open Data Format. – 5.4 Spreadsheets in Spotfire as Linked Open Data. – 5.5 Spreadsheets in Semantic Insights Research Assistant for Semantic Search, Report Writing, and Ontology Development. 33

34 5. Data Analysis and Collaboration Tool to Support the DoD OIG 34 http://www.dodig.mil/

35 5. Data Analysis and Collaboration Tool to Support the DoD OIG 35 http://semanticommunity.info/@api/deki/files/12769/=DoDOIG.xlsx 5.1 Information Architecture of Public Web Pages in Spreadsheets as Linked Open Data. Tabs (12): Cover Page Press Room Publications 2011 DoD IG Appendices A, F, & I Report to Congress Statistical Highlights Table 3.1 & Figures 3.1 & 3.2

36 5. Data Analysis and Collaboration Tool to Support the DoD OIG MindTouch makes the world's most respected social knowledge base. They power purpose-built help 2.0 communities that connect companies with their customers. Millions use their software every day. Many of the world's most respected brands rely on MindTouch including NASA, SAIC, Booz Allen, Microsoft, Cisco, Washington Post, Viacom, the New York Times, AXA, Timberland and HCA. Innovative companies like RightScale, ExactTarget and Mozilla have standardized on MindTouch for their documentation strategy. The open source.NET Web Oriented Architecture Framework (WOAF) is redefining how enterprise software is built. MindTouch is a recognized expert in both open source and Enterprise 2.0 technologies. The MindTouch Productivity Tools bridge Microsoft office and your desktop for all Windows applications. Have your users continue to work with the applications they're familiar with, instead of forcing them to learn a new tool with our document management solution. With the MindTouch Desktop Suite, you'll save time and money by not having to train users on a new system. 36 http://www.mindtouch.com/

37 5. Data Analysis and Collaboration Tool to Support the DoD OIG 37 http://semanticommunity.info/Build_DoD_Vocabularies_in_the_Cloud/2011_DOD_IG_Semiannual_Report_to_Congress 5.2 Public Reports (Web and PDF) in Wiki as Linked Open Data.

38 5. Data Analysis and Collaboration Tool to Support the DoD OIG 38 5.3 Desktop and Network Databases in Wiki and Spreadsheets in Linked Open Data Format. http://www.mindtouch.com/add-ons/desktop_suite?product-refer=desktop-suite

39 5. Data Analysis and Collaboration Tool to Support the DoD OIG 39 http://semanticommunity.info/Build_TOGAF_in_the_Cloud/TOGAF_9_Course_Directory 5.3 Desktop and Network Databases in Wiki and Spreadsheets in Linked Open Data Format.

40 5. Data Analysis and Collaboration Tool to Support the DoD OIG 40 http://help.mindtouch.us/MindTouch_Pro_Member_Guide/MindTouch_Desktop_Suite

41 5. Data Analysis and Collaboration Tool to Support the DoD OIG 41 PC Desktop Spotfire Spreadsheets in Spotfire as Linked Open Data. 5.4 Spreadsheets in Spotfire as Linked Open Data.

42 5. Data Analysis and Collaboration Tool to Support the DoD OIG 42 Spotfire Web PlayerWeb Player Recall Section 4. DoD Common Vocabulary 5.4 Spreadsheets in Spotfire as Linked Open Data.

43 5. Data Analysis and Collaboration Tool to Support the DoD OIG 43. http://www.semanticinsights.com/company/presentation.htm 5.5 Spreadsheets in Semantic Insights Research Assistant for Semantic Search, Report Writing, and Ontology Development.

44 5. Data Analysis and Collaboration Tool to Support the DoD OIG 44 http://www.semanticinsights.com/company/SI%20Fact%20Sheet.pdf SIRA can be used to find similarity between current and past events that are expressed or hinted at in text. SIRA can be used to find relationships of people, places, things and activities that may be expressed or hinted at in text.

45 6. Questions and Answers Sound Byte: Bring the data and the metadata back together and do the data science first to accomplish a business need and lay a solid foundation for integration and application of semantic technologies. Questions about the steps I followed? Questions about the results I produced? See Supplemental Slides for the Data Science Approach to Semantic Web/Technology Pilots. 45

46 7. Supplemental Slides 7.1 Semantic Technology Training: Building Knowledge-Centric Systems – KM 2011 – SemTech 2011 7.2 W3C Government Linked Data Working Group – Clinical Quality Linked Data on Health.data.gov – Build Clinical Quality Linked Data on Health.data.gov in the Cloud – Hospital Compare Downloadable Database Example of "5 Star Government Data“ 7.3 Library of Congress Project Recollection and Digital Preservation Initiative 7.4 Elsevier/Tetherless World Health and Life Sciences Hackathon (27-28 June 2011) – Build TWC in the Cloud – Build NCI CLASS in the Cloud – Build the NYC Data Mine Health in the Cloud – Build SciVerse Apps in the Cloud (IN PROCESS) 7.5 Be Informed (IN PROCESS) 46

47 7.1 Semantic Technology Training: Building Knowledge-Centric Systems 47 http://semanticommunity.info/FOSE_Institute/Knowledge_Management

48 7.1 Semantic Technology Training: Building Knowledge-Centric Systems 48 http://semanticommunity.info/Semantic_Technology_Conferences

49 7.2 W3C Government Linked Data Working Group The mission of the Government Linked Data (GLD) Working Group is to provide standards and other information which help governments around the world publish their data as effective and usable Linked Data using Semantic Web technologies. This group will develop standards-track documents and maintain a community website in order to help governments at all levels (from small towns to nations) share their data as high quality ("five-star") linked data. The Working Group will construct and maintain an online directory of the government linked data community. "Cookbook" Advice Site The group will produce Best Practices for Publishing Linked Data. The group will develop Standard Vocabularies. First Face-to-Face Meeting, June 29-30 th, NSF, Arlington, VA. 49 http://www.w3.org/2011/gld/charter

50 7.2 Open Public Dataset Catalogs Faceted Browser 50 http://semanticommunity.info/Data.gov/An_Open_Data_Public_Dataset_Catalogs_Faceted_Browser

51 7.2 Linked Data Cookbook Linked Data is an evolving set of techniques for publishing and consuming data on the Web. Learn how Linked Data can turn the Web into a distributed database and how you can participate. In this session, Bernadette Hyland takes the mystery out of Linked Data by summarizing seven steps to prepare your data sets as Linked Data and announce it so others will use it. – Model without context: There is a Process: Identify, Model, Name, Describe, Convert, Publish, and Maintain. I Disagree! Participants will understand the actual steps to produce high quality, useful data sets that can be modeled, transformed, documented and available on the Linked Data cloud. We'll discuss a recent government agency that did just this in less than 12 weeks. Best practices for data publishing as well as the "social contract" one makes as a publisher will be discussed. – Better to make progress with something rather than do nothing because we cannot be comprehensive and complete. I Disagree! Bernadette oversees strategy for Talis‘ North American clients. She brings a strong background in commercial and government data management strategies, coupled with expertise in leading high-growth software organizations. Prior to joining Talis, Bernadette was CEO of several profitable Internet companies delivering scalable Web-based solutions for the enterprise, including Zepheira LLC and Tucana Technologies Inc., a pioneer in the emerging semantic technology community. 51 http://semtech2011.semanticweb.com/sessionPop.cfm?confid=62&proposalid=3822

52 7.2 Linked Data Cookbook 1. Leverage what exists. – Obtain data extracts (i.e., databases and/or spreadsheets) or create data in a way that can be replicated. 2. Model data without context to allow for reuse and easier merging of data sets. – With LD, application logic does not drive the data schema, concepts, etc. 3. Look for real world objects of interest (e.g., people, places, things, locations, etc.) and model them. – Use common sense to decide whether or not to make link. I Disagree! 4. Connect data from different sources and authoritative vocabularies (see list of popular vocabularies below). – Put aside immediate needs of any application. I Disagree! – Don’t think about how an application will use your data. I Disagree! 5. Write a script or process to convert the data set repeatedly. 6. Publish to the Web and announce it! (more details shortly). 7. Maintenance strategy (more details in the social contract at the end). 52 http://www.slideshare.net/bhylandwood/bernadette-hyland-semtech-2011-west-linked-data-cookbook

53 7.2 Linked Data Cookbook Guidelines for merging: – URIs name the resources we are describing. – Two people using the same URI are describing the same thing. – The same URI in two datasets means the same thing. – Graphs from several different sources can be merged. – Resources with the same URI are considered identical. – No limitations on which graphs can be merged. For a government agency... a data policy is “a must”: – specify data quality and retention, treatment of data thru secondary sources, restrictions for use, frequency of updates, public participation, and applicability of this data policy. I Agree! 53 http://www.slideshare.net/bhylandwood/bernadette-hyland-semtech-2011-west-linked-data-cookbook

54 7.2 Linked Data Cookbook 54 http://www.slideshare.net/bhylandwood/bernadette-hyland-semtech-2011-west-linked-data-cookbook

55 7.2 Clinical Quality Linked Data on Health.data.gov 55 http://www.data.gov/communities/node/81/blogs/4920 See Next Slide

56 7.2 Clinical Quality Linked Data on Health.data.gov 56 http://health.data.gov/def/hospital/Hospital

57 7.2 Clinical Quality Linked Data on Health.data.gov 57 http://health.data.gov/doc/hospital/393303.csv

58 7.2 Clinical Quality Linked Data on Health.data.gov 58 http://www.slideshare.net/george.thomas.name/clinical-quality-linked-data-on-healthdatagov

59 7.2 Health data innovation 'at a crawl' The health care data community should step up its efforts to innovate to help improve the nation’s health outcomes and reduce costs, Health and Human Services Secretary Kathleen Sebelius said at the department’s second Health Data Initiative Forum on June 9. “Use tools and use data,” Sebelius said at the forum held at the National Institute of Medicine campus in Bethesda, Md. “Do it more, do it better and do it faster.” Sebelius said Americans experience a “triple loss” due to having the highest public health care costs, highest private health care costs, and only mediocre health outcomes. The goal of the conference was to present 45 winning health care IT applications developed with HHS’ newly-available data sets within the last several months. HHS CTO Todd Park called the event a “Health Data Palooza” that would showcase innovation in health IT. – PerlDiverInc and Semantic Community were one of the finalists! 59 http://fcw.com/articles/2011/06/09/nation-needs-more-health-data-innovation-sebelius-says-at-forum.aspx

60 PearlDiver Data Engine & Semantic Community Data Visualization Benjamin YoungBrand Niemann PearlDiver Technologies Inc.Semantic Community Health Data Initiative Forum Submission Medicare Zombie Hunter

61 7.2 Build Clinical Quality Linked Data on Health.data.gov in the Cloud 61 http://semanticommunity.info/Semantic_Technology_Conferences/Clinical_Quality_Linked_Data_on_Health.data.gov

62 7.2 Build Clinical Quality Linked Data on Health.data.gov in the Cloud 62 http://semanticommunity.info/Semantic_Technology_Conferences/Clinical_Quality_Linked_Data_on_Health.data.gov/Hospital_Compare_Downloadable_Database_Metadata

63 7.2 Build Clinical Quality Linked Data on Health.data.gov in the Cloud 63 PC Desktop Spotfire

64 7.2 Build Clinical Quality Linked Data on Health.data.gov in the Cloud 64 Spotfire Web PlayerWeb Player

65 7.3 Library of Congress Project Recollection and Digital Preservation Initiative 65 The Libraries of Congress & MIT are developing a Semantic Web Browser (Exhibit and now Exhibit 3) to do essentially what Spotfire already does!

66 7.3 Library of Congress Project Recollection and Digital Preservation Initiative 66 PC Desktop Spotfire

67 7.3 Library of Congress Project Recollection and Digital Preservation Initiative 67 http://semanticommunity.info/Semantic_Technology_Conferences/Library_of_Congress

68 7.3 Library of Congress Project Recollection and Digital Preservation Initiative 68 Spotfire Web PlayerWeb Player Interoperability Interface!

69 7.4 Elsevier/Tetherless World Health and Life Sciences Hackathon (27-28 June 2011) 69 http://semanticommunity.info/Build_TWC_in_the_Cloud

70 7.4 NYC Data Web 70 http://knoodl.com/ui/groups/NYC_Homepage

71 7.4 NYC Data Web 71 http://semanticommunity.info/Semantic_Technology_Conferences/NY_Data_Mine/Revelytix Quote: Ontology architecture is a new aspect of system architecture and development, to our knowledge it has not been employed anywhere else in DOD.

72 7.4 NYC Data Web 72 http://semanticommunity.info/Semantic_Technology_Conferences/NY_Data_Mine/Revelytix#Dashboard

73 7.4 NYC Data Web 73 PC Desktop Spotfire

74 7.4 NYC Data Web 74 PC Desktop Spotfire

75 7.5 Be Informed A recent paper describes the formalism and rationale that Be Informed applies to business process modeling. It explains how and why goal- oriented modeling differs from more conventional business process modeling which is procedural. In the near-term, there is applicability for many government agencies, especially for those exploring semantic approaches. For example, Dennis Wisnoski advocates semantic web (RDF & OWL) standards for modeling data integration, and a dialect of BPMN for modeling processes. The metaphor for processes is an electronic circuit specification that uses standard building blocks. "We all know what those primitives mean." Previous, costly attempts at business process modeling were failures in part because there was no standard at the primitive level. However, as this paper makes clear, just having unambiguous primitives is only part of what is needed to specify and manage complex and dynamic business processes. Modeling flow in swim lanes is less agile than modeling goals, activities, and pre and post conditions. 75 Source: Mills Davis, Project10x, July 5, 2011.

76 7.5 Be Informed 76 Source: Specifying Flexible Business Processes using Pre and Post Conditions, Jeroen van Grondelle and Menno Gulpers, Be Informed BV, Apeldoorn, The Netherlands, 13 pp. Fig. 1. Summary of the Meta Model for Capturing Business Processes


Download ppt "Build DoD Vocabularies in the Cloud 3 rd Annual SOA & Semantic Technology Symposium: Interoperable Business Operations Through Shared Understanding Dr."

Similar presentations


Ads by Google