Frank van Harmelen Vrije Universiteit Amsterdam The Web of data and LarKC’s role in it Creative Commons License: allowed to share & remix, but must attribute.

Slides:



Advertisements
Similar presentations
© 2012 IBM Corporation Smarter Cities: Creating opportunities through leadership and innovation Ed Bryan, Vice President, Industry Solutions IBM Software.
Advertisements

Consumer-Centric Knowledge Web A Vision of Consumer Applications of Software Agent Technology - Enabling Consumer-Centric Knowledge-Based Computing Jack.
1 Real World Chemistry Virtual discovery for the real world Joe Mernagh 19 May 2005.
Mark Kessel Partner Symphony Capital LLC June 3, 2010 Business Development Key Ingredient for Success.
28 April 2004Second Nordic Conference on Scholarly Communication 1 Citation Analysis for the Free, Online Literature Tim Brody Intelligence, Agents, Multimedia.
Town Meeting Aims Introduce the project and partners Present our baseline technologies Outline current and planned work Understand your perspectives on.
1 From Grids to Service-Oriented Knowledge Utilities research challenges Thierry Priol.
ICT PSP Infoday Luxembourg Call 2011 – 2.4 eLearning ICT-PSP Call Objective eLearning Marc Röder Infso E6/eContent and Safer Internet Luxembourg,
Frank van Harmelen Vrije Universiteit Amsterdam The Information Universe of the (Near) Futur e Creative Commons License: allowed to share & remix, but.
Action s innovant es t ransnational es Calls for Proposals France: ESF Innovative Transnational Actions.
Pre-commercial Procurement of Innovation Building together innovative solutions Ulf Dahlsten, Director European Commission, DG INFSO.
Fighting Malaria With The Grid. Computing on The Grid The Internet allows users to share information across vast geographical distances. Using similar.
Prototype Knowledge Base: an on-line information service in dependability and security Hugh Glaser Electronics & Computer Science University of Southampton.
Demographic facts and their trends can impact you and your business as we climb out of the recession?
Controlled Vocabularies in TELPlus Antoine ISAAC Vrije Universiteit Amsterdam EDLProject Workshop November 2007.
International training programs in Brussels related to scientific information and ICT Vrije Universiteit Brussel, Pleinlaan.
UCL Library Services and UCL Publications Board: New Developments in e-Publishing at UCL Martin Moyle Group Manager, IT Services, UCL Library Services.
Supporting Engagement in Open Access: a Publishers Perspective
Cognitive Systems, ICANN panel, Q1 What is machine intelligence, as beyond pattern matching, classification and prediction. What is machine intelligence,
Introduction to Semantic Web What? Why? How? So far? Next? Frank van Harmelen AI Department Vrije Universiteit Amsterdam Creative Commons License: allowed.
CCGrid2013 Panel on Clouds Henri Bal Vrije Universiteit Amsterdam.
Project Overview Slide 2 of 15 Overview Project in a Nutshell ◦Motivation ◦Aims and Objectives ◦Expected Outcomes PlanetData Programs Join PlanetData.
Productivity Perspectives depend on your point of view Eric Bartelsman Vrije Universiteit Amsterdam and Tinbergen Institute Canberra, ABS/PC Dec. 9, 2004.
Broadband Adoption: Patterns, Behaviors, and Implications Presented to the New Jersey Connected Broadband Summit John B. Horrigan Associate Director for.
Korean participation in the Large Knowledge Collider (LarKC) Creative Commons License: allowed to share & remix, but must attribute & non-commercial.
Big Data: Big Challenges for Computer Science Henri Bal Vrije Universiteit Amsterdam.
America’s Fully Developed Suburbs The First Suburbs.
Martin Schuurmans Chair EIT The EIT Sustainable Growth and Competitiveness through Innovation.
Michel Goldman, Executive Director Innovative Medicines Initiative (IMI)
Partnering for the future David MacArthur 31 October 2003 The British Library and FIL.
1 Pharmaceutical Challenges for the Semantic Web.
Pierre GODIN, Policy Analyst
Frank van Harmelen Semantics: where are we now, where should we go? Creative Commons CC BY 3.0: allowed to share & remix (also commercial) but must attribute.
Ontologies and the Semantic Web by Ian Horrocks presented by Thomas Packer 1.
1 Bluffers Guide to The Semantic Web Frank van Harmelen CS Department Vrije Universiteit Amsterdam Data wants to be free.
Building an Environmental Component to Digital Government Presentation to: Science on the Semantic Web Sue Stendebach, NSF October 24 and 25.
Linked2Safety Project (FP7-ICT – 5.3) A NEXT-GENERATION, SECURE LINKED DATA MEDICAL INFORMATION SPACE FOR SEMANTICALLY-INTERCONNECTING ELECTRONIC.
Stations November 7 th – November 11th. Project Overview This week students will listen to a guest speaker from Healthy Communities. Their next project.
Event Metadata Records as a Testbed for Scalable Data Mining David Malon, Peter van Gemmeren (Argonne National Laboratory) At a data rate of 200 hertz,
© 2015 Core Knowledge Foundation. This work is licensed under a Creative Commons Attribution- NonCommercial-ShareAlike 3.0 Unported License.
Semantic Search: different meanings. Semantic search: different meanings Definition 1: Semantic search as the problem of searching documents beyond the.
CS 790 – Bioinformatics Introduction and overview.
VIVO: Sharing Data for Research Discovery Mike Conlon University of Florida
EU Project proposal. Andrei S. Lopatenko 1 EU Project Proposal CERIF-SW Andrei S. Lopatenko Vienna University of Technology
Henri Bal Vrije Universiteit Amsterdam High Performance Distributed Computing.
The Semantic Web from ft Frank van Harmelen Creative Commons License: allowed to share & remix, but must attribute & non-commercial.
Developing medicines for the future and why it is challenging Angela Milne.
BioPaths-Catalyze Drug Discovery, Development and Clinical Research
E-Heritage and the VU Semantic Web group Guus Schreiber Computer Science VU University Amsterdam.
A Semantic Knowledge Base for the UK Government Web Archive Tom Storrar & Claire Newing Applying records management processes principles to the open government.
The World's Largest computer Network. The World Wide Web In 1989, Tim Berners-Lee, an Oxford-trained computer scientist, had an idea for a "global hypertext.
Paper Prototyping Source: Paper Prototyping a method of brainstorming, designing, creating, testing, refining and communicating.
Clinical research data interoperbility Shared names meeting, Boston, Bosse Andersson (AstraZeneca R&D Lund) Kerstin Forsberg (AstraZeneca R&D.
Chapter One The Science of Biology
Critical Path Initiative Sousan S. Altaie, Ph.D. Scientific Policy Advisor OIVD/CDRH.
B2A Pharma Prototype Implementation of an industrial-strength pharmaceutical workflow in a Grid environment Falk Zimmermann NEC Europe Ltd. IT Research.
Filling institutional repositories: considering copyright issues Susan Veldsman eIFL Content Manager
Our Place in the Cloud DCIA P2P & Cloud Market Conference March 9, 2010.
Finding the Information You Need on the Internet.
ICT22 – 2016: Technologies for Learning and Skills ICT24 – 2016: Gaming and gamification Francesca Borrelli DG CONNECT, European Commission BRUXELLES.
Copyright for Kids CCISD
Very VERY large scale knowledge representation In collaboration with:
In a circular economy, almost nothing is wasted
Regions for Economic Change – Tools for smart regions
Strategic uses of Web Content Management Systems
Science vocabulary (12) 8/22/18 quiz
Talk in 4 parts Basic principles of the Semantic Web
Agenda for today 09: :00 Overview and Goals of LarKC, Frank van Harmelen 10: :30 Introduction to the LarKC Architecture, Spyros Kotoulas 10:30.
Sergio Andreozzi Strategy and Policy Manager (EGI.eu)
H2020 SCC Smart Cities & Communities Info Day Horizon 2020
Presentation transcript:

Frank van Harmelen Vrije Universiteit Amsterdam The Web of data and LarKC’s role in it Creative Commons License: allowed to share & remix, but must attribute & non-commercial

The Current Information Universe                     linked web-pages, written by people, written for people, used only by people... Many of these pages already come from data, usable by computers! But we can’t link the data.... ? ? ? ? The Future Information Universe ? linked data, usable by computers! useful for people!

already many billions of facts & rules How far away is this ? Not very far away! rapidly growing Linked Open Data cloud. Encyclopedia Geographic names (millions) names of artists & art works (10.000’s) scientific bibliographies hierarchical dictionaries (UK, FR, NL) hierarchical dictionaries (UK, FR, NL) life-science databases any CD ever recorded (almost) every book sold by Amazon basic facts on every country on the planet common sense rules & facts ( ’s) It gets bigger every month

Full Web-style decoupling: re-usability, independence All identifiers are URL's (= on the Web) –Allows total decoupling of data vocabulary meta-data x T [ IsOfType ] different owners & locations

For the first time ever, it is now possible: to re-use somebody else's knowledge base without having to talk to them first (syntax, semantics) without having to make copies Rapid growth: "billion triple challenge" (= machine-reason with a billion facts and rules) 2006: “where do we get a billion facts from?” 2008: “which billion shall we choose!”

What to do when success is becoming a problem? The Large Knowledge Collider a platform for infinitely scalable reasoning on the data-web

Infinite scalability? parallelisation cluster computing distribution “self-computing semantic Web” approximation “almost” is often good enough gets better with more resources

First result: MaRVIN MaRVIN scales by: distribution (over many nodes) approximation (sound but incomplete) anytime convergence (more complete over time) brain the size of a planet

The consortium 14 partners, 50 people

The project 10M€ budget 3.5 years 80 person years 3 case studies 14 partners

Use case: Drug Discovery Problem: pharmaceutical R&D in early clinical development is stagnating (Q1Q2Q3)(Q1Q2Q3) FDA white paper Innovation or Stagnation (March 2004): “developers have no choice but to use the tools of the last century to assess this century's candidate solutions.” “industry scientists often lack cross-cutting information about an entire product area, or information about techniques that may be used in areas other than theirs” FDA white paper Innovation or Stagnation (March 2004): “developers have no choice but to use the tools of the last century to assess this century's candidate solutions.” “industry scientists often lack cross-cutting information about an entire product area, or information about techniques that may be used in areas other than theirs” “ Show me any potential liver toxicity associated with the compound’s drug class, target, structure and disease.” Show me all liver toxicity associated with the target or the pathway. Genetics “Show me all liver toxicity associated with compounds with similar structure” Chemistry “Show me all liver toxicity from the public literature and internal reports that are related to the drug class, disease and patient population” LITERATURE Current NCBI: linking but no inference

Where is the traffic moving Is public transportation where people are Which location attracts most people right now Is public transportation where people will be Where is the traffic moving Is public transportation where people are Which location attracts most people right now Is public transportation where people will be Use Case: City on-line Our cities face many challenges Urban Computing is the ICT way to address them Is public transportation where the people are? Which landmarks attract more people? Where are people concentrating? Where is traffic moving? improve the quality of life

Is anybody doing this for real? OpenCalais: –enrich text (news items) with semantic meta-data –recognise people, places, events, organisations,... –useful for searching, selecting, personalising, aggregating, summarising, etc From early ’09: –identify “people, places, events, organisations,...” by linking to the Open Data cloud:        

Summarising The Information Universe of the Future will be a Web of Data This Web of Data is rapidly taking shape There are compelling use-cases Industrial take-up is beginning to happen We are building new infrastructure to deal with required scale

Contact Info Want to ask questions? Want to play with LarKC? Want to contribute plugins? Want to run a use-case? Want to ask questions? Want to play with LarKC? Want to contribute plugins? Want to run a use-case?