Www.sti-innsbruck.at © Copyright 2012 STI INNSBRUCK www.sti-innsbruck.at Apache Stanbol.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Co-funded by the European Union Semantic CMS Community Project Review Meeting Luxemburg, Knowledge Representation and Reasoning.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Semantic Indexing and Search for Content Management Systems Suat Gönül, SRDC A. Anil Sinaci, SRDC prepared by presented.
AHRT: The Automated Human Resources Tool BY Roi Ceren Muthukumaran Chandrasekaran.
SPICE! An Ontology Based Web Application By Angela Maduko and Felicia Jones Final Presentation For CSCI8350: Enterprise Integration.
Building and Analyzing Social Networks Web Data and Semantics in Social Network Applications Dr. Bhavani Thuraisingham February 15, 2013.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
The user entered the query “What is the historical relation between Greek and Roma”. Here are the query’s results. The user clicked the topic “Roman copies.
AceMedia Personal content management in a mobile environment Jonathan Teh Motorola Labs.
IST NeOn-project.org The Semantic Web is growing… #SW Pages Lee, J., Goodwin, R. (2004) The Semantic.
Overview of Search Engines
© Copyright 2008 STI INNSBRUCK Rhizomer “The Rhizomer Semantic Content Management System” Roberto Garcia, Juan.
Using Java in Linked Data Applications Fuming Shih Oct 12.
Xpantrac connection with IDEAL Sloane Neidig, Samantha Johnson, David Cabrera, Erika Hoffman CS /6/2014.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Thesaurusmanagement Quickstart Introduction. What are controlled vocabularies? organized arrangement of words and phrases used to index content and/or.
Managing Large RDF Graphs (Infinite Graph) Vaibhav Khadilkar Department of Computer Science, The University of Texas at Dallas FEARLESS engineering.
What Can Do for You! Fabian Christ
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
Semantic Web. Course Content
BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July 2005 Metadata Agents and Semantic Mediation Mikhaila Burgess Cardiff University.
Co-funded by the European Union Semantic CMS Community Presentation and Interaction Components VIE.js Copyright IKS Consortium 1 Tilman Becker DFKI GmbH.
Survey of Semantic Annotation Platforms
Entity Recognition via Querying DBpedia ElShaimaa Ali.
Provenance Metadata for Shared Product Model Databases Etiel Petrinja, Vlado Stankovski & Žiga Turk University of Ljubljana Faculty of Civil and Geodetic.
GCMD/IDN STATUS AND PLANS Stephen Wharton CWIC Meeting February19, 2015.
Practical Project of the 2006 Joint International Master’s Degree.
University of Economics Prague Information Extraction (WP6) Martin Labský MedIEQ meeting Helsinki, 24th October 2006.
SemSearch: A Search Engine for the Semantic Web Yuangui Lei, Victoria Uren, Enrico Motta Knowledge Media Institute The Open University EKAW 2006 Presented.
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
A bad case of content reuse Validator Website to Validate License Violations Validator – Only requires the URI of the site to check This work by Oshani.
A bad case of content reuse Validator Website to Validate License Violations Validator – Only requires the URI of the site to check for a license violation.
2007. Software Engineering Laboratory, School of Computer Science S E Web-Harvest Web-Harvest: Open Source Web Data Extraction tool 이재정 Software Engineering.
Oracle Database 11g Semantics Overview Xavier Lopez, Ph.D., Dir. Of Product Mgt., Spatial & Semantic Technologies Souripriya Das, Ph.D., Consultant Member.
Technology – Broad View Aspects that play a role when integrating archives leave the details of some core topics to the 2. day Bernhard Neumair:Base Technologies.
Efficient RDF Storage and Retrieval in Jena2 Written by: Kevin Wilkinson, Craig Sayers, Harumi Kuno, Dave Reynolds Presented by: Umer Fareed 파리드.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
© Copyright 2013 STI INNSBRUCK “How to put an annotation in HTML?” Ioannis Stavrakantonakis.
© Geodise Project, University of Southampton, Knowledge Management in Geodise Geodise Knowledge Management Team Barry Tao, Colin Puleston, Liming.
Semantic Enhancement: Key to Massive and Heterogeneous Data Pools Violeta Damjanovic, Thomas Kurz, Rupert Westenthaler, Wernher Behrendt, Andreas Gruber,
Trustworthy Semantic Webs Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #4 Vision for Semantic Web.
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
112/14/2015 Discovery of Composable Web Services Presented by: Duygu ÇELİK Submitted by: Duygu ÇELİK & Vassilya ABDULOVA Submitted to: Assoc.Prof.Dr.Atilla.
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
Semantic web Bootstrapping & Annotation Hassan Sayyadi Semantic web research laboratory Computer department Sharif university of.
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
Trait ontology approach Marie-Angélique LAPORTE NCEAS June 7 th 2010.
Co-funded by the European Union Semantic CMS Community Reference Architecture for Semantic CMS Copyright IKS Consortium 1 Lecturer Organization Date of.
Steven Perry Dave Vieglais. W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Overview WASABI is a framework for.
VIVO architecture March 1, Major Components Vitro is a general-purpose Web-based application leveraging semantic standards VIVO is a customized.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Sesame A generic architecture for storing and querying RDF and RDFs Written by Jeen Broekstra, Arjohn Kampman Summarized by Gihyun Gong.
Extended Metadata Registries and Semantics (Part 2: Implementation) Karlo Berket Ecoterm IV Environmental Terminology Workshop April 18, 2007 Diplomatic.
26/02/ WSMO – UDDI Semantics Review Taxonomies and Value Sets Discussion Paper Max Voskob – February 2004 UDDI Spec TC V4 Requirements.
Infrastructure and Workflow for the Formal Evaluation of Semantic Search Technologies Stuart N. Wrigley 1, Raúl García-Castro 2 and Cassia Trojahn 3 1.
An Alfresco Apache Stanbol Integration (port of OpenCalais Integration) Steve Reiner CTO Integrated Semantics.
Knowledge Representation and Reasoning in IKS
Laurea Magistrale in Scienze di Internet
Laurea Magistrale in Scienze di Internet
Analyzing and Securing Social Networks
Experience Management
Knowledge Based Workflow Building Architecture
PREMIS Tools and Services
LOD reference architecture
Chaitali Gupta, Madhusudhan Govindaraju
Lab 2: Information Retrieval
Presentation transcript:

© Copyright 2012 STI INNSBRUCK Apache Stanbol

Overview Features overview Components –Stanbol Content Enhancer –Stanbol Entity Hub –Stanbol Content Hub –Stanbol Ontology Technologies 2

Features Apache Stanbol provides a set of reusable components for semantic content management. Apache Stanbol's main features are: –Content Enhancement Services that add semantic information to “non-semantic” pieces of content.Content Enhancement –Reasoning Services that are able to retrieve additional semantic information about the content based on the semantic information retrieved via content enhancement.Reasoning –Knowledge Models Services that are used to define and manipulate the data models (e.g. ontologies) that are used to store the semantic information.Knowledge Models –Persistence Services that store (or cache) semantic information, i.e. enhanced content, entities, facts, and make it searchable.Persistence 3

Components Enhancer: Extracts Knowledge from parsed Content Entityhub: Manage Entities and Topics of Interest to your Domain Contenthub: Semantic Indexing / Search over your - semantic enhanced - Content CMS Adapter: Sync. your CMS with Apache Stanbol (JCR/CMIS) Ontology Manager: Manage you formal Domain Knowledge Reasoners & Rules: Apply Domain Knowledge to improve / validate extracted. Information. Refactor / refine knowledge to align it to public schemas such as schema.org 4

Stanbol Content Enhancer Entity Tagging - replacing text based tags such as "Bob Marley" with entities - dbpedia:Bob_Marley - to improve content search and categorization.dbpedia:Bob_Marley Entity Disambiguation - enhance the entity tagging experience by explicit support for disambiguation between different suggested entities. This allows users to explicitly link to Paris (Texas), Bob Marley (Comedian) or in between any other entities that do share similar labels. Entity Checker - interact with extracted entities similar as with todays spellchecker: Show extracted/suggested dirtily within the content; Allow users to interact with suggestions and to disambiguate between different matches if necessary; Support search for additional/other entities. 5

Stanbol Content Enhancer (II) 6

Stanbol Content Enhancer (III) Support for domain specific vocabularies 7

Stanbol Content Enhancer (IV) The following Languages are supported for Named Entity Recognition - and can therefore be used for Named entity Linking: –English (via NamedEntityTaggingEngine, OpenCalais)NamedEntityTaggingEngineOpenCalais –Spansh (via NamedEntityTaggingEngine, OpenCalais)NamedEntityTaggingEngineOpenCalais –Dutch ((via NamedEntityTaggingEngine)NamedEntityTaggingEngine –French (via CELI NER engine, OpenCalais)OpenCalais –Italien (via CELI NER engine) For the following languages NLP support is available to improve results when using the Keyword Extraction Engine: –Danish –Dutch –English –German –Portuguese –Spanish –Swedish 8

Stanbol Content Enhancer (V) 9

Stanbol Entity Hub Responsible for providing the information about Entities relevant to the users domain. The following figure tries to provide an overview about the features of the Entityhub. 10

Stanbol Content Hub Add Semantic Search to your CMS –RESTful Faceted Search Interface –Related Keyword Search using Entityhub, Ontonet or Wordnet –Improve Search by Semantic Indexing Use the Stanbol Contenthub for semantic indexing 11

Stanbol Ontology Manage your Ontologies –and use/combine them in Scopes Reasoning –on volatile Data loaded into a Sessions –consistency check / classification / enrichment –RDFS, OWL and OWL - 2 Support for background Jobs –for long running reasoning tasks 12

Stanbol Ontology 13

Stanbol Ontology (Rules) Stanbol Rules –Recipes: Manage a set of Rules that are executed together –Rules are converted to SWRL,Jena Rules or SPARQL CONSTRUCT depending on the available RuleEngine Typical Use Cases –integrity checks for imported Data –harmonize Vocabularies e.g. simple SEO by using schema.org 14

Technologies Functionalities are provided as RESTful services returning results as RDF (Resource Description Language) and JSON.RDFJSON –Apache Stanbol also supports the use of JSON-LD.JSON-LD Apache Stanbol can be run as a standalone application (packaged as a runable JAR) or as an web application (packaged as a WAR file) deployable in servlet containers such as Apache Tomcat. Written in Java based on the OSGi as component framework.OSGi Implemented using frameworks such as –Apache Solr - for semantic search; Apache Solr –Apache Tika - for plain text and metadata extraction; Apache Tika –Apache OpenNLP - for natural language processing; Apache OpenNLP –Apache Clerezza and Apache Jena - as RDF and storage frameworks; Apache ClerezzaApache Jena –Apache Felix as OSGi framework and Apache Felix –Apache Sling for deployment.Apache Sling 15

Technologies (II) Stanbol Components provide –RESTful API –Java API and OSGI services Stanbol Components do NOT depend on each other –however they can be easily combined 16

Live DEMO 17