Syntax and semantics >AMYLASEE1 TGCATNGY A very simple FASTA file.

Slides:



Advertisements
Similar presentations
DC Architecture WG meeting Monday Sept 12 Slot 1: Slot 2: Location: Seminar Room 4.1.E01.
Advertisements

Introduction to linked data Gordon Dunsire Presented at the Cataloguing and Indexing Group Scotland seminar Linked data and the Semantic Web: what have.
An introduction to RDF and library linked data Gordon Dunsire Presented at the Dewey Decimal Classification Executive Briefing 15 Sep 2011, London.
Semantic Web Thanks to folks at LAIT lab Sources include :
The Semantic Web – WEEK 4: RDF
CS570 Artificial Intelligence Semantic Web & Ontology 2
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
Ontology Notes are from:
Ontologies and the Semantic Web by Ian Horrocks presented by Thomas Packer 1.
CS 330 Programming Languages 09 / 13 / 2007 Instructor: Michael Eckmann.
CS 330 Programming Languages 09 / 18 / 2007 Instructor: Michael Eckmann.
RDF: Data Description With HTML, the Web is for reading With XML, the Web is for processing Necessary to know  who wrote this?  who owns it?  who authorised.
Module 2b: Modeling Information Objects and Relationships IMT530: Organization of Information Resources Winter, 2007 Michael Crandall.
The Semantic Web An Emerging Successor of the Web By Tim Berners-Lee, James Hendler and Ora Lassila Presentedby Veera Chandra Sekhar.
Introduction to XML This material is based heavily on the tutorial by the same name at
4/20/2017.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Nancy Ide Vassar College USA Resource Definition Framework A Tutorial EUROLAN 2003 July 28 - August 8 Bucharest - Romania.
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
Amarnath Gupta Univ. of California San Diego. An Abstract Question There is no concrete answer …but …
XP New Perspectives on XML Tutorial 4 1 XML Schema Tutorial – Carey ISBN Working with Namespaces and Schemas.
RDF: Concepts and Abstract Syntax W3C Recommendation 10 February Michael Felderer Digital Enterprise.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
Logics for Data and Knowledge Representation
ISBD for the Semantic Web: namespaces, elements, vocabularies, application profile Gordon Dunsire Presented at Centar zu Stalno Stručno Usavršavanje (CSSU),
By: Dan Johnson & Jena Block. RDF definition What is Semantic web? Search Engine Example What is RDF? Triples Vocabularies RDF/XML Why RDF?
Integrating Live Plant Images with Other Types of Biodiversity Records Steve Baskauf Vanderbilt Dept. of Biological Sciences
Of 33 lecture 3: xml and xml schema. of 33 XML, RDF, RDF Schema overview XML – simple introduction and XML Schema RDF – basics, language RDF Schema –
The LOM RDF binding – update Mikael Nilsson The Knowledge Management.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Information Interchange on the Semantic Web an interactive talk by Piotr Kaminski, University of Victoria
ISBN Chapter 3 Describing Semantics -Attribute Grammars -Dynamic Semantics.
Of 41 lecture 4: rdf – basics and language. of 41 RDF basic ideas the fundamental concepts of RDF  resources  properties  statements ece 720, winter.
Semantic Web - an introduction By Daniel Wu (danielwujr)
RDF and XML 인공지능 연구실 한기덕. 2 개요  1. Basic of RDF  2. Example of RDF  3. How XML Namespaces Work  4. The Abbreviated RDF Syntax  5. RDF Resource Collections.
Semantic web course – Computer Engineering Department – Sharif Univ. of Technology – Fall Knowledge Representation Semantic Web - Fall 2005 Computer.
The Semantic Web and expert metadata: pull apart then bring together Presented at 12.seminar Arhivi, Knjižnice, Muzeji Nov 2008, Pore č, Croatia.
Chapter 7: Resource Description Framework (RDF) Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley,
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
Tutorial 13 Validating Documents with Schemas
SPINNING THE SEMANTIC WEB APPLICATIONS FOR THE MODERN ERA LIBRARIES
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Understanding RDF. 2/30 What is RDF? Resource Description Framework is an XML-based language to describe resources. A common understanding of a resource.
Chapter 7: Resource Description Framework (RDF) Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley,
Internet & World Wide Web How to Program, 5/e. © by Pearson Education, Inc. All Rights Reserved.2.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Metadata : an overview XML and Educational Metadata, SBU, London, 10 July 2001 Pete Johnston UKOLN, University of Bath Bath, BA2 7AY UKOLN is supported.
Semantic Phyloinformatic Web Services Using the EvoInfo Stack Speaker: John Harney LSDIS Lab, Dept. of Computer Science, University of Georgia Mentor(s):
Problems with XML & XML Schemas XML falls apart on the Scalability design goal. 1.The order in which elements appear in an XML document is significant.
CS621 : Artificial Intelligence Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 12 RDF, OWL, Minimax.
Working with XML. Markup Languages Text-based languages based on SGML Text-based languages based on SGML SGML = Standard Generalized Markup Language SGML.
Semantic Web 06 T 0006 YOSHIYUKI Osawa. Problem of current web  limits of search engines Most web pages are only groups of character strings. Most web.
Chapter 5 The Semantic Web 1. The Semantic Web  Initiated by Tim Berners-Lee, the inventor of the World Wide Web.  A common framework that allows data.
Semantic Web In Depth Resource Description Framework Dr Nicholas Gibbins –
26/02/ WSMO – UDDI Semantics Review Taxonomies and Value Sets Discussion Paper Max Voskob – February 2004 UDDI Spec TC V4 Requirements.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
Linked Data & Semantic Web Technology The Semantic Web Part 4. Resource Description Framework (1) Dr. Myungjin Lee.
OWL (Ontology Web Language and Applications) Maw-Sheng Horng Department of Mathematics and Information Education National Taipei University of Education.
Service-Oriented Computing: Semantics, Processes, Agents
The Semantic Web By: Maulik Parikh.
Building the Semantic Web
Service-Oriented Computing: Semantics, Processes, Agents
Service-Oriented Computing: Semantics, Processes, Agents
XML QUESTIONS AND ANSWERS
Grid Computing 7700 Fall 2005 Lecture 18: Semantic Grid
Ontology.
Grid Computing 7700 Fall 2005 Lecture 18: Semantic Grid
Ontology.
A Knowledge Representation Language for Internet Applications
Presentation transcript:

Syntax and semantics >AMYLASEE1 TGCATNGY A very simple FASTA file

FASTA syntax >AMYLASEE1 TGCATNGY FASTA syntax in Backus-Naur notation ::= | ::= “>” ::= { } ::= | ::= | “” Identifier Sequence A very simple FASTA file

The syntax rules do not fix meaning >AMYLASEE1 TGCATNGY Do I denote a protein, “Amylase E1”, or a person, “Amy Lasee, I”? What kind of sequence am I? TGCATNGY = Threonine-Glycine-Cysteine-Alanine-Threonine... TGCATNGY = Thymine-Guanosine-Cytosine-Adenine-Thymine... What is the relationship between the identifier and the sequence? Is Amy Lasee the sample donor? The experimenter? The owner? Is Amylase E1 a gene or a protein name? Is it arbitrary? Is it unique?

... but a fixed meaning requires clear syntax >AMYLASEE1 TGCATNGY AMYLASEE1 TGCATNGY So the first step to fixing the semantics is to make the syntax more explicit. XML can help. An xml-ified version of FASTA

Extensible syntax >AMYLASEE1|NP_523768|GO_ TGCATNGY AMYLASEE1 NP_ TGCATNGY Its easy to expand the XML unambiguously to include other elements that may be useful (GO term, GB identifier) but this is all just associating strings with other strings.

Implicit vs. explicit semantics >AMYLASEE1 TGCATNGY AMYLASEE1 NP_ TGCATNGY AMYLASEE1 NP_ TGCATNGY Notice that the semantics are implicit in the tags. To see this, lets replace “go_function” with “string3”. The two versions are syntactically identical. What differentiates them? The answer is that a human expert supplies the semantics by recognizing “go_function” as a reference to Gene Ontology molecular function annotations, whereas “string3” means nothing to us, though it has the same value.

Referring to the source AMYLASEE1 TGCATNGY So, lets try something new-- we will make a direct informational link to the GO concept. This gives us a human-readable definition that seems to resolve the “amylase vs. Amy Lasee” question, and it gives us machine-accessible relations, e.g., a machine can navigate the GO hierarchy to learn that amylase activity is_a glycogenase activity. But there is still something missing...

Making implicit semantics explicit How did we know how to interpret this? To a computer, the tagged values are just three different strings, with no semantics. However, an expert human can supply semantics by combining background knowledge with cues hidden in the tags. In this case, we infer that string3 is an URL with the GO function for this sequence. To specify the same meaning to a computer, we need to make explicit many things: 1.That string3 is operationally a URL (subject to URL protocols) 2.That this URL is the source of a thing that is an ontology concept 3.That the concept (“foo456”) is associated with the entity “foo123” 4.That foo456 is_function_of foo123 (or, foo123 has_function foo456) AMYLASEE1 TGCATNGY

Describing the world with triples In other words, we want to specify a subject-predicate-object triple: Sequence{ name=“amylaseE1” sequence=“TGCATNGY” } alpha amylase activity has_molecular_function subjectobject predicate Anyone Can Make Statements About Any Resource To facilitate operation at Internet scale, RDF is an open-world framework that allows anyone to make statements about any resource. In general, it is not assumed that complete information about any resource is available. RDF does not prevent anyone from making assertions that are nonsensical or inconsistent with other statements, or the world as people see it. Designers of applications that use RDF should be aware of this and may design their applications to tolerate incomplete or inconsistent sources of information. From the RDF spec:

Specifying an RDF triple <fasta_archive xmlns:rdf=“ xmlns:bfo=“ > AMYLASEE1 TGCATNGY The ultra-succinct form of the RDF triple syntax is or in more familiar language we might say

Using nexml syntax Not finished. Nexml provides 2 ways to express semantics: 1.Built-in links to CDAO (SAWSDL links in schema) 2.Ad hoc references to external namespaces in

Built-in links to CDAO Not finished. Example (“Edge”) of SAWSDL links in schema

References to external namespaces using elements Not finished. Will explain some examples from wiki.

some things to express (see wiki) Attaching a concept to an element Attaching annotation or an external resource to an element Attaching a concept to an element through a relation Attaching a taxon identifier to an OTU through a relation Identifying specimens within collections Literature References Example 1: associate a reference with a tree (or other) element Example 2: associate a reference with a record Associating an OBO phenotype with a character state

What can I do with semantics? Not finished. 1 thing to do is to make semantics clear to human users. Another thing is to make this accessible to computers. What can the computers do? If you have software to read your files and reconstruct the RDF triples as statements in the ontology language, then you can carry out reasoning in the ontology language. Examples (taxonomy; types of chars; anatomical relations of chars)