Presentation is loading. Please wait.

Presentation is loading. Please wait.

Some questions -What is metadata? -Data about data.

Similar presentations


Presentation on theme: "Some questions -What is metadata? -Data about data."— Presentation transcript:

1 Some questions -What is metadata? -Data about data

2 Some questions -How do we know it is metadata? -Intuition or marked as metadata

3 Some questions -How does a machine know that it reads metadata? -Marked as metadata, formalized in e.g. RDF(S) or OWL

4 Some questions -How can we extract metadata? -Manually -Known places in structured documents

5 Some questions -How ca we use metadata? -Annotate data -Finding relationships (later)

6 Some questions -How do we annotate data with metadata? -Manually (e.g. write XML tags) -Identify instances automatically, then machine annotates

7 Some questions -Problems with automatic identification -Disambiguation -Same name, different entities -Which “Christopher Thomas”? -Same entity, different role -“Christopher Thomas” can be an entity in the LSDIS ontology and also in the Friendster FOAF ontology. Not yet merged.

8 Taxonomies -What is a taxonomy? -From Greek ταξινομία from the words taxis = order and nomos = law -Hierarchical classification of things -Mathematically, a taxonomy is a tree structure of classifications for a given set of objects

9 Ontologies -What is an Ontology? -In computer science, an ontology is the attempt to formulate an exhaustive and rigorous conceptual schema within a given domain, a typically hierarchical data structure containing all the relevant entities and their relationships and rules (theorems, regulations) within that domainrelationships

10 Machine Learning -What is Machine Learning? -an area of artificial intelligence concerned with the development of techniques which allow computers to "learn"

11 –supervised learning --- where the algorithm generates a function that maps inputs to desired outputs. One standard formulation of the supervised learning task is the classification problem: the learner is required to learn (to approximate the behavior of) a function which maps a vector into one of several classes by looking at several input-output examples of the function. Machine Learning techniques

12 –unsupervised learning --- which models a set of inputs: labeled examples are not available. –reinforcement learning --- where the algorithm learns a policy of how to act given an observation of the world. Every action has some impact in the environment, and the environment provides feedback that guides the learning algorithm. Machine Learning techniques

13 Classification –Supervised Learning –Reinforcement Learning –Artificial Neural Networks –Nearest Neighbor/Bayesian approaches Group entities around a point of reference

14 Machine Learning techniques Clustering –Unsupervised –Try to find functions that split a dataset in a meaningful way –Needs an evaluation function that tells what is meaningful and what is not.


Download ppt "Some questions -What is metadata? -Data about data."

Similar presentations


Ads by Google