UMBC an Honors University in Maryland 1 Adding Semantics to Social Websites for Citizen Science Pranam Kolari University of Maryland, Baltimore County.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

UMBC an Honors University in Maryland The Semantic Web … It Just Might Work. Joel Sachs Joint work with: Cyndy Parr, Andriy Parafiynyk,
UMBC an Honors University in Maryland Examples of Integrating Ecological Information on the Semantic Web Joel Sachs and Cynthia Simms Parr contact:
Social Media.
Jennifer A. Dunne Santa Fe Institute Pacific Ecoinformatics & Computational Ecology Lab Rich William, Neo Martinez, et al. Challenges.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
What is the Internet? Internet: The Internet, in simplest terms, is the large group of millions of computers around the world that are all connected to.
UMBC AN HONORS UNIVERSITY IN MARYLAND Future Research Challenges and Needed Resources for The Web, Semantics and Data Mining Tim Finin UMBC, Baltimore.
CSCI 572 Project Presentation Mohsen Taheriyan Semantic Search on FOAF profiles.
Open Statistics: Envisioning a Statistical Knowledge Network Ben Shneiderman Founding Director ( ), Human-Computer Interaction.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Department of Computer Science, University of Maryland, College Park 1 Sharath Srinivas - CMSC 818Z, Spring 2007 Semantic Web and Knowledge Representation.
Samad Paydar Web Technology Laboratory Computer Engineering Department Ferdowsi University of Mashhad 1389/11/20 An Introduction to the Semantic Web.
WEB 2.0: Definitions, glossary, tools and uses. Use web 2.0 tools to create vibrant learning communities.
Web Content Management Systems. Lecture Contents Web Content Management Systems Non-technical users manage content Workflow management system Different.
Web 2.0: Concepts and Applications 2 Publishing Online.
PDF Wikispaces Blogging PBWorks You are now ready to cut the red ribbon and unveil your project to your intended audience.
Configuring Social Media, Google Analytics, and Gadgets Lila Bronson Training Manager, OmniUpdate, Inc.
Proprietary & Confidential The Thread That Ties it All Together Voicethread and Discovery Education Jennifer Dorman denblogs.com/jendorman.
Semantic Web outlook and trends May The Past 24 Odd Years 1984 Lenat’s Cyc vision 1989 TBL’s Web vision 1991 DARPA Knowledge Sharing Effort 1996.
Finding knowledge, data and answers on the Semantic Web
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
 The ability to develop step by step procedures for solving problems  She uses algorithmic thinking by setting up her charts.
Lushan Han, Tim Finin, Cynthia Parr, Joel Sachs, and Anupam Joshi RDF123: from Spreadsheets to RDF.
What is the Internet? Internet: The Internet, in simplest terms, is the large group of millions of computers around the world that are all connected to.
Web 2.0: Concepts and Applications 2 Publishing Online.
PUBLISHING ONLINE Chapter 2. Overview Blogs and wikis are two Web 2.0 tools that allow users to publish content online Blogs function as online journals.
UMBC an Honors University in Maryland 1 Search Engines for Semantic Web Knowledge Tim Finin University of Maryland, Baltimore County Joint work with Li.
Research support was provided by NSF, award NSF-ITR-IIS , PI Tim Finin, UMBC. SPIRE Semantic Prototypes in Research Ecoinfomatics Approach We are.
Future Learning Landscapes Yvan Peter – Université Lille 1 Serge Garlatti – Telecom Bretagne.
Search Engine Architecture
UMBC an Honors University in Maryland 1 Search Engines for Semantic Web Knowledge Tim Finin University of Maryland, Baltimore County Joint work with Li.
UMBC an Honors University in Maryland 1 Information Integration and the Semantic Web Finding knowledge, data and answers Tim Finin University of Maryland,
You sexy beast. Ok, inappropriate. How about: Web of links to Web of Meaning Hello Semantic Web!
A Short Tutorial to Semantic Media Wiki (SMW) [[date:: July 21, 2009 ]] At [[part of:: Web Science Summer Research Week ]] By [[has speaker:: Jie Bao ]]
UMBC an Honors University in Maryland 1 Finding knowledge, data and answers on the Semantic Web Tim Finin University of Maryland, Baltimore County
UMBC an Honors University in Maryland 1 Information Integration and the Semantic Web Finding knowledge, data and answers Tim Finin 1, Anupam Joshi 1, Li.
UMBC an Honors University in Maryland 1 Using the Semantic Web to Support Ecoinformatics Andriy Parafiynyk University of Maryland, Baltimore County
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
ESIP Semantic Web Products and Services ‘triples’ “tutorial” aka sausage making ESIP SW Cluster, Jan ed.
1 © Xchanging 2010 no part of this document may be circulated, quoted or reproduced without prior written approval of Xchanging. MOSS Training – UI customization.
IBM Lotus Software © 2006 IBM Corporation IBM Lotus Notes Domino Blog Template Steve Castledine.
Web Design and Development. World Wide Web  World Wide Web (WWW or W3), collection of globally distributed text and multimedia documents and files 
Assembling Biological Inventories for Analysis Robert J. Meese, Ph.D. University of California, Davis (530) Presented by Andrea.
Web 2.0 Ali Ghandour Based on slides from: Clara Ko, EuropeanPWN Amsterdam.
UMBC an Honors University in Maryland 1 Finding and Ranking Knowledge on the Semantic Web Li Ding, Rong Pan, Tim Finin, Anupam Joshi, Yun Peng and Pranam.
Lessons learned from Semantic Wiki Jie Bao and Li Ding June 19, 2008.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Kendra Hunter & Charde Johnson EDUC Dr. M. Kariuki.
Making Software Agents Smarter Tim Finin University of Maryland, Baltimore County ICAART 2010, 22 January 2010
UMBC an Honors University in Maryland 1 Searching for Knowledge and Data on the Semantic Web Tim Finin University of Maryland, Baltimore County
Selected Semantic Web UMBC CoBrA – Context Broker Architecture  Using OWL to define ontologies for context modeling and reasoning  Taking.
1 Web Services for Semantic Interoperability and Integration Tim Finin University of Maryland, Baltimore County Dagstuhl, 20 September 2004
Spire Semantic Prototypes In Ecoinformaics UMBC CS UMBC CS UMD MIND SWAP UMD MIND SWAP UMBC GEST UMBC GEST NASA GSFC NASA GSFC RMBL Peace RMBL Peace UC.
Chapter 8: Web Analytics, Web Mining, and Social Analytics
Swoogle: A Semantic Web Search and Metadata Engine Li Ding, Tim Finin, Anupam Joshi, Rong Pan, R. Scott Cost, Yun Peng Pavan Reddivari, Vishal Doshi, Joel.
Top 10 Technology Tools for Teaching and Learning
Overview Blogs and wikis are two Web 2.0 tools that allow users to publish content online Blogs function as online journals Wikis are collections of searchable,
Finding knowledge, data and answers on the Semantic Web
SWD = SWO + SWI SWD Rank SWD IR Engine
Web Services for Semantic Interoperability and Integration
Presented by ebiqity UMBC Nov, 2004
RDF123 RDF123 is an application and web service to generate RDF data from spreadsheets Graphically create/edit spreadsheet to RDF map MAP map + spreadsheet.
Visit Swoogle web site at
denblogs.com/jendorman
Overview Blogs and wikis are two Web 2.0 tools that allow users to publish content online Blogs function as online journals Wikis are collections of searchable,
Enabling Semantic Ecoblogging and Bioblitzes
OntoRank for RDF documents
Presented By S.Yamuna AP/CSE
Presentation transcript:

UMBC an Honors University in Maryland 1 Adding Semantics to Social Websites for Citizen Science Pranam Kolari University of Maryland, Baltimore County Joint work with Andriy Parafiynyk, Tim Finin, Cynthia Parr, Joel Sachs, and Lushan Han  This work was partially supported by DARPA contract F , NSF grants CCR and IIS

2 This talk Motivation Swoogle Semantic Web search engine Social Semantic Web Conclusions

3 Social media describes the online technologies and practices that people use to share opinions, insights, experiences, and perspectives and engage with each other. Wikipedia 07 SOCIAL MEDIA

4 Social Media for agents Today social media supports information sharing among communities of people - enables Citizen Journalism An infrastructure based on pings, feeds, content aggregators, and filters (e.g. pipes) aids scalability Social media now accounts for ~1/3 of new Web content! We need to explore how networks of agents can use the same strategies to share data and knowledge

5 This talk Motivation Swoogle Semantic Web search engine Social Semantic Web Conclusions

6 Google has made us smarter

7 But what about our agents? tell register Agents still have a very minimal understanding of text and images.

8 But what about our agents? A Google for knowledge on the Semantic Web is needed by software agents and programs Swoogle tell register

9 Running since summer M RDF docs, 434M triples, 10K ontologies, 15K namespaces, 1.5M classes, 185K properties, 49M instances, 800 registered users Running since summer M RDF docs, 434M triples, 10K ontologies, 15K namespaces, 1.5M classes, 185K properties, 49M instances, 800 registered users

10 Analysis Index Discovery IR Indexer Search Services Semantic Web metadata Web Service Web Server Candidate URLs Bounded Web Crawler Google Crawler SwoogleBot SWD Indexer Ranking document cache SWD classifier human machine htmlrdf/xml … the Web Semantic Web Information flowSwoogle‘s web interface Swoogle Architecture pings Archive

11 Applications and use cases Supporting Semantic Web developers –Ontology designers, vocabulary discovery, who’s using my ontologies or data?, use analysis, errors, statistics, etc. Searching specialized collections –Spire: aggregating observations and data from biologists –InferenceWeb: searching over and enhancing proofs –SemNews: Text Meaning of news stories Supporting SW tools –Triple shop: finding data for SPARQL queries 1 2 3

12 2 An NSF ITR collaborative project with University of Maryland, Baltimore County University of Maryland, College Park U. Of California, Davis Rocky Mountain Biological Laboratory An NSF ITR collaborative project with University of Maryland, Baltimore County University of Maryland, College Park U. Of California, Davis Rocky Mountain Biological Laboratory

13 An invasive species scenario Nile Tilapia fish have been found in a California lake. Can this invasive species thrive in this environment? If so, what will be the likely consequences for the ecology? So…we need to understand the effects of introducing this fish into the food web of a typical California lake

14 Food Webs A food web models the trophic (feeding) relationships between organisms in an ecology –Food web simulators explore consequences of ecological changes, i.e., species introduction or removal –Food web are constructed from studies of a location’s species inventory and the known trophic relations. Goal: automatically construct a food web for a new species using existing data and knowledge ELVIS: Ecosystem Location Visualization and Information System

15 East River Valley Trophic Web

16 The problem We have data on what species are known to be in the location and can further restrict and fill in with other ecological models => Maybe we can mine social media for species observations data? But we don’t know which of these the Nile Tilapia eats of who might eat it. We can reason from taxonomic data (similar species) and known natural history data (size, mass, habitat, etc.) to fill in the gaps.

17 Food Web Constructor Predict food web links using database and taxonomic reasoning. In an new estuary, Nile Tilapia could compete with ostracods (green) to eat algae. Predators (red) and prey (blue) of ostracods may be affected

18 Status ELVIS (Ecosystem Location Visualization and Information System) as an integrated set of web services for constructing food webs for a given location. Background ontologies –SpireEcoConcepts: concepts and properties to represent food webs, and ELVIS related tasks, inputs and outputs –ETHAN (Evolutionary Trees and Natural History) Concepts and properties for ‘natural history’ information on species derived from data in the Animal diversity web and other taxonomic sources. 250K classes on plants and animals

19 This talk Motivation Swoogle Semantic Web search engine Social Semantic Web Conclusions

20 Social media sites have become the biggest source of new content on the Web Blogs, Wikis, Photo sites, forums, etc. Accounting for ~1/3 of new Web content

21 Social media sites embrace new ways of letting users add semantic information Shows users the potential of semantics This graph shows the uptake of tags in blogs

22 Social Media and the Semantic Web Many are exploring how Semantic Web technology can work with social media Social media like blogs are typically temporally organized –valued for their timely and dynamic information! If static pages form the Web ’ s long term memory, then the Blogosphere is its stream of consciousness Maybe we can (1) help people publish data in RDF on their blogs, (2) mine social media sites for useful information, (3) exploit new infrastructure ideas for sharing Semantic Web data.

23 The OWL icon links to the data in RDF A BioBlitz involves going out to an area and recording every organism you see

24 Here’s the post’s RDF data

25 A good Semantic Web opportunity We want to make it easy for scientists to enter and collect information from social media –Professionals, students and amateurs! Some early examples –SPOTter – a tool to add Semantic Web data to blogs –Splickr – a system to mine Flickr for images of organisms –RDF123 – an application and Web service to render spreadsheets as RDF data

26 SPOTter: SPire Observation Tool We’ve developed some simple components to help people add RDF data to blogs and ping Swoogle to get it indexed. SPOTter is an initial prototype that uses the ETHAN ontology and is being used in some BioBlitz activities with students. We’re working toward a version that uses Twitter so that people can make the blog entries from the cell phones via SMS –The SPOTter agent will get the entries (via RSS) and index the data

27 SPOTter button Once entered, the data is embedded into the blog post and Swoogle is pinged to index it

28 Prototype SPOTter Search engine We can draw a bounding box on the map and find observations An RSS feed provided for each query

29 Flickr The Flickr “photo sharing” site has millions of photographs –Many of plants and animals Most of them have descriptions, timestamps, tags and even geo-tags –Flickr has even introduced “machine tags” that can be mapped into RDF Any Flickr users (humans or bots) can add comments and annotations There’s a good API It could be a good source of ecological information

30

31

32 Results for people and machines

33 RDF123 An application and web service to generate RDF data from spreadsheets MAP DATA Graphically create & edit spreadsheet to RDF map map + spreadsheet => RDF data Some metadata can Be embedded in spreadsheet CSV or Google doc See

34 RDF123 The Bioblitz project needed a way to collect and share observational data from students Spreadsheets selected as a common data format and templates developed RDF123 application and web service developed to ease exporting the data as RDF for a Maryland BioBlitz group –Supports a web service to generate RDF given URLs for the sheet and map –Works on CSV files and also Google spreadsheets

UMBC an Honors University in Maryland 35 A map provides a template for an RDF subgraph for each row

36 The map is also represented in RDF

37 Here’s the RDF that’s produced from the spreadsheet

38 Metadata, including the URI of a map, can be embedded in the spreadsheet

39 Ping and Feed Design Pattern The Web uses a ping and feed design pattern that is a variant of publish and subscribe It accounts for the scalable, smooth function of the Blogosphere and related social media systems Pings push and feeds pull We can use the same approach to managing volumes of Semantic Web data

40 Pings and Feeds in the Blogosphere Content provider send pings to ping servers when they have a new item Ping servers aggregate pings and stream them to aggregators and indexers, like Google Indexing sites retrieve new items from content provider’s feed Ping Server C1 C2 C3 pings Search Engine

41 Pings and Feeds in the Semantic Web Content provider send pings to ping-the- semantic-web when they have new RDF data PTSW aggregates pings and streams them to SW aggregators and indexers, like Swoogle Indexing sites retrieve new RDF data from content provider’s feed PTSW C1 C2 C3 pings Swoogle

42 Semantic Web Feeds drive Mashups As in the regular web, sites and query engines use feeds to capture queries Accessing a feed runs the query and produces a list of the first N results (usually 10 ≤ N ≤ 20) Such query feeds can drive mashups Systems like Yahoo pipes make it easy to compose feeds

43 This talk Motivation Swoogle Semantic Web search engine Social Semantic Web Conclusions

44 Conclusion The web will contain the world’s knowledge in forms accessible to people and computers –We need better ways to discover, index, search and reason over SW knowledge SW search engines address different tasks than html search engines –So they require different techniques and APIs Swoogle like systems can help create consensus ontologies and foster best practices Social media provide new challenges and opportunities for the Semantic Web

45 Annotated in OWL For more information