Presentation is loading. Please wait.

Presentation is loading. Please wait.

Almaden Services Research Almaden Research Center, San Jose, CA 6 July 2006 Ontologizing the ONTOLOG content Tools, techniques, and approaches panel E.

Similar presentations


Presentation on theme: "Almaden Services Research Almaden Research Center, San Jose, CA 6 July 2006 Ontologizing the ONTOLOG content Tools, techniques, and approaches panel E."— Presentation transcript:

1 Almaden Services Research Almaden Research Center, San Jose, CA 6 July 2006 Ontologizing the ONTOLOG content Tools, techniques, and approaches panel E. Michael (Max) Maximilien Almaden Services Research

2 Almaden Services Research 2 Almaden Research Center, San Jose, CA6 July 2006 Agenda Todays web ONTOLOG community of practice What have we done so far? Where do we want to go? Introduce panelists –Scott Spangler, IBM Almaden Services Research –Peter Mika, Free University of Amsterdam –Conor Shankey, VisualKnowledge, Inc., Vancouver, Canada –John Boz Handy-Bosma, IBM Global Services Initial questions

3 Almaden Services Research 3 Almaden Research Center, San Jose, CA6 July 2006 Todays web User contributed content, e.g., blogs, audio and video podcasts –Highly unstructured –Highly idiosyncratic Collaborative platforms for content creation, e.g., wikis Idiosyncratic web resource annotations leads to folksonomies Wisdom of the crowds –Instant feedback –Issues with collusion and smart mob effect [H. Rheingold, 2002] –Ratings and reputation, e.g., digg.com, eBay.com Exposing content and processes as services –RSS feeds for data aggregation –SOAP, REST, and other services for data and process programming

4 Almaden Services Research 4 Almaden Research Center, San Jose, CA6 July 2006 ONTOLOG community of practice Experts in formal and semi-formal knowledge representation Highly educated researchers – academia and industry Content contribution – structured but diverse –Wiki pages for presentation preparation and summary –Presentations (PPT and PDF) by participants and moderators –MP3 audio files of sessions – aggregated as podcast feeds –Biographies of members and participants Can we use our own knowledge and approaches to give structure to ONTOLOG body of knowledge?

5 Almaden Services Research 5 Almaden Research Center, San Jose, CA6 July 2006 What have we done so far? Discussed use cases Discussed architectural approaches Have an initial taxonomy [ Bedford and Smith] Started discussing tools and approaches Not clear what exactly we want to achieve? However, common direction seems to be to: –Achieve a better categorization of content –Invite user participation (a la web 2.0) –Take advantage of ontology expertise –Reconcile web 2.0 loose semantic efforts with more formalize approaches

6 Almaden Services Research 6 Almaden Research Center, San Jose, CA6 July 2006 Where do we want to go? For todays panel we have a good cross-section – includes experts –Unstructured text mining and automated taxonomy and categorization –Folksonomies and original research in reconciling ontologies and social networks –Research semantic tools –Semantic wikis and tools –Faceted-based logic, search, and associated tools Audience can contribute ontologists view Previous talk (Tim Redmond) discussed the Protégé tool and how to extend it Future talk (Pat Cassidy) will discuss more formalized approaches to ontological engineering

7 Almaden Services Research 7 Almaden Research Center, San Jose, CA6 July 2006 Scott Spangler, IBM Almaden Services Research, San Jose, CA Senior Technical Staff member at IBM Almaden Services Research 15 years developing applications for –Statistical data analysis –Knowledge-based systems –Text mining –Business intelligence (BI) 14 patents in text mining and BI BS in mathematics from MIT Masters in computer science from UT Austin

8 Almaden Services Research 8 Almaden Research Center, San Jose, CA6 July 2006 Peter Mika, Free University of Amsterdam, Netherlands Ph.D. candidate in computer science at Free University, Amsterdam, The Netherlands –Social networks and folksonomy –Semantic Web and ontology Best paper award at ISWC 2005 for paper entitled Ontologies are us: A unified model of social networks and semantics Winner of Semantic Web Challenge at ISWC 2004 for Flink systemFlink Co-chair of Semantic Web Challenge 2006 Author of various semantic related tool and research –openacademia.org, Elmo, SWAP, WonderWeb, OnToKnowledge, and moreopenacademia.orgElmo See Peters Web site:

9 Almaden Services Research 9 Almaden Research Center, San Jose, CA6 July 2006 Conor Shankey, VisualKnowledge, Vancouver, Canada CEO of VisualKnowledge Enterprise class ontology lifecycle management platform –Flexible metadata support, e.g., OWL and RDF –Ontology federation –Support for transactions and multithreading –Pluggable micro-inference engine Long history of successful ontological applications Various successful projects using VisualKnowledge tools

10 Almaden Services Research 10 Almaden Research Center, San Jose, CA6 July 2006 John Boz Handy-Bosma, IBM Global Services, Austin, TX IBM Master Inventor –More than 60 patents issued or pending –15 th plateau level –Recipient of several IBM awards for inventions incorporated into products Expertise –Faceted-based logic –Application of facet-based search Senior IT Architect, IBM Austin Assignee to IBM Almaden Services Research Visiting Faculty, University of Texas at Austin

11 Almaden Services Research 11 Almaden Research Center, San Jose, CA6 July 2006 Initial questions Is automated taxonomy and categorization of unstructured text good enough? What are best text mining techniques for wiki-based content? What active or passive role do you see for users of the ONTOLOG forum to help better categorize content? Importance of user ratings? Do we need more active users in order for ratings to reflect wisdom? How can facetted-logic and -search help? Role of ontological engineering tools? Role and application of web 2.0 tools, services, and techniques? E.g., podzinger.com

12 Almaden Services Research 12 Almaden Research Center, San Jose, CA6 July 2006 Thank You Merci Grazie Gracias Obrigado Danke Japanese English French Russian German Italian Spanish Brazilian Portuguese Arabic Traditional Chinese Simplified Chinese Hindi Tamil Thai Korean

13 Almaden Services Research 13 Almaden Research Center, San Jose, CA6 July 2006 Backup slides

14 Almaden Services Research 14 Almaden Research Center, San Jose, CA6 July 2006 Approach and thesis Two key issues in just ontologizing content –Lack of pragmatism in the goals of ontologies –Heterogeneity of usage and use cases Summary of approach –Simple tagging for human collaboration (folksonomies) as well as rating systems for contents parts –Convert audio automatically into annotated text transcripts –Mining tools to automate annotation of content and infer taxonomies –Ontology for outline of content Secret sauce is in how we combine the semantics, i.e., algorithms, to solve the use cases

15 Almaden Services Research 15 Almaden Research Center, San Jose, CA6 July 2006 Tagging and ratings – Human collaboration Tagging –Idiosyncratic –Results in bag of tags forming folksonomies –Various available services, e.g., and so onhttp://del.icio.ushttp://flikr.com –Need incentives for humans, e.g., easier search –Evolving into some form of ontology (see Peter Mikas paper Ontologies are us: A unified model of social networks and semantics at ICSW 2005) Ratings –Enables feedback –Rate the ratings to avoid collusion –Similar to Amazons rating system, and eBay.coms reputation system (various works in literature)http://digg.com

16 Almaden Services Research 16 Almaden Research Center, San Jose, CA6 July 2006 Audio content Automated transcript –Use Web services to convert audio to text transcript –Some Web services, e.g., also annotate the transcript and do more than close captionshttp://podzinger.com –May involve human collaboration to gradually improve content (especially resolving context errors) Issues –ONTOLOG audio (Podcast) have some low quality MP3s –Static noise and voice storms

17 Almaden Services Research 17 Almaden Research Center, San Jose, CA6 July 2006 Mining Automatic annotation of content –Mature tool set in UIMA –Others (?) –Generate initial taxonomy –Continual process to update annotation Dr. David Ferrucci (IBM Research) lead architect of UIMA project to present to community on May 11, 2006

18 Almaden Services Research 18 Almaden Research Center, San Jose, CA6 July 2006 Ontology Outline –Create initial outline of site content with some ontology –Reuse existing ontology IMO this ontology can be specific to ONTOLOG and therefore not necessarily a upper ontology What are the primary goals for this outline and ontology? –Cataloguing –Search (why not just use Google services?) –Statistics (why not just use Amazons Alexa services?) –Others (?)


Download ppt "Almaden Services Research Almaden Research Center, San Jose, CA 6 July 2006 Ontologizing the ONTOLOG content Tools, techniques, and approaches panel E."

Similar presentations


Ads by Google