Some facets of knowledge management in mathematics Wolfram Sperber (Zentralblatt Math) Patrick Ion (Math Reviews) Facets of Knowledge Organization A tribute.

Slides:



Advertisements
Similar presentations
Ontology Assessment – Proposed Framework and Methodology.
Advertisements

Serving the interests of e-resource users in Research, Higher and Further Education Libraries ELIN: Electronic Library Information Navigator Ian Mayfield.
THE STEPS OF SEARCH You have opened a new veterinary clinic in a small town, and want people in the vicinity to know about it. You need some new ideas.
Developing a Metadata Exchange Format for Mathematical Literature David Ruddy Project Euclid Cornell University Library DML 2010 Paris 7 July 2010.
Query Languages. Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
An Introduction to MODS: The Metadata Object Description Schema Tech Talk By Daniel Gelaw Alemneh October 17, 2007 October 17, 2007.
2012 Zentralblatt MATH / ZBMath Training Guide. Training Guide Zentralblatt MATH / ZBMath | 2012 | 2 Why an Abstracting and Indexing Service in Mathematics?
Corey A Harper DC2006 October 4, 2006 Authority Control for the Semantic Web Encoding Library of Congress Subject Headings (LCSH) in SKOS.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
IR Models: Structural Models
Introduction to Library Research Gabriela Scherrer Reference Librarian for English Languages and Literatures, University Library of Bern.
Copyright: Bruno Buchberger “Reading” Working with the Literature Bruno Buchberger Part of the Block Course „Working Techniques“ in the Frame of.
Introduction to Library Research Gabriela Scherrer Reference Librarian for English Languages and Literatures, University Library of Bern.
MANISHA VERMA, VASUDEVA VARMA PATENT SEARCH USING IPC CLASSIFICATION VECTORS.
Aligning Thesauri for an integrated Access to Cultural Heritage Collections Antoine ISAAC (including slides by Frank van Harmelen) STITCH Project UDC Conference.
1 Using Scopus for Literature Research. 2 Why Scopus?  A comprehensive abstract and citation database of peer- reviewed literature and quality web sources.
Antonella De Robbio, Dario Maguolo Mathematics Library – University Library System University of Padova – ITALY Mathematics Subject Classification and.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
CSE 730 Information Retrieval of Biomedical Data The use of medical lexicon in biomedical IR.
SemanTic Interoperability To access Cultural Heritage Frank van Harmelen Henk Matthezing Peter Wittenburg Marjolein van Gendt Antoine Isaac Lourens van.
GL12 Conf. Dec. 6-7, 2010NTL, Prague, Czech Republic Extending the “Facets” concept by applying NLP tools to catalog records of scientific literature *E.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
Introduction to Library Research Gabriela Scherrer Reference Librarian for English Languages and Literatures, University Library of Bern.
Robert Roggenbuck Universität Osnabrück/ IWI Osnabrück Wolfram Sperber Konrad-Zuse- Zentrum für Informationstec hnik Berlin (ZIB) Osnabrück,
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
LIS510 lecture 3 Thomas Krichel information storage & retrieval this area is now more know as information retrieval when I dealt with it I.
Information Systems & Semantic Web University of Koblenz ▪ Landau, Germany Semantic Web - Multimedia Annotation – Steffen Staab
Chapter 1: “Reading” Working with the Literature Copyright Bruno Buchberger 2004 No parts of this file may be copied or stored without written permission.
Vocabularies in the VO Alasdair J G Gray Norman Gray Iadh Ounis.
By: Dan Johnson & Jena Block. RDF definition What is Semantic web? Search Engine Example What is RDF? Triples Vocabularies RDF/XML Why RDF?
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
-1- Philipp Heim, Thomas Ertl, Jürgen Ziegler Facet Graphs: Complex Semantic Querying Made Easy Philipp Heim 1, Thomas Ertl 1 and Jürgen Ziegler 2 1 Visualization.
MD9.6 Release: Highlights Increased the character limit for all URL resources to 600 characters. Data_Center/Service_Provider Data_Set_Citation/Service_Citation.
Semantic Network as Continuous System Technical University of Košice doc. Ing. Kristína Machová, PhD. Ing. Stanislav Dvorščák WIKT 2010.
Terminology and documentation*  Object of the study of terminology:  analysis and description of the units representing specialized knowledge in specialized.
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
PREMIS Controlled vocabularies Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress PREMIS Implementation Fair San.
Semantic web course – Computer Engineering Department – Sharif Univ. of Technology – Fall Knowledge Representation Semantic Web - Fall 2005 Computer.
Artificial Neural Network Building Using WEKA Software
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Externally growing self-organizing maps and its application to database visualization and exploration.
AGROVOC Thesaurus. 1980s: developed as multilingual structured thesaurus for agricultural terminology (“rice”) : parallel effort to express thesaurus.
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Strategies for subject navigation of linked Web sites using RDF topic maps Carol Jean Godby Devon Smith OCLC Online Computer Library Center Knowledge Technologies.
Patent Searching Basics Patrick M. Torre, Ph.D. November 18, 2015.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
 Steps to locate MathSciNet Database: 1. Go to library.astate.edu and go to the article database section 2. Then select the ‘M’ in the A-Z section 3.
Citation Searching Isabel Holowaty Juliet Ralph
How to Write Literature Review ww.ePowerPoint.com
CP3024 Lecture 12 Search Engines. What is the main WWW problem?  With an estimated 800 million web pages finding the one you want is difficult!
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
Part of the Cronos Group 4C/kZen 4 th EcoTerm meeting, Vienna, April 18, 2007 Jef Vanbockryck Research & Development “Risk Assessment ontologies and data.
How "Next Generation" Are We? A Snapshot of the Current State of OPACs in U.S. and Canadian Academic Libraries Melissa A. Hofmann and Sharon Yang, Moore.
Jean-Yves Le Meur - CERN Geneva Switzerland - GL'99 Conference 1.
Integrated Departmental Information Service IDIS provides integration in three aspects Integrate relational querying and text retrieval Integrate search.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
September 2003, 7 th EDG Conference, Heidelberg – Roberta Faggian, CERN/IT CERN – European Organization for Nuclear Research The GRACE Project GRid enabled.
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
Online Information and Education Conference 2004, Bangkok Dr. Britta Woldering, German National Library Metadata development in The European Library.
Some basic concepts Week 1 Lecture notes INF 384C: Organizing Information Spring 2016 Karen Wickett UT School of Information.
“Reading” Working with the Literature
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
PREMIS Tools and Services
Introduction of KNS55 Platform
Introduction to Information Retrieval
Presentation transcript:

Some facets of knowledge management in mathematics Wolfram Sperber (Zentralblatt Math) Patrick Ion (Math Reviews) Facets of Knowledge Organization A tribute to Professor Brian Vickery ISKO UK biennal conference 4 th -5 th July 2011 London

Agenda A state-of-the-art analysis Enrichment of the MSC - new approaches: SKOS and a controlled vocabulary for mathematics Conclusions and Outlook

State of the art Zentralblatt Math and Math Reviews: the leading reviewing journals in mathematics and its applications coverage: more than 3,000,000 bibliographic entries of mathematical publications (journal articles, monographs, textbooks from 1820 up to now systematic analysis of the whole literature of mathematics

Facets of content analysis Biblio- graphic meta- data Authors, title, source,... Semantic meta- data Reviews/ abstract, keywords, classifi- cation Linked meta- Data References, networks of authors, coauthors … Social meta- data Comments, questions... ( in Zentralblatt Math and Math Reviews)

Different levels of semantic metadata Reviews (individual) Keywords (semi-formal, but no controlled vocabulary exists for mathematics up to now) Classification (formal, no degrees of freedom)

Classification in mathematics: Mathematics Subject Scheme (MSC) Features of the MSC: a topic-specific classification scheme nodes: more than 6,000 nodes (63 on the top level, more than 500 on the second level, more than 5,000 on the third level) relations: hierarchical relations are the most important relations within the MSC, but there are further two types of similarity relations: See also and For … see …

Printed and electronic versions of the MSC up to 2010 the master of the MSC was the printed version (advantages: nearly linear reading), but is only of limited use for the retrieval in the database reasons: too many groups, too complex, not intuitive for the most users, it is much simpler to search for keywords and names there is an electronic master of MSC 2010 (TeX- encoded), the TeX-encoded MSC is not machine -understandable

Restrictions and deficits of the MSC (I) the TeX-encoded version doesn't use standards for a semantic analysis of the structure of the MSC, so it is not interoperable with other classification schemes; the classes are defined only by their labels and their location within the MSC the labels of the classes are not unique the MSC is heterogeneous: the classes have different types, especially: modeling,mathematical objects, theories and methods, etc.

Agenda A state-of-the-art analysis Enrichment of the MSC - new approaches: SKOS and a controlled vocabulary for mathematics Conclusions and Outlook

Enrichment of the MSC: Transformation to SKOS Create a SKOS-encoded form of the MSC (SKOS – Simple Knowledge Organization Scheme) Why SKOS? SKOS provides a standardized vocabulary for classification schemes, thesauri, etc. SKOS is based on XML and RDF, this means SKOS can be extended to individual requirements, e.g., formula analysis in mathematics

a 1:1 translation form the TeX-master to a SKOS master (so we can model the MSC given by its classes and hierarchical relations) the result: we have the same content as in the current version, but  the content is encoded in a machine-understan- dable way (so it can be used by other schemes and applications)  the scheme is extensible: we can add further information The first step: Encoding of the MSC in SKOS:

SKOS snapshot

u p to now: the MSC model is just a classical graph model overlapping of classes couldn't be modeled in the printed form, but now we can do it! the idea is very simple: we use the terminology used in mathematical publications and add this information to the scheme Enhancement of the MSC model

Description of MSC classes by terms The idea: each class will be characterized by a (weighted) vector of terms In more detail: Using machine-learning tools Step 1: Vocabulary in MSC and other sources provide a start vocabulary Step 2: Analysis of existing keywords in the databases Step 3: Keyword extraction of the (classified) information of the databases ZBMATH and MathSciNet

The usual problems relevant terms are typically phrases, not single words synonyms and homonyms different grammatical forms of phrases abbreviations which are often used Controlled (semi-automatic) processing

The (fictional) result for MSC class Ordinary Differential Equations (MSC 34-XX) TermsOccurences Linear ODE371 Nonlinear ODE1072 Fractional ODE96 Stability781 Periodic Solutions37...

Benefits a precise and dynamic characterization of the MSC classes a controlled vocabulary for mathematics  a tool which can be used for  clustering of documents (similarity analysis of documents,(semi-)automatic keyword extraction and classification,  keyword generation by authors,  sophisticated retrieval features (MSC as a hidden method for retrieval)

Further steps a rigid facet structure for the MSC (reducing the size of the MSC) a typing of the MSC classes mathematical modeling, mathematical objects (e.g., ordinary differential equations), theories and methods (e.g., K-theory), qualitative aspects (e.g., stability) applications (within mathematics and in other fields) formula analysis and formula search (MathML)

Agenda A state-of-the-art analysis Enrichment of the MSC - new approaches: SKOS and a controlled vocabulary for mathematics Conclusions and Outlook

knowledge management in mathematics is at a turning point, we need new machine-based methods for content analysis and a new quality of service using standards formats for the MSC (e.g., methods of Semantic Web allowing a machine-processing of semantic information) enhancement of the MSC by combining different (classical) methods of semantic analysis (e.g., classification, controlled vocabularies, etc.) We are on the way!

Thanks!