GL12 Conf. Dec. 6-7, 2010NTL, Prague, Czech Republic Extending the “Facets” concept by applying NLP tools to catalog records of scientific literature *E.

Slides:



Advertisements
Similar presentations
Christiane Stock Emmanuelle Rocklin Aurélie Cordier
Advertisements

a Terminological and Statistical Approach
PUMA & MetaPub Open Access to Italian CNR Repositories in the Perspective of the European Digital Repository Infrastructure GL9 - NINTH INTERNATIONAL CONFERENCE.
Strategies and activities undertaken in Italy for diffusion and dissemination of Minerva products Ministerial NEtwoRk for Valorising Activities in digitisation.
GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
How the University Library can help you with your term paper
Introduction to Library Research Gabriela Scherrer Reference Librarian for English Languages and Literatures, University Library of Bern.
FAO and UNESCO-IOC/IODE Combine Efforts in their Support of Open Access Written by Marc Goovaerts, U. Hasselt, BE.
Converging parallel universes Library services as building blocks of digital humanities research 42nd LIBER Annual Conference Munich June 2013 Gregor Horstkemper.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Introduction to Library Research Gabriela Scherrer Reference Librarian for English Languages and Literatures, University Library of Bern.
Introduction to Library Research Gabriela Scherrer Reference Librarian for English Languages and Literatures, University Library of Bern.
Antonella De Robbio, Dario Maguolo Mathematics Library – University Library System University of Padova – ITALY Mathematics Subject Classification and.
How the University Library can help you with your term paper Computer Science SC Hester Mountifield Science Library x 8050
Conference papers & proceedings. Many conference papers are published in journals and some may be released before a conference takes place. Other papers.
Tools and resources supporting the cultural tourism Istituto di Linguistica Computazionale “Antonio Zampolli” CNR - Pisa GL14: November 28, Sassolini.
At the NATIONAL TRANSPORTATION LIBRARY CIL 2007 Washington, DC Joyce W. Koeneman Digital Librarian, NTL Research and Innovative Technology Administration.
Cluj Napoca, 28 August IEEE International Conference on Intelligent Computer Communication and Processing Digital Libraries Workshop Towards.
Grey Literature, E-Repositories and Evaluation of Academic & Research Institutes. The case study of BPI e-repository Maria V. Kitsiou - Head Librarian,
Biological Science Database Proquest WEDAD AL-HUSAINAN ISD/NSTIC Kuwait Institute for Scientific Research November/2012.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Key integrating concepts Groups Formal Community Groups Ad-hoc special purpose/ interest groups Fine-grained access control and membership Linked All content.
Data Exchange Tools (DExT) DExT PROJECTAN OPEN EXCHANGE FORMAT FOR DATA enables long-term preservation and re-use of metadata,
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
Introduction to Library Research Gabriela Scherrer Reference Librarian for English Languages and Literatures, University Library of Bern.
GL14 - Fourteenth International Conference on Grey Literature National Research Council Rome, Italy November 2012 Rosa Di Cesare, Marianna Nobile.
Literature in Theory & Practice Frederic Murray Assistant Professor MLIS, University of British Columbia BA, Political Science, University of Iowa Instructional.
IL Step 1: Sources of Information Information Literacy 1.
Thomson Scientific October 2006 ISI Web of Knowledge Autumn updates.
Bio-Medical Information Retrieval from Net By Sukhdev Singh.
Knowledge Representation and Indexing Using the Unified Medical Language System Kenneth Baclawski* Joseph “Jay” Cigna* Mieczyslaw M. Kokar* Peter Major.
GEO: a special collection for Earth Science community *Stefania Biagioni, *Silvia Giannini, **Cecilia Giussani *CNR-ISTI, **CNR-IGG Pisa, Italy GL13 Conference,
NCBI/WHO PubMed/Hinari Course Introduction Session #1, Sept 13, 2005 Session #2, Sept 14, 2005 Internet Concepts and Scientific Literature Resources Ho.
E - Physical Sciences & Engineering Jeff Pache IEE
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
The comparative study of grey literature organisation and approach: Two countries, similar and different Primož Južnič, Petra Myškov á, Richard Pap í k.
GREY LITERATURE AND COMPUTATIONAL LINGUISTICS: FROM PAPER TO NET Claudia Marzi, Gabriella Pardelli, Manuela Sassi Istituto di Linguistica Computazionale.
Presented by Dr. S. C. Jindal Librarian Central Science Library University of Delhi Delhi Information Competency.
3.1. Types of scientific reports and their purpose oral form - it is not obligatory, could be very formal and it can not be used for the justification.
Open Access - an introduction, Aleppo, December Open Access – an introduction Ian Johnson.
RESEARCH – DOING AND ANALYSING Gavin Coney Thomson Reuters May 2009.
SPRINGER ONLINE
CODE (Committee on Digital Environment) July 26, 2000 Rice University THE NET OF THE 21st CENTURY: Concepts across the Interspace Bruce Schatz CANIS Laboratory.
XXIII International Symposium on Nuclear Electronics & Computing NEC’11 JINR DOCUMENT SERVER: Current Status and Future Plans I.Filozova, S.Kuniaev, G.Musulmanbekov,
Iana Atanassova Research: – Information retrieval in scientific publications exploiting semantic annotations and linguistic knowledge bases – Ranking algorithms.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Reference Collections: Collection Characteristics.
Find Research Data b2find.eudat.eu B2FIND User Training How to find data objects and collections using EUDAT’s B2FIND This work is licensed.
GL15 Grey Literature Bratislava 2-3 december 2013 Industrial Philology: problems and techniques of data and archives preservation for future generations.
INGENTA GATEWAY PORTAL
Digital Library Services team Indico Workshop - CERN – Invenio: a possible search system for Indico.
Knowledge Support for Modeling and Simulation Michal Ševčenko Czech Technical University in Prague.
Jean-Yves Le Meur - CERN Geneva Switzerland - GL'99 Conference 1.
Semantic networks for improved access to biomedical databases Istituto di Linguistica Computazionale “Antonio Zampolli” CNR – Pisa Sassolini Eva, Cucurullo.
How can I use a digital library to support my teaching? Find good resources to enhance existing curriculum  Search special collections aimed at your interests.
ONTOLOGY LIBRARIES: A STUDY FROM ONTOFIER AND ONTOLOGIST PERSPECTIVES Debashis Naskar 1 and Biswanath Dutta 2 DSIC, Universitat Politècnica de València.
WP5: Semantic Multimedia
Contents Module 6: E-journal, E-books and Internet Resources
3. Scientific literature, Internet online resources
Digital library and OR 21 October 2002 Members’ Council
Exploring Scholarly Data with Rexplore
Introduction of KNS55 Platform
3. Scientific literature, Internet online resources
Objectives, activities, and results of the database Lituanistika
3. Scientific literature, Internet online resources
Open archives for Library and Information Science
Networked Information Resources
Web archives as a research subject
3. Scientific literature, Internet online resources
AUC’s Role In Facilitating Access To Knowledge In The Arab World
Presentation transcript:

GL12 Conf. Dec. 6-7, 2010NTL, Prague, Czech Republic Extending the “Facets” concept by applying NLP tools to catalog records of scientific literature *E. Picchi, *M. Sassi, **S. Biagioni, **S. Giannini *Institute of Computational Linguistics **Institute of Information Science and Technologies CNR, Italy

GL12 Conf. Dec. 6-7, 2010NTL, Prague, Czech Republic The contest National Research Council of Italy - CNR Institute of Computational Linguistics DBTficio Laboratory Models and methods for the natural languages processing, monolingual and multilingual prototype applications Institute of Information Science and Technologies Networked Multimedia Information Systems Laboratory & Library Digital Library management systems

GL12 Conf. Dec. 6-7, 2010NTL, Prague, Czech Republic The target The target is to present the prototype of an “intelligent” navigation system named DBT&Facets, which has been implemented on the full bibliographic records of the documents archived in the PUMA digital library of the Italian National Research Council (CNR) The system has been implemented by integrating the core textual search engine (known as DBT and developed by ILC) with the TextPower (TP) technology.

GL12 Conf. Dec. 6-7, 2010NTL, Prague, Czech Republic PUMA repositories PUMA is a user-focused, service–oriented infrastructure which manages an increasing number of CNR institutional repositories containing about 25,000 published or open access documents in a wide variety of disciplines PUMA archives the metadata (qualified DC + administrative metadata) and the full texts of the following document types:  Published literature: journal article, conference and workshop paper, book and contribution to book, guest editorial  Grey Literature: conference presentation, workshop and meeting paper, communication poster/abstract, pre print, technical report, project report, internal note, PHD thesis, guest editorial, other materials (eg. courses, tutorials etc...)

GL12 Conf. Dec. 6-7, 2010NTL, Prague, Czech Republic TP-Text Power Technology TP is based on NLP techniques and linguistic resources used to create tools for the evaluation, analysis, classification and browsing of information related to the domains of scientific literature The extraction of implicit knowledge from the texts through which TP can enrich the documents, is a specialization of the "Facets" technology

GL12 Conf. Dec. 6-7, 2010NTL, Prague, Czech Republic The “facet” concept… … is peculiar of Archives and Library Science field, but is also used in Information retrieval systems. In Library Science the term "facet" identifies the elements of a structured material such as library catalogs, which are characterized by the code of the field and its contents TP extends the facet concept by extracting “field + content” pairs not only from structured fields but also from free text, eg. abstracts, using a linguistic-statistical approach to annotate relevant terminology, named entities, etc. The enriched text can be queried, analysed, and classified using “DBT&Facets”

GL12 Conf. Dec. 6-7, 2010NTL, Prague, Czech Republic …The prototype PUMA Multidisciplinary Repositories Ca Records & Abstracts enriched by and elaborated by “Intelligent” navigation system DBT&Facets is an advanced search tool that permits the user to query and refine their results, and to identify particular relations between them

GL12 Conf. Dec. 6-7, 2010NTL, Prague, Czech Republic The Puma query sample Simple Search for “grey”

GL12 Conf. Dec. 6-7, 2010NTL, Prague, Czech Republic The prototype query sample 1 CATEGORIES ACollections BInstitute CAuthor DLanguage of Summary ELanguage of Document FFree Subjects GYear of Publication

GL12 Conf. Dec. 6-7, 2010NTL, Prague, Czech Republic The prototype query sample 2 CATEGORIES HSub-Type IType JOther Language of Summary KEdited by LTitle of Journal MTitle of Event NTitle OAbstract in english

GL12 Conf. Dec. 6-7, 2010NTL, Prague, Czech Republic The graph built on a selected author

GL12 Conf. Dec. 6-7, 2010NTL, Prague, Czech Republic Conclusions In an open domain like scientific documentation, our approach based on the criteria of “semantic similarity” is useful – and perhaps more objective than one based on hierarchical elements - as it makes it possible to link different types of information, also across domains if necessary The aim of the project has been to structure a knowledge system of domain-specific information which assists the user by suggesting possible directions for their search