Ontology-based information retrieval of scientific information Natalia V. Loukachevitch Laboratory of Information Resources Analysis Research Computing.

Slides:



Advertisements
Similar presentations
Crosslingual Ontology-Based Document Retrieval (Search) in an eLearning Environment Eelco Mossel LSP 2007, Hamburg.
Advertisements

INSTITUTE OF POLITICS, LAW AND SOCIAL DEVELOPMENT RUSSIAN STUDIES: POLITICAL RESEARCES OF RUSSIA AND POST-SOVIET STATES MASTER’S DEGREE PROGRAM.
About «Cross Border E-archive» Conference «Digital archives and historical cross border heritage» 19 June 2014, Riga, Latvia.
1 Retrieval Performance Evaluation Modern Information Retrieval by R. Baeza-Yates and B. Ribeiro-Neto Addison-Wesley, (Chapter 3)
Moscow State University Research Computing Center NCO Center for Information Research University Information System RUSSIA: Database and Value-added Services.
Reference Collections: Task Characteristics. TREC Collection Text REtrieval Conference (TREC) –sponsored by NIST and DARPA (1992-?) Comparing approaches.
Automating Keyphrase Extraction with Multi-Objective Genetic Algorithms (MOGA) Jia-Long Wu Alice M. Agogino Berkeley Expert System Laboratory U.C. Berkeley.
Semantic (Language) Models: Robustness, Structure & Beyond Thomas Hofmann Department of Computer Science Brown University Chief Scientist.
1 Information Retrieval and Web Search Introduction.
Anna Bogomolova, Tatyana N. Yudina, Oleg Karasev, Ruslan Sennov University Information System RUSSIA: RF Social and Budget Statistics Modules with Research-assisting.
Columbia University Dept of Computer Science Center for Research on Info Access University of So. Calif Information Sciences Institute (ISI)
CSE 730 Information Retrieval of Biomedical Data The use of medical lexicon in biomedical IR.
Problems of Ontology Development for a Broad Domain Loukachevitch Natalia Leading Researcher of Lomonosov Moscow State University Center.
Consortia Portal for Sharing Resources of Russian Libraries Alexander Plemnek, Natalia Sokolova St. Petersburg State Polytechnic University, St. Petersburg,
ACCESS TO QUALITY RESOURCES ON RUSSIA Tanja Pursiainen, University of Helsinki, Aleksanteri institute. EVA 2004 Moscow, 29 November 2004.
INTERNATIONAL CENTRE FOR SCIENTIFIC AND TECHNICAL INFORMATION (ICSTI) BUSINESS SUPPORTING ACTIVITIES AND PROJECTS.
China’s Scientific Data Sharing Initiatives and Future Perspective Pro. Peng, Jie Dr. Liu, Runda 5 March 2012,
 Year of foundation – 1931  Rector - Sergey Naumov  Location - Russia, Saratov  Address - Saratov, ul. Radyshcheva 89  Site -
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Evaluating the Contribution of EuroWordNet and Word Sense Disambiguation to Cross-Language Information Retrieval Paul Clough 1 and Mark Stevenson 2 Department.
CSO engagement in policy process Hille Hinsberg State Chancellery Government Communication Officer
Sociopolitical Domain as a Bridge from General Words to Terms of Specific Domains Research Computing Center of Moscow State University NCO Center for Information.
August 21, 2002Szechenyi National Library Support for Multilingual Information Access Douglas W. Oard College of Information Studies and Institute for.
CLEF – Cross Language Evaluation Forum Question Answering at CLEF 2003 ( Bridging Languages for Question Answering: DIOGENE at CLEF-2003.
Ontology Development in the Sciences Some Fundamental Considerations Ontolytics LLC Topics:  Possible uses of ontologies  Ontologies vs. terminologies.
1 The Domain-Specific Track at CLEF 2008 Vivien Petras & Stefan Baerisch GESIS Social Science Information Centre, Bonn, Germany Aarhus, Denmark, September.
Project Proposal: CTS2 SDK Presentation to OHT Steering Committee.
Federal Information System on Grey Literature in Russia: a New Stage of Development in Digital and Network Environment Aleksandr V. Starovoitov, Aleksandr.
School of Computing FACULTY OF ENGINEERING Developing a methodology for building small scale domain ontologies: HISO case study Ilaria Corda PhD student.
ICS-FORTH January 11, Thesaurus Mapping Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer Science Bath, UK, January.
YOUR RELIABLE PARTNER. “Taxation of intellectual property, research & development in Russian Federation”
Information Retrieval and Web Search Cross Language Information Retrieval Instructor: Rada Mihalcea Class web page:
 The university was founded in 1931 as the Saratov financial and economic institute.  In 1938 it is renamed into the Saratov credit and economic institute.
1 Query Operations Relevance Feedback & Query Expansion.
Cross-Language Evaluation Forum (CLEF) IST Expected Kick-off Date: August 2001 Carol Peters IEI-CNR, Pisa, Italy Carol Peters: blabla Carol.
The Agricultural Ontology Service (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Library and Documentation Systems.
19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick.
Péter Schönhofen – Ad Hoc Hungarian → English – CLEF Workshop 20 Sep 2007 Performing Cross-Language Retrieval with Wikipedia Participation report for Ad.
VIETNAM STATISTICAL DEVELOPMET STRATEGY , VISION TO 2030 (VSDS) Presented by: Nguyen Van Doan Director, Institute of Statistical Science GSO December,
Mining fuzzy domain ontology based on concept Vector from wikipedia category network.
Collocations and Information Management Applications Gregor Erbach Saarland University Saarbrücken.
Sergey Gromov Yulia Krasilnikova Vladimir Polyakov (NRTU MISIS, Moscow) KNOWLEDGE BASE CREATION FOR NATIONAL NANOTECHNOLOGY NETWORKS «CONSTRUCTIONAL NANOMATERIALS»
Project Overview Vangelis Karkaletsis NCSR “Demokritos” Frascati, July 17, 2002 (IST )
Artificial Intelligence Research Center Pereslavl-Zalessky, Russia Program Systems Institute, RAS.
Common Terminology Services 2 CTS 2 Submission Team Status Update HL7 Vocabulary Working Group May 17, 2011.
Using Domain Ontologies to Improve Information Retrieval in Scientific Publications Engineering Informatics Lab at Stanford.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
National Technical University of Ukraine “Kiev Polytechnic Institute” Heat and energy design faculty Department of automation design of energy processes.
MetaPlus Klas Blomqvist Statistics Sweden Research and Development – Central Methods
High-Level Forum on Strategic Planning for Statistics in Central Asia Countries Bishkek, Kyrgyz Republic, May 2006 Oleg Kara, Deputy Director General,
RUSSIAN ECONOMY LECTURE 9 FINANCE & INVESTMENT Vera Valentinovna Ageeva Tomsk Polytechnic University Institute of Humanities, Social Sciences & Technologies.
1 The Domain-Specific Track at CLEF 2007 Vivien Petras, Stefan Baerisch & Max Stempfhuber GESIS Social Science Information Centre, Bonn, Germany Budapest,
+7 (499) , Moscow pr. 60-letiya Oktyabrya, 9 SYSTEM FOR INTELLIGENT SEARCH AND ANALYSIS OF LARGE-SCALE TEXT COLLECTIONS Institute.
Unification of Cadastre and Registry
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
Nikolay Begchin, Deputy Director of Budget Methodology Department
Priorities and coordination of capacity building in Azerbaijan
Minsk 6 oblasts + the city of Minsk 118 regions 207,000 square km
Development of the Amphibian Anatomical Ontology
Information Retrieval and Web Search
Understanding Recent Education Reform in Ukraine
Information Retrieval and Web Search
Comparing Two Thesaurus Representations for Russian
Information Retrieval and Web Search
The Republic of Iraq .
RUSSIAN LAW Lecture 1. Lecture with elements of seminar: Theory of law
State Programme of the Transition of the Russian Federation
Information Retrieval and Web Search
ELECTRONIC CUSTOMS IN DIGITAL ECONOMY V
Country Report of the Statistical Center of Iran for Workshop on Integrated Economic Statistics and Informal Sector for ECO Member Countries November.
Presentation transcript:

Ontology-based information retrieval of scientific information Natalia V. Loukachevitch Laboratory of Information Resources Analysis Research Computing Center of Moscow State University (MGU NIVC)

Thematic Search of Scientific Information Knowledge-based (ontology-based) search Use of synonyms Automatic query expansion Automatic analysis of query results Help in interactive search

Bilingual Sociopolitical Thesaurus The thesaurus development is based on three methodologies: methods of construction of information-retrieval thesauri (information-retrieval context, analysis of terminology, terminology-based concepts, a small set of relation types) development of wordnets for various languages (word-based concepts, detailed sets of synonyms, description of ambiguous text expressions) ontology and formal ontology research (strictness of relations description, necessity of many-step inference) (33,000 concepts, 80,000 Russian terms, 85,000 English terms)

General Lexicon Specific Lexicon Специальная лексика Socio-Political Domain vs. General Lexicon and Specific Lexicons Intermediate Zone Information Security Aviation Ontology Cultural Heritage Ontology on Natural Sciences and Technology 30,000 concepts; 70,000 terms

Thematic Structure tax; taxation system; tax payer; finances; economy; tax legislation; VAT legislation; law; draft law; Taxation Code; deputy minister; Ministry of Finance; finances; reform; tax reform population budget, estimate; finances; economy; document government; state power; Minister of Finance State Duma; state power; state

Thematic representation of a text: Thematic Node i || + == Thematic Node j Thematic node in the text

University Information System RUSSIA ( ) - Database of Fulltext Documents (1,5 mln): Legal Acts, Newspaper articles, Scientific Reports - Database “Statistics of Russian Federation” (Socio-economic Statistics, Demographic Statistics, Agrarian Statistics, Urban Statistics) - Database “Budget system of Russian Federation”) (

Visualisation of Data in Dynamic Tables and Maps

ConvertorsProcessingInterfacesServices Unified Technology Platform (Constructor)

Cross-Language Information Retrieval

Applications of technology Concept-based information retrieval (monolingual, bilingual) Information-Retrieval systems combining word-based and concept-based serach Concept-based automatic text categorization Automatic Question-Answering Automatic Text Summarization

Main Projects  State Duma of RF ( …)  Central Election Commission of RF ( …)  Legal Company “Garant” (2002 – …)  Ministry of Education ( )  Accounting Chamber of RF (2003 – …)  Central Bank of RF (2006 – …)  Grants: – McArthur Foundation (1994, 1995, …) – Ford Foundation (2002, 2003) – Russian Foundation for Basic Research (9) – Russian Foundation for Humanitarian (5) – Eurasia Foundation (2002, 2003)

Participance in International Forums Participance in Text REtrieval Conference TREC organized by NIST DARPA (TREC-6, TREC-8) Participance in Summarizarion Conference SUMMAC organized by NIST DARPA (1 st place) Cross-Language Evaluation Forum CLEF (DELOS program) –paricipance in Steering Committee –provision of Russian collections for evaluation purposes –information retrieval of domain-specific information retrieval Organizers of Russian Information Retrieval Evaluation Seminar ROMIP ( )