ECPRD seminar on the net IX”, Brussels, 2011 Faceted Search Some examples of applied faceted search on websites developed by the EP Jerry.

Slides:



Advertisements
Similar presentations
© Copyright 2012 STI INNSBRUCK Apache Lucene Ioan Toma based on slides from Aaron Bannert
Advertisements

Apache Struts Technology
Advanced Searching Engineering Village.
Engineering Village ™ Basic Searching.
A. Grigorov, A. Georgiev, M. Petrov, S. Varbanov, K. Stefanov Building a Knowledge Repository for Life-long Competence Development.
IAEA International Atomic Energy Agency United Nations Library and Information Network for Knowledge Sharing (UN-LINKS) September 2013, Geneva.
SOFTWARE PRESENTATION ODMS (OPEN SOURCE DOCUMENT MANAGEMENT SYSTEM)
IAEA International Atomic Energy Agency INIS Collection Search: Introduction and main features INIS Training Seminar 7-11 October 2013, Vienna Domenico.
ARCHIMÈDE Presented by Guy Teasdale Directeur, Services soutien et développement Bibliothèque de l’Université Laval CARL Workshop on Institutional Repositories.
Search Engines and Information Retrieval
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
Introducing Symposia : “ The digital repository that thinks like a librarian”
Overview of Search Engines
Release 4 of the COUNTER Code of Practice for e- Resources and new usage- based measures of impact Peter Shepherd COUNTER May 2014.
IBM User Technology March 2004 | Dynamic Navigation in DITA © 2004 IBM Corporation Dynamic Navigation in DITA Erik Hennum and Robert Anderson.
ACCESS TO QUALITY RESOURCES ON RUSSIA Tanja Pursiainen, University of Helsinki, Aleksanteri institute. EVA 2004 Moscow, 29 November 2004.
Implementing search with free software An introduction to Solr By Mick England.
What difference a good tool? using Endeca for a faceted catalog Emily Lynema NCSU Libraries ACRL Delaware Valley Chapter Fall Program November 3, 2006.
Xpantrac connection with IDEAL Sloane Neidig, Samantha Johnson, David Cabrera, Erika Hoffman CS /6/2014.
Search Search Drupal with Apache Solr with CERN Web Communications Group – Copyright 2013.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
A Scalable Application Architecture for composing News Portals on the Internet Serpil TOK, Zeki BAYRAM. Eastern MediterraneanUniversity Famagusta Famagusta.
IPEX - The next version ECPRD ICT seminar, November 2010, Bucharest IPEX – The next version A preview of ongoing work to redesign the IPEX website.
Aurora: A Conceptual Model for Web-content Adaptation to Support the Universal Accessibility of Web-based Services Anita W. Huang, Neel Sundaresan Presented.
Search Engines and Information Retrieval Chapter 1.
Dr. Nikos Houssos| National Documentation Centre / NHRF European Network of National Contact Points for Research Infrastructures moving forward The CERIF-based.
1. 2 introductions Nicholas Fischio Development Manager Kelvin Smith Library of Case Western Reserve University Benjamin Bykowski Tech Lead and Senior.
Building Search Portals With SP2013 Search. 2 SharePoint 2013 Search  Introduction  Changes in the Architecture  Result Sources  Query Rules/Result.
BioWeb … an online resource for bioenergy and bioproducts Sun Grant BioWeb … an online resource for bioenergy and bioproducts BBI Biofuels Conference Nashville,
Information retrieval wed sept data…. -start at 6.45.
Web Search. Structure of the Web n The Web is a complex network (graph) of nodes & links that has the appearance of a self-organizing structure  The.
University of North Texas Libraries Building Search Systems for Digital Library Collections Mark E. Phillips Texas Conference on Digital Libraries May.
Revolutionizing enterprise web development Searching with Solr.
Introduction to Nutch CSCI 572: Information Retrieval and Search Engines Summer 2010.
NCSU Libraries Kristin Antelman NCSU Libraries June 24, 2006.
Kelly Boccia Abi Natarajan Konstantin Livitski Senthil Anand Subbanan Meyyappan 1.
Overview of IU Digital Collections Search Hui Zhang Jon Dunn Indiana University Digital Library Program IU Digital Library Brown Bag October 19, 2011.
SharePoint 2010 Search Architecture The Connector Framework Enhancing the Search User Interface Creating Custom Ranking Models.
User Guide to DBPIA for Institutional Members Nurimedia Co., Ltd. 2012
U.S Geological Survey National Biological Information Infrastructure Technical Overview: NBII Metadata Clearinghouse May 2008 Mike Frame.
Search Engines. Search Strategies Define the search topic(s) and break it down into its component parts What terms, words or phrases do you use to describe.
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
2007. Software Engineering Laboratory, School of Computer Science S E Web-Harvest Web-Harvest: Open Source Web Data Extraction tool 이재정 Software Engineering.
CiNii Articles is a service that provides information on scholastic articles, with an emphasis on Japanese papers. It allows users to find the articles.
United Nations Economic Commission for Europe Statistical Division The Importance of Databases in the Dissemination Process Steven Vale, UNECE.
PatentScope - Electronic Publication World Intellectual Property Organization.
This presentation describes the development and implementation of WSU Research Exchange, a permanent digital repository system that is being, adding WSU.
Copyright © 2006 Pilothouse Consulting Inc. All rights reserved. Search Overview Search Features: WSS and Office Search Architecture Content Sources and.
Faceted browsing for ACL Anthology Praveen Bysani.
Mercury – A Service Oriented Web-based system for finding and retrieving Biogeochemical, Ecological and other land- based data National Aeronautics and.
Data Integration Hanna Zhong Department of Computer Science University of Illinois, Urbana-Champaign 11/12/2009.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
A Faceted Interface to the Library Catalog Tito Sierra NCSU Libraries ALA Midwinter Meeting January 20, 2007.
Using JSTOR November What is JSTOR?JSTOR 2.JSTOR demonstration −Searching JSTOR −Format of the journal content −Using a MyJSTOR account to organize.
Apache Solr Dima Ionut Daniel. Contents What is Apache Solr? Architecture Features Core Solr Concepts Configuration Conclusions Bibliography.
Sitecore. Compelling Web Experiences Page 1www.sitecore.net Patrick Schweizer Director of Sales Enablement 2013.
BOF-1147, JavaTM Technology and WebDAV: Standardizing Content Management Java and WebDAV Juergen Pill Team Leader Software AG Remy Maucherat Software Engineer.
Information Retrieval in Practice
Information Retrieval in Practice
Using JSTOR May 2016.
VI-SEEM Data Discovery Service
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
New free text search engine for
Building Search Systems for Digital Library Collections
Quick guide < Keyword search >
BUILDING A DIGITAL REPOSITORY FOR LEARNING RESOURCES
IPEX Users conference 2015 The IPEX NETWORK – MONGIN FORREST.
Márton Németh – László Drótos How to catalogue a web archive?
Metadata supported full-text search in a web archive
Presentation transcript:

ECPRD seminar on the net IX”, Brussels, 2011 Faceted Search Some examples of applied faceted search on websites developed by the EP Jerry Hilbert European Parliament

ECPRD seminar on the net IX”, Brussels, 2011 Faceted search - Definition Faceted search, also called faceted navigation or faceted browsing, is a technique for accessing a collection of information represented using a faceted classification, allowing users to explore by filtering available information. A faceted classification system allows the assignment of multiple classifications to an object, enabling the classifications to be ordered in multiple ways, rather than in a single, pre-determined, taxonomic order. Each facet typically corresponds to the possible values of a property common to a set of digital objects. Facets are often derived by analysis of the text of an item using entity extraction techniques or from pre-existing fields in the database such as author, descriptor, language, and format. This approach permits existing web-pages, product descriptions or articles to have this extra metadata extracted and presented as a navigation facet Source: Wikipedia

ECPRD seminar on the net IX”, Brussels, 2011 Faceted search - Technology Different search engines offer nowadays the possibility for faceted search. The EP uses SolR, based on LUCENE. Solr is an open source enterprise search platform from the Apache Lucene project. Its major features include powerful full-text search, hit highlighting, faceted search, dynamic clustering, database integration, and rich document (e.g., Word, PDF) handling. Providing distributed search and index replication, Solr is highly scalable. Solr is written in Java and runs as a standalone full-text search server within a servlet container such as Apache Tomcat. Solr uses the Lucene Java search library at its core for full-text indexing and search, and has REST-like HTTP/XML and JSON APIs that make it easy to use from virtually any programming language. Solr's powerful external configuration allows it to be tailored to almost any type of application without Java coding, and it has an extensive plugin architecture when more advanced customization is required. Source: Wikipedia

ECPRD seminar on the net IX”, Brussels, 2011 Faceted search - Technology How is Lucene/Solr used? XML IN – XML OUT XML IN: Data is structured in XML when submitting for indexation XML OUT: Data is returned as XML (including facet details) as the result of a search Also, configuration of the search engine for free text - number of terms to match - relevance of the terms, according to the field they are associated to

ECPRD seminar on the net IX”, Brussels, 2011 Faceted search – EP websites In the coming slides examples of faceted search as applied on websites developed by the EP will be shown for: - The Legislative Observatory of the EP - Public Register of documents - IPEX - ECPRD

ECPRD seminar on the net IX”, Brussels, 2011 Faceted search – EP websites OEIL Legislative Observatory of the EP (old version of the site)

ECPRD seminar on the net IX”, Brussels, 2011 Faceted search – EP websites OEIL: Legislative Observatory OEIL contains legislative, budgetary, non-legislative and internal parliamentary procedures, such as: Co-decision, consultation and assent procedure budgetary and discharge procedures own-initiative reports by the European Parliament appointments, waivers of immunity and changes to the Rules of Procedure (i.e. internal EP procedures) resolutions and recommendations adopted by the European Parliament documents forwarded for information from the Commission (during the last 9 months).

ECPRD seminar on the net IX”, Brussels, 2011 Faceted search – EP websites OEIL: Legislative Observatory Situation before implementing faceted search

ECPRD seminar on the net IX”, Brussels, 2011 Faceted search – EP websites OEIL: Legislative Observatory

ECPRD seminar on the net IX”, Brussels, 2011 Challenges for applying facets in OEIL: (1)Sequence of facets Parliamentary term, … (2)Protocol order for returned matches in the facets Political groups, Commission DGs, etc. (3)Facets with huge results of additional criteria Rapporteurs (possibly a few hundred) (4)Facets for structured keywords list Legal Basis (Treaty to Article) (5) Length of words Faceted search – EP websites OEIL: Legislative Observatory

ECPRD seminar on the net IX”, Brussels, 2011 Faceted search – EP websites OEIL: Legislative Observatory

ECPRD seminar on the net IX”, Brussels, 2011 Where facets can help out: (1)Date range searches (2)Structured references of procedures or documents Faceted search – EP websites OEIL: Legislative Observatory

ECPRD seminar on the net IX”, Brussels, 2011 Faceted search – EP websites Public Register of documents

ECPRD seminar on the net IX”, Brussels, 2011 Faceted search – EP websites Public Register Documents accessible through the Register 5 main categories of documents Parliamentary activity EP general information From other institutions and Member States Documents from third parties Budgetary procedure 125 types of documents References Documents (All LV) List defined by EP Bureau ReferencesDocuments December December December December

ECPRD seminar on the net IX”, Brussels, 2011 Faceted search – EP websites Public Register Public Register / Metadata Usually for each document : reference number title dates summary authorities authors relations

ECPRD seminar on the net IX”, Brussels, 2011 Faceted search – EP websites Public Register Situation before implementing faceted search

ECPRD seminar on the net IX”, Brussels, 2011 Faceted search – EP websites Public Register Situation before implementing faceted search

ECPRD seminar on the net IX”, Brussels, 2011 Faceted search – EP websites Public Register

ECPRD seminar on the net IX”, Brussels, 2011 Faceted search – EP websites Public Register

ECPRD seminar on the net IX”, Brussels, 2011 Faceted search – EP websites Public Register

ECPRD seminar on the net IX”, Brussels, 2011 Faceted search – EP websites IPEX Interparliamentary EU Information Exchange (old version of the site)

ECPRD seminar on the net IX”, Brussels, 2011 The IPEX Database contains a complete catalog of Commission documents from From each Commission document users can click on "Related dossiers" and from there access national scrutiny pages. Each national scrutiny page contains documents from the individual national parliaments relating to the specific Commission document or legislative procedure. IPEX also hosts a calendar of interparliamentary cooperation which contains information concerning all interparliamentary meetings relating to the European Union. Faceted search IPEX: Interparliamentary EU Information Exchange

ECPRD seminar on the net IX”, Brussels, 2011 Faceted search IPEX: Interparliamentary EU Information Exchange

ECPRD seminar on the net IX”, Brussels, 2011 Faceted search IPEX: Interparlamentary EU Information exchange

ECPRD seminar on the net IX”, Brussels, 2011 Challenge: How to guarantee that the result lists presents the information in its context Faceted search IPEX: Interparlamentary EU Information exchange Dossier Documents Scrutinies Private forums

ECPRD seminar on the net IX”, Brussels, 2011 Faceted search IPEX: Interparlamentary EU Information exchange

ECPRD seminar on the net IX”, Brussels, 2011 Faceted search – EP websites ECPRD European Center for Parliamentary Research and Documentation (private site)

ECPRD seminar on the net IX”, Brussels, 2011 Faceted search ECPRD: European Center for Parliamentary Research and Documentation

ECPRD seminar on the net IX”, Brussels, 2011 Faceted search ECPRD: European Center for Parliamentary Research and Documentation Situation before implementing faceted search

ECPRD seminar on the net IX”, Brussels, 2011 Faceted search ECPRD: European Center for Parliamentary Research and Documentation

ECPRD seminar on the net IX”, Brussels, 2011 Faceted search ECPRD: European Center for Parliamentary Research and Documentation

ECPRD seminar on the net IX”, Brussels, 2011 For the next release an extension to the current (new) search implementation is foreseen: Using the key facet for Thesaurus entries as a privileged entry point to find relevant objects on the site (i.e. Taking benefit of XML structured output of facettes to use it as a way to navigate to the good records) Faceted search ECPRD: European Center for Parliamentary Research and Documentation

ECPRD seminar on the net IX”, Brussels, 2011 Faceted search Conclusions

ECPRD seminar on the net IX”, Brussels, 2011 Faceted search Conclusions - One size don’t fit it all - Advanced search may be required for pre-selection - Challenges show when large result lists are returned - Site wide searches require to recall the context of the object - Analysis starts when indexing, not when producing result lists

ECPRD seminar on the net IX”, Brussels, 2011 Faceted search Conclusions - Easily comprehensible and powerfull drill up&down feature - Flexible to adapt to future queries -No ‘0 result lists’ when drilling - Statistical follow of ‘to expect’ results

ECPRD seminar on the net IX”, Brussels, 2011 Faceted search Thanks for your attention! Questions?