Demystifying Endeca’s search results ranking Kristina Spurgin with input & support from Ben Pennell & Jeff Campbell UNC Libraries.

Slides:



Advertisements
Similar presentations
A worldwide library cooperative OCLC Online Computer Library Center OCLC CJK Users Group 2007 Annual Meeting March 24, 2007, Boston David Whitehair, OCLC.
Advertisements

History Study Center Primary and secondary sources documenting global history 2010.
R2 Library Features and Functionality Overview. The R2 Library  The R2 Library is an electronic database that enables access to digital book content.
Modern Language Association (MLA) International Bibliography Hosted by Gale Cengage Welcome to our Guided Tour Tour takes about 7 minutes. The show will.
Periodicals BooksNewspapers Reference tools Online Databases Printed Version Electronic Version Annual reports and other publications.
MARC 101 for Non-Catalogers Colorado Horizon Users Group Meeting Philip S. Miller Library Castle Rock, CO May 29, 2007.
Extending Primo beyond your ILS data source : including EAD and Graphic Sources Janet Lute ILS Coordinator Princeton University Library IGeLU 2014Oxford,
Information & Library Services Australian Education Index, British Education Index and ERIC Sally Giffen August 2006.
University of Adelaide Library Life Impact The University of Adelaide The well connected catalogue Patricia Scott, Denise Tobin and Helen Attar.
Reference and Libraries Australia Search Karen Mackney and David Ong.
April 2001Division of Library Services IDEAL® is a collection of full text journal titles. Includes 173 journal titles from Academic Press. Abstracts and.
5 th September 2003Diane Tough Content Creation at the NHM or The evolving catalogue!
Overseas Library Catalog – Request Item Overseas Library Catalog Request loaned item.
How to Read the Keyword Results Screen. A keyword search will result in.
Anatomy of the Keyword Search Results Screen. A keyword search will result in.
Searching TAL Online Developed by Northern Lights Internet Solutions Ltd. Advanced Searching.
Web of Science: An Introduction Peggy Jobe
Book Search By Subject LIBRARY LESSONS From the Research helpdesk August 2011 “Book Search by Subject ’ is licensed by NJIT Library under a Creative Commons.
Dongmei Cao 10/22/2008 class blog:
Introduction to MARC Cataloguing Part 2 Presenters: Irma Sauvola: Part 1 Dan Smith: Part 2.
Making sense of the data jumble Trinity College Library Dublin’s Discovery Solution Experience Arlene Healy & Charles Montague Digital Systems and Services.
What difference a good tool? using Endeca for a faceted catalog Emily Lynema NCSU Libraries ACRL Delaware Valley Chapter Fall Program November 3, 2006.
Web Database Design Session 6 and 7 Matakuliah: Web Database Tahun: 2008.
Intended for novice users as an introduction to the online catalog’s capabilities. The guide would be available on the New Brighton Public Library’s website.
Improving the Catalogue Interface using Endeca Tito Sierra NCSU Libraries.
CiNii Books is a service that provides information, which has been accumulated by NACSIS-CAT, on books and journals that are held in university libraries.
N EXT G EN C ATALOG F ORUM Where do we go from here?
Lecture Four: Steps 3 and 4 INST 250/4.  Does one look for facts, or opinions, or both when conducting a literature search?  What is the difference.
Project Overview Bibliographic merging, Endeca, and Web application.
Support.ebsco.com EBSCOhost Basic Searching for Academic Libraries Tutorial.
Support.ebsco.com Basic Searching for K-12 School Libraries Tutorial.
NARA’s New Authority Sources: Authority Files and Thesauri in ARC C. Jerry Simmons Authority Team Leader, Lifecycle Coordination Staff National Archives.
NCSU Libraries Kristin Antelman NCSU Libraries June 24, 2006.
NCSU Libraries Andrew Pace & Emily Lynema NCSU Libraries May 24, 2006.
© 2007 CBHL The CBHL Distributed Library The Council on Botanical and Horticultural Libraries A Guide to Content and Search Features.
Medline on OvidSP. Medline Facts Extensive MeSH thesaurus structure with many synonyms used in mapping and multidatabase searching with Embase Thesaurus.
History Study Centre Demonstration. History Study Centre A wealth of primary and secondary resources for historians. Content is selected and organised.
RSC eBook Collection April 2007 RSC eBook Collection Over 700 Books c. 8,000 chapters c. 250,000 pages 10,000 items - tables.
IL Step 3: Using Bibliographic Databases Information Literacy 1.
Basic Search Engine Optimization. What is SEO?  SEO is an abbreviation for search engine optimization.
DLOHI Digital Library of Haffkine Institute. DISCLAIMER This Digital Collection is developed and maintained purely and solely for the purpose of preservation.
Searching Voyager: #2: Finding a Book by Its Title Zale Library at Paul Quinn College David Hamrick, 2012 “Now, voyager, sail thou forth to seek and find…”
Encyclopaedia Idea1 New Library Feature Proposal 22 The Encyclopaedia.
Endeca: a faceted search solution for the library catalog Kristin Antelman & Emily Lynema UNC University Library Advisory Council June 15, 2006.
GOBI’s Basic Search with Faceted Results Conference Dial In Information: Pass Code: Spring
ITGS Databases.
Web of Science: Citation Indexes on the Web Gary Wiggins 9/29/2004.
Introduction to Information Retrieval Example of information need in the context of the world wide web: “Find all documents containing information on computer.
Comparative Labor History Research Tools & Strategies.
Information Retrieval Transfer Cycle Dania Bilal IS 530 Fall 2007.
Kemp Library See this presentation any time!
Implementation of a faceted catalog search solution Kristin Antelman & Emily Lynema NCSU Libraries Feb. 7, 2006.
 Here you will learn how to access  The Library Catalog and search for a book  Your Library and Textbook account.
Sally McCallum Library of Congress
Chapter Three Presentation: User interface How to Build a Digital Library Ian H. Witten and David Bainbridge.
A Faceted Interface to the Library Catalog Tito Sierra NCSU Libraries ALA Midwinter Meeting January 20, 2007.
Roger Mills February don’t be evil stand on the shoulders of giants.
Type in: destiny.usd259.netdestiny.usd259.net Click on the library page tab.
Destiny: Your Library’s Online Catalog Finding Books, Magazines Websites & More! Basic Search Power Search Visual Search Destiny Quest.
PubMed Basics Barbara A. Wood, MLIS Calder Library University of Miami Miller School of Medicine.
GUIDE. P UB M ED
Image Field Data Identifier ksrl.kc.sm_wyando_1885_001 Title Wyandotte, Kansas : 1885 Description sheet number: 1 Name Sanborn.
Philosopher’s Index Manual
Databases- presentation and training
Encore Implementation: One Academic Library's Experience
The National Library of Medicine and its databases
Library Content Comparison System
Internet Research Third Edition
IL Step 3: Using Bibliographic Databases
Lívia Vasas, PhD 2018 The Nation Library of Medicine and its databases Mozilla Firefox or Google Chrome Lívia Vasas, PhD.
Presentation transcript:

Demystifying Endeca’s search results ranking Kristina Spurgin with input & support from Ben Pennell & Jeff Campbell UNC Libraries

Endeca details Search Configuration and Relevance Ranking – The supported search methods and details on how results are ranked for each TRLN Endeca Data Model – The major field groups, with brief descriptions of their use, and indexing and display properties. Endeca Extract and Mappings Spreadsheet – Details on how MARC fields get mapped into Endeca fields

TRLN Endeca Search Interfaces Words anywhere (i.e. Keyword) Author Title Journal title Subject ISBN/ISSN (Publisher)

How to think about RelRank Image source

Spotting the relevancy strata Subject search relevancy strategy – Exact phrase match, starting from beginning of a single field is the gold-standard match – Subject heading search: commonplace bookcommonplace book

PubDateSort = 1700 No pub date!

A more complex search: keyword (AKA “Words anywhere”) “Searches all indexed fields, but only uses some fields to rank results.” -- Search Configuration and Relevance RankingSearch Configuration and Relevance Ranking

What fields are indexed? Guide to the TRLN Endeca Data Model gives some info Guide to the TRLN Endeca Data Model

What fields are indexed? Endeca Extract and Mappings Spreadsheet gives the detailed info. Endeca Extract and Mappings Spreadsheet

More on keyword search (AKA “Words anywhere”) “Matches in the main title, subject headings, and main author fields will be given the highest ranking.” -- Search Configuration and Relevance RankingSearch Configuration and Relevance Ranking

More on keyword search (AKA “Words anywhere”) “Queries that match as a phrase are ranked higher than those which do not.” -- Search Configuration and Relevance RankingSearch Configuration and Relevance Ranking

More on keyword search (AKA “Words anywhere”) “Exact term matches are ranked higher than those returned because of spell correction, stemming, and thesaurus lookups.” -- Search Configuration and Relevance RankingSearch Configuration and Relevance Ranking

More on keyword search (AKA “Words anywhere”) “Matches in tables of contents, summaries, or selected EAD elements are not used to determine ranking.” -- Search Configuration and Relevance RankingSearch Configuration and Relevance Ranking

An aside on keyword search (AKA “Words anywhere”)

Fields used to rank Keyword results Most important to least Main Title Main Title Normalized Title Vernacular Title Vernacular Segmented Subject Headings Subjects Normalized Subjects Vernacular Segmented Main Author Main Author Normalized Main Author Vernacular Main Author Vernacular Segmented Company Varying Titles Varying Titles Vernacular Segmented Other Authors Other Author Translation Authors Normalized Main Uniform Title Main Uniform Title Vernacular Main Uniform Title Vernacular Segmented Uniform Title Uniform Title Vernacular Uniform Title Vernacular Segmented Title Index Earlier Title Later Title Host Item Linking Uncontrolled Subject Other Titles Other Title Translation Translated as Linking Translation of Linking Series Title Index Series Statement Series Normalized Series Statement Vernacular Series Statement Vernacular Segmented Publisher Publisher Normalized Sound Recording Imprint Director Performer Credits Production Credits Biographical Sketch Related Collections Digital Collection Genre Product

Fields used to rank Title results Most important to least Title1 Title2 Title3 Title4 Main Title Main Title Normalized Journal Title Index Title Vernacular Title Vernacular Segmented Varying Titles Titles Normalized Varying Titles Vernacular Segmented Main Uniform Title Main Uniform Title Vernacular Main Uniform Title Vernacular Segmented

1 word titles 2 word titles 3 word titles

Fields used to rank Journal Title results Most important to least Journal Title Index Journal Uniform Title Journal Title Abbreviation Journal Later Title Journal Earlier Title

Fields used to rank Author results Most important to least Main Author Main Author Normalized Main Author Vernacular Main Author Vernacular Segmented Director Performer Credits Production Credits Author

Fields used to rank Subject results Most important to least Subject Headings Subjects Vernacular Segmented Subjects Normalized Genre

What is irrelevant to relevancy? Many aspects of the record are NOT considered in relevancy ranking FORMAT is the biggest surprise, it seems

And, with that whirlwind tour… Image source