Presentation is loading. Please wait.

Presentation is loading. Please wait.

Demystifying Endeca’s search results ranking Kristina Spurgin with input & support from Ben Pennell & Jeff Campbell UNC Libraries.

Similar presentations


Presentation on theme: "Demystifying Endeca’s search results ranking Kristina Spurgin with input & support from Ben Pennell & Jeff Campbell UNC Libraries."— Presentation transcript:

1 Demystifying Endeca’s search results ranking Kristina Spurgin with input & support from Ben Pennell & Jeff Campbell UNC Libraries

2

3

4

5

6 Endeca details Search Configuration and Relevance Ranking – The supported search methods and details on how results are ranked for each TRLN Endeca Data Model – The major field groups, with brief descriptions of their use, and indexing and display properties. Endeca Extract and Mappings Spreadsheet – Details on how MARC fields get mapped into Endeca fields

7 TRLN Endeca Search Interfaces Words anywhere (i.e. Keyword) Author Title Journal title Subject ISBN/ISSN (Publisher)

8 How to think about RelRank Image source

9 Spotting the relevancy strata Subject search relevancy strategy – Exact phrase match, starting from beginning of a single field is the gold-standard match – Subject heading search: commonplace bookcommonplace book

10

11 PubDateSort = 1700 No pub date!

12

13 A more complex search: keyword (AKA “Words anywhere”) “Searches all indexed fields, but only uses some fields to rank results.” -- Search Configuration and Relevance RankingSearch Configuration and Relevance Ranking

14 What fields are indexed? Guide to the TRLN Endeca Data Model gives some info Guide to the TRLN Endeca Data Model

15 What fields are indexed? Endeca Extract and Mappings Spreadsheet gives the detailed info. Endeca Extract and Mappings Spreadsheet

16 More on keyword search (AKA “Words anywhere”) “Matches in the main title, subject headings, and main author fields will be given the highest ranking.” -- Search Configuration and Relevance RankingSearch Configuration and Relevance Ranking

17

18

19 More on keyword search (AKA “Words anywhere”) “Queries that match as a phrase are ranked higher than those which do not.” -- Search Configuration and Relevance RankingSearch Configuration and Relevance Ranking

20 More on keyword search (AKA “Words anywhere”) “Exact term matches are ranked higher than those returned because of spell correction, stemming, and thesaurus lookups.” -- Search Configuration and Relevance RankingSearch Configuration and Relevance Ranking

21

22 More on keyword search (AKA “Words anywhere”) “Matches in tables of contents, summaries, or selected EAD elements are not used to determine ranking.” -- Search Configuration and Relevance RankingSearch Configuration and Relevance Ranking

23 An aside on keyword search (AKA “Words anywhere”)

24 Fields used to rank Keyword results Most important to least Main Title Main Title Normalized Title Vernacular Title Vernacular Segmented Subject Headings Subjects Normalized Subjects Vernacular Segmented Main Author Main Author Normalized Main Author Vernacular Main Author Vernacular Segmented Company Varying Titles Varying Titles Vernacular Segmented Other Authors Other Author Translation Authors Normalized Main Uniform Title Main Uniform Title Vernacular Main Uniform Title Vernacular Segmented Uniform Title Uniform Title Vernacular Uniform Title Vernacular Segmented Title Index Earlier Title Later Title Host Item Linking Uncontrolled Subject Other Titles Other Title Translation Translated as Linking Translation of Linking Series Title Index Series Statement Series Normalized Series Statement Vernacular Series Statement Vernacular Segmented Publisher Publisher Normalized Sound Recording Imprint Director Performer Credits Production Credits Biographical Sketch Related Collections Digital Collection Genre Product

25 Fields used to rank Title results Most important to least Title1 Title2 Title3 Title4 Main Title Main Title Normalized Journal Title Index Title Vernacular Title Vernacular Segmented Varying Titles Titles Normalized Varying Titles Vernacular Segmented Main Uniform Title Main Uniform Title Vernacular Main Uniform Title Vernacular Segmented

26 1 word titles 2 word titles 3 word titles

27 Fields used to rank Journal Title results Most important to least Journal Title Index Journal Uniform Title Journal Title Abbreviation Journal Later Title Journal Earlier Title

28 Fields used to rank Author results Most important to least Main Author Main Author Normalized Main Author Vernacular Main Author Vernacular Segmented Director Performer Credits Production Credits Author

29 Fields used to rank Subject results Most important to least Subject Headings Subjects Vernacular Segmented Subjects Normalized Genre

30 What is irrelevant to relevancy? Many aspects of the record are NOT considered in relevancy ranking FORMAT is the biggest surprise, it seems

31 And, with that whirlwind tour… Image source


Download ppt "Demystifying Endeca’s search results ranking Kristina Spurgin with input & support from Ben Pennell & Jeff Campbell UNC Libraries."

Similar presentations


Ads by Google