LTER IM Meeting 2008 – Benson, Boose, Bohm, Gries, Gu, Kaplan, Koskela, Laney, Porter, Remillard, Sheldon and others.

Slides:



Advertisements
Similar presentations
EQUINOX DATA DELIVERY SYSTEM May 31, 2011 –Elizabeth Hill Equinox.uwo.ca.
Advertisements

Mark Servilla & Duane Costa LTER Network Office LTER 2012 All Scientist Meeting LTER Network Office.
The North American Carbon Program Google Earth Collection Peter C. Griffith, NACP Coordinator; Lisa E. Wilcox; Amy L. Morrell, NACP Web Group Organization:
Natural Language Processing WEB SEARCH ENGINES August, 2002.
Advanced Searching Engineering Village.
Leveraging Your Taxonomy to Increase User Productivity MAIQuery and TM Navtree.
Page 1 June 2, 2015 Optimizing for Search Making it easier for users to find your content.
Information Retrieval in Practice
Interfaces for Retrieval Results. Information Retrieval Activities Selecting a collection –Talked about last class –Lists, overviews, wizards, automatic.
Automating Tasks With Macros
Essential Software Architecture Chapter Two - Introducing the Case Study Ian Gorton CS590 – Winter 2008.
Automating Tasks With Macros. 2 Design a switchboard and dialog box for a graphical user interface Database developers interact directly with Access.
INFORMATION RETRIEVAL WEEK 1 AND 2
Toward Automatic Processing and Indexing of Microfilm.
Search engines. The number of Internet hosts exceeded in in in in in
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
©2006, CSA Using COS Funding Alert Automatic Notification of Relevant New Opportunities from the World’s Largest Funding Database ™ Easily Accessible Via.
Federated Searching: The ABC’s of HSE, XML, & Z39.50 Harry Samuels Product Manager Linking & Searching August 27, 2004.
Overview of Search Engines
The AdWords Toolbox All the tools you need to make your ad run more efficiently!
Long-Term Ecological Research working_groups/controlled_vocabulary Working Group: “Synthesis through data.
Databases & Data Warehouses Chapter 3 Database Processing.
Improving Data Discovery in Metadata Repositories through Semantic Search Chad Berkley 1, Shawn Bowers 2, Matt Jones 1, Mark Schildhauer 1, Josh Madin.
Based on material developed by Samantha Romanello and
 Workshops: March & May 2011 and lots of VTCs! Details at:
Systems Analysis – Analyzing Requirements.  Analyzing requirement stage identifies user information needs and new systems requirements  IS dev team.
Building Search Portals With SP2013 Search. 2 SharePoint 2013 Search  Introduction  Changes in the Architecture  Result Sources  Query Rules/Result.
Controlled Vocabulary Working Group PRESENTED BY JOHN PORTER.
LTER IMC Meeting Sept Past Activities Created list of about ~650 terms based on widely-used LTER EML Keywords Autocomplete search aid added to.
Information Retrieval and Knowledge Organisation Knut Hinkelmann.
Web Searching Basics Dr. Dania Bilal IS 530 Fall 2009.
The Business Model and Strategy of MBAA 609 R. Nakatsu.
Search Engine By Bhupendra Ratha, Lecturer School of Library and Information Science Devi Ahilya University, Indore
Controlled Vocabulary Working Group Virtual Water Cooler Session April 6-7, 2009 Moderator: John Porter rm.action?confKey=jhp7e.
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 3 1 Searching the Web Using Search Engines and Directories Effectively Tutorial.
The Internet 8th Edition Tutorial 4 Searching the Web.
Search Engines. Search Strategies Define the search topic(s) and break it down into its component parts What terms, words or phrases do you use to describe.
Search engines are used to for looking for documents. They compile their databases by employing "spiders" or "robots" to crawl through web space from.
Module 10 Administering and Configuring SharePoint Search.
The Business Model of Google MBAA 609 R. Nakatsu.
Search Engine Architecture
How do I search the Internet? Narrow your topic and its description; pull out key words and categories.
Introduction to Morpho BEAM Workshop Samantha Romanello Long Term Ecological Research University of New Mexico.
 Finalize VOCAB “Terms of Reference”  Define use cases for the keyword database and its development  Develop procedures for capturing and managing.
MetaLib 4 User Guide. 2 MetaLib 4 Access MetaLib at: – MetaLib may be used at two different levels –
Search Engines By: Faruq Hasan.
Mercury – A Service Oriented Web-based system for finding and retrieving Biogeochemical, Ecological and other land- based data National Aeronautics and.
 Enhancing User Experience  Why it is important?  Discussing user experience one-by-one.
Controlled Vocabulary Giri Palanisamy Eda C. Melendez-Colom Corinna Gries Duane Costa John Porter.
Introduction to Morpho RCN Workshop Samantha Romanello Long Term Ecological Research University of New Mexico.
Collaborative Project Database Margaret O’Brien, Corinna Gries, Wade Sheldon, Jonathan Walsh, John Porter, Sven Bohm, James Brunt, Suzanne Remillard, Ken.
What is Seo? Search Engine Optimization for Dummies.
Controlled Vocabulary Working Group Activities
Integrated Departmental Information Service IDIS provides integration in three aspects Integrate relational querying and text retrieval Integrate search.
HIVE-DRYAD Integration. For Curators Use HIVE to generate subject, taxon, and spatial terms suggestion. Curator’s needs: – Get terms suggestion from HIVE.
Controlled Vocabulary Working Group Activities
June 30, 2005 Public Web Site Search Project Update: 6/30/2005 Linda Busdiecker & Andy Nguyen Department of Information Technology.
SEMINAR ON INTERNET SEARCHING PRESENTED BY:- AVIPSA PUROHIT REGD NO GUIDED BY:- Lect. ANANYA MISHRA.
Information Retrieval in Practice
Section 10.1 Define scripting
Search Engine Architecture
Improving Data Discovery Through Semantic Search
Search Engine Architecture
Search Techniques and Advanced tools for Researchers
LTER Metadata Query Interface – Current Status and Future Challenges
Anatomy of a Search Search The Index:
Agenda What is SEO ? How Do Search Engines Work? Measuring SEO success ? On Page SEO – Basic Practices? Technical SEO - Source Code. Off Page SEO – Social.
LTER Controlled Vocabulary Virtual WaterCooler - July, 2018
Information Retrieval and Web Design
Presentation transcript:

LTER IM Meeting 2008 – Benson, Boose, Bohm, Gries, Gu, Kaplan, Koskela, Laney, Porter, Remillard, Sheldon and others

 Response to requests in VTC Aug. 2008

Duane Costa

 Goal: Enhance search results for end-user by extending the list of matching search terms to include broader/narrower/related terms  How: Query a thesaurus via web service and use the extended set of terms to expand the search; two possible approaches (see next slide)  Potential problem: Could overwhelm user with too many search results  Extended search mode could be made optional for user, toggled on/off with a checkbox  Or, user could be offered a list of additional terms to select from, where only the selected terms would be included in the extended search

 Approach #1: Extend list of user- entered terms by dynamically querying a thesaurus via web service at search time  Web service is used at time of search, adding overhead to search time  Too many search terms could severely degrade performance of Metacat search  Only terms entered by user are queried via web service (this is an advantage over Approach #2, where all terms in an EML document must be queried via web service)  Approach #2: (1) Evaluate terms in each EML document; (2) For each term, query thesaurus via web service to get additional terms; (3) Store additional terms for each document somewhere external to the document (e.g. database table)  Web services are used during “off- hours” and results are cached locally in a table  Need to decide which terms in EML document should be queried via web services; potentially many  Need a good indexing scheme to efficiently retrieve all matching terms for an EML document  Whenever an EML document is updated, the cached set of extended terms must be updated

John Porter

GOALS  To make it easier for metadata creators to use existing/accepted terms rather than making up new ones  To analyze metadata content to suggest suitable terms HOW  Interfaces  Web interface – returns string that can be cut-and-pasted into documents  Web service – accepts XML queries (tentative suggestions) and returns XML results  Technology  Compare words in documentation with existing list(s) to get initial suggestions  Expand the words that do match to include more general and more specific terms  Table of synonyms

1. Document to Scan for words 2. Select the Word(s) that might make good Keywords Fish, Bird, Forest, Carbon 3. Select Related Terms that also would make good keywords OR Salmon Suggest your own word: Anadromous species Commercial fishing Marine fishes 4. XML result to paste into document: fish Commercial fishing

 Create Preferred Word list  With tools that display list quickly  Process for adding new terms  Ordered list so present only the most important ones first  Both NET and Site relevance “permafrost”  An tools that use that list “google term list style”

 List sources  EML Keywords  EML attributes names and labels  Single words from Abstracts and titles and publications  Criteria for Ordering  How often does the term appear in metacat searches?  Number of sites using term  Number datasets that use the term (weight by total number of site datasets)  Is it in GCMD list?  Is it in NBII thesaurus and if so how many related terms?

 Periodically develop hierarchy of 500 highest rated terms  Periodially generate synonomy that includes preferred version  Best Practices on keywords

 Tools to automatically generate ranked list from sources  AJAX-based web page widget/insert that uses list  Group charged with creation of hierarchy /synonomy etc.  Get funding to do this  Scientists  Need way to code hierarchy in EML?