Improving Bioscience Literature Search Interfaces National Library of Medicine June 19, 2009 Some research reported here supported by NSF DBI-0317510 and.

Slides:



Advertisements
Similar presentations
Critical Reading Strategies: Overview of Research Process
Advertisements

Effective Searching Strategies and Techniques
MY NCBI (module 4.5). MODULE 4.5 PubMed/How to Use MY NCBI Instructions - This part of the: course is a PowerPoint demonstration intended to introduce.
Bringing Order to the Web: Automatically Categorizing Search Results Hao Chen SIMS, UC Berkeley Susan Dumais Adaptive Systems & Interactions Microsoft.
Biomedical Journals’ Standards for Digital Images in Biomedical Articles Addeane S. Caelleigh * and Kirsten D. Miles + *UVa School of Medicine; former.
DNA BLAST Lab.
Project Proposal.
Using the Literature to Make Decisions. Objectives The student will identify information based questions from a clinical scenario. The student will classify.
Nursing Research through the MCTC Library Use this hands-on session to learn effective searching for your Nursing research assignments. We will take a.
Statistical Studies: Statistical Investigations
HEY YOU IN GRADE 10! How about a few pointers for that LITERACY TEST?
Encyclopedia of Genetics, Genomics, Proteomics and Bioinformatics USC School of Medicine Library.
Empirically Validated Web Page Design Metrics Melody Y. Ivory, Rashmi R. Sinha, Marti A. Hearst UC Berkeley CHI 2001.
Research Ideas Chapter 2 Dusana Rybarova Psyc 290B May
Caption Search for Bioscience Search Interfaces Marti Hearst, Anna Divoli, Jerry Ye, Mike Wooldridge UC Berkeley School of Information ACL Workshop on.
Article Database Tutorial (and quick guide to library resources)
Automating Tasks With Macros
1 User-Centered Design at the USPTO: Application to Patent IT Modernization Marti Hearst Chief IT Strategist, USPTO May 23, 2011.
Text, Tags and Thumbnails: Latest Trends in Bioscience Literature Search Special Libraries Association Pharmaceutical & Health Technologies Division Spring.
Evidence for Showing Gene/Protein Name Suggestions in Bioscience Literature Search Interfaces Anna Divoli, Marti A. Hearst, Michael A. Wooldridge School.
Automating Assessment of Web Site Usability Marti Hearst Melody Ivory Rashmi Sinha University of California, Berkeley.
SIMS 213: User Interface Design & Development Marti Hearst Tues, Feb 4, 2003.
Using WilsonSelect. WilsonSelect (or WilsonSelectPlus) is a database of full-text articles from magazines and journals. It covers a very wide range of.
1 User Centered Design and Evaluation. 2 Overview My evaluation experience Why involve users at all? What is a user-centered approach? Evaluation strategies.
SIMS 213: User Interface Design & Development Marti Hearst Thurs, Jan 27, 2005.
MaNIS Interface Project Mayjane Co Denise Green Jane Lee Rebecca Shapley.
Evidence for Showing Gene/Protein Name Suggestions in Bioscience Literature Search Interfaces Anna Divoli, Marti A. Hearst, Michael A. Wooldridge School.
Citances and What should our UI look like? Marti Hearst SIMS, UC Berkeley Supported by NSF DBI and a gift from Genentech.
II. Visiting the Library 1 updated 12/02/09. 2 Pat’s English class visits the BCC Library to locate literary criticism on Charlotte Perkins Gilman’s story,
Finding Psychology Research Articles for Review 1.
Evaluation IMD07101: Introduction to Human Computer Interaction Brian Davison 2010/11.
‘Hints for Designing Effective Questionnaires ’
SUNIL GAHLAWAT LU GAN MIYA SYLVESTER YIRAN WANG Usability and Utility of TopicLens a Visualization System for the Exploration of Topic Models.
Accessing Research Material Contents Slide 2-15 Introduction to Library World Navigation and Research Material Slide Using and Pubmed, and Google.
ACTIVITY. THE BRIEF You need to provide solid proof to your stakeholders that your mobile website meets the needs of your audience. You have two websites.
Tutorial Guide no. 16 OVID DATABASE: GLOBAL HEALTH.
Univeristy of Tennessee Knoxville Increasing Effective Student Use of the Scientific Journal Literature Award: DUE NSF:
AELDP ACADEMIC READING. Questions Do you have any questions about academic reading?
Put it to the Test: Usability Testing of Library Web Sites Nicole Campbell, Washington State University.
Search - on the Web and Locally Related directly to Web Search Engines: Part 1 and Part 2. IEEE Computer. June & August 2006.
Title and Abstract Description of paper Summarize the paper.
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 3 1 Searching the Web Using Search Engines and Directories Effectively Tutorial.
How to Prepare an Annotated Bibliography
1.Describe in detail what you see (make observations). 2.What questions would you as a biologist ask after seeing the mold?
Usability Assessment Methods beyond Testing Chapter 7 Evaluating without users.
Searching the web Enormous amount of information –In 1994, 100 thousand pages indexed –In 1997, 100 million pages indexed –In June, 2000, 500 million pages.
Software Engineering User Interface Design Slide 1 User Interface Design.
Indexing of Tables and Figures: Scientists’ Reaction Carol Tenopir University of Tennessee web.utk.edu/~tenopir/
Our Situation as Discovery Tools Were Released 2009/10 UNCG had a federated search product (EHIS), but due to its limitations we didn’t feature it very.
Design Process … and some design inspiration. Course ReCap To make you notice interfaces, good and bad – You’ll never look at doors the same way again.
QUESTIONNAIRE DESIGN.
SPRINGER ONLINE
The Faceted Interface: Optimizing the Usability of Enterprise Search Glenn Barnett, Principal Consultant, Molecular.
ProQuest Jennifer Jackson, Regional Sales Manager, ProQuest.
CACAO Training Fall Community Assessment of Community Annotation with Ontologies (CACAO)
Copyright OpenHelix. No use or reproduction without express written consent1.
A Usability Assessment of EBSCO Discovery Service David Comeaux – Web Development Librarian Emily Frank – Instructional Technologies Librarian Mike Waugh.
UWMS Data Mining Workshop Content Analysis: Automated Summarizing Prof. Marti Hearst SIMS 202, Lecture 16.
Table of Contents – Part B HINARI Resources –Clinical Evidence –Cochrane Library –EBM Guidelines –Essential Evidence Plus –HINARI EBM Journals.
LAB REPORTS Some guidelines. Abstract Summarise your report in under 200 words What was your question? How did you investigate it? What did you find?
UNIVERSITAT DE BARCELONA Facultat de Biblioteconomia i Documentació Grau d’Informació i Documentació Research Methods Research reports Professor: Ángel.
From the initial page of the Cochrane Library, we have clicked on the Cochrane Reviews: By Topic hyperlink. This has displayed the Topics for Cochrane.
Developing systems for full-text search in biomedicine. Anna Divoli School of Information University of California, Berkeley 07 Aug 2007 University of.
Research Methods in Psychology Introduction to Psychology.
Assess usability of a Web site’s information architecture: Approximate people’s information-seeking behavior (Monte Carlo simulation) Output quantitative.
Research Skills for Your Essay Where to begin…. Starting the search task for real Finding and selecting the best resources are the key to any project.
Access to Electronic Journals and Articles in ARL Libraries By Dana M. Caudle Cecilia M. Schmitz.
Research – using the Internet and other secondary sources and Source analysis Top Tips – get ready to make your own notes!
Chapter 6. Data Collection in a Wizard-of-Oz Experiment in Reinforcement Learning for Adaptive Dialogue Systems by: Rieser & Lemon. Course: Autonomous.
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Presentation transcript:

Improving Bioscience Literature Search Interfaces National Library of Medicine June 19, 2009 Some research reported here supported by NSF DBI and a gift from Genentech Marti A. Hearst Professor School of Information, UC Berkeley

NLM June 2009Marti Hearst BioText Project Goals Provide flexible, useful, appealing search for bioscientists. Focus on: Full text journal articles New language analysis algorithms New search interfaces Promote usability design to the bioinformatics community

NLM June 2009Marti Hearst The Importance of Figures and Captions Observations of biologists’ reading habits: It has often observed that biologists focus on figures+captions along with title and abstract. KDD Cup 2002 The objective was to extract only the papers that included experimental results regarding expression of gene products and to identify the genes and products for which experimental results were provided. ClearForest+Celera did well in part by focusing on figure captions, which contain critical experimental evidence.

NLM June 2009Marti Hearst

NLM June 2009Marti Hearst Our Idea Make a full text search engine for journal articles that focuses on showing figures Make it possible to search over caption text (and text that refers to captions) Try to group the figures intelligently

NLM June 2009Marti Hearst Related Work Cohen & Murphy: Parsed structure of image captions Extract facts about subcellular localization Yu et al. Created a small image taxonomy; classified images according to these with SVMs Yu & Lee: BioEx: Link sentences from an abstract to images in the same paper; show those when displaying a paper. Not focused on a full search interface; can’t search over caption text.

Digression: Designing for Usability

NLM June 2009Marti Hearst User-Centered Design Needs assessment Find out  who users are  what their goals are  what tasks they need to perform Task Analysis  Characterize what steps users need to take  Create scenarios of actual use  Decide which users and tasks to support Iterate between Designing Evaluating

NLM June 2009Marti Hearst User Interface Design is an Iterative Process Design Prototype Evaluate

NLM June 2009Marti Hearst Developing the BioText Search Interface Main idea: a search interface that meets the unique needs of bioscientists. Hypothesis: the articles’ figures should be exposed in the interface. Process: Did interviews, designed mock-up Made an initial prototype Did a pilot study Used these results to redesign Evaluated the new design Results: highly positive responses.

NLM June 2009Marti Hearst Small Details Matter UIs for search especially require great care in small details In part due to the text-heavy nature of search A tension between more information and introducing clutter How and where to place things is important People tend to scan or skim Only a small percentage reads instructions

NLM June 2009Marti Hearst Small Details Matter Example: Google spelling correction:  Used a long sentence at the top of the page: “If you didn’t find what you were looking for …”  People complained they got results, but not the right results.  In reality, the spellchecker had suggested an appropriate correction.

NLM June 2009Marti Hearst Small Details Matter The fix: Analyzed logs, saw people didn’t see the correction:  clicked on first search result,  didn’t find what they were looking for (came right back to the search page  scrolled to the bottom of the page, did not find anything  and then complained directly to Google Solution was to repeat the spelling suggestion at the bottom of the page. More adjustments: The message is shorter, and different on the top vs. the bottom Interview with Marissa Mayer by Mark Hurst:

NLM June 2009Marti Hearst Biotext: Pilot Usability Study Primary Goal: Determine whether biological researchers would find the idea of caption search and figure display to be useful or not. Secondary Goal: If yes, how best to support these features in the interface?

NLM June 2009Marti Hearst Method Told participants we were evaluating a new search interface (tip: don’t say “our” interface) Asked them to use each design on their own queries (order of presentation was varied) Had them fill out a questionnaire after each interface session Also had open-ended discussions about the designs

NLM June 2009Marti Hearst Participants

NLM June 2009Marti Hearst Captions + Figure View

NLM June 2009Marti Hearst

NLM June 2009Marti Hearst

NLM June 2009Marti Hearst Captions + Figure & Thumbnails

NLM June 2009Marti Hearst Results Captions + Figure View 7 = strongly agree 1 = strong disagree participant # participant #

NLM June 2009Marti Hearst Results 7 out of 8 said they would want to use either CF or CFT in their bioscience journal article searches The 8 th thought figures would not be useful in their tasks Many participants noted that caption search would be better for some tasks than others Two of the participants preferred CFT to CF; the rest thought CFT was too busy. Best to show all the thumbnails that correspond to a given article after full text search Best to show only the figure that corresponds to the caption in the caption search view

NLM June 2009Marti Hearst

NLM June 2009Marti Hearst

NLM June 2009Marti Hearst Results, cont. All four participants who saw the Grid view liked it, but noted that the metadata shown was insufficient; If it were changed to include title and other bibliographic data, 2 of the 4 who saw Grid said they would prefer that view over the CF view.

Current Design

NLM June 2009Marti Hearst Current Design Indexes the PubMedCentral open access journal article collection, with more than: 300 journals 129,000 articles 247,000 figures 104,000 tables

NLM June 2009Marti Hearst

NLM June 2009Marti Hearst

NLM June 2009Marti Hearst

NLM June 2009Marti Hearst

NLM June 2009Marti Hearst

NLM June 2009Marti Hearst

NLM June 2009Marti Hearst

NLM June 2009Marti Hearst

NLM June 2009Marti Hearst

NLM June 2009Marti Hearst

NLM June 2009Marti Hearst Second Study Modified, improved interface 20 participants 6 grad students, 6 postdocs, 1 faculty, 7 other  Cell or molecular biology, genetics or genomics, biochemistry, evolutionary biology, bioinformatics. All use PubMed, most as primary tool

NLM June 2009Marti Hearst Second Study Procedure: Session lasted ~1 hour Participants were shown the interface and its views, and then asked to use it and respond. They then assessed the interfaces explicitly. Measures: Focus on subjective responses. Intent to use is a reliable indicator of actual usage. ( Venkatesh & Morris 03, Sun & Zhang 06)

NLM June 2009Marti Hearst Results 19 out of 20 wanted articles’ figures alongside the full text search results. 15 out of 20 would use a caption search and figure display interface either frequently or sometimes 4 said rarely 1 said undecided.

NLM June 2009Marti Hearst Results 10 out of 20 would use a tool for searching the text of tables and their captions either frequently or sometimes 7 said they would use it rarely if at all, 2 said they would never use it 1 was undecided.

NLM June 2009Marti Hearst Results

NLM June 2009Marti Hearst Full Text View: Favorable Aspects

NLM June 2009Marti Hearst Full Text View: Unfavorable Aspects

NLM June 2009Marti Hearst Figure Caption Views: Favorable Aspects

NLM June 2009Marti Hearst Figure Caption Views: Unfavorable Aspects

NLM June 2009Marti Hearst Table View: Favorable Aspects

NLM June 2009Marti Hearst Table View: Unfavorable Aspects

NLM June 2009Marti Hearst Now Google is Doing It!

Showing Related Terms in Bioscience Literature Search Needs assessment and low-fi evaluation

NLM June 2009Marti Hearst First Questionnaire General information about how they search and what related information they want to see. 38 participants 22 grad students, 6 postdocs, 5 faculty, 5 other Systems biology, bioinformatics, genomics, biochemistry, cellular and evolutionary biology, microbiology, physiology, …

Participants’ Characteristics

NLM June 2009Marti Hearst Results Related Information Type Avg rating # selecting 1 or 2 Gene’s Synonyms4.42 Gene’s Synonyms refined by organism4.02 Gene’s Homologs3.75 Genes from same family: parents3.47 Genes from same family: children 3.64 Genes from same family: siblings 3.29 Genes this gene interacts with3.74 Diseases this gene is associated with 3.46 Chemicals/drugs this gene is associated with Localization information for this gene (Do NOT want this) (Neutral) (REALLY want this)

NLM June 2009Marti Hearst Second Questionnaire Evaluating 4 designs for gene/protein name suggestions 19 participants 4 grad students, 7 postdocs, 3 faculty, 5 other Wide range of specializations

NLM June 2009Marti Hearst Design 1: Baseline

NLM June 2009Marti Hearst Design 2: Links

NLM June 2009Marti Hearst Design 3: Checkboxes

NLM June 2009Marti Hearst Design 4: Grouped Links

NLM June 2009Marti Hearst Results DesignParticipants who rated design 1st or 2nd Average rating (1=low, 4=high) #% 3 (checkboxes) (grouped links) (links) (baseline) 001.6

NLM June 2009Marti Hearst Results: More Detail Strong desire for the search system to suggest information closely related to gene/protein names. Some interest in less closely related information. Most participants want to see organism names in conjunction with gene names. A majority of participants prefer to see term suggestions grouped by type (synonyms, homologs, etc).

NLM June 2009Marti Hearst Results: More Detail Split in preference between single-click hyperlink interaction (categories or single terms) and checkbox-style interaction. The majority of participants prefers to have the option to chose either individual names or whole groups with one click. Split in preference between the system suggesting only names that it is highly confident are related and include names that it is less confident about under a “show more” link.

NLM June 2009Marti Hearst Summary: BioText Search Studies Nearly all participants strongly desire Full text search Figure display in search results Impediments to adoption Needs to index all articles Needs to be in the primary search tool(s) Participants also want to see term suggestions that are closely related to their query.