Andreas Doms Biotechnology Center Technical UniversityDresden ontology-based literature search life science literature gopubmed hands-on curation tool.

Slides:



Advertisements
Similar presentations
Annotation of Gene Function …and how thats useful to you.
Advertisements

The use of Ontology in Organising and Managing Protein Family Resources Katy Wolstencroft, University Of Manchester.
NCBI/WHO PubMed/Hinari Course NCBI Literature Databases: PubMed Background.
Genome databases and webtools for genome analysis Become familiar with microbial genome databases Use some of the tools useful for analyzing genome Visit.
Online Counseling Resource YCMOU ELearning Drive… School of Architecture, Science and Technology Yashwantrao Chavan Maharashtra Open University, Nashik.
Global Alignment and Collaboration Jo
Literature Informatics Beyond PubMed: Next Generation Literature Searching Carrie Iwema, PhD, MLS 24 th August 2011.
1 CBioC: Collaborative Bio- Curation Chitta Baral Department of Computer Science and Engineering Arizona State University.
What is an ontology and Why should you care? Barry Smith with thanks to Jane Lomax, Gene Ontology Consortium 1.
Lecture 2.21 Retrieving Information: Using Entrez.
Integration of Bioinformatics into Inquiry Based Learning by Kathleen Gabric.
Sequence Analysis. Today How to retrieve a DNA sequence? How to search for other related DNA sequences? How to search for its protein sequence? How to.
1 Ontology Generation Based on a User-Specified Ontology Seed Cui Tao Data Extraction Research Group Department of Computer Science Brigham Young University.
B IOMEDICAL T EXT M INING AND ITS A PPLICATION IN C ANCER R ESEARCH Henry Ikediego
GTL User Facilities Facility II: Whole Proteome Analysis Michelle V. Buchanan.
Cis-Regulatory/ Text Mining Interface Discussion.
1 iProLINK: An integrated protein resource for literature mining and literature-based curation 1. Bibliography mapping - UniProt mapped citations 2. Annotation.
Bioinformatics Jan Taylor. A bit about me Biochemistry and Molecular Biology Computer Science, Computational Biology Multivariate statistics Machine learning.
Medline Text Searching Tools – a Comparison Experiment McDermott Center for Human Growth and Development Center for Biomedical Inventions.
Introduction to Gene Mining Part B: How similar are plant and human versions of a gene? After completing part B, you will demonstrate How to use NCBI BLASTp.
Gramene Objectives Develop a database and tools to store, visualize and analyze data on genetics, genomics, proteomics, and biochemistry of grass plants.
ALBUQUERQUE TVI BIOTECHNOLOGY PROGRAM Program Director: Bill Palmisano, Ph.D. Team Member: Jenna Johnson Mentor: Joy McMillan, Ph.D. Madison Area Technical.
CS 790 – Bioinformatics Introduction and overview.
IProLINK – A Literature Mining Resource at PIR (integrated Protein Literature INformation and Knowledge ) Hu ZZ 1, Liu H 2, Vijay-Shanker K 3, Mani I 4,
Gene Therapy is the Path to a Cure Keith R. Jerome, MD, PhD Fred Hutchinson Cancer Research Center Seattle, Washington USA.
PattArAn – From Annotation Triplets to Sentence Fingerprints Motivation Motivation  Scientific concepts are annotated with controlled vocabulary (CV)
BIOINFORMATICS IN BIOCHEMISTRY Bioinformatics– a field at the interface of molecular biology, computer science, and mathematics Bioinformatics focuses.
1 Bio-Trac 40 (Protein Bioinformatics) October 8, 2009 Zhang-Zhi Hu, M.D. Associate Professor Department of Oncology Department of Biochemistry and Molecular.
8 October 2009Microbial Research Commons1 Toward a biomedical research commons: A view from NLM-NIH Jerry Sheehan Assistant Director for Policy Development.
Proteins …..a recap. Characteristics of Proteins Are made up of monomers These are called amino acids There are 20 amino acids And they all have the general.
Organizing information in the post-genomic era The rise of bioinformatics.
Cell Signaling Ontology Takako Takai-Igarashi and Toshihisa Takagi Human Genome Center, Institute of Medical Science, University of Tokyo.
Human liver rate-limiting enzymes influence metabolic flux via branch points and inhibitors Min Zhao Center for Bioinformatics Peking University.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
A Tutorial of Sequence Matching in Oracle Haifeng Ji* and Gang Qian** * Oklahoma City Community College ** University of Central Oklahoma.
Development of an Information Service Program in Molecular Biology and Genetics Ansuman Chattopadhyay, PhD Information Specialist in Molecular Biology.
1 Gene function annotation. 2 Outline  Functional annotation  Controlled vocabularies  Functional annotation at TAIR  Resources and tools at TAIR.
CACAO Training Fall Community Assessment of Community Annotation with Ontologies (CACAO)
Central dogma: the story of life RNA DNA Protein.
Generic Database. What should a genome database do? Search Browse Collect Download results Multiple format Genome Browser Information Genomic Proteomic.
Bioinformatics Curriculum Issues, goals, curriculum.
Bioinformatics and Computational Biology
CACAO Training Jim Hu and Suzi Aleksander Fall 2015.
1, StarOmics course,Lausanne, Monday November 19 th Training agenda Chemicals Reactions Enzymes Pathways.
UIC at TREC 2007: Genomics Track Wei Zhou, Clement Yu University of Illinois at Chicago Nov. 8, 2007.
Integration of Bioinformatics into Inquiry Based Learning by Kathleen Gabric.
1 An Introduction to Ontology for Scientists Barry Smith University at Buffalo
PubChem: An Open Repository for Chemical Structure and Biological Activity Information Steve Bryant The NIH Biowulf Cluster: 10 Years of Scientific Supercomputing.
An Introduction to NCBI & BLAST National Center for Biotechnology Information Richard Johnston Pasadena City College.
Japan Consortium for Glycobiology and Glycotechnology DataBase 日本糖鎖科学統合データベース PACDB - Pathogen Adherence to Carbohydrate Database The Pathogen Adherence.
Welcome to the Protein Database Tutorial. This tutorial will describe how to navigate the section of Gramene that provides collective information on proteins.
NCBI PubMed NCBI Literature Databases: PubMed Session #1, April 28, 2005 Session #2, April 29, 2005 Ho Chi Minh City, VietNam.
CACAO Training Jim Hu and Suzi Aleksander Fall 2015.
Sample Registration - Introduction
BME435 BIOINFORMATICS.
Introduction to Genes and Genomes with Ensembl
Dr. Ghufran Mohammed Hussein
CACAO Training ASM-JGI 2012.
Biological Databases By: Komal Arora.
Biomedical Text Mining and Its Applications
Partial Integration of GO with the Ingenuity Ontology
ABO Blood Type: An Example of Genetic Variation
Functional Annotation of the Horse Genome
Modified from slides from Jim Hu and Suzi Aleksander Spring 2016
Mangaldai College, Mangaldai
Annotation: linking literature to gene products
Beyond PubMed--Next Generation Literature Searching
A User’s Guide to GO: Structural and Functional Annotation
Biotechnology is the use of biological systems, such as microorganisms, whole cells or their molecules, to solve problems or to make useful products.
DNA and Modern Genetics
Presentation transcript:

Andreas Doms Biotechnology Center Technical UniversityDresden ontology-based literature search life science literature gopubmed hands-on curation tool examples agenda

Andreas Doms Biotechnology Center Technical UniversityDresden ontology-based literature search life science literature gopubmed hands-on curation tool agenda examples Part I: GoPubMed Part II: practical

Life science literature researches work in the life sciences PubMed literature database contains scientific abstracts Annual growth ca agenda life science literature most important source for researchers a lot of unstructured information growth rapidly common vocabulary helps searching ontology-based literature search gopubmed hands-on curation tool examples

PubMed Great resource if one knows what one is looking for –“Kox1” has 17 hits But “diabetes” will produce > How can the common vocabulary of the GeneOntology be used to facilate literature search? agenda life science literature most important source for researchers a lot of unstructured information growth rapidly common vocabulary helps searching ontology-based literature search gopubmed hands-on curation tool examples

The Gene Ontology agenda life science literature ontology-based literature search Gene Ontology long result lists concept regonition the idea of ontology- based search gopubmed hands-on curation tool examples

Searching in PubMed agenda life science literature ontology-based literature search Gene Ontology long result lists concept regonition the idea of ontology- based search gopubmed hands-on curation tool examples

Searching in PubMed agenda life science literature ontology-based literature search Gene Ontology long result lists concept regonition the idea of ontology- based search gopubmed hands-on curation tool examples

Identification of ontology terms… Concept recognition agenda life science literature ontology-based literature search Gene Ontology long result lists concept regonition the idea of ontology- based search gopubmed hands-on curation tool examples

We all know sequence alignment agenda life science literature ontology-based literature search Gene Ontology long result lists concept regonition the idea of ontology- based search gopubmed hands-on curation tool examples

Idea: Use it for concept identification agenda life science literature ontology-based literature search Gene Ontology long result lists concept regonition the idea of ontology- based search gopubmed hands-on curation tool examples

Examples PMID : ”Primed monocytes transcribed TNF mRNA at a higher rate than freshly isolated monocytes upon activation with LPS.” (for monocyte activation (GO: )) PMID : ”Although all nm23 proteins contain nucleoside diphosphate (NDP) kinase activity, it has not been established that the enzyme activity mediated the various functions of nm23 proteins.” (for protein kinase activity (GO: )). agenda life science literature ontology-based literature search Gene Ontology long result lists concept regonition the idea of ontology- based search gopubmed hands-on curation tool examples

Concept recognition difficult as terms are not used in their lexical base form Inter-annotator agreement between 65 to 85% Good entity recognition systems: ~90% f- value Generel concepts may be mis-interpreted („development“ has 8 meanings in WordNet) Machine learning can achieve very good results for disambiguation but you need training data agenda life science literature ontology-based literature search Gene Ontology long result lists concept regonition the idea of ontology- based search gopubmed hands-on curation tool examples

thioredoxin-disulfide reductase activity (GO: ) small-molecule carrier or transporter (GO: ) Endonuclease activity, active with either ribo- or deoxyribonucleic acids and producing 5’-phosphomonoesters (GO: ) [methionine synthase] reductase activity (GO: ) structural constituent of chorion (sensu Insecta) (GO: ) Concept recognition agenda life science literature ontology-based literature search Gene Ontology long result lists concept regonition the idea of ontology- based search gopubmed hands-on curation tool examples

Ontology-based literature search agenda life science literature ontology-based literature search Gene Ontology long result lists concept regonition the idea of ontology- based search gopubmed hands-on curation tool examples

Ontology-based literature search agenda life science literature ontology-based literature search Gene Ontology long result lists concept regonition the idea of ontology- based search gopubmed hands-on curation tool examples

Ontology-based literature search agenda life science literature ontology-based literature search Gene Ontology long result lists concept regonition the idea of ontology- based search gopubmed hands-on curation tool examples

Ontology-based literature search agenda life science literature ontology-based literature search Gene Ontology long result lists concept regonition the idea of ontology- based search gopubmed hands-on curation tool examples

levamisole inhibitor agenda life science literature gopubmed ontology-based literature search hands-on curation tool examples user interface using GoPubMed browsing results

helicobacter pylori Which anatomical structure is affected by helicobacter pylori? agenda life science literature examples ontology-based literature search hands-on curation tool gopubmed

rab5 Which biological process is the protein Rab5 involved in and where is located in the cell? agenda life science literature examples ontology-based literature search hands-on curation tool gopubmed

aspirin Which enzymes are inhibited by aspirin? agenda life science literature examples ontology-based literature search hands-on curation tool gopubmed

leukemia Is „leukemia“ a hot research topic? agenda life science literature examples ontology-based literature search hands-on curation tool gopubmed

nurse [au] london [ad] What is Paul Nurse working on? agenda life science literature examples ontology-based literature search hands-on curation tool gopubmed

What is Paul Nurse working on? agenda life science literature examples ontology-based literature search hands-on curation tool gopubmed

blobel g[au] new york[ad] What is Günther Blobel working on? agenda life science literature examples ontology-based literature search hands-on curation tool gopubmed

Blobel donated the whole money for the Frauenkirche in Dresden What is Günther Blobel working on? agenda life science literature examples ontology-based literature search hands-on curation tool gopubmed

GoPubMed can answer bio- medical Questions GoPubMed gives a good overview about current bio- medical literature GoPubMed provides a bibliometric analysis ontology-based literature search life science literature gopubmed hands-on curation tool examples agenda

Thank you so much! ontology-based literature search life science literature gopubmed hands-on curation tool examples agenda