Selection of Resources for the Development of an Information Service Program in Molecular Biology and Genetics Ansuman Chattopadhyay, PhD Information Specialist.

Slides:



Advertisements
Similar presentations
Introduction to CLC Main Workbench 20 June, 2012 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System.
Advertisements

NCBI data, sliding window programs and dot plots Sept. 25, 2012 Learning objectives-Become familiar with OMIM and PubMed. Understand the difference between.
Creating NCBI The late Senator Claude Pepper recognized the importance of computerized information processing methods for the conduct of biomedical research.
Genome databases and webtools for genome analysis Become familiar with microbial genome databases Use some of the tools useful for analyzing genome Visit.
Oncomine Database Lauren Smalls-Mantey Georgia Institute of Technology June 19, 2006 Note: This presentation contains animation.
1 Welcome to the Protein Database Tutorial This tutorial will describe how to navigate the section of Gramene that provides collective information on proteins.
The design, construction and use of software tools to generate, store, annotate, access and analyse data and information relating to Molecular Biology.
The design, construction and use of software tools to generate, store, annotate, access and analyse data and information relating to Molecular Biology.
NATIONAL LIBRARY OF MEDICINE The PubMed ID and Entrez, PubMed and PubMed Central Edwin Sequeira National Center for Biotechnology Information June 21,
How to use the web for bioinformatics Molecular Technologies Ethan Strauss X 1171
Bioinformatics and the Engineering Library ASEE 2008 Amy Stout.
Evidence-Based Information Retrieval in Bioinformatics
Flying to the Top, One Tweet at a Time: Using Social Media to Rank Online Search Results Robyn B. Reed, MA, MLIS Co-authors: Carrie L. Iwema, PhD, MLS.
Archives and Information Retrieval
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
Kate Milova MolGen retreat March 24, Microarray experiments: Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
How to use the web for bioinformatics Molecular Technologies February 11, 2005 Ethan Strauss X 1373
Biological Databases Notes adapted from lecture notes of Dr. Larry Hunter at the University of Colorado.
Introduction to Genomics, Bioinformatics & Proteomics Brian Rybarczyk, PhD PMABS Department of Biology University of North Carolina Chapel Hill.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
ExPASy - Expert Protein Analysis System The bioinformatics resource portal and other resources An Overview.
341: Introduction to Bioinformatics Dr. Natasa Przulj Deaprtment of Computing Imperial College London
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
Biology in Silico : Online Tools for “Omics” WCRC RETREAT SEPTEMBER 6TH, 2014 ANSUMAN CHATTOPADHYAY, PHD HEAD, MOLECULAR BIOLOGY INFORMATION SERVICE HEALTH.
Genome database & information system for Daphnia Don Gilbert, October 2002 Talk doc at
BTN323: INTRODUCTION TO BIOLOGICAL DATABASES Day2: Specialized Databases Lecturer: Junaid Gamieldien, PhD
University of Nebraska-Lincoln PhD. in Biochemistry Protein synthesis initiation in eukaryotic system Vanderbilt University School.
InfoBoosters: Connecting Texts with Databases BOOST BOX, DEC 9, 2014 NATIONAL NETWORK OF LIBRARIES OF MEDICINE, MIDDLE ATLANTIC REGION ANSUMAN CHATTOPADHYAY,
Medline Text Searching Tools – a Comparison Experiment McDermott Center for Human Growth and Development Center for Biomedical Inventions.
Bioinformatics.
Development of Bioinformatics and its application on Biotechnology
AP Biology Ch. 20 Biotechnology.
Information Resources for Bioinformatics 1 MARC: Developing Bioinformatics Programs July, 2008 Alex Ropelewski Hugh Nicholas
Networks and Interactions Boo Virk v1.0.
20.1 Structural Genomics Determines the DNA Sequences of Entire Genomes The ultimate goal of genomic research: determining the ordered nucleotide sequences.
Copyright OpenHelix. No use or reproduction without express written consent1.
Browsing the Genome Using Genome Browsers to Visualize and Mine Data.
Professional Development Course 1 – Molecular Medicine Genome Biology June 12, 2012 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
Bioinformatics Core Facility Guglielmo Roma January 2011.
Development of an Information Service Program in Molecular Biology and Genetics Ansuman Chattopadhyay, PhD Information Specialist in Molecular Biology.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill
BIOLOGICAL DATABASES. BIOLOGICAL DATA Bioinformatics is the science of Storing, Extracting, Organizing, Analyzing, and Interpreting information in biological.
Structural Models Lecture 11. Structural Models: Introduction Structural models display relationships among entities and have a variety of uses, such.
Epidemiology 217 Molecular and Genetic Epidemiology Bioinformatics & Proteomics John Witte.
EB3233 Bioinformatics Introduction to Bioinformatics.
An overview of Bioinformatics. Cell and Central Dogma.
A collaborative tool for sequence annotation. Contact:
Bioinformatics and Computational Biology
A literature network of human genes for high-throughput analysis of gene expression Speaker : Shih-Te, YangShih-Te, Yang Advisor : Ueng-Cheng, YangUeng-Cheng,
An Introduction to NCBI & BLAST National Center for Biotechnology Information Richard Johnston Pasadena City College.
Gene Technologies and Human ApplicationsSection 3 Section 3: Gene Technologies in Detail Preview Bellringer Key Ideas Basic Tools for Genetic Manipulation.
 What is MSA (Multiple Sequence Alignment)? What is it good for? How do I use it?  Software and algorithms The programs How they work? Which to use?
1 Survey of Biodata Analysis from a Data Mining Perspective Peter Bajcsy Jiawei Han Lei Liu Jiong Yang.
Pathway Informatics 16th August, 2017
Figure 20.0 DNA sequencers DNA Technology.
Biology in silico: Creation of an online bioinformatics.
Bioinformatics Madina Bazarova. What is Bioinformatics? Bioinformatics is marriage between biology and computer. It is the use of computers for the acquisition,
Covering the Bases: Carrie Iwema, PhD, MLS
Mangaldai College, Mangaldai
Genomes and Their Evolution
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
LESSON 1 INTNRODUCTION HYE-JOO KWON, Ph.D /
Vector NTI Introduction
Lesson 3 Bioinformatics Laboratory
Presentation transcript:

Selection of Resources for the Development of an Information Service Program in Molecular Biology and Genetics Ansuman Chattopadhyay, PhD Information Specialist in Molecular Biology and Genetics Health Sciences Library System University of Pittsburgh

Topics Multi Step Life Sciences Research Literature Retrieval Sequence Analysis Laboratory Resources University of Pittsburgh HSLS Molecular Biology Information Service Program

Life Sciences Research- A Multi Step Process Hypothesis Generation Knowledge Mining Sequence Analysis Laboratory Bench Work Mol Biol Information Service

Hypothesis Generation Knowledge Mining Sequence Analysis Laboratory Bench Work Literature Retrieval Resources PubMed -- CellSpace Knowledge Miner --PubGene --Genomatix BiblioSphere

Too much information 83,130 31,596

Literature Retrieval Resources

What is CellSpace ? C ellSpace is a bioinformatics tool-- a knowledge mining system that automatically detects, analyzes, and reports the logical relationships between four types of terms found in the research literature: 1.molecule: proteins, genes, drugs 2.function: biological processes and disease states 3.cell type 4.organism

__ ++__ Molecules Functions Cells & Systems Organisms Molecules Functions Cells & Systems Organisms Literature Association Molecules: Molecules Drugs Genes Proteins Functions: Biological Functions Disease States Cells & Systems: Cells, Sub-cellular Components,Tissues and Organs CellSpace Knowledge Miner What is CellSpace ?

Start with a single protein (or other molecule) and find its functions, the diseases in which it is implicated, and related molecules. Start with a disease or biological function and find related molecules, or related functions. Start with two or more functions, and find the related molecules that they have in common What you can do with CellSpace?

Start with results from a high-throughput experiment (such as a cluster of co-regulated genes from microarray analysis), and easily find the functions that they share. Start with the results of proteomics experiments, and quickly screen the data to distinguish published interactions from novel ones..View the literature that supports the connections found in CellSpace. What you can do with CellSpace?

Start with a disease or biological function and find related molecules, or related functions Find molecules related to apoptosis CellSpace Knowledge Miner

1 2 3 Drag and drop 5 4 Click to select

Find molecules associated with “apoptosis” Results are presented with statistical likelihood value Get references

CellSpace Knowledge Miner

CellSpace computers analyze the National Library of Medicine's MEDLINE database, performing proprietary statistical correlation analyses regarding the organisms, cell types, biological processes, and molecules reported in 655 selected life science research journals. The molecular relationships extracted from the literature are then stored in the CellSpace database, which can be queried via the CellSpace user interface. The information is updated every two weeks How CellSpace Works?

The Network Browser tool displays literature association networks for a gene. The Set Cover Article Search tool will let you search the literature using a set covering algorithm. The set covering algorithm is particularly useful to search for literature references for large sets of terms.PubGene

PubGene

The query gene is shown with bright red font in the graph, its direct neighbors are shown with darker red font, and neighbours of neighbours are shown with black font PubGene

BiblioSphere

BiblioSphere

BiblioSphere

BiblioSphere

BiblioSphere

BiblioSphere

Resources comparison AvailabilityCoverageUpdate frequency CellSpace Commercial 2 weeks free trial 655 Medline journals Every 2 weeks PubGene V2.1 free V2.3 - commercial All Medline Journals SP: H,M,R V2.1-once in a year V2.3- every 2 weeks BiblioSphere20 use/month free All Medline Journals Abstract only SP: H,M,R continuous

Resources comparison Search Terms CellSpace Mol: gene, protein, Drugs Func: Biological func, Disease state, Cell and tissue type, PubGeneGene name BibliosphereGene name

Information Hubs Hypothesis Generation Knowledge Mining Sequence Analysis Laboratory Bench Work The molecular biology and genetics resources that can serve as information hubs, an access point to retrieve a broad range of information through a small number of selected web-based public databases

Information Hubs UCSC Genome Bioinformatics Resources Gene’s detail pageGenome Browser Family Browser Proteome Browser SwissProt LocusLink / Entrez Gene Gene Cards Gene Lynx Incyte Proteome Bioknowledge Library Human Protein Reference Database Organism Genome Consortium sites

Information Hubs UCSC Gene’s Detail Page SwissProt LocusLink OMIM GeneCards GeneLynx CGAP PubMed AceView Mouse Genome Informatics Sequence Genomic,mRNA Protein Gene Expression Data RNA Structure Protein Structure GO Annotations Molecular function Bio pathways Cellular component UCSC genome browser UCSC Family browser UCSC Proteome browser Other Species

Information Hubs

Information Hubs

Information Hubs

Information Hubs

Information Hubs

Proteome BioKnowledge Library Expression in Organ/Tissue Cell Type Tumor Type Disease Sequence Gene Ontology terms Protein Interactions Protein Modifications Gene Regulation Literature Excerpts

Resources Comparison AvailabilityTypeSP CoverageNoteworthy Features UCSCFreeH,M,R, etcExpression, Proteome/Fam ily Browser SwissProt/ uniprot FreeCuratedALLProtein information LocusLinkFreeH,M,R,N,P etc Link to NCBI resources GeneCardsFreeHExpression GeneLynxFreeH,M,R Proteome BKLCommercialCuratedH,M,R,Y,N, Pathogenic Fungi Literature excerpts HPRDFreeCuratedHProtein interaction

Genome Browsers :

Nucleic Acids Research Database Issue Molecular Database Catalog

Growth of Molecular databases

Database Catalog

Hypothesis Generation Knowledge Mining Sequence Analysis Laboratory Bench Work Sequence Analysis Sequence Search Sequence Alignment MolBiol Tools: Restriction mapping, PCR primer design Sequence Manipulation

Nucleic Acids Research Database Issue Web Server Catalog

Sequence Analysis healthlinks.washington.edu/index.cfm?id=210BCCB7-511A-4C6B-8B40-DFC47AABEA7F

Sequence Analysis

Sequence Analysis

DNAStar LaserGene PC/Mac

Sequence Analysis Vector NTI Database Software DNA/RNA Protein Oligo Enzyme Gel Marker Blast Result Analysis Result Vector NTI core AlignX ContigExpress GenomBench BioAnnotator

Sequence Analysis VectorNTI Advanced software suit consists of five independent yet interconnected components: Vector NTI core: the cornerstone application for Vector NTI suite, provides tools for sequence analysis and molecule manipulation. AlignX: a multiple sequence alignment tool ContigExpress: a DNA sequence assembly and sequencing project management tool GenomBench: a tool for genomic DNA sequence analysis and annotation BioAnnotator: a tool for functional annotation of DNAs and proteins

Sequence Analysis Using vector NTI molecular biologists can: Perform routine sequence analysis tasks such as restriction mapping, identifying protein coding regions or finding sequence motifs and carrying out sequence similarity searches Generate recombinant cloning strategies and protocols Design and analyze PCR primers Catalog a growing number of plasmids and PCR primers, in order to track the origin and lineage of recombinant molecules Run in silico gel electrophoresis Perform and edit multiple sequence alignments on proteins and nucleic acids Create publication quality graphics and more

Laboratory Resources Hypothesis generation Knowledge Mining Sequence Analysis Laboratory Bench work Protocols: Useful Laboratory Resources:

Laboratory Resources Basic Protocol Alternate Protocol Commentary Critical Parameters Troubleshooting Time Considerations Key References Internet Resources

Laboratory Resources

HSLS Mol Biol Information Service

Website Usage Report

Workshop 1: Information Hubs 2: Sequence Similarity Searching 3: DNA Protein Analysis Tools 4: CellSpace Knowledge Miner 5: VectorNTI May 2003-April 2004 Workshops

Total: 70 One-on-one Consultation

“…..only half of biomedical researchers using genome databases are familiar with the tools that can be used to actually access the data.” “….. all scientists on the planet must be empowered to use these powerful databases to unravel longstanding scientific mysteries.” atabases to unravel longstanding scientifi c… Andreas D. Baxevanis & Francis S. Collins Nature Genetics, September 2002, Vol 32