Presentation is loading. Please wait.

Presentation is loading. Please wait.

Selection of Resources for the Development of an Information Service Program in Molecular Biology and Genetics Ansuman Chattopadhyay, PhD Information Specialist.

Similar presentations


Presentation on theme: "Selection of Resources for the Development of an Information Service Program in Molecular Biology and Genetics Ansuman Chattopadhyay, PhD Information Specialist."— Presentation transcript:

1 Selection of Resources for the Development of an Information Service Program in Molecular Biology and Genetics Ansuman Chattopadhyay, PhD Information Specialist in Molecular Biology and Genetics Health Sciences Library System University of Pittsburgh

2 Topics Multi Step Life Sciences Research Literature Retrieval Sequence Analysis Laboratory Resources University of Pittsburgh HSLS Molecular Biology Information Service Program

3 Life Sciences Research- A Multi Step Process Hypothesis Generation Knowledge Mining Sequence Analysis Laboratory Bench Work Mol Biol Information Service

4 Hypothesis Generation Knowledge Mining Sequence Analysis Laboratory Bench Work Literature Retrieval Resources PubMed -- CellSpace Knowledge Miner --PubGene --Genomatix BiblioSphere

5 Too much information 83,130 31,596

6 www.cellomics.cellspace.com Literature Retrieval Resources http://www.pubgene.org http://www.genomatix.de/

7 www.cellomics.cellspace.com What is CellSpace ? C ellSpace is a bioinformatics tool-- a knowledge mining system that automatically detects, analyzes, and reports the logical relationships between four types of terms found in the research literature: 1.molecule: proteins, genes, drugs 2.function: biological processes and disease states 3.cell type 4.organism

8 ++++ ++++ ++__ ++__ Molecules Functions Cells & Systems Organisms Molecules Functions Cells & Systems Organisms Literature Association Molecules: Molecules Drugs Genes Proteins Functions: Biological Functions Disease States Cells & Systems: Cells, Sub-cellular Components,Tissues and Organs CellSpace Knowledge Miner What is CellSpace ?

9 Start with a single protein (or other molecule) and find its functions, the diseases in which it is implicated, and related molecules. Start with a disease or biological function and find related molecules, or related functions. Start with two or more functions, and find the related molecules that they have in common What you can do with CellSpace?

10 Start with results from a high-throughput experiment (such as a cluster of co-regulated genes from microarray analysis), and easily find the functions that they share. Start with the results of proteomics experiments, and quickly screen the data to distinguish published interactions from novel ones..View the literature that supports the connections found in CellSpace. What you can do with CellSpace?

11 Start with a disease or biological function and find related molecules, or related functions Find molecules related to apoptosis CellSpace Knowledge Miner

12 1 2 3 Drag and drop 5 4 Click to select

13 Find molecules associated with “apoptosis” Results are presented with statistical likelihood value Get references

14 CellSpace Knowledge Miner

15 CellSpace computers analyze the National Library of Medicine's MEDLINE database, performing proprietary statistical correlation analyses regarding the organisms, cell types, biological processes, and molecules reported in 655 selected life science research journals. The molecular relationships extracted from the literature are then stored in the CellSpace database, which can be queried via the CellSpace user interface. The information is updated every two weeks How CellSpace Works?

16 The Network Browser tool displays literature association networks for a gene. The Set Cover Article Search tool will let you search the literature using a set covering algorithm. The set covering algorithm is particularly useful to search for literature references for large sets of terms.PubGene

17 PubGene

18 The query gene is shown with bright red font in the graph, its direct neighbors are shown with darker red font, and neighbours of neighbours are shown with black font PubGene

19 BiblioSphere

20 BiblioSphere

21 BiblioSphere

22 BiblioSphere

23 BiblioSphere

24 BiblioSphere

25 Resources comparison AvailabilityCoverageUpdate frequency CellSpace Commercial 2 weeks free trial 655 Medline journals Every 2 weeks PubGene V2.1 free V2.3 - commercial All Medline Journals SP: H,M,R V2.1-once in a year V2.3- every 2 weeks BiblioSphere20 use/month free All Medline Journals Abstract only SP: H,M,R continuous

26 Resources comparison Search Terms CellSpace Mol: gene, protein, Drugs Func: Biological func, Disease state, Cell and tissue type, PubGeneGene name BibliosphereGene name

27 Information Hubs Hypothesis Generation Knowledge Mining Sequence Analysis Laboratory Bench Work The molecular biology and genetics resources that can serve as information hubs, an access point to retrieve a broad range of information through a small number of selected web-based public databases

28 Information Hubs UCSC Genome Bioinformatics Resources Gene’s detail pageGenome Browser Family Browser Proteome Browser SwissProt LocusLink / Entrez Gene Gene Cards Gene Lynx Incyte Proteome Bioknowledge Library Human Protein Reference Database Organism Genome Consortium sites

29 Information Hubs UCSC Gene’s Detail Page SwissProt LocusLink OMIM GeneCards GeneLynx CGAP PubMed AceView Mouse Genome Informatics Sequence Genomic,mRNA Protein Gene Expression Data RNA Structure Protein Structure GO Annotations Molecular function Bio pathways Cellular component UCSC genome browser UCSC Family browser UCSC Proteome browser Other Species

30 Information Hubs http://genome.ucsc.edu/cgi-bin/hgGene?hgsid=31408663&db=hg16&hgg_gene=U14680&hgg_chrom=chr17&hgg_start=41570859&hgg_end=41650551

31 Information Hubs http://www.ncbi.nlm.nih.gov/LocusLink/LocRpt.cgi?l=672

32 Information Hubs

33 http://bioinfo.weizmann.ac.il/cards-bin/carddisp?BRCA1&search=BRCA1&suff=txt

34 Information Hubs http://www.hprd.org/protein/00218

35 Information Hubs

36 Proteome BioKnowledge Library Expression in Organ/Tissue Cell Type Tumor Type Disease Sequence Gene Ontology terms Protein Interactions Protein Modifications Gene Regulation Literature Excerpts

37 Resources Comparison AvailabilityTypeSP CoverageNoteworthy Features UCSCFreeH,M,R, etcExpression, Proteome/Fam ily Browser SwissProt/ uniprot FreeCuratedALLProtein information LocusLinkFreeH,M,R,N,P etc Link to NCBI resources GeneCardsFreeHExpression GeneLynxFreeH,M,R Proteome BKLCommercialCuratedH,M,R,Y,N, Pathogenic Fungi Literature excerpts HPRDFreeCuratedHProtein interaction

38 Genome Browsers :

39 Nucleic Acids Research Database Issue http://nar.oupjournals.org/ Molecular Database Catalog

40 Growth of Molecular databases

41 Database Catalog http://www.infobiogen.fr/services/dbcat/

42 Hypothesis Generation Knowledge Mining Sequence Analysis Laboratory Bench Work Sequence Analysis Sequence Search Sequence Alignment MolBiol Tools: Restriction mapping, PCR primer design Sequence Manipulation

43 Nucleic Acids Research Database Issue http://nar.oupjournals.org/ Web Server Catalog

44 Sequence Analysis http://www.bioinformatics.vg/ http:// healthlinks.washington.edu/index.cfm?id=210BCCB7-511A-4C6B-8B40-DFC47AABEA7F http://www.hsls.pitt.edu/guides/genetics

45 Sequence Analysis http://www.bioinformatics.vg/

46 Sequence Analysis

47

48

49 DNAStar LaserGene PC/Mac

50 Sequence Analysis Vector NTI Database Software DNA/RNA Protein Oligo Enzyme Gel Marker Blast Result Analysis Result Vector NTI core AlignX ContigExpress GenomBench BioAnnotator

51 Sequence Analysis VectorNTI Advanced software suit consists of five independent yet interconnected components: Vector NTI core: the cornerstone application for Vector NTI suite, provides tools for sequence analysis and molecule manipulation. AlignX: a multiple sequence alignment tool ContigExpress: a DNA sequence assembly and sequencing project management tool GenomBench: a tool for genomic DNA sequence analysis and annotation BioAnnotator: a tool for functional annotation of DNAs and proteins

52 Sequence Analysis Using vector NTI molecular biologists can: Perform routine sequence analysis tasks such as restriction mapping, identifying protein coding regions or finding sequence motifs and carrying out sequence similarity searches Generate recombinant cloning strategies and protocols Design and analyze PCR primers Catalog a growing number of plasmids and PCR primers, in order to track the origin and lineage of recombinant molecules Run in silico gel electrophoresis Perform and edit multiple sequence alignments on proteins and nucleic acids Create publication quality graphics and more

53 Laboratory Resources Hypothesis generation Knowledge Mining Sequence Analysis Laboratory Bench work Protocols: Useful Laboratory Resources:

54 Laboratory Resources Basic Protocol Alternate Protocol Commentary Critical Parameters Troubleshooting Time Considerations Key References Internet Resources http://www.interscience.wiley.com/c_p/index.htm

55 Laboratory Resources http://researchlink.labvelocity.com/

56 HSLS Mol Biol Information Service

57 http://www.hsls.pitt.edu/guides/genetics

58 Website Usage Report http://www.hsls.pitt.edu/guides/genetics

59 Workshop 1: Information Hubs 2: Sequence Similarity Searching 3: DNA Protein Analysis Tools 4: CellSpace Knowledge Miner 5: VectorNTI May 2003-April 2004 Workshops

60 2003 2004 Total: 70 One-on-one Consultation

61 “…..only half of biomedical researchers using genome databases are familiar with the tools that can be used to actually access the data.” “….. all scientists on the planet must be empowered to use these powerful databases to unravel longstanding scientific mysteries.” atabases to unravel longstanding scientifi c… Andreas D. Baxevanis & Francis S. Collins Nature Genetics, September 2002, Vol 32


Download ppt "Selection of Resources for the Development of an Information Service Program in Molecular Biology and Genetics Ansuman Chattopadhyay, PhD Information Specialist."

Similar presentations


Ads by Google