Presentation is loading. Please wait.

Presentation is loading. Please wait.

Analysis Environments For Scientific Communities From Bases to Spaces Bruce R. Schatz Institute for Genomic Biology University of Illinois at Urbana-Champaign.

Similar presentations


Presentation on theme: "Analysis Environments For Scientific Communities From Bases to Spaces Bruce R. Schatz Institute for Genomic Biology University of Illinois at Urbana-Champaign."— Presentation transcript:

1 Analysis Environments For Scientific Communities From Bases to Spaces Bruce R. Schatz Institute for Genomic Biology University of Illinois at Urbana-Champaign schatz@uiuc.edu,www.beespace.uiuc.edu Baker Center for Bioinformatics Iowa State University October 6, 2006

2 What are Analysis Environments Functional Analysis Find the underlying Mechanisms Of Genes, Behaviors, Diseases Comparative Analysis Top-down data mining (vs Bottom-up) Multiple Sources especially literature

3 Building Analysis Environments Manual by Humans Interactionuser navigation Classificationcollection indexing Automatic by Computers Federationsearch bridges Integrationresults links

4 Trends in Analysis Environments Central versus Distributed Viewpoints The 90s Pre-Genome Entrez (NIH NCBI) versus WCS (NSF Arizona) The 00s Post-Genome GO (NIH curators) versus BeeSpace (NSF Illinois)

5 Pre-Genome Environments Focused on Syntax pre-Web WCS (Worm Community System) Search words across sources Follow links across sources Words automatic, Links manual Towards Integrated Searching

6 Post-Genome Environments Focused on Semantics post-Web BeeSpace (Honey Bee Inter Space) Navigate concepts across sources Integrate data across sources Concepts automatic, Links automatic Towards Conceptual Navigation

7 Worm Community System WCS Information: Literature BIOSIS, MEDLINE, newsletters, meetings Data Genes, Maps, Sequences, strains, cells WCS Functionality Browsingsearch, navigation Filteringselection, analysis Sharinglinking, publishing WCS: 250 users at 50 labs across Internet (1991)

8 WCS Molecular

9 WCS Cellular

10 WCS invokes gm

11 WCS vis-à-vis acedb

12 from Objects to Concepts from Syntax to Semantics Infrastructure is Interaction with Abstraction Internet is packet transmission across computers Interspace is concept navigation across repositories Towards the Interspace

13 THE THIRD WAVE OF NET EVOLUTION PACKETS OBJECTS CONCEPTS

14 Technology Engineering Electrical FORMAL INFORMAL (manual) (automatic) IEEE communities groups individuals LEVELS OF INDEXES

15 Post-Genome Informatics I Comparative Analysis within the Dry Lab of Biological Knowledge Classical Organisms have Genetic Descriptions. There will be NO more classical organisms beyond Mice and Men, Worms and Flies, Yeasts and Weeds. Must use comparative genomics on classical organisms Via sequence homologies and literature analysis.

16 Post-Genome Informatics II Functional Analysis within the Dry Lab of Biological Knowledge Automatic annotation of genes to standard classifications, e.g. Gene Ontology via homology on computed protein sequences. Automatic analysis of functions to scientific literature, e.g. concept spaces via text extractions. Thus must use functions in literature descriptions.

17 Informatics: From Bases to Spaces data Bases support genome data e.g. FlyBase has sequences and maps Genes annotated by GeneOntology and linked to biological literature information Spaces support biological literature e.g. BeeSpace uses automatically generated conceptual relationships to navigate functions

18 BeeSpace FIBR Project BeeSpace project is NSF FIBR flagship Frontiers Integrative Biological Research, $5M for 5 years at University of Illinois Analyzing Nature and Nurture in Societal Roles using honey bee as model (Functional Analysis of Social Behavior) Genomic technologies in wet lab and dry lab Bee Bee [Biology] gene expressions Space Space [Informatics] concept navigations

19

20 System Architecture

21 Concept Navigation in BeeSpace

22 V1 BeeSpace Community Collections Organism Honey Bee / Fruit Fly Song Bird / Soy Bean Behavior Social / Territorial Foraging / Nesting Development Behavioral Maturation Insect Development Insect Communication Structure Fly Genetics / Fly Biochemistry Fly Physiology / Insect Neurophysiology

23 CONCEPT SWITCHING “Concept” versus “Term” set of “semantically” equivalent terms Concept switching region to region (set to set) match term Semantic region Concept Space

24

25

26

27

28

29 BeeSpace Analysis Environment Build Concept Space of Biomedical Literature for Functional Analysis of Bee Genes -Partition Literature into Community Collections -Extract and Index Concepts within Collections -Navigate Concepts within Documents -Follow Links from Documents into Databases Locate Candidate Genes in Related Literatures then follow links into Genome Databases

30 Well Characterized Gene

31 Poorly Characterized Gene

32 Gene Summarization, BeeSpace V2

33

34 Collaboration across Users

35

36

37

38

39

40

41 Category Browse (Collection)

42

43

44 Category Browse (Search)

45

46

47 PlantSpace Examples

48

49

50

51

52

53

54

55

56 Interactive Functional Analysis BeeSpace will enable users to navigate a uniform space of diverse databases and literature sources for hypothesis development and testing, with a software system beyond a searchable database, using literature analyses to discover functional relationships between genes and behavior. Genes to Behaviors Behaviors to Genes Concepts to Concepts Clusters to Clusters Navigation across Sources

57 BeeSpace Information Sources General for All Spaces: Scientific Literature -Medline, Biosis, CAB Abstracts Genome Databases -GenBank, ProteinDataBank, ArrayExpress Special for BeeSpace : Model Organisms (heredity) -Gene Descriptions (FlyBase, WormBase) Natural Histories (environment) -BeeKeeping Books (Cornell, Harvard)

58 XSpace Information Sources Organize Genome Databases (XBase) Compute Gene Descriptions from Model Organisms Partition Scientific Literature for Organism X Compute XSpace using Semantic Indexing Boost the Functional Analysis from Special Sources Collecting Useful Data about Natural Histories e.g. CowSpace Leverage in AIPL Databases

59 Towards SoySpace Organize Genome Databases (SoyBase) Partition Scientific Literature for SoyBean Gene Descriptions from Models (TAIR) Natural Histories from Population Databases Key to Functional Analysis is Special Sources Collecting Appropriate Text about Genes Extracting Adequate Data about Histories Leverage is National Archives of germplasm and Historical Records for soybean crops

60 Towards the Interspace The Analysis Environment technology is GENERAL ! BirdSpace? BeeSpace? PigSpace? CowSpace? BehaviorSpace? BrainSpace? SoySpace? PlantSpace? BioSpace … Interspace


Download ppt "Analysis Environments For Scientific Communities From Bases to Spaces Bruce R. Schatz Institute for Genomic Biology University of Illinois at Urbana-Champaign."

Similar presentations


Ads by Google