Presentation is loading. Please wait.

Presentation is loading. Please wait.

Metabolic Pathway Databases and Tools Speaker and Schedule Update PMN (Peifen Zhang) KEGG (auto-slide show) MetaCrop (cancelled)

Similar presentations


Presentation on theme: "Metabolic Pathway Databases and Tools Speaker and Schedule Update PMN (Peifen Zhang) KEGG (auto-slide show) MetaCrop (cancelled)"— Presentation transcript:

1 Metabolic Pathway Databases and Tools Speaker and Schedule Update PMN (Peifen Zhang) KEGG (auto-slide show) MetaCrop (cancelled)

2 and Its Databases Peifen Zhang Carnegie Institution For Science Department of Plant Biology

3 Where We Are

4 PMN: - Sue Rhee (PI) - Kate Dreher (curator) - A. S. Karthikeyan (curator, alumni) - Anjo Chi (tech team) - Cynthia Lee (tech team) - Larry Ploetz (tech team) - Shanker Singh (tech team) - Bob Muller (tech team) - Vanessa Kirkup (tech team, alumni) - Tom Meyer (tech team, alumni) Key Collaborators: - Peter Karp (MetaCyc, SRI) - Ron Caspi (MetaCYc, SRI) - Lukas Mueller (SGN) - Anuradha Pujar (SGN) Who We Are

5 Outline General introduction Browse/Search/Analyze Pathway database creation and curation

6 Introducing the PMN Scope –PMN is a collection of plant metabolic pathway databases –PMN is a community for data curation Curators, editorial board, ally databases, researchers Major goals –Create metabolic pathway databases for plants For individual plant species –e.g. AraCyc (Arabidopsis thaliana) –e.g. PoplarCyc (Populus trichocarpa) Combining data for all plant species - PlantCyc –Create a computational prediction “pipeline”: Start with protein sequences for a specific plant species End with a comprehensive set of predicted enzyme functions and associated metabolic pathways

7 PMN Databases AraCyc, PoplarCyc, and more to come –Single-species –Comprehensive collection of pathways in a particular species –Complete collection of enzymes, known or predicted, in that species PlantCyc –Multiple-species –Comprehensive collection of pathways for all plants –Representative collection of known enzymes in plants

8 PMN Database Content Statistics PlantCyc 4.0AraCyc 7.0PoplarCyc 2.0 Pathways685369288 Enzymes1105855063420 Reactions292924181707 Compounds296627191397 Organisms34311* Valuable plant natural products, many are specialized metabolites that are limited to a few species or genus. medicinal: e.g. artemisinin and quinine (treatment of malaria), codeine and morphine (pain-killer), ginsenosides (cardio-protectant), lupenol (antiinflammatory), taxol and vinblastine (anti-cancer) industrial materials: e.g. resin and rubber food flavor and scents: e.g. capsaicin and piperine (chili and pepper flavor), geranyl acetate (aroma of rose) and menthol (mint).

9 Other Plant Databases Accessible From PMN DatabaseSpeciesSourceCuration status RiceCyc ***RiceGramenesome curation SorghumCycSorghumGrameneno curation MedicCyc ***MedicagoNoble Foundationsome curation LycoCyc ***TomatoSol Genomics Networksome curation PotatoCycPotatoSol Genomics Networkno curation CapCycPepperSol Genomics Networkno curation NicotianaCycTobaccoSol Genomics Networkno curation PetuniaCycPetuniaSol Genomics Networkno curation CoffeaCycCoffeeSol Genomics Networkno curation *** Significant numbers of genes from these databases have been integrated into PlantCyc

10 Browse/Search/Analyze

11 Browsing the PMN Data

12 Browsing Pathways

13 Quick Search Quick search bar choline

14 Searching in PMN databases

15 Specific Data Type Search, Pathway Search For example, find pathways that includes a specific intermediate ornithine

16 A Typical Pathway Detail Page

17 Pathway Enzyme Gene Reaction Compound Evidence Codes A Typical Pathway Detail Page Upstream pathway

18 Conventions Used in Curation and Data Presentation A pathway, as drawn in the text books, is a functional unit, regulated as a unit Pathway displayed is expected to operate as such in the individual species listed

19

20 Conventions Used in Curation and Data Presentation Pathway, as drawn in the text books, is a functional unit, regulated as a unit Pathway displayed is expected to operate as such in the individual species shown Alternative routes that have been observed in different organisms are curated separately as pathway variants

21

22 Pathway, as drawn in the text books, is a functional unit, regulated as a unit Pathway displayed is expected to operate as such in the individual species shown Alternative routes that have been observed in different organisms are curated separately as pathway variants Mosaics combined of alternative routes from several different species are curated as Superpathways Connected pathways, extended networks, are curated as Superpathways Conventions Used in Curation and Data Presentation

23

24 Linking to Other Data Detail Pages Compound

25 Synonyms Appears as Product Appears as Reactant Compound Detail Pages Molecular Weight / Formula Smiles / InChI

26 Pathway(s) Arabidopsis Enzyme: phosphatidyltransferase Enzyme detail pages Evidence Inhibitors, Kinetic Parameters, etc. Summary Reaction

27 The Global Overview Map

28

29 Visualizing Omics Data Visualize and interpret large scale omics data in a metabolism context: Gene expression data Proteomic data Metabolic profiling data Reaction flux data

30 Input File Format For The Omics Viewer Tab-delimited text file At1g777601.152.33.22.151.531.75 At2g133600.7-0.530-0.730.03-0.72 At3g10230-1.1-0.051.051.151.250.05 At3g10230-0.65-0.581.131.230.67-0.12 At3g01120-1.08-0.15-1.2-1.15-1.15-0.58 At3g015000.07-0.72-0.68-1.4-1.93-9.23 At3g024700.03-0.530.581.280.551.4 At3g024700.55-0.120.620.65-0.051.22 At3g025800.6-0.550.080.55-2.2-1.65 At3g025801.150.70.03-0.6-2.4-1.65 At3g02780-1.150.050.1-0.08-0.57-0.28 At3g04120-0.15-1.550.12-0.30.231.77 At3g04120-0.15-1.50.05-0.320.251.7 At3g04120-0.07-0.850.1-0.750.21.55 At3g048701.05-1.08-0.05-1.10.05-1.33 At3g04940-0.85-0.1-0.85-1.3-1.83-1.3 At3g07420-0.68-0.12-0.7-0.10-0.1 At3g10850-0.6-0.78-0.65-0.72-2.08-1.67 At3g13790-0.21.81.651.751.772.05 At3g137901.11.751.951.951.852.32 At3g144150.25-0.53-0.55-1.1-1.2-1.22

31 Omics Viewer

32 Red: Enhanced expression over my threshold (i.e. 2-fold change) Yellow: Repressed expression over my threshold Blue: Not significantly changed over my threshold Omics Viewer: Color Coding Gene Expression Levels

33 Omics Viewer: generating a table of individual pathways exceeding certain threshold

34 Comparing Across Species –Use Metabolic Map

35 Comparing Across Species

36

37

38 PMN BLAST DataSets experimental all kingdoms experimental or computational plants only

39 Online Tutorials

40 Data Downloads Complete databases Custom flat files Custom BLAST dataset

41 Download and Install a Local Copy of the PMN Databases Run robust live database query by scripts, via Perl, Java, LISP interfaces Edit with private data Access to additional features not available on web mode Free, open database license Pathway Tools Software (SRI)

42 Developing The PMN

43 Creating Single-Species Databases –New sets of DNA sequences become available Genomes are sequenced Large EST data sets are created –Unigene builds are generated –PMN pipeline predicts enzyme functions Based on sequence similarity to known enzymes, enzymes with experimental or literature support –Set of predicted enzymes is used to predict metabolic pathways The pathway prediction software (Pathway Tools) uses: –Enzyme functional annotations –A reference set of pathways (e.g. PlantCyc) –Curators validate predicted pathways in the new database Remove incorrect information and add additional data

44 Pathway Tools Annotated Sequences AT1G69370 chorismate mutase prephenate aminotransferase arogenate dehydratase chorismateprephenateL-arogenateL-phenylalanine 5.4.99.5 2.6.1.79 4.2.1.91 Enzyme functions Protein sequences chorismate mutase PlantCyc AT1G69370 BLAST RESD chorismateprephenateL-arogenateL-phenylalanine 5.4.99.5 2.6.1.79 4.2.1.91 AT1G69370 chorismate mutase

45 Identifiers Used in Automated Enzyme Annotation and Enzyme to Pathway Mapping Complete EC number –e.g. 2.1.1.128 Unique PlantCyc reaction id, when complete EC is not assigned –e.g. RXN-0981 GO term id –e.g. GO:560010 2.1.1.128 RXN-0981

46 Manual Curation –Who Curators identify, read and enter information from published journal articles –What Add missing pathways Update existing pathways Create new reactions Add compound structures Add missing enzymes Curate enzyme properties, kinetic data Remove false-positive pathway predictions Remove false-positive enzyme annotations

47 Submitting Data To Us

48 Community Gratitude

49 curator@plantcyc.org meet Kate, Booth# 219 www.plantcyc.org


Download ppt "Metabolic Pathway Databases and Tools Speaker and Schedule Update PMN (Peifen Zhang) KEGG (auto-slide show) MetaCrop (cancelled)"

Similar presentations


Ads by Google