Protein Ontology: Addressing the need for precision in representing protein networks Darren A. Natale, Ph.D. Protein Science Team Lead, PIR Research Assistant.

Slides:



Advertisements
Similar presentations
Annotation of Gene Function …and how thats useful to you.
Advertisements

Chapter 11 Cell Communication
Ch 17 Gene Expression I: Transcription
Molecular Basis for Relationship between Genotype and Phenotype DNA RNA protein genotype function organism phenotype DNA sequence amino acid sequence transcription.
Lecture 4: DNA transcription
SIGNALING FROM THE CELL SURFACE TO THE NUCLEUS
Endo. 4 Detecting and signalling Cell surface receptors: G protein linked and tyrosine kinase receptors: second messengers, phosphorylating kinases, activation.
Chapter 5 Ligand gated ion channels, intracellular receptors and phosphorylation cascades.
Relationship between Genotype and Phenotype
A tumor suppressor gene
Developing a protein-interactions ontology Esther Ratsch European Media Laboratory.
Transforming Growth Factor-Beta Receptor 2 TGF-β receptor 2
EBI is an Outstation of the European Molecular Biology Laboratory. UniProt Jennifer McDowall, Ph.D. Senior InterPro Curator Protein Sequence Database:
Step 1 of Protein Synthesis
Colinearity of Gene and Protein DNA RNA protein genotype function organism phenotype DNA sequence amino acid sequence transcription translation.
DNA and Chromosome Structure. Chromosomal Structure of the Genetic Material.
Express yourself That darn ribosome Mighty Mighty Proteins Mutants RNA to the Rescue
Cell Signaling A __________________________is a series of steps by which a signal on a cell’s surface is converted into a ________________________________________________.
Cell signaling Lecture 8. Transforming growth factor (TGF β) Receptors/Smad pathway BMP7 TGF β1, TGF β2, TGF β3 Dpp Inhibins Activins TGF β receptors.
Control of Gene Expression Eukaryotes. Eukaryotic Gene Expression Some genes are expressed in all cells all the time. These so-called housekeeping genes.
Integration of PRO and UniProtKB Amherst, NY May 16, 2013 Cathy H. Wu, Ph.D. PRO-PO-GO Meeting.
B. Signal Transduction Pathway (cell signaling)
1 Biomedical Ontologies Bio-Trac 40 (Protein Bioinformatics) October 8, 2009 Zhang-Zhi Hu, M.D. Associate Professor Department of Oncology Department of.
Anastasia Nikolskaya PIR (Protein Information Resource) Georgetown University Medical Center
Signaling Pathways That Control Gene Activity TGFβ Receptors and the Direct Activation of Smads Presented By: Todd Lindsey.
1 Bio-Trac 40 (Protein Bioinformatics) October 9, 2008 Zhang-Zhi Hu, M.D. Research Associate Professor Protein Information Resource, Department of Biochemistry.
Control of Gene Expression
1 Bio-Trac 40 (Protein Bioinformatics) October 8, 2009 Zhang-Zhi Hu, M.D. Associate Professor Department of Oncology Department of Biochemistry and Molecular.
More regulating gene expression. Combinations of 3 nucleotides code for each 1 amino acid in a protein. We looked at the mechanisms of gene expression,
Response: Cell signaling leads to regulation of transcription or cytoplasmic activities Chapter 11.4.
Protein Ontology (PRO) Amherst, NY May 15, 2013 Cathy H. Wu, Ph.D. Director, Protein Information Resource (PIR) Edward G. Jefferson Chair and Director.
You have worked for 2 years to isolate a gene involved in axon guidance. You sequence the cDNA clone that contains axon guidance activity. What do you.
Molecular Basis for Relationship between Genotype and Phenotype DNA RNA protein genotype function organism phenotype DNA sequence amino acid sequence transcription.
Regulating Eukaryotic Gene Expression. Why change gene expression? Different cells need different components Responding to the environment Replacement.
The ‘regulates’ relationships Chris, David, Tanya.
Michael Cummings David Reisman University of South Carolina Gene Regulation Part 2 Chapter 9.
CHMI 4237 E Special topics in Biochemistry Eric R. Gauthier, Ph.D. Dept. Chemistry-Biochemistry Laurentian University Cell proliferation 4- Signaling to.
Gene Expression. Cell Differentiation Cell types are different because genes are expressed differently in them. Causes:  Changes in chromatin structure.
Functional Annotation and Functional Enrichment. Annotation Structural Annotation – defining the boundaries of features of interest (coding regions, regulatory.
1 Gene function annotation. 2 Outline  Functional annotation  Controlled vocabularies  Functional annotation at TAIR  Resources and tools at TAIR.
Transcription Packet #10 Chapter #8.
You can request PRO terms by using the SourceForge PRO tracker (Fig 3A) or by directly contributing to PRO by providing the information in the RACE-PRO.
Scope of the Gene Ontology Vocabularies. Compile structured vocabularies describing aspects of molecular biology Describe gene products using vocabulary.
Transcription. Recall: What is the Central Dogma of molecular genetics?
Cell Communication (Chpt. 11) Chapter 11. Overview of Cell Signaling Signaling evolved early in history of life Communicating cells may be close together.
What is so memorable about CREBBP? (RSTS; RTS, CBP CBP/MOZ FUSION GENE, CREB-binding protein)
Chapter 15- Cell Communication Part I- General signaling strategies
Crash Course!  Introduction to Molecular Biology.
AP Biology Eukaryotic Genome Control Mechanisms for Gene expression.
Homework #2 is due 10/17 Bonus #1 is due 10/24 Exam key is online Office hours: M 10/ :30am 2-5pm in Bio 6.
Chapter 14: Signaling Pathways That Control Gene Activity
Tools in Bioinformatics Ontologies and pathways. Why are ontologies needed? A free text is the best way to describe what a protein does to a human reader.
CFE Higher Biology DNA and the Genome Transcription.
Genes in development Signal transduction pathways and
TRANSCRIPTION (DNA → mRNA). Fig. 17-7a-2 Promoter Transcription unit DNA Start point RNA polymerase Initiation RNA transcript 5 5 Unwound.
BIO409/509 Cell and Molecular Biology.
User Community Interactions - Impact on PRO - Darren A. Natale, Ph.D. Protein Science Associate Team Lead, PIR Research Assistant Professor, GUMC Protein.
High-throughput data used in bioinformatics
Overview of the TGF-β signaling pathway
Developing a protein-interactions ontology
Transcription and Gene Regulation
Expression of Human Genes
“Proteomics is a science that focuses on the study of proteins: their roles, their structures, their localization, their interactions, and other factors.”
more regulating gene expression
Robert H. Oakley, PhD, John A. Cidlowski, PhD 
TGFb –Superfamily Proteins
Concept 18.2: Eukaryotic gene expression can be regulated at any stage
Overview Gene Ontology Introduction Biological network data
Smads “Freeze” When They Ski
TGFβ Signaling in Growth Control, Cancer, and Heritable Disorders
Presentation transcript:

Protein Ontology: Addressing the need for precision in representing protein networks Darren A. Natale, Ph.D. Protein Science Team Lead, PIR Research Assistant Professor, GUMC Workshop on Ontologies of Cellular Networks March 2008

IEV_ part_of IEV_ IEV_ IEV_ IEV_ IEV_ Binding of R-smad:smad4 complex and responsive element 2 Complex formation of R-smad and Smad4 5 Transcription by R-smad:smad4 1 Phosphorylation of R-smad by TGF beta receptor I 3 Nuclear import of R-smad:smad4 TGF-  signaling pathway Example from: INOH Event Ontology R-smad R-smad: Smad4 smad4 TGF beta receptor I responsive element ActionsLocations Nucleus Cytoplasm nuc lear me m b rane Roles

IMR_ Txn regulator IMR_ SMAD IMR_ Co-Smad IMR_ R-Smad IMR_ I-Smad IMR_ Smad3 IMR_ Smad2 IMR_ Smad5 IMR_ Smad1 IMR_ Smad8 is_a Example from: INOH Molecule Role Ontology IMR_ Smad4 The Roles Played IMR_ SMAD2_HUMAN sequence_of

Cellular Component: - nucleus Molecular Function: - protein binding Biological Process: - signal transduction - regulation of transcription, DNA-dependent Mothers against decapentaplegic homolog 2 Smad 2 GO annotation of SMAD2_HUMAN:

II I TGF-  TGF-beta receptor PP Smad 4 4 DNA binding 1 phosphorylation 2 complex formation Nucleus Cytoplasm Smad 2 PP Smad 4 5 Transcription Regulation PP Smad 2 Smad 4 3 nuclear translocation PP Smad 2 P P P ++ ERK1 CAMK2 P P

“normal”Cytoplasmic PRO: REACT_ TGF-  receptor phosphorylated Forms complex Nuclear Txn upregulation PRO: REACT_ ERK1 phosphorylatedForms complex Nuclear Txn upregulation++ PRO: CAMK2 phosphorylated Forms complex Cytoplasmic No Txn upregulation PRO: alternatively spliced short form Cytoplasmic PRO: REACT_ phosphorylated short form Nuclear Txn upregulationPRO: REACT_ point mutation (causative agent: large intestine carcinoma) Doesn’t form complex Cytoplasmic No Txn upregulation PRO: Smad 2 PP PP P PP P x PP SMAD2_HUMAN

%PRO: Smad2 %PRO: Smad2 isoform 1 (long form) %PRO: Smad2 isoform 1 phosphorylated form %PRO: Smad2 isoform 1, TGF-  receptor I-phosphorylated %PRO: Smad2 isoform 1, TGF-  receptor I and ERK1-phosphorylated arises_from SO: amino_acid_substitution NOT has_modification MOD: phosphorylated residue NOT has_function GO: transcription coactivator activity gives_rise_to DO: carcinoma of the large intestine %PRO: Smad2 sequence 1, TGF-  receptor I and CAMK2-phosphorylated %PRO: Smad2 sequence 2 (short form) - splice variant %PRO: Smad2 sequence 2 phosphorylated form %PRO: Smad2 sequence 2, TGF-  receptor I-phosphorylated %PRO: Smad2 sequence 3 - genetic variant related to colorectal carcinoma %PRO: Smad2 isoform 1, TGF-  receptor I and CAMK2-phosphorylated %PRO: Smad2 isoform 2 (short form) - splice variant %PRO: Smad2 isoform 2 phosphorylated form %PRO: Smad2 isoform 2, TGF-  receptor I-phosphorylated %PRO: Smad2 isoform 3 - genetic variant related to colorectal carcinoma has_modification MOD:O-phosphorylated L-serine has_modification MOD:O-phosphorylated L-threonine has_function GO: TGF-  receptor, pathway-specific cytoplasmic mediator activity has_function GO:SMAD binding has_function GO:transcription coactivator activity participates_in GO:signal transduction participates_in GO:SMAD protein heteromerization participates_in GO:regulation of transcription, DNA-dependent located_in GO:nucleus part_of GO:transcription factor complex

ProEvo ProForm GO Gene Ontology molecular function cellular component biological process participates_in part_of (for complexes) located _in (for compartments) has_function PRO protein Root Level is_a translation product of an evolutionarily-related gene translation product of a specific mRNA Family-Level Distinction In common: specific ancestor Source: PIRSF family Modification-Level Distinction In common: specific translation product Source: UniProtKB Sequence-Level Distinction In common: specific allele or splice variant Source: UniProtKB cleaved/modified translation product disease DO/UMLS Disease agent_of is_a protein modification has_modification PSI-MOD Modification SO Sequence Ontology sequence change arises_from (sequence change) gives_rise_to (effect on function) is_a protein domain has_part Pfam Domain Example: TGF-  receptor phosphorylated smad2 isoform1 is a phosphorylated smad2 isoform1 is a smad2 isoform 1 is a smad2 is a TGF-  receptor-regulated smad is a smad is a protein Modification Level Sequence Level Family Level Root Level translation product of a specific gene Gene-Level Distinction In common: specific gene Sources: PIRSF subfamily, Panther subfamily is_a Gene Level

IEV_ part_of IEV_ IEV_ IEV_ IEV_ IEV_ Binding of R-smad:smad4 complex and responsive element 2 Complex formation of R-smad and Smad4 5 Transcription by R-smad:smad4 1 Phosphorylation of R-smad by TGF beta receptor I 3 Nuclear import of R-smad:smad4 TGF-  signaling pathway Example from: INOH Event Ontology R-smad R-smad: Smad4 smad4 TGF beta receptor I responsive element ActionsLocations Nucleus nuc lear me m b rane Roles Actors smad2 smad2: PP PP PP PP PP has_participant PRO:smad4 has_participant PRO:TGF-  receptor-phosphorylated smad2 P P P P P Transcription has_participant PRO:smad4 has_participant PRO:TGF-  receptor & ERK1-phosphorylated smad2 Cytoplasm

PRO Team (so far…) Principle Investigators Cathy Wu (PIR at GUMC) Judith Blake (The Jackson Laboratory) Barry Smith (SUNY Buffalo) Curators & Developers Cecilia Arighi (PIR at GUMC) Winona Barker (PIR at GUMC) Harold Drabkin (The Jackson Laboratory) Zhang-zhi Hu (PIR at GUMC) Hongfang Liu (GUMC) Darren Natale (PIR at GUMC) Official Launch: March 31,

IMR_ Txn regulator IMR_ SMAD IMR_ R-Smad IMR_ Smad2 Smad2 vs c-Myc in INOH and PRO IMR_ Txn factor IMR_ protein IMR_ Myc IMR_ c-Myc PRO: smad PRO: R-Smad PRO: smad2 is_a PRO: protein PRO: myc PRO: c-myc

cytoskeleton component EPB42 Hypothetical example: TGM3 vs EPB42 structural protein IMR_ protein TGM3 is_a PRO: protein transglutaminaseEPB42protease TGM3 protein modifier enzyme