Interrogation of cross talk between proteins and gene regulatory networks in breast cancer Chambers, Teressa Lee Hiren Karathia Sridhar Hannenhalli.

Slides:



Advertisements
Similar presentations
Annotation of Gene Function …and how thats useful to you.
Advertisements

Molecular Biomedical Informatics Machine Learning and Bioinformatics Machine Learning & Bioinformatics 1.
MitoInteractome : Mitochondrial Protein Interactome Database Rohit Reja Korean Bioinformation Center, Daejeon, Korea.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Bioinformatics at IU - Ketan Mane. Bioinformatics at IU What is Bioinformatics? Bioinformatics is the study of the inherent structure of biological information.
Gene expression analysis summary Where are we now?
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
ONCOMINE: A Bioinformatics Infrastructure for Cancer Genomics
1 Protein-Protein Interaction Networks MSC Seminar in Computational Biology
Protein-protein Interactions Hsueh-Fen Juan 2003, Mar 31 NTNU.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Computational Molecular Biology Biochem 218 – BioMedical Informatics Gene Regulatory.
Overview of Bioinformatics A/P Shoba Ranganathan Justin Choo National University of Singapore A Tutorial on Bioinformatics.
Knowledge Integration for Gene Target Selection Graciela Gonzalez, PhD Juan C. Uribe Contact:
Ch10. Intermolecular Interactions and Biological Pathways
Shankar Subramaniam University of California at San Diego Data to Biology.
Introduction to Bioinformatics Spring 2002 Adapted from Irit Orr Course at WIS.
Finish up array applications Move on to proteomics Protein microarrays.
Literature reviews revised is due4/11 (Friday) turn in together: revised paper (with bibliography) and peer review and 1st draft.
Agent-based methods for translational cancer multilevel modelling Sylvia Nagl PhD Cancer Systems Science & Biomedical Informatics UCL Cancer Institute.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Proteomics Session 1 Introduction. Some basic concepts in biology and biochemistry.
Biological Networks & Systems Anne R. Haake Rhys Price Jones.
Central dogma: the story of life RNA DNA Protein.
EB3233 Bioinformatics Introduction to Bioinformatics.
A collaborative tool for sequence annotation. Contact:
An approach to carry out research and teaching in Bioinformatics in remote areas Alok Bhattacharya Centre for Computational Biology & Bioinformatics JAWAHARLAL.
Exploring and Exploiting the Biological Maze Zoé Lacroix Arizona State University.
GO based data analysis Iowa State Workshop 11 June 2009.
Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013.
PROTEIN INTERACTION NETWORK – INFERENCE TOOL DIVYA RAO CANDIDATE FOR MASTER OF SCIENCE IN BIOINFORMATICS ADVISOR: Dr. FILIPPO MENCZER CAPSTONE PROJECT.
Elena Klenova CTCF and BORIS in normal development, epigenetics and tumourigenesis Areas of research: Molecular Oncology Gene regulation Translational.
High throughput biology data management and data intensive computing drivers George Michaels.
Different microarray applications Rita Holdhus Introduction to microarrays September 2010 microarray.no Aim of lecture: To get some basic knowledge about.
Organellar Proteomics: Turning Inventories into Insights
High-throughput data used in bioinformatics
University of California at San Diego
General idea and concepts of cell-cell signaling
Data challenges in the pharmaceutical industry
Global Transcriptional Dysregulation in Breast Cancer
Microarray Technology and Applications
Dept of Biomedical Informatics University of Pittsburgh
Department of Genetics • Stanford University School of Medicine
“Proteomics is a science that focuses on the study of proteins: their roles, their structures, their localization, their interactions, and other factors.”
University of California at San Diego
Annotation: linking literature to gene products
Using Spotfire for Proteomic Analysis
Cell Signaling.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
What is an Ontology An ontology is a set of terms, relationships and definitions that capture the knowledge of a certain domain. (common ontology ≠ common.
Presented by Meeyoung Park
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Schedule for the Afternoon
General idea and concepts of cell-cell signaling
Bioinformatics For MNW 2nd Year
In these studies, expression levels are viewed as quantitative traits, and gene expression phenotypes are mapped to particular genomic loci by combining.
INTRODUCTION Nutrigenomics Dr. Muhamad Firdaus
AH Biology: Unit 1 Proteomics and Protein Structure 1
Research Support Network (RAI)
ChIP-seq Robert J. Trumbly
From Mendel to Genomics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Adam C. Wilkinson, Hiromitsu Nakauchi, Berthold Göttgens  Cell Systems 
Introduction to Bioinformatics
Proteomics Informatics David Fenyő
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
A Primer on Concepts and Applications of Proteomics in Neuroscience
Interactome Networks and Human Disease
Presentation transcript:

Interrogation of cross talk between proteins and gene regulatory networks in breast cancer Chambers, Teressa Lee Hiren Karathia Sridhar Hannenhalli

Cross Talk of Protein and Gene Regulation?? HPRD HPRD Ligand INTERACTION Receptor INTERACTION PPI Interactions PTM I A P NETPATH S E E Interact with other Protein molecules Prot ANNOTATION Transport of Pn to Nucleus from Cytoplasm Prot Getting information of pathway events from literatures Activation of TF TF Regulation of Genes

View of EGFR Protein Signaling Pathways

View of EGFR Protein Signaling Pathways

View of EGFR Protein Signaling Pathways

NetPath NetPath is a curated resource of signal transduction pathways in humans. At present there are 20 pathways 10 – Immune signaling pathways 10 – Cancer Signaling pathways

TCGA data set 100 Normal 100 Benign Cancer

Protein Protein Interactions Proteins Proteins Protein Protein Interactions Interfacing TF (ITF) ?? TF Protein DNA Interactions DNA (Genes)

Questions How can we identify context specific ITFs? What are correlation between ITFs centric protein and gene regulatory networks? Which ITF centric paths and target genes are cancer specific?

Collected from Databases and Literatures Pipeline Protein paths (K = 2) 200 Samples Parse XML (SBML) STAT RNA-Seq Collected from Databases and Literatures Target Genes Proteins Target Genes Corr STAT

Comparisons

Comparisons Protein paths (K = 3) STAT STAT Target Genes

All Protein paths (K <= 3) STAT ** Target Genes

** Protein paths (K = 1) STAT ** Target Genes

Protein paths (K = 2) STAT ** Target Genes

** Protein paths (K = 1) ** Target Genes

** Protein paths (K = 2) Target Genes **

Validation of the analyses

Validating a candidate gene

Flow of whole experiment

THANK YOU

Molecular Cross Talk

RELATIONSHIPS OF ALL THE RESOURCES Protein or gene centric functional information i.e., HPRD PUBMED Interaction information Annotation Gene regulation database Gene locus NetPath (Information of signaling pathways) SNPs Pathway Details Biological scientists read full text papers from pubmed database GenProt (Genomic information with SNPs) Genomic Details VALIDATION OF PATHWAY CANDIDATES dbSNP

View of EGFR induced Signaling pathways

RELATIONSHIPS OF ALL THE RESOURCES Protein centric functional annotations i.e., HPRD PUBMED Interaction information Manual annotation Gene regulation database Gene locus NetPath (Information of signaling pathways) SNPs Pathway Details Biological scientists read full text research papers from pubmed database. GenProt (Genomic information with SNPs) Genomic Details VALIDATION OF PATHWAY CANDIDATES dbSNP

Biological databases HPRD (Human Protein Reference Database) PPD (Plasma Protein Database) GenProt NetPATH (Networks and PATHways) Proteinpedia

HPRD (Human Protein Reference Database) The HPRD is a protein information resource that provides extensive information pertaining to human proteins. It includes information as follows: Domain Architecture Protein Function Protein-protein Interaction Post Translation Modification (PTMs) Enzyme-substrate relationships Sub cellular localization Tissue Expression Disease association All this Information collected by Molecular Biologists from full text papers available in Pubmed Database

ARCHITECTURE OF HPRD

ARCHITECTURE OF HPRD

FRONT PAGE OF HPRD @ http://hprd.org

Queries in HPRD

RESULT PAGE OF QUERY

PAGE OF INDIVIDUAL PROTEIN ENTRY

Protein Sequence Information

Protein Interaction Information

Protein Post Translation Modification & Substrate information

GENPROT Genprot is graphically viewer, which depicts human genomic sequence alongside its transcript and SNP (Single Nucleotide Polymorphism) if available on protein coding or non coding DNA sequences.

Genprot main interface

Integration of HPRD in GENPROT

PATH FOR KNOWLEDGE DISCOVERY Data (Biological Data) Information Knowledge OLAP OLTP Obtained Raw DATA From DATABASE (Biological Database) Storing Information in DATABASE (Biological Database) Storing Knowledge in Knowledge Base (Biological Data Warehouse) Data Manipulation Data Mining

DATA MINING APPROACH ON HPRD Biological Data (Interpreted by Biologist) Biologial meaningful Information Knowledge discovered (Data mining process) OLAP OLTP Obtained Raw DATA From DATABASE (Pubmed) Storing Information in DATABASE (HPRD) Storing Knowledge in Knowledge Base (Biological Data Warehouse) Data Interpretation Data Mining

HPRD report on Chromosome dimension

HPRD Report on Disease dimension

HPRD report based on Localization dimension

PTM event distribution

HPRD report on Protein and cDNA length

HPRD report based on Protein expression in Tissues

HPRD report based on Protein expression in Tissues

RELATIONSHIPS OF ALL THE RESOURCES HPRD or PPD PUBMED Interaction information Annotation Gene regulation database Gene locus NetPath (Information of signaling pathways) SNPs Pathway Details Biological scientists read full text papers from pubmed database GenProt (Genomic information with SNPs) Genomic Details VALIDATION OF PATHWAY CANDIDATES dbSNP

POSSIBLE APPLICATIONS OF HPRD IN MOLECULAR BIOLOGY 1. Comparative interactome analysis between human and other species by using data mining and wet laboratory approach 2. Design the strategy for validating the protein-protein interactions.

Example of comparative interactome analysis 25,464 24,587 105 Human Worm ORTHOLOGS ANALYSIS 42 21 PPI out of 36 found to be with high confidence 36 63 Fly 5,625 Human: 25,464 PPI (HPRD) 9 out of these 21 PPI was taken for validation done by Coimmunoprecipitation experiment C.elegans: 5,625 PPI (Li et al., 2004) CO-IMMUNOPRECIPITATION (CO-IP) EXPERIMENT S.cerevisiae: 15,675 PPI (MIPS & DIP) D.melanogaster: 20,439 PPI (Glot et al., 2003)

Objectives for validating biomolecules Specific objective - 1: Analysis of protein complexes of adapter proteins that are not well characterized in EGFR1 pathway. Experiment techniques: Protein digestion & Mass Spectroscopy Specific objective - 2: Validating the interactions/interactors of EGFR1 pathway molecules identified through large-scale yeast 2-hybrid (Y2H) screens through biochemical and functional assays. Experiment techniques: Co-IP and Western Blotting

Human ProteinPedia Human Proteinpedia is a community portal for sharing and integration of human protein data. Human Proteinpedia is information source of Protein-Protein interactions Tissue expression Expression in Cell line Subcellular Localization Enzyme Substrate relationships All together are obtained from number of experiment evidences. For example 1. Co-immunoprecipitation. 6. Western Blotting. 2. Mass spectroscopy. 7. Yeast 2 Hybrid experiments. 3. Fluorescence based experiments 4. Immunohistochemistry 5. Protein peptide Microarray

Human Proteinpedia

QUERING IN HUMAN PROTEINPEDIA

Results of Query

Results

Mass spectroscopy results

Mass spectroscopy peaks

CONCLUSION HPRD is a knowledgebase of Human Proteins that can assist the biomedical community in discovering genomic, transcriptomic and proteomic information together in an integrated view in terms of sequence, function and protein interactions, which provides facilities to a scientist to differentiate between ordered and disordered states of human cells.

CAREER PATH EDUCATION B.Sc (Biochemistry) M.Sc (Biomedical Technology) M.Sc (Bioinformatics) Joined as Research Scientist at Institute of Bioinformatics Worked for developing and designing HPRD, NetPath & Genprot.

Dept. of Biomathematics and Bioinformatics Under the supervision of HPRD development team Joined as Ph.D student funded by AGAUR in Dept. of Biomathematics and Bioinformatics at UDL Under the supervision of Dr. Rui Alves

“MOLTES GRACIES”