By: Amira Djebbari and John Quackenbush BMC Systems Biology 2008, 2: 57 Presented by: Garron Wright April 20, 2009 CSCE 582.

Slides:



Advertisements
Similar presentations
Genomes and Proteomes genome: complete set of genetic information in organism gene sequence contains recipe for making proteins (genotype) proteome: complete.
Advertisements

A Systematic approach to the Large-Scale Analysis of Genotype- Phenotype correlations Paul Fisher Dr. Robert Stevens Prof. Andrew Brass.
Bioinformatics “Other techniques raise more questions than they answer. Bioinformatics is what answers the questions those techniques generate.” SheAvery
Bioinformatics at IU - Ketan Mane. Bioinformatics at IU What is Bioinformatics? Bioinformatics is the study of the inherent structure of biological information.
Bioinformatics Needs for the post-genomic era Dr. Erik Bongcam-Rudloff The Linnaeus Centre for Bioinformatics.
Cluster analysis of networks generated through homology: automatic identification of important protein communities involved in cancer metastasis Jonsson.
KEGG: Kyoto Encyclopedia of Genes and Genomes Susan Seo Intro to Bioinformatics Fall 2004.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
T Monday, June 15, 2015Monday, June 15, 2015Monday, June 15, 2015Monday, June 15, 2015.
Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break 14:45 – 15:15Regulatory pathways lecture 15:15 – 15:45Exercise.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
CISC667, F05, Lec24, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) DNA Microarray, 2d gel, MSMS, yeast 2-hybrid.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Recap Don’t forget to – pick a paper and – me See the schedule to see what’s taken –
Proteomics Understanding Proteins in the Postgenomic Era.
DEMO CSE fall. What is GeneMANIA GeneMANIA finds other genes that are related to a set of input genes, using a very large set of functional.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
341: Introduction to Bioinformatics Dr. Natasa Przulj Deaprtment of Computing Imperial College London
Bioinformatics Jan Taylor. A bit about me Biochemistry and Molecular Biology Computer Science, Computational Biology Multivariate statistics Machine learning.
Bayesian integration of biological prior knowledge into the reconstruction of gene regulatory networks Dirk Husmeier Adriano V. Werhli.
Knowledge Integration for Gene Target Selection Graciela Gonzalez, PhD Juan C. Uribe Contact:
Knowledgebase Creation & Systems Biology: A new prospect in discovery informatics S.Shriram, Siri Technologies (Cytogenomics), Bangalore S.Shriram, Siri.
Genome-scale Metabolic Reconstruction and Modeling of Microbial Life Aaron Best, Biology Matthew DeJongh, Computer Science Nathan Tintle, Mathematics Hope.
Whole Genome Expression Analysis
Cis-regulation Trans-regulation 5 Objective: pathway reconstruction.
Using Bayesian Networks to Analyze Expression Data N. Friedman, M. Linial, I. Nachman, D. Hebrew University.
Computational protein design. Reasons to pursue the goal of protein design In medicine and industry, the ability to precisely engineer protein hormones.
Introduction to Bioinformatics Prologue. Bioinformatics Living things have the ability to store, utilize, and pass on information Bioinformatics strives.
Gene Regulatory Network Inference. Progress in Disease Treatment  Personalized medicine is becoming more prevalent for several kinds of cancer treatment.
Introduction to Bioinformatics Spring 2002 Adapted from Irit Orr Course at WIS.
Function first: a powerful approach to post-genomic drug discovery Stephen F. Betz, Susan M. Baxter and Jacquelyn S. Fetrow GeneFormatics Presented by.
Abstract Background: In this work, a candidate gene prioritization method is described, and based on protein-protein interaction network (PPIN) analysis.
CS5263 Bioinformatics Lecture 20 Practical issues in motif finding Final project.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Evolutionary Algorithms for Finding Optimal Gene Sets in Micro array Prediction. J. M. Deutsch Presented by: Shruti Sharma.
Genomics.
Decoding the Network Footprint of Diseases With increasing availability of data, there is significant activity directed towards correlating genomic, proteomic,
Learning Bayesian networks from postgenomic data with an improved structure MCMC sampling scheme Dirk Husmeier Marco Grzegorczyk 1) Biomathematics & Statistics.
Nuria Lopez-Bigas Methods and tools in functional genomics (microarrays) BCO17.
An overview of Bioinformatics. Cell and Central Dogma.
Class 23, 2001 CBCl/AI MIT Bioinformatics Applications and Feature Selection for SVMs S. Mukherjee.
Bioinformatics and Computational Biology
Data Mining: Knowledge Discovery in Databases Peter van der Putten ALP Group, LIACS Pre-University College Bio Informatics January
Introduction to biological molecular networks
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Applications of Supervised Learning in Bioinformatics Yen-Jen Oyang Dept. of Computer Science and Information Engineering.
Discovering functional interaction patterns in Protein-Protein Interactions Networks   Authors: Mehmet E Turnalp Tolga Can Presented By: Sandeep Kumar.
Nonlinear differential equation model for quantification of transcriptional regulation applied to microarray data of Saccharomyces cerevisiae Vu, T. T.,
Case Study: Characterizing Diseased States from Expression/Regulation Data Tuck et al., BMC Bioinformatics, 2006.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS) LECTURE 13 ANALYSIS OF THE TRANSCRIPTOME.
Effect of Alcohol on Brain Development NormalFetal Alcohol Syndrome.
1 Survey of Biodata Analysis from a Data Mining Perspective Peter Bajcsy Jiawei Han Lei Liu Jiong Yang.
Graduate Research with Bioinformatics Research Mentors Nancy Warter-Perez, ECE Robert Vellanoweth Chem and Biochem Fellow Sean Caonguyen 8/20/08.
Inferring Regulatory Networks from Gene Expression Data BMI/CS 776 Mark Craven April 2002.
Yiming Kang, Hien-haw Liow, Ezekiel Maier, & Michael Brent
Journal club Jun , Zhen.
APPLICATIONS OF BIOINFORMATICS IN DRUG DISCOVERY
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
1 Department of Engineering, 2 Department of Mathematics,
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
The Most Informative Spacing Statistic Identifies Biologically Relevant Patterns in Transcript Level Distributions Stan Pounds Department of Biostatistics.
Schedule for the Afternoon
The Future of Genetic Research
Bioinformatics Vicki & Joe.
Bioinformatics, Vol.17 Suppl.1 (ISMB 2001) Weekly Lab. Seminar
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

By: Amira Djebbari and John Quackenbush BMC Systems Biology 2008, 2: 57 Presented by: Garron Wright April 20, 2009 CSCE 582

Bioinformatics Gene Network Modeling Techniques Microarrays Seeded Bayesian Networks/a priori knowledge biasing

Confluence of biology, computer science, and information technology Ultimate goal: enable the discovery of new biological insights as well as creating global perspective that elucidates unifying principles of biology Genetic information is then used to create a comprehensive picture of cellular function in normal state to then compare to diseased and other altered states Interpretation of nucleotide sequence data, amino acid sequence data, protein domains, and protein structures Source:

Source:

Weighted matrices Boolean Networks Differential Equations Bayesian Networks

All potential network topologies must be assessed Possible Solution: Heuristic Search Algorithms Example: Greedy Hill Climbing Problem: Often find local maxima but not global

Leads to near optimization in exploring the state space (relative gene expression state in this instance) Network seed biases the search for the best topology, however it does not limit novel gene interactions from being identified How do we seed/set a priori knowledge? Pathway/interaction databases Networks deduced from published literature in PubMed High-throughput interaction screens (PPI)

Deducing Prior Knowledge From Published Literature Co-occurrence method Limited papers/literature to just 2 genes thus exhibits scale-free behavior Deducing Prior Network Structure From High-Throughput Screens Interactome data PPI Represent unbiased screens for interactions and have shown new unreported interactions Modified Depth-First Search to imply directionality to the network structure and then bootstrapping 100 times for the following cases: no priors, literature derived priors, PPI derived priors, and a combination of the two Select features with bootstrap confidence of 0.7 or greater

Bayesian Network analysis of gene expression on a leukemia study Comparison of Acute Lymphoblastic Lymphoma (ALL) to Acute Myeloid Leukemia (AML) Top 40 genes that distinguish between the two cancers Microarray does not probe entire genome, only a subset Bayesian analysis of gene expression on a second leukemia study Nearly the entire genome probed Evaluated network reconstruction on the KEGG cell cycle pathway Created a Receiver-Operator (ROC) Curve (True positive rate vs. False Positive rate)

Advent of microarray and other high throughput technology led to the expectation that the pathways/networks that would link genotype to phenotype would be discovered Using BN analysis as described here can recover some of that promise The two gene expression data sets are typical of microarray studies Domain specific knowledge of prior network seeds improves the ability of BN learning the interactions between genes These interactions can be used to reconstruct predictive networks at any confidence level Automated way to derive network graphs from a gene list, refining the graph with the expression data, and learning conditional probabilities that can be used to predict gene response to various insults Wide range of uses: mechanistic studies to drug target prioritization