Inferring Gene Regulatory Networks from Asynchronous Microarray Data David Oviatt, Dr. Mark Clement, Dr. Quinn Snell, Kenneth Sundberg Department of Computer.

Slides:



Advertisements
Similar presentations
Michael R. Smith, Mark Clement, Tony Martinez, and Quinn Snell
Advertisements

Sparsification and Sampling of Networks for Collective Classification
DREAM4 Puzzle – inferring network structure from microarray data Qiong Cheng.
Combining Classification and Model Trees for Handling Ordinal Problems D. Anyfantis, M. Karagiannopoulos S. B. Kotsiantis, P. E. Pintelas Educational Software.
Analyzing Gene Relationships for Down Syndrome with Labeled Transition Graphs Neha Rungta, Hyrum Carroll, Eric Mercer, Mark Clement, and Quinn Snell Computer.
GNUMap: Unbiased Probabilistic Mapping of Next- Generation Sequencing Reads Nathan Clement Computational Sciences Laboratory Brigham Young University Provo,
Clustered alignments of gene- expression time series data Adam A. Smith, Aaron Vollrath, Cristopher A. Bradfield and Mark Craven Department of Biosatatistics.
Daozheng Chen 1, Mustafa Bilgic 2, Lise Getoor 1, David Jacobs 1, Lilyana Mihalkova 1, Tom Yeh 1 1 Department of Computer Science, University of Maryland,
GENIE – GEne Network Inference with Ensemble of trees Van Anh Huynh-Thu Department of Electrical Engineering and Computer Science, Systems and Modeling,
7.3 Kruskal’s Algorithm. Kruskal’s Algorithm was developed by JOSEPH KRUSKAL.
DNA Microarray Bioinformatics - #27611 Program Normalization exercise (from last week) Dimension reduction theory (PCA/Clustering) Dimension reduction.
Iterative Optimization of Hierarchical Clusterings Doug Fisher Department of Computer Science, Vanderbilt University Journal of Artificial Intelligence.
Recognizing Ontology-Applicable Multiple-Record Web Documents David W. Embley Dennis Ng Li Xu Brigham Young University.
Dimension reduction : PCA and Clustering by Agnieszka S. Juncker
Evolutionary Algorithms Simon M. Lucas. The basic idea Initialise a random population of individuals repeat { evaluate select vary (e.g. mutate or crossover)
Simulation and Application on learning gene causal relationships Xin Zhang.
Dimension Reduction and Feature Selection Craig A. Struble, Ph.D. Department of Mathematics, Statistics, and Computer Science Marquette University.
Project Phase I l Due on 9/22, send me through l 2-10 Pages l Free style in writing (use 11pt font or larger) l Project description å Overview å.
Microarray analysis 2 Golan Yona. 2) Analysis of co-expression Search for similarly expressed genes experiment1 experiment2 experiment3 ……….. Gene i:
Applications of Data Mining in Microarray Data Analysis Yen-Jen Oyang Dept. of Computer Science and Information Engineering.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Genetic Regulatory Network Inference Russell Schwartz Department of Biological Sciences Carnegie Mellon University.
BEN LOCKWOOD DAVID C. LEBLANC DEPT. OF BIOLOGY, BALL STATE UNIVERSITY Growth-Climate Associations for White Ash (Fraxinus americana L.) in Monroe County,
Gene Regulatory Network Inference. Progress in Disease Treatment  Personalized medicine is becoming more prevalent for several kinds of cancer treatment.
Computational Intelligence: Methods and Applications Lecture 30 Neurofuzzy system FSM and covering algorithms. Włodzisław Duch Dept. of Informatics, UMK.
Computer Science, Software Engineering & Robotics Workshop, FGCU, April 27-28, 2012 Fault Prediction with Particle Filters by David Hatfield mentors: Dr.
PCA, Clustering and Classification by Agnieszka S. Juncker Part of the slides is adapted from Chris Workman.
Class Prediction and Discovery Using Gene Expression Data Donna K. Slonim, Pablo Tamayo, Jill P. Mesirov, Todd R. Golub, Eric S. Lander 발표자 : 이인희.
Clustering What is clustering? Also called “unsupervised learning”Also called “unsupervised learning”
Learning Linear Causal Models Oksana Kohutyuk ComS 673 Spring 2005 Department of Computer Science Iowa State University.
Dr. Engr. Sami ur Rahman Data Analysis Correlational Research.
1 Thinking Critically with Psychological Science Chapter 1.
Evolutionary Algorithms for Finding Optimal Gene Sets in Micro array Prediction. J. M. Deutsch Presented by: Shruti Sharma.
Metabolic Network Inference from Multiple Types of Genomic Data Yoshihiro Yamanishi Centre de Bio-informatique, Ecole des Mines de Paris.
LOGO iDNA-Prot|dis: Identifying DNA-Binding Proteins by Incorporating Amino Acid Distance- Pairs and Reduced Alphabet Profile into the General Pseudo Amino.
Artificial Intelligence Research Laboratory Bioinformatics and Computational Biology Program Computational Intelligence, Learning, and Discovery Program.
Extracting binary signals from microarray time-course data Debashis Sahoo 1, David L. Dill 2, Rob Tibshirani 3 and Sylvia K. Plevritis 4 1 Department of.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Model-based evaluation of clustering validation measures.
Gene expression & Clustering. Determining gene function Sequence comparison tells us if a gene is similar to another gene, e.g., in a new species –Dynamic.
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
GeneMANIA: a real-time multiple association network integration algorithm for predicting gene function Sara Mostafavi, Debajyoti Ray, David Warde-Farley,
Distributed Spanning Tree Center for Information Security Department of Computer Science University of Tulsa, Tulsa, OK
Graph Theory Trees. WHAT YOU WILL LEARN Trees, spanning trees, and minimum-cost spanning trees.
Typically, classifiers are trained based on local features of each site in the training set of protein sequences. Thus no global sequence information is.
Tutorial 8 Gene expression analysis 1. How to interpret an expression matrix Expression data DBs - GEO Clustering –Hierarchical clustering –K-means clustering.
A comparative approach for gene network inference using time-series gene expression data Guillaume Bourque* and David Sankoff *Centre de Recherches Mathématiques,
Computational Biology Group. Class prediction of tumor samples Supervised Clustering Detection of Subgroups in a Class.
Cluster Analysis Data Mining Experiment Department of Computer Science Shenzhen Graduate School Harbin Institute of Technology.
Advanced Gene Selection Algorithms Designed for Microarray Datasets Limitation of current feature selection methods: –Ignores gene/gene interaction: single.
Scalable Learning of Collective Behavior Based on Sparse Social Dimensions Lei Tang, Huan Liu CIKM ’ 09 Speaker: Hsin-Lan, Wang Date: 2010/02/01.
Performance measures Morten Nielsen, CBS, Department of Systems Biology, DTU.
Elinor Velasquez San Francisco State University Department of Biology.
Classification Using Top Scoring Pair Based Methods Tina Gui.
Analyzing circadian expression data by harmonic regression based on autoregressive spectral estimation Rendong Yang and Zhen Su Division of Bioinformatics,
MicroArray Data Analysis Candice Quadros & Amol Kothari.
EQTLs.
Cluster Analysis II 10/03/2012.
CSE 4705 Artificial Intelligence
Regulatory network inference: use of whole brain vs
Bud Mishra Professor of Computer Science and Mathematics 12 ¦ 3 ¦ 2001
الأستاذ المساعد بقسم المناهج وطرق التدريس
Computer Science Department Brigham Young University

Kruskal’s Algorithm for finding a minimum spanning tree
Evaluation of inferred networks
Somi Jacob and Christian Bach
Reecha Khanal Mentor: Avdesh Mishra Supervisor: Dr. Md Tamjidul Hoque
Minimum Spanning Trees (MSTs)
Vectors: Adding and subtracting
RANDOM NUMBERS SET # 1:
Presentation transcript:

Inferring Gene Regulatory Networks from Asynchronous Microarray Data David Oviatt, Dr. Mark Clement, Dr. Quinn Snell, Kenneth Sundberg Department of Computer Science, Brigham Young University, Provo, UT Jared Allen, Dr. Randall Roper Department of Biology, Indiana University-Purdue University Indianapolis, Indianapolis, IN

Purpose: Use microarray data to infer the gene regulatory network of an organism

Other Methods' Unreasonable Requirements High number of samples Time series data

Problems: Scarcity of microarray data Large size of networks Noise

AIRnet: Asynchronous Inference of Regulatory networks Classify gene levels using k-means clustering Compute influence vectors (i.v.) Convert i.v.'s into a sorted list of edges Use Kruskal's algorithm to find the minimum-cost spanning tree

Influence Vectors Perform pairwise- comparisons of change in gene levels between samples, adding or subtracting from i.v. Divide i.v. by the total number of comparisons

In-silico Data DREAM3 competition - M3_Challenges Laboratory of Intelligent Systems: Thomas Schaffter and Daniel Marbach - GeneNet Weaver -

Clockwise from top left: simulated E.coli 1 network; E.coli 1 inferred correlations above 50%; simulated E.coli 2 network; E.coli 2 inferred correlations above 50%; inferred networks made using 2 bins for each gene.

Metrics Precision Recall F-score Accuracy Sensitivity Specificity MCC – Matthews Correlation Coefficient

AIRnet Compared to Random 1000 random predictions created for each test case Mean score of each metric reported for each network size

Score Summaries:

The End