Nuria Lopez-Bigas Methods and tools in functional genomics (microarrays) BCO17.

Slides:



Advertisements
Similar presentations
1 Machine Learning: Lecture 10 Unsupervised Learning (Based on Chapter 9 of Nilsson, N., Introduction to Machine Learning, 1996)
Advertisements

Gene Ontology John Pinney
Multiple Criteria for Evaluating Land Cover Classification Algorithms Summary of a paper by R.S. DeFries and Jonathan Cheung-Wai Chan April, 2000 Remote.
Microarrays Dr Peter Smooker,
Gene ontology & hypergeometric test Simon Rasmussen CBS - DTU.
Bio277 Lab 2: Clustering and Classification of Microarray Data Jess Mar Department of Biostatistics Quackenbush Lab DFCI
Lesson 8: Machine Learning (and the Legionella as a case study) Biological Sequences Analysis, MTA.
Introduction to Computational Biology Topics. Molecular Data Definition of data  DNA/RNA  Protein  Expression Basics of programming in Matlab  Vectors.
Dimension reduction : PCA and Clustering Slides by Agnieszka Juncker and Chris Workman.
Gene Set Analysis 09/24/07. From individual gene to gene sets Finding a list of differentially expressed genes is only the starting point. Suppose we.
GEPAS -Gene Expression Pattern Analysis Suite Hongli Li Computer Science Department UMASS Lowell
Gene Expression and Networks. 2 Microarray Analysis Unsupervised -Partion Methods K-means SOM (Self Organizing Maps -Hierarchical Clustering Supervised.
Multidimensional Analysis If you are comparing more than two conditions (for example 10 types of cancer) or if you are looking at a time series (cell cycle.
Introduction to Hierarchical Clustering Analysis Pengyu Hong 09/16/2005.
Microarray analysis 2 Golan Yona. 2) Analysis of co-expression Search for similarly expressed genes experiment1 experiment2 experiment3 ……….. Gene i:
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
GCB/CIS 535 Microarray Topics John Tobias November 15 th, 2004.
Data Mining – Intro.
Analysis of GO annotation at cluster level by H. Bjørn Nielsen Slides from Agnieszka S. Juncker.
Introduction to Data Mining Engineering Group in ACL.
Microarray Data Analysis Illumina Gene Expression Data Analysis Yun Lian.
Microarray Gene Expression Data Analysis A.Venkatesh CBBL Functional Genomics Chapter: 07.
CS Machine Learning. What is Machine Learning? Adapt to / learn from data  To optimize a performance function Can be used to:  Extract knowledge.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Data Mining Chun-Hung Chou
Intrusion Detection Jie Lin. Outline Introduction A Frame for Intrusion Detection System Intrusion Detection Techniques Ideas for Improving Intrusion.
Whole Genome Expression Analysis
Data Mining Joyeeta Dutta-Moscato July 10, Wherever we have large amounts of data, we have the need for building systems capable of learning information.
Analysis and Management of Microarray Data Dr G. P. S. Raghava.
From motif search to gene expression analysis
Introduction to DNA Microarray Technology Steen Knudsen Uma Chandran.
ArrayCluster: an analytic tool for clustering, data visualization and module finder on gene expression profiles 組員:李祥豪 謝紹陽 江建霖.
Gene Regulatory Network Inference. Progress in Disease Treatment  Personalized medicine is becoming more prevalent for several kinds of cancer treatment.
More on Microarrays Chitta Baral Arizona State University.
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.
Data Mining: Classification & Predication Hosam Al-Samarraie, PhD. Centre for Instructional Technology & Multimedia Universiti Sains Malaysia.
Microarray - Leukemia vs. normal GeneChip System.
Gene expression analysis
A Short Overview of Microarrays Tex Thompson Spring 2005.
S. F. Molaeezadeh-31 may 2008Gene expression modeling through positive Boolean functions 1 Seminar Title: Gene expression modeling through positive Boolean.
1 Machine Learning 1.Where does machine learning fit in computer science? 2.What is machine learning? 3.Where can machine learning be applied? 4.Should.
Microarrays and Gene Expression Analysis. 2 Gene Expression Data Microarray experiments Applications Data analysis Gene Expression Databases.
Analysis and Management of Microarray Data Previous Workshops –Computer Aided Drug Design –Public Domain Resources in Biology –Application of Computer.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Analysis of GO annotation at cluster level by Agnieszka S. Juncker.
Gene Expression and Networks. 2 Microarray Analysis Supervised Methods -Analysis of variance -Discriminate analysis -Support Vector Machine (SVM) Unsupervised.
An Overview of Clustering Methods Michael D. Kane, Ph.D.
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
Course Work Project Project title “Data Analysis Methods for Microarray Based Gene Expression Analysis” Sushil Kumar Singh (batch ) IBAB, Bangalore.
GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.
Gene set analyses of genomic datasets Andreas Schlicker Jelle ten Hoeve Lodewyk Wessels.
Introduction to Microarrays Kellie J. Archer, Ph.D. Assistant Professor Department of Biostatistics
Computational Approaches for Biomarker Discovery SubbaLakshmiswetha Patchamatla.
Clustering.
1 Unsupervised Learning and Clustering Shyh-Kang Jeng Department of Electrical Engineering/ Graduate Institute of Communication/ Graduate Institute of.
GeWorkbench Overview Support Team Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT and Harvard.
Data Mining and Decision Support
Gene expression. Gene Expression 2 protein RNA DNA.
Tutorial 8 Gene expression analysis 1. How to interpret an expression matrix Expression data DBs - GEO Clustering –Hierarchical clustering –K-means clustering.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
Effect of Alcohol on Brain Development NormalFetal Alcohol Syndrome.
SUPERVISED AND UNSUPERVISED LEARNING Presentation by Ege Saygıner CENG 784.
Biological data representation and data mining Xin Chen
An Artificial Intelligence Approach to Precision Oncology
Analysis of GO annotation at cluster level by Agnieszka S. Juncker
Classification and Prediction
Dimension reduction : PCA and Clustering
Gene Expression Analysis
Presentation transcript:

Nuria Lopez-Bigas Methods and tools in functional genomics (microarrays) BCO17

What are microarrays?

Microarray data analysis is the step that will allow us to extract biological meaning to high-throughput data generated with the experiment. Microarray data analysis

Microarray DATA Normalized data Data preprocession and normalization

Normalization and Noise: Normalization Some kind of normalization is usually required when comparing more than one microarray experiment. Adjust to account for differences in overall brightness of slides Normalize relative to housekeeping genes Noise Refers to variability and reproducibility of microarray experiments Intra and inter-microarray variations can significantly skew interpretation of data Sample collection is very important. If comparing two conditions you must control for all variables other than the one you are trying to measure Technical noise can result from imperfections in the chip. Both biological and technical replicates are required to measure and control these sources of noise Microarray data analysis

Differential expression Microarray DATANormalized data Data preprocession and normalization Data analysis

Microarray data analysis Differential expressionGO,KEGG…analysis Microarray DATANormalized data Data preprocession and normalization Data analysis

The Gene Ontology project provides a controlled vocabulary to describe gene and gene product attributes in any organism. The Ontologies Cellular component Biological process Molecular function BROWSER::AMIGO TOOLS Gene Ontology

Gene Ontology::Tools FUNC-EXPRESSION

KEGG

Microarray data analysis Differential expression GO,KEGG…analysis Classification Microarray DATANormalized data Data preprocession and normalization Data analysis

Classification Support vectors machines Desition trees

Microarray data analysis Differential expression GO,KEGG…analysis Classification Clustering Microarray DATANormalized data Data preprocession and normalization Data analysis

Supervised versus Unsupervised: Supervised Analysis to determine genes that fit a predetermined pattern Usually used to find genes with expression levels that are significantly different between groups of samples or finding genes that accurately predict a characteristic of the sample Two popular supervised techniques would be nearest-neighbour analysis and support vector machines. Unsupervised Analysis to characterize the components of a data set without a priori input or knowledge of a training signal Try to find internal structure or relationships in data without trying to predict some ‘correct answer’. Three classes: 1. Feature determination: Look for genes with interesting patterns Eg. Principal-components analysis 2. Cluster determination: Determine groups of genes with similar expression patterns eg. Nearest-neighbour clustering, self-organizing maps, k-means clustering, 2d hierarchical clustering 3. Network determination: Determine graphs representing gene-gene or gene-phenotype interactions. Eg. Boolean networks, Bayesian networks, relevance networks Clustering & Classification

Cooper Breast Cancer Res :158

Microarray data analysis Differential expression GO,KEGG…analysis Clustering Classification Promoter analysis Microarray DATANormalized data Data preprocession and normalization Data analysis

Promoter analysis::TFBS TRANSFAC

Promoter analysis::Tools

Microarray data analysis Differential expression GO,KEGG…analysis Clustering Classification Promoter analysis Reverse engineering Microarray DATANormalized data Data preprocession and normalization Data analysis

Reverse engineering

Microarray data analysis Differential expression GO,KEGG…analysis Clustering Classification Promoter analysis Reverse engineering Microarray DATANormalized data Data preprocession and normalization Data analysis