Visualization and analysis of large data collections: a case study applied to confocal microscopy data Wim de Leeuw, Swammerdam Institute for Life Sciences,

Slides:



Advertisements
Similar presentations
High Resolution studies
Advertisements

TWO STEP EQUATIONS 1. SOLVE FOR X 2. DO THE ADDITION STEP FIRST
QUALITY CONTROL TOOLS FOR PROCESS IMPROVEMENT
V-Detector: A Negative Selection Algorithm Zhou Ji, advised by Prof. Dasgupta Computer Science Research Day The University of Memphis March 25, 2005.
Hands-On Problem Solving with Remote Electron Microscopy George Motter Andrea Harmer Lehigh University.
Combining Like Terms. Only combine terms that are exactly the same!! Whats the same mean? –If numbers have a variable, then you can combine only ones.
Introduction ATMCP and Performance Dominique Colin de Verdière (CENA) Bernard Miaillier (Eurocontrol) TIM9 - ATMCP-RTSP May 2002.
Coordinate Plane Practice The following presentation provides practice in two skillsThe following presentation provides practice in two skills –Graphing.
0 - 0.
MULT. INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
Addition Facts
Year 6 mental test 5 second questions
5th Intensive Course on Soil Micromorphology Naples th - 14th September Image Analysis Lecture 1 Introduction.
Session # 2 SWE 211 – Introduction to Software Engineering Lect. Amanullah Quadri 2. Fact Finding & Techniques.
Chapter 4: Basic Estimation Techniques
1 ECE 495 – Integrated System Design I Introduction to Image Processing ECE 495, Spring 2013.
1. 2 GUIDELINES 1. Identify the variable(s) of interest (the focus) and the population of the study. 2. Develop a detailed plan for collecting data. If.
LOGO Regression Analysis Lecturer: Dr. Bo Yuan
McGraw-Hill/Irwin McGraw-Hill/Irwin Copyright © 2009 by The McGraw-Hill Companies, Inc. All rights reserved.
KONNEX Scientific Conference, 5th October 2004 KONNEX Configuration modes 1 KONNEX Configuration modes in the frame of a modular communication stack Petar.
Non-Parametric Statistics
Matthias Wimmer, Bernd Radig, Michael Beetz Chair for Image Understanding Computer Science TU München, Germany A Person and Context.
1 UNSD/FSO workshop on Integrated Economic Statistics Jean-François Fracheboud Federal Department of Home Affairs FDHA Federal Statistical Office Re-engineering.
Experimental Measurements and their Uncertainties
Take from Ten First Subtraction Strategy -9 Click on a number below to go directly to that type of subtraction problems
The world leader in serving science TQ ANALYST SOFTWARE Putting your applications on target.
Template V.24, 1-Mar-2007 Creating Value with the Infor MES & QM Jay Melville Sr. Manufacturing Consultant Mike LeRoy Business Unit Manager, May 30, 2007.
Hours Listening To Music In A Week! David Burgueño, Nestor Garcia, Rodrigo Martinez.
Devising Secure Sockets Layer-Based Distributed Systems: A Performance-Aware Approach Norman Lim, Shikharesh Majumdar,Vineet Srivastava, Dept. of Systems.
Past Tense Probe. Past Tense Probe Past Tense Probe – Practice 1.
Who are the Experts?Simon KampaSlide 1 Who are the Experts? Simon Kampa IAM Group University of Southampton
Addition 1’s to 20.
25 seconds left…...
Copyright © 2011 Pearson Education, Inc. Putting Statistics to Work.
CSTA K-12 Computer Science Standards (rev 2011)
Test B, 100 Subtraction Facts
Week 1.
Analyzing Genes and Genomes
Vanderbilt Business Objects Users Group 1 Linking Data from Multiple Sources.
1 Cross-Correlations and Cleaning Up Data Jessica Ferguson.
CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu Lecture 27 – Overview of probability concepts 1.
Chapter 15: Statistics Section 15.1: Formulating Statistical Questions, Gathering Data, and Using Samples.
Bottoms Up Factoring. Start with the X-box 3-9 Product Sum
How Cells Obtain Energy from Food
Hydrological information systems Svein Taksdal Head of section, Section for Hydroinformatics Hydrology department Norwegian Water Resources and Energy.
Migration of a large survey onto a micro-economic platform Val Cox April 2014.
Image Analysis Phases Image pre-processing –Noise suppression, linear and non-linear filters, deconvolution, etc. Image segmentation –Detection of objects.
Applications of Visualization and Data Clustering to 3D Gene Expression Data Oliver Rübel 1,2,3,7, Gunther H. Weber 3,7, Min-Yu Huang 1,7, E. Wes Bethel.
Modeling Process Quality
Computational Biology: A Measurement Perspective Alden Dima Information Technology Laboratory
Genomic Arrays: Tools for cancer gene discovery Ian Roberts MRC Cancer Cell Unit Hutchison MRC Research Centre
Dimension reduction : PCA and Clustering Agnieszka S. Juncker Slides: Christopher Workman and Agnieszka S. Juncker Center for Biological Sequence Analysis.
Automatic Analysis of BrdU Incorporation for Replication Timing Assays Intern: Albert F. Cervantes Advisors: York Marahrens Moira Regelson Silvia Diaz-Perez.
Analysis of Drug-Gene Interaction Data Florian Ganglberger Sebastian Nijman Lab.
Image processing for selected biological experiments J. Schier, B. Kovář ÚTIA AV ČR, v.v.i.
Copyright 2000, Media Cybernetics, L.P. Array-Pro ® Analyzer Software.
© The Catholic University of America Dept of Biomedical Engineering ENGR 104: Lecture 2 Statistical Analysis Using Matlab Lecturers: Dr. Binh Tran.
Epigenetic Analysis BIOS Statistics for Systems Biology Spring 2008.
Protein Lysate Microarrays Clay Scott Ryan McConnell Shannon Neeley.
Data Preprocessing Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2010.
Directions for Hypertext Research: Exploring the Design Space for Interactive Scholarly Communication John J. Leggett & Frank M. Shipman Department of.
Introduction to Statistical Analysis of Gene Expression Data Feng Hong Beespace meeting April 20, 2005.
AdvisorStudent Dr. Jia Li Shaojun Liu Dept. of Computer Science and Engineering, Oakland University Automatic 3D Image Segmentation of Internal Lung Structures.
Image registration aligns the common features of two images. The open-source Insight Toolkit (ITK, funded by the National Library of Medicine) provides.
High throughput biology data management and data intensive computing drivers George Michaels.
Microarray: An Introduction
M7Plus Unit-10: Statistics CMAPP Days (Compacted Days 1 – 5 )
2-1 Data Summary and Display 2-1 Data Summary and Display.
Statistical Data Analysis
Figure 1. Multiple levels of chromatin folding
Presentation transcript:

Visualization and analysis of large data collections: a case study applied to confocal microscopy data Wim de Leeuw, Swammerdam Institute for Life Sciences, Amsterdam Pernette Verschure, Swammerdam Institute for Life Sciences, Amsterdam Robert van Liere, Center for Mathematics and Computer Science, Amsterdam

2 Motivation (1): Context: cell biology experiments Phenomenon captured using digital microscopy Experiment characteristics: Biological diversity Not all biological parameters can be controlled Many measurements needed

3 Motivation (2): Visualization and analysis of collections of data sets High variability Non-trivial information extraction (eg segmentation) Noise Visualization Modes: Interactive vs Batch Interactive control+feedback vs static settings of parameters Time consuming vs multiple data sets processed simultaneously Aim: combine advantages of Interactive and Batch Visualization

4 Agenda Biological Problem Chromatin structure and gene control Visualization Problem Data collection description Analysis with visual summaries

5 Chromatin Structure and Gene Control Chromatin Structure Low level : DNA, nucleosomes, 30 nm fiber High level: fiber folding Gene control Regulation of gene activity Biological research question: Relation chromatin structure and gene control Is there, what is, when, etc....

6 Experiment Question: influence of Hetrochromatin protein 1 on chromatin structure? Approach: Prepare collection of cells with a specific region Control group: target GFP to the region HP1 group : target GFP/HP1 to the region Observe regions with confocal microscope Data analysis question: Identify and quantify the differences between control and HP1 group

7 Collection of data sets 60 data sets (30 control group, 30 HP1 group) Each data set: 512 x 512 x 32 Sample images: Control group (left) HP1 group (right) Data analysis questions: Accurately detect region of interest Quantify region attributes (size, roughness, roundness, etc) What are the attribute differences in the control and HP1 groups ?

8 Diversity of the collection

9 Interactive Visualization of Collection Advantages Control over visualization tools and parameters Segmentation Attribute computations Direct feedback Disadvantages Laborious Error prone

10 Batch processing of collection Advantage All sets are processed automatically A-priori parameter settings Disadvantage No feedback on the process

11 Visual Summaries Definition: a user defined compact visual representation of the data during (batch) processing Governing idea: the visual summary is used to visualize the steps in batch process Examples: General strategy: Interactive setup (determine parameters, attributes, etc) Batch processing using setup Information visualization with visual summaries

12 User Interface using Visual Summaries

13 Discriminating groups Red: HP1 sets, Green: control Region granularity vs number of spots in region Granularity attribute Average intensity gradient of region Plot tells us: Large variation, some outliers HP1 and control seem different

14 Large variation, some outliers Brush / link outliers Investigate visual summary Problems with data set Corrupt data

15 HP1 and control seem different Further analysis Histograms Box plots Statistical tests Wilcoxon Wilcoxon tells us that there is indeed a significant difference

16 Lessons learned Showing a significant difference in granularity vs number of spots tells us that the HP1 effects the structure of chromatin. The effect is that chromatin is condensed in a number of compact regions. Biological significant result. Two papers published Strategy for analysis of collections of confocal data sets Interactive visualization and batch processing are both needed Information visualization is used for the analysis of batch output Visual summaries are used to link back to original data set or previous steps in batch process Strategy has been implemented as the Argos system

17 Generality Argos has been used for the analysis of an experiment consisting of confocal data sets Argos has been used for the analysis of micro array data