Flow Data Analysis Challenges Deck from Amgen Attendees Bioinformatics/ Biostatistics Molecular Computational Biology Sciences John Gosink Cheng SuKatie.

Slides:



Advertisements
Similar presentations
Building a Panel Select markers Select conjugates Method to build panel performance criteria.
Advertisements

Design of Experiments Lecture I
Isoparametric Elements Element Stiffness Matrices
Working with Statisticians At some point, a statistician is likely to be asked to analyze your data. This can lead to much unhappiness.
Quality Control in Immunophenotyping Dr. N. Varma Prof. & Head – Hematology Post Graduate Institute of Medical Education & Research (PGIMER), Chandigarh.
Advances In Flow cytometry Amina Kariminia Ph.D.
John Porter Why this presentation? The forms data take for analysis are often different than the forms data take for archival storage Spreadsheets are.
Virtual Dart: An Augmented Reality Game on Mobile Device Supervisor: Professor Michael R. Lyu Prepared by: Lai Chung Sum Siu Ho Tung.
Validity, Sampling & Experimental Control Psych 231: Research Methods in Psychology.
Statistics for the Social Sciences
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL March 25, 2003 CHEP 2003 Data Analysis Environment and Visualization.
Kate Milova MolGen retreat March 24, Microarray experiments: Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Microsoft Office Illustrated Inserting Illustrations, Objects, and Media Clips.
Presenting information
Status of calorimeter simulations Mikhail Prokudin, ITEP.
Visual Documentation v User Interface Active class (for selection and some processes)
Basics of Flow Cytometry Holden Maecker. Outline Definitions, what can be measured by flow cytometry Fluidics: Sheath and sample streams, flow cells,
Classification and Prediction: Regression Analysis
Rebecca Boger Earth and Environmental Sciences Brooklyn College.
ICCS e-newsletter CSI Summer 2011 Jeannine T. Holden, MD Emory University School of Medicine Atlanta, GA.
Flow Cytometry and Reproducible Analysis Cliburn Chan Department of Biostatistics and Bioinformatics, DUMC.
JumpStart the Regulatory Review: Applying the Right Tools at the Right Time to the Right Audience Lilliam Rosario, Ph.D. Director Office of Computational.
Protein Tertiary Structure Prediction
Annex I: Methods & Tools prepared by some members of the ICH Q9 EWG for example only; not an official policy/guidance July 2006, slide 1 ICH Q9 QUALITY.
Net Optics Confidential and Proprietary Net Optics appTap Intelligent Access and Monitoring Architecture Solutions.
Hydrologic Modeling: Verification, Validation, Calibration, and Sensitivity Analysis Fritz R. Fiedler, P.E., Ph.D.
National Immune Monitoring Laboratory University of Montreal, Quebec, Canada Biosystemix Ltd. Biosystemix, Ltd.
ArrayCluster: an analytic tool for clustering, data visualization and module finder on gene expression profiles 組員:李祥豪 謝紹陽 江建霖.
Flow cytometry to evaluate vaccine-induced T cell responses: standardized analysis of large numbers of FCS files Stephen De Rosa, M.D. HVTN Laboratory.
Basic Principles, Instrumentation, and Practices
Outline 3  PWA overview Computational challenges in Partial Wave Analysis Comparison of new and old PWA software design - performance issues Maciej Swat.
Areej Jouhar & Hafsa El-Zain Biostatistics BIOS 101 Foundation year.
Computational Diagnostics: A Novel Approach to Viewing Medical Data 5th International Conference on Coordinated & Multiple Views in Exploratory Visualization.
Immunological Images and the ImmPort Database and Analysis Portal Anna Maria Masci Department of Immunology Duke University Buffalo, 24 June 2014.
Quality Assurance How do you know your results are correct? How confident are you?
Computer Science 210 Computer Organization More on Assembler.
Source catalog generation Aim: Build the LAT source catalog (1, 3, 5 years) Jean Ballet, CEA SaclayGSFC, 29 June 2005 Four main functions: Find unknown.
1 Research Question  Can a vision-based mobile robot  with limited computation and memory,  and rapidly varying camera positions,  operate autonomously.
Final project – Computational Biology RNA Quantification מגישים: מיכל סימון חיים בן שימול בהנחיית: יהודה ברודי ד"ר ירון שב-טל.
Data Preprocessing Compiled By: Umair Yaqub Lecturer Govt. Murray College Sialkot.
Validation Defination Establishing documentary evidence which provides a high degree of assurance that specification process will consistently produce.
DTC Quantitative Methods Summary of some SPSS commands Weeks 1 & 2, January 2012.
1 End-to-End Learning for Automatic Cell Phenotyping Paolo Emilio Barbano, Koray Kavukcuoglu, Marco Scoffier, Yann LeCun April 26, 2006.
U.S. Department of the Interior U.S. Geological Survey Automatic Generation of Parameter Inputs and Visualization of Model Outputs for AGNPS using GIS.
1 Circuitscape Capstone Presentation Team Circuitscape Katie Rankin Mike Schulte Carl Reniker Sean Collins.
1 PEER Session 02/04/15. 2  Multiple good data management software options exist – quantitative (e.g., SPSS), qualitative (e.g, atlas.ti), mixed (e.g.,
Approaches to Intrusion Detection statistical anomaly detection – threshold – profile based rule-based detection – anomaly – penetration identification.
1 Giuseppe Romeo Voronoi based Source Detection. 2 Voronoi cell The Voronoi tessellation is constructed as follows: for each data point  i (also called.
1 Collecting and Interpreting Quantitative Data Deborah K. van Alphen and Robert W. Lingard California State University, Northridge.
1 ITN clinical trial analysis design – Transplantation example Cross-sectional and longitudinal clinical trials to assess tolerance Disease areas: Transplantation.
1 SOFTWARE FOR VERIFICATION OF HOMOGENEITY RESULTS FROM AUTOMATED URINE ANALYZERS IN REAL TIME Waldemar Volanski Ademir Luiz do Prado Geraldo Picheth et.
Setting Up Copy Number Variation Assays
Vanessa L. Peachee, MS, PhD Director, Immunotoxicology March 14, 2016
Greg Finak, Ph.D., Senior Staff Scientist Marylou Ingram ISAC Scholar
Flow Cytometry Halima Moncrieffe, University College London, UK IL-17
Computer Science 210 Computer Organization
Flowcytometry.
Dr Lindi Coetzee Ms Keshendree Moodley Prof D.K. Glencross
Data challenges in the pharmaceutical industry
Computer Science 210 Computer Organization
Memory B cells IgM only Marginal zone B cells without plasmablasts
Machine Learning Feature Creation and Selection
Lab 2 Data Manipulation and Descriptive Stats in R
Assessing Quality of Paradata to Better Understand the Data Collection Process for CAPI Social Surveys François Laflamme Milana Karaganis European Conference.
Kathleen R. Bartemes, BA, Gail M. Kephart, BS, Stephanie J
Accelerating Research in Life Sciences
The Art of Flow Cytometry
TransCAD Working with Matrices 2019/4/29.
Collecting and Interpreting Quantitative Data
Presentation transcript:

Flow Data Analysis Challenges Deck from Amgen Attendees Bioinformatics/ Biostatistics Molecular Computational Biology Sciences John Gosink Cheng SuKatie Newhall Hugh Rand Bill Rees Mark DalphinGary Means Wednesday, September 20, 2006

For Internal Use Only. Amgen Confidential. 2 Sample and meta-data tracking can be complicated Misc cytokines Misc drugs Stimulation/inhibition combinations Blood samples Multiple cell types FCS files Cell events 1,000 – 10, – – 10 10,000 – 100,000 5 – 20 blood samples stimulations / sample cell types/mix cell events/ cell type channels/cell 5,000 samples x 50 stims/sample x 7 cell-types/cocktail x 5 Mbytes/FCS file  10 terabytes An FCS file Approx. the size of an Affymetrix Microarray.CEL file Need a relational database and associated code infrastructure John Gosink, Bioinformatics/Computational Biology, Amgen

For Internal Use Only. Amgen Confidential. 3 Some meta-data that we need to capture, store, and index (let alone the actual FCS files/data) Sample meta-data Sample ID Sample to well mapping Stimulation conditions Dilutions Reagent meta-data Reagent batches Labeling scheme Machine meta-data (FCS format currently captures most) measurement windows PMT settings compensation (and matrix) transformation Gating parameters coordinates thresholds gate hierarchy John Gosink, Bioinformatics/Computational Biology, Amgen

For Internal Use Only. Amgen Confidential. 4 More interesting questions involve natural cell populations and their variation Catalog of all cell types - What are their distributions in all of flow parameter space - How to standardize between samples and runs What are fruitful approaches to characterizing these distributions -Baseline catagorization - Number of “typical” cell volumes (archetypes) - Location of archetypes - Shapes of archetypes - Relationships of cell counts in the archetypes - Characterization of the “void” - How empty is the void - How smooth is the void Detection of novel (sub) populations and unforseen changes John Gosink, Bioinformatics/Computational Biology, Amgen

For Internal Use Only. Amgen Confidential. 5  Question: How do we best quantitate multiple overlapping peaks  One Approach: Fit peaks as a sum of small numbers of basis set functions.  Issues: Basis set choice, sensitivity, accuracy, … Separation of Overlapping Peaks Hugh Rand, Bioinformatics/Computational Biology, Amgen

For Internal Use Only. Amgen Confidential. 6 Example histograms Noisy Overlap Small Peaks Shape More Overlap Hugh Rand, Bioinformatics/Computational Biology, Amgen

For Internal Use Only. Amgen Confidential. 7 Receptor Occupancy Assay and Analysis Unlabeled Drug Ab Labeled Drug Ab Labeled Recpt Ab Labeled Isotype Ctrl Ab Unlabeled 0 – sat’d dose Flow Cytometer Labeled Ab Labeled anti-recpt Ab Labeled isotype ctrl Cell with specific and non-specific receptors Cell with specific and non-specific receptors. Ab induces more recpt. Some Drug in Animal No Drug in Animal Mark Dalphin, Bioinformatics/Computational Biology, Amgen

For Internal Use Only. Amgen Confidential. 8 Some math… Simple form, without non-specific binding Add non-specific binding and things are not so tidy Mark Dalphin, Bioinformatics/Computational Biology, Amgen

For Internal Use Only. Amgen Confidential. 9 Problems with receptor occupancy assays  Even with 1:1 conjugates, MFI varies significantly from Ab to Ab against the same receptor  “Can’t see less than 1,000 receptors per cell”  Large variability from instrument to instrument and run to run  Why doesn’t this behave like a well-controlled physical experiment; why is it “semi-quantitative”?  I’d like to see: –Easy loading of data-sets and meta-data –Module to compute occupancy –Some way to look at associated binding curves Mark Dalphin, Bioinformatics/Computational Biology, Amgen

For Internal Use Only. Amgen Confidential. 10 Gating Sensitivity  If gates change slightly, will results change?  Reasons for considering gating sensitivity: – Quantitative analysis of the responses – Gating is done per individual samples – Gating is somewhat subjective, even auto-gating – Multiple gates used – Subgroups of small size Cheng Su, Biostatistics, Amgen

For Internal Use Only. Amgen Confidential. 11 Gating Sensitivity Analysis  Sensitivity Analysis – Get new gates by moving the boundary of gates – Conduct analysis – Compare the results  Challenges – software/system: to import the gate boundary – methodology: methods to automate gate movement and compare results Cheng Su, Biostatistics, Amgen

For Internal Use Only. Amgen Confidential. 12 System Outline Samples LSRII XMLFCS Gating B Cells T Cells NK Cells Analysis (R,SAS,Java,…) Result Check against Cheng Su, Biostatistics, Amgen

For Internal Use Only. Amgen Confidential. 13 How to move what we do in proprietary graphical tools into a more high-throughput environments?  Question: Are there applications available that can accommodate the size of FCS files that I generate, allow me to compare data across a plate, and provide data output in an acceptable format?  Problem: Currently using a 9-color, 12-parameter antibody panel in whole blood (and it’s only getting bigger!) –FCS file size = 10,000 to 30,000 KB –Analysis time = 8 hours for 32 samples/wells –Export time = minutes for 32 FCS files –Output = at least 7 gated files for each FCS file Katie Newhall, Molecular Sciences, Amgen

For Internal Use Only. Amgen Confidential. 14 How to move what we do in proprietary graphical tools into a more high-throughput environments?  Potential solutions –Analysis Automated gating Sample flagging Comparison of samples across a plate Output of histogram statistics in an excel format –Export time Gating information and experimental metadata exported with FCS/TXT files Katie Newhall, Molecular Sciences, Amgen

For Internal Use Only. Amgen Confidential. 15 immunophenotyping  experiment: – 80 clinical whole blood samples – no ex vivo manipulation – 4 dose cohorts – 38 3-color, RBClyse/no-wash stains – parameter FCS files  What populations of events change in some way as a function of drug dose or disease state or changes in other populations? Bill Rees, Molecular Sciences, Amgen

For Internal Use Only. Amgen Confidential. 16 An immunophenotyping panel T cells B cells NK cells monocytes Bill Rees, Molecular Sciences, Amgen

For Internal Use Only. Amgen Confidential. 17 Immunophenotyping  I will not deal with this 2-dimensions at a time – time – too many populations in each stain, only some do I know to look for – don’t know what I’m looking for with minimal biological insight  Issues: –definitions of terms –Metrics, e.g. MFI and %CD45+ events, % responders –Linking raw data to other study data/protocols and to analysis product –Autogating with visual QC –Can the identification of the major cell types (operationally defined by robust stains, e.g. CD3+ CD8+ CD56-) be automated to incrementally reduce the analysis time? Bill Rees, Molecular Sciences, Amgen

For Internal Use Only. Amgen Confidential. 18 Whole blood stimulation assays where leukocytes are evaluated for phosphoprotein pathway activation inhibition Note: This is the region where notes could be placed Gary Means, Molecular Sciences, Amgen

For Internal Use Only. Amgen Confidential. 19 Process Cells Whole blood Stimulate Labeling Flow Sample Data File Soft- ware T cell Granulocyte NKB cell Monocyte DN Lymphocyte CD4+/CD8+CD8+CD4+ CD8+ memory CD4+ memory Use bioinformatics tools to evaluate coordinate regulation of at multiple different intracellular targets 11 gates x 4 targets x 96 wells Problem? Each set of gated data must be independently exported and kept linked to the experimental process metadata Gary Means, Molecular Sciences, Amgen

For Internal Use Only. Amgen Confidential. 20  Automatically export events with additional columns which contain all of the gating information associated with each event.  Metadata must be inextricably associated with the experimental results. Solutions? Gary Means, Molecular Sciences, Amgen