Presentation on theme: "RTI International RTI International is a trade name of Research Triangle Institute. www.rti.org Designing a high quality metabolomics experiment Grier."— Presentation transcript:
RTI International RTI International is a trade name of Research Triangle Institute. Designing a high quality metabolomics experiment Grier P Page Ph.D. Senior Statistical Geneticist RTI International Atlanta Office
RTI International Metabolomics is Powerful and Central
RTI International Designing a good study
RTI International RTI International is a trade name of Research Triangle Institute. Errors Errors Everywhere
UMSA Analysis Insulin Resistant Insulin Sensitive Day 1 Day 2
RTI International Understand the strengths and weaknesses of each step of the experiments. Take these strengths and weaknesses into account in your design. Primary consideration of good experimental design
RTI International RTI International is a trade name of Research Triangle Institute.
From Drug Discov Today Sep 1;10(17):
RTI International State the Question and Articulate the Goals
RTI International The Myth That Metabolomics does not need a Hypothesis There always needs to be a biological question in the experiment. If there is not even a question don’t bother. The question could be nebulous: What happens to the metabolome of this tissue when I apply Drug A. The purpose of the question is to drive the experimental design. Make sure the samples answer the question: Cause vs. effect.
Design Issues Known sources of non-biological error (not exhaustive) that must be addressed – Technician / post-doc – Reagent lot – Temperature – Protocol – Date – Location – Cage/ Field positions
RTI International Experimental Design
RTI International Biological replication is essential. Two types of replication – Biological replication – samples from different individuals are analyzed – Technical replication – same sample measured repeatedly Technical replicates allow only the effects of measurement variability to be estimated and reduced, whereas biological replicates allow this to be done for both measurement variability and biological differences between cases. Almost all experiments that use statistical inference require biological replication.
RTI International How many replicates? Controlled experiments – cell lines, mice, rats 8-12 per group. Human studies – discovery 20+ per group For predictive models – 100+ per group, need model building and validation sets The more the better, always.
RTI International Experimental Conduct All experiments are subject to non- biological variability that can confound any study
RTI International Control Everything! Know what you are doing Practice!
RTI International What if you can’t control or make all things uniform Randomize Orthogonalize
RTI International What are Orthogonalization and Randomization ? Orthogonalization- spreading the biological sources of error evenly across the non-biological sources of error. – Maximally powerful for known sources of error. Randomization – spear the biological sources of error at random across the non-biological sources of error. – Useful for controlling for unknown sources of error
RTI International Examples of Orthogonalization and Randomization ? Sample #TreatmentVariety OrderSample OrderSample The experiment Orthogonalize Randomize
RTI International RTI International is a trade name of Research Triangle Institute. Statistical analyses have assumptions too
RTI International Statistical analyses Supervised analyses – linear models etc – Assume IID (independently identically distibuted) – Normality – Sometimes can rely on central limit – ‘Weird’ variances – Using fold change alone as a statistic alone is not valid. – ‘Shrinkage’ and or use of Bayes can be a good thing. False-discovery rate is a good alternative to conventional multiple-testing approaches. Pathway testing is desirable.
RTI International Classification Supervised classification – Supervised-classification procedures require independent cross-validation. – See MAQC-II recommendations Nat Biotechnol August ; 28(8): 827–838. doi: /nbt Wholly separate model building and validation stages. Can be 3 stage with multiple models tested Unsupervised classification – Unsupervised classification should be validated using resampling-based procedures.
RTI International Unsupervised classification - continued Unsupervised analysis methods – Cluster analysis – Principle components – Separability analysis All have assumptions and input parameters and changing them results in very different answers
Sample size estimation for metabolomics studies
RTI International There is strength in numbers — power and sample size. Unsupervised analyses – Principal components, clustering, heat maps and variants – These are actually data transformations or data display rather than hypothesis testing, thus unclear if sample size estimation is appropriate or even possible. – Stability of clustering may be appropriate to think about. Garge et al 2005 suggested 50+ samples for any stability.
RTI International Sample size in supervised experiments Supervised analyses – Linear models and variants – Methods are still evolving, but we suggest the approach we developed for microarrays may be appropriate for metabolomics (being evaluated)
RTI International is a trade name of Research Triangle Institute. Metabolomics does not reveal everything and different technologies show different things
RTI International Technology and detection evolves over time.
RTI International Technologies are not perfect in agreement
RTI International The human urine metabolome
RTI International Sample, Image and Data Quality Checking
Metabolite quality Still evolving field RTI is one of the Metabolomics Reference Standards Synthesis Centers
RTI International Know your data - What should it look like
These are OK
These are not OK
RTI International One bad sample can contaminate an experiment
Histogram of p-values
Potentially Bad Data
Histogram of p-values with bad data removed
RTI International Quality of Database, Bioinformatics and Interpretative tools
RTI International Just because a database says something does not mean it is right. Read the evidence. Databases are biased. Databases are incomplete Databases have lots of data Understand data before you use it Database are useful! Understand what databases include, don’t include, and assumptions
RTI International RTI International is a trade name of Research Triangle Institute. Issues in the Annotation of Genes, proteins, metabolites
RTI International Annotation is inconsistent across sources
RTI International RTI International is a trade name of Research Triangle Institute. Issues with pathway data
TCA cycle from Ingenuity
TCA from GeneMAPP
TCA cycle from Ingenuity
RTI International RTI International is a trade name of Research Triangle Institute. Share Your Data Use shared data!
RTI International Metabolomics WorkBench
RTI International MetaboLights
RTI International Practice compendium research – to allow others to replicate your work Many high profile omic studies are not even technically reproducible Overshare your data and show work
RTI International Limited in the literature so far. Some work on tissue and species metabolomes. Use metabolomics databases
RTI International Design your experiment well Conduct your experiment well Control for non-biological sources of error Know what is good and bad quality data at each stage including metabolite, image, data, and annotation If you are aware of these issues and control for them highly powerful and reproducible metabolite experimentation is possible. Else you get garbage Share your data and use shared data Summary
RTI International The MicroArray Quality Control (MAQC)-II study of common practices for the development and validation of microarray based predictive models. Nat Biotechnol August ; 28(8): 827–838. Microarray data analysis: from disarray to consolidation and consensus. Nat Rev Genet Jan;7(1):55-65.Nat Rev Genet. Baggerly K. "Disclose all data in publications." Nature Sep 23;467(7314):401. PMID: Repeatability of published microarray gene expression analyses. Nat Genet Feb;41(2): Nat Genet. A design and statistical perspective on microarray gene expression studies in nutrition: the need for playful creativity and scientific hard-mindedness. Nutrition Nov-Dec;19(11-12): Nutrition. 39 Steps. From Drug Discov Today Sep 1;10(17): References
If time allows
RTI International RTI International is a trade name of Research Triangle Institute. RTI Regional Comprehensive Metabolomics Resource Core (RTI RCMRC) Susan Sumner, PhD Director RTI RCMRC Discovery Sciences Proteomics and Metabolomics Programs RTI International
Contact Information for the RTI RCMRC Susan C.J. Sumner, PhD Director RTI RCMRC Senior Scientist nanoSafety RTI International Discovery Sciences 3040 Cornwallis Drive Research Triangle Park North Carolina (office) (cell) Jason P. Burgess, PhD Program Coordinator, RTI RCMRC Associate Director, Discovery Sciences RTI International 3040 Cornwallis Drive Research Triangle Park North Carolina (office)
RTI International MS and NMR Instruments at RTI and DHMRI RTIDHMRI Mass Spectrometers (38) LC-MS 136 GC-MS 43 GC x GC-TOF-MS 11 ICP-MS 61 MALDI ToF/ToF 21 NMR (6) 24
RTI International Some RTI Metabolomics Applications and Pilots Experience with adolescent and adult human subject research, animal model and cell based research, e.g., Apoptosis- cells Drug induced liver injury- animal models in utero exposure to chemicals and fetal imprinting- animal models Dietary exposure and imprinting- animal models NAFLD - pediatric obesity; microbiome Weight Loss- pediatric obesity Preterm delivery- human subjects Response to vaccine- human subjects Nicotine withdrawal- human subjects Colon cancer- human subjects
RTI International Pilot and Feasibility Studies The aim of the pilot and feasibility program is to foster collaborations and promote the use of metabolomics. Studies will be selected through an application process. – Application involves abstract, description of samples available (matrix type, volume, type and duration of storage, sample processing, freeze thaws, etc), description of phenotypes, and plan for subsequent grant/contract submissions for metabolomics analysis beyond initial pilot study. Applications may also include technology development. Applications must agree to deposit data in DRCC, coauthor publications, and submit joint grant/contract proposals. Deadlines being defined