Nat. Rev. Nephrol. doi: /nrneph

Slides:



Advertisements
Similar presentations
DESCRIBING DISTRIBUTION NUMERICALLY
Advertisements

Measures of Variation Sample range Sample variance Sample standard deviation Sample interquartile range.
Measures of Dispersion
Visualizing and Exploring Data Summary statistics for data (mean, median, mode, quartile, variance, skewnes) Distribution of values for single variables.
Measures of Dispersion or Measures of Variability
Jan Shapes of distributions… “Statistics” for one quantitative variable… Mean and median Percentiles Standard deviations Transforming data… Rescale:
Measures of Center and Variation
Slides by JOHN LOUCKS St. Edward’s University.
Describing Distributions Numerically
Statistics: Use Graphs to Show Data Box Plots.
CHAPTER 39 Cumulative Frequency. Cumulative Frequency Tables The cumulative frequency is the running total of the frequency up to the end of each class.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 12 Describing Data.
Programming in R Describing Univariate and Multivariate data.
Quantitative Skills: Data Analysis and Graphing.
Enter these data into your calculator!!!
Exploratory Data Analysis. Computing Science, University of Aberdeen2 Introduction Applying data mining (InfoVis as well) techniques requires gaining.
Objectives 1.2 Describing distributions with numbers
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
Methods for Describing Sets of Data
Quantitative Skills 1: Graphing
Review Measures of central tendency
Percentiles and Box – and – Whisker Plots Measures of central tendency show us the spread of data. Mean and standard deviation are useful with every day.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Statistics Chapter 1: Exploring Data. 1.1 Displaying Distributions with Graphs Individuals Objects that are described by a set of data Variables Any characteristic.
Chapter 8 Making Sense of Data in Six Sigma and Lean
Numerical Measures of Variability
STATISTICS. What is the difference between descriptive and inferential statistics? Descriptive Statistics: Describe data Help us organize bits of data.
UNIT #1 CHAPTERS BY JEREMY GREEN, ADAM PAQUETTEY, AND MATT STAUB.
Vocabulary to know: *statistics *data *outlier *mean *median *mode * range.
Numerical descriptions of distributions
Statistics with TI-Nspire™ Technology Module E Lesson 1: Elementary concepts.
Statistics and Data Analysis
COMPUTATIONAL FORMULAS AND IQR’S. Compare the following heights in inches: BoysGirls
© 2012 W.H. Freeman and Company Lecture 2 – Aug 29.
Chapter 14 Statistics and Data Analysis. Data Analysis Chart Types Frequency Distribution.
(Unit 6) Formulas and Definitions:. Association. A connection between data values.
Exploratory Data Analysis
Methods for Describing Sets of Data
Figure 2-7 (p. 47) A bar graph showing the distribution of personality types in a sample of college students. Because personality type is a discrete variable.
Measures of Position – Quartiles and Percentiles
Exploring Data: Summary Statistics and Visualizations
Quantitative Data Continued
BAE 6520 Applied Environmental Statistics
Notes 13.2 Measures of Center & Spread
EXPLORATORY DATA ANALYSIS and DESCRIPTIVE STATISTICS
STAT 4030 – Programming in R STATISTICS MODULE: Basic Data Analysis
BAE 5333 Applied Water Resources Statistics
Numerical descriptions of distributions
Chapter 5 : Describing Distributions Numerically I
Calculating Median and Quartiles
Shoe Sizes.
Description of Data (Summary and Variability measures)
Chapter 3 Describing Data Using Numerical Measures
Bead Array–Based microRNA Expression Profiling of Peripheral Blood and the Impact of Different RNA Isolation Approaches  Andrea Gaarz, Svenja Debey-Pascher,
HMI 7530– Programming in R STATISTICS MODULE: Basic Data Analysis
Validation and Comparison of Pharmacogenetics-Based Warfarin Dosing Algorithms for Application of Pharmacogenetic Testing  Nitin Roper, Barry Storer,
Noninvasive Molecular Monitoring in Multiple Myeloma Patients Using Cell-Free Tumor DNA  Giulia Biancon, Silvia Gimondi, Antonio Vendramin, Cristiana.
Displaying Distributions with Graphs
Box-and-Whisker Plots
Nat. Rev. Nephrol. doi: /nrneph
Triplex High-Resolution Melting Assay for the Simultaneous Assessment of IFNL3 rs , ABCB11 rs , and RNF7 rs Genotypes in Chronic.
Copyright © 2016 Elsevier Inc. All rights reserved.
Box-and-Whisker Plots
Box-and-Whisker Plots
MCC6.SP.5c, MCC9-12.S.ID.1, MCC9-12.S.1D.2 and MCC9-12.S.ID.3
Jon Wakefield  The American Journal of Human Genetics 
A and B, ROC curves of the proteomics panel diagnosis.
Ch 1 Review Questions.
Find the Mean of the following numbers.
Lesson Plan Day 1 Lesson Plan Day 2 Lesson Plan Day 3
Presentation transcript:

Nat. Rev. Nephrol. doi:10.1038/nrneph.2017.30 Figure 2 Data presentation and common biostatistical approaches in metabolomics Figure 2 | Data presentation and common biostatistical approaches in metabolomics. a | A boxplot for a chosen metabolite. Each box corresponds to samples collected on a different day of the experiment. Boxplots describe the data by quartiles (Q1 and Q3) for which the point of origin is the median of the data. Extreme outliers are defined by values in the range of Q1 – 1.5*(Q3 – Q1) and Q3 + 1.5*(Q3 – Q1). If data points below the limit of detection can be defined as extreme outliers they should be discarded. Each box contains 75% of the values for the samples collected on the relevant day. The median (Q2) is depicted as a dark blue bar and outliers are shown as bright blue squares. The whiskers show one standard deviation above and below the mean. The P value is given for the difference of all data in the boxes (Kruskal–Wallis U test). b | Data distribution. The frequency of the data distribution might be normal or other (for example skewed or bimodal). The upper panels show example histograms, whereas the lower panels show the corresponding quartile–quartile (Q-Q) plots. c | Principal component analysis (PCA) can be applied to large datasets, which can include metabolite concentrations as well as further phenotyping data. The PCA graph shows differences between samples based on multiple metabolites ranked according to contribution strength into eigenvalue/eigenvector pairs. The eigenvectors are the solutions for which the maximum variance can be calculated by the PCA algorithm and the eigenvalues are the spread of the variance (the length of the eigenvector). The eigenvalue/eigenvector pair is calculated for each variable in the data. The highest ranked metabolite combinations are in the PC1 axis followed by PC2, PC3 etc. Here data from the same experiment as part a are shown for all metabolites after calculation of eigenvectors and eigenvalues. Each dot represents a different sample. d | Partial least squares regression discriminant analysis (PLS-DA) provides information on differences between samples, which are calculated for all metabolites. The graph shows a comparison of nuclear magnetic resonance data from urine samples collected in the morning and afternoon. t[1] and t[2] represent the linear regression model and the response vector, respectively. Each dot represents a different sample. e | Orthogonal partial last squares regression is used to reduce noise in PLS data. This graph shows the same data as in part d. The first principal component (t[1]P) is plotted against the first orthogonal component (t[2]O). f | A receiver operating characteristic curve is used to analyse the performance of a diagnostic assay by plotting the true positive discovery rate (sensitivity) versus the false positive rate (1-specificity). The best theoretical performance (specificity 1 and selectivity 1) is shown by the dashed line. Different biostatistics models (including several combinations of metabolites) are represented by the solid lines. The area under the curve can be calculated for each model and is given as a percentage. Permission to reproduce parts d and e obtained from American Chemical Society © Wagner, S. et al. Anal. Chem. 79, 2918–2926 (2007). Hocher, B. & Adamski, J. (2017) Metabolomics for clinical use and research in chronic kidney disease Nat. Rev. Nephrol. doi:10.1038/nrneph.2017.30