1 Data Analysis © 2005 Germaine G Cornelissen-Guillaume. Copying and distribution of this document is permitted in any medium, provided this notice is.

Slides:



Advertisements
Similar presentations
Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
Advertisements

11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Managerial Economics in a Global Economy
Hypothesis Testing Steps in Hypothesis Testing:
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Objectives (BPS chapter 24)
Chapter 8 Estimation: Additional Topics
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Point estimation, interval estimation
Chapter 6 Introduction to Sampling Distributions
Curve-Fitting Regression
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
Analysis of Differential Expression T-test ANOVA Non-parametric methods Correlation Regression.
Development of Empirical Models From Process Data
Transforms What does the word transform mean?. Transforms What does the word transform mean? –Changing something into another thing.
1 Simple Linear Regression Chapter Introduction In this chapter we examine the relationship among interval variables via a mathematical equation.
Inferences About Process Quality
8-1 Introduction In the previous chapter we illustrated how a parameter can be estimated from sample data. However, it is important to understand how.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Statistical Intervals Based on a Single Sample.
INFERENTIAL STATISTICS. By using the sample statistics, researchers are usually interested in making inferences about the population. INFERENTIAL STATISICS.
Correlation and Regression Analysis
1 Simple Linear Regression 1. review of least squares procedure 2. inference for least squares lines.
Objectives of Multiple Regression
Inference for regression - Simple linear regression
5-1 Introduction 5-2 Inference on the Means of Two Populations, Variances Known Assumptions.
AP STATISTICS LESSON 11 – 2 (DAY 1) Comparing Two Means.
BPS - 3rd Ed. Chapter 211 Inference for Regression.
1 CSI5388: Functional Elements of Statistics for Machine Learning Part I.
ESTIMATION. STATISTICAL INFERENCE It is the procedure where inference about a population is made on the basis of the results obtained from a sample drawn.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.
Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 15 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
Physics 114: Exam 2 Review Lectures 11-16
Statistics for clinicians Biostatistics course by Kevin E. Kip, Ph.D., FAHA Professor and Executive Director, Research Center University of South Florida,
Curve-Fitting Regression
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
AMBULATORY BLOOD PRESSURE MONITORING HEALTH WATCH IN BRNO, CZECH REPUBLIC Homolka P., Siegelová J., Cornélissen G.*, Fišer B., Dušek J., Vank P., Mašek.
Chapter 5 Parameter estimation. What is sample inference? Distinguish between managerial & financial accounting. Understand how managers can use accounting.
© Copyright McGraw-Hill 2000
MARKETING RESEARCH CHAPTER 18 :Correlation and Regression.
CHAPTER SEVEN ESTIMATION. 7.1 A Point Estimate: A point estimate of some population parameter is a single value of a statistic (parameter space). For.
Simple linear regression Tron Anders Moger
Statistics Sampling Intervals for a Single Sample Contents, figures, and exercises come from the textbook: Applied Statistics and Probability for Engineers,
The Phoenix Project Usage – Germaine Cornélissen © 2007 Germaine G Cornelissen-Guillaume. Copying and distribution of this document is permitted in any.
Correlation & Regression Analysis
Chapter 8: Simple Linear Regression Yang Zhenlin.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Chapter 8. Process and Measurement System Capability Analysis
Statistics Sampling Distributions and Point Estimation of Parameters Contents, figures, and exercises come from the textbook: Applied Statistics and Probability.
ESTIMATION METHODS We know how to calculate confidence intervals for estimates of  and  2 Now, we need procedures to calculate  and  2, themselves.
Chapter 8 Estimation ©. Estimator and Estimate estimator estimate An estimator of a population parameter is a random variable that depends on the sample.
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
6-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Confidence Intervals Dr. Amjad El-Shanti MD, PMH,Dr PH University of Palestine 2016.
BPS - 5th Ed. Chapter 231 Inference for Regression.
CORRELATION-REGULATION ANALYSIS Томский политехнический университет.
Statistics 350 Lecture 2. Today Last Day: Section Today: Section 1.6 Homework #1: Chapter 1 Problems (page 33-38): 2, 5, 6, 7, 22, 26, 33, 34,
The simple linear regression model and parameter estimation
Chapter 7. Classification and Prediction
ESTIMATION.
Two-Sample Hypothesis Testing
CONCEPTS OF ESTIMATION
Econ 3790: Business and Economics Statistics
The Simple Linear Regression Model: Specification and Estimation
Parametric Methods Berlin Chen, 2005 References:
ESTIMATION.
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

1 Data Analysis © 2005 Germaine G Cornelissen-Guillaume. Copying and distribution of this document is permitted in any medium, provided this notice is preserved.

2 Germaine Cornélissen University of Minnesota Phoenix presentation July 10, 2005

3 Procedure Data are collected at intervals (e.g., 30 min) around the clock for at least 24 hours, preferable for 7 days or longer. Data are analyzed by cosinor (e.g., least squares fit of cosine curves with periods of 24 and 12 hours) to derive estimates of MESOR, amplitude, and acrophase. Data are analyzed non-parametrically.

4 Ambulatory monitors can be obtained with a great reduction in cost with chronobiologic analyses from the University of Minnesota with results interpreted in the light of gender- and age-adjusted archived norms in clinical health. Opportunity

5 Procedure Data are collected at intervals (e.g., 30 min) around the clock for at least 24 hours, preferable for 7 days or longer. Data are retrieved from the monitor by means of an interface (RS232 port). Data are analyzed non-parametrically.

6

7 Procedure Data are collected at intervals (e.g., 30 min) around the clock for at least 24 hours, preferable for 7 days or longer. Data are retrieved from the monitor by means of an interface (RS232 port). Data are analyzed by cosinor (e.g., least squares fit of cosine curves with periods of 24 and 12 hours) to derive estimates of MESOR, amplitude, and acrophase.

8 Let us consider first the case of a single component model. SINGLE COSINOR METHOD

9 Y(t) = M + Acos(2  t/  +  ) + e(t) Y i are data collected at times t i (i=1,..., N) M is the MESOR (midline estimating statistic of rhythm) 2A is the double amplitude (a measure of the extent of predictable change within a cycle)  is the acrophase (a measure of the timing of overall high values)  is the period (duration of one cycle) e i is the error term, assumed to be independent, normally distributed with mean zero and unknown constant variance  2.

10 When the period is known, the model can be rewritten as Y(t) = M +  X +  Z + e(t) where  = Acos  and  = -Asin  and X = cos(2  t/  ) and Z = sin(2  t/  ) The model is linear in its parameters M,  and .

11 The principle of the least squares method is to find values M,  and  such that the residual sum of squares  i e 2 is minimal, or equivalently its first-order derivative is equal to zero. The estimation procedure consists of: 1. Differentiating  i e 2 with respect to each parameter 2. Equating these derivatives to zero to obtain the “normal equations” 3. Solving the system of equations thus obtained for the parameters M,  and .

12

13

14

15 g

16

17 Acrophases are expressed in negative degrees, with 360°≡24 hours and 0° set to local midnight.

18

19

20

21

22

23

24 Acrophases are expressed in negative degrees, with 360°≡24 hours and 0° set to local midnight.

25

26

27

28

29

30 PR = MSS/(MSS+RSS) = R 2.

31

32

33 Results from the parametric approach are reported in the sphygmochron where they are compared with 90% prediction limits from healthy peers matched by gender and age.

34 The double amplitude of systolic blood pressure is above the upper 95% prediction limit: systolic CHAT is diagnosed.

35 MESOR 2A

36

37 Procedure Data are collected at intervals (e.g., 30 min) around the clock for at least 24 hours, preferable for 7 days or longer. Data are retrieved from the monitor by means of an interface (RS232 port). Data are analyzed by cosinor (e.g., least squares fit of cosine curves with periods of 24 and 12 hours) to derive estimates of MESOR, amplitude, and acrophase. Data are analyzed non-parametrically.

38

39

40

41 Time-Specified Reference Limits (Chronodesms) Derived as 90% prediction limits for given timepoint (or short – e.g., 30-min interval) using data from healthy peers matched by gender and age. Let Y 1, …, Y n be a sample randomly drawn from a normal population with mean  and variance  2. A tolerance interval containing a proportion p of the population is defined by  ± z  /2  where z  /2 is the upper (1-  /2) fractile of the standard normal curve (e.g., 1.96 for  = 0.05). This interval covers a fraction (1-  ) of the population.

42 Time-Specified Reference Limits (Chronodesms) In practice,  and  are not known and need to be estimated from the sample:  = Y = 1/n.  j Y j  2 = s 2 = 1/(n-1).  j (Y j – Y) 2 Using these estimates to derive a tolerance interval will generally not cover an exact fraction (1-  ) of the population (because the mean and variance are not known), but it will do so on the average. Y ± t  /2; (n-1) s [(n+1)/n] 1/2 (t = Student distribution).

43 Time-Specified Reference Limits (Chronodesms) This interval may also be interpreted as having probability (1-  ) of including any single additional observation from the same population. In this sense, the interval is a prediction interval or a tolerance interval of  -expectation.

44 Nonparametric endpoints The data are stacked over a single 24-hour span (all data collected at or around 00:00 are averaged, as are all data collected at or around 00:30, …, and at or around 23:30). Practically, averages are computed over consecutive 30-min intervals after the data from different days have been stacked.

45

46 Nonparametric endpoints A scan from 00:00 to 24:00 is conducted to determine whether the profile at that time is within the prediction interval or whether it is above the upper 95% prediction limit. If so, using linear interpolation, the amount of time the profile was excessive is estimated and summed over the 24-hour span. The total is divided by 24 hours to yield the percentage time elevation (PTE).

47 For each (e.g., 30-min) interval, there are 4 different possibilities.

48

49

50

51

52 Nonparametric endpoints The amount of excess (BP: hyperbaric index or HBI; HR: tachycardic index or TCI) is calculated by numerical integration in a way similar to the calculation of the PTE. Again, there are 4 possibilities depending on whether the profile is completely below or above the limit or whether it rises above or drops below the limit.

53

54

55

56

57 Nonparametric endpoints The timing of excess is calculated as the “center of gravity” of the area of excess: t E = arctan(s/c) where s =  j hbi j sin φ j /  j hbi j c =  j hbi j cos φ j /  j hbi j φ j = -  t j where hbi j are the partial HBIs and t j are the (e.g., 30-min) intervals’ midpoints.

58 Nonparametric endpoints Because the excess does not necessarily occur during a single daily episode, the ‘center of gravity’ may not invariably correspond to a reliable estimate of the timing of excess. In order to check on this possibility, fractionated HBI and TCI are also computed over consecutive intervals of 1 or 3 hours. Information about the timing of excess is useful for treatment scheduling.

59

60

61

62

63 Comments Reference values are now computed in terms of clock hours. It would be preferable to compute them in relation to the usual rest-activity schedule of each individual. Hence the merit of a diary. Reference values are now computed for a finite number of groups specified by gender and age with a special set of reference values for pregnant women. It would be preferable to determine overall trends with age for women, men and pregnant women in terms of the MESOR and circadian amplitude and acrophase to have a continuum of values. When repeated profiles are obtained for the same person, additional methods are available such as parameter tests and cumulative sums (CUSUMs) that can assess any changes in rhythm characteristics as a function of time (e.g., in relation to a given intervention such as medication).

64

65 EXAMPLE: SBP in healthy women

66 Comments Reference values are now computed in terms of clock hours. It would be preferable to compute them in relation to the usual rest-activity schedule of each individual. Hence the merit of a diary. Reference values are now computed for a finite number of groups specified by gender and age with a special set of reference values for pregnant women. It would be preferable to determine overall trends with age for women, men and pregnant women in terms of the MESOR and circadian amplitude and acrophase to have a continuum of values. When repeated profiles are obtained for the same person, additional methods are available such as parameter tests and cumulative sums (CUSUMs) that can assess any changes in rhythm characteristics as a function of time (e.g., in relation to a given intervention such as medication).

67

68

69

70

71

72