Presentation is loading. Please wait.

Presentation is loading. Please wait.

Statistical model for count data Speaker : Tzu-Chun Lo Advisor : Yao-Ting Huang.

Similar presentations


Presentation on theme: "Statistical model for count data Speaker : Tzu-Chun Lo Advisor : Yao-Ting Huang."— Presentation transcript:

1 Statistical model for count data Speaker : Tzu-Chun Lo Advisor : Yao-Ting Huang

2 Outline Why use statistical model Target ▫Gene expression Binomial distribution ▫Poisson distribution Over dispersion Negative binomial ▫Chi-square approximation Conclusion

3 Statistics model A statistical model is a probability distribution constructed to enable inferences to be drawn or decisions made from data. Population sample Inference Make a decision : Hypothesis testing designer consumer We have to choose a statistics model for sample (mean, variance) We (mean, variance) size

4 Target Gene expression ▫We like to use statistical model to test an observed difference in read counts is significant. Look like a significant region How about this Can we sure ? Noise or not

5 Count data A type of data in which the observations can take only the non-negative integer values {0, 1, 2, 3,...}, and where these integers arise from counting rather than ranking. An individual piece of count data is often termed a count variable. Binomial Poisson Negative binomial All of them are this type

6 Binomial distribution

7 33 goals 110 shots in this season Success : 0.3 Fail : 0.7 What is the probability if he scored 6 goals in 10 shots

8 Binomial distribution 0 1 2 3 4 5 6 7 8 9 10 6

9 Poisson distribution

10

11 e = 2.718281828…

12 Poisson Games goals Goals of game01234567 Poisson0.51.62.5 1.81.10.60.2 Raw data12222011

13 The presence of greater variability (statistical dispersion) in a data set than would be expected based on a given simple statistical model. Overdispersion

14 Negative binomial

15

16 Parameter estimation

17 Approximate control limits Chi-square approximation

18 Example = 67.0

19

20

21 Conclusion Thanks for attention

22 Statistics model Suitable type ▫Which distribution should we use Parameters ▫Get some information from data Inference ▫What do we want to know ▫How could we make a decision  Hypothesis testing

23 Statistics model Suitable type ▫Binomial distribution Parameters ▫n = 10, p = 0.7 Inference ▫2 successes

24 Multinomial distribution The analog of the Bernoulli distribution is the categorical distribution, where each trial results in exactly one of some fixed finite number k of possible outcomes. http://en.wikipedia.org/wiki/Multinomial_distr ibutionhttp://en.wikipedia.org/wiki/Multinomial_distr ibution

25 Trinomial distribution

26 Count data A type of data in which the observations can take only the non-negative integer values {0, 1, 2, 3,...}, and where these integers arise from counting rather than ranking. We tend to use fixed fractions of genes. The probability that reads appeared in this region The number of read counts in this interval (Binomial distribution) (Poisson distribution)

27

28 Poisson example

29 Negative binomial


Download ppt "Statistical model for count data Speaker : Tzu-Chun Lo Advisor : Yao-Ting Huang."

Similar presentations


Ads by Google