Presentation is loading. Please wait.

Presentation is loading. Please wait.

In silico cis-analysis promoter analysis - Promoters and cis-elements - Searching for patterns - Searching redundant patterns.

Similar presentations


Presentation on theme: "In silico cis-analysis promoter analysis - Promoters and cis-elements - Searching for patterns - Searching redundant patterns."— Presentation transcript:

1 In silico cis-analysis promoter analysis - Promoters and cis-elements - Searching for patterns - Searching redundant patterns

2 What is a promoter? and why care about it? CDS

3 A little about these cis-elements - Transcription Factors (TF) often bind to them - They are normally from 5-10 bp long - They are often placed in the region from TSS and upstream -Often they come in clusters i.e. must be placed in some ‘syntax’ to be functional -We assume, they are shared by the promoters in a regulon -They are not always conserved 100%

4 Sound hard to find? it is hard

5 Some tricks makes it possible Regulon (cluster) Comparing promoters from a regulon to all other promoters i.e. using a negative set (all other promoters in species X) This allows us to use hypergeometric statistics Ranked list of genes The distribution of sequences with a given pattern along a rank i.e. is the pattern overrepresented in promoters with low (or high) p-values in a microarray experiment This allows us to use Kolmogorov-Smirnov Hypergeometric Statistic 10x 5x Takeout 6 balls 1x 5x p=0.04 Kolmogorov-Smirnov, here test for deviations from a uniform distribution of patterns along a sorted list og promoters Occurrence Promoters ranked by e.g. p-value

6 A cis-element as it might be seen by a TF

7 So, the elements may not always be 100% conserved If we only had a weight matrice to describe our pattern we could find less conserved patterns With a Gibbs sampler we can work backwards We give it promoters and it builds a weight matrix Motif model (residue frequency x 100): POS A C G T Info 1 89... 1.0 2.. 92. 2.0 3... 94 1.2 4. 92.. 2.0 5 94... 1.2 6 94... 1.2 weight matrix

8 ●●●●●●●●●●●●●● ●●●●●●●●●● mRNAs OVEREXPRESSED IN mpk4 MUTANT

9 Gibbs sample evaluation TTGACT Gibbs sampling on 1000 random sampled sets of 17 Arabidopsis promoters, evaluated by information content. monte carlo

10 GACTTTTC Gibbs sample evaluation Gibbs sampling on 1000 random sampled sets of 17 Arabidopsis promoters, evaluated by information content. monte carlo Here for 8bp patterns

11 Lebel et al. 1998 Arabidopsis PR1 promoter

12 Exams for C27614 3th and 7th of december You must attend both exams The exam will take place in aud. 51, building 208 Each exam lasts 2 hours Both exams are open book exams You can bring any book Pocket calculator But no computers


Download ppt "In silico cis-analysis promoter analysis - Promoters and cis-elements - Searching for patterns - Searching redundant patterns."

Similar presentations


Ads by Google