Presentation is loading. Please wait.

Presentation is loading. Please wait.

Some principles and examples related to evaluation of sequence similarities with help of length equivalent measures (ELEMS) Jaroslav Kubrycht and Karel.

Similar presentations


Presentation on theme: "Some principles and examples related to evaluation of sequence similarities with help of length equivalent measures (ELEMS) Jaroslav Kubrycht and Karel."— Presentation transcript:

1

2 Some principles and examples related to evaluation of sequence similarities with help of length equivalent measures (ELEMS) Jaroslav Kubrycht and Karel Sigler Prague, 30 November, 2006

3 Examples and kinds of column identities derived by ELEMS

4 LIATR ISARV LWIRCC LWSITV ISAIRC LSATR LIWIC LISRC IWATV LWSICR Minimum aa numbers limiting ELEMS(RDA) derived levels: CCBE aa, high occurrence aa, template motif aa, questionable aa cysteine exhibits the same numbers for both template motif and questionable aa ? see our pdf file

5 Examples of amino acid similarities and their contradictory dissimilarities in sequence block columns

6 Questionable amino acids A and V convertible via single triplet mutation present in the same column (cooperating pairs) achieve mixed high occurrence level. AG A VA AA G VG G AA V On the other, hand collocating template amino acids A and G without mutation relationship form contradictory pairs, which in fact diminish the level of overall extent of aa similarities in their block.

7 Length equivalents as products of probabilistic compression

8 The probability of amino acids present in left column can be represented by a complete column similarity of non-integer height, i.e. by the vertical length equivalent of column (LE A ). A A A A A A A A ELEMS(RDA) in given case determines high occurrence level of aa similarity, which LE A = 3.095.

9 In addition to LE A, we define also mean compressed height of whole sequence blocks, i.e. LE TM. Both given height- related (vertical) length equivalents are restricted by the same number limits in ELEMS distinguishing different kinds of similarities. restricted aadescription of lower limitinterval questionable (gray zone) random aa/chain1.0-1.5 template motiffuzzy-related point among random aa/chain and double sequence similarity 1.5-3.0 cohesivethree compressed aa/chains represent minimum sticking stage of cohesion 3.0-(SL+2) CBCEfor details see our pdf file> SL+2

10 Similar compression principle is also used to process gapped sequence block. Thus we result a compressed block with co- lumns containing only identical/similar aa and exhibiting non- integer height done by LE TM. However, the first floor of given oblong block belongs to a random chain (in light orange) of the template motif. Only upper area determines HLE value. This means that: HLE = (LE TM – 1) x n. HLE random chain

11 Mild modification in case of double sequence similarity Double sequence similarity uses only a single value of LE A (LE A = 2) following from the presence of only two chains in corresponding sequence block. Since this similarity has no alternative chain, corresponding alignment is accompanied by increased frequency of losses of column similarities in comparison with multiple sequence alignments. This and LE A values higher than necessary induced us to avoid restrictions of mean length equivalent (LE TM ) value in double sequence similarity, still keeping HLE evaluation. In spite of it, some agreement between BLAST and ELEMS is demonstrated in WP3.2.2.

12 Alternatively, we can represent HLE as a single chain of non-integer HLE length. This raises the question of minimal length of the chain exhibiting mean aa probability (or score) identical with template motif related to HLE. Corresponding minimum value of non-integer length (SL, i.e. specific limit) can be determined using several statistical procedures. specific limit (SL) HLE chain of sufficient length i.e. HLE > SL

13 RBS as unifying value

14 The ratio of HLE to SL is independent of any probability differences. Moreover, this ratio provides a simply and illustrative insight into the difference from minimum significant value. Consequently, we suppose that such value may represent an interesting density- related parameter, which may complement the bit score evaluation. The given ratio was named relative block similarity (RBS). RBS is thus determined by the formula: RBS = HLE/SLE

15 Thank you for your visit of our web page. If you have any questions, our e-mails are: jkub@post.cz sigler@biomed.cas.cz You are invited.


Download ppt "Some principles and examples related to evaluation of sequence similarities with help of length equivalent measures (ELEMS) Jaroslav Kubrycht and Karel."

Similar presentations


Ads by Google