Ta-18 ATGAAGACCTTCCTCATCTTTGCTCTCCTCGCCACTGCAGCGACAAGTGCCATTGCACAAATGGAGACTAGCCGCGTCCCTGGTTTGGAGAAACCATGGCAG 102 Ta G TG T M K/R T F L I/V F A L L A T/I A A T S A I A Q M E T S R V P G L E K P W Q 34 CAGCAACCATTATCACCACAACAACAACCACCATGTTCACAGCAACAACAACCACTTCCGCAGCAACAACAACCAATTATTATACTGCAGCAACCACCATTT Q Q P L S P Q Q Q P P C S Q Q Q Q P L P Q Q Q Q P I I I L Q Q P P F 68 TCGCAGCAACAACAACCAGTTCTACCGCAGCAACAACAACCAGTTATTATACTACAACAACCACCATTTTTGGAGCAACAACAACCAGTTCTACCACAACAA S Q Q Q Q P V L P Q Q Q Q P V I I L Q Q P P F L E Q Q Q P V L P Q Q 102 CCATCATTTTCACAACAACAACAACAACAACAACAACAACCACCATTTTTGGAGCAACAACAACCAGTTCTACCACAACAACCATCATTTTCACAACAACAA P S F S Q Q Q Q Q Q Q Q Q P P F L E Q Q Q P V L P Q Q P S F S Q Q Q 136 CAACAACAACAACAACCATTTCCGCAGCAGCAACAACCATCTTCACAACAACAACCTTTTCCACAACAACACCAACATCTTCTGCAACAACAAATCCCTGTT Q Q Q Q Q P F P Q Q Q Q P S S Q Q Q P F P Q Q H Q H L L Q Q Q I P V 170 GTTCAACCATCCGTTTTGCAGCAGCTACACCCATGCAAGGTATTCCTCCAGCAGCAGTGCAGCCATGTGGCAATGTCGCAACGTCTTGCTAGGTCGCAAATG V Q P S V L Q Q L H P C K V F L Q Q Q C S H V A M S Q R L A R S Q M 204 TGGCAGCAGAGCAGTTGCCATGTGATGCAGCAACAATGTTGCCAGCAGCTGCCGCAAATCCCCGAACAATCCCGCTATGAGGCAATCCGTACCATCGTCTAC W Q Q S S C H V M Q Q Q C C Q Q L P Q I P E Q S R Y E A I R T I V Y 238 TCCATCATCCTGCAAGAACAACAACAGGGGTTTGTCCAACCTCAGCAGCAACAACCCCAACAGTTGGGCCAAGGTGTCTCCCAACCCCAACAGCAGTCGCAG S I I L Q E Q Q Q G F V Q P Q Q Q Q P Q Q L G Q G V S Q P Q Q Q S Q 272 CAACAGCAGCTCGGACAGTGTTCTTTCCAACAACCTCAACAACAACAACTGGGTCAGCAGCCTCAACAACAACAGATACCACAGGGTACATTCTTGCAGCCA Q Q Q L G Q C S F Q Q P Q Q Q Q L G Q Q P Q Q Q Q I P Q G T F L Q P 306 CACCAGATATCTCAACTTGAGGTGATGACTTCCATTGCACTCCGTACCCTGCCAACGATATGCGGTGTCAATGTGCCGTTGTACAGCTCCACCACTAGTGTG H Q I S Q L E V M T S I A L R T L P T I C G V N V P L Y S S T T S V 340 CCATTCGGCATTGGAACCGGAGTTGGTGGCTACTGATAA P F G I G T G V G G Y * * 351 a Signal peptide Mature protein Ta-24 ATGAAGACATTCCTCGTCTTTGCCCTCCTCGCCATTGTGGCGACAAGTGTCATTGCGCAGATGGAGACTAGCTGCATCCCTGGTTTGGAGAGACCATGGCAG 102 Ta G---C-----T M K/R T F L V F A L L A I V A T S V I A Q M E T S C I P G L E R P W Q 34 CAGCAACCATTACCACCACAACAGACATTATTTCCACAACAACAACCATTTCCACAACAACAACAACCACCATTTTCACAACAACAACCATCATTTTCGCAG Q Q P L P P Q Q T L F P Q Q Q P F P Q Q Q Q P P F S Q Q Q P S F S Q 68 CAACAACCACCATTTTCGCAGCAACAACCAATTCTACCGCAGCAACCACCATTTTCACAGCAACAACAACCAGCTCTACCGCAACAATCACCATTTTTGCAG Q Q P P F S Q Q Q P I L P Q Q P P F S Q Q Q Q P A L P Q Q S P F L Q 102 CAACAACAACTAGTTTTACCTCCACAACAACAACACCAACAGCTTCTGCAACAACAAATCCCTATTGTTCAACCATCCGTTTTGCAGCAGCTAAACCCATGC Q Q Q L V L P P Q Q Q H Q Q L L Q Q Q I P I V Q P S V L Q Q L N P C 136 AAGGTATTCCTCCAGCAGAAGTGCAGCCCTGTAGCAATGCCACAACGTCTTGCTAGGTCGCAAATGTGGCAGCAGAGCAGTTGCCATGTGATGCAACAACAA K V F L Q Q K C S P V A M P Q R L A R S Q M W Q Q S S C H V M Q Q Q 170 TGTTGCCAGCAGTTGCCGCAAATCCCCGAACAATCCCGCTATGAGGCAATCCGTGCCATCACCTACTCCATCATCCTGCAAGAACAACAACAGGGTTTTGTC C C Q Q L P Q I P E Q S R Y E A I R A I T Y S I I L Q E Q Q Q G F V 204 CAACCTCAGCAGCAACAGCCCCAACAGTCGGGTCAAGGTGTCTCCCAATCCCAACAGCAGTCGCAGCAGCAGCTCGGACAATGTTCTTTCCAACAACCTCAA A Q P Q Q Q Q P Q Q S G Q G V S Q S Q Q Q S Q Q Q L G Q C S F Q Q P Q 238 CAGCAACTGGGTCAACAGCCTCAACAACAACAAGTACTACAGGGTACCTTTTTGCAACCACACCAGATAGCTCACCTTGAGGTGATGACTTCCATTGCACTC Q Q L G Q Q P Q Q Q Q V L Q G T F L Q P H Q I A H L E V M T S I A L 272 CGTACCCTGCCAACGATGTGCAGCGTCAATGTGCCGTTGTACAGCTCCACCACTAGTGTGCCATTCAGCGTTGGCACCGGAGTTGGTGTCTACTGATAA R T L P T M C S V N V P L Y S S T T S V P F S V G T G V G V Y * * 303 Signal peptide Mature protein b
Supplementary Fig. 2 The deduced amino acid sequence alignments of four groups of genes with same mature proteins. The three, two and two sequences with the same N-terminal METSCIP- are shown in a, b and c, respectively. The two sequences with the same N-terminal METSRVP- are shown in d. Short bars indicate the nucleotide sequences were the same among (between) the sequences compared, * indicates stop codons Ta-17 ATGAAGACCTTCCTCATCTTTGCCCTCCTCGCCATTGTGGCGACAAGTGTCATTGCGCAGATGGAGACTAGCTGCATCCCTGGTTTGGAGAGACCATGGCAG 102 Ta C-G C M K T F L/PI/V F A L L/P A I V A T S V I A Q M E T S C I P G L E R P W Q 34 CAGCAACCATTACCACCACAACAGACATTATTTCCACAACAACAACCATTTCCACAACAACAACAACCACCATTTTCACAACAACAACCATCATTTTCGCAG Q Q P L P P Q Q T L F P Q Q Q P F P Q Q Q Q P P F S Q Q Q P S F S Q 68 CAACAACCACCATTTTCGCAGCAACAACCAATTCTACCGCAGCAACCACCATTTTCACAGCAACAACAACCAGCTCTACCGCAACAATCACCATTTTTGCAG Q Q P P F S Q Q Q P I L P Q Q P P F S Q Q Q Q P A L P Q Q S P F L Q 102 CAACAACAACTAGTTTTACCTCCACAACAACAACACCAACAGCTTCTGCAGCAACAAATCCCTATTGTTCAACCATCCGTTTTGCAGCAGCTAAACCCATGC A Q Q Q L V L P P Q Q Q H Q Q L L Q Q Q I P I V Q P S V L Q Q L N P C 136 AAGGTATTCCTCCAGCAGAAGTGCAGCCCTGTAGCAATGCCACAACGTCTTGCTAGGTCGCAAATGTGGCAGCAGAGCAGTTGCCATGTGATGCAACAACAA K V F L Q Q K C S P V A M P Q R L A R S Q M W Q Q S S C H V M Q Q Q 170 TGTTGCCAGCAGTTGCCGCAAATCCCCGAACAATCCCGCTATGAGGCAATCCGTGCCATCACCTACTCCATCATCCTGCAAGAACAACAACAGGGTTTTGTC C C Q Q L P Q I P E Q S R Y E A I R A I T Y S I I L Q E Q Q Q G F V 204 CAACCTCAGCAGCAACAACCCCAACAGTCGGGTCAAGGTGTCTCCCAATCCCAACAGCAGTCGCAGCAGCAGCTCGGACAATGTTCTTTCCAACAACCTCAA Q P Q Q Q Q P Q Q S G Q G V S Q S Q Q Q S Q Q Q L G Q C S F Q Q P Q 238 CAGCAACTGGGTCAACAGCCTCAACAACAACAAGTACTACAGGGTACCTTTTTGCAACCACACCAGATAGCTCACCTTGAGGTGATGACTTCCATTGCACTC Q Q L G Q Q P Q Q Q Q V L Q G T F L Q P H Q I A H L E V M T S I A L 272 CGTACCCTGCCAACGATGTGCAGCGTCAATGTGCCGTTGTACAGCTCCACCACTAGTGTGCCATTCAGCGTAGGCACCGGAGTTGGTGCCTAA T R T L P T M C S V N V P L Y S S T T S V P F S V G T G V G A * 303 Signal peptide Mature protein c Ta-4 ATGAAGACCTTCCCCGTCTTTGCCCTCCTCGCCATTGTGGCGACAAGTGTCATTGCGCAGATGGAGACTAGCTGCATCCCTGGTTTGGAGAGACCATGGCAG 102 Ta T-A Ta A----TT M K T F L/PI/V F A L L A I V A T S V I A Q M E T S C I P G L E R P W Q 34 CAGCAACCATTACCACCACAACAGACATTATTTCCACAACAACAACCATTTCCACAACAACAACAACCACCATTTTCACAACAACAACCATCATTTTCGCAG Q Q P L P P Q Q T L F P Q Q Q P F P Q Q Q Q P P F S Q Q Q P S F S Q 68 CAACAACCACCATTTTCGCAGCAACAACCAATTCTACCGCAGCAACCACCATTTTCACAGCAACAACAACCAGCTCTACCGCAACAATCACCATTTTTGCAG Q Q P P F S Q Q Q P I L P Q Q P P F S Q Q Q Q P A L P Q Q S P F L Q 102 CAACAACAACTAGTTTTACCTCCACAACAACAACACCAACAGCTTCTGCAACAACAAATCCCTATTGTTCAACCATCCGTTCTGCAGCAGCTAAACCCATGC T Q Q Q L V L P P Q Q Q H Q Q L L Q Q Q I P I V Q P S V L Q Q L N P C 136 AAGGTATTCCTCCAGCAGAAGTGCAGCCCTGTAGCAATGCCACAACGTCTTGCTAGGTCGCAAATGTGGCAGCAGAGCAGTTGCCATGTGATGCAACAACAA K V F L Q Q K C S P V A M P Q R L A R S Q M W Q Q S S C H V M Q Q Q 170 TGTTGCCAGCAGTTGCCGCAAATCCCCGAACAATCCCGCTATGAGGCAATCCGTGCCATCACCTACTCCATCATCCTGCAAGAACAACAACAGGGTTTTGTC C C Q Q L P Q I P E Q S R Y E A I R A I T Y S I I L Q E Q Q Q G F V 204 CAACCTCAGCAGCAACAACCCCAACAGTCGGGTCAAGGTGTCTCCCAATCCCAACAGCAGTCGCAGCAGCAGCTCGGACAATGTTCTTTCCAACAACCTCAA Q P Q Q Q Q P Q Q S G Q G V S Q S Q Q Q S Q Q Q L G Q C S F Q Q P Q 238 CAGCAACTGGGTCAACAGCCTCAACAACAACAAGTACTACAGGGTACCTTTTTGCAACCACACCAGATAGCTCACCTTGAGGTGATGACTTCCATTGCACTC Q Q L G Q Q P Q Q Q Q V L Q G T F L Q P H Q I A H L E V M T S I A L 272 CGTACCCTGCCAACGATGTGCAGCGTCAATGTGCCGTTGTACAGCTCCACCACTAGTGTGCCATTCAGCGTTGGCACCGGAGTTGCTGCCTACTGATAA R T L P T M C S V N V P L Y S S T T S V P F S V G T G V A A Y * * 305 Signal peptide Mature protein d