Presentation is loading. Please wait.

Presentation is loading. Please wait.

Mutations and Epimutations A story of two cultivars and their children. Matteo Pellegrini.

Similar presentations


Presentation on theme: "Mutations and Epimutations A story of two cultivars and their children. Matteo Pellegrini."— Presentation transcript:

1 Mutations and Epimutations A story of two cultivars and their children. Matteo Pellegrini

2 Nipponbare and 93-11 Nipponbare: – Oryza sativa japonica Primarily Japan, China, Indonesia Agronomic differences: Days to heading 93-11 – Oryza sativa indica India, Bangladesh, Nepal, China Submerged growth Agronomic differences: Seed fertility Long grain Taller (83 cm)

3 Why Study Crosses? Crosses of Indica and Japonica are often sterile Show hybrid vigor in agronomic traits

4 Overview Identify SNPs between ecotypes. – SNP generation Identify epiMutations between ecotypes. – Identify methyl-inheritance Identify allele-specific expression Identify RNA editing P F1 NPB9311 2 rice ecotypes: Nipponbare and 93-11 Generated BS-seq data for NPB, 93-11, and 2 reciprocal crosses

5 Detecting Cytosine Methylation A, C unmethylated, C methylated, G, T ? … m mm … … ACCCGTACCCGATTAG … … ATCTGTATCCGATTAG …  Apply sodium bisulfite and amplify:  Unmethylated C → T, methylated C (and A/G/T) unchanged  Try to align new sequence to known reference; compare

6 Mapping Approach: BS Seeker Chen et al (2010) BMC Bioinformatics BS reads are C/T converted, so normal aligners are not applicable Three letter alignment: AATCGTA CTAATCGCAG G BS read: Ref. genome: TTAATTGTAGG AATTGTA Convert C to T AATTGTA TTAATTGTAGG Bowtie mapping CTAATCGCAG G AATCGTA Restore to 4 letters m m u u Compare alignments

7 Methylation levels at single-base resolution 7  Calculate methylation level at each covered cytosine  Methylation level= #C/(#C+#T) 5’ --attgagacatcctagcgcgtggtgacaataata—- 3’ ttttagcgcgtggtg cattttagtgcgtgg tagtgcgtggtg 3/(3+0)=100% 1/(1+2)=33.3% Ref. genome:

8 Workflow Alignments – BS-Seeker mapping of NPB and 9311 samples to NPB reference genome. – Maps 9311 genome to NPB coordinates Parent genomes – Each read generates a small implied sequence fragment. – Use this to generate a parent genome. F1 read matching Map reads to NPB reference genome to get location. Compare each read to NPB and 9311 parent genomes and determine better match.

9 SNP parent1 parent2 Methylation level at CG sites Methylation level at CG sites BS-seq parent1/parent2 Detecting Alelle-Specific methylation

10 Library statistics Methyl-Seq ReadsMapped% MappedCoverage NPB 298M 134M45% 17.58 93-11 157M 74M47% 10.14 NPB x 93-11 594M 279M47% 20.04 -NPB 6.51 -93-11 6.08 93-11 x NPB 543M236M43% 25.77 -NPB 7.45 -93-11 6.59 RNA-Seq NPB 42M 17M42% 93-11 42M 13M31% NPB x 93-11 48M 12M26% -NPB -93-11 93-11 x NPB 43M 11M25% -NPB -93-11

11 Identifying SNPs If sites: – > 3 reads/strand – > 90% agreement within ecotype – Strands agree with each other (compensate for Cs). – (obviously) disagree with each other. Will miss indels, dups, inversions, other chr rearrangements. Will miss long runs of SNPs ( > 3 within ~55 bp) (BS-seeker limit)

12 SNPs - NPB vs 93-11 1,209,456 mutations / 306,106,830 sites with mutual base calls ~ 1/253 bases Mostly (73%) C->T (or G- >A if C->T on opposite strand) or T->C & A->G if in other 93-11 ACGT A 86,677,300 42,553 216,135 42,513 C 43,336 65,771,387 34,146 226,045 G 34,146 65,771,387 43,336 T 42,513 216,135 42,553 86,677,300

13 SNPs - NPB vs F1 (9N-NPB) 12 mutations Are these real or false? Similar numbers amongst all F1 comparisons ACGT A 3,188,414 - 3 - C - 2,695,005 - 3 G 2 - 2,548,205 - T - 4 - 3,253,196

14 Identifying epimutations Use the binomial dist. to build min, max, and mean pct methylation at each C. Confidence intervals at 5% are min, max As # of reads ^, interval size v Reads Min/max

15 Identifying epimutations (cont) Called different if: – mean(sample1) max(sample1)

16 1 in 300 CG sites spontaneously mutate across one generation Epimutation rate

17 Epimutation clusters 9311 parent NPB parent NPB cross 9311 cross

18 Epimutation clusters II 9311 parent NPB parent NPB cross 9311 cross

19 Epimutations are enriched in regions where parents differ Half of the epimutations between parents and crosses occur at sites where parents differ

20 Epimutations (continued) Epimutations within genes – 498 genes were significantly enriched for epimutations – GO Term x-ecotypes indicates: ATP synthesizing related activity (ATP synthesis coupled proton transport, hydrogen transport, ion transmembrane transport, etc).

21 Expression Many genes (~7800/25640) are differentially expressed between ecotypes. GO term: choroplast related terms, response to cadmiumion.

22 Expression cont. Across generations, only 78 genes differentially expressed Of these only 2 were differentially expressed in the parents

23 Allele Specific Expression 681 examples of allele specific expression Partially explain hybrid vigor? NPB parent NPB cross 9311 parent 9311 cross NPB cross 9311 cross

24 Allele-Specific Genes Accumulate Mutations SNP Density All genesAllele-specific genes And are also enriched for differentially methylated sites

25 Allele-specific Expression cont. And are also enriched for differentially methylated sites

26 RNA Editing Cytidine deamination : C to U Adenosine deaminase: A to I (G)

27 How Widespread Recent studies indicate that RNA editing may be more widespread than originally thought Others have disputed this claim (Schrider et al, PlosOne) In plants RNA editing is thought to take place in the mitochondria and plastids Is there editing in nuclear genes? Science. 2011 Jul 1;333(6038):53-8.

28 RNA Editing in Rice NPB - RNA ACGT NPB - DNA A5535334690730632219 C4758443628242797054 G3777243743826364213 T2210322769495577323 Initially we found lots of examples….

29 On Closer Inspection… Alignments are often off by one or more bases at splice sites

30 But a Few Real Ones Remain?

31 But more Filtering Should be done… Position of edit site along read

32 Current Numbers

33 Conclusions Epimutation rates are one in 300 cytosines across one generation – Clusters of epimutations are present – Are enriched in sites where parental epigenomes differ Allele-specific expression is widespread and associated with – Increased SNP densities – Higher differential methylation Find some evidence for RNA editing but…

34 Acknowledgements – Krishna Chodavarapu (Pellegrini Lab) – Suhua Feng (Steve Jacobsen Lab) – Blake Myers, Guo-liang Wang, Yulin Jia


Download ppt "Mutations and Epimutations A story of two cultivars and their children. Matteo Pellegrini."

Similar presentations


Ads by Google