Presentation is loading. Please wait.

Presentation is loading. Please wait.

Genome-wide Analysis of Pre-mRNA 3′ End Processing Reveals a Decisive Role of Human Cleavage Factor I in the Regulation of 3′ UTR Length  Georges Martin,

Similar presentations


Presentation on theme: "Genome-wide Analysis of Pre-mRNA 3′ End Processing Reveals a Decisive Role of Human Cleavage Factor I in the Regulation of 3′ UTR Length  Georges Martin,"— Presentation transcript:

1 Genome-wide Analysis of Pre-mRNA 3′ End Processing Reveals a Decisive Role of Human Cleavage Factor I in the Regulation of 3′ UTR Length  Georges Martin, Andreas R. Gruber, Walter Keller, Mihaela Zavolan  Cell Reports  Volume 1, Issue 6, Pages (June 2012) DOI: /j.celrep Copyright © 2012 The Authors Terms and Conditions

2 Cell Reports 2012 1, 753-763DOI: (10.1016/j.celrep.2012.05.003)
Copyright © 2012 The Authors Terms and Conditions

3 Figure 1 Outline of the A-seq Method Used to Map Binding Sites of 3′ End Cleavage Sites Further details are provided in Experimental Procedures. See also Figure S1. Cell Reports 2012 1, DOI: ( /j.celrep ) Copyright © 2012 The Authors Terms and Conditions

4 Figure 2 Relationship between Polyadenylation Motifs and Cleavage Sites (A and B) Nucleotide composition (A) and frequency of the two most common poly(A) signals (B), as a function of distance relative to the dominant cleavage sites that are anchored at 0. (C and D) Distributions of poly(A) hexamer scores (see Experimental Procedures) (C) and fraction of reads derived from the first, second, third, or fourth cleavage site (D) for genes with 1, 2, 3, and 4 identified cleavage sites. Cleavage sites are sorted from most proximal (left) to most distal (right). Distributions are summarized as box plots, with boxes indicating the interquartile range, the black horizontal the median, and the whiskers delimiting the interval of 1.5 times the interquartile range, centered at the median. Points outside of this interval are shown as circles. (E) The type of poly(A) signal (strong: AAUAAA, medium: AUUAAA and AGUAAA, weak: all other motifs described in Beaudoing et al., 2000) found at alternative cleavage in genes with 1, 2, or 3 tandem cleavage sites (CSs) as well as their conservation in mouse, indicated by the type of poly(A) signal identified in the orthologous mouse regions (details in Experimental Procedures). See also Figure S2. Cell Reports 2012 1, DOI: ( /j.celrep ) Copyright © 2012 The Authors Terms and Conditions

5 Figure 3 Positional Profiles of Binding of 3′ End Processing Factors Relative to Dominant Cleavage Sites (A) Average density of reads from PAR-CLIP samples of CF Im (top), CPSF (middle), and CstF (bottom) proteins in the vicinity of the 3,000 most abundant dominant cleavage sites. (B) The span of the region in which the density of reads is within 1% of the density at the peak for each factor. Positions are indicated in nucleotides relative to the cleavage site, which is located at 0. See also Figure S3. Cell Reports 2012 1, DOI: ( /j.celrep ) Copyright © 2012 The Authors Terms and Conditions

6 Figure 4 Sequence Motifs That Are Most Enriched in the Binding Sites of 3′ End Processing Factors The MEME-identified motifs that were represented in at least 50 of the most abundantly isolated 500 sites of various 3′ end processing factors are shown. For each motif we indicated the number of sites among the top 500 that contained it and the E-value. See also Figure S4. Cell Reports 2012 1, DOI: ( /j.celrep ) Copyright © 2012 The Authors Terms and Conditions

7 Figure 5 Scatter Plots of Proximal/Distal Poly(A) Site Usage Ratio in Pairs of A-seq Samples (A) A-seq samples prepared from cells treated as indicated were used to infer poly(A) site usage. Each dot represents one gene that had more than one cleavage site in a terminal exon. The proximal/distal ratios were calculated as Σ(number of A-seq reads at all 3′ end processing regions except the distal one)/(number of A-seq reads at the most distal 3′ end processing region). (B) Effects of CF Im68 siRNA treatment on the poly(A) site choice (northern blot panel, NB and genome browser panel showing the A-seq results) and protein levels (WB panel) of RNF11. Comparison of proximal/distal site usage ratios between northern blots (dark gray columns, quantification done with the ImageJ software; and A-seq (light gray columns) are indicated in panel “NB and A-seq p/d ratios,” where no-si indicates no siRNA treatment, si-A is siRNA scrambled control A, si-64 is siCstF-64 and si-68 is siCF Im68 treatment. See also Figure S5. Cell Reports 2012 1, DOI: ( /j.celrep ) Copyright © 2012 The Authors Terms and Conditions

8 Figure 6 A Model of the Effect of CF Im68 Concentration on the Choice of the Cleavage Site Lack of deposition of CF Im at proximal sites hinders the cleavage and stimulates transcription toward distal cleavage sites. Deposition of CF Im at optimal, distal sites releases the block on the CPSF cleavage activity, allowing formation of the 3′ end. Absence of CF Im from some 3′ end complexes when its concentration is low results in no inhibition being sensed at the proximal site, where CPSF can cleave to produce a mature 3′ end. For simplicity, additional factors and the RNA polymerase are not depicted. Cell Reports 2012 1, DOI: ( /j.celrep ) Copyright © 2012 The Authors Terms and Conditions

9 Figure S1 Density of Reads Originating from the SMNDC1 Locus Viewed on the CLIPZ Genome Browser, Related to Figure 1 (A) The coordinates of the locus are shown at the top of the figure, followed by the gene structure inferred from various GenBank transcripts. Tracks corresponding to CLIP and A-seq samples follow. The read copy number in a given sample is shown in reads per million on the scales to the right of the tracks. The first density track corresponds to the A-seq sample generated from HEK293 cells cultured under normal conditions and showing that the three poly(A) sites from polyA-DB are used to various degrees. Tracks for CLIP densities corresponding to various 3′ end processing factors follow. AR or FR indicate the antibody used for IP, where A is a protein-specific antibody and F is FLAG-specific antibody used with FLAG-tagged proteins. R stands for the partial RNase I digest. (B) A-seq and CLIP reads near the poly(A) site of the PAPOLG transcript. Upper panel depicts density plots for CF Im25, CF Im59, and CF Im68. Below, a section from the CLIPZ genome browser showing a subset of reads obtained from CF Im CLIP experiments. UGUA motifs are highlighted in pink, the samples from which the reads were derived are shown on the right side. Cell Reports 2012 1, DOI: ( /j.celrep ) Copyright © 2012 The Authors Terms and Conditions

10 Figure S2 Northern Blots Showing the Relative Usage of Alternative Cleavage Sites with and without Knockdown of 3′ End Processing Polypeptides CstF-64 and CF Im68, Related to Figure 2 (A) Numbers on the left-hand side of the blots are sizes of transcripts (in kilobases). (B) Comparison of A-seq to Northern blot results. Northern blots were quantified with ImageJ; corrected to signals with β-actin reprobing of filters, not shown. For A-seq, read counts were taken from the CLIPZ database. Numbers to the left are arbitrary units. No-si is no siRNA, Si-A is siRNA scrambled control-A, si-64 is siCstF-64 and si-68 is siCF Im68 treatment. No-si ratios of both Northern and A-seq were set to 1. (C) The relative usage of the corresponding sites determined with A-seq as seen on the CLIPZ web server ( is shown. 3′ UTR regions from various transcripts in GenBank are shown at the top of the figures, followed by annotated cleavage sites from polyA-DB. Tracks illustrating the density of reads obtained in individual A-seq samples from these genomic regions follow. Cell Reports 2012 1, DOI: ( /j.celrep ) Copyright © 2012 The Authors Terms and Conditions

11 Figure S3 Distribution of Binding Sites of 3′ End Processing Factors Relative to the Cleavage Site from Replicate CLIP and of HITS-CLIP Experiments and to Dominant Cleavage Sites with and without UGUA Motifs, Related to Figure 3 (A) Left panel: Averaged density of reads from replicate CLIP libraries prepared for CPSF proteins, also listed in Table S3. Right panel: read density from libraries obtained with the HITS-CLIP method for CPSF-73 and CPSF-160 involving UV-crosslinking with 254 nm light in the absence of 4-thio-uridine. Positions are indicated in nucleotides relative to the cleavage site, which is located at 0. (B) Upper two panels: Read densities for CF Im components around cleavage sites without (left) and with (right) UGUA motifs within the 100 nts upstream of the CS. Lower two panels: UGUA frequencies for cleavage sites with and without UGUA motifs in the 100 nts upstream of the CS. Cell Reports 2012 1, DOI: ( /j.celrep ) Copyright © 2012 The Authors Terms and Conditions

12 Figure S4 Positional Relationship between the Enriched Sequence Motifs—UGUA in Panel A, AAUAAA in Panel B, and UGUSU in Panel C—and the Most Frequently Crosslinked Nucleotides in Binding Sites of Individual Factors, Related to Figure 4 The most abundantly CLIPed 500 sites for each individual factor were used for these plots. The sites were anchored at the most frequently crosslinked nucleotide, as determined by the number of crosslink-diagnostic (T-to-C) mutations (Hafner et al., 2010; Kishore et al., 2011) in the sequenced reads associated with the sites. The positions where the start of individual motifs occurred with respect to the most abundantly crosslinked nucleotide were used to generate the histograms. Cell Reports 2012 1, DOI: ( /j.celrep ) Copyright © 2012 The Authors Terms and Conditions

13 Figure S5 Quantification of Poly(A)-Containing RNA in Total RNA from siRNA Treated HEK293 Cells, Related to Figure 5 (A) Cells were stained with trypan blue and live cells counted after three days (blue columns). Total RNA was extracted and 500, 167 and 50 ng total RNA was assayed for poly(A)+ RNA (red columns show the fraction of poly(A)-containing/total RNA; see Extended Experimental Procedures). (B) The numbers of reads (from a standardized sample size of 1′000′000) that were obtained at various types of sites in the indicated samples are summarized as box-plots. The box indicates the inter-quartile range (median is shown as a black horizontal line). Whiskers indicate the interval of 1.5 the inter-quartile range, centered on the median. Points outside of this range are shown as circles. Reads that mapped to the introns of transcripts that had two or more poly(A) sites in the same exon were used to compute the usage of intronic cleavage sites in various libraries. Cell Reports 2012 1, DOI: ( /j.celrep ) Copyright © 2012 The Authors Terms and Conditions


Download ppt "Genome-wide Analysis of Pre-mRNA 3′ End Processing Reveals a Decisive Role of Human Cleavage Factor I in the Regulation of 3′ UTR Length  Georges Martin,"

Similar presentations


Ads by Google