Presentation is loading. Please wait.

Presentation is loading. Please wait.

Comparative Genomics Gene Regulatory Networks (GRNs) Anil Jegga Biomedical Informatics Contact Information: Anil Jegga Biomedical Informatics Room # 232,

Similar presentations


Presentation on theme: "Comparative Genomics Gene Regulatory Networks (GRNs) Anil Jegga Biomedical Informatics Contact Information: Anil Jegga Biomedical Informatics Room # 232,"— Presentation transcript:

1 Comparative Genomics Gene Regulatory Networks (GRNs) Anil Jegga Biomedical Informatics Contact Information: Anil Jegga Biomedical Informatics Room # 232, S Building 10th Floor CCHMC Homepage: http://anil.cchmc.org Tel: 513-636-0261 E-mail: anil.jegga@cchmc.org Session 2: February 24, 2012 2/24/20121Jegga Biomedical Informatics Additional exercise available at: http://anil.cchmc.org/grn.html

2 Session 1: Overview of GRNs (Feb 23) a.Computational Approaches b.Cis-Element Identification c.Comparative Genomics d.Regulatory region variations e.p53 case study Session 2: Database Session (Feb 24) a.Genome Browsers b.Promoter Analysis, TFBS Search c.Co-regulated gene analysis 2/24/20122Jegga Biomedical Informatics

3 Session 2 (Databases/Servers) Feb 24, 2012 a.Genome Browsers b.Promoter Analysis, TFBS Search c.Co-regulated gene analysis

4 Genome Browser (http://genome.ucsc.edu) 2/24/20124Jegga Biomedical Informatics

5 1 45 6 3 2 Genome Browser (http://genome.ucsc.edu) Genome Browser Gateway choices: 1.Select Clade 2.Select genome/species: You can search only one species at a time 3.Assembly: the official backbone DNA sequence 4.Position: location in the genome to examine or search term (gene symbol, accession number, etc.) 5.Image width: how many pixels in display window; 5000 max 6.Configure: make fonts bigger + other options 2/24/20125Jegga Biomedical Informatics

6 Genome Browser (http://genome.ucsc.edu) 2/24/20126Jegga Biomedical Informatics

7 Genome Browser (http://genome.ucsc.edu) Explore the tracks 2/24/20127Jegga Biomedical Informatics

8 2/24/20128Jegga Biomedical Informatics

9 2/24/2012Jegga Biomedical Informatics9 What if I want to download promoter sequences for several genes at a time?

10 Genome Browser (http://genome.ucsc.edu) 2/24/201210Jegga Biomedical Informatics

11 Genome Browser (http://genome.ucsc.edu) 1 1 2 2 3 3 4 4 5 5 6 6 2/24/201211Jegga Biomedical Informatics

12 2/24/201212Jegga Biomedical Informatics

13 2/24/2012Jegga Biomedical Informatics13 Other Genome Browsers: ENSEMBL http://www.ensembl.org

14 2/24/2012Jegga Biomedical Informatics14 I have a promoter sequence and how do I scan it for known TFBSs?

15 2/24/2012Jegga Biomedical Informatics15 JASPAR: http://jaspar.genereg.net

16 2/24/2012Jegga Biomedical Informatics16 JASPAR: http://jaspar.genereg.net

17 2/24/2012Jegga Biomedical Informatics17 JASPAR: http://jaspar.genereg.net

18 2/24/2012Jegga Biomedical Informatics18 Gene-Regulation: http://www.gene-regulation.com Need to have an account (free for academic use)

19 2/24/2012Jegga Biomedical Informatics19 How can I identify putative regulatory regions for a gene or microRNA?

20 I have found a miRNA enriched in my gene list or I am interested in a specific gene and I want to identify putative regulatory regions for miRNA/gene GenomeTrafac: http://genometrafac.cchmc.org 2/24/201220Jegga Biomedical Informatics

21 GenomeTrafac: http://genometrafac.cchmc.org 2/24/201221Jegga Biomedical Informatics

22 GenomeTrafac: http://genometrafac.cchmc.org 2/24/201222Jegga Biomedical Informatics

23 GenomeTrafac: http://genometrafac.cchmc.org 2/24/201223Jegga Biomedical Informatics

24 GenomeTrafac: http://genometrafac.cchmc.org 2/24/201224Jegga Biomedical Informatics

25 GenomeTrafac: http://genometrafac.cchmc.org 2/24/201225Jegga Biomedical Informatics

26 2/24/2012Jegga Biomedical Informatics26 DCODE: http://www.dcode.org/

27 2/24/2012Jegga Biomedical Informatics27 ECR Browser: http://ecrbrowser.dcode.org/ Multispecies (not limited to pairwise comparisons)

28 2/24/2012Jegga Biomedical Informatics28 ECR Browser: http://ecrbrowser.dcode.org/

29 2/24/2012Jegga Biomedical Informatics29 ECR Browser: http://ecrbrowser.dcode.org/

30 2/24/2012Jegga Biomedical Informatics30 ECR Browser: http://ecrbrowser.dcode.org/

31 2/24/2012Jegga Biomedical Informatics31 ECR Browser: http://ecrbrowser.dcode.org/

32 RESOURCES - URLs: Summary Application/ResourceURL Genome Browserhttp://genome.ucsc.edu JASPARhttp://jaspar.genereg.net/ Gene Regulationhttp://www.gene-regulation.com GenomeTrafachttp://genometrafac.cchmc.org DCODEhttp://www.dcode.org/ 2/24/201232Jegga Biomedical Informatics

33 I have a list of co-expressed mRNAs (Transcriptome)…. I want to find the shared cis-elements – Known and Novel  Known transcription factor binding sites (TFBS)  Conserved oPOSSUM DiRE  Non-conserved Pscan MatInspector (*Licensed)  Unknown TFBS or Novel motifs  Conserved oPOSSUM Weeder-H  Non-conserved MEME Weeder 1.Each of these applications support different forms of input. Very few support probeset IDs. 2.Red Font: Input sequence required; Do not support gene symbols, gene IDs, or accession numbers. The advantage is you can use them for scanning sequences from any species. 3.*Licensed software: We have access to the licensed version. 1.Each of these applications support different forms of input. Very few support probeset IDs. 2.Red Font: Input sequence required; Do not support gene symbols, gene IDs, or accession numbers. The advantage is you can use them for scanning sequences from any species. 3.*Licensed software: We have access to the licensed version. 2/24/201233Jegga Biomedical Informatics

34 I have a list of co-expressed mRNAs (Transcriptome)…. I want to find the shared cis-elements – Known and Novel  Known transcription factor binding sites (TFBS)  Conserved oPOSSUM DiRE  Non-conserved Pscan MatInspector (*Licensed)  Unknown TFBS or Novel motifs  Conserved oPOSSUM Weeder-H  Non-conserved MEME Weeder 2/24/201234Jegga Biomedical Informatics

35 oPOSSUM (http://burgundy.cmmt.ubc.ca/oPOSSUM/) Supports human and mouse 2/24/201235Jegga Biomedical Informatics

36 oPOSSUM (http://www.cisreg.ca/oPOSSUM) Disadvantage: Supports either human or mouse only 2/24/201236Jegga Biomedical Informatics

37 oPOSSUM (http://www.cisreg.ca/oPOSSUM) The JASPAR PHYLOFACTS database consists of 174 profiles that were extracted from phylogenetically conserved gene upstream elements. They are a mix of known and as of yet undefined motifs. When should it be used? They are useful when one expects that other factors might determine promoter characteristics and/or tissue specificity. The JASPAR PHYLOFACTS database consists of 174 profiles that were extracted from phylogenetically conserved gene upstream elements. They are a mix of known and as of yet undefined motifs. When should it be used? They are useful when one expects that other factors might determine promoter characteristics and/or tissue specificity. 2/24/201237Jegga Biomedical Informatics

38 oPOSSUM (http://www.cisreg.ca/oPOSSUM) The Z-score statistic reflects the occurrence of the TFBS in the promoters of the co-expressed set compared to background. The Fisher statistic reflects the proportion of genes that contain the TFBS compared to background. 2/24/201238Jegga Biomedical Informatics

39 oPOSSUM (http://www.cisreg.ca/oPOSSUM) 2/24/201239Jegga Biomedical Informatics

40 2/24/201240Jegga Biomedical Informatics

41 oPOSSUM (http://www.cisreg.ca/oPOSSUM) 2/24/201241Jegga Biomedical Informatics

42 DiRE (http://dire.dcode.org/) 2/24/201242Jegga Biomedical Informatics

43 DiRE (http://dire.dcode.org/) ECR-Browser (http://ecrbrowser.dcode.org/) 2/24/201243Jegga Biomedical Informatics

44 Pscan (http://159.149.109.9/pscan) 2/24/201244Jegga Biomedical Informatics

45 Pscan (http://159.149.109.9/pscan) 2/24/201245Jegga Biomedical Informatics

46 Pscan (http://159.149.109.9/pscan) 2/24/201246Jegga Biomedical Informatics

47 Pscan (http://159.149.109.9/pscan) 2/24/201247Jegga Biomedical Informatics

48 Pscan (http://159.149.109.9/pscan) 2/24/201248Jegga Biomedical Informatics

49 Pscan (http://159.149.109.9/pscan) Comparing different input gene sets : 1.In the detailed output for a given matrix, you can compare the results obtained with the matrix on the gene set just submitted with the results the matrix had produced on another gene set. The latter could be a "negative" gene set (or vice versa ). 2.To perform the comparison, you have to fill in the "Compare with..." box fields with mean, standard deviation and sample size values of the other analysis - for the current one you can find them in the "Sample Data Statistics" box or in the overall text output that can be downloaded from the main output page. 3.Warning: Make sure that the values you input are correct, and especially that they were obtained by using the same matrix. Once you have clicked the "Go!" button, an output window will pop up and report if either of the two means is significantly higher than the other, together with a confidence p- value computed with a Welch t-test. 2/24/201249Jegga Biomedical Informatics

50 I have a list of co-expressed mRNAs (Transcriptome)…. I want to find the shared cis-elements – Known and Novel  Known transcription factor binding sites (TFBS)  Conserved oPOSSUM DiRE  Non-conserved Pscan MatInspector (*Licensed)  Unknown TFBS or Novel motifs  Conserved oPOSSUM Weeder-H  Non-conserved MEME Weeder 2/24/201250Jegga Biomedical Informatics

51 oPOSSUM (http://www.cisreg.ca/oPOSSUM) 2/24/201251Jegga Biomedical Informatics

52 oPOSSUM (http://www.cisreg.ca/oPOSSUM) The JASPAR PHYLOFACTS database consists of 174 profiles that were extracted from phylogenetically conserved gene upstream elements. They are a mix of known and as of yet undefined motifs. When should it be used? They are useful when one expects that other factors might determine promoter characteristics and/or tissue specificity. The JASPAR PHYLOFACTS database consists of 174 profiles that were extracted from phylogenetically conserved gene upstream elements. They are a mix of known and as of yet undefined motifs. When should it be used? They are useful when one expects that other factors might determine promoter characteristics and/or tissue specificity. 2/24/201252Jegga Biomedical Informatics

53 oPOSSUM (http://www.cisreg.ca/oPOSSUM) 2/24/201253Jegga Biomedical Informatics

54 oPOSSUM (http://www.cisreg.ca/oPOSSUM) 2/24/201254Jegga Biomedical Informatics

55 I have a list of co-expressed mRNAs (Transcriptome)…. I want to find the shared cis-elements – Known and Novel  Known transcription factor binding sites (TFBS)  Conserved oPOSSUM DiRE  Non-conserved Pscan MatInspector (*Licensed)  Unknown TFBS or Novel motifs  Conserved oPOSSUM Weeder-H  Non-conserved MEME Weeder 1.Each of these applications support different forms of input. Very few support probeset IDs. 2.Red Font: Input sequence required; Do not support gene symbols, gene IDs, or accession numbers. The advantage is you can use them for scanning sequences from any species. 3.*Licensed software: We have access to the licensed version. 1.Each of these applications support different forms of input. Very few support probeset IDs. 2.Red Font: Input sequence required; Do not support gene symbols, gene IDs, or accession numbers. The advantage is you can use them for scanning sequences from any species. 3.*Licensed software: We have access to the licensed version. How to fetch promoter/upstream sequence – single/multiple? 2/24/201255Jegga Biomedical Informatics

56 I have a list of co-expressed mRNAs (Transcriptome)…. I want to find the shared cis-elements – Known and Novel  Known transcription factor binding sites (TFBS)  Conserved oPOSSUM DiRE  Non-conserved Pscan MatInspector (*Licensed)  Unknown TFBS or Novel motifs  Conserved oPOSSUM Weeder-H  Non-conserved MEME Weeder 1.Each of these applications support different forms of input. Very few support probeset IDs. 2.Red Font: Input sequence required; Do not support gene symbols, gene IDs, or accession numbers. The advantage is you can use them for scanning sequences from any species. 3.*Licensed software: We have access to the licensed version. 1.Each of these applications support different forms of input. Very few support probeset IDs. 2.Red Font: Input sequence required; Do not support gene symbols, gene IDs, or accession numbers. The advantage is you can use them for scanning sequences from any species. 3.*Licensed software: We have access to the licensed version. Use the fetched promoter/upstream sequences for the following analyses 2/24/201256Jegga Biomedical Informatics

57 WeederH (http://159.149.109.9/pscan) 1.Supports large number of species. 2.Does not support multiple sequences (multifasta) input. You have to enter each sequence separately. 3.Good for small number of sequences where you expect a potential novel (or not included in the TFBS libraries) conserved motif. 2/24/201257Jegga Biomedical Informatics

58 Weeder (http://159.149.109.9/modtools/) Do not use Groupwise mail when submitting large number of sequences because the results are sent “in the mail” and not as an attachment. And Groupwise mail truncates messages if they are very long. Use Gmail instead. A link to the results page used to be sent earlier. 2/24/201258Jegga Biomedical Informatics

59 Weeder (http://159.149.109.9/modtools/) 2/24/201259Jegga Biomedical Informatics

60 MEME (http://meme.sdsc.edu) MEME takes as input a group of DNA or protein sequences and outputs as many motifs as requested. MEME uses statistical modeling techniques to automatically choose the best width, number of occurrences, and description for each motif. Your MEME results consist of: your MEME results in HTML format your MEME results in XML format your MEME results in TEXT format and the MAST results of searching your input sequences for the motifs found by MEME using MAST. 2/24/201260Jegga Biomedical Informatics

61 MEME (http://meme.sdsc.edu) 2/24/201261Jegga Biomedical Informatics

62 TOMTOM can be used to find out if an overrepresented motif in your sequences matches or is similar to a known TFBS MEME (http://meme.sdsc.edu) 2/24/201262Jegga Biomedical Informatics

63 Summary Cis-Element Finding Matrix CONSERVEDNON-CONSERVED KNOWN TFBS oPOSSUM DiRE Pscan MatInspector* NOVEL/UNKNOWN TFBS OR MOTIFS oPOSSUM WEEDER-H MEME WEEDER 2/24/201263Jegga Biomedical Informatics

64 RESOURCES - URLs: Summary Application/ResourceURL oPOSSUMhttp://burgundy.cmmt.ubc.ca/oPOSSUM/ DiREhttp://dire.dcode.org/ Weeder-Hhttp://159.149.109.9/modtools/ Weederhttp://159.149.109.9/modtools/ Pscanhttp://159.149.109.9/pscan MEMEhttp://meme.sdsc.edu/ MatInspectorhttp://www.genomatix.de/ Genome Browserhttp://genome.ucsc.edu ECR Browserhttp://ecrbrowser.dcode.org 2/24/201264Jegga Biomedical Informatics Additional exercise available at: http://anil.cchmc.org/grn.html


Download ppt "Comparative Genomics Gene Regulatory Networks (GRNs) Anil Jegga Biomedical Informatics Contact Information: Anil Jegga Biomedical Informatics Room # 232,"

Similar presentations


Ads by Google