발표자 석사 2 년 김태형 Vol. 11, Issue 3, 389-404, March 2001 Comparative DNA Sequence Analysis of Mouse and Human Protocadherin Gene Clusters 인간과 마우스의 PCDH 유전자.

Slides:



Advertisements
Similar presentations
Periodic clusters. Non periodic clusters That was only the beginning…
Advertisements

Transcriptional regulation and promoter analysis
Genomics: READING genome sequences ASSEMBLY of the sequence ANNOTATION of the sequence carry out dideoxy sequencing connect seqs. to make whole chromosomes.
Combined analysis of ChIP- chip data and sequence data Harbison et al. CS 466 Saurabh Sinha.
Bioinformatics Motif Detection Revised 27/10/06. Overview Introduction Multiple Alignments Multiple alignment based on HMM Motif Finding –Motif representation.
Basics of Comparative Genomics Dr G. P. S. Raghava.
1 Computational Molecular Biology MPI for Molecular Genetics DNA sequence analysis Gene prediction Gene prediction methods Gene indices Mapping cDNA on.
Identification of Transcriptional Regulatory Elements in Chemosensory Receptor Genes by Probabilistic Segmentation Steven A. McCarroll, Hao Li Cornelia.
1 Gene Finding Charles Yan. 2 Gene Finding Genomes of many organisms have been sequenced. We need to translate the raw sequences into knowledge. Where.
Multiple alignment June 29, 2007 Learning objectives- Review sequence alignment answer and answer questions you may have. Understand how the E value may.
CSE182-L12 Gene Finding.
Chris Chander, Luke Adea BioSci D145 Feb. 12, 2015
Eukaryotic Gene Finding
1 Unity of Invention: Biotech Examples TC1600 Special Program Examiner Julie Burke (571)
Computational Molecular Biology Biochem 218 – BioMedical Informatics Gene Regulatory.
Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species.
International Livestock Research Institute, Nairobi, Kenya. Introduction to Bioinformatics: NOV David Lynn (M.Sc., Ph.D.) Trinity College Dublin.
Mouse Genome Sequencing
Bikash Shakya Emma Lang Jorge Diaz.  BLASTx entire sequence against 9 plant genomes. RepeatMasker  55.47% repetitive sequences  82.5% retroelements.
Ultraconserved Elements in the Human Genome Bejerano, G., et.al. Katie Allen & Megan Mosher.
Rosalind Elsie Franklin  Biophysicist and crystallographer  X-ray diffraction images of DNA  Tobacco mosaic and polio viruses  (source: wikipedia)
What is comparative genomics? Analyzing & comparing genetic material from different species to study evolution, gene function, and inherited disease Understand.
Eukaryotic Gene Expression The “More Complex” Genome.
Genome Annotation BBSI July 14, 2005 Rita Shiang.
U.S. Patent and Trademark Office Technology Center 1600 Michael P. Woodward Unity of Invention: Biotech Examples.
NCBI Review Concepts Chuong Huynh. NCBI Pairwise Sequence Alignments Purpose: identification of sequences with significant similarity to (a)
1 Orthology and paralogy A practical approach Searching the primaries Searching the secondaries Significance of database matches DB Web addresses Software.
NCBI’s Genome Annotation: Overview Incremental processing Re-annotation ( batch ) Post-annotation review Case studies NOTE: limiting discussion to annotation.
COURSE OF BIOINFORMATICS Exam_31/01/2014 A.
DNA sequencing. Dideoxy analogs of normal nucleotide triphosphates (ddNTP) cause premature termination of a growing chain of nucleotides. ACAGTCGATTG ACAddG.
You have worked for 2 years to isolate a gene involved in axon guidance. You sequence the cDNA clone that contains axon guidance activity. What do you.
Recombinant DNA Technology and Genomics A.Overview: B.Creating a DNA Library C.Recover the clone of interest D.Analyzing/characterizing the DNA - create.
Fea- ture Num- ber Feature NameFeature description 1 Average number of exons Average number of exons in the transcripts of a gene where indel is located.
Computational Genomics and Proteomics Lecture 8 Motif Discovery C E N T R F O R I N T E G R A T I V E B I O I N F O R M A T I C S V U E.
Genomics and Forensics
MEME homework: probability of finding GAGTCA at a given position in the yeast genome, based on a background model of A = 0.3, T = 0.3, G = 0.2, C = 0.2.
MPL The DNA Sequence of chimpanzee chromosome 22 and comparative analysis with its human ortholog, chromosome 21 Bioinformatics Dae-Soo Kim.
Gene Structure and Identification III BIO520 BioinformaticsJim Lund Previous reading: 1.3, , 10.4,
1 Many to 1 Gene Associations The following slides show a few examples of gene predictions by one annotation group that overlap one or more genes from.
Finding genes in the genome
COURSE OF BIOINFORMATICS Exam_30/01/2014 A.
A high-resolution map of human evolutionary constraints using 29 mammals Kerstin Lindblad-Toh et al Presentation by Robert Lewis and Kaylee Wells.
Human Molecular Genetics Institute of Medical Genetics.
Supplemental Fig. S1 A B AtMYBS aa AtMYBS
The Transcriptional Landscape of the Mammalian Genome
Figure 1. Annotation and characterization of genomic target of p63 in mouse keratinocytes (MK) based on ChIP-Seq. (A) Scatterplot representing high degree.
Figure 1. Structure of the fly LGR2 gene and the corresponding cDNA sequence. A, Derivation of the fly LGR2 full-length cDNA from the genomic sequence.
Basics of Comparative Genomics
Recurrent inversion breaking intron 1 of the factor VIII gene is a frequent cause of severe hemophilia A by Richard D. Bagnall, Naushin Waseem, Peter M.
Peter John M.Phil, PhD Atta-ur-Rahman School of Applied Biosciences (ASAB) National University of Sciences & Technology (NUST)
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Volume 19, Issue 5, Pages (May 2017)
Volume 10, Issue 1, Pages (July 2002)
Sequencing of t(2;7) Translocations Reveals a Consistent Breakpoint Linking CDK6 to the IGK Locus in Indolent B-Cell Neoplasia  Edward P.K. Parker, Reiner.
Volume 19, Issue 5, Pages (May 2017)
Evidence for Widespread Reticulate Evolution within Human Duplicons
Volume 2, Issue 2, Pages (February 2008)
Presented by, Jeremy Logue.
Volume 128, Issue 6, Pages (March 2007)
Basics of Comparative Genomics
Problems from last section
Basic Local Alignment Search Tool
Identification of the GCS1 ortholog in Gonium pectorale.
Presented by, Jeremy Logue.
Identification of TSIX, Encoding an RNA Antisense to Human XIST, Reveals Differences from its Murine Counterpart: Implications for X Inactivation  Barbara.
Volume 21, Issue 23, Pages (December 2011)
Volume 11, Issue 7, Pages (May 2015)
Origins and Impacts of New Mammalian Exons
Figure Genetic characterization of the novel GYG1 gene mutation (A) GYG1_cDNA sequence and position of primers used. Genetic characterization of the novel.
Volume 97, Issue 6, Pages (June 1999)
Presentation transcript:

발표자 석사 2 년 김태형 Vol. 11, Issue 3, , March 2001 Comparative DNA Sequence Analysis of Mouse and Human Protocadherin Gene Clusters 인간과 마우스의 PCDH 유전자 클러스트 들에 대한 DNA 서열 비교분석

Experiment Designed 19 pairs of PCR primers to amplify genomic DNA containing the homologous mouse protocadherin genes Screen a mouse BAC genomic DNA library (RPCI-23), and isolated 21 BAC clones containing sequences of the mouse protocadherin gene clusters Seven minimally overlapping clones were selected for DNA sequencing (RPCI-23_193o23, 6p18, 72c14, 92d17, 161o8, 56b11, and 19k11) Comparison of the sequences of cDNAs with those of the genomic DNA

BAC clone Pcdha variable region exons Pcdhb variable region exons Pcdhc variable region exons C-type Pcdh variable region exons reric or psedogene Human Chromosome 5q31 Mouse Chromosome 18

Comparison of the Organization of the Mouse and Human Pcdha Gene Clusters Mouse Pcdha genes confirmed the consensus splice sites at the ends of all 14 variable region exons First 12 mouse Pcdha genes are highly similar to each other Like the corresponding human genes, mouse Pcdha -C1 and - C2 genes are more similar to each other than to the 12 upstream Pcdha genes Constant region exons 1, 2, and 3 are 92%, 99%, and 89% identical between mouse and human

Comparison of the Organization of Human and Mouse Pcdhr Gene Clusters 22 mouse Pcdhr variable region exons and three small constant region exons in the region downstream The constant region exon sequences are highly conserved between mouse and human Specifically, constant region exons 1, 2, and 3 have 95%, 90%, and 80% identity, respectively, between mouse and human at the nucleotide level Orthologous human gene except the mouse Pcdhr -b8 gene Mouse has a relic sequence at the location corresponding to the human Pcdhr -b3 gene Similar to the Pcdhr gene cluster, the last three Pcdhr genes (C3, C4, and C5) are conserved between mouse and human

Comparison of the Organization of Human and Mouse Pcdhb Gene Clusters Single large exon encodes an 818aa protein containing a signal peptide Highly similar to the human Pcdhb 1 protein: 88% identity and 92% similarity with no gaps over the entire length Covers the gap between the human Pcdh 8 and Pcdh 9 genes, and found that the gap sequence contains only one additional Pcdhb gene (therefore designated Pcdh 8a) Mouse has six more Pcdhb genes than human does, and the Pcdhb locus is expanded in mouse compared to that in human The Pcdhb proteins have highly conserved extracellular and transmembrane domains Conserved Pcdhb 5' splice sites do function. However, neither the cell type in which this splicing occurs nor the target 3' splice site has been identified

Evolutionary Relationships among Members of the Human and Mouse Pcdha,Pcdhb, Pcdhr Genes Members of Pcdhr genes are strictly conserved between mouse and human Mouse ortholog of human Pcdhr-b3 gene has degenerated into a relic sequence, and the human ortholog of mouse Pcdhr-b8 has become a pseudogene In the same order and orientation C-type protocadherin genes, the last two Pcdha genes and the last three Pcdhr genes, are more similar to each other, and are separated from corresponding upstream genes by a very large intergenic region (>40 kb) in both mouse and human Pcdha and Pcdhr gene clusters have highly conserved constant region exons between mouse and human Pcdhb gene cluster does not have constant region exons in both mouse and human

The Distribution of CpG Islands Corresponds to the Locations of the Variable Region Exons Sequences around the translation start sites of mouse and human protocadherin variable region exons revealed a high density of CpG dinucleotides, suggesting that they are CpG islands Searched the entire human and mouse gene clusters for CpG islands using the CpGplot program This distribution supports the proposal that each variable region exon has its own promoter and a transcriptional start site is located upstream from each variable region exon The peak of ratios correlates with the position of protocadherin variable region exons but not constant region exons

Mouse genome sequecne Mouse annotation Human identity

Orthologous Paralogous Human vs Mouse

Noncoding Sequence Conservation Within the Variable Region of Mouse and Human Used the PipMaker program (Schwartz et al ) Systematic analysis of these sequences First two relics (r1 and r2) in the mouse Pcdha gene cluster Most striking features are the occurrence of highly conserved sequences upstream of each variable region exon 70% identity and longer than 100 base pairs (bp) 5' flanking sequences of orthologous variable region exons have a significantly higher percentage identity than the corresponding paralogous sequences within Pcdha and Pcdhr gene clusters in both mouse and human

Identification of a DNA Sequence Motif Upstream of Protocadherin Variable Region Exons Used a version of the Gibbs sampler program called GibbsDNA This motif cannot be found in transcription factor binding site databases Both human and mouse Pcdhb1 genes do not have the motif. All three gene clusters revealed a common core sequence, “ CGCT ” The loci strongly suggest that they are important for the regulation of protocadherin gene expression

DISCUSSION The overall genomic organization of the three protocadherin gene clusters is highly conserved between mouse and human The interspersed repeats occupy 41% and 36% of the genomic sequences in the protocadherin loci in mouse and human, 30% in the human T-cell receptor locus SINEs is much higher than that of LINEs Identified the orthologous mouse and human gene pairs in the Pcdha and Pcdhr gene clusters

METHODS Mouse BAC Isolation and Sequencing –Nineteen PCR primer pairs were designed to screen a mouse BAC library (RPCI-23) Phylogenetic Analysis PAUP (Phylogenetic Analysis Using Parsimony), version Sequence Analysis –Annotation Aligned by using the multiple sequence alignment program Pileup –Comparison RepeatMasker PipMaker