Structure and function of genome

Slides:

Advertisements

Similar presentations

Genomics – The Language of DNA Honors Genetics 2006.

Advertisements

第八章轴系零件 § 8-1 键、销及其连接一、键连接二、销连接 § 8-2 轴一、轴的分类和应用二、轴的结构和轴上零件的固定

The Organization of Cellular Genomes Complexity of Genomes Chromosomes and Chromatin Sequences of Genomes Bioinformatics As we have discussed for the last.

Chapter 7b - Transposable elements:

第一章基因和基因组及基因工程的概念第一节基因的概念第二节基因组第三节基因工程的定义和研究内容第四节基因工程的发展史.

The Transport of Molecules into and out of the Nucleus Through an aqueous channels whose diamenter is adjustable Bidirection: import and export Active.

GENETIC-CONCEPTS.

HistCite 结果分析示例罗昭锋. By:SC 可能原因：文献年度过窄，少有相互引用.

2.2 结构的抗力抗力及其不定因素材料强度的标准值材料强度的设计值.

地理信息系统概述. 数据和信息 (Data & Information) 数据原始事实如：员工姓名，数据可以有数值、图形、声音、视觉数据等信息以一定规则组织在一起的事实的集合。

吉林大学远程教育课件主讲人 : 杨凤杰学时： 64 ( 第六十二讲 ) 离散数学. 最后，我们构造能识别 A 的 Kleene 闭包 A* 的自动机 M A* =(S A* ， I ， f A* ， s A* ， F A* ) ，令 S A* 包括所有的 S A 的状态以及一个附加的状态 s.

1 为了更好的揭示随机现象的规律性并利用数学工具描述其规律, 有必要引入随机变量来描述随机试验的不同结果例电话总机某段时间内接到的电话次数, 可用一个变量 X 来描述例检测一件产品可能出现的两个结果, 也可以用一个变量来描述第五章随机变量及其分布函数.

11-8. 电解质溶液的活度和活度系数电解质是有能力形成可以自由移动的离子的物质. 理想溶液体系分子间相互作用实际溶液体系 ( 非电解质 ) 部分电离学说 (1878 年 ) 弱电解质溶液体系离子间相互作用 (1923 年 ) 强电解质溶液体系.

第十一章曲线回归第一节曲线的类型与特点第二节曲线方程的配置第三节多项式回归.

第一章病毒的结构第一节病毒的大小、形态结构和化学组成第二节病毒的分类.

第二章随机变量及其分布第一节随机变量及其分布函数一、随机变量用数量来表示试验的基本事件定义 1 设试验的基本空间为，，如果对试验的每一个基本事件，规定一个实数记作与之对应，这样就得到一个定义在基本空间上的一个单值实函数，称变量为随机变量．随机变量常用字母、、等表示．或用.

数学系 University of Science and Technology of China DEPARTMENT OF MATHEMATICS 第 3 章曲线拟合的最小二乘法给出一组离散点，确定一个函数逼近原函数，插值是这样的一种手段。在实际中，数据不可避免的会有误差，插值函数会将这些误差也包括在内。

聚合物在生物高分子分离中的应用王延梅中国科学技术大学高分子科学与工程系 Tel

一、染色质 chromatin 二、染色体 chromosome 三、人类染色体的正常核型 the normal human karyotype 四、染色体的多态性 chromosome polymorphism 第二节遗传的细胞基础 Cellular Basis of Inheritance.

第二章贝叶斯决策理论 3学时.

非均相物系的分离沉降速度球形颗粒的：一、自由沉降二、沉降速度的计算三、直径计算 1. 试差法 2. 摩擦数群法四、非球形颗粒的自由沉降 1. 当量直径 de ：与颗粒体积相等的圆球直径 V P — 颗粒的实际体积 2. 球形度  s ： S—— 与颗粒实际体积相等的球形表面积.

化学系 3 班何萍物质的分离原理 世世界上任何物质，其存在形式几乎均以混合物状态存在。分离过程就是将混合物分成两种或多种性质不同的纯物质的过程。 分分子蒸馏技术是一种特殊的液－液分离技术。

第一节相图基本知识 1 三元相图的主要特点（1）是立体图形，主要由曲面构成；（2）可发生四相平衡转变；（3）一、二、三相区为一空间。

第九章核糖体 Robinson ＆ Brown （ 1953 ）发现于植物细胞， Palacle （ 1955 ）发现于动物细胞， Roberts （ 1958 ）建议命名为核糖核蛋白体（ ribosome ），简称核糖体。核糖体是所有类型的细胞内合成蛋白质的工厂，在一个旺盛生长的细菌中，大约有.

吉林大学远程教育课件主讲人 : 杨凤杰学时： 64 ( 第五十三讲 ) 离散数学. 定义设 G= （ V ， T ， S ， P ）是一个语法结构，由 G 产生的语言（或者说 G 的语言）是由初始状态 S 演绎出来的所有终止符的集合，记为 L （ G ） ={w  T *

RT-PCR 扬州大学生物科学与技术学院. 背景介绍 DNA 存在于细胞核中并编码了基因转录 : 双链 DNA 解链后利用其中一条链（编码链）合成信使 RNA （ mRNA ） mRNA 从细胞核转移到细胞质中 mRNA 结合上核糖体开始翻译成蛋白质蛋白执行基因的功能.

第三章病毒的遗传与进化第一节突变第二节诱变第三节基因重组第四节病毒基因产物间的相互作用.

编译原理总结. 基本概念  编译器、解释器  编译过程、各过程的功能  编译器在程序执行过程中的作用  编译器的实现途径.

§8-3 电场强度一、电场近代物理证明：电场是一种物质。它具有能量、动量、质量。电荷电场电荷电场对外的表现 : 1) 电场中的电荷要受到电场力的作用 ; 2) 电场力可移动电荷作功.

1 分子生物学技术. 2 第一节、重组 DNA 技术 - 基因工程第二节、分子杂交及相关技术第三节、聚合酶链反应的原理和应用第四节、基因定位的常用方法.

Department of Mathematics 第二章解析函数第一节解析函数的概念与 C-R 条件第二节初等解析函数第三节初等多值函数.

氧族元素第一课时. 氧族元素包含元素氧族元素包括氧 ( 8 O) 、硫 ( 16 S) 、硒 ( Se) 、碲 ( Te) 、钋 ( Po) 等氧 ( 8 O) 、硫 ( 16 S) 、硒 ( Se) 、碲 ( Te) 、钋 ( Po) 等氧族元素。它们的最外层电子、化学性质相似统称为.

Eukaryotic Gene Expression The “More Complex” Genome.

Human Genetics The Human Genome 1.

Selfish DNA Honors Genetics.

Chapter 2 Chromosome Action 第二章遗传的染色体基础 2.1 Sexual Reproduction 2.2 Chromosome Morphology and Number Configuration and structure 形态与结构 Number.

Chapter 11-b Transposon.

Genome Organization & Evolution. Chromosomes Genes are always in genomic structures (chromosomes) – never ‘free floating’ Bacterial genomes are circular.

Genetics: Chromosome Organization. Chromosomes: Structures that contain the genetic material (DNA) Genome – complete set of genetic material in a particular.

Chapter 21 Eukaryotic Genome Sequences

1 、如果 x ＋ 5 ＞ 4 ，那么两边都可得 x ＞－ 1 2 、在－ 3y ＞－ 4 的两边都乘以 7 可得 3 、在不等式 — x≤5 的两边都乘以－ 1 可得 4 、将－ 7x — 6 ＜ 8 移项可得。 5 、将 5 + a ＞－ 2 a 移项可得。 6 、将－ 8x ＜ 0.

BACTERIAL TRANSPOSONS

名探柯南在侦查一个特大盗窃集团过程中，获得藏有宝物的密码箱，密码究竟是什么呢？请看信息： ABCDEF( 每个字母表示一个数字 ) A ：是所有自然数的因数 B ：既有因数 5 ，又是 5 的倍数 C ：既是偶数又是质数 D ：既是奇数又是合数 EF ：是 2 、 3 、 5 的最小公倍数.

MeiosisMeiosis 减数分裂 Learning Objectives Definition of MeiosisDefinition of Meiosis Processes of MeiosisProcesses of Meiosis Significance of MeiosisSignificance.

§10.2 对偶空间一、对偶空间与对偶基二、对偶空间的有关结果三、例题讲析.

《高中生命科学》有丝分裂澄衷高级中学邓敏 2008 年 12 月. 人的胚胎发育过程受精卵个体个体.

请同学们仔细观察下列两幅图有什么共同特点？如果两个图形不仅形状相同，而且每组对应点所在的直线都经过同一点, 那么这样的两个图形叫做位似图形, 这个点叫做位似中心.

7 生产费用在完工产品与在产品之间分配的核算. 2 第七章生产费用在完工产品与在产品之间的分配  知识点 :  理解在产品的概念  掌握生产费用在完工产品与在产品之间的分配.

Mobile DNA  Transposons By Anna Purna

Lecture 10 Genes, genomes and chromosomes

力的合成力的合成一、力的合成二、力的平行四边形上一页下一页目录退出. 一、力的合成 O. O. 1. 合力与分力我们常常用一个力来代替几个力。如果这个力单独作用在物体上的效果与原来几个力共同作用在物体上的效果完全一样，那么，这一个力就叫做那几个力的合力，而那几个力就是这个力的分力。

河南济源市沁园中学前进中的沁园中学欢迎您 ! 温故知新： 1 、什么是原子？ 2 、原子是怎样构成的？ 3 、原子带电吗？为什么？

David Sadava H. Craig Heller Gordon H. Orians William K. Purves David M. Hillis Biologia.blu B – Le basi molecolari della vita e dell’evoluzione The Eukaryotic.

个体精子卵细胞父亲受精卵母亲人类生活史问题：人类产生配子（精、卵细胞）是不是有丝分裂？

The Secret of Life! DNA. 2/4/20162 SOMETHING HAPPENS GENE PROTEIN.

八. 真核生物的转录㈠特点 ① 转录单元为单顺反子（ single cistron ），每个蛋白质基因都有自身的启动子，从而造成在功能上相关而又独立的基因之间具有更复杂的调控系统。 ② RNA 聚合酶的高度分工，由 3 种不同的酶催化转录不同的 RNA 。 ③ 需要基本转录因子与转录调控因子的参与，这.

第 11 章旋转电机交流绕组的电势和磁势内容提要内容提要  旋转磁场是交流电机工作的基础。  在交流电机理论中有两种旋转磁场： (1) 机械旋转磁场（二极机械旋转磁场，四极机械旋转磁场） (2) 电气旋转磁场（二极电气旋转磁场，四极电气旋转磁场）二极机械旋转磁场四极机械旋转磁场二极电气旋转磁场四极电气旋转磁场.

欢迎使用《工程流体力学》多媒体授课系统燕山大学《工程流体力学》课程组. 第九章缝隙流动概述 9.1 两固定平板间的层流流动 9.2 具有相对运动的两平行平板间的缝隙流动 9.3 环形缝隙中的层流流动.

1 第三章数列数列的概念考点搜索 ●数列的概念 ●数列通项公式的求解方法 ●用函数的观点理解数列高考猜想以递推数列、新情境下的数列为载体, 重点考查数列的通项及性质, 是近年来高考的热点, 也是考题难点之所在.

第二节. 广告牌为什么会被风吹倒？结构的稳定性：指结构在负载的作用下维持其原有平衡状态的能力。它是结构的重要性质之一。

第九章核糖体 Robinson ＆ Brown （ 1953 ）发现于植物细胞。 Palacle （ 1955 ）发现于动物细胞。 Roberts （ 1958 ）建议命名为核糖核蛋白（ ribosome ），简称核糖体。核糖体是细胞内合成蛋白质的工厂，在一个旺盛生长的细菌中，大约有

你知道多细胞动物和人的生长发育是从什么细胞开始的吗 ? 受精卵分化肌肉细胞上皮细胞人体的各种细胞图.

Aim: How is DNA organized in a eukaryotic cell?. Why is the control of gene expression more complex in eukaryotes than prokaryotes ? Eukaryotes have:

Chromosome Organization & Molecular Structure. Chromosomes & Genomes Chromosomes complexes of DNA & proteins – chromatin Viral – linear, circular; DNA.

高频电子线路高频电子线路主讲元辉 5.5 晶体振荡器石英晶体振荡器的频率稳定度 1 、石英晶体谐振器具有很高的标准性。、石英晶体谐振器与有源器件的接入系数通常近似如下受外界不稳定因素的影响少。 3 、石英晶体谐振器具有非常高的值。维持振荡频率稳定不变的能力极强。

Organization of prokaryotic, eukaryotic and viral genomes

Genomes and their evolution

Transposable Elements

SGN23 The Organization of the Human Genome

Gene Density and Noncoding DNA

Presentation transcript:

Structure and function of genome

Genome and Gene gene is the basic functional unit of heredity in a living organism. Its nature is the nucleic sequence encoding a polypeptide or protein . Gene determines amino acid sequences of a polypeptide, and also determines the cell-specific traits. rRNA, tRNA, also have their own gene. genome is the entirety of an organism's hereditary information. It is encoded either in DNA or, for many types of virus, in RNA. The genome includes both the genes and the non-coding sequences of the DNA of haploid. The human genome contains 24 chromosomes.

Section 1 genome of virus The main types of virus genome : Double-stranded DNA: SV40, adenovirus, herpes virus. Single-stranded DNA: parvovirus, M13 phage Double-stranded RNA: retrovirus. Plus-strand RNA: polio virus, corona virus. Minus-strand RNA: rabies virus, influenza virus, measles virus. Reverse transcription virus: specific taxa, such as HIV, HCV.

genome of SV40 virus Double-stranded circular DNA Regions of early genes and late genes. Early genes: T antigen and t antigen. Late gene: VP1, VP2, VP3. there are regulatory regions between early genes and late genes : including origin of replication, promoter, enhancer.

genome of SV40 virus There is the phenomenon of alternative splicing and overlapped genes

retrovirus Carrying two identical positive-stranded RNA. bind two tRNA in host cell The structural proteins of virus : envelope protein(env), Capsid protein (gag). Reverse transcriptase (pol)

genome of Retrovirus Coding region containing three genes: gag, pol and env. Non-coding region: R region: 20 ~ 80 nucleotide repeats. PBS: primer binding sites, binding to tRNA as a primer. U region :promoter in U3 and polyadenylation signal in U5. provirus: long terminal repeat (LTR ) in the end.

genome is simple, with a small number of coding genes genome is simple, with a small number of coding genes. The genome size of different viruses varied significantly. Hepatitis B virus: 3.2kb; pox virus: 300kb. Genomes of different viruses have individual natures of nucleic acid. Most of genes is single copy with single-stranded nucleic acid. Retrovirus: diploid. Influenza virus: 8 single-stranded RNA. Retrovirus: 10 double-stranded RNA.

Most of the genome is coding sequences, a part of it is regulatory sequence, and very few of it is structure sequence. Replication and transcription of virus depend on host cells. Eukaryotic viruses can contain introns, while bacteria and viruses not. Alternative splicing happened in viral genome and produced several kinds of mRNA from one transcript. gene overlapping is common. A sequence may have two kinds of open reading frame, resulting protein with very different amino acid.

Section 2 Genome of prokaryotes Containing the complete set of genome to ensure their own metabolism and reproduction. The survival of mycoplasma, chlamydia etc depends on the host. Containing genes which can regulate their own growth and metabolism based on the environmental change. No differentiation and development in prokaryotes ,and the number of gene is smaller than that of eukyrocytes.

Genome of E. coli Size of E.coli:4.6 × 106bp, 4288 ORF; 2584 operons. The average size of the gene:951bp. The average interval between genes:118bp

features of prokaryotic genome Genome usually consists of double-stranded circular DNA. Prokaryotic DNA does not form a chromosome. There is no nucleus, but there is a nucleoid where DNA concentrate. The average size of genes is around 106~107 bp. The number of genes is fewer

the features of structure and function of prokaryotic genome An operon is a functioning unit of genomic material containing a cluster of genes under the control of a single regulatory signal or promoter. The genes are transcribed together into a mRNA strand and either translated together in the cytoplasm. Polycistronic mRNA:a single mRNA molecule that codes for more than one protein

The majority of the genome is single sequence, and rarely duplicated The majority of the genome is single sequence, and rarely duplicated. rRNA gene are multiple copies. Isozymes in genome: E. coli has three acetolactate synthase, and two branches mutase. The majority of sequences are coding sequence, with a very few non-coding sequences. There is a certain regulatory sequences which often contained inverted repeat. Most genes are in the state of expression.

Plasmid DNA A plasmid is an extra chromosomal DNA molecule separate from the chromosomal DNA which is capable of replicating independently from the chromosomal DNA. In many cases, it is circular and double-stranded. Plasmids usually occur naturally in bacteria, Its size varies from 1.5 to 15 kb.

to classify plasmids is by function. There are 3 main classes: Fertility-F-plasmids, which contain tra-genes. They are capable of conjugation (transfer of genetic material between bacteria which are touching). Resistance-(R)plasmids, which contain genes that can build a resistance against antibiotics or poisons and help bacteria produce pili. Col-plasmids, which contain genes that code for (determine the production of) bacteriocins, proteins that can kill other bacteria.

F factor Conjugation: transfer of genetic material between bacteria which are touching

Transposable elements Transposable elements ：the genetic material of genome that can move independently They can cause the changes of genome structure and gene sequences

The types of transposable elements insertion sequence transposon transposable bacteriophage。

Insertion sequence An insertion sequence is a short DNA sequence that acts as a simple transposable element. IS have two major characteristics: they are small, generally around 700 to 2500 bp in length only code for proteins implicated in the transposition. These proteins are usually the transposase which catalyses the enzymatic reaction allowing the IS to move, and also one regulatory protein which either stimulates or inhibits the transposition activity. The coding region in an IS is usually flanked by inverted repeats. Frequency of translocation is 10-7

Transposon Transposons are sequences of DNA that can move around to different positions within the genome of a single cell, a process called transposition. transposons, which carry transposase gene and accessory genes such as antibiotic resistance genes In the process, they can cause mutations and change the amount of DNA in the genome. Transposons were also once called jumping genes, and are examples of mobile genetic elements. They were discovered by Barbara McClintock early in her career, for which she was awarded a Nobel Prize in 1983.

Genetic effects of transposable elements 1 The transposition of a transposable element is not movement of itself, but to copy a new copy of the gene. 2 When transposition occurred, the target sequence doubled, and located on both sides of transposable elements to form direct repeat sequences 3 form the co-integrate in the process of transposition 4 chromosomal aberrations possibly occurred 5 transposable elements can be excised from the original location

Transposons are mutagens Transposons are mutagens. They can damage the genome of their host cell in different ways: A transposon that inserts itself into a functional gene will most likely disable that gene. After a transposon leaves a gene, the resulting gap will probably not be repaired correctly. Multiple copies of the same sequence, such as Alu sequences can hinder precise chromosomal pairing during mitosis and meiosis, resulting in unequal crossovers, one of the main reasons for chromosome duplication. Diseases that are often caused by transposons include hemophilia A and B, severe combined immunodeficiency, porphyria, predisposition to cancer, and Duchenne muscular dystrophy. Additionally, many transposons contain promoters which drive transcription of their own transposase. These promoters can cause aberrant expression of linked genes, causing disease or mutant phenotypes

Significance of bacterial genomics research To shed more light on the characteristics of pathogenic microorganisms and pathogenic mechanism. To provide more convenient tools for the discovery of disease-causing genes To Reveal more pathogen-specific sequence, and to improve the accuracy of identification of pathogens. To provide a basis for the discovery of vaccines and screening of durgs

Section 3 Eukaryotic genomes Most eukaryotes are multi-cellular organisms, with the complex phenomenon of differentiation and development. Eukaryotes have more genes and more complex regulation mechanism than that in prokaryotes Eukaryotes have a nucleus, and the genome in the nucleus bind to histone proteins to form chromatin. Mitochondria and chloroplasts of the eukaryotic also have their own genetic material.

There are 280 kinds of eukaryotic genome project, of which 19 kinds have been completed, including 3 kinds of plants, 9 kinds of fungi, 3 kinds of protozoa, Caenorhabditis elegans, Drosophila, mouse, human

The structural characteristics of eukaryotic genomes Linear double-stranded DNA, and each species has a fixed number of chromosomes. eukaryotic cells are generally diploid. Yeast has both haploid and diploid states. Haploid and polyploid widely exist in eukaryotic species . Structure of eukaryotic genomes is complex, and the number of genes is large. The size of the human genome is about 1000 times bigger than that of E. coli. The number of human genes is about 10 times more than that of E. coli.

An mRNA molecule is said to be monocistronic when it contains the genetic information to translate only a single protein. polycistronic mRNA carries the information of several genes, which are translated into several proteins. These proteins usually have a related function and are grouped and regulated together in an opero rRNA and tRNA mRNA are polycistronic. There is no operon, and function-related genes are often sparse in different parts of the genome. α-globin gene locates in chromosome 16. β-globin gene locates in chromosome 11.

The vast majority of genome is non-coding sequence, with the role of forming structure and regulation. coding sequences less than 10%. Size of the human genome is 3 × 109 bp, with only 3 × 104 genes, the average size of genes is 105 bp. Containing a large number of repetitive sequences. Highly repetitive sequences: 105 or more Moderately repetitive sequence :10-104 Single-copy sequence: less than 10

The structural characteristics of eukaryotic genomes Eukaryotic genes are split genes An intron is a DNA region within a gene that is not translated into protein The non-coding sequences within genes. exon can refer to the sequence in the DNA or its RNA transcript

The structural characteristics of eukaryotic genomes A gene family is a set of genes with a known homology. They are generally biochemically similar. Globin gene family (α, β, γ, δ, ε, ζ). Superfamily: gene members shared structural homology and different function.

The structural characteristics of eukaryotic genomes A gene cluster is a set of two or more genes that serve to encode for the same or similar products. Because populations from a common ancestor tend to possess the same varieties of gene clusters, they are useful for tracing back recent evolutionary history. Histone gene cluster: 5 kinds of genes clustered in tandem, and there are multiple copies.

α-globin gene cluster An example of a gene cluster is the Human a-globin gene cluster, which contains 3 functional genes and 3 non-functional gene for similar proteins α1, α2: α gene duplicate with adult expression. ξ: embryonic genes. ψξ, ψα1, ψα2: pseudo-genes, 75% homology with α, accumulate much mutations ,so it can not be expressed.

β -globin gene cluster ε: expressed in early embryonic stage. γ: in embryonic stage. δ: express at birth with a extremely low level. β: key protein expressed in adult. ψβ, ψβ1: pseudogene.

structural characteristics of eukaryotic genomes Eukaryotic genomes are highly variable. During meiosis, association and exchange occurred in homologous chromosome Eukaryotic genomes also have mobile genetic material. transposon The human genome contains a large number of transposon, most of which have been inactivated by mutation.

The structure of eukaryotic genomes

the feature of structure of the human genome

Features of human genome Genes There are estimated ca. 54,000 human protein-coding genes. The number of human genes seems to be less than a factor of two greater than that of many much simpler organisms, such as the roundworm and the fruit fly. human cells make extensive use of alternative splicing to produce several different proteins from a single gene, and the human proteome is thought to be much larger than those of the afore mentioned organisms Besides, most human genes have multiple exons, and human introns are frequently much longer than the flanking exons Human genes are distributed unevenly across the chromosomes. Each chromosome contains various gene-rich and gene-poor regions, which seem to be correlated with chromosome bands and GC-content. The significance of these nonrandom patterns of gene density is not well understood.In addition to protein coding genes, the human genome contains thousands of RNA genes, including tRNA, rRNA, microRNA, and other non-coding RNA genes.

The composition of the human genome The known coding sequence is only about 1.5%, there are a large number of interval sequence between the genes, insertion sequence and repetitive sequence within the gene. Coding sequence: coding proteins and a variety of RNA, and part of the coding sequences is. Non-coding sequences include: Regulatory sequences: promoter, enhancer and so on. Intron: it also contain regulatory sequences. Interval sequence: Junction area between genes. Repetitive sequences.

the repetitive sequences of the human genome Inverted repeat sequence Tandem repeat sequence: satellites, small satellites, mini-satellite, micro-satellite DNA. Gene cluster: group proteins, rRNA, tRNA and so on. Interspersed repeated sequence: Alu family, Kpn family and so on. Single-copy sequence: gene coding sequences and spacer sequences.

Satellite DNA consists of highly repetitive DNA, and is so called because repetitions of a short DNA sequence tend to produce a different frequency of the nucleotides adenine, cytosine, guanine and thymine, and thus have a different density from bulk DNA - such that they form a second or 'satellite' band when genomic DNA is separated on a density gradient. Type Size of repeat unit (bp) Location α (alphoid DNA) 171 All chromosomes β 68 Centromeres of chromosomes 1, 9, 13, 14, 15, 21, 22 and Y Satellite 1 25-48 Centromeres and other regions in heterochromatin of most chromosomes Satellite 2 5 Most chromosomes Satellite 3

A minisatellite is a section of DNA that consists of a short series of bases 10–60bp.These occur at more than 1000 locations in the human genome. Some minisatellites contain a central (or "core") sequence of letters “GGGCAGGANG” (where N can be any base) or more generally a strand bias with purines (Adenosine (A) and Guanine (G)) on one strand and pyrimidines (Cytosine (C) and Thymine (T)) on the other. It has been proposed that this sequence per se encourages chromosomes to swap DNA. In alternative models, it is the presence of a neighbouring cis-acting meiotic double-strand break hotspot which is the primary cause of minisatellite repeat copy number variations. Somatic changes are suggested to result from replication difficulties (which might include replication slippage, among other phenomena).

Microsatellites, Simple Sequence Repeats (SSRs), or tandem repeats, are repeating sequences of 1-6 base pairs of DNA.[1] Microsatellites are typically neutral and co-dominant. They are used as molecular markers in genetics, for kinship, population and other studies. They can also be used to study gene duplication or deletion

1. 反向重复顺序 Inverted repeat sequence 亦称倒位重复顺序（inverted repeats sequence）。两端反向重复，可形成发卡结构。无插入：GGTACC 有插入：GGTNNN…NNNACC 人类基因组有约 5％的反向重复顺序，大部分以单拷贝形式散布于整个基因组。常见于蛋白结合区与转录调控区。 Also known as inverted repeat sequence Inverted repeats at both ends can form a hairpin. Without insertion: GGTACC with insertion: GGTNNN ... NNNACC There is about 5% inverted repeat sequence in human genome , and the majority is the form of single copy spersed in the whole genome. Commonly found in protein-binding regions and the transcriptional regulatory region.

2. 串联重复顺序 Tandem repeat sequence 串联重复序列是一个固定的重复单位头尾相连形成的重复。串联重复序列约占基因组的 10%。将基因组打断后进行密度梯度离心时发现，称卫星 DNA。组蛋白基因，rRNA 基因等也属串联重复序列。 Tandem repeat sequence is duplication formed by a fixed repeat which is connected end to end Tandem repeat sequences account for about 10% of the genome. satellite DNA. Histone genes, rRNA genes also are tandem repeat.

卫星 DNA Satellite DNA 重复次数非常高，可达数百万。每一个重复序列簇有数千重复单元。按序列特征可分为Ⅰ、Ⅱ、Ⅲ、Ⅳ、α、β。每种类型有不同家族，其核心序列不同。原位杂交证实：各组卫星 DNA 主要位于异染色质，特别是中心粒。但很少具有染色体特异性。 II 和 III 分布于几乎分布于所有染色体。一些卫星 DNA 具有染色体特异性和区域特异性。 β 存在于 Y 染色体等的着丝粒区域。 α分布于所有染色体的着丝粒区域 Repetition number is very high, up to several millions. There are thousands of repeatitive units in each repeat cluster. It can be divided into Ⅰ, Ⅱ, Ⅲ, Ⅳ, α, β by sequence features . Each type has a different family whith different from its core sequence. In situ hybridization confirmed: Satellite DNA in each group are mainly located in heterochromatin, in particular the centriole. But rarely has a chromosome-specific. II and III is found in almost distributed in all chromosomes. Some satellite DNA has the chromosome-specific and regional specificity. β exists in the centromere region of Y chromosome. α is found in all the centromere region of chromosome 。

小卫星 DNA Small satellite DNA 可变数目串联重复： variable number of tandem repeats,VNTR 6－70 bp，串联成簇，重复几到几十次，个体间重复次数高度可变。端粒：位于染色体末端，具有保护作用。 TTAGGG 组成的重复序列，往往重复数千倍 variable number of tandem repeats : variable number of tandem repeats, VNTR 6-70 bp, tandem clustered, repeat a few to dozens of times, the repeated number is of highly variable among individuals. Telomere: At the end of chromosome and has a protective effect. The repeat sequences composed of TTAGGG often repeated thousands of times

微卫星 DNA Microsatellite DNA 又称短串联重复： short tandem repeats, STR 1-4 bp 串联重复。 2 bp 重复最常见，一般为 (AC)n 或 (TG)n。重复 10－60。当 n 大于 14 时，个体间重复次数高度可变。 STR 在基因组分布非常广泛。占约 5％，平均每 30－50 kb 就有一个 STR 序列。 Also known as short tandem repeats: short tandem repeats, STR 1-4 bp tandem repeats. the most common appearance is 2 bp duplication and is usually (AC) n or (TG) n. Repeat 10-60. When n is greater than 14, repeated number among individuals is highly variable. STR are widely distributed in the genome. Accounted for about 5%, there is a STR sequence averaging 30-50 kb.

3. 散在重复顺序 Interspersed repetitive sequence Interspersed repeated sequence。分散而非成簇，散布于整个基因组。约占基因组的 20％，包括一些重复基因，但大多数为非编码序列。多数散在重复序列是 retrotransposon，具有末端重复序列，但非 LTR。在哺乳类，按照其长度大致有两个家族： SINES: short interspersed nuclear elements LINES: long interspersed nuclear elements

SINES (short interspersed elements ) 在人类基因组，最常见的是 Alu 家族。人类基因组中含量最丰富的中度重复，有 70-100 万的 Alu 位点。平均 5kb 就有一个，约占基因组的 10％。具有很强的种属特异性，是人类基因组的标志。可被 AluI 分解为 130bp 与 170bp 两个片段，因而得名。 In the human genome, the most common one is the Alu family. Human genome, the most abundance is moderately repetition, and there are 70-100 million Alu site. There is one in average 5kb, about 10% of the genome. It is with highly species-specific and is a sign of the human genome. It can be divided into two 130bp and 170bp fragments by AluI, so the name comes out.

Alu 家 Alu family Alu 具有与 7SL RNA 同源的区域，可转录并参与翻译与蛋白质转运等的调控。 Alu 是一种不能自主转位的 retrotransposon，具有末端正向重复序列，但不编码转位相关基因。 Alu has the homologous regions with7SL RNA, may be involved in transcription and the regulation of translation , transport of protein and so on. Alu is an retrotransposon without ability of independent transposition It has a positive terminal repeat sequence, but do not encode transposition-related genes. Alu 家 Alu family

LINES (Long interspersed elements ) 在人类基因组中，最常见的是 L1 element 。约有 50000 个拷贝，占基因组的 15%。是一种自主转位的 retrotransposon，编码转位相关基因。 L1 有多种成员，在人类称 L1Hs/Kpn family 长度 6.4 kb，但很多有缺失。可被 KpnI 分解为 4 个片段，因而得名。 In the human genome, the L1 elemen is most common. There are about 50,000 copies, accounting for 15% of the genome. Is a kind of retrotransposonwith ability of independent transposition can code transposition-related genes. There are many members of the L1, called L1Hs/Kpn family in humans. With the Length of 6.4 kb, but many are missing. It can be broken down into four segments by KpnI, so is name comes out.

人类基因组的组成 The composition of the human genome

人类基因组的结构示意图 Schematic diagram of the human genome

（二）线粒体 DNA Mitochondrial DNA 长 16569 bp 的双链环状分子。共编码 2 个 rRNA，22 个 tRNA，13 个氧化磷酸化相关多肽。特征：母系遗传。遗传异质性。突变积累至一定比例才能产生效应，域值效应。基因排列紧密，对致变因素敏感。 It is the double-stranded circular molecule with the lenth of16569 bp. It totally encodes 2 rRNA, 22 Ge tRNA, 13 Ge oxidative phosphorylation related peptides. Features: Maternal inheritance. Genetic heterogeneity. Only when the mutation accumulate to certain proportion can the effect be generated,that is threshold effect. Gene arrange closely and is sensitive to the factors leading to mutations .

（三）DNA 多态性 DNA polymorphism 在特定的基因组位点，出现多种等位基因的现象。位点多态性：碱基组成差异造成，单核苷酸多态性 (SNP)。限制性片段长度多态性(RFLP)。 restriction fragment length polymorphism 串联重复多态性：DNA指纹。线粒体 DNA 多态性：人类起源的线索。 Multiple alleles can appear in specific genomic loci Polymorphism: Differences in base composition cause the single nucleotide polymorphism (SNP). Restriction fragment length polymorphism (RFLP). Tandem repeat polymorphism: DNA fingerprinting. Mitochondrial DNA polymorphism:it clues to human origins.

易感基因与环境的相互作用 interactions of susceptibility gene and environment HIV 与受体 CCR5 ApoE4与AD Asyn duplication与PD HIV and the receptor CCR5, ApoE4 and AD Asyn duplication and PD