Presentation is loading. Please wait.

Presentation is loading. Please wait.

GENBANK FILE FORMAT LOCUS –LOCUS NAME Is usually the first letter of the genus and species name, followed by the accession number –SEQUENCE LENGTH Number.

Similar presentations


Presentation on theme: "GENBANK FILE FORMAT LOCUS –LOCUS NAME Is usually the first letter of the genus and species name, followed by the accession number –SEQUENCE LENGTH Number."— Presentation transcript:

1 GENBANK FILE FORMAT LOCUS –LOCUS NAME Is usually the first letter of the genus and species name, followed by the accession number –SEQUENCE LENGTH Number of nucleotide base pairs in the sequence record –MOLECULE TYPE The type of molecule that was sequenced. Example: DNA –GENBANK DIVISION The three letter abbreviation that describes a records division –MODIFICATION DATE Date of last modification –Example:

2 GENBANK FILE FORMAT DEFINITION –Brief description of sequence –Includes information such as Source organism Gene name / protein name Sequence function ACCESSION –Unique identifier for a sequence record VERSION –Is in the format accession.version –GI GenInfo Identifier –Sequence identification number A new GI is assigned if sequence is altered KEYWORDS –Word or phrase describing the sequence –Are generally present in older records –If entry has no keyword, the field contains a period

3 GENBANK FILE FORMAT SOURCE –Organism name in an abbreviated form –ORGANISM Formal scientific genus and species name Lineage information according to the phylogenetic classification scheme REFERENCE –Publications that discuss reports on sequence data –Contains information such as: AUTHORS TITLE JOURNAL PUBMED –Refers to a PubMed identifier to link to a corresponding record

4 GENBANK FILE FORMAT FEATURES –Biologically significant regions in the sequence –SOURCE length of sequence scientific name of source organism Taxon ID (Taxonomy reference) –GENE Name assigned to the region of biological interest –CDS Coding Sequence Includes amino acid translation ORIGIN –May be blank –Or may give a local pointer to the sequence start

5 GENBANK RECORD LAYOUT

6 Genbank Record Summary Genbank is an annotated collection of all publicly available biological sequences The Genbank record format must be flexible enough to allow for biological data from numerous sources to be integrated without difficulty Genbank records contain comprehensive information on an entry, including information on the source, distinguishing characteristics, and information on journal articles pertaining to the entry


Download ppt "GENBANK FILE FORMAT LOCUS –LOCUS NAME Is usually the first letter of the genus and species name, followed by the accession number –SEQUENCE LENGTH Number."

Similar presentations


Ads by Google