Protein Sequences. The Genetic Code The natural extension of the genetic code…

Slides:



Advertisements
Similar presentations
Chemistry 2100 Lecture 10.
Advertisements

Review.
Amino Acids PHC 211.  Characteristics and Structures of amino acids  Classification of Amino Acids  Essential and Nonessential Amino Acids  Levels.
A Ala Alanine Alanine is a small, hydrophobic
Review of Basic Principles of Chemistry, Amino Acids and Proteins Brian Kuhlman: The material presented here is available on the.
Lactate dehydrogenase + 38 ATP + 2 ATP. How does lactate dehydrogenase perform its catalytic function ?
Review: Amino Acid Side Chains Aliphatic- Ala, Val, Leu, Ile, Gly Polar- Ser, Thr, Cys, Met, [Tyr, Trp] Acidic (and conjugate amide)- Asp, Asn, Glu, Gln.
Metabolic fuels and Dietary components Lecture - 2 By Dr. Abdulrahman Al-Ajlan.
FUNDAMENTALS OF MOLECULAR BIOLOGY Introduction -Molecular Biology, Cell, Molecule, Chemical Bonding Macromolecule -Class -Chemical structure -Forms Important.
5’ C 3’ OH (free) 1’ C 5’ PO4 (free) DNA is a linear polymer of nucleotide subunits joined together by phosphodiester bonds - covalent bonds between.
An overview of amino acid structure Topic 2. Biomacromolecule A naturally occurring substance of large molecular weight e.g. Protein, DNA, lipids etc.
Protein-a chemical view A chain of amino acids folded in 3D Picture from on-line biology bookon-line biology book Peptide Protein backbone N / C terminal.
1 Levels of Protein Structure Primary to Quaternary Structure.
Amino Acids and Proteins 1.What is an amino acid / protein 2.Where are they found 3.Properties of the amino acids 4.How are proteins synthesized 1.Transcription.
Amino Acids, Peptides, Protein Primary Structure Chapter 3.
Amino Acids, Peptides, Protein Primary Structure
Basic protein structure and stability I: Formation of peptide bonds/ properties of amino acids Biochem 565, Fall /25/08.
Amino Acids, Peptides, Protein Primary Structure
Lipids A. Classified based on solubility (like dissolves like) 1. insoluble in polar solvents 2. soluble in nonpolar solvents 3. lipids are hydrophobic.
ProteinStructuralDatabases. Proteins are built from amino-acids. Introduction H | NH2-c-CO2H | R.
Protein: Linear chain of amino acids called residues (4 in this toy protein) Ser Trp Leu O N N N N O O C C C C O O CαCα CαCα CαCα CαCα Lys H H H H H The.
Protein Structure Elements Primary to Quaternary Structure.
Protein Structure FDSC400. Protein Functions Biological?Food?
You Must Know How the sequence and subcomponents of proteins determine their properties. The cellular functions of proteins. (Brief – we will come back.
Amino Acids, Peptides, and Proteins.. Classification of Amino Acids.
Lesson today Protein Teacher : Isroli Laboratory : Animal Physiology and Biochemistry Faculty : Animal Agriculture Diponegoro University.
Proteins. The central role of proteins in the chemistry of life Proteins have a variety of functions. Structural proteins make up the physical structure.
Marlou Snelleman 2012 Proteins and amino acids. Overview Proteins Primary structure Secondary structure Tertiary structure Quaternary structure Amino.
Amino Acids and Peptides
Proteins: Levels of Protein Structure Conformation of Peptide Group
Lecture 1 Amino acids.
Proteins are polymers of amino acids.
Proteins account for more than 50% of the dry mass of most cells
Chapter Three Amino Acids and Peptides
Proteins account for more than 50% of the dry mass of most cells
What are proteins? Proteins are important; e.g. for catalyzing and regulating biochemical reactions, transporting molecules, … Linear polymer chain composed.
The most important secondary structural elements of proteins are: A. α-Helix B. Pleated-sheet structures C. β Turns The most common secondary structures.
BIOCHEMISTRY REVIEW Overview of Biomolecules Chapter 4 Protein Sequence.
1.Overall amino acid structure 2.Amino acid stereochemistry 3.Amino acid sidechain structure & classification 4.‘Non-standard’ amino acids 5.Amino acid.
AMINO ACIDS.
The Organic Chemistry of Amino Acids,
Amino Acids & Side Groups Polar Charged ◦ ACIDIC negatively charged amino acids  ASP & GLU R group with a 2nd COOH that ionizes* above pH 7.02nd COOH.
Proteins – Amides from Amino Acids
Secondary structure prediction
Amino Acids are the building units of proteins
1 10/26/2015 MOLECULES. 2 10/26/2015 H 2 N-CH-C-OH O R Monomer E.g. protein Monomer vs polymer amino acid monomer R is a side group.
Amino acids Met dank aan Dr. Detke.
Amino acids. Essential Amino Acids 10 amino acids not synthesized by the body arg, his, ile, leu, lys, met, phe, thr, trp, val Must obtain from the diet.
Amino acids structure. Configuration of Amino Acids.
Marlou Snelleman 2011 Proteins and amino acids. Overview Proteins Primary structure Secondary structure Tertiary structure Quaternary structure Amino.
Proteins.
Proteins Structure of proteins Proteins are made of C, H, O and nitrogen and may have sulfur. The monomers of proteins are amino acids An amino acid.
Chapter 3 Proteins.
Biochemistry I Chapter 4 Amino Acids revised 9/5/2013
X-ray detection xray/facilities.html.
Amino Acids  Amino Acids are the building units of proteins. Proteins are polymers of amino acids linked together by what is called “ Peptide bond” (see.
Levels of Protein Structure. Why is the structure of proteins (and the other organic nutrients) important to learn?
Levels of Protein Structure. Why is the structure of proteins (and the other organic nutrients) important to learn?
Proteins chemistry. Proteins What are the many functions of proteins? What are amino acids? What are zwitterions? What determines the characteristics.
PHR202: Biochemistry and Molecular Biology Mir Ishruna Muniyat.
Amino Acid & Basic Classification
Protein Structure FDSC400. Protein Functions Biological?Food?
Haixu Tang School of Inforamtics
Fig. 5-UN1  carbon Amino group Carboxyl group.
Packet #9 Supplement.
Amino Acids Amine group -NH2 Carboxylic group -COOH
Packet #9 Supplement.
Proteins Genetic information in DNA codes specifically for the production of proteins Cells have thousands of different proteins, each with a specific.
Chapter Three Amino Acids and Peptides
Levels of Protein Structure
Presentation transcript:

Protein Sequences

The Genetic Code

The natural extension of the genetic code…

1.Overall amino acid structure 2.Amino acid stereochemistry 3.Amino acid sidechain structure & classification 4.‘Non-standard’ amino acids 5.Amino acid ionization 6.Formation of the peptide bond 7.Disulfide bonds 8.Comparing protein sequences to describe evolutionary processes.

Q: How many amino acids are there?   

 The twenty alpha-amino acids that are encoded by the genetic code share the generic structure…

Atom nomenclature within amino acids (as used within the PDB) CA CB C O N OG1CG2

77 CACBCGCDCE NZ C O, OXT N

ATOM 1 N PRO A N ATOM 2 CA PRO A C ATOM 3 C PRO A C ATOM 4 O PRO A O ATOM 5 CB PRO A C ATOM 6 CG PRO A C ATOM 7 CD PRO A C ATOM 8 N LYS A N ATOM 9 CA LYS A C ATOM 10 C LYS A C ATOM 11 O LYS A O ATOM 12 CB LYS A C ATOM 13 CG LYS A C ATOM 14 CD LYS A C ATOM 15 CE LYS A C ATOM 16 NZ LYS A N ATOM 17 N ALA A N ATOM 18 CA ALA A C The.pdb file format Atom numberAtom nameResidue nameChain IDResidue number X-coordinateY-coordinateZ-coordinateOccupancy B-factor (aka Temp factor) Atom type Record name

Lys Arg To Do: Learn how to name the atoms of all amino acids. Hint: look at any generic PDB file to get a list of atom types. -The alpha carbon (CA) is immediately adjacent the most oxidized carbon (which is the CO 2 - in amino acids) -All the other heavy nuclei are named according to the Greek alphabet. -Put otherwise, LYS can be described by: CA, CB, CG, CD, CE, and NZ. Atom nomenclature within amino acids (as used within the PDB)

Numbers are used to discriminate between similar positions… CB CG OD1 ND2 CB CG ND1 CE1NE2 CD2 Here are some harder examples… CB CG CD2 CE2 CZ OH CD1 CE2 CB CG CD2 CD1 NE1 CE2 CH2 CE3 CZ2 CZ3 CB CD2 CD1 CG CB OG1CG2

Side-chain torsion angles -With the exception of Ala and Gly, all sidechains also have torsion angles. -To Do on your own: -Count the # of chi’s in each amino acid. -Determine why Ala doesn’t have a chi angle.

1.Overall amino acid structure 2.Amino acid stereochemistry 3.Amino acid sidechain structure & classification 4.‘Non-standard’ amino acids 5.Amino acid ionization 6.Formation of the peptide bond 7.Disulfide bonds 8.Comparing protein sequences to describe evolutionary processes.

Fischer projection

1.Overall amino acid structure 2.Amino acid stereochemistry 3.Amino acid sidechain structure & classification 4.‘Non-standard’ amino acids 5.Amino acid ionization 6.Formation of the peptide bond 7.Disulfide bonds 8.Comparing protein sequences to describe evolutionary processes.

Terminologies Hydrophobic: Amino acids are those with side chains that do not like to reside in an aqueous environment. Hence, these amino acids buried within the hydrophobic core of the protein. –Aliphatic: Hydrophobic group that contains only carbon or hydrogen atoms. –Aromatic: A side chain is considered aromatic when it contains an aromatic ring system. Polar: Polar amino acids are those with side-chains that prefer to reside in an aqueous environment and hence can be generally found exposed on the surface of a protein.

It’s actually a bit more complicated…

-OH -SH Twenty Amino acids Hydrophobic (non polar) Polar Polar NeutralCharged Aromatic (PHE, TRP) Aliphatic (ALA, VAL, LEU, ILE, MET, PRO) AmideAcidic Basic (ASN, GLN) (THR, SER) (CYS) (ASP, GLU) (HIS, LYS,ARG) TYR: Amphipathic GLY: Unclassifiable HINT: You should definitely know this!!!

1.Overall amino acid structure 2.Amino acid stereochemistry 3.Amino acid sidechain structure & classification 4.‘Non-standard’ amino acids 5.Amino acid ionization 6.Formation of the peptide bond 7.Disulfide bonds 8.Comparing protein sequences to describe evolutionary processes.

Not uncommon amino acids in biochemistry, but they are not encoded within the genetic code (meaning not incorporated into proteins)…

1.Overall amino acid structure 2.Amino acid stereochemistry 3.Amino acid sidechain structure & classification 4.‘Non-standard’ amino acids 5.Amino acid ionization 6.Formation of the peptide bond 7.Disulfide bonds 8.Comparing protein sequences to describe evolutionary processes.

1.Overall amino acid structure 2.Amino acid stereochemistry 3.Amino acid sidechain structure & classification 4.‘Non-standard’ amino acids 5.Amino acid ionization 6.Formation of the peptide bond 7.Disulfide bonds 8.Comparing protein sequences to describe evolutionary processes.

Primary structure = the complete set of covalent bonds within a protein

Polypeptides Linear arrangement of n amino acid residues linked by peptide bonds. Polymers composed of two, three, a few, and many amino acid residues are called as dipeptides, tripeptides, oligopeptides and polypeptides. Proteins are molecules that consist of one or more polypeptide chains.

Q: why is the pentapeptide SGYAL different than LAYGS?

Amino acid to Dipeptide Amino Acid 1 Amino Acid 2 Peptide bond is the amide linkage that is formed between two amino acids, which results in (net) release of a molecule of water (H 2 O). The four atoms in the yellow box form a rigid planar unit and, as we will see next, there is no rotation around the C-N bond. Peptide bond Note: this chemistry will not work as drawn!

The peptide bond has a partial double bond character, estimated at 40% under typical conditions. It is this fact that makes the peptide bond planar and rigid.

A quick aside… A horrible leaving group A viable leaving group + +..

1.Overall amino acid structure 2.Amino acid stereochemistry 3.Amino acid sidechain structure & classification 4.‘Non-standard’ amino acids 5.Amino acid ionization 6.Formation of the peptide bond 7.Disulfide bonds 8.Comparing protein sequences to describe evolutionary processes.

--The primary structure is a complete description of the covalent bond network within a protein. --This is almost(!) completely described by the sequence of amino acids. --If you know that the protein is AVG…, you can look up the structures of A, V and G, plus what you know about peptide bonding allows you to complete the covalent bond structure. --So, when does the primary structure not fully describe the covalent bond network? --BTW, this is a HUGE pet peeve of mine…there is no such thing as a primary sequence, despite its rather common usage (including in journal article titles…UGG!). A primary sequence implies a secondary sequence, which is nonsense. While there is of course primary, secondary, tertiary and quaternary structures, there is only the “sequence”.

1.Overall amino acid structure 2.Amino acid stereochemistry 3.Amino acid sidechain structure & classification 4.‘Non-standard’ amino acids 5.Amino acid ionization 6.Formation of the peptide bond 7.Disulfide bonds 8.Comparing protein sequences to describe evolutionary processes.

Multiple sequence alignments Given the sequences: INDUSTRY INTERESTING IMPORTANT One example of a MSA is:But is it better than: IN-DUST--RYINDU--ST-RYINTERESTING IMPOR--TANTIMPOR-T-ANT

Multiple sequence alignments I-N-DU-ST-RYI--NDU-ST-RY- I-NTERESTINGI--NTERESTING IMPO-R--TANTI-MPO-R--TANT IN-DUTS--RYINDU--ST-RYINTERESTING IMPOR--TANTIMPOR-T-ANT I-NDUS--T-RY-I-N--D--U-S-T-RY INT-ERES-TINGI-N-TE-RE-S-TING IMPOR--TAN--T-M-PO--RTA-NT---

Multiple sequence alignments Possible MSAEntire column can NOT have only gaps! I-N-DU-ST-RYI--NDU-ST-RY- I-NTERESTINGI--NTERESTING IMPO-R--TANTI-MPO-R--TANT Can NOT move residues aroundPossible IN-DUTS--RYINDU--ST-RYINTERESTING IMPOR--TANTIMPOR-T-ANT Very few matches!Too many gaps! I-NDUS--T-RY-I-N--D--U-S-T-RY INT-ERES-TINGI-N-TE-RE-S-TING IMPOR--TAN--TIM-PO--RTA-NT---

Which alignment pairs make the most sense? AVGTLE VLASID AVGTLE EKWVKV VS. A-VT-G-R-L-E AA-TA-Q-V-IE AVTG-RLE AATAQ-IE VS. AVWF----VLIM ALWFAMVFILIM ESQG----KTD DTQADGKCRTD VS. More similar amino acids Fewer gapsGap location makes more sense because gaps are less frequent in nonpolar regions.

A multiple sequence alignment: -CAPSRPLNENDDGR-QAFELIGTAVNM... -CVPGRGEMEHDD-RDQVLELFGTVVNL... -AVPKRAALQNDDGR-QGWELYGTVSAQ... -AVPTKMNCFNDDGR-QSVNLIGTVSGN... -ILPARTSMCNDDGR-QTIEMKGTPAGG... --APGK--NGHKLV--Q-FELKGTYSRT... AFAPRRIKMVNKLGR-QNFTLLGTFERT... AYRPDRCNTCNKLGR-QDVELMGTDART... -YRPEEWFGENKLGR-QSAELIGTDERS... --APL-ETYWPKLGR-QTGALAGTNSAV... --RPY-KAGWNKLGR-QSYELGGTNPYI PARAKNMG---R-QSYHL--TMEWQ... Chothia & Lesk. EMBO J. 5: (1986).

AN EXAMPLE MULTIPLE SEQUENCE ALIGNMENT. Conserved residues are indicated by color. Note that gaps tend to cluster together. Also gaps at the N- and C-terminal ends are more common. Why?

REGULAR EXPRESSIONS AND SEQUENCE LOGOS. Regular expressions provide a coarse-grain summary of an alignment segment. Sequence logos essentially due the same, but without information loss (cf.

A PHYLOGENETIC TREE DESCRIBES AN EVOLUTIONARY PROCESS. But from a more pragmatic viewpoint, it also visually describes the similarities and dissimilarities between sequences within a multiple alignment.