20 Amino acids, their single-letter data-base codes (SLC), and their corresponding DNA codons |
Amino Acid |
SLC | DNA codons |
Isoleucine |
I |
ATT, ATC, ATA |
| Leucine | L |
CTT, CTC, CTA, CTG, TTA, TTG |
| Valine | V |
GTT, GTC, GTA, GTG |
| Phenylalanine | F |
TTT, TTC |
| Methionine | M | ATG |
| Cysteine | C | TGT, TGC |
| Alanine | A |
GCT, GCC, GCA, GCG |
| Glycine | G |
GGT, GGC, GGA, GGG |
| Proline | P |
CCT, CCC, CCA, CCG |
| Threonine | T |
ACT, ACC, ACA, ACG |
| Serine | S | TCT, TCC, TCA, TCG, AGT, AGC |
Tyrosine |
Y | TAT, TAC |
| Tryptophan | W | TGG |
| Glutamine | Q | CAA, CAG |
| Asparagine | N | AAT, AAC |
| Histidine | H |
CAT, CAC |
| Glutamic acid | E |
GAA, GAG |
Aspartic acid |
D |
GAT, GAC |
| Lysine | K |
AAA, AAG |
| Arginine | R |
CGT, CGC, CGA, CGG, AGA, AGG |
| Stop codons | Stop | TAA, TAG, TGA |
| In this table, the twenty amino acids found in proteins are listed, along with the single-letter code used to represent these amino acids in protein data bases. The DNA codons representing each amino acid are also listed. All 64 possible 3-letter combinations of the DNA coding units T, C, A and G are used either to encode one of these amino acids or as one of the three stop codons that signals the end of a sequence. While DNA can be decoded unambiguously, it is not possible to predict a DNA sequence from its protein sequence. Because most amino acids have multiple codons, a number of possible DNA sequences might represent the same protein sequence. | ||