30431 Introduktion til Bioinformatik
David Ussery
Tuesday, 17 April, 2001
cokeloreb.gif


DNA Structural Atlases for Sequenced Genomes


Overview

This lecture is about DNA structural analysis for the Escherichia coli genomes. In a real sense, it is a continuation of last Friday's talk. In the first part ("DNA is like Coca-cola"), I will talk about DNA Structures, and their possible biological meanings. Then, in the second half, I will talk about DNA structures and promoters (in E. coli of course as an example), and discuss structural organisation of promoters in bacterial genomes in general.

As with the last lecture, I have made separate file, containing specific LEARNING OBJECTIVES for this lecture, as well as a "self-test quiz", which I recommend having a look at, BEFORE the lecture, if possible.




DNA is like Coca-Cola
coke_logo.gif      
Coke bottle
Coke DNA Solubility
water water
-
Sugar (sucrose)  Sugar (dexoyribose)
VERY 
High
Phosphate 
(PO4- acid)
phosphate
moderate
caffeine bases 
(A,T,C,G)
extremely
low


Compare the structures of Caffeine: Caffeine

with Adenine: Adenine
Here's the structure of caffeine, flipped:

Caffeine is a "base analogue" of Adenine, and in fact can sometimes be incorporated into a growing DNA chain, instead of Adenine.  Caffeine is a weak mutagen, for this reason.

The VERY MOST important property contributing to DNA helix stability is the stacking of the base-pairs on top of one another, due to hydrophobic forces.  (Remember, the bases "hate" water, and are not very soluble.)  Free bases will stack on top of each other and form a helix in solution!  This type of process is called "self-assembly", where you just throw something in solution, and it fits together on its own, with no extra work needed.

DNAmovDNAmovDNAmovDNAmovDNAmovDNAmov


Properties of individual dinucleotide base pairs
Dinucleotide 
base pairs
Stacking energies 
(Kcal/mol bp)
twist angle
bp/turn
(GC).(GC)
-14.59
40.0
9.0
(AC).(GT)
-10.51
34.4
10.5
(TC).(GA)
-9.81
36.9
9.8
(CG).(CG)
-9.61
29.8
12.1
(GG).(CC)
-8.26
33.7
10.7
(AT).(AT)
-6.57
32.1
11.4
(TG).(CA)
-6.57
34.5
10.<
(AG).(CT)
-6.78
27.9
13.0
(AA).(TT)
-5.37
35.6
10.1
(TA).(TA)
-3.82
36.0
10.0
The base stacking energies are from: Ornstein,R.L., Rein,R., Breen,D.L., and MacElroy,R.D.,Biopolymers, 17:2341-2360,(1978).  The helical twist angles (and their implied bp/turn) are from Kabsch,Sander, and Trifonov,Nucleic Acids Research,10:1097-1104,(1982).



The Family of DNAs

A-, B-, and Z-DNAs
A-DNA (left), B-DNA (middle) and Z-DNA (right) -- 12 bp each
From Dickerson et al. in Cold Spring Harbor Symposium for Quantitative Biology (1982) v47 p13-24.





3 families of DNA helices:



A-DNA conformation

A-DNA family - this is most common for double stranded RNA, RNA/DNA hybrids, as well as for certain DNA sequences, such as long stretches of purines.















B-DNA conformation

B-DNA family - DNA exists in the "B-DNA form", most of the time inside the cells of living organisms. This is the classical "Watson-Crick" structure.





















Z-DNA conformation

Z-DNA family - this is much more rare than the other two families, although certains sequences (such as runs of GC repeats (GCGCGC)) can form Z-DNA easily.




















A-DNA and Z-DNA in Sequenced Chromosomes From All 5 Kingdoms



OrganismKingdomSizeA-DNA
Rn or Yn
where n>4 bp
A-DNA Atlas
Z-DNA
(YR)6 or (RY)6
Z-DNA Atlas
length dist.
plot
E. coli K-12Monera
(Bacteria)
4,639,221 bp
(complete)
16.7
A-DNA Atlas
0.1
Z-DNA Atlas
plot
P. abyssiMonera
(Archaea)
1,765,111 bp
(complete)
33.4
A-DNA Atlas
0.0
Z-DNA Atlas
plot
S. pombe chr. Fungi
(yeast)
5,325,148 bp
(~12 Mbp total)
25.7
A-DNA Atlas
0.2
Z-DNA Atlas
plot
L. major chr. 3Protista
(protozoa)
384,499 bp
(~40 Mbp total)
21.3
A-DNA Atlas
3.1
Z-DNA Atlas
plot
A. thaliana chr. 1 (bottom half)Plantae
(thale cress)
14,668,8831 bp
(~100 Mbp total)
27.0
-
0.5
-
plot
H. sapiens chr. 22Animalae
(humans)
34,601,435 bp
(~3000 Mbp total)
35.0
-
0.6
-
plot
Expected values
-
n bp
18.75%
-
0.32%
-
-



Link to a Table with comparison of purine and pyr/pur stretches in more than 700 sequenced chromosomes.


What is? Background - What is a "DNA Structural Atlas"?

DNA helix



three views of E. coli Three Views of Escherichia coli:


view 1 The DNA Structural Atlas     Structural Atlas

view 2 The Base-Composition Atlas     Base Atlas

view 3 The DNA Repeat Atlas     Repeat Atlas

      the combined view The Genome Atlas (combined view)     Genome Atlas
                  note: this has a "zoom" function!



Part 2 of the lecture - DNA structures and promoters



MORE ABOUT DNA:

A timeline of DNA/genetics

A List of Nobel Prizes won for work in DNA/genetics

BOOKS on DNA structures

A list of more than a THOUSAND BOOKS ABOUT DNA



Back to the lecture notes





back Back to Dave's Homepage Chromosome icon


last updated 19 April, 2001 by Dave Ussery