DTU Course number 27011
Introduction to Bioinformatics
David Ussery
Tuesday, 11 November, 2001 (Part 2)
cokeloreb.gif


DNA Structural Atlases for Sequenced Genomes


Overview

This lecture is about DNA structural analysis for the Escherichia coli genomes. In a real sense, it is a continuation of the previous talk from earlier today. In the first part ("DNA is like Coca-cola"), I will talk about DNA Structures, and their possible biological meanings. Then, in the second half, I will talk about DNA structures and promoters (in E. coli of course as an example), and discuss structural organisation of promoters in bacterial genomes in general.

As with the last lecture, I have made separate file, containing specific LEARNING OBJECTIVES for this lecture, as well as a "self-test quiz", which I recommend having a look at, BEFORE the lecture, if possible.




DNA is like Coca-Cola
coke_logo.gif      
Coke bottle
Coke DNA Solubility
water water
-
Sugar (sucrose)  Sugar (dexoyribose)
VERY 
High
Phosphate 
(PO4- acid)
phosphate
moderate
caffeine bases 
(A,T,C,G)
extremely
low


Compare the structures of Caffeine: Caffeine

with Adenine: Adenine
Here's the structure of caffeine, flipped:

Caffeine is a "base analogue" of Adenine, and in fact can sometimes be incorporated into a growing DNA chain, instead of Adenine.  Caffeine is a weak mutagen, for this reason.

The VERY MOST important property contributing to DNA helix stability is the stacking of the base-pairs on top of one another, due to hydrophobic forces.  (Remember, the bases "hate" water, and are not very soluble.)  Free bases will stack on top of each other and form a helix in solution!  This type of process is called "self-assembly", where you just throw something in solution, and it fits together on its own, with no extra work needed.

DNAmovDNAmovDNAmovDNAmovDNAmovDNAmov


Properties of individual dinucleotide base pairs
Dinucleotide 
base pairs
Stacking energies 
(Kcal/mol bp)
twist angle
bp/turn
(GC).(GC)
-14.59
40.0
9.0
(AC).(GT)
-10.51
34.4
10.5
(TC).(GA)
-9.81
36.9
9.8
(CG).(CG)
-9.61
29.8
12.1
(GG).(CC)
-8.26
33.7
10.7
(AT).(AT)
-6.57
32.1
11.4
(TG).(CA)
-6.57
34.5
10.<
(AG).(CT)
-6.78
27.9
13.0
(AA).(TT)
-5.37
35.6
10.1
(TA).(TA)
-3.82
36.0
10.0
The base stacking energies are from: Ornstein,R.L., Rein,R., Breen,D.L., and MacElroy,R.D.,Biopolymers, 17:2341-2360,(1978).  The helical twist angles (and their implied bp/turn) are from Kabsch,Sander, and Trifonov,Nucleic Acids Research,10:1097-1104,(1982).



The Family of DNAs

A-, B-, and Z-DNAs
A-DNA (left), B-DNA (middle) and Z-DNA (right) -- 12 bp each
From Dickerson et al. in Cold Spring Harbor Symposium for Quantitative Biology (1982) v47 p13-24.





3 families of DNA helices:



A-DNA conformation

A-DNA family - this is most common for double stranded RNA, RNA/DNA hybrids, as well as for certain DNA sequences, such as long stretches of purines.















B-DNA conformation

B-DNA family - DNA exists in the "B-DNA form", most of the time inside the cells of living organisms. This is the classical "Watson-Crick" structure.





















Z-DNA conformation

Z-DNA family - this is much more rare than the other two families, although certains sequences (such as runs of GC repeats (GCGCGC)) can form Z-DNA easily.




















A-DNA and Z-DNA in Sequenced Chromosomes From All 5 Kingdoms



OrganismKingdomSizeA-DNA
Rn or Yn
where n>4 bp
A-DNA Atlas
Z-DNA
(YR)6 or (RY)6
Z-DNA Atlas
length dist.
plot
E. coli K-12Monera
(Bacteria)
4,639,221 bp
(complete)
16.7
A-DNA Atlas
0.1
Z-DNA Atlas
plot
P. abyssiMonera
(Archaea)
1,765,111 bp
(complete)
33.4
A-DNA Atlas
0.0
Z-DNA Atlas
plot
S. pombe chr. Fungi
(yeast)
5,325,148 bp
(~12 Mbp total)
25.7
A-DNA Atlas
0.2
Z-DNA Atlas
plot
L. major chr. 3Protista
(protozoa)
384,499 bp
(~40 Mbp total)
21.3
A-DNA Atlas
3.1
Z-DNA Atlas
plot
A. thaliana chr. 1 (bottom half)Plantae
(thale cress)
14,668,8831 bp
(~100 Mbp total)
27.0
-
0.5
-
plot
H. sapiens chr. 22Animalae
(humans)
34,601,435 bp
(~3000 Mbp total)
35.0
-
0.6
-
plot
Expected values
-
n bp
18.75%
-
0.32%
-
-



What is? Background - What is a "DNA Structural Atlas"?

DNA helix



three views of E. coli Three Views of Escherichia coli:


view 1 The DNA Structural Atlas     Structural Atlas

view 2 The Base-Composition Atlas     Base Atlas

view 3 The DNA Repeat Atlas     Repeat Atlas

      the combined view The Genome Atlas (combined view)     Genome Atlas
                  note: this has a "zoom" function!




Analysis of E. coli Promoters





Transcriptional Regulation by different sigma-factors.
 
 

E.coli RNA Polymerase subunits
Gene
Mass 
KDa
Function 
-35 Sequence
separ.
-10 Sequence
#molecules
per cell
# genes
regulated
rpoA
40
alpha subunit
-
-
~4000
-
rpoB
155
beta subnit
-
-
-
~2000
-
rpoC
160
beta' subunit
-
-
-
~2000
-
rpoD
70
sigma70 
General
TTGACA
16-18 bp
TATAAT
600
3851
rpoN
54
sigma54 
Nitrogen
 CTGGNA
6 bp 
 TTGCA
~50
177
rpoS
38
sigma38 
Stationary
 ?
 -
 ?
200 (log)
600 
(stat.)
68
rpoH
32
sigma32 
Heat shock
 CCCTTGAA
 13-15 bp
 CCCGATNT
~50
75
fliA
28
sigma28 
Flagellar
CTAAA
 15 bp
 GCCGATAA
~20
13
rpoE
24
sigma24
High temp.
heat shock
?
 -
 ?
< 5
4 ?
fecI
19
sigma19
iron transp.
?
-
?
< 5
2 ?

 
 


  • There are about 1500 - 2000 copies of RNAP holoenzyme per cell (that is, a, b/b')

  •  
  • For bacteria growing in "log phase":
      • ~600 copies of RpoD (s70)
      • ~200 copies of RpoS (s38)

      •  
  • [RpoS] increases to ~600 copies per cell in stationary phase or osmotic shock.

Logo plots for E. coli Sigma factors.

cluster analysis




Structural profiles for E. coli Sigma70 and Sigma38 promoters

sigma promoters


Table of structural profiles for 17 different sequenced genomes

Promoters Table





Link to more atlases


References:

Papers relevant to this lecture (handed out in class)

    Today (Tuesday, 17 April, 2001)

  1. Anders Gorm Pedersen, Lars Juhl Jensen, Hans-Henrik Stærfeldt, Søren Brunak, and David W. Ussery, "A DNA Structural Atlas for Escherichia coli", Journal of Molecular Biology, 299 (#4), 907-930, (2000).     [cover]

  2. Link to JMB online version of this article.        PDF file     [PubMed]

  3. David W. Ussery, "DNA Structure: A-, B-, and Z-DNA Families", manuscript submitted to The Encyclopedia of Life Sciences, April, 2000.
  4. Lars Juhl Jensen, Carsten Friis, and David W. Ussery, "Three Views of Microbial Genomes", Research in Microbiology, 150, pages 773-777, 1999.
  5.    [cover]     [PubMed]        PDF file

  6. David W. Ussery, "DNA Denaturation", manuscript submitted to The Encyclopedia of Genetics, September, 2000.



  7. Friday (6 April, 2001)

  8. David W. Ussery, "Genome Databases", The Encyclopedia of Genetics, in press, April, 2001.
  9. Ussery,D.W., Larsen,T.S., Wilkes,K.T., Friis,C., Worning,P., Krogh,A., Brunak,S. "Genome Organisation and Chromatin Structure in Escherichia coli", Biochimie,83:201-212, (2001).
  10. Carsten Friis, Lars Juhl Jensen, and David W. Ussery, "Visualisation of Pathogenicity Regions in Bacteria", Genetica, 108:47-51, 2000.
  11. David W. Ussery, "Bioinformatics2000 Meeting Report", Genome Biology, 1, (#3), 1-2, 2000.



Other references

  • Richard R. Sinden, Christopher E. Pearson, Vladimir N. Potoman, and David W. Ussery, "DNA: Structure and Function", Advances in Genome Biology, 5A:1-141, (1998).
  • Ussery,D.W., Higgins,C.F., and Bloshoy,A., "Environmental Influences on DNA Curvature", J. Biomolecular Structure & Dynamics,16:811-823, (1999).[PubMed]



  • Link to a list of recent papers and talks on DNA structures.



    Books about DNA:

    Davies, Kevin, "CRACKING THE GENOME: Inside the Race to Unlock Human DNA - Craig Ventor, Francis Collins, James Watson, and the Story of the Greatest Scientific Discovery of Our Time", (The Free Press, New York, 2001).      Amazon      Barnes&Noble

    Watson, James D. "A PASSION FOR DNA: Genes, Genomes, and Society", (Oxford University Press, Oxford, 2000).      Amazon      Barnes&Noble

    Sinden, Richard R., "DNA: STRUCTURE and FUNCTION", (Academic Press, New York, 1994).      Amazon      Barnes&Noble

    Calladine,C.R., Drew,H.R., "Understanding DNA: The Molecule and How It Works", (2nd edition, Academic Press, San Diego, 1997).      Amazon      Barnes&Noble



    A List of more than a thousand books about DNA




    Go to the E. coli atlas page Link to the E. coli GenomeAtlas page



    Back to the CBS homepage
    Back to Dave's Courses page

    Last modified Wednesday, 15 August, 2001 by David Ussery