CBS-dk Estimates of the Number of Genes
and
Length Distributions of Genes
in Microbial Genomes

See:
Marie Skovgaard, Lars Juhl Jensen, Søren Brunak, David Ussery and Anders Krogh
On the total number of genes and their length distribution in complete microbial genomes
TRENDS in Genetics 17(8):425-428, 2001


The plot shows how much the genomes are over-annotated according to our estimate as a function of the GC content.

Table of sequenced genomes

Columns:

%AT: The percentage of A+T
Organism: Name of organism with link to a home page
Reference: Link to Medline and time of publication
GenBank acc. no.: Accession Number with link to GenBank
Size (bp): Size of genome in base pairs
Number of genes/Annotated: Number of genes annotated in GenBank
Number of genes/Alignment: Estimate from SwissProt matches
Number of genes/Stop Triplet Estimate: Estimate from stop frequencies
Length Distribution: Link to a plot of length distributions

%AT Organism Reference GenBank acc. no. Size (bp) Number of genes Length
Distribution
Annotated Alignment
Estimate
Stop Triplet
Estimate
44 Aeropyrum pernix
strain K1
[PubMed]
Apr, 1999
Aero_p 1,669,695 2694 1376 1423
50 Methanobacterium
thermoautotrophicum

strain deltaH
[PubMed]
Nov, 1997
AE000666 1,751,377 1869 1466 1535
51 Archaeoglobus fulgidus
strain DSM4304
[PubMed]
Nov, 1997
AE000782 2,178,400 2407 1818 1927
55 Pyrococcus abyssi
strain GE5
-
Apr, 1998
AL096836 1,765,118 1765 1497 1635
58 Pyrococcus horikoshii
strain OT3
[PubMed]
Apr, 1998
Pyro\_h 1,738,505 2064 1448 1616
69 Methanococcus jannashchii
strain DSM 2661
[PubMed]
Aug, 1996
L77117 1,664,970 2694 1376 1423
33 Deinococcus radioduransR1 [PubMed]
Nov, 1999
AE000513 AE001825 3,060,986 2937 2323 1904
33 Pseudomonas aeruginosa
strain PAO1
[PubMed]
Jun, 2000
AE004091 6,261,170 5565 4753 3508
34 Mycobacterium tuberculosis
strain H37Rv
[PubMed]
Jun,1998
AL123456 4,411,529 3918 3410 2537
47 Treponema pallidum
strain Nichols
[PubMed]
Jul,1998
AE000520 1,138,011 1031 920 820
47 Xylella fastidiosa
clone 9a5c
[PubMed]
Jul, 2000
AE003849 2,679,306 2766 1770 1792
48 Neisseria meningitidis
serogroup A, strain Z2491
[PubMed]
Mar, 2000
AL162759 2,182,497 2025 1530 1500
48 Neisseria meningitidis
serogroup B, strain MC58
[PubMed]
Apr, 2000
AE002098 2,182,497 2121 1539 1447
48 Vibrio cholerae
strain N16961
[PubMed]
Aug, 2000
AE003852 AE003853 4,033,460 3828 2991 2931
49 Escherichia coli
strain K-12, isolate MG1655
[PubMed]
Sep,1997
U00096 4,639,221 4289 3771 3463
52 Synechocystissp.
strain PCC6803
[PubMed]
Sep, 1996
AB001339 3,573,470 3169 2559 2550
54 Thermotoga maritima
strain MSB8
[PubMed]
May, 1999
AE000512 1,860,725 1846 1576 1564
56 Bacillus subtilis
strain 168
[PubMed]
Nov, 1997
AL009126 4,214,814 4100 3263 3330
57 Aquifex aeolicus
strain VF5
[PubMed]
Mar, 1998
AE000657 1,551,335 1522 1337 1412
59 Chlamydia pneumoniae
strain CWL029
[PubMed]
Apr, 1999
AE001363 1,230,230 1052 903 909
59 Chlamydophila pneumoniae
strain AR 39
[PubMed]
Mar, 2000
AE00216 1,229,853 997 790 913
59 CPNEUJ138
strain AR 39
[PubMed]
Mar, 2000
BA000008 1,229,853 1070 921 910
59 Chlamydia trachomatis
strain D/UW-3/Cx
[PubMed]
Nov, 1998
AE001273 1,042,519 894 772 754
60 Chlamydia muridarum
(MoPn) strain Nigg
[PubMed]
Mar, 2000
AE002160 1,069,412 818 698 763
60 Mycoplasma pneumoniae
strain M129
[PubMed]
Nov, 1996
U00089 816,394 677 610 617
61 Helicobacter pylori
strain 26695
[PubMed]
Aug, 1997
AE000511 1,667,867 1566 1303 1384
61 Helicobacter pylori
strain J99
[PubMed]
Jan, 1999
AE001439 1643831 1491 1316 1351
62 Haemophilus influenzae Rd
strain KW20
[PubMed]
Jul, 1995
L42023 1,830,138 1745 3771 3463
68 Mycoplasma genitalium
strain G-37
[PubMed]
Oct, 1995
L43967 580,074 480 461 474
69 Campylobacter jejuni
strain NCTC 11168
-
Feb, 2000
AL111168 1,643,831 1654 1420 1494
71 Borrelia burgdorferi
strain B31
[PubMed]
Dec,1997
AE000783 910,724 850 756 772
71 Rickettsia prowazekii
strain Madrid E
[PubMed]
Nov, 1998
AJ235269 1,111,523 834 759 795
75 Ureaplasma urealyticum
strain serovar 3

Jan, 2000
Uu 751,719 613 564 556
62 Saccharomyces cerevisiae
[PubMed]
May, 1997
U00091 Y13134 X59720 Z71256 U00092 D50617 Y13135 U00093 Z47047 Y13136 Y13137Y13138 Z71257 Y13139 Y13140 U00094 12,057,849 6269 5560 5728

Go back Go to the CBS Home Page

Last modified on Thursday, 26 August, 2001, by Anders Krogh