Consider
the following "typical" (?) family:
1 tcgactcgat ctttcatggg
tcaaataaac acagaattga tagaatattt taataatcta
61 acaaagatga tttaataaat atttattaaa
ttttgttctc tagaaataag caatacactt
121 ttttaaggag ccatgaagca tttagaaaaa
aattgatcat atattatatc tcaaagaaac
181 catattaaat tatgaaatat aaagtgttaa
cagcctccat gttctgccta cactgaaata
241 gagctagaca ttaatagcaa aagtttaaat
gcaaaaatca gtagcttgga aaagagaaat
301 aatgataaca aaaaccaaat ttttttctaa
aaactccaga ttcaaacagg aaaggaaact
361 atataataaa ctgttaggag agtaagaact
ctatgtatca atattttaaa tggctaaagc
421 tgtatttagc agaaaattaa tagccctgag
aactttcatt attaaaggag agtaacaaaa
481 tcaatctgtg gaaaataaaa ctaatgaatt
aataaataga gactcctttt ttttctgtta
541 aataagacca acaagcaaat aatatttatt
taatgagcta gaaataggag gagcttacgc
601 agcataatac aagatattta ctggaaacct
gtagcaaggt atgttattta acgttaacgt
661 tgtaaatatt actttcactg ccacaaatga
gatgagtgct tgtgattttt gtaaaagtta
721 aatcccaaag aacaatttct gcttcccatg
tagtctgtaa accctcacaa aacacaaaca
781 gtagcaacaa caaaactctc tgagggatct
gaagaatgaa caaaggcagc aagattctcg
841 aagggggttg aagacacctg gaagtagtat
aggttaaatt tcttatgact attatcctga
901 ggtcaagcca caggtagggc cacacagagt
ggctgaaact cctgtaggaa gctcagtctt
961 tctggcctta tgaaccaggg gacagagttt
ggagcaaccc cagctactgg aaagtaaggg
1021 aggaatgtca ttaaaaagag ggccagaaaa
ggagcattac aaatttttgg tgtaaattct
1081 gctcaaatct ctggctgacc tttggaatat
acatacacag gcagaccaaa gcagtcagtg
1141 aatttgataa tatcaatata gattttctct
atgaagaaca gagattgtat ttttaaaaag
1201 gattaaaaaa aaagcaacca gagcctcagg
ttcctatggg ataattcttt taacatatga
1261 gtaattggtt ccccagaagg aaaggtgata
gagaatgggt cataaaaata ttaggttggt
1321 gcaaaagtaa tcgtggtttt gcaattactt
ttcattacaa aacccacaat tacttttgtg
1381 ccaacctaat atttgaagaa acagtggccc
cacacacttc ccaaaattgg tgagagataa
1441 atgtacacgt tcaagatgct gaggcacatt
gtaatcaaac tgctgaaaac ccaagacact
1501 atcttgaaaa tagccggaga aaaacaacac
actatataca tgaaactgca ctgctaaata
1561 ctgccaacct accttcaaaa actatgaaag
caagaagaca atgaaacaat atcttcaatg
1621 tgctgtaagg agataagact atccaatccc
gaagtccata tccaaagata ctcatacttc
1681 aggaacggag tcaaaataaa gttgcattca
gaaaaacaaa aacagaatat ttattgccag
1741 cagaccaact tacactgtaa aaaatgcttt
gttctttggg atgacaagaa ataatagatg
1801 aaaactcaga tccttatgta aaaataaaga
aatcataaat agtaagataa atataaaaat
1861 caattttttc ctcctaatgt tttaagaata
tgtatgtttg tttaagtaaa aactatagca
1921 tcatttttat gggatttata atacaggtag
atgtaatttc aatgacaact tgaatttaga
1981 gggagtttgt aaatagattc atacatgtgt
gaggtttcca tattgtatgt aaatagtata
2041 atagtaattc taggtagact gtaaaaagag
aggttggatt gaattcctgg taatatgtgt
2101 ttaaatggaa tgaatactcg ttcttagaat
gaagagaagt gtgcagaatg gaactttgca
2161 aacccttact ggcccatatg tcacggggtg
gagatgtcac cattttctat atggctccag
2221 atcaagcaat ctcttgacaa aaagcaagct
ggcttcgagt aaccttcaaa cggagtgtac
2281 caggccaatg cagaagacat aacatggttt
gctttctttc tcccgtccat tttagtatct
2341 taagtaccag cagatagaag gtccagaaaa
aagaaagagg atgatattgt aggggaggcc
2401 aaattttctc tcttagggtt ttttagctgg
gcctgaggat tgaattgaca taaggcagat
2461 cagcgggaga aaagcataca aatttaattc
attcaatttt tatgtgagca caagagccct
2521 cttaaggaaa tgaaacccaa agacagagtt
gaaagttgaa cacctatgta ctgaattggg
2581 aaaggaatag aaaattgtga gtatgtgaca
aagccaacgg gcttgggcta gggaggttaa
2641 ttgggtagag aagtaactag gaagataagg
gttactttaa taaggtgtgt ttgttcagat
2701 ttctctcagc atcaacttct cgtctttgat
gataagaacg ctactttccc tctggcgtag
2761 ggaggacaag ttatctcctg tttctaagga
aaagaagaaa gctcagagta tattttttgt
2821 atctgctgtt tttccaagtg tctttaagct
caaaatagtc aatatactag agcaacatat
2881 tttggagtat tgtgttctga actccttcaa
tctttagcat ttctcttctg tcattcttca
2941 ggctcttgag cacttttgag aaactaatat
gaatgggaaa gatgtaatat ttactcccag
3001 gaaggttgct ttggtgatct agtttgtgaa
ctagactcag gaaactgtct gggtattgga
3061 aaggaagtca cggatgcgca atggaatttg
taaaggggtg gcacagctct tgaatactct
3121 ccttggaaga gaccctggca tggagagaac
agagtattaa cctcactctg aggttgggaa
3181 gactaaggga atctgaggtg tgaggaggga
tggagacgac aggtcacaca atgtgatgga
3241 aaaccgcaca tggagtggaa agaacatgaa
ctctggagtc tggaactcct atatctggac
3301 ttgacaccgt attctgctat gtggccttcg
gtgagtgtgc caagtagaat attggctctg
3361 caaacgtatc cacatcaaat ttctagagct
tgtgaatgtg accttgtttg caaaaaaaat
3421 ggtccttgca gatatgatta aattgaggat
cttgagatga ggaaatcatc ctggattatt
3481 ttgggagacc taaatgctat tacaagagtc
ccaccaacca ccagaagctg aaagtggcaa
3541 ggaatggaat ccctcctaga gcctctggag
ggatctttgc tcttaaacat cttaattttg
3601 ggcttctggc ttccagagct gtgagagcca
caaagtttgt ggttattttt tattgcagcc
3661 acagtgaaga agataataaa gccagctact
gacctctctg aatctcagtg ccctcatctg
3721 taagatagaa tcagctagct actttatagc
atttttccag gattaaatga gacatgtaaa
3781 atagaaaaca aacaaacaag caaacaaaga
aaagataagg aacatgctga gaaaaaaata
3841 tagacacccc ttgtttgatg ttactaattg
attcctgatg atcagatgta ttacccaaaa
3901 atgatttcct tgagacagtt tgccattact
taaatgggaa aaaatgtgtt atatgtcaaa
3961 ccaaccctct agtgaatttc ttgaagtgta
agtaaaacac agtaaaataa ccatagtaga
4021 acaataatgt cacactgaga cttaaaatgc
aggcatacct atatctcaga gatactgcat
4081 gttcacttcc agaccatcac aataaagtga
atattgcaat aaaacaaggc acacaaactt
4141 ttttgtttcc cagtgaatat aaaagttatg
tttacagtct atagtagtct agtatgcaat
4201 agcattataa aaaaaatatg ttatcttaat
ttaaaaattt tttttgctaa aaatactggt
4261 gatcattttg aaaccctcca ggtaattcca
taatcctttc ctgcctggtt ggaagggtcn
4321 tttgcctttg atattttatt ggcttgccga
actggtatca aactgggtgg ttgttgaagg
4381 gcagaaatgg ctgtggcaat ttgtttaaag
taagaaaaca gtaaagtttt caacatcagt
4441 ggcttcttgc tttcacgaat gacttctctg
ttgcatacag cactgtttgc cagcatttta
4501 cccacaataa aactcctttc aaaatgagtg
aatcctctcc aacccttctt tatcaactaa
4561 gtttatgtaa tattctaaag cttttgttgt
catttcaacg atgttcataa catcttcttc
4621 aggagtagat tttacctcaa gaaatccttt
tctttgctta cccgtaagaa gcaactcctc
4681 actcattcaa gttttattgt aaaattacag
taattcagtc acatcttcaa gctttacttc
4741 tgattctagt tgtcgctatt tcttccacat
cttcagctcc ttcctccact gaagtcttga
4801 acccctcaaa gtcatccatg agagttggaa
tcaacttctc ccaaacacct tttcatgtta
4861 atattttgtt gtcctcccat aaatcatgaa
tgttctcaat gacatccagc atggtgaatc
4921 ctttccagaa ggttttcatt ttactttgcc
cagatcaatc tgataaatca ctatgtatag
4981 tcttacaaat gtattttctt aaataatgag
aacttgaaag tcaaaattac tccttcaccc
5041 atgggctgca gaatggatgc tgtgttggca
ggcatgaaaa caacattcat ttccttttac
5101 atctccatca gaactcttgg gtgaccaggt
acattttcag tgagcagtaa tattttgaaa
5161 ggaatctttt ttctgagcaa taggtctcaa
gagagggctt aaaatattca gtaaaccata
5221 ctataaacag atgtactgtc atctaggctt
tcctgtttct tttcagagaa catgaagagt
5281 tggtgtagta tcattcttaa gggccttagc
attttcagaa tggtcaatga gcattggctt
5341 caacttaaag tcaccagctg cgttagcagc
taacaaaaga atcagcctgt tctttgaaga
5401 tttgaagcca ggtatcgact tctcctctct
agctaggaaa atcctagatg gcatcttctt
5461 ccaacagagg gctgtttcat ctctcttgaa
aatctgcctt tggccgggtg tggtggctta
5521 cccctgtaat cccagcattg tgggaggccg
aggcaggcaa ataatgaggt caggagttcg
5581 agaccagcct agccaacatg gggaaaccat
gtctctacta aaaatacaaa aaattagcag
5641 ggcgtagtgg tgggcacctg taatcccagg
tacttgggag gctgaggcag gagaattgct
5701 tgaacccggg aggcagaggt tgcagtgagc
tgagatcgca ccattgcact ccagccctag
5761 tgacagagtg agactatgtt tcaaaaaaaa
aaaaaaaata tgctgtttag tgtagccacc
5821 ttcattagtt atcttagcta gatcttctgg
ataacttgct gcagcatcta tataagcact
5881 tcctgctttg ccttttatgt tctggagatg
gcttctctcc ttgaacctca tgaatcagcc
5941 tctgctatct tcaaactttt cttctgcagc
ttctttacct cttttagcct tcatagaatt
6001 gaatatagtt agggcctttc tctggtttaa
gccttgggtt aagggaatgt tgtggctagt
6061 ttgatcttct atccagacca ctcaaacttt
cccatatcct caacaagcct gtttcacttt
6121 cttatcattt gtatgttcat tggagtagca
cttttaattt ccttcaagaa cttttccttt
6181 gcattcacaa cttggctatt tggtgcaaga
ggcctagctt tcatcctagc tcagcttttg
6241 acatgccttc ctcgccaagc ttaatcattt
ctaatttttg atctacaggg agagacatgg
6301 gactcttcct ttcacttgaa cacttagaag
ctgttgtctg cttattagtt ggactaattt
6361 caatttgttg tgtctcaggg aatagggaag
cctgaggaga gggaaaaatg gctagtcagt
6421 gcagcagtca gaacacacaa acatttactg
ataaagttca ccatcttata ttgggcccca
6481 taacacttac aataggaaca tctaagatca
ctgatcacag atcaccacaa cagacataat
6541 aataatgaat tagtttgaaa tattgggaga
attaccaaat tgtgatacat agacgtgaaa
6601 tgaacacgtg ctgttggaaa aatggtgcca
atagacttgc tagatgcaaa gttgccacaa
6661 accttcaact tgtaaaacca cattatctgt
gaagtgcaat gaaacaaagt gcagtaaaac
6721 aaggtatgcc tgtacttaaa aaaattatat
cctcgatcac ttgtacttaa aaaaattata
6781 tccttgatga cttctacaat taacatgtta
taaaatgttt gctttgtacg tagtagctat
6841 tctcacttgt tttgttttct tctctacaat
tgtcttcagt gttttctcaa cagtttgggg
6901 atttcaacaa taagcaaaaa aagtaaaatg
gggaaggcac tgggaaatgt gctctagtgc
6961 ctggtggcca agtgtggctc acactgatgg
tatgatggcc ttttagcatc agcaacttct
7021 gaaatgggtg tttcgtgtgt tgtcactcag
cttgctttcc aaaatatcta tatattattc
7081 tgatgttcta ctgaatatgt tttgaaaata
tcttgatgat tttcctatat ttccttttat
7141 tgaaagtttt gcagagtttg gtattataaa
tctaaagttt tgctacatga acattggcaa
7201 aaaaaaaaaa aaatagatga tgaaatttgg
attttaaaaa ccaagagtca ttttaccatg
7261 aaaatgttaa tcgaggcaac atcattcagg
ttatgcctgt atttgcagaa tgtttcatag
7321 tagaaaatta atatgaaata taaaaagagt
taatacaaac aagtaatcag aaataaaaat
7381 caatagaaaa ataactaagg ggtctgaaga
aacataaatt ggagaagcaa cagaaatgga
7441 caattaatga atgaaagaat gtacaactgc
aaaagtaatc aggaaatgcc ttctaaaatc
7501 agcacagtat agctttttac atctatccaa
ttgtcaaaat taaagggaaa aataaatcta
7561 acaattctag gtgttggaga ggttgtaaag
caatgaagat tatcattcac agagcaaact
7621 ggtacagcca ctgtggaata tagtgtggca
tcatccagaa tagttgaaga catgcaactc
7681 ccatggccca catgccaagg ggaatgtgca
caacatactt acaatagtat ctttttgata
7741 attcaggaaa acgtacatgt tcaccaatag
ctaatacata aatattatta ttcaactaag
7801 agacgatttt cgaaaagcaa aaaaagaatg
agtaaaaaca acatatagta acaagaataa
7861 ctataaatat atgttgagta aagatgtttt
gaaaatgagt ttcagaagaa tgcaatctaa
7921 taccattata taacaattta aaacatgcaa
aaaaaacccc atcattttca gctagttggc
7981 aaatgtagtg gaattgggca tggggatatt
ctccctagaa aaagaatttt ttcatgtttt
8041 ttgatctagt aatctcattt atagaaatct
ttcttaaaga aatagagatg caggcaaaaa
8101 ttcattatta tactttggta ttatagtatt
tatagaataa ttaagtactc taaattatct
8161 gtcagtaaac tgtgaaattg ttaaataaat
tatggaacat aaggtagtgt tttaaaatct
8221 ttcaaaaatt atatctaata atatgggaaa
attgcattgc tatgtaatcc gaatttggtt
8281 taaaaattaa acaaatagca aaaaaagaag
gcaaatatat aaataagact gggaactata
8341 aaaacactat caatgtttat ctctgagtga
tagaataata tgtgattttt aaaaatgttt
8401 ttctgtaatt ctcaaattag gtgcagtaag
cagacatgac ttctacaatc ataacaggta
8461 gcttcattga cctgtttcta aaatgggtag
cactttttaa ggattgaagt tgttttgtcc
8521 agttggacag attcttatcc ctttgtcttt
tcttgtattg ccaacttctg cctatacttg
8581 aaatagtagg gcagatttct atatgtaagc
atttaaaggg tctgcaagaa atactaagcc
8641 atgtaggaac tgtgaatatt aaatgagata
atgaatgaaa agaagcctag cacagtccct
8701 agcccaagta agtacctatt attattatta
tcattgactt tactacggct ttggtgttat
8761 gtcacagaca aaagtttgca aaccactttt
ttttttgctt gttttttttt tttaaggact
8821 caaaacagtt acctttgagt actaaagtcc
ttaagatgga aaacgagatg atatttatta
8881 tgccactgat atgtttcagg aagagcttgt
ggagagagca gacaaaacaa atggtaaatc
8941 ttgttcttag aaagaaagag ctatctgtcc
ccattacagc cagctgtcat gcccaggctg
9001 gagccactct tgggacaaaa ttagggacac
ttgaaaatgg taaatcttgg aaactgagag
9061 ctctgtattt ttcaacttag gatcataatc
tattaacccc tatttctatc ccagtttttc
9121 tcttggataa ctgatttcat ttgtgcaact
tgcattgaga aggtcaggat aaggcctgga
9181 aattgtcaaa ctctagattt tgatgacatt
ttggtaactc cttacaggaa aatctatcag
9241 ttacatggtc caattgtagg ggtgatattt
actaattttg cattctattg aacagcgata
9301 tataatggaa tgtatgttgt gaatcttcaa
taagtgtttc ttttctctta taacccaatg
9361 aattgacctc tataataaca tgcagttgct
aaatgaaaaa agaattgtat ttaagcagaa
9421 aaaagtttta gaaaatgtga ttttgtatgc
caaaatatgc atattagtag cttttatgag
9481 aagtgtcctt gaaggaaaat atggtgaatt
tgtatctggt tcaggtcttc ttttaataac
9541 cagaaaacaa ggctgaaaca agcacaaggc
agttctatct gtttgaatga tataaattgg
9601 ttgttaaaaa taaaaataaa atgaaaatga
aaaccaagat aacaatacca aggataagtt
9661 aaacaaaaag ttggtttttt gaaaagatga
acaaaattga taagcctctg gctgtactaa
9721 ccaagaaaag atccaaataa aagtcagaaa
tgaaaaagaa gacattacaa atgataccat
9781 aggaatacaa aagatgatca gagactacta
tgaagaagtg tacactcaca atctagataa
9841 tgtagaggaa attgacaaat tcttgtaaac
atatgactgc ccaagattga accagaaaga
9901 aatagaaatc ttaaacagac caataatgaa
tagtaagatt gaatcaataa ttttaaaacc
9961 ttccccctaa aatgcccagg actggatggc
ttcatagcca aattctacaa aatgtgcaaa
10021 gaagaactga tactcatcct actgaaactc ttccaaaaaa
atcaagatga ttaaataaag
10081 aaaatgtgct ctgtgtgtgt atataagtgt atacatatgt
atatatatac acacacacac
10141 atacacatat acatatgtat acacatatat acacacacac
cacatgtata tacatataca
10201 catatataca tacagcacat tttatttatg tatatgtgtg
tgtatatatg tatacataat
10261 agaatactat tcagccacag aaaatgaatg aaaacatgtc
ttttacaaca acatggatgg
10321 aataggaggc cattatctta agtgaaaaaa ctcagaaaca
tagtgaaata ccacatgtcc
10381 tcacacgtgg gagctgaata atgtgtgtat gtagacatag
agtatagaat aatagtcact
10441 ggagactcag aatgatgata gggtgggagg ggagtgaggt
agtacaatgt acactattta
10501 ggtgttgatt acattaaaag cccagacttg accactatgc
ggtatatcca tataacaaaa
10561 ctgcacttgt acccctttaa tttacgtaat taaaaaattg
gctgggcaca gtggctcatg
10621 cctgtaatcc aaacactttg ggaggcccag gcgggcagat
caattgagcc caggagttgg
10681 agactagcct gggcaacatg gcaaaacacc acctctacca
aaaaaataca aaaattagct
10741 gggcatggtg ccacatgcct gtaatcccag ctactatgga
ggctcaggtg agaggatccc
10801 ttgaacccag gaggcggaag ttgtagtgag ccaagatggc
gccacggcat tccagcctag
10861 gtgacattat gagaccctgt ctcaaaaaca acaataacaa
caataacaac aaaagaaaaa
10921 attatgctca ctaaaagaaa aaaacagtaa ttaaaaaaaa
tgagttcttg tggtgaaaca
10981 taatggcacc tcctggttcc ttcttgcatt tgaagattat
agatgaggat tttcagggaa
11041 tggtcaagat caaaacctat ttggtctccc aagattttcc
aggagaggct gtaggcctct
11101 tcctagctta cctggcatgt acatactaac tgcaggtagc
agctgtatga agttgttggg
11161 taactcattt ttggttgtgt tctctaggtg acacattagg
catttttgtc tacctttaag
11221 tcctccgttc tcacctctgg tagagtgtaa actccataat
gagagagact gcttctgact
11281 tgtccacctc tgtaccctaa tttaatgaaa acatgttgaa
caaataaaga ataaacctag
11341 tggtggtccc aatcatccac ctgcccttcc aaccatcgtc
ctttccagat tgcagccttc
11401 tatatcagaa gtgtcagcct ttgttttctt gctctgggaa
gtgatgattt aagttccaca
11461 ccaccaaagg attgtcaaat acacacaaca aaaataaact
tgagctaaaa ttgtgatcta
11521 atatgcagag agattgaaaa agctaagaaa ataaattata
ccactttggc catgcagata
11581 caggaaccat ctggaagcat ctcagcaacc tgaaagggag
catctgatgc ctcgaattgc
11641 cctgtccgtc agctttagct gctattccca tttgcttctc
ttaaatgtat ttaacttggg
11701 acaagttggg agccaccttc tcttttctga ggtgatctta
gaagattgat atacgtaagt
11761 tgcagcttgt ttggattgtt tatctgtatt taaaggctct
accaggcatt gtattttgct
11821 gttaagataa aatcaataaa taaaaactta aaaaagaacc
agactaatcc ttgaacattc
11881 ctaaacatat actctctcct gctttcaaaa aggaacactg
cttaatgttt acttattgta
11941 gtcttctgag aatttgattt gctgtgattg cacctttatt
caaaagtaca gaacctaaaa
12001 atggtacaaa gaaagtttat ttcaaatcag atgcatttgc
aagccagtgt cattaaaaaa
12061 catgctgttg gtaaggctga ggtcagagtc tatcatacag
acagatgtga tttattgcaa
12121 aagagagccg actactgctt aggtatcagg tgccaccaat
acataactac caccaacgaa
12181 aggctggttg tgatgacggt attggaagct ggtgagtcag
actcattaca attttcatgt
12241 aatctctaaa cttccttgtt aaactatttt ctactcttcg
taacttttga aaagagagaa
12301 tatgggagaa cttgagcttc ctctttctcc ttctctggct
tcttttgttc ccgggcagag
12361 acattagcta gcagtcagat atctgagaac atggtaagag
aaataagagc aggaagcctc
12421 ggcaactcac ttgataatta tttttcatca tttagtagca
ggtggtagaa taaataggat
12481 catttgtaaa gttcaagtgt tgatggtggt agtggagaga
gatactaaag cattttagtt
12541 ttcaagccac aatgaagtaa ctggtatcat tcttgacctc
atttcatgaa taactataaa
12601 attagataaa atgtatgcgg caactgtttt cagaaattgg
ctaacaggca atgcaggctg
12661 tgacccttga gacaagtaaa cccctgaggc aagccttgca
ttcacttggc tttctgccta
12721 ggggtgcatt cactggactg gctagtaagg aagtggggcc
aancattaca tgcttgtttc
12781 aatgaactga taaggcagag tttgtacttt gttgctgctg
aggggctgga attagcagag
12841 tagggtactg aataagagag atgcatgagg taggcatgtg
gaggtagtgt acatggaggt
12901 ttctttgcag gtcattgtct gaaggattga ctatgtataa
gcagggcaag acaccattaa
12961 ggcctagcag agaaggttca ctgtgggacc aagagccgaa
caaagacact agtggttgcc
13021 cagagatggg agacgttgga attcttagtc tgagagggca
gatttccctg agaacctggg
13081 cattcagttg aaaccccaga aaggccagcc tcaggagtga
gacctgtaca ctaccataag
13141 gctgtgttct acaactaagg acaaaattga actagaccag
ttctaacaat gaaaggttca
13201 agagaatatt ttagaaatgt aatttcctgc cagaaaaaaa
ctcagtaccc tataaggaag
13261 acaatataat gcagattccc taaatgtgtc gccctgtgtt
tagcatacaa caaaaaaatt
13321 actggacgtg tgaagaggaa gctccattaa tcagaaaaaa
gcaataaata gaaacagact
13381 ttgaaataac tcagacatta gaattagaaa atgaggaatt
taaaataact agataagtat
13441 attcaaggac ttgacataaa atgggtgaca taaaatgaag
aggcgggaaa tctcagagaa
13501 acaaatgtaa actataataa agaaacaagt ctatattata
gaactgcaaa taataatttc
13561 caaaatgaaa agtttactga atggcttagc aacatattag
aaactacaga aaagatcagg
13621 aaactttaag acagatcagt agaaattgtc tgatctgaag
aacaggagga aaaaatagaa
13681 aaaatatatg aatcttcagt gatctgtggg gcaaaagcga
gaagtctaac atttgtacaa
13741 tttgaggtcc agaaatgagg atagagagaa tgagggagga
aaaaaaatag tttaaaaaaa
13801 tgaacaaatt ttcccaaata tagtgaaaaa atcaaactac
aggttgaaga tgtttagcaa
13861 accaaagcag gataaaaaca aagaaaacca tacctggatg
tatgacggcc aactgataaa
13921 aatccaaaag taaagaagaa agctgctgag gatgggagga
gtggagagac attacatata
13981 ggagaacaac agtaagagtt attttagact cctaaataaa
aactattcaa gccagaagac
14041 aatagaatga catttttaaa ttcttttttt aaatttaaat
ttattttatt ttaaattctg
14101 ggaaacgtgt gcaggatgtg tagctttgtt acatatgtaa
acatttgcca tggtggtttg
14161 ctgtacctat caaaccatca cctaggtatt aagccacaca
tgcattagct atttatcctg
14221 atgctctccc tcccaccgct gccccccatc ccaccccaga
caggccccag tgtgtgttgt
14281 ttccctccct gtggccatgt gttctcattg gtcagctccc
acttatgagt gagaacatgc
14341 agtgtttggt tttctgttcc tgtgttagct tgttgaggat
gatctcttcc agctctatcc
14401 atgcccctgc aaaggacatg atctaattcc tttttatggc
tgcataaaca aaacctaagt
14461 tttagataaa caaaacctaa gaaaatgtac tgctagtcaa
cccgcattat aagaaatgtt
14521 aaagcaaatt cttcagccta aaagaaagtg aaagcaaatg
gaaacagatt aacaggaagg
14581 aatgaggagc acagaaatgg taaatatatg gaataatata
tatattttta aactgtcttt
14641 attttctgtt aagctcttac ggggtaactt actatttaaa
gcaaaactca tgacaatgta
14701 ttgaaaggat tataatgcaa atagaagtaa aacatgtgac
agcataatgg acaaggaagt
14761 aataaatgtc aggttcttat attttacatg agatagtata
atattaattt taggtagatt
14821 ttgataagat aaggatgcat attgttagtc ctagagcatg
cacgcacaca cacacacaca
14881 cacacacaca cacacacaca cacacacaca cacacacatg
ctacaaagag gtataattaa
14941 aaaccctaat agtggtataa agatggaaac tttagaaata
tgcattttaa cagaaagaag
15001 gcaagaaaga aggaagaagg aaacaaaaaa agatggcata
aatattcaga tggtagactt
15061 aaaactgaat tatatcatta cattaaatat aaatggacta
aatgtcccaa ttaatagtca
15121 tagattgtca atatggatac tgtcaggcct ctgagcccaa
gccaagccat cgcatcccct
15181 gtgacttgca cgtatacgcc cagatggccc gaagtaactg
aagaatcaca aaagaagtga
15241 atatgccctg cctcacctta actgatgaca ttccatcaca
aaagaagtgt aaatggctgg
15301 tccttgcctt aagtgatgac attaccttgt gaaagtcctt
ttcctagctc atcctggctc
15361 aaaaagcacc cccactgagc accttgcgac ccccactcct
gccagccaga gaacaaaccc
15421 cctttgactg taattttcct ttacctaccc aaatcctata
aaatggcccc acccttatct
15481 cccttcgctg actctctttt tggactcagc ccgcctgcac
ccaggtgaaa taaacagcca
15541 tgttgctcac acaaagcctg tttggtggtc tcttcacacg
gacgtgcatg aaagatacaa
15601 aataaataaa acccaataat atgctgacta caacatatgc
actttaaata caaaaacaca
15661 aagagttaaa agtaaaagga atacaaatat atgttataga
aataattttt agaaagccaa
15721 tgtggctaca tcaattacac aaagtaaact ttaagacaag
aaggtttacc aaagataaag
15781 agggacatat cataatgata aaggggtcaa ttaagaacaa
ataataattc caattttttt
15841 ttcaagacag agttttgctc ttgttgccca ggatggagtg
caatggcatg gtctcagctc
15901 cctgcaatct tcttctcctg gtttcaagtg attctgctgc
ctcagcctgc caagtagctg
15961 ggattacagg tgcctgccac tacacccggc taatttttgt
gttttttttt tttttttagt
16021 agagatgggg tttcaccatg ttggtcaggc tggtcacgaa
ctcctgacct cagctgatcc
16081 acttgcctca gcctcccaaa atgctgggat tacaggcatg
agccactgca cccggccaat
16141 aattctaaat gtttactcaa ctaaaacttt aagaactttc
agatatagga atcaagagcg
16201 ggattttttt ttcaatattg tattatgaaa aaattaaacg
tatatgaaaa ctgaagtaat
16261 tttatagtga acaccctgtg taaccaccgc ttggtgtcta
acattaacat tctactgtgt
16321 ttattatatt acatttgtac ctatctgtta ccccttcttt
ctatttgtca attcatcagc
16381 tgcagtgatt ttgaaactct actctgtaaa gttccaggaa
ttgtgtgtgt gtgtgtatgt
16441 gtgtattttg gaagtggggt agtgttttga gaacactagt
agtcctcatt tgaccaaata
16501 atttatactt ttatctatta ttctatctta tatattggac
ttctgtttag gttacctgaa
16561 aacatggttt tgtgagtaaa ggaaaatttt gaaagctttt
atcaagggta ttctgagggt
16621 aatccactct cccaatctag ttaagcataa tctagaaacc
tgaccatgtg ttccttgaag
16681 acactgagtg agtgtaccca atttggcagc ttttgtgaga
gcaaggacca tccttcattt
16741 ttaattccct tcaggttcat gcctacagtg aaaagtcagt
acatcttgct ggatgaatga
16801 aggtgggcaa tgtatttatt ctatataata ttgtattttg
tgggcacacc tgtgagaata
16861 tgacacaggt tggacagttc tacctggctc ttagctatgt
taattaccaa ctaattatga
16921 ggttgtttta ttttcagaac cccgttaagt ccatttatta
attagaataa gactataaat
16981 tccctctgtt taatccaaat ttcactatct ggcatccccc
aaatcagaaa tgttaacact
17041 ggaacattgc ttgctttcaa attaaatagg taagatttga
agagaacttt ggagtttaag
17101 aggttacatc taatgtgttc taaatattag gactcgtttt
tcttctgatt acacccctac
17161 aaagtgccaa gcctttgata gctcattttg tgttttctga
agttctagtt ctcctggtca
17221 cattctccag caaggccctg tgtattaagt gtaaccagta
atggaaaatc ctctacatag
17281 aatgtactca gaaaatttga ggagcttaag cacttgactt
ctctttggaa acacagtgga
17341 tatttgtgtc gctactgtac acaaaggagg agacactgag
aaataaagaa ggttatccca
17401 agtccccagt aaaacaggat gaggttttta ggtttgagga
aagtgcctca ttcactccat
17461 tcctgcctca ggatactgaa aggcaggttt ctaggctgag
ttttctactg taggaaatcc
17521 tattttttac ttatcaagaa tgcattagag ttggaagaat
tacagtcttc ctacaccata
17581 tgctaatgtt aattccagct ggattaaagc aataaataaa
tgtataaaat caatcattag
17641 aaacccagaa tagaaatgag tatttatccc tactggagtg
agaggggact tcataaactt
17701 gaagacaaag agaaagatcg tttctgaaaa gagtaataga
tttgactcca ttaaaaatta
17761 aaacttctgt ttgccaaaaa gcaaatccac acagtgataa
gtatgtgcca cagatatgag
17821 ggacaagtga ctaggaatct taaaagacca agccaattga
taacaatgca tgattacaat
17881 ggttaaatgg gaggtcaggc atccattgtt ttcctgccta
atgctgttat tttcaattct
17941 ttcttgctca gtgggagaac tcaattgccc agggtctctt
cctgcttgct tttctatcat
18001 tcctatttct gagccccagc aggggctggg ttcttcttct
cttcttcttc cttcttcttc
18061 ttcttcttct cttcttcttc cttcttcttc cttcttcttc
cttcttcttc cttcttcttc
18121 ttcttcttct cttcttcttc cttcttcttc cttcttcttc
cttcttcttc cttcttcttc
18181 ttcttcttct cttcttcttc cttcttcttc cttcttcttc
cttcttcttc cttcttcttc
18241 ttcttcttct cttcttcttc cttcttcttc cttcttcttc
cttcttcttc cttcttcttc
18301 ttcttcttct cttcttcttc cttcttcttc cttcttcttc
cttcttcttc cttcttcttc
18361 ttcttcttct cttcttcttc cttcttcttc cttcttcttc
cttcttcttc cttcttcttc
18421 ttcttcttct cttcttcttc cttcttcttc cttcttcttc
cttcttcttc cttcttcttc
18481 ttcttcttct cttcttcttc cttcttcttc cttcttcttc
cttcttcttc cttcttcttc
18541 ttcttcttct cttcttcttc cttcttcttc cttcttcttc
cttcttcttc cttcttcttc
18601 ttcttcttct cttcttcttc cttcttcttc cttctacatt
tccatgggca tttcccctgc
18661 ctggagggct ttctagctct cacatgcaat cttcagtgat
gttttcgtga ggtctatttg
18721 gggattccca gcaatgttgt tctgtgccat ttgttattag
gtatgggtgt tttatgctca
18781 tctttcattg gacaaatgat tgttttattt ctagaaaaga
aaattcatca tgttccccta
18841 cttcctatct tggatgcttg aaacatacat gtatgtgtat
gttagtgtgt atatatgtgt
18901 gtgtttgtgt gttcaaggtt ttaagctaga agcgggtaat
ggctttagga agtaatggtg
18961 gggaaatctg tcacagcatt atcacatgtt cccatctgct
gaaagcagaa aaatgattgt
19021 gagattgtga tgaccagagg gaagtgatga ccaagtgagc
ttatgttctg aaccaggttt
19081 ctgtaaatca cattaagcaa gtttctccct gttactccta
tgtccatttg caaggatgaa
10141 cacttataag tcatgtttac attctaccaa caaccactct
tttaacaggt aggtcttgtt
10201 ccattgtaca gccccagaga aaatatgcat gaacattcat
tcatgtataa gtacctttaa
10261 ggaagaatct gctttgttca aggcactgta aaatacacaa
agataaatta ccaagatgtt
10321 tcaaatccgt ggatgtacac aaataattat aataagatcg
agtcgactcc tttagtgagg
10381 gttaattgag ctcgcggccg cgagctctaa
This sequence has undergone a major mutational event. Please identify the following:
What type of mutation is this?
What is the name of the original locus?
What types of mutational events are commonly associated with this locus? (How does this relate to your answer to question # 6)
What
other sequence(s) are involved with this locus (that might not have been
involved before the mutation event....)
You
have been asked to help piece together what has happened at the following
crime scene: In the past 3 days, about 28,000 people in New York city have
suddenly died of apparent bacterial infection. Police have localised
the contamination to one of the subway stations. Inspection of some
broken glass from the rail track shows the apparent source: a light bulb
has been cut open and filled with freeze-dried bacteria, and then taped
to the rail. When the train came through, breaking the glass, everyone
was exposed to the highly contageous bacteria. You have managed to
sequence some of the toxin, and come up with the following bit of amino
acid sequence:
What
does this sequence tell you, in terms of possible mechanisms of how this
bacteria might have come about?? Do you think it is reasonable that
this is just an "accident" of nature, or do you think it is likely that
this was engineered by a person using standard recombinant DNA methodologies?
You have the following sequence cloned into a plasmid, in Escherichia coli; however, you seem to have a difficult time in keeping this clone stable. Please explain what types of problems might explain the instability.
1 ttaatgatta ttatgatgta aagttaaaag aatctcggtg atgctgccaa cttactgatt
61 tagtgtatga
tggtgttttt gaggtgctcc agtggcttct gtttctatca gctgtccctc
121 ctgttcagct
actgacgggg tggtgcgtaa cggcaaaagc accgccggac atcagcgcta
181 tctctgctct
cactgccgta aaacatggca actgcagttc acttacaccg cttctcaacc
241 cggtacgcac
cagaaaatca ttgatatggc catgaatggc gttggatgcc gggcaactgc
301 ccgcattatg
ggcgttggcc tcaacacgat tttacgtcac ttaaaaaact caggccgcag
361 tcggtaacct
cgcgcataca gccgggcagt gacgtcatcg tctgcgcgga aatggacgaa
421 cagtggggct
atgtcggggc taaatcgcgc cagcgctggc tgttttacgc gtatgacagt
481 ctccggaaga
cggttgttgc gcacgtattc ggtgaacgca ctatggcgac gctggggcgt
541 cttatgagcc
tgctgtcacc ctttgacgtg gtgatatgga tgacggatgg ctggccgctg
601 tatgaatccc
gcctgaaggg aaagctgcac gtaatcagca agcgatatac gcggcgaatt
661 gagcggcata
acctgaatct gaggcagcac ctggcacggc tgggacggaa gtcgctgtcg
721 ttctaaaaat
cggtggagct gcatgacaaa gtcatcgggc attatctgaa cataaaacac
781 tatcaataag
ttggagtcat tacccctggc gtccagcaca tcacgcgact agatgggcat
841 cttattcttc
tttgcttcaa tcctagagct tcttccagag aggatttaaa gtccatcatc
901 tcaaccagtg
agttgtagcc aaaggatgca gtgtgcaaat agtggtcggc atctcgtggc
961 gatgagatta
cgtaggtgta ccagtttgtc agcccctaag agcaagagtg ttcagtccgt
1001 tctacataag
tgccgatttt acactggctt gcggttggga tcacttgctt tgccgcgtca
1061 tgtccgccag
acgtgaggtg aaggacgttt tttctggcaa aataaccgtg gcaaaaggtg
1121 ataaataaca
acatttctca tttgactgtt agcatccgaa taataacttt gtgagcggtc
1181 ggtggtttag
cgattcgtgg cgatacccta aaggcagcca ataccctccg ggacggcgta
1241 attttgctta
aagttatatt tttcgacagc ggaacattta gcaaatgggc aaaaatcgcg
1301 ccatgctttc
cttcgttttg tcttttatat catagactta tttttgttta atgaatgttt
1361 aagatatgta
ttatttgagt aatctattaa tcagtaggct tttttatttt tattcccggc
1421 gtccagcgct
ataatctgcg gacgattttc tttgtggaat gtgcattgta gagcatgagt
1481 cgtttatccc
acctgtcgga acatagcgcc gccgccgtgt gcaaagggcc atcaagcctg
1541 aacgcgcgcg
gtcttcctgg cggtgaagac gatgtctctt cttatctttc cccacgtcgt
1601 aaacgctcct
ttctgacggt tgccagcatc ggctgggcgc ccgttgcgct tttttcggtg
1661 gcggtaaggg
catgcgtttt gccctgcgta attgagaaag tcctgatggt taacccggcg
1721 gcaaatgcgg
aaagtgtgat gcgtttggcg gccatgtgag ggaagataca tgttaaaagc

To use this, all you have to do is paste in
your sequence, and then click on the particular line of interest.
In this case, the RED lines are the best matches. Notice that you
have an insertion (hint, hint) into the middle of the gene. The red
line on the next line is the "insert" (likely a transposable element of
some sort). Please try this out and hopefully you can get it to work....

Back to the syllabus