50 points Total.
PLEASE read the questions fully before you
try to answer them!!
1.
(10 pts.) You
have sequenced the following human gene, and would like to clone it.
Please tell me what the name of this gene is, on what chromosome it is
located, and how you might go about trying to clone the gene.
1 gccggcctga ccatggcgtc
gggcgcgggg ctgggtgcgg tcgtcgaggg ctggcggcgg
61 cggcgggagg acgcgcgggc ggcgctggga
ctgctgggcc ggctgcccgt gctgcccgtg
121 gcggcggcag ccgagttgcc ccctgtgccc
gggggacccc gcggcccggg cgagttggcc
181 aagtacgggc tgccggggct ggcgcagctc
aagagccgcg agtcgtacgt gctgtgctac
241 gacccgcgca cccgcggcgc gctctgggtg
gtggagcagc tgcgacccga gcgtctccgc
301 ggcgacggcg accggcgcga gtgcgacttc
cgcgaggacg actcggtgca cgcgtaccac
361 cgtgccacca acgtcgacta ccgcggcagt
ggcttcgacc gcggtcacct ggccgccgcc
421 gccaaccacc gctggagcca gaaggccatg
gacgacacgt tctacctgag caaagtcgcg
481 ccccaggtga cccacctcaa ccagaatgcc
tggaacaacc tggagaaata tagccgcagc
541 ttgacccgca gctaccaaaa cgtctatgtc
tgcacagggc cactcttcct gcccaggaca
601 gaggctgatg ggaaatccta cgtaaagtac
caggtcatcg gcaagaacca cgtggcagtg
661 cccacacact tcttcaaggt gctgatcctg
gaggcagcag gtggccaaat tgagctccgc
721 acctacgtga tgcccaacgc acctgtggat
gaggccatcc cactggagcg cttcctggtg
781 cccatcgaga gcattgagcg ggcttcgggg
ctgctctttg tgccaaacat cctggcgcgg
841 gcaggcagcc tcaaggccat cacggcgggc
agtaagtgag ggtggagccc agtgagactg
Consider
the following "typical" (?) family:

2. (10 pts.) You work for an insurance company, and have been given the following sequence, isolated the DNA from the youngest child. What does this mean, in terms of the health prospects of this child?
1 tcgactcgat ctttcatggg
tcaaataaac acagaattga tagaatattt taataatcta
61 acaaagatga tttaataaat atttattaaa
ttttgttctc tagaaataag caatacactt
121 ttttaaggag ccatgaagca tttagaaaaa
aattgatcat atattatatc tcaaagaaac
181 catattaaat tatgaaatat aaagtgttaa
cagcctccat gttctgccta cactgaaata
241 gagctagaca ttaatagcaa aagtttaaat
gcaaaaatca gtagcttgga aaagagaaat
301 aatgataaca aaaaccaaat ttttttctaa
aaactccaga ttcaaacagg aaaggaaact
361 atataataaa ctgttaggag agtaagaact
ctatgtatca atattttaaa tggctaaagc
421 tgtatttagc agaaaattaa tagccctgag
aactttcatt attaaaggag agtaacaaaa
481 tcaatctgtg gaaaataaaa ctaatgaatt
aataaataga gactcctttt ttttctgtta
541 aataagacca acaagcaaat aatatttatt
taatgagcta gaaataggag gagcttacgc
601 agcataatac aagatattta ctggaaacct
gtagcaaggt atgttattta acgttaacgt
661 tgtaaatatt actttcactg ccacaaatga
gatgagtgct tgtgattttt gtaaaagtta
721 aatcccaaag aacaatttct gcttcccatg
tagtctgtaa accctcacaa aacacaaaca
781 gtagcaacaa caaaactctc tgagggatct
gaagaatgaa caaaggcagc aagattctcg
841 aagggggttg aagacacctg gaagtagtat
aggttaaatt tcttatgact attatcctga
901 ggtcaagcca caggtagggc cacacagagt
ggctgaaact cctgtaggaa gctcagtctt
961 tctggcctta tgaaccaggg gacagagttt
ggagcaaccc cagctactgg aaagtaaggg
1021 aggaatgtca ttaaaaagag ggccagaaaa
ggagcattac aaatttttgg tgtaaattct
1081 gctcaaatct ctggctgacc tttggaatat
acatacacag gcagaccaaa gcagtcagtg
1141 aatttgataa tatcaatata gattttctct
atgaagaaca gagattgtat ttttaaaaag
1201 gattaaaaaa aaagcaacca gagcctcagg
ttcctatggg ataattcttt taacatatga
1261 gtaattggtt ccccagaagg aaaggtgata
gagaatgggt cataaaaata ttaggttggt
1321 gcaaaagtaa tcgtggtttt gcaattactt
ttcattacaa aacccacaat tacttttgtg
1381 ccaacctaat atttgaagaa acagtggccc
cacacacttc ccaaaattgg tgagagataa
1441 atgtacacgt tcaagatgct gaggcacatt
gtaatcaaac tgctgaaaac ccaagacact
1501 atcttgaaaa tagccggaga aaaacaacac
actatataca tgaaactgca ctgctaaata
1561 ctgccaacct accttcaaaa actatgaaag
caagaagaca atgaaacaat atcttcaatg
1621 tgctgtaagg agataagact atccaatccc
gaagtccata tccaaagata ctcatacttc
1681 aggaacggag tcaaaataaa gttgcattca
gaaaaacaaa aacagaatat ttattgccag
1741 cagaccaact tacactgtaa aaaatgcttt
gttctttggg atgacaagaa ataatagatg
1801 aaaactcaga tccttatgta aaaataaaga
aatcataaat agtaagataa atataaaaat
1861 caattttttc ctcctaatgt tttaagaata
tgtatgtttg tttaagtaaa aactatagca
1921 tcatttttat gggatttata atacaggtag
atgtaatttc aatgacaact tgaatttaga
1981 gggagtttgt aaatagattc atacatgtgt
gaggtttcca tattgtatgt aaatagtata
2041 atagtaattc taggtagact gtaaaaagag
aggttggatt gaattcctgg taatatgtgt
2101 ttaaatggaa tgaatactcg ttcttagaat
gaagagaagt gtgcagaatg gaactttgca
2161 aacccttact ggcccatatg tcacggggtg
gagatgtcac cattttctat atggctccag
2221 atcaagcaat ctcttgacaa aaagcaagct
ggcttcgagt aaccttcaaa cggagtgtac
2281 caggccaatg cagaagacat aacatggttt
gctttctttc tcccgtccat tttagtatct
2341 taagtaccag cagatagaag gtccagaaaa
aagaaagagg atgatattgt aggggaggcc
2401 aaattttctc tcttagggtt ttttagctgg
gcctgaggat tgaattgaca taaggcagat
2461 cagcgggaga aaagcataca aatttaattc
attcaatttt tatgtgagca caagagccct
2521 cttaaggaaa tgaaacccaa agacagagtt
gaaagttgaa cacctatgta ctgaattggg
2581 aaaggaatag aaaattgtga gtatgtgaca
aagccaacgg gcttgggcta gggaggttaa
2641 ttgggtagag aagtaactag gaagataagg
gttactttaa taaggtgtgt ttgttcagat
2701 ttctctcagc atcaacttct cgtctttgat
gataagaacg ctactttccc tctggcgtag
2761 ggaggacaag ttatctcctg tttctaagga
aaagaagaaa gctcagagta tattttttgt
2821 atctgctgtt tttccaagtg tctttaagct
caaaatagtc aatatactag agcaacatat
2881 tttggagtat tgtgttctga actccttcaa
tctttagcat ttctcttctg tcattcttca
2941 ggctcttgag cacttttgag aaactaatat
gaatgggaaa gatgtaatat ttactcccag
3001 gaaggttgct ttggtgatct agtttgtgaa
ctagactcag gaaactgtct gggtattgga
3061 aaggaagtca cggatgcgca atggaatttg
taaaggggtg gcacagctct tgaatactct
3121 ccttggaaga gaccctggca tggagagaac
agagtattaa cctcactctg aggttgggaa
3181 gactaaggga atctgaggtg tgaggaggga
tggagacgac aggtcacaca atgtgatgga
3241 aaaccgcaca tggagtggaa agaacatgaa
ctctggagtc tggaactcct atatctggac
3301 ttgacaccgt attctgctat gtggccttcg
gtgagtgtgc caagtagaat attggctctg
3361 caaacgtatc cacatcaaat ttctagagct
tgtgaatgtg accttgtttg caaaaaaaat
3421 ggtccttgca gatatgatta aattgaggat
cttgagatga ggaaatcatc ctggattatt
3481 ttgggagacc taaatgctat tacaagagtc
ccaccaacca ccagaagctg aaagtggcaa
3541 ggaatggaat ccctcctaga gcctctggag
ggatctttgc tcttaaacat cttaattttg
3601 ggcttctggc ttccagagct gtgagagcca
caaagtttgt ggttattttt tattgcagcc
3661 acagtgaaga agataataaa gccagctact
gacctctctg aatctcagtg ccctcatctg
3721 taagatagaa tcagctagct actttatagc
atttttccag gattaaatga gacatgtaaa
3781 atagaaaaca aacaaacaag caaacaaaga
aaagataagg aacatgctga gaaaaaaata
3841 tagacacccc ttgtttgatg ttactaattg
attcctgatg atcagatgta ttacccaaaa
3901 atgatttcct tgagacagtt tgccattact
taaatgggaa aaaatgtgtt atatgtcaaa
3961 ccaaccctct agtgaatttc ttgaagtgta
agtaaaacac agtaaaataa ccatagtaga
4021 acaataatgt cacactgaga cttaaaatgc
aggcatacct atatctcaga gatactgcat
4081 gttcacttcc agaccatcac aataaagtga
atattgcaat aaaacaaggc acacaaactt
4141 ttttgtttcc cagtgaatat aaaagttatg
tttacagtct atagtagtct agtatgcaat
4201 agcattataa aaaaaatatg ttatcttaat
ttaaaaattt tttttgctaa aaatactggt
4261 gatcattttg aaaccctcca ggtaattcca
taatcctttc ctgcctggtt ggaagggtcn
4321 tttgcctttg atattttatt ggcttgccga
actggtatca aactgggtgg ttgttgaagg
4381 gcagaaatgg ctgtggcaat ttgtttaaag
taagaaaaca gtaaagtttt caacatcagt
4441 ggcttcttgc tttcacgaat gacttctctg
ttgcatacag cactgtttgc cagcatttta
4501 cccacaataa aactcctttc aaaatgagtg
aatcctctcc aacccttctt tatcaactaa
4561 gtttatgtaa tattctaaag cttttgttgt
catttcaacg atgttcataa catcttcttc
4621 aggagtagat tttacctcaa gaaatccttt
tctttgctta cccgtaagaa gcaactcctc
4681 actcattcaa gttttattgt aaaattacag
taattcagtc acatcttcaa gctttacttc
4741 tgattctagt tgtcgctatt tcttccacat
cttcagctcc ttcctccact gaagtcttga
4801 acccctcaaa gtcatccatg agagttggaa
tcaacttctc ccaaacacct tttcatgtta
4861 atattttgtt gtcctcccat aaatcatgaa
tgttctcaat gacatccagc atggtgaatc
4921 ctttccagaa ggttttcatt ttactttgcc
cagatcaatc tgataaatca ctatgtatag
4981 tcttacaaat gtattttctt aaataatgag
aacttgaaag tcaaaattac tccttcaccc
5041 atgggctgca gaatggatgc tgtgttggca
ggcatgaaaa caacattcat ttccttttac
5101 atctccatca gaactcttgg gtgaccaggt
acattttcag tgagcagtaa tattttgaaa
5161 ggaatctttt ttctgagcaa taggtctcaa
gagagggctt aaaatattca gtaaaccata
5221 ctataaacag atgtactgtc atctaggctt
tcctgtttct tttcagagaa catgaagagt
5281 tggtgtagta tcattcttaa gggccttagc
attttcagaa tggtcaatga gcattggctt
5341 caacttaaag tcaccagctg cgttagcagc
taacaaaaga atcagcctgt tctttgaaga
5401 tttgaagcca ggtatcgact tctcctctct
agctaggaaa atcctagatg gcatcttctt
5461 ccaacagagg gctgtttcat ctctcttgaa
aatctgcctt tggccgggtg tggtggctta
5521 cccctgtaat cccagcattg tgggaggccg
aggcaggcaa ataatgaggt caggagttcg
5581 agaccagcct agccaacatg gggaaaccat
gtctctacta aaaatacaaa aaattagcag
5641 ggcgtagtgg tgggcacctg taatcccagg
tacttgggag gctgaggcag gagaattgct
5701 tgaacccggg aggcagaggt tgcagtgagc
tgagatcgca ccattgcact ccagccctag
5761 tgacagagtg agactatgtt tcaaaaaaaa
aaaaaaaata tgctgtttag tgtagccacc
5821 ttcattagtt atcttagcta gatcttctgg
ataacttgct gcagcatcta tataagcact
5881 tcctgctttg ccttttatgt tctggagatg
gcttctctcc ttgaacctca tgaatcagcc
5941 tctgctatct tcaaactttt cttctgcagc
ttctttacct cttttagcct tcatagaatt
6001 gaatatagtt agggcctttc tctggtttaa
gccttgggtt aagggaatgt tgtggctagt
6061 ttgatcttct atccagacca ctcaaacttt
cccatatcct caacaagcct gtttcacttt
6121 cttatcattt gtatgttcat tggagtagca
cttttaattt ccttcaagaa cttttccttt
6181 gcattcacaa cttggctatt tggtgcaaga
ggcctagctt tcatcctagc tcagcttttg
6241 acatgccttc ctcgccaagc ttaatcattt
ctaatttttg atctacaggg agagacatgg
6301 gactcttcct ttcacttgaa cacttagaag
ctgttgtctg cttattagtt ggactaattt
6361 caatttgttg tgtctcaggg aatagggaag
cctgaggaga gggaaaaatg gctagtcagt
6421 gcagcagtca gaacacacaa acatttactg
ataaagttca ccatcttata ttgggcccca
6481 taacacttac aataggaaca tctaagatca
ctgatcacag atcaccacaa cagacataat
6541 aataatgaat tagtttgaaa tattgggaga
attaccaaat tgtgatacat agacgtgaaa
6601 tgaacacgtg ctgttggaaa aatggtgcca
atagacttgc tagatgcaaa gttgccacaa
6661 accttcaact tgtaaaacca cattatctgt
gaagtgcaat gaaacaaagt gcagtaaaac
6721 aaggtatgcc tgtacttaaa aaaattatat
cctcgatcac ttgtacttaa aaaaattata
6781 tccttgatga cttctacaat taacatgtta
taaaatgttt gctttgtacg tagtagctat
6841 tctcacttgt tttgttttct tctctacaat
tgtcttcagt gttttctcaa cagtttgggg
6901 atttcaacaa taagcaaaaa aagtaaaatg
gggaaggcac tgggaaatgt gctctagtgc
6961 ctggtggcca agtgtggctc acactgatgg
tatgatggcc ttttagcatc agcaacttct
7021 gaaatgggtg tttcgtgtgt tgtcactcag
cttgctttcc aaaatatcta tatattattc
7081 tgatgttcta ctgaatatgt tttgaaaata
tcttgatgat tttcctatat ttccttttat
7141 tgaaagtttt gcagagtttg gtattataaa
tctaaagttt tgctacatga acattggcaa
7201 aaaaaaaaaa aaatagatga tgaaatttgg
attttaaaaa ccaagagtca ttttaccatg
7261 aaaatgttaa tcgaggcaac atcattcagg
ttatgcctgt atttgcagaa tgtttcatag
7321 tagaaaatta atatgaaata taaaaagagt
taatacaaac aagtaatcag aaataaaaat
7381 caatagaaaa ataactaagg ggtctgaaga
aacataaatt ggagaagcaa cagaaatgga
7441 caattaatga atgaaagaat gtacaactgc
aaaagtaatc aggaaatgcc ttctaaaatc
7501 agcacagtat agctttttac atctatccaa
ttgtcaaaat taaagggaaa aataaatcta
7561 acaattctag gtgttggaga ggttgtaaag
caatgaagat tatcattcac agagcaaact
7621 ggtacagcca ctgtggaata tagtgtggca
tcatccagaa tagttgaaga catgcaactc
7681 ccatggccca catgccaagg ggaatgtgca
caacatactt acaatagtat ctttttgata
7741 attcaggaaa acgtacatgt tcaccaatag
ctaatacata aatattatta ttcaactaag
7801 agacgatttt cgaaaagcaa aaaaagaatg
agtaaaaaca acatatagta acaagaataa
7861 ctataaatat atgttgagta aagatgtttt
gaaaatgagt ttcagaagaa tgcaatctaa
7921 taccattata taacaattta aaacatgcaa
aaaaaacccc atcattttca gctagttggc
7981 aaatgtagtg gaattgggca tggggatatt
ctccctagaa aaagaatttt ttcatgtttt
8041 ttgatctagt aatctcattt atagaaatct
ttcttaaaga aatagagatg caggcaaaaa
8101 ttcattatta tactttggta ttatagtatt
tatagaataa ttaagtactc taaattatct
8161 gtcagtaaac tgtgaaattg ttaaataaat
tatggaacat aaggtagtgt tttaaaatct
8221 ttcaaaaatt atatctaata atatgggaaa
attgcattgc tatgtaatcc gaatttggtt
8281 taaaaattaa acaaatagca aaaaaagaag
gcaaatatat aaataagact gggaactata
8341 aaaacactat caatgtttat ctctgagtga
tagaataata tgtgattttt aaaaatgttt
8401 ttctgtaatt ctcaaattag gtgcagtaag
cagacatgac ttctacaatc ataacaggta
8461 gcttcattga cctgtttcta aaatgggtag
cactttttaa ggattgaagt tgttttgtcc
8521 agttggacag attcttatcc ctttgtcttt
tcttgtattg ccaacttctg cctatacttg
8581 aaatagtagg gcagatttct atatgtaagc
atttaaaggg tctgcaagaa atactaagcc
8641 atgtaggaac tgtgaatatt aaatgagata
atgaatgaaa agaagcctag cacagtccct
8701 agcccaagta agtacctatt attattatta
tcattgactt tactacggct ttggtgttat
8761 gtcacagaca aaagtttgca aaccactttt
ttttttgctt gttttttttt tttaaggact
8821 caaaacagtt acctttgagt actaaagtcc
ttaagatgga aaacgagatg atatttatta
8881 tgccactgat atgtttcagg aagagcttgt
ggagagagca gacaaaacaa atggtaaatc
8941 ttgttcttag aaagaaagag ctatctgtcc
ccattacagc cagctgtcat gcccaggctg
9001 gagccactct tgggacaaaa ttagggacac
ttgaaaatgg taaatcttgg aaactgagag
9061 ctctgtattt ttcaacttag gatcataatc
tattaacccc tatttctatc ccagtttttc
9121 tcttggataa ctgatttcat ttgtgcaact
tgcattgaga aggtcaggat aaggcctgga
9181 aattgtcaaa ctctagattt tgatgacatt
ttggtaactc cttacaggaa aatctatcag
9241 ttacatggtc caattgtagg ggtgatattt
actaattttg cattctattg aacagcgata
9301 tataatggaa tgtatgttgt gaatcttcaa
taagtgtttc ttttctctta taacccaatg
9361 aattgacctc tataataaca tgcagttgct
aaatgaaaaa agaattgtat ttaagcagaa
9421 aaaagtttta gaaaatgtga ttttgtatgc
caaaatatgc atattagtag cttttatgag
9481 aagtgtcctt gaaggaaaat atggtgaatt
tgtatctggt tcaggtcttc ttttaataac
9541 cagaaaacaa ggctgaaaca agcacaaggc
agttctatct gtttgaatga tataaattgg
9601 ttgttaaaaa taaaaataaa atgaaaatga
aaaccaagat aacaatacca aggataagtt
9661 aaacaaaaag ttggtttttt gaaaagatga
acaaaattga taagcctctg gctgtactaa
9721 ccaagaaaag atccaaataa aagtcagaaa
tgaaaaagaa gacattacaa atgataccat
9781 aggaatacaa aagatgatca gagactacta
tgaagaagtg tacactcaca atctagataa
9841 tgtagaggaa attgacaaat tcttgtaaac
atatgactgc ccaagattga accagaaaga
9901 aatagaaatc ttaaacagac caataatgaa
tagtaagatt gaatcaataa ttttaaaacc
9961 ttccccctaa aatgcccagg actggatggc
ttcatagcca aattctacaa aatgtgcaaa
10021 gaagaactga tactcatcct actgaaactc ttccaaaaaa
atcaagatga ttaaataaag
10081 aaaatgtgct ctgtgtgtgt atataagtgt atacatatgt
atatatatac acacacacac
10141 atacacatat acatatgtat acacatatat acacacacac
cacatgtata tacatataca
10201 catatataca tacagcacat tttatttatg tatatgtgtg
tgtatatatg tatacataat
10261 agaatactat tcagccacag aaaatgaatg aaaacatgtc
ttttacaaca acatggatgg
10321 aataggaggc cattatctta agtgaaaaaa ctcagaaaca
tagtgaaata ccacatgtcc
10381 tcacacgtgg gagctgaata atgtgtgtat gtagacatag
agtatagaat aatagtcact
10441 ggagactcag aatgatgata gggtgggagg ggagtgaggt
agtacaatgt acactattta
10501 ggtgttgatt acattaaaag cccagacttg accactatgc
ggtatatcca tataacaaaa
10561 ctgcacttgt acccctttaa tttacgtaat taaaaaattg
gctgggcaca gtggctcatg
10621 cctgtaatcc aaacactttg ggaggcccag gcgggcagat
caattgagcc caggagttgg
10681 agactagcct gggcaacatg gcaaaacacc acctctacca
aaaaaataca aaaattagct
10741 gggcatggtg ccacatgcct gtaatcccag ctactatgga
ggctcaggtg agaggatccc
10801 ttgaacccag gaggcggaag ttgtagtgag ccaagatggc
gccacggcat tccagcctag
10861 gtgacattat gagaccctgt ctcaaaaaca acaataacaa
caataacaac aaaagaaaaa
10921 attatgctca ctaaaagaaa aaaacagtaa ttaaaaaaaa
tgagttcttg tggtgaaaca
10981 taatggcacc tcctggttcc ttcttgcatt tgaagattat
agatgaggat tttcagggaa
11041 tggtcaagat caaaacctat ttggtctccc aagattttcc
aggagaggct gtaggcctct
11101 tcctagctta cctggcatgt acatactaac tgcaggtagc
agctgtatga agttgttggg
11161 taactcattt ttggttgtgt tctctaggtg acacattagg
catttttgtc tacctttaag
11221 tcctccgttc tcacctctgg tagagtgtaa actccataat
gagagagact gcttctgact
11281 tgtccacctc tgtaccctaa tttaatgaaa acatgttgaa
caaataaaga ataaacctag
11341 tggtggtccc aatcatccac ctgcccttcc aaccatcgtc
ctttccagat tgcagccttc
11401 tatatcagaa gtgtcagcct ttgttttctt gctctgggaa
gtgatgattt aagttccaca
11461 ccaccaaagg attgtcaaat acacacaaca aaaataaact
tgagctaaaa ttgtgatcta
11521 atatgcagag agattgaaaa agctaagaaa ataaattata
ccactttggc catgcagata
11581 caggaaccat ctggaagcat ctcagcaacc tgaaagggag
catctgatgc ctcgaattgc
11641 cctgtccgtc agctttagct gctattccca tttgcttctc
ttaaatgtat ttaacttggg
11701 acaagttggg agccaccttc tcttttctga ggtgatctta
gaagattgat atacgtaagt
11761 tgcagcttgt ttggattgtt tatctgtatt taaaggctct
accaggcatt gtattttgct
11821 gttaagataa aatcaataaa taaaaactta aaaaagaacc
agactaatcc ttgaacattc
11881 ctaaacatat actctctcct gctttcaaaa aggaacactg
cttaatgttt acttattgta
11941 gtcttctgag aatttgattt gctgtgattg cacctttatt
caaaagtaca gaacctaaaa
12001 atggtacaaa gaaagtttat ttcaaatcag atgcatttgc
aagccagtgt cattaaaaaa
12061 catgctgttg gtaaggctga ggtcagagtc tatcatacag
acagatgtga tttattgcaa
12121 aagagagccg actactgctt aggtatcagg tgccaccaat
acataactac caccaacgaa
12181 aggctggttg tgatgacggt attggaagct ggtgagtcag
actcattaca attttcatgt
12241 aatctctaaa cttccttgtt aaactatttt ctactcttcg
taacttttga aaagagagaa
12301 tatgggagaa cttgagcttc ctctttctcc ttctctggct
tcttttgttc ccgggcagag
12361 acattagcta gcagtcagat atctgagaac atggtaagag
aaataagagc aggaagcctc
12421 ggcaactcac ttgataatta tttttcatca tttagtagca
ggtggtagaa taaataggat
12481 catttgtaaa gttcaagtgt tgatggtggt agtggagaga
gatactaaag cattttagtt
12541 ttcaagccac aatgaagtaa ctggtatcat tcttgacctc
atttcatgaa taactataaa
12601 attagataaa atgtatgcgg caactgtttt cagaaattgg
ctaacaggca atgcaggctg
12661 tgacccttga gacaagtaaa cccctgaggc aagccttgca
ttcacttggc tttctgccta
12721 ggggtgcatt cactggactg gctagtaagg aagtggggcc
aancattaca tgcttgtttc
12781 aatgaactga taaggcagag tttgtacttt gttgctgctg
aggggctgga attagcagag
12841 tagggtactg aataagagag atgcatgagg taggcatgtg
gaggtagtgt acatggaggt
12901 ttctttgcag gtcattgtct gaaggattga ctatgtataa
gcagggcaag acaccattaa
12961 ggcctagcag agaaggttca ctgtgggacc aagagccgaa
caaagacact agtggttgcc
13021 cagagatggg agacgttgga attcttagtc tgagagggca
gatttccctg agaacctggg
13081 cattcagttg aaaccccaga aaggccagcc tcaggagtga
gacctgtaca ctaccataag
13141 gctgtgttct acaactaagg acaaaattga actagaccag
ttctaacaat gaaaggttca
13201 agagaatatt ttagaaatgt aatttcctgc cagaaaaaaa
ctcagtaccc tataaggaag
13261 acaatataat gcagattccc taaatgtgtc gccctgtgtt
tagcatacaa caaaaaaatt
13321 actggacgtg tgaagaggaa gctccattaa tcagaaaaaa
gcaataaata gaaacagact
13381 ttgaaataac tcagacatta gaattagaaa atgaggaatt
taaaataact agataagtat
13441 attcaaggac ttgacataaa atgggtgaca taaaatgaag
aggcgggaaa tctcagagaa
13501 acaaatgtaa actataataa agaaacaagt ctatattata
gaactgcaaa taataatttc
13561 caaaatgaaa agtttactga atggcttagc aacatattag
aaactacaga aaagatcagg
13621 aaactttaag acagatcagt agaaattgtc tgatctgaag
aacaggagga aaaaatagaa
13681 aaaatatatg aatcttcagt gatctgtggg gcaaaagcga
gaagtctaac atttgtacaa
13741 tttgaggtcc agaaatgagg atagagagaa tgagggagga
aaaaaaatag tttaaaaaaa
13801 tgaacaaatt ttcccaaata tagtgaaaaa atcaaactac
aggttgaaga tgtttagcaa
13861 accaaagcag gataaaaaca aagaaaacca tacctggatg
tatgacggcc aactgataaa
13921 aatccaaaag taaagaagaa agctgctgag gatgggagga
gtggagagac attacatata
13981 ggagaacaac agtaagagtt attttagact cctaaataaa
aactattcaa gccagaagac
14041 aatagaatga catttttaaa ttcttttttt aaatttaaat
ttattttatt ttaaattctg
14101 ggaaacgtgt gcaggatgtg tagctttgtt acatatgtaa
acatttgcca tggtggtttg
14161 ctgtacctat caaaccatca cctaggtatt aagccacaca
tgcattagct atttatcctg
14221 atgctctccc tcccaccgct gccccccatc ccaccccaga
caggccccag tgtgtgttgt
14281 ttccctccct gtggccatgt gttctcattg gtcagctccc
acttatgagt gagaacatgc
14341 agtgtttggt tttctgttcc tgtgttagct tgttgaggat
gatctcttcc agctctatcc
14401 atgcccctgc aaaggacatg atctaattcc tttttatggc
tgcataaaca aaacctaagt
14461 tttagataaa caaaacctaa gaaaatgtac tgctagtcaa
cccgcattat aagaaatgtt
14521 aaagcaaatt cttcagccta aaagaaagtg aaagcaaatg
gaaacagatt aacaggaagg
14581 aatgaggagc acagaaatgg taaatatatg gaataatata
tatattttta aactgtcttt
14641 attttctgtt aagctcttac ggggtaactt actatttaaa
gcaaaactca tgacaatgta
14701 ttgaaaggat tataatgcaa atagaagtaa aacatgtgac
agcataatgg acaaggaagt
14761 aataaatgtc aggttcttat attttacatg agatagtata
atattaattt taggtagatt
14821 ttgataagat aaggatgcat attgttagtc ctagagcatg
cacgcacaca cacacacaca
14881 cacacacaca cacacacaca cacacacaca cacacacatg
ctacaaagag gtataattaa
14941 aaaccctaat agtggtataa agatggaaac tttagaaata
tgcattttaa cagaaagaag
15001 gcaagaaaga aggaagaagg aaacaaaaaa agatggcata
aatattcaga tggtagactt
15061 aaaactgaat tatatcatta cattaaatat aaatggacta
aatgtcccaa ttaatagtca
15121 tagattgtca atatggatac tgtcaggcct ctgagcccaa
gccaagccat cgcatcccct
15181 gtgacttgca cgtatacgcc cagatggccc gaagtaactg
aagaatcaca aaagaagtga
15241 atatgccctg cctcacctta actgatgaca ttccatcaca
aaagaagtgt aaatggctgg
15301 tccttgcctt aagtgatgac attaccttgt gaaagtcctt
ttcctagctc atcctggctc
15361 aaaaagcacc cccactgagc accttgcgac ccccactcct
gccagccaga gaacaaaccc
15421 cctttgactg taattttcct ttacctaccc aaatcctata
aaatggcccc acccttatct
15481 cccttcgctg actctctttt tggactcagc ccgcctgcac
ccaggtgaaa taaacagcca
15541 tgttgctcac acaaagcctg tttggtggtc tcttcacacg
gacgtgcatg aaagatacaa
15601 aataaataaa acccaataat atgctgacta caacatatgc
actttaaata caaaaacaca
15661 aagagttaaa agtaaaagga atacaaatat atgttataga
aataattttt agaaagccaa
15721 tgtggctaca tcaattacac aaagtaaact ttaagacaag
aaggtttacc aaagataaag
15781 agggacatat cataatgata aaggggtcaa ttaagaacaa
ataataattc caattttttt
15841 ttcaagacag agttttgctc ttgttgccca ggatggagtg
caatggcatg gtctcagctc
15901 cctgcaatct tcttctcctg gtttcaagtg attctgctgc
ctcagcctgc caagtagctg
15961 ggattacagg tgcctgccac tacacccggc taatttttgt
gttttttttt tttttttagt
16021 agagatgggg tttcaccatg ttggtcaggc tggtcacgaa
ctcctgacct cagctgatcc
16081 acttgcctca gcctcccaaa atgctgggat tacaggcatg
agccactgca cccggccaat
16141 aattctaaat gtttactcaa ctaaaacttt aagaactttc
agatatagga atcaagagcg
16201 ggattttttt ttcaatattg tattatgaaa aaattaaacg
tatatgaaaa ctgaagtaat
16261 tttatagtga acaccctgtg taaccaccgc ttggtgtcta
acattaacat tctactgtgt
16321 ttattatatt acatttgtac ctatctgtta ccccttcttt
ctatttgtca attcatcagc
16381 tgcagtgatt ttgaaactct actctgtaaa gttccaggaa
ttgtgtgtgt gtgtgtatgt
16441 gtgtattttg gaagtggggt agtgttttga gaacactagt
agtcctcatt tgaccaaata
16501 atttatactt ttatctatta ttctatctta tatattggac
ttctgtttag gttacctgaa
16561 aacatggttt tgtgagtaaa ggaaaatttt gaaagctttt
atcaagggta ttctgagggt
16621 aatccactct cccaatctag ttaagcataa tctagaaacc
tgaccatgtg ttccttgaag
16681 acactgagtg agtgtaccca atttggcagc ttttgtgaga
gcaaggacca tccttcattt
16741 ttaattccct tcaggttcat gcctacagtg aaaagtcagt
acatcttgct ggatgaatga
16801 aggtgggcaa tgtatttatt ctatataata ttgtattttg
tgggcacacc tgtgagaata
16861 tgacacaggt tggacagttc tacctggctc ttagctatgt
taattaccaa ctaattatga
16921 ggttgtttta ttttcagaac cccgttaagt ccatttatta
attagaataa gactataaat
16981 tccctctgtt taatccaaat ttcactatct ggcatccccc
aaatcagaaa tgttaacact
17041 ggaacattgc ttgctttcaa attaaatagg taagatttga
agagaacttt ggagtttaag
17101 aggttacatc taatgtgttc taaatattag gactcgtttt
tcttctgatt acacccctac
17161 aaagtgccaa gcctttgata gctcattttg tgttttctga
agttctagtt ctcctggtca
17221 cattctccag caaggccctg tgtattaagt gtaaccagta
atggaaaatc ctctacatag
17281 aatgtactca gaaaatttga ggagcttaag cacttgactt
ctctttggaa acacagtgga
17341 tatttgtgtc gctactgtac acaaaggagg agacactgag
aaataaagaa ggttatccca
17401 agtccccagt aaaacaggat gaggttttta ggtttgagga
aagtgcctca ttcactccat
17461 tcctgcctca ggatactgaa aggcaggttt ctaggctgag
ttttctactg taggaaatcc
17521 tattttttac ttatcaagaa tgcattagag ttggaagaat
tacagtcttc ctacaccata
17581 tgctaatgtt aattccagct ggattaaagc aataaataaa
tgtataaaat caatcattag
17641 aaacccagaa tagaaatgag tatttatccc tactggagtg
agaggggact tcataaactt
17701 gaagacaaag agaaagatcg tttctgaaaa gagtaataga
tttgactcca ttaaaaatta
17761 aaacttctgt ttgccaaaaa gcaaatccac acagtgataa
gtatgtgcca cagatatgag
17821 ggacaagtga ctaggaatct taaaagacca agccaattga
taacaatgca tgattacaat
17881 ggttaaatgg gaggtcaggc atccattgtt ttcctgccta
atgctgttat tttcaattct
17941 ttcttgctca gtgggagaac tcaattgccc agggtctctt
cctgcttgct tttctatcat
18001 tcctatttct gagccccagc aggggctggg ttcttcttct
cttcttcttc cttcttcttc
18061 ttcttcttct cttcttcttc cttcttcttc cttcttcttc
cttcttcttc cttcttcttc
18121 ttcttcttct cttcttcttc cttcttcttc cttcttcttc
cttcttcttc cttcttcttc
18181 ttcttcttct cttcttcttc cttcttcttc cttcttcttc
cttcttcttc cttcttcttc
18241 ttcttcttct cttcttcttc cttcttcttc cttcttcttc
cttcttcttc cttcttcttc
18301 ttcttcttct cttcttcttc cttcttcttc cttcttcttc
cttcttcttc cttcttcttc
18361 ttcttcttct cttcttcttc cttcttcttc cttcttcttc
cttcttcttc cttcttcttc
18421 ttcttcttct cttcttcttc cttcttcttc cttcttcttc
cttcttcttc cttcttcttc
18481 ttcttcttct cttcttcttc cttcttcttc cttcttcttc
cttcttcttc cttcttcttc
18541 ttcttcttct cttcttcttc cttcttcttc cttcttcttc
cttcttcttc cttcttcttc
18601 ttcttcttct cttcttcttc cttcttcttc cttctacatt
tccatgggca tttcccctgc
18661 ctggagggct ttctagctct cacatgcaat cttcagtgat
gttttcgtga ggtctatttg
18721 gggattccca gcaatgttgt tctgtgccat ttgttattag
gtatgggtgt tttatgctca
18781 tctttcattg gacaaatgat tgttttattt ctagaaaaga
aaattcatca tgttccccta
18841 cttcctatct tggatgcttg aaacatacat gtatgtgtat
gttagtgtgt atatatgtgt
18901 gtgtttgtgt gttcaaggtt ttaagctaga agcgggtaat
ggctttagga agtaatggtg
18961 gggaaatctg tcacagcatt atcacatgtt cccatctgct
gaaagcagaa aaatgattgt
19021 gagattgtga tgaccagagg gaagtgatga ccaagtgagc
ttatgttctg aaccaggttt
19081 ctgtaaatca cattaagcaa gtttctccct gttactccta
tgtccatttg caaggatgaa
10141 cacttataag tcatgtttac attctaccaa caaccactct
tttaacaggt aggtcttgtt
10201 ccattgtaca gccccagaga aaatatgcat gaacattcat
tcatgtataa gtacctttaa
10261 ggaagaatct gctttgttca aggcactgta aaatacacaa
agataaatta ccaagatgtt
10321 tcaaatccgt ggatgtacac aaataattat aataagatcg
agtcgactcc tttagtgagg
10381 gttaattgag ctcgcggccg cgagctctaa
3. (10 pts.) This sequence has undergone a major mutational event. Please identify the following:
What type of mutation is this?
What is the name of the original gene?
What does it do?
What
other sequence(s) are involved with this gene (that might not have been
involved before the mutation event....)
4.
(10 pts.) You
have been asked to help piece together what has happened at the following
crime scene: In the past 3 days, about 28,000 people in New York city have
suddenly died of apparent bacterial infection. Police have localised
the contamination to one of the subway stations. Inspection of some
broken glass from the rail track shows the apparent source: a light bulb
has been cut open and filled with freeze-dried bacteria, and then taped
to the rail. When the train came through, breaking the glass, everyone
was exposed to the highly contageous bacteria. You have managed to
sequence some of the toxin, and come up with the following bit of amino
acid sequence:
5.
(10 pts.) You
instruct your research team to try and design some PCR primers, based on
the sequence from problem #8, and eventually, after several weeks of sleepless
nights, come up with the following sequence:
1 gttaactgtg gtggttgtca
ccgcccatta cacggcatac agctatatcg agccttttgt
61 acaaaacatt gcgggattca gcgccaactt
tgccacggca ttactgttat tactcggtgg
121 tgcgggcatt attggcagcg tgattttcgg
taaactgggt aatcagtatg cgtctgcgtt
181 ggtgagtacg gcgattgcgc tgttgctggt
gtgcctggca ttgctgttac ctgcggcgaa
241 cagtgaaata cacctcgggg tgctgagtat
tttctggggg atcgcgatga tgatcatcgg
301 gcttggtatg caggttaaag tgctggcgct
ggcaccagat gctaccgacg tcgcgatggc
361 gctattctcc ggcatattta atattggaat
cggggcgggt gcgttggtag gtaatcaggt
421 gagtttgcac tggtcaatgt cgatgattgg
ttatgtgggc gcggtgcctg cttttgccgc
481 gttaatttgg tcaatcatta tatttcgccg
ctggccagtg acactcgaag aacagacgca
541 atagttgaaa ggcccattcg ggcctttttt
aatggtacgt tttaatgatt tccaggatgc
601 cgttaataat aaactgcaca cccatacata
ccagcaggaa tcccatcaga cgggagatcg
661 cttcaatgcc acccttgccc accagccgca
taattgcgcc ggagctgcgt aggcttcccc
721 acaaaataac cgccaccagg aaaaagatca
gcggcggcgc aaccatcagt acccaatcag
781 cgaaggttga actctgacgc actgtggacg
ccgagctaat aatcatcgct atggttcccg
841 gaccggcagt acttggcatt gccagcggca
caaaggcaat attggcactg ggttcatctt
901 ccagctcttc cgacttgctt ttcgcctccg
gtgaatcaat cgctttctgt tgcggaaaga
961 gcatccgaaa accgataaac gcgacgatta
agccgcctgc aattcgcaga ccgggaatcg
1021 aaatgccaaa tgtatccatc accagttgcc
cggcgtaata cgccaccatc atgatggcaa
1081 atacgtacac cgaggccatc aacgactgac
gattacgttc ggcactgttc atgttgcctg
1141 ccaggccaag aaataacgcg acagttgtta
atgggttagc taacggcagc aacaccacca
1201 gccccaggcc aattgcttta aacaaatcta
acattggtgg ttgttatcct gtgtatctgg
1261 gttatcagcg aaaagtataa ggggtaaaca
aggataaagt gtcactcttt agctagcctt
1321 gcatcgcatt gaacaaaact tgaaccgatt
tagcaaaacg tggcatcggt caattcattc
1381 atttgactta tacttgcctg ggcaatatta
tcccctgcaa ctaattactt gccagggcaa
1441 ctaatgtgaa aagtaccagc gatctgttca
atgaaattat tccattgggt cgcttaatcc
1501 atatggttaa tcagaagaaa gatcgcctgc
ttaacgagta tctgtctccg ctggatatta
1561 ccgcggcaca gtttaaggtg ctctgctcta
tccgctgcgc ggcgtgtatt actccggttg
1621 aactgaaaaa ggtattgtcg gtcgacctgg
gagcactgac ccgtatgctg gatcgcctgg
1681 tctgtaaagg ctgggtggaa aggttgccga
acccgaatga caagcgcggc gtactggtaa
1741 aacttaccac cggcggcgcg gcaatatgtg
aacaatgcca tcaattagtt ggccaggacc
1801 tgcaccaaga attaacaaaa aacctgacgg
cggacgaagt ggcaacactt gagtatttgc
1861 ttaagaaagt cctgccgtaa acaaaaaaga
ggtatgacga tgtccagacg caatactgac
1921 gctattacca ttcatagcat tttggactgg
atcgaggaca acctggaatc gccactgtca
1981 ctggagaaag tgtcagagcg ttcgggttac
tccaaatggc acctgcaacg gatgtttaaa
2041 aaagaaaccg gtcattcatt aggccaatac
atccgcagcc gtaagatgac ggaaatcgcg
2101 caaaagctga aggaaagtaa cgagccgata
ctctatctgg cagaacgata tggcttcgag
2161 tcgcaacaaa ctctgacccg aaccttcaaa
aattactttg atgttccgcc gcataaatac
2221 cggatgacca atatgcaggg cgaatcgcgc
tttttacatc cattaaatca ttacaacagc
2281 tagttgaaaa cgtgacaacg tcactgaggc
aatcatgaaa ccactttcat ccgcaatagc
2341 agctgcgctt attctctttt ccgcgcaggg
cgttgcggaa caaaccacgc agccagttgt
2401 tacttcttgt gccaatgtcg tggttgttcc
cccatcgcag gaacacccac cgtttgattt
2461 aaatcacatg ggtactggca gtgataagtc
ggatgcgctc ggcgtgccct attataatca
2521 acacgctatg tagtttgttc tggccccgac
atctcggggc ttattaactt cccaccttta
2581 ccgctttacg ccaccgcaag ccaaatacat
tgatatacag cccggtcata atgagcaccg
2641 cacctaaaaa ttgcagaccc gttaagcgtt
catccaacaa tagtgccgca cttgccagtc
2701 ctactacggg caccagtaac gataacggtg
caacccgcca ggtttcatag cgtcccagta
2761 acgtccccca gatcccataa ccaacaattg
tcgccacaaa cgccagatac atcagagaca
2821 agatggtggt catatcgata gtaaccagac
tgtgaatcat ggttgcggaa ccatcgagaa
2881 tcagcgaggc aacaaagaag ggaatgattg
ggattaaagc gctccagatt accagcgaca
2941 tcaccgccgg acgcgttgag tgcgacatga
tctttttatt gaagatgttg ccacacgccc
3001 aactaaatgc tgccgccagg gtcaacataa
agccgagcat cgccacatgc tgaccgttca
3061 gactatcttc gattaacacc agtacgccaa
aaatcgctaa ggcgatcccc gccaattgtt
3121 tgccatgcag tcgctccccg aaagtaaacg
cgccaagcat gatagtaaaa aacgcctgtg
3181 cctgtaacac cagcgaagcc agtccagcag
gcataccgaa gttaatggca caaaaaagaa
3241 aagcaaactg cgcaaaactg atggttaatc
cataccccag cagcaaattc agtggtactt
3301 tcggtcgtgc gacaaaaaag atagccggaa
aagcgaccag cataaagcgc aaaccggcca
3361 gcatcagcgt ggcatgttat gaagccccac
tttgatgacc acaaaattta gcccccatac
3421 gaccactacc agtagcgcca acaccccatc
ttttcgcgac attctaccgc ctctgaattt
3481 catcttttgt aagcaatcaa cttagctgaa
tttacttttc tttaacagtt gattcgttag
3541 tcgccggtta cgacggcatt aatgcgcaaa
taagtcgcta tacttcggat ttttgccatg
3601 ctatttcttt acatctctaa aacaaaacat
aacgaaacgc actgccggac agacaaatga
3661 acttatccct acgacgctct accagcgccc
ttcttgcctc gtcgttgtta ttaaccatcg
3721 gacgcggcgc taccgtgcca tttatgacca
tttacttgag tcgccagtac agcctgagtg
3781 tcgatctaat cggttatgcg atgacaattg
cgctcactat tggcgtcgtt tttagcctcg
3841 gttttggtat cctggcggat aagttcgaca
agaaacgcta tatgttactg gcaattaccg
3901 ccttcgccag cggttttatt gccattactt
tagtgaataa cgtgacgctg gttgtgctct
3961 tttttgccct cattaactgc gcctattctg
tttttgctac cgtgctgaaa gcctggtttg
4021 ccgacaatct ttcgtccacc agcaaaacga
aaatcttctc aatcaactac accatgctaa
4081 acattggctg accatcggtc cgccgctcgg
cacgctgttg gtaatgcaga gcatcaatct
4141 gcccttctgg ctggcagcta tctgttccgc
gtttcccatg cttttcattc aaatttgggt
4201 aaagcgcagc gagaaaatca tcgccacgga
aacaggcagt gtctggtcgc cgaaagtttt
4261 attacaagat aaagcactgt tgtggtttac
ctgctctggt tttctggctt cttttgtaag
4321 cggcgcattt gcttcatgca tttcacaata
tgtgatggtg attgctgatg gggattttgc
4381 cgaaaaggtg gtcgcggttg ttcttccggt
gaatgctgcc atggtggtta cgttgcaata
4441 ttccgtgggc cgccgactta acccggctaa
catccgcgcg ctgatgacag caggcaccct
4501 ctgtttcgtc atcggtctgg tcggttttat
tttttccggc aacagcctgc tattgtgggg
4561 tatgtcagct gcggtattta ctgtcggtga
aatcatttat gcgccgggcg agtatatgtt
4621 gattgaccat attgcgccgc cagaaatgaa
agccagctat ttttccgccc agtctttagg
4681 ctggcttggt gccgcgatta acccattagt
gagtggcgta gtgctaacca gcctgccgcc
4741 ttcctcgctg tttgtcatct tagcgttggt
gatcattgct gcgtgggtgc tgatgttaaa
4801 agggattcga gcaagaccgt gggggcagcc
cgcgctttgt tgatttaagt cgaacacaat
4861 aaagatttaa ttcagccttc gtttaggtta
cctctgctaa tatctttctc attgagatga
4921 aaattaaggt aagcgaggaa acacaccaca
ccataaacgg aggcaaataa tgctgggtaa
4981 tatgaatgtt tttatggccg tactgggaat
aattttattt tctggttttc tggccgcgta
5041 tttcagccac aaatgggatg actaatgaac
ggagataatc cctcacctaa ccggcccctt
5101 gttacagttg tgtacaaggg gcctgatttt
tatgacggcg aaaaaaaacc gccagtaaac
5161 cggcggtgaa tgcttgcatg gatagatttg
tgttttgctt ttacgctaac aggcattttc
5221 ctgcactgat aacgaatcgt tgacacagta
gcatcagttt tctcaatgaa tgttaaacgg
5281 agcttaaact cggttaatca cattttgttc
gtcaataaac atgcagcgat ttcttccggt
5341 ttgcttaccc tcatacattg cccggtccgc
tcttccaatg accacatcca gaggctcttc
5401 aggaaatgcg cgactcacac ctgctgtcac
ggtaatgttg atatgccctt cagaatgtgt
5461 gatggcatgg ttatcgacta actggcaaat
tctgacacct gcacgacatg cttcttcatc
5521 attagccgct ttgacaataa tgataaattc
ttcgcccccg tagcgataaa ccgtttcgta
5581 atcacgcgtc caactggcta agtaagttgc
cagggtgcgt aatactacat cgccgattaa
5641 atgcccgtag tatcattaac caatttaaat
cggtcaatat ccaacaacat taaataaaga
5701 ttcagaggct cagcgttgcg taactgatga
tcaaaggatt catcaagaac ccgacgaccc
5761 ggcaatcccg tcaaaacatc catattgcta
cggatcgtca gcaaataaat tttgtaatcg
5821 gttaatgccg cagtaaaaga aagcaacccc
tcctgaaagg cgtcgaaatg cgcgtcctgc
5881 cagtgatttt caacaatagc cagcattaat
tcccgaccac agttatgcat atgttgatgg
5941 gcagaatcca ttagccgaac gtaaggtaat
tcatcgttat cgagtggccc cagatgatca
6001 atccaccgac caaactggca cagtccataa
gaatggttat ccgttatttc tggcttactg
6061 gcatctctcg cgaccacgct gtgaaacata
ctcaccagcc actggtagtg ggcatcgata
6121 gccttattga gatttaacaa gatggcatca
atttccgttg tcttcttgat cattgccact
6181 cctttttcac agttccttgt gcgcgctatt
ctaacgagag aaaagcaaaa ttacgtcaat
6241 attttcatag aaatccgaag ttatgagtca
tctctgagat aacattgtga tttaaaacaa
6301 aatcagcgga taaaaaagtg tttaattctg
taaattacct ctgcattatc gtaaataaaa
6361 ggatgacaaa tagcataacc caatacccta
atggcccagt agttcaggcc atcaggctaa
6421 tttattttta tttctgcaaa tgagtgaccc
gaacgacggc cggcgcgctt ttcttatcca
6481 gactgccact aatgttgatc atctggtccg
gctgaacttc tcgtccatca aagacggccg
6541 caggaataac gacattaatt tcaccgctct
tatcgcgaaa aacgtaacgg tcctctcctt
6601 tgtgagaaat caaattaccg cgtagtgaaa
ccgaagcgcc atcgtgcatg gtttttgcga
6661 aatcaacggt catttttttt gcatcatcgg
ttccgcgata gccatcttct attgcatgag
6721 gcggcggtgg cgctgcatcc tgttttaaac
cgccctggtc atctgccaac gcataaggca
6781 tgacaagaaa acttgctaat acaatggcct
gaaatttcat actaactcct taattgcgtt
6841 tggtttgact tattaagtct ggttgctatt
tttataattg ccaaataaga atattgccaa
6901 ttgttataag gcatttaaaa tcagccaact
agctgtcaaa tatacagaga atttaactca
6961 ctaaagttaa gaagattgaa aagtcttaaa
catattttca gaataatcgg atttatatgt
7021 ttgaaaatta ttatattgga cgagcataca
gaaaaagcaa atcaccttta catataaaag
7081 cgtggacaaa aaacagtgaa cattaataga
gataaaattg tacaacttgt agataccgat
7141 actattgaaa acctgacatc cgcgttgagt
caaagactta tcgcggatca attacgctta
7201 actaccgccg aatcatgcac cggcggtaag
ttggctagcg ccctgtgtgc agctgaagat
7261 acacccaaat tttacggtgc aggctttgtt
actttcaccg atcaggcaaa gatgaaaatc
7321 ctcagcgtaa gccagcaatc tcttgaacga
tattctgcgg tgagtgagaa agtggcagca
7381 gaaatggcaa ccggtgccat agagcgtgcg
gatgctgatg tcagtattgc cattaccggc
7441 tacggcggac cggagggcgg tgaagatggt
acgccagcgg gtaccgtctg gtttgcgtgg
7501 catattaaag gccagaacta cactgcggtt
atgcattttg ctggcgactg cgaaacggta
7561 ttagctttag cggtgaggtt tgccctcgcc
cagctgctgc aattactgct ataaccaggc
7621 tggcctggcg atatctcagg ccagccattg
gtggtgttta tatgttcaag ccacgatgtt
7681 gcagcatcgg cataatctta ggtgccttac
cgcgccattg tcgatacagg cgttccagat
7741 cttcgctgtt acctctggaa aggatcgcct
cgcgaaaacg cagcccattt tcacgcgtta
7801 atccgccctg ctcaacaaac cactgataac
catcatcggc caacatttgc gtccacagat
7861 aagcgtaata acctgcaggc actaacgtca
aaaattctag gagggcatga acggcatgta
7921 ggatgcccca tttggtaaag aaatcatgca
gcaatcttac ttacggcaag aggagtttgc
7981 aggggcaatg actgaagtac tgcggaattc
gccttatcgc tgtgatatat catcacttaa
8041 tagccccgat gcagagtggc gaaaagtgat
ttttaaggcc attgatgaat ccttatcgaa
8101 aaaaatgggg cgaactcagc caactcagta
taggtttctt ttcggccaat ctccaacagt
8161 atttatgaat gggttatctg ctgcaacaaa
tggctcccct gactttgttg cttttaaatc
8221 agagttaatt cagctaatta aggagcgagg
acaatattgg gaaaaaatgc ctgaaatttg
8281 gctaggccgt ttctttagaa tagaagaagg
gttggctaca tcatttatga gaaacgtgta
8341 tcctgatttc ccaccaatca acgatacaag
aatgacatgg aatcacacaa aaataatggc
8401 ctcagatggt actgaagctc ttgttggtgg
acataacatg aacatggatc tatttagaaa
8461 ttatccacct gttcatgatg tatcaattat
cactcatggt tcttctgctt atggctccca
8521 gctatatctt aacgaactat ggtcatgtaa
ttcagattta ctaaaaaaag aatattttga
8581 ttatgaaagc atgatgtggg cggtcggaac
aaagttctat gataagcctg aagatccgct
8641 taaaagctca gttgctatga attatatgaa
gcaacggcaa gaggacctac tcaacttgca
8701 tgaaaacttt aatcagaagg tagcgactcg
tattagtgaa tacgaaaaca tggaagagta
8761 taaaaaagca gacagagttt tatcagtagg
taaatattgg acaggaccta atatggagca
8821 tgactaccaa agagggtctg aaataatgaa
agagcaactg ataaaaaatg ctaagcgcat
8881 aattagaatt tcacagcaag atctcgtgag
tgcttggaaa aaaaaatgga aagaccactt
8941 tacgtgtaat tggattattg aggctttgtt
agaaaataaa gatcttcata ttcatgttgt
9001 agtctctgct ctagatgcag cagctggagc
tggtgatcag tactcatttg gttctggagc
9061 agaacggacc tatgaattat ttaagtatta
cctaacccat gatattgata ccgatgaagt
9121 attagacgat cctgatggta gccgtgctga
tgccttaaaa agaatattga ttgcaccatt
9181 cttctttaca gataaagtac ctgatgaaaa
tacaattgaa ggcgaaacct acaagtggcc
9241 tgatttagaa caaagcgctt atactgcaac
acttaagcaa aaaccacttt cggaaaaacc
9301 cccgcatcaa ggtattattg gtagtgcact
aatgtcagca attaaaggta gtggactttt
9361 ctatcctaaa gtccctgttg cacctggtaa
tcacgccaaa ttaatgatta ttgacgatga
9421 gttgtacgtt gttggatcag ataatcttta
tcccggttat ctgtcagagt ttgactattt
9481 agtcgaagga aaagatgcag ttaatgaatt
aatgaaatct tactgggaac cattgtggaa
9541 atattctagc ccacatgcat ttccaaaatt
aagcccaaat tgatgactag aatacactag
9601 taagtaataa tcagtcataa atcacataaa
ccatcatgct aataatcatg atggttatca
9661 gcgcccaata cgcaaaccgc ctctccccgc
gcgttggccg attcattaat gcagctggca
9721 cgacaggttt cccgactgga aagcgggcag
tgagcgcaac gcaattaatg tgagttagct
9781 cactcattag gcaccccagg ctttacactt
tatgcttccg gctcgtatgt tgtgtggaat
9841 tgtgagcgga taacaatttc acacaggaaa
cagctatgac catgattacg aattcccggg
9901 gatccgtcga cctgcagcca agcttggcac
tggccgtcgt tttacaacgt cgtgactggg
9961 aaaaccctgg cgttacccaa cttaatcgcc
ttgcagcaca tccccctttc gccagctggc
10021 gtaatagcga agaggcccgc accgatcgcc cttcccaaca
gttgcgcagc ctgaatggcg
10081 aatggcgcct gatgcggtat tttctcctta cgcatctgtg
cggtatttca caccgcatat
10141 ggtgcactct cagtacaatc tgctctgatg ccgcatagtt
aagccagccc cgacacccgc
10201 caacacccgc tgacgcgccc tgacgggctt gtctgctccc
ggcatccgct tacagacaag
10261 ctgtgaccgt ctccgggagc tgcatgtgtc agaggttttc
accgtcatca ccgaaacgcg
10321 cgagacgaaa gggcctcgtg atacgcctat ttttataggt
taatgtcatg ataataatgg
10381 tttcttagac gtcaggtggc acttttcggg gaaatgtgcg
cggaacccct atttgtttat
10441 ttttctaaat acattcaaat atgtatccgc tcatgagaca
ataaccctga taaatgcttc
10501 aataatattg aaaaaggaag agtatgagta ttcaacattt
ccgtgtcgcc cttattccct
10561 tttttgcggc attttgcctt cctgtttttg ctcacccaga
aacgctggtg aaagtaaaag
10621 atgctgaaga tcagttgggt gcacgagtgg gttacatcga
actggatctc aacagcggta
10681 agatccttga gagttttcgc cccgaagaac gttttccaat
gatgagcact tttaaagttc
10721 tgctatgtgg cgcggtatta tcccgtattg acgccgggca
agagcaactc ggtcgccgca
10781 tacactattc tcagaatgac ttggttgagt actcaccagt
cacagaaaag catcttacgg
10841 atggcatgac agtaagagaa ttatgcagtg ctgccataac
catgagtgat aacactgcgg
10901 ccaacttact tctgacaacg atcggaggac cgaaggagct
aaccgctttt ttgcacaaca
10961 tgggggatca tgtaactcgc cttgatcgtt gggaaccgga
gctgaatgaa gccataccaa
11001 acgacgagcg tgacaccacg atgcctgtag caatggcaac
aacgttgcgc aaactattaa
11061 ctggcgaact acttactcta gcttcccggc aacaattaat
agactggatg gaggcggata
11121 aagttgcagg accacttctg cgctcggccc ttccggctgg
ctggtttatt gctgataaat
11181 ctggagccgg tgagcgtggg tctcgcggta tcattgcagc
actggggcca gatggtaagc
11241 cctcccgtat cgtagttatc tacacgacgg ggagtcaggc
aactatggat gaacgaaata
11321 gacagatcgc tgagataggt gcctcactga ttaagcattg
gtaactgtca gaccaagttt
11381 actcatatat actttagatt gatttaaaac ttcattttta
atttaaaagg atctaggtga
11421 agatcctttt tgataatctc atgaccaaaa tcccttaacg
tgagttttcg ttccactgag
11481 cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga
tccttttttt ctgcgcgtaa
11541 tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt
ggtttgtttg ccggatcaag
11601 agctaccaac tctttttccg aaggtaactg gcttcagcag
agcgcagata ccaaatactg
11661 tccttctagt gtagccgtag ttaggccacc acttcaagaa
ctctgtagca ccgcctacat
11721 acctcgctct gctaatcctg ttaccagtgg ctgctgccag
tggcgataag tcgtgtctta
11781 ccgggttgga ctcaagacga tagttaccgg ataaggcgca
gcggtcgggc tgaacggggg
11841 gttcgtgcac acagcccagc ttggagcgaa cgacctacac
cgaactgaga tacctacagc
11901 gtgagctatg agaaagcgcc acgcttcccg aagggagaaa
ggcggacagg tatccggtaa
11961 gcggcagggt cggaacagga gagcgcacga gggagcttcc
agggggaaac gcctggtatc
12021 tttatagtcc tgtcgggttt cgccacctct gacttgagcg
tcgatttttg tgatgctcgt
12081 caggggggcg gagcctatgg aaaaacgcca gcaacgcggc
ctttttacgg ttcctggcct
12141 tttgctggcc ttttgctcac atgttctttc ctgcgttatc
ccctgattct gtggataacc
12201 gtattaccgc ctttgagtga gctgataccg ctcgccgcag
ccgaacgacc gagcgcagcg
12261 agtcagtgag cgaggaagcg gaaga
What
does this sequence tell you, in terms of possible mechanisms of how this
bacteria might have come about?? Do you think it is reasonable that
this is just an "accident" of nature, or do you think it is likely that
this was engineered by a person using standard recombinant DNA methodologies?


To use this, all you have to do is paste in
your sequence, and then click on the particular line of interest.
In this case, the RED lines are the best matches. Notice that you
have an insertion (hint, hint) into the middle of the gene. The red
line on the next line is the "insert" (likely a transposable element of
some sort). Please try this out and hopefully you can get it to work....


Last modified on: 19 February, 2000 by Dave Ussery