72 2W8X.A mol:aa MEMBRANE PROTEIN FNCNKREGPCSQRSLCECDPNLQLGRHSDQLWHYNLRTNRCERGGYRDNCNSHSSSGACVMACERIHHHHHH ...TTTTT..TTTT....TTTT............TTTTT......TTTTTTT.................... 135 2W9Y.A mol:aa LIPID TRANSPORT GAMSVASLPEVKNFFPTEQLEFSSSITADEKPVLHEVFQKHSCGEMIDEVSKKHPELGKRLATVLEGNKKRLDGLSPAAV EYAKKLIHMVTTTLCSLTVGKPIDDADAKRLHQEFQSLSSEDQAALRKNNPDIKF TTTT......................TTTTT...........TTTT.........................TTTT..... .................................................TTTT.. 141 2WCJ.A mol:aa TRANSPORT PROTEIN TAEVMSHVTAHFGKTLEECREESGLSVDILDEFKHFWSDDFDVVHRELGCAIICMSNKFSLMDDDVRMHHVNMDEYIKSF PNGQVLAEKMVKLIHNCEKQFDTETDDCTRVVKVAACFKEDSRKEGIAPEVAMVEAVIEKY .........................TTTTTTTT....TTTT.....................TTTT.............T TTT.................TTTT................................TTTT. 148 2WCR.A mol:aa IMMUNE SYSTEM NSFLQDVPYWMLQNRSEYITQGVDSSHIVDGKKTEEIEKIATKRATIRVAQNIVHKLKEAYLSKTNRIKQKITNEMFIQM TQPIYDSLMNVDRLGIYINPNNEEVFALVRARGFDKDALSEGLHKMSLDNQAVSILVAKVEEIFKDSV ...TTTT..........TTTT.......TTTT..............................TTTT.............. ..................TTTTT............................................. 120 2WFB.A mol:aa BIOSYNTHETIC PROTEIN ASHMQRIAVTAEGPGLDGLVDPRFGRAAGFVVVDAATMAAEYVDNGASQTLSHGAGINAAQVLAKSGAGVLLTGYVGPKA FQALQAAGIKVGQDLEGLTVRQAVQRFLDGQVPMAAGPNK ..............TTTT....TTTTT......TTTTT............TTTT.......................... ..............TTTT...................... 153 2WFO.A mol:aa VIRAL PROTEIN ELPSLCMLNNSFYYMKGGANIFLIRVSDVSVLMKEYDVSVYEPEDLGNCLNKSDSSWAIHWFSIALGHDWLMDPPMLCRN KTKKEGSNIQFNISKADESRVYGKKIRNGMRHLFRGFYDPCEEGKVCYVTINQCGDPSSFEYCGTNYLSKCQF ........TTTT....TTTT............TTTTT...............................TTTTT......T TTTTTT.........TTTTT.............TTTT....TTTT.......TTTT..TTTTTT......... 96 2WJ5.A mol:aa CHAPERONE GAMAQVPTDPGYFSVLLDVKHFSPEEISVKVVGDHVEVHARHEERPDEHGFIAREFHRRYRLPPGVDPAAVTSALSPEGV LSIQATPASAQASLPS ..................TTTT........TTTTT.........TTTTTT............TTTTTTTTT....TTTT. ................ 138 2WJ9.A mol:aa HYDROLASE INHIBITOR TSSACAPETGLQQLVATIVPDEQRISFWPQHFGLIPQWVTLEPRVFGWMDRLCCIWNLYTLNNGGAFMAPEETWVLFNAM NGNRAEMSPEAAGIAACLMTYSHHACRTECYAMTVHYYRLRDYALQHPECSAIMRIID ...............................TTTTTTT......................TTTT.............TTT TT............................................TTTT........ 204 2WJR.A mol:aa TRANSPORT PROTEIN ALDVRGGYRSGSHAYETRLKVSEGWQNGWWASMESNTWNTINDVQVEVNYAIKLDDQWTVRPGMLTHFSSNGTRYGPYVK LSWDATKDLNFGIRYRYDWKAYRQQDLSGDMSRDNVHRWDGYVTYHINSDFTFAWQTTLYSKQNDYRYANHKKWATENAF VLQYHMTPDITPYIEYDYLDRQGVYNGRDNLSENSYRIGVSFKL ........TTTTT...........TTTT..........................TTTT..........TTTT........ ...TTTTTT................TTTT..................TTTT................TTTT......... ....TTTTTT........TTTT.TTTTT................ 191 2WL1.A mol:aa SIGNALING PROTEIN NVPELIGAQAHAVNVILDAETAYPNLIFSDDLKSVRLGNKWERLPDGPQRFDSCIIVLGSPSFLSGRRYWEVEVGDKTAW ILGACKTSISRKGNMTLSPENGYWVVIMMKENEYQASSVPPTRLLIKEPPKRVGIFVDYRVGSISFYNVTARSHIYTFAS CSFSGPLQPIFSPGTRDGGKNTAPLTICPVG .................TTTTTTTTT..TTTT......TTTT....TTTTTTTT...................TTTT... .....TTTTTTTT................TTTT....TTTT....TTTTTTTT....TTTTT.....TTTTT.....TTT T.................TTTT......... 269 2WNF.A mol:aa TRANSFERASE PCTCTRCIEEQRVSAWFDERFNRSQPLLTAKNAHLEEDTYKWWLRLQREKQPNNLNDTIRELFQVVPGNVDPLLEKRLVS CRRCAVVGNSGNLKESYYGPQIDSHDFVLRNKAPTEGFEADVGSKTTHHFVYPESFRELAQEVSILVPFKTTDLEWVISA TTTGRISHTYVPVPAKIKVKKEKILIYHPAFIKYVFDRWLQGHGRYPSTGILSVIFSLHICDEVDLYGFGADSKGNWHHY WEGVHDGDFESNVTTILASINKIRIFKGR ...TTTTTTTTTTTT.............TTTTT..........TTTTTTTTT..............TTTTTTTTTT..TT TT..........TTTT........TTTT......TTTTT............TTTTT...TTTT................T TTT....TTTTT.TTTT......................TTTTTTTT.............TTTT.TTTT..TTTT...TT TT..................TTTT..... 48 2WPV.B mol:aa PROTEIN BINDING GPEHEFVSKFLTLATLTEPKLPKSYTKPLKDVTNLGVPLPTLKYKYKQ ..............TTTTTTTTTTT....................... 189 2WQF.A mol:aa OXIDOREDUCTASE SFIKSLENRRTIYALGRNVQDEEKVIETIKEAVRFSPTAFNSQTGRLLILTGDAQDKLWDEIVAPELKAAMEAAKLDGFK AAFGTILFFEDQAVVKNLQEQFALYADNFPVWSEQGSGIISVNVWTALAELGLGANLQHYNPLIDEAVAKEWNLPESWKL RGQLVFGSIEAPAGEKTFMDDADRFIVAK ...........TTTT...TTTT.......................................................... ........................TTTT...............................TTTTTT.........TTTT.. ............................. 93 2WTP.A mol:aa METAL BINDING PROTEIN STATAQAMAKRHATLYGDPAGQSQASRIIDVKPGMRYVNVDSGETVAFRAGEKIVAWTFAQMVRDTSVDLGLLMPDLPGS AGVRVYIDRSDLF ...............................TTTT.....TTTT....TTTTT........TTTT........TTTTTTT TTT......TTTT 50 2WUJ.A mol:aa CELL CYCLE LTPNDIHNKTFTKSFRGYDEDEVNEFLAQVRKDYEIVLRKKTELEAKVNE .....TTTT....TTTT................................. 402 2WUU.A mol:aa TRANSFERASE AHAFWSTQPVPQTEDETEKIVFAGPMDEPKTVADIPEEPYPIASTFEWWTPNMEAADDIHAIYELLRDNYVESMFRFNYS EEFLQWALCPPSYIPDWHVAVRRKADKKLLAFIAGVPVTLRMGTPKYMKVKAQEKGQEEEAAKYDAPRHICEINFLCVHK QLREKRLAPILIKEVTRRVNRTNVWQAVYTAGVLLPTPYASGQYFHRSLNPEKLVEIRFSGIPAQYQKFQNPMAMLKRNY QLPNAPKNSGLREMKPSDVPQVRRILMNYLDNFDVGPVFSDAEISHYLLPRDGVVFTYVVENDKKVTDFFSFYRIPSTVI ILNAAYVHYYAATSMPLHQLILDLLIVAHSRGFDVCNMVEILDNRSFVEQLKFGAGDGHLRYYFYNWAYPKIKPSQVALV ML .TTTT.....................................TTTT.....TTTT......................... .........TTTT.........TTTTT..................................TTTT............... .TTTT................................TTTT......TTTT......TTTT....TTTTTTT........ ..TTTT.TTTT.....................TTTT..............TTTTT.....TTTTT............... ...........TTTT.........................TTTT..TTTTTT...........TTTT............. .. 213 2WUX.A mol:aa VIRAL PROTEIN DYSYRPTIGRTYVYDNKYYKNLDAVIKNAPLDNYLVAEDPFLGPGKNQKLTLFKEIRNVKPDTMKLVVGWKGKEFYRETW TRFMEDSFPIVNDQEVMDVFLVVNMRPTRPNRCYKFLAQHALRCDPDYVPHDVIRIVEPSWVGSNNEYRISLAKYTNSFE QFIDRVIWENFYKPIVYIGTDSAEEEEILLEVSLVFKVKEFAPDAPLFTGPAY TTTT........TTTTT.....................TTTTT................TTTT................. ......TTTT..................TTTT..................TTTT.TTTT...TTTT.............. ................................................TTTT. 709 2WVX.A mol:aa HYDROLASE KDWTQYVNPLGSQSTFELSTGNTYPAIARPWGNFWTPQTGKGDGWQYTYTANKIRGFKQTHQPSPWINDYGQFSIPIVGQ PVFDEEKRASWFAHKGEVATPYYYKVYLAEHDIVTETPTERAVLFRFTFPENDHSYVVVDAFDKGSYIKIIPEENKIIGY TTRNSGGVPENFKNYFIIEFDKPFTYKATVENGNLQENVAEQTTDHAGAIIGFKTRKGEQVNARIASSFISFEQAAANNE LGKDNIEQLAQKGKDAWNQVLGKIEVEGGNLDQYRTFYSCLYRSLLFPRKFYELDANGQPIHYSPYNGQVLPGYFTDTGF WDTFRCLFPLLNLYPSVNKEQEGLINTYLESGFFPEWASPGHRGCVGNNSASILVDAYKGVKVDDIKTLYEGLIHGTENV HPEVSSTGRLGYEYYNKLGYVPYDVKINENAARTLEYAYDDWCIYRLAKELKRPKKEISLFAKRANYKNLFDKESKLRGR NEDGTFQSPFSPLKWGDAFTEGNSWHYTWSVFHDPQGLIDLGGKEFVTDSVFAVPPIFDDSYYGQVIHEIRETVNGNYAH GNQPIQHIYLYDYAGQPWKAQYWLRQVDRYTPGPDGYCGDEDNGQTSAWYVFSALGFYPVCPGTDEYVGTPLFKKATLHF ENGNSLVIDAPNNSTENFYIDSSFNGADHTKNYLRHEDLFKGGTIKVDSNRPNLNRGTKEEDPYSFSKE ..........TTTTTTTTTTTT......TTTT..........TTTTTTTTT............TTTTT............ ...TTTTTT..........TTTT....TTTTT......TTTT....................TTTT.............. ........TTTT.......TTTTTTTT..TTTTT.TTTT....TTTT........TTTT........TTTT.......TT TTTT.........................................TTTT.....TTTT.....TTTTT............ ..TTTTT...........................TTTTTTTT....................TTTT.............. TTTTTTTTTTTT.........TTTT.TTTT.........................................TTTTT.... TTTT..TTTTTTTT.TTTTTTT....TTTTTTTTT........................TTTTT..............TT TT..............................TTTT....TTTTT.............TTTTTT........TTTT...T TTT......TTTTTTTTT....TTTTT.........................TTTTTTTTTT..TTTTT 137 2WY3.B mol:aa IMMUNE SYSTEM/VIRAL PROTEIN VDLGSKSSNSTCRLNVTELASIHPGETWTLHGMCISICYYENVTEDEIIGVAFTWQHNESVVDLWLYQNDTVIRNFSDIT TNILQDGLKMRTVPVTKLYTSRMVTNLTVGRYDCLRCENGTTKIIERLYVRLGSLYP ....TTTTT..............TTTT..TTTT......TTTTTTTT.......TTTTTTT.....TTTTTTTTTTTTTT TT.......TTTT...........TTTTT.......TTTTT............TTTT 146 2WZO.A mol:aa CELL CYCLE GRPVFPIGLGGLTVYSLGEIITDRPGFHDESAIYPVGYCSTRIYASMKCPDQKCLYTCQIKDGGVQPQFEIVPEDDPQNA IVSSSADACHAELLRTISTTMGKLMPNLLPAGADFFGFSHPAIHNLIQSCPGARKCINYQWVKFDV ........TTTT........TTTTT...TTTT..TTTT.......TTTTTTT..........TTTTT.....TTTTT... ........................TTTT........TTTT.........TTTT..TTTT....... 86 2ZXY.A mol:aa OXYGEN BINDING, TRANSPORT PROTEIN ADGKAIFQQKGCGSCHQANVDTVGPSLKKIAQAYAGKEDQLIKFLKGEAPAIVDPAKEAIMKPQLTMLKGLSDAELKALA DFILSH ................TTTTTTTTT........TTTT.............TTTTT......................... ...... 147 3A0Z.A mol:aa TRANSFERASE MEFTEFNLNELIREVYVLFEEKIRKMNIDFCFETDNEDLRVEADRTRIKQVLINLVQNAIEATGENGKIKITSEDMYTKV RVSVWNSGPPIPEELKEKIFSPFFTTGLGLSICRKIIEDEHGGKIWTENRENGVVFIFEIPKTPEKR .................................TTTTTT........................TTTT........TTTT. ..............TTTT..TTTT.........................TTTT.......TTTT... 79 3A4R.A mol:aa TRANSCRIPTION GPLGSQELRLRVQGKEKHQMLEISLSPDSPLKVLMSHYEEAMGLSGHKLSFFFDGTKLSGKELPADLGLESGDLIEVWG TTTT...........TTTT......TTTT..............TTTT....TTTTT..TTTT.......TTTT...... 154 3A57.A mol:aa TOXIN GSDEILFVVRDTTFNTNAPVNVEVSDFWTNRNVKRKPYKDVYGQSVFTTSGTKWLTSYMTVNINDKDYTMAAVSGYKHGH SAVFVKSDQVQLQHSYDSVASFVGEDEDSIPSKMYLDETPEYFVNVEAYESGSGNILVMCISNKESFFECKHQQ ...............TTTT.........TTTTTTTT.TTTTTTT.......TTTT......TTTTT.........TTTTT ............................TTTT......TTTT........TTTT........TTTTT....... 103 3A5P.A mol:aa SUGAR BINDING PROTEIN VWSVQIVDNAGLGANLALYPSGNSSTVPRYVTVTGYAPITFSEIGPKTVHQSWYITVHNGDDRAFQLGYEGGGVATATFT AGGNVSISTGFGDAQHLTLKKLA .......TTTT......TTTT...........TTTT.....................TTTTT.......TTTT......T TTTT................... 353 3AAP.A mol:aa HYDROLASE KHSCIAVIDAGSTGSRLHIYSYDTDDTNTPIHIEEIWNKKIKPGFASIQPNSVTIDAYLTMLLADAPIHNIPVYFYATAG MRLLPQSQQKKYYDELEYWFRQQSQWQLVEAKTITGNDEALFDWLAVNYKLDTLKSVQNKSVGVMDMGGASVQIVFPMPK NAEISKHNQVELNIYGQNINLYVHSFLGLGQTEMSHQFLNSPSCFANDYPLPDGESGQGNAPSCKEEVTSLMNSVHKVNQ QIQPLLALNPVNEWYSIGGISNLASSQLFHFENSELTNQSLLQQGDNQICHQQWDILNGQYPDDEYLYQYCLLSSYYYAL MVDGYGINPNQTIHYIPPEQNLDWTIGVVLHRA ..........TTTT..........TTTT..................................TTTT.............. ....................................................TTTT...........TTTT......... TTTT........TTTTT........TTTT........TTTT....TTTT.TTTT.......................... ..........TTTT...........TTTT.TTTTT.........................TTTTTTT............. .......TTTT...................... 507 3FOT.A mol:aa TRANSFERASE LPPLVPALYRWKSTGSSGRQVQRRCVGAEAIVGLEEKNRRALYDLYIATSLRNIAPASTLLTLQNLKEMFELALLDARFE HPECACTVSWDDEVPAIITYESPESNESARDWARGCIHVQPTAKSALDLWSEMEEGRAAANNTPSKSIELFLLSDVSTDS TPIPQDATVEILFHSNHLFWDGIGCRKFVGDLFRLVGSYIGRSDSREMKKIQWGQEIKNLSPPVVDSLKLDINTLGSEFD DKCTEYTSALVANYKSRGMKFQPGLALPRCVIHKLSADESIDIVKAVKTRLGPGFTISHLTQAAIVLALLDHLKLSDDEV FISPTSVDGRRWLREDIASNFYAMCQTAAVVRIENLKSITVSHKDEKELQVRALESACRNIKKSYRQWLENPFLQALGLR VHNFEASYLHAKPIPFEGEANPLFISDGINERFIPHEIKQTATGENVLSVESIDFVVNQSLPYLAIRLDSWRDASTLNII YNDANYTEAEVQKYLQSIVEFMLAFRL ..............TTTT......................TTTT...........TTTT..................... ......................TTTT.......................................TTTT........TTT T..TTTT...............................TTTT.........TTTTTT....................... ...................................................TTTT....................TTTT. .................TTTT....................TTTT.........................TTTT...... ...........TTTTTTTT....................TTTTT.........................TTTTT...... .TTTTT..................... 101 3FOV.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION SAEASAADYLERQGYRILARRFKTRCGEIDLVAQRDALVAFVEVKARAYAVTPRQQSRIVAAAEAWLSRHPEHASELRFD AILIAPNTAPRHLPGAFDATP .......................TTTT.......TTTT.......................................... ....TTTT......TTTT... 313 3FQG.A mol:aa PROTEIN BINDING MLREFSFYDVPPAHVPPVSEPLEIACYSLSRDRELLLDDSKLSYYYPPPLFSDLNTGFPNRFHPPKSDPDPISIVKDVLM TKGIQMNSSFLTWRGLITKIMCAPLDPRNHWETYLVMDPTSGIIMMEERTNQDRMCYWGYKFEAISTLPEIWDAQDVVPD EQYCSIVKINIGKSKLILAGEVDCIWDKKPCENPNLHYVELKTSKKYPLENYGMRKKLLKYWAQSFLLGIGRIIIGFRDD NGILIEMKELFTHQIPKMLRPYFKPNDWTPNRLLVVLEHALEWIKQTVKQHPPSTEFTLSYTGGSKLVLRQII .............................TTTT.....TTTT......TTTTTTTTTTTTTT.................. .....................TTTTTTTT........TTTTT...........................TTTTT...TTT TT........TTTT.................................TTTT...........................TT TT.....................TTTT........................TTTT.................. 186 3FRR.A mol:aa PROTEIN BINDING LGSGFKAERLRVNLRLVINRLKLLEKKKTELAQKARKEIADYLAAGKDERARIRVEHIIREDYLVEAMEILELYCDLLLA RFGLIQSMKELDSGLAESVSTLIWAAPRLQSEVAELKIVADQLCAKYSKEYGKLCRTNQIGTVNDRLMHKLSVEAPPKIL VERYLIEIAKNYNVPYEPDSVVMAEA ................................................................................ ............................TTTTTT.......................TTTT................... .......................... 109 3FSO.A mol:aa CELL ADHESION MRDVVSFEQPEFSVSRGDQVARIPVIRRVLDGGKSQVSYRTQDGTAQGNRDYIPVEGELLFQPGEAWKELQVKLLELRQV RRFHVQLSNPKFGAHLGQPHSTTIIIRDP TTTT......................................TTTTTTTTTT.........TTTT............... ..........TTTT..TTTTTT....... 218 3FSS.A mol:aa CHAPERONE TNTIFKLEGVSVLSPLRKKLDLVFYLSNVDGSPVITLLKGNDRELSIYQLNKNIKASFLPVPEKPNLIYLFTYTSCEDNK FSEPVVTLNKENTLNQFKKLGLLDSNVTDFEKCVEYIRKQAILTGFKISNPFVKINSFHLQCHRGTKEGTLYFLPDHIIF GFKKPILLLDASDIESITYSSITRLTFNASLVTKDGEKYEFSIDQTEYAKIDDYVKRK ............TTTTT.........TTTTT......TTTTT....TTTT..........TTTTTTT............. .......................TTTT......................TTTT.........TTTTT......TTTT... ..TTTT................TTTT......TTTT...................... 291 3FWK.A mol:aa TRANSFERASE GAMVMRLGDAAELCYNLTSSYLQIAAESDSIIAQTQRAINTTKSILINETFPKWSPLNGEISFSYNGGKDCQVLLLLYLS CLWEYYIVKLPTVFIDHDDTFKTLENFIEETSLRYSLSLYESDRDKCETMAEAFETFLQVFPETKAIVIGIRHTDPFGEH LKPIQKTDANWPDFYRLQPLLHWNLANIWSFLLYSNEPICELYRYGFTSLGNVEETLPNPHLRKDKNSTPLKLNFEWEIE NRYKHNEVTKAEPIPIADEDLVKIENLHEDYYPGWYLVDDKLERAGRIKKK ..........................TTTT...................TTTTTTTTTTTTT.................. ................TTTT......................TTTT..............TTTT.......TTTTTTTTT TTTTT..TTTT......TTTTTT.......................TTTT.TTTTT........TTTT....TTTT.... .....TTTTT...............................TTTT...... 137 3FWZ.A mol:aa MEMBRANE PROTEIN SNAVDICNHALLVGYGRVGSLLGEKLLASDIPLVVIETSRTRVDELRERGVRAVLGNAANEEIQLAHLECAKWLILTIPN GYEAGEIVASARAKNPDIEIIARAHYDDEVAYITERGANQVVGEREIARTLELLETP .....TTTT...............................................TTTTTTTTTTT............. ..............TTTT...................TTTT................ 84 3FX7.A mol:aa UNKNOWN FUNCTION QMDTEEVREFVGHLERFKELLREEVNSLSNHFHNLESWRDARRDKFSEVLDNLKSTFNEFDEAAQEQIAWLKERIRVLEE DYLE ..................................TTTT.......................................... .... 93 3FYB.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION GADIDQASKTEEAAAFRHLLRHLDEHKDVQNIDLIQADFCRNCLAKWLEAATEQGVELDYDGAREYVYGPFAEWKTLYQK PASEAQLAAFEAK .......................TTTTTTTTTTTTTTT.......................................... ............. 82 3FYM.A mol:aa DNA BINDING PROTEIN KTVGEALKGRRERLGMTLTELEQRTGIKREMLVHIENNEFDQLPNKNYSEGFIRKYASVVNIEPNQLIQAHQDEIPSNQA EW ..........................................TTTT........................TTTTT..... .. 387 3G02.A mol:aa HYDROLASE KAFAKFPSSASISPNPFTVSIPDEQLDDLKTLVRLSKIAPPTYESLQADGRFGITSEWLTTMREKWLSEFDWRPFEARLN SFPQFTTEIEGLTIHFAALFSEREDAVPIALLHGWPGSFVEFYPILQLFREEYTPETLPFHLVVPSLPGYTFSSGPPLDK DFGLMDNARVVDQLMKDLGFGSGYIIQGGDIGSFVGRLLGVGFDACKAVHLNFCNMSAPPEGPSIESLSAAEKEGIARME KFMTDGYAYAMEHSTRPSTIGHVLSSSPIALLAWIGEKYLQWVDKPLPSETILEMVSLYWLTESFPRAIHTYREWVPTTP YQKELYIHKPFGFSFFPKDLVPVPRSWIATTGNLVFFRDHAEGGHFAALERPRELKTDLTAFVEQVW TTTT..TTTT....................................TTTTTTT........................... .......TTTTT..........TTTT.......TTTT................TTTTT........TTTTTTT...TTTT ...................TTTT...................TTTT............TTTT.................. ..........................................TTTT............................TTTTTT TTTTTT.........TTTTTTT.....................TTTT.................... 138 3G0M.A mol:aa HYDROLASE MAALPDKEKLLRNFTRCANWEEKYLYIIELGQRLAELNPQDRNPQNTIHGCQSQVWIVMRRNANGIIELQGDSDAAIVKG LMAVVFILYHQMTAQDIVHFDVRPWFEKMALAQHLTPSRSQGLEAMIRAIRAKAATLS TTTT.........................................................TTTT............... ........TTTT.............................................. 87 3G1J.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION GVDRDYLQSEYGVLKAGQCYKVVRSFRDYRNINYERGDVRFLGSNFVPYESGLSLFFDKNGSERQILCVRPEFQEIAHHL DSYFCKL ..........TTTTTTTT...TTTT..TTTT...TTTT........TTTTT......TTTTT.......TTTTTTTTTTT ....... 77 3G21.A mol:aa VIRAL PROTEIN AGPWADIMQGPSESFVDFANRLIKAVEGSDLPPSARAPVIIDCFRQKSQPDIQQLIRTAPSTLTTPGEIIKYVLDRQ .........TTTT..............................................TTTT.............. 90 3G2B.A mol:aa BIOSYNTHETIC PROTEIN TISRDSCPALRAGVRLQHDRARDQWVLLAPERVVELDDIALVVAQRYDGTQSLAQIAQTLAAEFDADASEIETDVIELTT TLHQKRLLRL ..TTTT....TTTT....TTTTT........................TTTT............................. .......... 146 3G2S.A mol:aa PROTEIN TRANSPORT EPAMEPETLEARINRATNPLNKELDWASINGFCEQLNEDFEGPPLATRLLAHKIQSPQEWEAIQALTVLETCMKSCGKRF HDEVGKFRFLNELIKVVSPKYLGSRTSEKVKNKILELLYSWTVGLPEEVKIAEAYQMLKKQGIVKS .................TTTTTTT..............TTTT.............TTTT..................... .................TTTTT......................TTTT.................. 51 3G36.A mol:aa NUCLEAR PROTEIN VDLQSLPTRAYLDQTVVPILLQGAVLAKERPPNPIEFLASYLLKNKAQFED ............TTTTT.............TTTT................. 274 3G3T.A mol:aa BIOSYNTHETIC PROTEIN FVRQTTKYWVHPDNITELKLIILKHLPVLEREDSAITSIYFDNENLDLYYGRLRKDEGAEAHRLRWYGGMSTDTIFVERK THREDWTGEKSVKARFALKERHVNDFLKGKYTVDQVFAKMRKEGKKPMNEIENLEALASEIQYVMLKKKLRPVVRSFYNR TAFQLPGDARVRISLDTELTMVREDNFDGVDRTHKNWRRTDIGVDWPFKQLDDKDICRFPYAVLEVKLQTQLGQEPPEWV RELVGSHLVEPVPKFSKFIHGVATLLNDKVDSIP ..........................................TTTT.........TTTT.........TTTT........ ............................TTTT................................................ ....TTTT.................TTTT.TTTTTTTTTTTTTTTTTTTTT...................TTTT...... .....TTTT................TTTTTTTT. 383 3G5B.A mol:aa APOPTOSIS PGSSVSGTFGCLGGRLTIPGTGVSLLVPNGAIPQGKFYDLYLRINKTESTLPLSEGSQTVLSPSVTCGPTGLLLCRPVVL TVPHCAEVIAGDWIFQLKTQAHQGHWEEVVTLDEETLNTPCYCQLEAKSCHILLDQLGTYVFTGESYSRSAVKRLQLAIF APALCTSLEYSLRVYCLEDTPAALKEVLELERTLGGYLVEEPKPLLFKDSYHNLRLSLHDIPHAHWRSKLLAKYQEIPFY HVWNGSQKALHCTFTLERHSLASTEFTCKVCVRQVEGEGQIFQLHTTLTTQLGPYAFKIPLSIRQKICNSLDAPNSRGND WRLLAQKLSMDRYLNYFATKASPTGVILDLWEARQQDDGDLNSLASALEEMGKSEMLVAMTTD TTTTT....TTTT....TTTT......TTTTTTTTT..................TTTT...............TTTT... ......TTTT.........TTTTTT.....TTTTTTTTT......TTTT....TTTT..........TTTT......... .TTTT........................................................TTTTT.............. ......TTTT.........TTTTT.........TTTT..............TTTTTT.................TTTT.. ..........TTTT.....TTTT........................................ 149 3G7G.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION EMNYEEVFSITITVDKPILIGQDDIVGRRQLIPIISGKVSGNNFNGKVLPGGIDSQIVRPDGKCELSARYAIRLDDGAAI YIENNGIRTVPDEYIEAVKSGEFVDPNAYYFRTIPTFETYSPKYKWMMNHIFVCCASRENVLLKFYKIS ......................TTTTT.............TTTT..............TTTT...........TTTT... ..................................................................... 133 3GA3.A mol:aa HYDROLASE AKHYKNNPSLITFLCKNCSVLACSGEDIHVIEKMHHVNMTPEFKELYIVRENKALQKKCADYQINGEIICKCGQAWGTMM VHKGLDLPCLKIRNFVVVFKNNSTKKQYKKWVELPITFPNLDYSELEHHHHHH TTTT..........TTTTT..........TTTTTT................TTTTT..TTTT.......TTTT....... TTTTT.............TTTTT.............................. 156 3GA4.A mol:aa TRANSFERASE IDDILQLKDDTGVITVTADNYPLLSRGVPGYFNILYITMRGTNSNGMSCQLCHDFEKTYHAVADVIRSQAPQSLNLFFTV DVNEVPQLVKDLKLQNVPHLVVYPPAESNKQSQFEWKTSPFYQYSLVPENAENTLQFGDFLAKILNISITVPQAFN ........TTTT....TTTTT......TTTT...........TTTT.......................TTTT....... TTTTTT............................TTTTT..........TTTT...................TTTT 253 3GAE.A mol:aa NUCLEAR PROTEIN GMKVLPVKQYLIMENYNPDTIFNGIVKINSNEKTFDDEILAQIGGALHDIDESWELLLSFANTIRSNWEIKTPAYDIVRL IVKKLPYSSDIKDYIEEGLGNKNITLTMLTVRILVNCFNNENWGVKLLESNQVYKSIFETIDTEFSQASAKQSQNLAIAV STLIFNYSALVTKGNSDLELLPIVADAINTKYGPLEEYQECEEAAYRLTVAYGNLATVEPTLRQFANSVTWLANIKRSYG NVPRFKDIFDDLS .TTTTTT............................................................TTTT......... .................TTTTTTT...............TTTTT.........TTTT.......TTTT............ ................TTTT...........TTTT.............................TTTTT.........TT TT........... 159 3GBW.A mol:aa LIGASE SLEDYSVVNRFESHGGGWGYSAHSVEAIRFSADTDILLGGLGLFGGRGEYTAKIKLFELGPDGGDHETDGDLLAETDVLA YDCAAREKYAFDEPVLLQAGWWYVAWARVSGPSSDCGSHGQASITTDDGVIFQFKSSKKSNNGTDVNAGQIPQLLYRLP ....................TTTT...................................TTTTTTTT............. ...TTTT...TTTT...TTTT...................TTTT.TTTT.......TTTTTTT.TTTT........... 126 3GBY.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION NASVTFSYLAETDYPVFTLGGSTADAARRLAASGCACAPVLDGERYLGVHLSRLLEGRKGWPTVKEKLGEELLETVRSYR PGEQLFDNLISVAAAKCSVVPLADEDGRYEGVVSRKRILGFLAERI TTTT......TTTT...TTTT...................TTTTT.........TTTTTTT.TTTT.............T TTT...........TTTTTT...TTTT................... 124 3GDW.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION SNANVGVFVLHGDSTASSLKTAQELLGTSIGTANPLTEVQTYEQLRNQVITQKESLNNGILLLTDGSLNSFGNLFEETGI RTKAITTSTIVLEAIRASVGRSLEDIYQNIQLSFESVVREQFRS ...........TTTTTTT...................TTTT..............TTTT..........TTTT....... ................TTTT........................ 102 3GE3.E mol:aa OXIDOREDUCTASE STLADQALHNNNVGPIIRAGDLVEPVIETAEIDNPGKEITVEDRRAYVRIAAEGELILTRKTLEEQLGRPFNMQELEINL ASFAGQIQADEDQIRFYFDKTM ...................TTTT..........TTTT......TTTTT...TTTT......................... .........TTTT......... 176 3GFP.A mol:aa HYDROLASE NVDAIKQLYDCKNEADKFDVLTELYGLTIGSSIIFVATKKTANVLYGKLKSEGHEVSILHGDLQTQERDRLIDDFREGRS KVLITTNVLARGIDIPTVSVVNYDLPTLANGQADPATYIHRIGRTGRFGRKGVAISFVHDKNSFNILSAIQKYFGDIETR VPTDDWDEVEKIVKKV TTTTT......................................................TTTT..............TTT T....TTTTTTTTTT......TTTT..TTTT..............TTTTT......................TTTT.... .TTTTT.......... 101 3GI7.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION NVETDQQTFACAAFNKQVAERELQSAYDELIERRDQFGDEAGLSRIEAAEKVWSQLRDADCKVETHAEQPGSNAYQIAWN SCIAQRSDERAEYLRSLGSQN TTTTT............................TTTTTTTTTT.....................TTTTTTTT........ ..................... 141 3GIX.A mol:aa SPLICING SFLLPKLTSKKEVDQAIKSTAEKVLVLRFGRDEDPVCLQLDDILSKTSSDLSKMAAIYLVDVDQTAVYTQYFDISYIPST VFFFNGQHMKVDYGSPDHTKFVGSFKTKQDFIDLIEVIYRGAMRGKLIVQSPIDPKNIPKY TTTT...............TTTT.......TTTT.............TTTTTTTT.....TTTTT..........TTTT. ..TTTTT.........TTTT......................................... 225 3GKJ.A mol:aa TRANSPORT PROTEIN QSCVWYGECGIAYGDKRYNCEYSGPPKPLPKDGYDLVQELCPGFFFGQVSLCCDVRQLQTLKDNLQLPLQFLSRCPSCFY NLLNLFCELTCSPRQSQFLQVTATEDYVDPVTNQTKTNVKELQYYVGQSFANAMYNACRDVEAPSSNDKALGLLCGKDAD ACQATNWIEYMFNKDNGQAPFTITPVFSDFPVHGMEPMNNATKGCDESVDEVTAPCSCQDCSIVC ............TTTT.............................TTTT......................TTTTT.... ...........TTTT.............TTTTT........................TTTT.TTTT...........TTT TT.............TTTTTTT........TTTTT........TTTT..TTTT............ 143 3GMG.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION ARLAALSILVGAVGATGPGVMITIDDPGPGVAPEVMIDVINELRAAGAEAIQINDAHRSVRVGVDTWVVGVPGSLTVDTK VLSPPYSILAIGDPPTLAAAMNIPGGAQDGVKRVGGRMVVQQADRVDVTALRQPKQHQYAQPV .........................TTTT..................TTTT...TTTT....TTTT....TTTTTTTTTT ......................TTTT..............................TTTT... 288 3GN6.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION FNPWTDAALDTIVNQALTLYAERVVPAHHDAFLAAIDTVSAKLRVLPGFLSLALKQSGDSTVKNYPETYKGVLATAYLDG VAAGTQPYFYNLFVRFADGRAARAAGFEALFETHIHPLLHAADGPELLAYRAVLQSVVAGDRHAIYRGAEEIRSFLRRPV ELPERETVTVENHVVPEDKHAAWEPQVAILLQVAQDTFEPQDEPSGVGLPGARDNRYYRKALSTEILRNAHADGGLRAYI HGVWESVWDHENSHLDPRFLAAAGPVGAAAVVGPVEPFYLTRRLVVAD .TTTT........................................TTTT...................TTTTTTTT.... ............................................................TTTT................ ...............TTTTT.................TTTTTTTTT.....TTTT.............TTTTTTTT.... .............................TTTT............... 243 3GNE.A mol:aa LYASE NVISTLDLNLLTKGGGSWNVDGVNMKKSAVTTFDGKRVVKAVYDKNSGTSANPGVGGFSFSAVPDGLNKNAITFAWEVFY PKGFDFARGGKHGGTFIGHGAASGYQHSKTGASNRIMWQEKGGVIDYIYPPSDLKQKIPGLDPEGHGIGFFQDDFKNALK YDVWNRIEIGTKMNTFKNGIPQLDGESYVIVNGKKEVLKRINWSRSPDLLISRFDWNTFFGGPLPSPKNQVAYFTNFQMK KYE .........TTTTTT.........TTTT...TTTTT.......TTTT.TTTT...........TTTTTTTT......... TTTT.TTTT.............TTTT.TTTT...................TTTT...TTTT.........TTTTTTTTTT TTT............TTTTT.........TTTTT...........TTTTT..........TTTTT............... ... 211 3GNZ.P mol:aa TOXIN AVINHDAVPVWPQPEPADATQALAVRFKPQLDVVNGCQPYPAVDPQGNTSGGLKPSQAAACRDMSKAQVYSRSGTYNGYY AIMYSWYMPKDSPSTGIGHRHDWENVVVWLDNAASANIVALSASAHSGYKKSFPADKSYLDGITAKISYKSTWPLDHELG FTTSAGKQQPLIQWEQMTQAARDALESTDFGNANVPFKSNFQDKLVKAFFQ ...........................................TTTT...........................TTTTT. ............TTTT...............TTTTT........TTTT...TTTT....TTTTT.......TTTT..... .............................TTTT.TTTTTTT.......... 80 3GOE.A mol:aa RECOMBINATION, REPLICATION HHHHHHKLITLLLRSSKSEDLRLSIPVDFTVKDLIKRYCTEVKISFHERIRLEFEGEWLDPNDQVQSTELEDEDQVSVVL TTTT..........TTTT.......TTTT.................TTTT..TTTTT..TTTT.......TTTT...... 132 3GP4.A mol:aa TRANSCRIPTION REGULATOR SLNIKEASEKSGVSADTIRYYERIGLIPPIHRNESGVRKFGAEDLRWILFTRQMRRAGLSIEALIDYLALFREGEHTLEA RAELLKKQRIELKNRIDVMQEALDRLDFKIDNYDTHLIPAQEELKDFNVERS ........................TTTT....TTTT............................................ ................................................TTTT 155 3GP6.A mol:aa TRANSFERASE MNADEWMTTFRENIAQTWQQPEHYDLYIPAITWHARFAYNERPWGGGFGLSRWDEKGNWHGLYAMAFKDSWNKWEPIAGY GWESTWRPLADENFHLGLGFTAGVTARDNWNYIPLPVLLPLASVGYGPVTFQMTYIPGTYNNGNVYFAWMRFQFL .................................TTTT................TTTT...........TTTT........ ........TTTTTT...............TTTT............TTTT.........TTTTT............ 163 3GQQ.A mol:aa SPLICING KQPIGPEDVLGLQRITGDYLCSPEENIYKIDFVRFKIRDDSGTVLFEIKKPPNAGRFVRYQFTPAFLRLRQVGATVEFTV GDKPVNNFRIERHYFRNQLLKSFDFHFGFCIPSSKNTCEHIYDFPPLSEELISEIRHPYETQSDSFYFVDDRLVHNKADY SYS ........TTTT.............TTTT.......................TTTT........................ ....TTTT.....TTTTT............TTTT....................TTTTTTTT.....TTTTT........ ... 98 3GWH.A mol:aa TRANSCRIPTION HSQLMAQLVEVIEDSFQMKVNKESVNYLRLIRHIRFTIERIKKEEPTKEPEKLMLLLKNEYPLCYNTAWKLIKILQQTLK KPVHEAEAVYLTLHLIPI ....................TTTT........................................................ ..............TTTT 113 3GWN.A mol:aa OXIDOREDUCTASE ANGLITKIWGTAGWTFNHAVTFGYPLNPTSDDKRRYKNYFISLGDVLPCRLCRESYKKFITTGKTALTNEVLRNRHTLTK WFYDVHNAVNNKLEVDYGLSYEDVVNKYESFRA ........................TTTT..............................TTTT.................. ................................. 91 3GXW.A mol:aa TRANSCRIPTION SHHRVINHPYYFPFNGKQAEDYLRSKERGDFVIRQSSRGDDHLAITWKLDKDLFQHVDIQEGKVLVVEGQRYHDLDQIIV EYLQNKIRLLN .......TTTT...............TTTT.....TTTTTTT.......TTTT............TTTTT.......... ........... 144 3GY9.A mol:aa TRANSFERASE DVTIERVNDFDGYNWLPLLAKSSQEGFQLVERLRNRREESFQEDGEAFVALSTTNQVLACGGYKQSGQARTGRIRHVYVL PEARSHGIGTALLEKISEAFLTYDRLVLYSEQADPFYQGLGFQLVSGEKITHTLDKTAFADSNR ......TTTT.................TTTTTTTTTTTTT..TTTT.....TTTT.........TTTTTTT......... ..TTTT................TTTT...TTTT.............TTTT..........TTTT 146 3GZB.A mol:aa LYASE GFASLVIPVSAQANSGEPQEQQLAVKYDALTEHDYKTLITFYNRDSIFFDKTANRKYTGGRFIIDFLERAHQGVLEYDFN IEHYNAGSLVVIGNYHFKGPGEQFGKPGKIIDVAIPAVTSLKLDLNRRVTEHVDLIDYQTSDQLAQ ..........TTTTT............TTTTTT.........TTTT...TTTTT................TTTT...... ....TTTTT...............................................TTTTTTTTT. 155 3H05.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION MKKIAIFGSAFNPPSLGHKSVIESLSHFDLVLLEPSIMLDYPIRCKLVDAFIKDMGLSNVQRSDLEQALYSVTTYALLEK IQEIYPTADITFVIGPDNFFKFAKFYKAEEITERWTVMACPEKVKIRSTDIRNALIEGKDISTYTTPTVSELLLN TTTT......TTTT..........TTTTTTT.........................TTTT.................... ....TTTT................TTTT.............TTTT...............TTTTT.......... 238 3H0W.A mol:aa LYASE SMFVSKRRFILKTCGTTLLLKALVPLLKLARDYSGFDSIQSFFYSRKNFMKPSHQGYPHRNFQEEIEFLNAIFPNGAAYC MGRMNSDCWYLYTLDFDQTLEILMSELDPAVMDQFYMKDGVTAKDVTRESGIRDLIPGSVIDATMFNPCGYSMNGMKSDG TYWTIHITPEPEFSYVSFETNLSQTSYDDLIRKVVEVFKPGKFVTTLFVNQKIEGFKRLDCQSAMFNDYNFVFTSFAK ....TTTT.....TTTT..............TTTT.............TTTT...TTTTT.........TTTTTTT.... ..TTTT...............................TTTT..............TTTT......TTTT.......TTTT .......................TTTT............TTTT.........TTTT.........TTTT......... 288 3H20.A mol:aa REPLICATION RTLQAIGRQLKAMGCERFDIGVRDATTGQMMNREWSAAEVLQNTPWLKRMNAQGNDVYIRPAEQERHGLVLVDDLSEFDL DDMKAEGREPALVVETSPKNYQAWVKVADAAGGELRGQIARTLASEYDADPASADSRHYGRLAGFTNRKKHTTYQPWVLL RESKGKTATAGPALVQQAGQQIEQAQRQQEKARRLASLERRTALDEYRSEMAGLVKRFGDDLSKCDFIAAQKLASRGRSA EEIGKAMAEASPALAERKHEADYIERTVSKVMGLPSVQLARAELARAP ..............TTTT.....TTTTT.................................TTTT............... .........TTTT.TTTTTT.......TTTT.......................TTTT...................... .......TTTT..............................................TTTTT.................. ................................................ 225 3H3L.A mol:aa HYDROLASE QALDSDGIPTGGEWITFDGKTLNGWRGYCRQDVPLGWVVEDGSITYKGSDNKADTGFGDLIYDKKFKNFVFEIEWKIDKA GNSGIFYTAQEIEGTPIYYSSPEYQLLDNENPDAWEGCDGNRQAGAVYDIPDPQPVKPYGNWNKTRIVVYNQRVIHYNDV KILEFQFGTPVWRALVDHSKFSKFSTSPEKCPEAYDLLQCGKQPGYIGQDHGYGVCFRNIRIKEL ...TTTT..........TTTTTTTT.TTTT...TTTT.TTTTT.....TTTT.............TTTT........TTT T..........TTTT............TTTTTTTTTTTTTTTTTTT....TTTT...TTTT.......TTTTT....... .....TTTT.........TTTTTTT.TTTT...TTTT.TTTTTT....TTTTTTT.......... 139 3H51.A mol:aa PROTEIN BINDING GVHYTDKAALPADGEAREVAALFDTWNAALATGNPHKVADLYAPDGVLLPTVSNEVRASREQIENYFEFLTKKPKGVINY RTVRLLDDDSAVDAGVYTFTLTDKNGKKSDVQARYTFVYEKRDGKWLIINHHSSAPEVD .................................................TTTT........................... ......TTTT............TTTT..............TTTTT.............. 168 3H5J.A mol:aa LYASE SEAFHTHSGIGVPLRRSNVDTDQIIPAVFLKRVTRTGFEDGLFAGWRSDPAFVLNLSPFDRGSVLVAGPDFGTGSSREHA VWALMDYGFRVVISSRFGDIFRGNAGKAGLLAAEVAQDDVELLWKLIEQSPGLEITANLQDRIITAATVVLPFKIDDHSA WRLLEGLD ...............TTTT.........TTTT........TTTTTT..TTTT...TTTT........TTTTTT....... .............TTTT................................TTTT....TTTTT...TTTT........... ........ 56 3H6P.C mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION GYAGTLQSLGADIASEQAVLSSAWQGDTGITYQGWQTQWNQALEDLVRAYQSMSGT ........................TTTT............................ 152 3H6R.A mol:aa HYDROLASE/HYDROLASE INHIBITOR MASLEDGTYRLRAVTTHNPDPGVGGEYATVEGARQPVKAEPSTPPFSEQQIWQVTRNSDGQYTIKYQGLNAPFEYGFSYD QLEPNAPVIAGDPKEYILQLVPSTADVYIIRAPIQRVGVDVEVGVQGNTLVYKFFPVDGSGGDRPAWRFTRE ..............TTTTTTT..........TTTT.......TTTTT.........TTTT....TTTT..TTTT....TT TTTTTT..............TTTTTTT........TTTT.....TTTTT....................... 95 3H7H.B mol:aa TRANSCRIPTION MDPNLWTVKCKIGEERATAISLMRKFIAYQFTDTPLQIKSVVAPEHVKGYIYVEAYKQTHVKQAIEGVGNLRLGYWNQQM VPIKEMTDVLKVVKE ..........TTTTT.............TTTTTTT........TTTT.......TTTT......TTTT.....TTTT... ....TTTT....... 182 3H8T.A mol:aa HEME-BINDING PROTEIN EAVTKTVTIDASKYETWQYFSFSKGEVVNVTDYKNDLNWDMALHRYDVRLNCGESGKGKGGAVFSGKTEMDQATTVPTDG YTVDVLGRITVKYEMGPDGHQMEYEEQGFSEVITGKKNAQGFASGGWLEFSHGPAGPTYKLSKRVFFVRGADGNIAKVQF TDYQDAELKKGVITFTYTYPVK ............TTTT....TTTTT....TTTT.........TTTTT................................. ...............TTTT..................TTTT..........TTTTT.............TTTT....... ....TTTTTTTT.......... 134 3H96.A mol:aa FLAVOPROTEIN DWNSQVIQEFRANGGRVGGNFEGAPVLVHHVGRKTGKAAVTPYLPSDDDPGTIYVFASKAGAASNPAWYYNLTTAGTAQV EVGTETYAVGVTEVTGEDRDRIYSEQARRYPGFADYEKKTAGIRTIPVLALTRT ............TTTT....TTTT.......TTTTT.........TTTTTTT............................ .TTTT..................................TTTTT.......... 106 3H9W.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION KAIPWKINWQTAFEYIGPQIEALLGWPQGSWKSVEDWATRHPEDQEWVVNFCVKQSECGVDHEADYRALHRDGHYVWIRD VVHVVRDDSGEVEALIGFFDISLEHH .......TTTT..........................................................TTTT....... ......TTTT................ 162 3HA2.A mol:aa OXIDOREDUCTASE QTLIIVAHPELARSNTQPFFKAAIENFSNVTWHPLVADFNVEQEQSLLLQNDRIILEFPLYWYSAPALLKQWDTVTTKFA TGHQYALEGKELGIVVSTGDNGNAFQAGAAEKFTISELRPFEAFANKTKYLPILAVHQFLYLEPDAQQRLLVAYQQYATN VG .......TTTTTTTTTT......TTTTTTT.....TTTT...........TTTT.....TTTTT................ TTTTTTTTTT...............TTTTTTT.TTTTT.................TTTT..................... .. 48 3HE5.B mol:aa DE NOVO PROTEIN GSARNAYLRKKIARLKKDNLQLERDEQNLEKIIANLRDEIARLENEVA TTTT............................................ 65 3HFO.A mol:aa RNA BINDING PROTEIN DSGLPSVRQVQLLIKDQTPVEIKLLTGDSLFGTIRWQDTDGLGLVDDSERSTIVRLAAIAYITPR .TTTT..................TTTT..........TTTT....TTTT................ 111 3HH1.A mol:aa TRANSFERASE AHKGTLYVVATPLGNLDDTFRAVNTLRNAGAIACEDTRRTSILLKHFGIEGKRLVSYHSFNEERAVRQVIELLEEGSDVA LVTDAGTPAISDPGYTASAAHAAGLPVVPVP ..............TTTT..........TTTT.TTTT....................TTTTT.................. ...TTTT........................ 282 3HL1.A mol:aa METAL BINDING PROTEIN PPVWTLPRLYQHFQGAIDLELWTIPYYLTVLYSIKDPTTVPYRLIQAAVYQELHAQLVSNIANAYGYSPTLSAPEYVGTA VPHIDFDLDTPNPTSIFTPYSAELGPLDLTRVNTCLIEYPEWRTQREPDLADDVTDYGSIGEFYDALRVGEQLRGHVRGN QKQDENSPPLTVTESGDAGFLQALTLVDIIVDQGEGQAWPHFQRFDFIRRPNWPGVYTGVTDPPAGSPGAEAQARLIADF AGFLDILNGFSGGGAPPAFGVQAKLGGDILSCWKLGAVPRYS .................................TTTTTT.....................................TTTT TTTT....TTTT..TTTTTT..............TTTTT...........TTTT................TTTT...TTT TTT........................................................TTTTTTTT............. .......................................... 277 3HN0.A mol:aa TRANSPORT PROTEIN DTVIKVSVLRGPSVIAFADWLENPPIIDNKKVQVKVVDSPDLAQALLIKQETDIAVLPINAANLYNKGIKIKLAGCPIWG TLYLVEKTPLKEPALYVFGNGTTPDILTRYYLGRQRLDYPLNYAFNTAGEITQGILAGKVNRAVLGEPFLSIALRKDSSL RITADLNHLTDNDTLGFAQTAVVYTPTEKYRIAFEDALRASCQKAVRYPKETIHSLEEHGIFAQGALTPKSIERCKIYYL SAIEAKDAVGFLRLIEQYEPKAVGGRLPDAGFIPEKQ .........................TTTTT.....................TTTT......................... ......TTTTTTT.....TTTT....................TTTT...................TTTT.......TTTT .......TTTTTTTTT...........TTTT............................TTTTTTTTT............ TTTTTTTTT.............TTTT........... 199 3HN5.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION ESLTGRVYNGEALQLRGNEAVQLQLYQHGYAKHDPINVYVNQDGYSANLFDGEYQITKSGNGPWTSEGRDTINVTVAGNT VQDVEVTPYYLVRDAQTLEGNKVNASFKVEKVAGGGIDRVFFLSTTQFVNDAEHNVDRYDETDNLDAYDETGKLYTFATR DYTDNSFQTALKRGTLFGRICIWPKGSDQGIYSKVIRLK .......TTTT.....TTTT.....TTTT...........TTTT.............TTTT....TTTT.......TTTT ..................TTTT........TTTT...............TTTTTTTT....TTTT....TTTT....... .TTTT..................TTTT............ 251 3HO6.A mol:aa TOXIN GVDFNKNTALDKNYLLNNKIPSNNGSKNYVHYIIQLQGDDISYEATCNLFSKNPKNSIIIQRNMNESAKSYFLSDDGESI LELNKYRIPERLKNKEKVKVTFIGHGKDEFNTSEFARLSVDSLSNEISSFLDTIKLDISPKNVEVNLLGCNMFSYDFNVE ETYPGKLLLSIMDKITSTLPDVNKNSITIGANQYEVRINSEGRKELLAHSGKWINKEEAIMSDLSSKEYIFFDSIDNKLK AKSKNIPGLAS ..TTTT...................................................................TTTT... ...TTTT....TTTT...........TTTT...TTTTT...................................TTTT... ..................TTTT................TTTT.....TTTT.....................TTTTT... ..TTTTTTT.. 184 3HOI.A mol:aa OXIDOREDUCTASE GAERTIQLPKPDNRAGLLKALSERHSTREYASKALSNTDLSDLLWAANGINRSSEGKRTAPSANRQDIDIYVVLPQGTYL YDAKGHKLNLISEGDHRSAVAGGQAFVNNAPVSLVLVSDLSKLGDAKSNHVQLGADAGIVSQNISLFCSAARLATVPRAS DLVRLKAALKLKDTQPNHPVGYFK ..TTTT...........................................TTTTTTT.TTTT............TTTT... .TTTTT..............TTTT.....TTTT...........TTTT................................ ...........TTTT......... 155 3HPC.X mol:aa PROTEIN TRANSPORT SVSVDLNVDPSLQIDIPDALSERDKVKFTVHTKTTLPTFQSPEFSVTRQHEDFVWLHDTLTETTDYAGLIIPPAPTKPDF DGPREKMQKLGEGEGSMTKEEFAKMKQELEAEYLAVFKKTVSSHEVFLQRLSSHPVLSKDRNFHVFLEYDQDLSV ......TTTTTTT.......TTTTT..........TTTTTTT.......................TTTT........... .....................................................TTTT..............TTTT 250 3HR0.A mol:aa TRANSPORT PROTEIN STDEAKMSFLVTLNNVEVCSENISTLKKTLESDCTKLFSQGIGGEQAQAKFDSCLSDLAAVSNKFRDLLQEGLTELNSTA IKPQVQPWINSFFSVSHNIEEEEFNDYEANDPWVQQFILNLEQQMAEFKASLSPVIYDSLTGLMTSLVAVELEKVVLKST FNRLGGLQFDKELRSLIAYLTTVTTWTIRDKFARLSQMATILNLERVTEILDYWGPNSGPLTWRLTPAEVRQVLALRIDF RSEDIKRLRL .....................................TTTT....................................... ...............TTTT............................................................. ...........................................TTTT..........TTTT...............TTTT .......... 91 3HRL.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION ASEAEAKLWQHLRAGRLNGYKFRRQQPGNYIVDFCVTPKLIVEADGVYDHARTVYLNSLGFTVLRFWNHEILQQTNDVLA EILRVLQELEK ..............TTTTTT..TTTT........TTTTT......................................... ........... 317 3HRR.A mol:aa TRANSCRIPTION EIKTTTTLHRVVEETTKPLGATLVVETDISRKDVNGLARGHLVDGIPLCTPSFYADIAMQVGQYSMQRLRGLVDVSDMVV DKALVPHGKGPQLLRTTLTMEWPPKAAATTRSAKVKFATYFKLDTEHASCTVRFTSDAQLKSLRRSVSEYKTHIRQLHDG HAKGQFMRYNRKTGYKLMSSMARFNPDYMLLDYLVLNEAENEAASGVDFSLGSSEGTFAAHPAHVDAITQVAGFAMNAND NVDIEKQVYVNHGWDSFQIYQPLDNSKSYQVYTKMGQANDLVHGDVVVLDGEQIVAFFRGLTLRSVPRGALRVVLQT ....TTTT........TTTT.......TTTTTTTT......TTTTT.................................. ......................TTTT...................................................... .................TTTTT...................................TTTT.................TT TTTTTTT................TTTT.....................TTTTT........................ 56 3HSH.A mol:aa PROTEIN BINDING GSSGVRLWATRQAMLGQVHEVPEGWLIFVAEQEELYVRVQNGFRKVQLEARTPLPR .....................TTTT...TTTTT.....TTTT.............. 270 3HTV.A mol:aa TRANSFERASE KQHNVVAGVDGATHIRFCLRTAEGETLHCEKKRTAEVIAPGLVSGIGEIDEQLRRFNARCHGLVGFPALVSKDKRTIIST PNLPLTAADLYDLADKLENTLNCPVEFSRDVNLQLSWDVVENRLTQQLVLAAYLGTGGFAVWNGAPWTGAHGVAGEETNC SGALRRWYEQQPRNYPLRDLFVHAENAPFVQSLLENAARAIATSINLFDPDAVILGGGVDPAFPRETLVATQKYLRRPLP HQVVRFIAASSSDFNGAQGAAILAHQRFLP ..........TTTT......TTTT.............TTTT.............................TTTT...... .........TTTT..............................TTTT......TTTT...........TTTTTTTTTTTT T......................TTTT................................................TTTTT TTTT.......TTTT............... 167 3HVS.A mol:aa OXIDOREDUCTASE SQTVHFQGNPVTVANSIPQAGSKAQTFTLVAKDLSDVTLGQFAGKRKVLNIFPSIDTGVCAASVRKFNQLATEIDNTVVL CISADLPFAQSRFCGAEGLNNVITLSTFRNAEFLQAYGVAIADGPLKGLAARAVVVIDENDNVIFSQLVDEITTEPDYEA ALAVLKA ....TTTTT.........TTTT........TTTT.......TTTT......TTTTTT................TTTT... ............TTTTTTTTTT....TTTTT...........TTTTTTT........TTTT.......TTTTTT...... ....... 276 3HWP.A mol:aa HYDROLASE RNTPFTYFSLPQKLFLRNQAAVRNKPYAKYFRSERVPLSAVRKIQQGPALEDTLTPSIEDINRLLEPDFVSEESGYALLP GPAYVQSRKFFPGCTAQFKWWFIWHPAESERYTLWFPYAHVSNPCVHHQRLRDESLSFEERLYGNTFCASEYVGDRLHLH IDFQQPASLGLNTDLYREAKIDGSVSALSLADHPEVPVSLVHLFKEVPDGYLTSRYWVGAHPSARFPGAEKAASLLKENG FGEAELETLAYEFAVHDCEFNHLASFLPDLYREFGT .....................TTTTTTT...............TTTT..................TTTT.........TT TT........TTTT.....................TTTTT....TTTT....TTTT.....TTTT......TTTTT.... .............................TTTTTTT..........TTTT.............TTTTTT........... .................................... 139 3HWU.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION QELKGKYKTPTGYLVLRHGDNVLQNLEQLARDEHIPSASFVGIGFSEATFGFYDFGRKQFDPKTYRNVEANTGSIAWKEG KPSIHAHGTVTDGTFQGAGGHLLGLTVGTGSCEITVTVYPQRLDRFVDPEIQANVLGLP TTTTTT..TTTT....TTTT.................................TTTTT......TTTT........TTTT T..........TTTT................................TTTTT....... 124 3HX8.A mol:aa ISOMERASE GQSAKEAIEAANADFVKAYNSKDAAGVASKYDDAAAFPPDARVDGRQNIQKLWQGADGISELKLTTLDVQESGDFAFESG SFSLKAPGKDSKLVDAAGKYVVVWRKGQDGGWKLYRDIWNSDPA .......................................................................TTTT..... .......TTTT...............TTTT.............. 411 3HXL.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION GVYINFKSEPQLGERGIVSPLILSWGEPGKITIEAGDDVFPKLGYSIDAQLRLINEALKRAKTLLLYRLNAGTKAAVTVG NLTVTAKWGGARGNDITLVIQENIDDETKFDVSTLVDGAELDKQTVSDIAGLAANDWVIFSGTGALTETAGAPLINGSDG AVTNQAYIDYLAAVEIFDFNTIALPSTDDALKATFTAFAKRLRDDEGKKIQVVLENYPAADYEGVISVKNGVVLADGTIL TAAQATAWVAGATAGARVNESLTYQGYDEAVDVAPRYTNAQIIAALQAGEFLFTASDNQALVEQDINTLTSFTADKGKQF AKNRVIRVLDGINNDFVRIFSKFYSNNADGRNLLKSECINYNTLQDIDAIKNFDGQTDLTVQSDVDAVYIEAYAWPVDSI EKIYVRVRIKL ..........................TTTT...TTTT.........................................TT TT...TTTT.............TTTTTTT.....TTTTT........TTTTT..TTTT...................... .....................................................TTTT....TTTT........TTTT... ................TTTT.TTTT.TTTT..TTTT..................TTTTT..TTTT.......TTTTT... ...............................................TTTT..TTTTT.....TTTT........TTTT. ........... 180 3HYN.A mol:aa NUCLEOTIDE-BINDING PROTEIN YQNANYSAFYVSEPFSESNLGANSTHDFVYYNLRWKGEDNSFPFNDAHDKTYNVRDGSDWEKTLKPRLHTRLDNSKNIIL FLSSITANSRALREENYGIGTKGLPVIVIYPDYDKKSDIVDSNGNFKKQIKDLWDKLPAFRDNSSVATLHIPCTKSVIIS ALNNEDFVNTADAEKYYYKP ...........TTTTTTTTTTT..TTTT......TTTTTTTT...TTTTTT.TTTT........................ ..TTTT.......................TTTT.......TTTT.................................... ...TTTT............. 123 3HZP.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION SSKEEILSILEAFASTERGSFFLDNATADFLFIRPSGNPLDAKGFENWSSGDLVLESAEITKVHKFELLGSNAAICVFTL GSKFTYKGTQNDDLPTVTSIFKKIDEKWKVAWQRSSGQSDTLW ..............TTTT........TTTT...TTTT............TTTTT.............TTTTTT....... ....TTTTT.............TTTTT................ 290 3I0W.A mol:aa HYDROLASE,LYASE/DNA MDFDMIEEKKDSVIVRNVENFELKDIFDCGQCFRWHRQENGNYIGIAFEKVVEVQKIGEDVVIYNINEEEFKNVWSEYFD LYRDYGEIKKELSRDPLLKKSVDFGEGIRILRQDPFEILLSFIISANNRIPMIKKCINNISEKAGKKLEYKGKIYYAFPT VDKLHEFTEKDFEECTAGFRAKYLKDTVDRIYNGELNLEYIKSLNDNECHEELKKFMGVGPQVADCIMLFSMQKYSAFPV DTWVKKAMMSLYVAPDVSLKKIRDFGREKFGSLSGFAQQYLFYYARENNI ........TTTT..TTTTTTT................TTTT....TTTTT......TTTT..TTTT.............T TTT.....................TTTT................TTTT....................TTTTT....... .................................TTTT..................TTTT..............TTTT... .............TTTT................................. 328 3I1A.A mol:aa TRANSFERASE KQPIQAQQLIELLKVHYGIDIHTAQFIQGGADTNAFAYQADSESKSYFIKLKYGYHDEINLSIIRLLHDSGIKEIIFPIH TLEAKLFQQLKHFKIIAYPFIHAPNGFTQNLTGKQWKQLGKVLRQIHETSVPISIQQQLRKEIYSPKWREIVRSFYNQIE FDNSDDKLTAAFKSFFNQNSAAIHRLVDTSEKLSKKIQPDLDKYVLCHSDIHAGNVLVGNEESIYIIDWDEPMLAPKERD LMFIGGGVGNVWNKPHEIQYFYEGYGEINVDKTILSYYRHERIVEDIAVYGQDLLSRNQNNQSRLESFKYFKEMFDPNNV VEIAFATE ..........................TTTT.TTTT......TTTT................................... TTTT.....TTTT...........TTTTTT.................................................. .TTTT..........................................TTTT.................TTTT........ ..TTTTTTTTT.............................................TTTT...............TTTT. ........ 123 3I2V.A mol:aa TRANSFERASE SRVSVTDYKRLLDSGAFHLLLDVRPQVEVDICRLPHALHIPLKHLERRDAESLKLLKEAIWEEKQGTAAVPIYVICKLGN DSQKAVKILQSLSAAQELDPLTVRDVVGGLAWAAKIDGTFPQY .................................TTTT..........................TTTT........TTTT. ...............TTTT......TTTT.......TTTT... 68 3I4O.A mol:aa TRANSLATION GAIEVEGRVVEPLPNAMFRIELENGHKVLAHISGKMRQHYIRILPEDRVVVELSPYDLSRGRIVYRYK .TTTT.......TTTTT....TTTT..................TTTT......TTTTT.......... 130 3I7M.A mol:aa HYDROLASE TKLEQIQQWTAQHHASTYLSNPKTIEYLTGFGSDPIERVLALVVFPDQDPFIFAPALEVEVIKETGWQFPVIGYLDHENP WAIADQVKQRHVNPEHVAIEKGQLQVAREALAAQFSAPSFDLDITSFIEH ............................................TTTT..........................TTTTTT TT...........TTTT..TTTTTTTTTTTTT..TTTT............ 79 3I84.A mol:aa CELL ADHESION GTVFTTVEDLGSKILLTCSLDDSTEVTGHRWLKGGVVLKEDALPGQKTEFKVDSDDQWGEYSCVFLPEPMGTANIQLHG .........TTTT..................TTTTT................................TTTT....... 145 3IBM.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION HEASRVLRERDYRWEGTEEESGARRQTLVGRPAGQEAPAFETRYFEVEPGGYTTLERHEHTHVVMVVRGHAEVVLDDRVE PLTPLDCVYIAPHAWHQIHATGANEPLGFLCIVDSDRDRPQRPDADDLARMCADPAVARRIRTEG ........TTTTTTTTT.........TTTTTTTTTTTT.........TTTT.......................TTTT.. ..TTTT....TTTT.......TTTT........TTTT............................ 94 3IC3.A mol:aa OXIDOREDUCTASE NATGPKQQPLPPDVEGREDAIEVLRAFVLDGGLSIAFRAFEDPEWGLLLVDIARHAARSYARESEYTEDEALERIVEFEA ELSRPTDTTTERTQ ..........TTTTTTTTTT.......TTTTT................................................ .............. 94 3ID1.A mol:aa HYDROLASE MVRPVVGEIAANSIAAEAQIAPGTELKAVDGIETPDWDAVRLQLVDKIGDESTTITVAPFGSDQRRDVKLDLRHWAFEPD KEDPVSSLGIRPRG .........TTTT.......TTTT...TTTTT..............TTTT........TTTT.........TTTT..TTT TT............ 138 3IDF.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION MKKLLFAIDDTEACERAAQYILDMFGKDADCTLTLIHVKPEFMLYGEAVLAAYDEIEMKEEEKAKLLTQKFSTFFTEKGI NPFVVIKEGEPVEMVLEEAKDYNLLIIGSSENSFLNKIFASHQDDFIQKAPIPVLIVK ........................TTTTTTT................................................. .............................TTTTTTT....TTTT.............. 115 3IDU.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION TDYDKLSNLTFEFPDLTVEIKGPDVVGVNKLAEYEVHVKNLGGIGVPSTKVRVYINGTLYKNWTVSLGPKEEKVLTFNWT PTQEGYRINATVDEENTVVELNENNNVATFDVSVV ...TTTT...............TTTTTTTT.......................TTTTT.........TTTT......... ..........TTTTTT...TTTTTTT......... 101 3IE4.A mol:aa IMMUNE SYSTEM YEVPKAKIDVFYPKGFEVSIPDEEGITLFAFHGKLNEEMEGLEAGTWARDIVKAKNGRWTFRDRITALKPGDTLYYWTYV IYNGLGYREDDGSFVVNGYSG ..........TTTT........TTTT.......TTTT..TTTT.TTTT.....TTTTT....TTTT..TTTT........ TTTTT................ 247 3IEE.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION VSTADIENAAEVIKYYNTSLGVLKDVKEKDVNAVLDYEQKGKTPALSAIVPPAVVSKDSAIVLNPGNCFNEETRRNLKQN YTGLFQARTEFYANFDTYLSYLKKKDVTNAKKLLDVNYQLSTQSEYKQNIFDILSPFTEQAELVLLVDNPLKAQISVRKS STQSILNLYARKHRDGPRIDLKVAELTKQLDAAKKLPVVNGHEGEKSYQAFLSQVETFIKQVKKVREKGEYSDADYDLTS AFETSII .....................................TTTTTT.TTTT.......TTTT..TTTTTTTTT.......... .................................................................TTTTTTTTTTTTTT. ..........TTTT........................TTTTTTT................................... ....... 75 3IG9.A mol:aa VIRAL PROTEIN GGYVNIKTFTHPAGEGKEVKGMEVSVPFEIYSNEHRIADAHYQTFPSEKAAYTTVVTDAADWRTKNAAMFTPTPV .............TTTT.............................TTTT.TTTT.................... 160 3IHT.A mol:aa TRANSFERASE EQSRLDLFIDRVSQRACLEHAIAQTAGLSGPVYELGLGNGRTYHHLRQHVQGREIYVFERAVASHPDSTPPEAQLILGDI RETLPATLERFGATASLVHADLGGHNREKNDRFARLISPLIEPHLAQGGLVSSDRYFEGLEELPLPPGAVVGRCFIYRRG ........................TTTT........TTTT.........TTTT.....TTTT.................. ..........TTTT.....................................TTTT.TTTT.....TTTTTTTT....... 145 3IJM.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION YSHPISLKTLVQEDDIGVNAPIIHQSVIARLTAGLYPLYQSKKIPFEPLPETLTEGYSSPVPDVLLYDHQTEEAKVIIEV CQNSGLKHDTSKIVKLIEDNAYGILEGFVFNYKTQQWLRYRLGDGGVATNSSFSEVLQVDLNTFV ..TTTT.......TTTTTTT.....................TTTT...TTTT...TTTT..TTTT..TTTTT........ ..............................TTTTT.....TTTTTTT......TTTTT....... 191 3IKB.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION ATSLEEITKAIADSQNKVFTEKNIEPLFAAPKTARINIVGQAPGIKAQESRLYWNDKSGDRLREWGVDYDTFYHSGYFAV IPDFYYPGKGKSGDLPPRKGFAQKWHQPILDLLPDIQLTILIGNYAQKYYLHQKSSVKLTDTVAHYKKYLPDYFPLVHPS PRNQIWSRHPWFEAQVVPDLKKIIQQIIQSS ..............................TTTT.................TTTT......................... .........TTTT....TTTT...........TTTT.................TTTT...........TTTTT....... ......TTTT..................... 373 3IKW.A mol:aa LYASE TAQTKNTQTLMPLTERVNVQADSARINQIIDGCWVAVGTNKPHAIQRDFTNLFDGKPSYRFELKTEDNTLEGYAKGETKG RAEFSYCYATSDDFRGLPADVYQKAQITKTVYHHGKGACPQGSSRDYEFSVYIPSSLDSNVSTIFAQWHGMPDRTLVQTP QGEVKKLTVDEFVELEKTTFFKKNVGHEKVARLDKQGNPVKDKNGKPVYKAGKPNGWLVEQGGYPPLAFGFSGGLFYIKA NSDRKWLTDKDDRCNANPGKTPVMKPLTSEYKASTIAYKLPFADFPKDCWITFRVHIDWTVYGKEAETIVKPGMLDVRMD YQEGKKVSKHIVDNEKILIGRNDEDGYYFKFGIYRVGDSTVPVCYNLAGYSER ......TTTT.......TTTTT......TTTTTT......TTTTTT.....TTTTT.......TTTT......TTTT... .............TTTT......................TTTT..........TTTTTTTT...........TTTTT.TT TT..................TTTTT........TTTT....TTTT...................TTTT..TTTTT..... ....TTTTTTTTT...TTTTTTTTT...TTTT.............TTTT...................TTTT........ ...................................TTTT.............. 464 3ILW.A mol:aa ISOMERASE VGRALPEVRDGLKPVHRRVLYAMFDSGFRPDRSHAKSARSVAETMGNYHPHGDASIYDSLVRMAQPWSLRYPLVDGQGNF GSPGNDPPAAMRYTEARLTPLAMEMLREIDEETVDFIPNYDGRVQEPTVLPSRFPNLLANGSGGIAVGMATNIPPHNLRE LADAVFWALENHDADEEETLAAVMGRVKGPDFPTAGLIVGSQGTADAYKTGRGSIRMRGVVEVEEDSRGRTSLVITELPY QVNHDNFITSIAEQVRDGKLAGISNIEDQSSDRVGLRIVIEIKRDAVAKVVINNLYKHTQLQTSFGANMLAIVDGVPRTL RLDQLIRYYVDHQLDVIVRRTTYRLRKANERAHILRGLVKALDALDEVIALIRASETVDIARAGLIELLDIDEIQAQAIL DMQLRRLAALERQRIIDDLAKIEAEIADLEDILAKPERQRGIVRDELAEIVDRHGDDRRTRIIA TTTTT.TTTTT.................TTTT................................TTTTTTTT........ .TTTT....TTTTT...........TTTT.........TTTT....TTTT.TTTT..........TTTT........... ..........TTTT.................TTTT..............................TTTT.........TT TT...............TTTTTT.......TTTTT.......TTTT............TTTTT........TTTTT.... ................................................................................ ................................................................ 50 3IM3.A mol:aa STRUCTURAL PROTEIN, SIGNALING PROTEIN SLRECELYVQKHNIQALLKDSIVQLCTARPERPMAFLREYFEKLEKEEAK .............................TTTT................. 155 3IMK.A mol:aa METAL BINDING PROTEIN GDEKPAITKIISGGQTGADRAALDFAIKHHIPYGGWVPKGRLAEGGRVPETYQLQEPTSDYSKRTEKNVLDSDGTLIISH GILKGGSALTEFFAEQYKKPCLHIDLDRISIEDAATLINSWTVSHHIQVLNIAGPRAGKDPEIYQATDLLEVFLA ..........................................TTTT..TTTT............................ ........................TTTTT.................TTTT.....TTTTTTTT............ 92 3IP4.C mol:aa LIGASE KVTREEVEHIANLARLQISPEETEEMANTLESILDFAKQNDSADTEGVEPTYHVLDLQNVLREDKAIKGIPQELALKNAK ETEDGQFKVPTI ............................................TTTT..TTTT.....................TTTTT TTTTTT...... 69 3IPF.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION NRQFLSLTGVSKVQSFDPKEILLETIQGVLSIKGEKLGIKHLDLKAGQVEVEGLIDALVYPLEHHHHHH ......TTTT......TTTT....TTTT..............TTTTT...........TTTT....... 95 3IPJ.A mol:aa TRANSFERASE SNANKYNKIANELIKIIGEDNIISITHCATRLRVMVKDREIINDKKVEKVDEVKGVFFTSGQYQIILGTGIVNKVYAEVE KMGLKTLSKKEQDEL .TTTT......................TTTT....TTTT..........TTTT.....TTTT.....TTTT......... ......TTTT..... 211 3IR4.A mol:aa OXIDOREDUCTASE SNAKLYIYDHCPFCVKARIFGLKNIPVELNVLQNDDEATPTRIGQKVPILQKDDSRYLPESDIVHYVDNLDGKPLLTGKR NPAIEEWLRKVNGYVNQLLLPRFAKSAFDEFSTPAARQYFIRKKEASSGSFDNHLAHSAGLIKKIGDDLRLLDKLIVQPN AVNGELSEDDIHLFPLLRNLTLVAGIHWPTKVADYRDNAKQTQINLLSSAI .......TTTT.....................TTTT...............TTTT.............TTTT.TTTT... ...........TTTT..............................................................TTT TTTT..................TTTT............TTTTT........ 182 3IT5.A mol:aa HYDROLASE APPSNLMQLPWRQGYSWQPNGAHSNTGSGYPYSSFDASYDWPRWGSATYSVVAAHAGTVRVLSRCQVRVTHPSGWATNYY HMDQIQVSNGQQVSADTKLGVYAGNINTALCEGGSSTGPHLHFSLLYNGAFVSLQGASFGPYRINVGTSNYDNDCRRYYF YNQSAGTTHCAFRPLYNPGLAL ..TTTT.....TTTT........TTTT..........TTTT.TTTT................TTTT....TTTT....TT TT.....TTTT..TTTT............TTTT............TTTTT...TTTTTTTTT......TTTT.TTTTT.. .TTTTT...TTTT......... 142 3IU6.A mol:aa TRANSCRIPTION MNVTLLIQELIHNLFVSVMSHQDDEGRYSDSLAEIPAVDPNFPNKPPLTFDIIRKNVENNRYRRLDLFQEHMFEVLERAR RMNRTDSEIYEDAVELQQFFIKIRDELCKNGEILLSPALSYTTKHLHNDVEKERKEKLPKEI ......................TTTT.....TTTTTTTTTTTTTT................................... ...TTTT.........................TTTT.......................... 32 3IUF.A mol:aa PROTEIN BINDING EDRDKPYACDICGKRYKNRPGLSYHYAHSHLA TTTTTT..TTTTT..............TTTT. 106 3IUO.A mol:aa HYDROLASE RVRTLANKSKKVSIVQQIDRKVALDDIAVSHGLDFPELLSEVETIVYSGTRINIDYFINEVDEDHLEDIFEYFKESTTDS LEEAQELGKDYSEEEIRLVRIKFLSE ...........................................................................TTTTT TTTTTTTTTTT............... 77 3IUW.A mol:aa RNA BINDING PROTEIN QHPTIHTLKIETEFFKAVKERRKTFEIRKNDRNFQVGDILILEEYNGYLDDECEAEVIYITDYAQREGYVVLGIELH ....................TTTT.....TTTT.TTTT.......................TTTTTTTT........ 140 3IVV.A mol:aa LIGASE SGKVVKFSYMWTINNFSFCREEMGEVIKSSTFSSGANDKLKWCLRVNPKGLDEESKDYLSLYLLLVSCPKSEVRAKFKFS ILNAKGEETKAMESQRAYRFVQGKDWGFKKFIRRDFLLDEANGLLPDDKLTLFCEVSVVQ ............TTTT.....TTTT.....................TTTT....TTTT.........TTTT......... ..TTTT.......TTTT...TTTT..................TTTT.............. 87 3IWF.A mol:aa TRANSCRIPTION REGULATOR PNILYKIDNQYPYFTKNEKKIAQFILNYPHKVVNTSQEIANQLETSSTSIIRLSKKVTPGGFNELKTRLSKFLPKEVTQY NNKLHSR .........................................................TTTT...............TTTT ....... 163 3IX3.A mol:aa TRANSCRIPTION FLELERSSGKLEWSAILQKMASDLGFSKILFGLLPKDSQDYENAFIVGNYPAAWREHYDRAGYARVDPTVSHCTQSVLPI FWEPSIYQTRKQHEFFEEASAAGLVYGLTMPLHGARGELGALSLSVEAENRAEANRFMESVLPTLWMLKDYALQSGAGLA FEH ..................................TTTT.......................................... .................................TTTT........................................... ... 233 3JQY.A mol:aa TRANSFERASE KTQDSRLKTQDSFSVDDNGSGNVFVCGDLVNSKENKVQFNGNNNKLIIEDDVECRWLTVIFRGDNNYVRIHKNSKIKGDI VATKGSKVIIGRRTTIGAGFEVVTDKCNVTIGHDCMIARDVILRASDGHPIFDIHSKKRINWAKDIIISSYVWVGRNVSI MKGVSVGSGSVIGYGSIVTKDVPSMCAAAGNPAKIIKRNIIWARTDKAELISDDKRCSSYHAKLTQLEHHHHH ...TTTT..TTTT.....TTTT......TTTT................TTTT..................TTTT...... ..TTTT....TTTT..TTTT...TTTT....TTTT..TTTT...........TTTTT.TTTT......TTTT..TTTT.. TTTT..TTTT..TTTT......TTTT...TTTT...........TTTT......................... 151 3JRN.A mol:aa PLANT PROTEIN TKYDVFLSFRGHDTRHNFISFLYKELVRRSIRTFKDDKPIEVSRFAVVVVSENYAASSWCLDELVTIMDFEKKGSITVMP IFYGVEPNHVRWQTGVLAEQFKKHASREDPEKVLKWRQALTNFAQLSGDCSGDDDSKLVDKIANEISNKKT TTTT.........TTTTT....................TTTTT.......TTTTTTTT...................... .TTTT..............................................TTTT...........TTTT. 508 3JSZ.A mol:aa TRANSFERASE SNELSKLRRFFSALNHTSEIDLHTLFDNLKSNLTLGSIEHLQEGSVTYAIIQELLKGADAQKKIESFLKGAIKNVIHPGV IKGLTPNEINWNVAKAYPEYYEHEKLPDVTFGGFKVRDSNEFKFKTNVQTSIWFSIKPELFPSKQQEALKRRREQYPGCK IRLIYSSSLLNPEANRQKAFAKKQNISLIDIDSVKTDSPLYPLIKAELANLGGGNPAAASDLCRWIPELFNEGFYVDIDL PVDSSKIVEGHQITGGVPILNGSIISEPIAPHHRRQEAVCATDIIAYANDRETQVDTVALHLKNIYDDPYTALKDTPLAQ TAFFNRCEEEGKNIFELRKGLQDAFRSDSLLELYVFLGPAKFKEVFKLKETQIKYIDDHISEFNEHDLLLHLISDNPSEI NQHTLDFGRAKVYDIAKEHYSAFYKPLVEEISGPGAIYNALGGASNFTTTHRRSTGPLPTTPPRVLQVFCDAHDKGPFVS DNIARWQTNVRELGVLNREGLSWLPSVG .............TTTTTTTT....................TTTT...........TTTT.................... ................TTTTT........TTTTT..........TTTT......TTTTTT...............TTTT. .....TTTTT.......................................................TTTTTT.....TTTT .......TTTT..................TTTT........TTTT...........................TTTT.... ...............................................................................T TTTTTTT......TTTT...........TTTTT............................................... ....TTTT.........TTTTTTT.... 173 3JTW.A mol:aa OXIDOREDUCTASE ARKVILFIASIDNYIADDQGAVDWLEKNVHGTESDDSYEKYSKIDTVIGRTTYEQVTQKLSPEKYVYADRQTYIVTSHLG EDTDKIKYWKQSPVELVKRIQKEKGKDVWIVGGAKIIDPLVQANLIDTYILTTVPIFLGSGIRLFDRLEEQVPVRLIDVY QKNELVYSIYQRG .........TTTT...TTTT...............TTTTTTTTT................TTTTTTTTTT.......... ..TTTT........................................................TTTTTT............ TTTTT........ 95 3JU0.A mol:aa DNA BINDING PROTEIN SLTDSKVKNAKSLEKEYKLTDGFGMHLLVHPNGSKYWRLSYRFEKKQRLLALGVYPAVSLADARQRRDEAKKLLAAGIDP SAKKQADNKTIQEKR ............TTTT.....TTTT....TTTT........TTTTT.......TTTTTT..................... ............... 145 3JUD.A mol:aa TRANSFERASE MALVFVYGTLKRGQPNHRVLRDGAHGSAAFRARGRTLEPYPLVIAGEHNIPWLLHLPGSGRLVEGEVYAVDERMLRFLDD FQSCPALYQRTVLRVQLLEEEPPAPTAVQCFVYSRATFPPEWAQLPHHDSYDSEGPHGLRYNPRE .......TTTTTTTTTT..................TTTT......TTTT......TTTT..................... ..TTTTTT...........TTTT........................TTTTTTTTTTT....... 84 3JXO.A mol:aa TRANSPORT PROTEIN NLYFQGMIPLEQGIEFLSVNVEEDSPVVGKKLKDLPLPRDSIIAAIVRGGVLVVPRGDTEILSGDKLYVIVSAEAKETVE ETLL ..........TTTT.......TTTTTTTTT.......TTTT.....TTTTT....TTTT..TTTT......TTTTT.... .... 178 3JYG.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION SLIYQIAKEFDFCYGHRVWSQELNPDFSLDPCLSCRHLHGHQGKVIVHLESRELQRGVTDFAHLNWFKRFIDEVLDHRFI IDIDDPLFPTLLPHFADKSALVWEEGYARVDFERIKGESSPILELYESFVVVRFVPTSESIASWLLELLRSRIQPLGVKV SSVEFLETPKSRARVYNE .................TTTT......TTTT......................TTTT.................TTTT.. .TTTTTTT......TTTTTTT.............TTTT.......................................... .....TTTTTT....... 147 3JYZ.A mol:aa STRUCTURAL PROTEIN GIDPFTVRTRVSEGLVLAEPAKLISTDGSASTADLTRATTTWNQQSNNLGASSKYVTSVLDAGNTGVITITYVADQVGLP TAGNTLILSPYINDGNTRTALATAVAAGTRGTIDWACTSASNATATAQGFTGAAGSVPQEFAPAQCR .......................TTTTT........................TTTT....TTTT...............T TTTT........TTTTT....................TTTTT......................... 189 3JZ9.A mol:aa TRANSPORT PROTEIN GHVTRIENLENAKKLWDNANSLEKGNISGYLKAANELHKFKEKNLKEDDLRPELSDKTISPKGYAILQSLWGAASDYSRA AATLTESTVEPGLVSAVNKSAFFDCKLSPNERATPDPDFKVGKSKILVGIQFIKDVADPTSKIWHNTKALNHKIAAIQKL ERSNNVNDETLESVLSSKGENLSEYLSYK .....................TTTT...............TTTT.........TTTT....................... ..TTTTT............TTTTTTTTTTTT....TTTT..................TTTT....TTTTT.......... ..................TTTT....... 100 3K0X.A mol:aa PROTEIN BINDING SAKLIFINQINDCKDGQKLRFLGCVQSYKNGILRLIDGSSSVTCDVTVVLPDVSIQKHEWLNIVGRKRQDGIVDVLLIRS AVGINLPRYRQMVSERQKCD .............TTTT...........TTTT....TTTT.....TTTTTTTT..TTTT.........TTTT........ TTTT................ 146 3K0Z.A mol:aa LYASE EVQLLKEPKPKATIDPSLSQKEATEVHAAQRFYAFWDTGKEELIPQTVTENFFDHTLPKGRPQGTEGLKFAAQNFRKIVP NIHCEIEDLLVVGDKVTARLSFTGTHNDKKIDFFAIDILHVKDGKITEDWHLEDNLTLKQQLGLIA ..............TTTT...................................TTTTTTTT...............TTTT TT.........TTTT.........TTTTT...........TTTTT..................... 330 3K2O.A mol:aa OXIDOREDUCTASE HNHKSKKRIREAKRSARPELKDSLDWTRHNYYESFSLSPAAVADNVERADALQLSVEEFVERYERPYKPVVLLNAQEGWS AQEKWTLERLKRKYRNQKFKCGEDNDGYSVKKKYYIEYESTRDDSPLYIFDSSYGEHPKRRKLLEDYKVPKFFTDDLFQY AGEKRRPPYRWFVGPPRSGTGIHIDPLGTSAWNALVQGHKRWCLFPTSTPRELIKVTRDEGGNQQDEAITWFNVIYPRTQ LPTWPPEFKPLEILQKPGETVFVPGGWWHVVLNLDTTIAITQNFASSTNFPVVWHKTVRGRPKLSRKWYRILKQEHPELA VLADSVDLQE ................TTTTT.....TTTT.................................TTTT....TTTTTTTT. .............TTTT......TTTT...............TTTT....TTTT..TTTT.................... .TTTTT........TTTT...........................TTTT..............TTTT............. TTTT...........TTTT....TTTT..................TTTTT.............................. .......... 178 3K5J.A mol:aa PROTEIN BINDING GDYNQTVLSHLQKFWKHHDIKGFTWTLGRIVEELPDFQVFQVIPNHEDEPWVYVSSGIGQFLGQEFFIISPFETPEHIET LALASASHYPDQFQLGKTVNIGRPWVEQSSFRHFLISLPYPYGQELEYDNVRFFWLLPITQTERLFLNTHSVEELETKFD EAGIDYLDINRASTVWQA ..............TTTT......TTTT.....TTTT........TTTT............................... ..TTTTT......TTTT...TTTTTTTTT.........TTTTT..................................... ....TTTTTTT....... 147 3K69.A mol:aa TRANSCRIPTION KLDFSVAVHSILYLDAHRDSKVASRELAQSLHLNPVIRNILSVLHKHGYLTGTVGKNGGYQLDLALADNLGDLYDLTIPP TISYARFITGPSKADQSPIAANISETLTDLFTVADRQYRAYYHQFTADLQADLNHHGTFLQHEQDSE ................TTTT..................................TTTT......TTTT............ TTTT..................................................TTTT......... 204 3K6O.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION TVTIAATVEKQPQYDAPYLVLDNGEKLWVVQHIVPYRDLKAGERIFGNYSFLEAGESGFAYNIRLNDYTLVPVQKIIGLN PDNDSIGNKVQIKDWPSDDYLNVRFLNFPSPQKPILNLVVNEIPWTKDGYAHLELRYNNNGSQGRLVPGVSFKLDDYSPE NSELKGIKVLVNPVDGEEKTYIFSYPLTGEDVPGFNPLDLAELK ..........TTTT......TTTT.......TTTTTTTTTTTT............TTTT....................T TTTTTTT.........TTTT........TTTT..........................TTTT...............TTT TTTT........TTTT............TTTT............ 339 3K7X.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION KWSEYANLAQQSLEKFYLADTKEQFLNNFYPTENPEEDNKVFNYWWLAHLVEVRLDAYLRTKKQADLEVAEKTYLHNKNR NGGTLIHDFYDDLWNALAAYRLYKATGKSIYLEDAQLVWQDLVDTGWNDIGGGFAWRRPQYYKNTPVNAPFIILSCWLYN ELNETKYLEWAKTYEWQTKVLVREDGFVEDGINRLEDGTIDYEWKFTYNQGVYIGANLELYRITKEAIYLDTANKTAAIS LKELTEDGIFKDEGNGGDEGLFKGIFYRYFTDLIEETANKTYRDFVLNSCQILVENAKLDGYLLGNWKEKPSGKIPYSAE LSGIALEAAKLELEHHHHH ....................TTTTT...TTTTTTT............................................T TTTT..TTTTTT............................................TTTT.................... ......................TTTT......TTTT....TTTTT................................... ....TTTTT................................................TTTTT...TTTT........... ...TTTT............ 212 3KD3.A mol:aa UNKNOWN FUNCTION KNIIFDFDSTLIKKESLELILEPILQKSPAKLKEIEYITNLGQGDISFRDSLQKRLAIASPTKQSIKEFSNKYCPNLLTD GIKELVQDLKNKGFEIWIFSGGLSESIQPFADYLNIPRENIFAVETIWNSDGSFKELDNSNGACDSKLSAFDKAKGLIDG EVIAIGDGYTDYQLYEKGYATKFIAYEHIEREKVINLSKYVARNVAELASLI ....................TTTTTTTTT..............TTTT.........................TTTTTTTT TT..............................................TTTT......TTTTTTTT.............. .................TTTTTT..............TTTT........... 75 3KDE.C mol:aa DNA BINDING PROTEIN/DNA MKYCKFCCKAVTGVKLIHVPKCAIKRKLWEQSLGCSLGENSQICDTHFNDSQWKAAKGQTFKRRRLNADAVPSKV ...TTTTT.............................TTTT.........................TTTT..... 110 3KDF.A mol:aa REPLICATION VDDLPRSRINAGLAQFIDKPVCFVGRLEKIHPTGKFILSDGEGKNGTIELEPLDEEISGIVEVVGRVTAKATILCTSYVQ FKEDSHPFDLGLYNEAVKIIHDFPQFYPLG ...............TTTT...........TTTT.....TTTT........................TTTT......... ..TTTT........................ 113 3KDF.B mol:aa REPLICATION HIVPCTISQLLSATLVDEVFRIGNVEISQVTIVGIIRHAEKAPTNIVYKIDDTAAPDVRQWVTVVPPETYVKVAGHLRSF QNKKSLVAFKIPLEDNEFTTHILEVINAHVLSK ..............TTTTT.TTTTT................TTTT....................TTTT.........TT TTT....TTTT..................TTTT 191 3KDG.A mol:aa HYDROLASE MDRVPIMYPIGQMHGTYILAQNENGLYIIDQHAAQERIKYEYFREKVGEVEPEVQEMIVPLTFHYSTNEALIIEQHKQEL ESVGVFLESFGSNSYIVRCHPAWFPKGEEAELIEEIIQQVLDSKNIDIKKLREEAAIMMSCKGNRHLRNDEIKALLDDLR STSDPFTCPHGRPIIIHHSTYEMEKMFKRVM ...........TTTTTT....TTTT.......................TTTT.............TTTT........... ..........TTTT......TTTTTTTTT.............TTTT.................................. .TTTTTTTTTT................TTTT 127 3KE7.A mol:aa ISOMERASE QKNENKTLNENIPEIISLEKEALASTDPAFVELSDTDVIYFDPSLETKIEGLEQLRTYYKGQLPPADHFDIRPVVQVAQN IAVLTFNLDSYLSDKVIKWNCTEVYRRNPDNQWKIIQTHWSYVKPLD .....TTTTTTTTT...........................TTTTTTT.................TTTT........TTT T.........TTTTT............TTTT..........TTTTTT 281 3KG9.A mol:aa LYASE LHPLLGEKLNLARIENQHHFQSYLTAESPAYLSQHQVFNKVLFPATGYLEIAAAVGKNLLTTGEQVVVSDVTIVRGLVIP ETDIKTVQTVISTLENNSYKLEIFSTSEANQWTLHAEGKIFLDSTTNTKAKIDLEQYQRECSQVIDIQQHYQQFKSRGID YGNSFQGIKQLWKGQGKALGKIALPEEIAGQATDYQLHPALLDAALQILGHAIGNTETDDKAYLPVGIDKLKQYRQTITQ VWAIVEIPENTLKGSIKLVDNQGSLLAEIEGLRVTATTADA .TTTTT....TTTTTTT.......TTTTTT.....TTTTT....................TTTT...............T TTT...........TTTT.............................................................. .............TTTT.......TTTTT.....TTTT................TTTTT..................... .......TTTT........TTTT..............TTTT 92 3KGK.A mol:aa CHAPERONE MKTLMVFDPAQALVDFSTDVQWLKQSGVQIERFNLAQQPMSFVQNEKVKAFIEASGAEGLPLLLLDGETVMAGRYPKRAE LARWFGIPLDKV .................................TTTTTT........................TTTTT............ ............ 122 3KK4.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION SNAEIFIRANQRSYSVQARSLRLHGVATSVRLEQLFWDVLEEIAARDGRVTQLIERLYDELVQYRGEAANFTSFLRVCCL RYQVLQAEGRIPADATVPIRSLDAQAVLRGLPANLYDSRPLG ...TTTTTT............TTTTT.........................................TTTT......... ........TTTT.TTTT..........TTTT........... 100 3KKF.A mol:aa OXIDOREDUCTASE GAENNVRLSRIIIDPERLEEYNAYLKEEIEVSRLEPGVLVLYAVAEKERPNHVTILEIYADEAAYKSHIATPHFKKYKEG TLDVQLELIDATPLIPGLKK ..................................TTTT.......TTTTTTT............................ ............TTTTTT.. 140 3KKG.A mol:aa LYASE GQDRSPIETQNVETVLRLFDEGWGAQDGWRDVWRETTPGFRSIFHSNQAVEGIEQAIAFNAVLFEGFPRLEVVVENVTVE GDNVVVQARLTGAQDGPFLGVPPSGQVDVPDVTLFTLADGQVIERYFTDLLAVTAISAPP ..................TTTTTTTTTTT.......TTTT...TTTT...................TTTT.........T TTT.............TTTTT...............TTTTT.......TTTTTTTTTT.. 137 3KLQ.A mol:aa CELL ADHESION STVQTSISVENVLERAGDSTPFSVALESIDAMKTIEEITIAGSGKASFSPLTFTTVGQYTYRVYQKPSQNKDYQADTTVF DVLVYVTYDEDGTLVAKVISRRAGDEEKSAITFKPKRLVKPIPPRQPDFPKTPLPLA ..............TTTT.........TTTTT...................................TTTTTT....... ........TTTT.........TTTT................................ 168 3KMI.A mol:aa MEMBRANE PROTEIN NIHKIHEVQKKLQEEVSIVLIDIADIIVNPKKENGYSRDLYTLNSLIDSSISETYDNINNTLLSDTRFFLEHDIIKSQRD ILENLYSYVSQLNSTPPQAHILSAFIHKIGYTEFEAETGNLLLEELKRLISKNQPLPVDRTEFENRAILFLCLTELKQFL VNRKHAQL ............................TTTT................................................ ..................................TTTT.......................................... ........ 224 3KOG.A mol:aa MEMBRANE PROTEIN TPVNAKFIITPVVIDATTGTDVTQSAEISFSKGNGTYEGTPELASESININAKYKGTGSASVTIPALKAGQFGAKEVTII LSENFFAQEESSNSQIETTKHSGFKNNTSDYWYYITVTYTKKEGSEVIKNDYEGDDSEIKNIIDAYNKGVREDKVTLNDV QVLAHSRFSVFVDYKTTSVYQIIEKSPDGNPVASFTVDSYNTIVSPKNEQIPGHGHAPSHGHGH ..............TTTTT..........TTTT.....TTTTT.........TTTT...........TTTT......... .TTTT........................................................................... ..TTTT............................................TTTT.....TTTT. 193 3KOS.A mol:aa TRANSCRIPTION QEKLKIGVVGTFAIGCLFPLLSDFKRSYPHIDLHISTHNNRVDPAAEGLDYTIRYGGGAWHDTDAQYLCSALSPLCSPTL ASQIQTPADILKFPLLRSYRRDEWALWQTVGEAPPSPTHNVVFDSSVTLEAAQAGGVAIAPVRFTHLLSSERIVQPFLTQ IDLGSYWITRLQSRPETPAREFSRWLTGVLHKT ...........................TTTT......TTTT.......TTTT.......TTTT................. ...................TTTT....TTTT....TTTT...TTTTTT......................TTTT...... ..........TTTT................... 49 3KPE.A mol:aa VIRAL PROTEIN, FUSION PROTEIN HLEGEVNKIKSALLSTNKAVVSLSNGVSVLTSKVLDLKNYIDKQLLPIV ................................................. 76 3KUC.B mol:aa GTP BINDING PROTEIN/TRANSFERASE NTIRVFLPNKQRTVVRVRNGMSLHDCLMKKLKVRGLQPECCAVFRLLHEHKGKKARLDWNTDAASLIGEELQVDFL ......TTTTT......TTTT............................TTTT....TTTT....TTTT....... 255 3KWS.A mol:aa ISOMERASE DLELKLSFQEGIAPGESLNEKLDFEKLGVVGFEPGGGGLAGRVNEIKQALNGRNIKVSAICAGFKGFILSTDPAIRKECD TKEIIAAAGELGSTGVIIVPAFNGQVPALPHTETRDFLCEQFNEGTFAAQHGTSVIFEPLNRKECFYLRQVADAASLCRD INNPGVRCGDFWHTWEETSDGAFISGGEYLQHVHVASRKRRSPGEDGDADNYINGFKGLKIGYNNYVSFECGCQGDRNVV VPAAVKLLREQWEQA ........TTTT............TTTT.......TTTT..........TTTT.............TTTT.......... .....................TTTTTTT................................TTTTTTTTT........... ..TTTT...TTTTTTTTTTT................TTTTT..TTTT................................. ............... 56 3KXT.A mol:aa DNA BINDING PROTEIN/DNA MKPVKVKTPAGKEAELVPEKVWALAPKGRKGVKIGLFKDPETGKYFRHKLPDDYPI .......TTTT......TTTT....TTTT.........TTTTT.......TTTT.. 110 3KYZ.A mol:aa TRANSFERASE YFLAPADRHYLADYARQAEDAWRREGAAGAERFRKELSAKEDTWVALVGPHLESLGSTPLSAEESSHLTFRKLDWPSRRL QDELPYVSIEFPGHPEQGRLVIQLPERLLP ................................................TTTT...................TTTT..TTT TT........TTTTT............... 47 3KZ5.A mol:aa DNA BINDING PROTEIN SSRHQFAPGATVLYKGDKMVLNLDRSRVPTECIEKIEAILKELEKPA ......TTTT....TTTT.....TTTTT................... 85 3KZD.A mol:aa SIGNALING PROTEIN GKVTHSIHIEKTYGFSLSSVEEDGIRRLYVNSVKETGLASKKGLKAGDEILEINNRAADALNSSMLKDFLSQPSLGLLVR TYPEL ....................TTTT.........TTTT.......TTTT...TTTTT........................ ..... 228 3KZP.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION KFQLFIQPKLDVLQGNIVEYEILLRDDSAVPRFPLSELEAVLADEELYLAFSEWFSEAFLDVLKKYPNDRFAINIAPQQL FYIETLHWLDKLKSESHRITVETEDIFDVPGHKRHLNANDKNAFILNKIKVIHGLGYHIAIDDVSCGLNSLERVSYLPYI IEIKFSLIHFKNIPLEDLLLFIKAWANFAQKNKLDFVVEGIETKETTLLESHGVSIFQGYLVNKPFPV ..........TTTTT............TTTT..................................TTTT........... ................................TTTT........................TTTTTTTTTTTTTT...... .........TTTT.............................TTTT...................... 518 3L0Q.B mol:aa TRANSFERASE LASYFIGVDVGTGSARAGVFDLQGRVGQASREITFKPKADFVEQSSENIWQAVCNAVRDAVNQADINPIQVKGLGFDATC SLVVLDKEGNPLTVSPSGRNEQNVIVWDHRAITQAERINATKHPVLEFVGGVISPEQTPKLLWLKQHPNTWSNVGHLFDL PDFLTWRATKDETRSLCSTVCKWTYLGHEDRWDPSYFKLVGLADLLDNNAAKIGATVKPGAPLGHGLSQRAASEGLIPGT AVSVSIIDAHAGTIGILGASGVTGENANFDRRIALIGGTSTAHASRSAHFISGIWGPYYSAILPEYWLNEGGQSATGALI DHIIQSHPCYPALLEQAKNKGETIYEALNYILRQAGEPENIAFLTNDIHLPYFHGNRSPRANPNLTGIITGLKLSTTPED ALRYLATIQALALGTRHIIETNQNGYNIDTASGGGTKNPIFVQEHANATGCALLPEESEALLGSAGTVAAGVFESLPEAA ASRIGKTVTPQTNKIKAYYDRKYRVFHQYHDHRYQALQ ..........TTTT......TTTT.............TTTT....................................... .....TTTT.....TTTTTTTTT.....TTTT................TTTT............................ .........................TTTTT...................TTTTTTTT.....TTTT..........TTTT .....................TTTT...TTTT....TTTTT.........TTTT...TTTTTTTTT......TTTT.... ......TTTT........................TTTT......TTTT...TTTT.TTTTTTTTT...........TTTT .....................TTTT.......TTTTTT................TTTTTTTTTTTTTTTTTTTTTTTTT. ............................TTTTTTTTT. 189 3L15.A mol:aa TRANSCRIPTION GLGTARLQLVEFSAFVEPQRHLFVHISQLESVDVRQIYDKFPEKKGGLRELYDRGPPHAFFLVKFWADLNWGFYGVSSQY ESLEHTLTCSSKVCSFGKQVVEKVETERAQLEDGRFVYRLLRSPCEYLVNFLHKLRQLPERYNSVLENFTILQVVTNRDT QELLLCTAYVFEVSTSERGAQHHIYRLVR ...TTTT..................................TTTTTT................................. .............TTTTT.............TTTT...........................TTTT..........TTTT T.............TTTT........... 263 3L23.A mol:aa ISOMERASE GKEIGLQIYSLSQELYKGDVAANLRKVKDGYSKLELAGYGKGAIGGVPDFKKAEDAGLKIISSHVNPVDTSISDPFKAIF KYSKEVTPKIEYWKATAADHAKLGCKYLIQPPTITTHDEAKLVCDIFNQASDVIKAEGIATGFGYHNHNEFNRVATKEQQ FKVGDQIYDLLKDTDPSKVYFEDVYWTVGQNDPVEYQKHPDRIKVLHIKDRAVFGQSGNFEIFKQYANGIKDYFVELEQP DGRTQFAGVKDCADYLIKAPFVK ................TTTT...................TTTTTTTT.TTTTTTTTT...........TTTT.TTTTT.. ..TTTTTTTT................................................TTTT.............TTTTT .....TTTTTTTTTTTTTT............TTTTTTTTTTTT......TTTTTTT.....TTTTTTTT........... ..................TTTT. 123 3L29.A mol:aa RNA BINDING PROTEIN DISAKDLRNIMYDHLPGFGTAFHQLVQVICKLGKDSNSLDIIHAEFQASLAEGDSPQCALIQITKRVPIFQDAAPPVIHI RSRGDIPRACQKSLRPVPPSPAIDAGWVCVFQLQDGKTLGLKI ................TTTT.................................................TTTT....... .................TTTT...........TTTT....... 94 3L3E.A mol:aa CELL CYCLE KPLHKVVVCVSKKLSKKQSELNGIAASLGADYRRSFDETVTHFIYQGRPNDTNREYKSVKERGVHIVSEHWLLDCAQECK HLPESLYPHTYNGS TTTTTT..........................TTTTTTTT.......TTTT............................. .......TTTT... 107 3L4H.A mol:aa PROTEIN BINDING LLLQSPAVKFITNPEFFTVLHANYSAYRVFTSSTCLKHILKVRRDARNFERYQHNRDLVNFINFADTRLELPRGWEIKTD QQGKSFFVDHNSRATTFIDPRIPLQNG .TTTT.......TTTT....TTTT........TTTTTT.............TTTT..........TTTT..TTTT....T TTT.....TTTTT.....TTTT.TTTT 163 3L51.B mol:aa CELL CYCLE GKVLDAIIQEKKSGRIPGIYGRLGDLGAIDEKYDIAISSCCHALDYIVVDSIDTAQECVNFLKKHNIGIATFIGLDKTVW AKKSKIQTPENTPRLFDLVKVKNEEIRQAFYFALRDTLVANNLDQATRVAYQRDRRWRVVTLQGQIIEQSGTSGGLEHHH HHH .............TTTT........................................................TTTTTTT TTT..............................TTTT..............TTTT.....TTTT...TTTT.......TT TT. 226 3L6T.A mol:aa HYDROLASE SNAMLDITTITRQNVTSVVGYYSDAKDDYYSKDSSFTSWQGTGAEALGLSGDVESARFKELLVGEIDTFTHMQRHVGDAK KERLGYDLTFSAPKGVSMQALIHGDKTIIEAHEKAVAAAVREAEKLAQARTTRKSVTQNTNNLVVATFRHETSRALDPDL HTHAFVMNMTQREDGQWRALKNDELMRNKMHLGDVYKQELALELTKAGYELRYNSKNNTFDMAHFS ..TTTT...........TTTT..TTTTTTTTTTTTTTT..........................TTTTTT.......... ....................TTTT.................................................TTTTT.. ...........TTTT......................................TTTTT...TTTTT 86 3L7H.A mol:aa PROTEIN TRANSPORT LKRIQSHKGVVGTIVVNNEGIPVKSTLDNTTTVQYAGLMSQLADKARSVVRDLDPSNDMTFLRVRSKKHEIMVAPDKDFI LIVIQN ......TTTT......TTTT.................................TTTT........TTTT......TTTT. ...... 150 2X2S.A mol:aa CELL ADHESION GFKGVGTYEIVPYQAPSLNLNAWEGKLEPGAVVRTYTRGDKPSDNAKWQVALVAGSGDSAEYLIINVHSGYFLTATKENH IVSTPQISPTDPSARWTIKPATTYEVFTINNKVSELGQLTVKDYSTHSGADVLSASAKTADNQKWYFDAK ...........TTTTTTT.....TTTT...........TTTT.......................TTTTT......TTTT .......TTTT........TTTT.......TTTT............TTTT.................... 236 2X2U.A mol:aa TRANSFERASE LYFSRDAYWEKLYVDQAAGTPLLYVHALRDAPEEVPSFRLGQHLYGTYRTRLHENNWIRIQEDTGLLYLQRSLDHSSWEK LSVRNRGFPLLTVYLKVFLSECQWPGCARVYFSFFNTSFPACSSLKPRELCFPETRPSFRIRENRPPGTFHQFRLLPVQF LCPQISVAYRLLEGEGLPFRSAPDSLEVSTRWALDREQREKYELVAVCTVHREEVVMVPFPVTVYDEDDSAPEFEN ..TTTTTT....TTTTTTTT........TTTTTT....................TTTT..TTTTT...TTTT........ ....TTTT..............TTTTT.........................TTTT.....TTTT............... .TTTT.....TTTTTTT....TTTT.........TTTTT..........................TTTT....... 174 2X32.A mol:aa CARBOHYDRATE-BINDING PROTEIN HLAYSLDATASFLNFVSSKKTHVLETHRFDVLSGGISTAGEAQLVIDLNSVNTGIDVRNGRMRDYLFETATYSVATVTVP VDLAAVAGLAVGEDMLVDVSATLDLHGVPGVIDTQLNVQRLSATRIMVQNQSPLLIKAADYSLEAGIETLRNLASLNVIS TTVPVDFVLFYEAP ......TTTTT.......TTTTT.............TTTT...........................TTTTTT....... .........TTTT..........TTTTT.............TTTT................................... .............. 113 2X3G.A mol:aa VIRAL PROTEIN GDLKKVLNFHFSYIYTYFITITTNYKYGDTEKIFRKFRSYIYNHDKNSHVFSIKETSNGLHYHILVFTNKKLDYSRVHKH PSHSDIRIELVPKSISDIKNVYKYLKTKKDIKS .........TTTT...............................TTTT................................ TTTT............................. 109 2X4J.A mol:aa VIRAL PROTEIN FFTNKIGCNVSSPLKHVDIVGEIVEEAVYNFLIDAGDKMCVGNKIGVWKVSRKSLYAKVPKGIGVTVYLANGRVQGRLID IGVYEVLVEEVGDIIYIHKDLVYALCWPK .TTTTTTT.....TTTT.........TTTT....TTTT....TTTT.............TTTT.....TTTT........ .TTTT...TTTTT........TTTT.... 62 2X4K.A mol:aa ISOMERASE SMMPIVNVKLLEGRSDEQLKNLVSEVTDAVEKTTGANRQAIHVVIEEMKPNHYGVAGVRKSD .....................................................TTTTTTTTT 125 2X4W.A mol:aa TRANSCRIPTION DSPLDALDLVWAKCRGYPSYPALIIDPKMPREGMFHHGVPIPVPPLEVLKLGEQMTQEAREHLYLVLFFDNKRTWQWLPR TKLVPLGVNQDLDKEKMLEGRKSNIRKSVQIAYHRALQHRSKVQG ....TTTT.....TTTT........TTTTTTTT.TTTTT..............................TTTT....... ....TTTTTT................................... 274 2X55.A mol:aa HYDROLASE QLIPNISPDSFTVAASTGMLSGKSHEMLYDAETGRKISQLDWKIKNVAILKGDISWDPYSFLTLNARGWTSLASGSGNMD DYDWMNENQSEWTDHSSHPATNVNHANEYDLNVKGWLLQDENYKAGITAGYQETRFSWTATGGSYSYNNGAYTGNFPKGV RVIGYNQRFSMPYIGLAGQYRINDFELNALFKFSDWVRAHDNDEHYMRDLTFREKTSGSRYYGTVINAGYYVTPNAKVFA EFTYSKYDESIGGDAAGISNKNYTVTAGLQYRFG ......TTTT...................TTTTT......................TTTTTT.................. .....TTTT..............................TTTT.......................TTTTTTT...TTTT .....................TTTT.............................................TTTTTT.... .................................. 189 2X5N.A mol:aa NUCLEAR PROTEIN VLEATMILIDNSEWMINGDYIPTRFEAQKDTVHMIFNQKINDNPENMCGLMTIGDNSPQVLSTLTRDYGKFLSAMHDLPV RGNAKFGDGIQIAQLALKHRENKIQRQRIVAFVGSPIVEDEKNLIRLAKRMKKNNVAIDIIHIGELSALQHFIDAANSSD SCHLVSIPPSPQLLSDLVNQSPIGQGVVA .................TTTTTTT..................TTTT.......TTTT.................TTTT.. ...................TTTTTT.....................................TTTT..........TTTT T...................TTTT..... 84 2XCJ.A mol:aa VIRAL PROTEIN SNTISEKIVLMRKSEYLSRQQLADLTGVPYGTLSYYESGRSTPPTDVMMNILQTPQFTKYTLWFMTNQIAPESGQIAPAL AHFG ......................................TTTT...................................... .... 89 2XDG.A mol:aa SIGNALING PROTEIN MLREDESACLQAAEEMPQTTLGCPATWDGLLCWPTAGSGEWVTLPCPDFFSHFSSESGAVKRDCTITGWSEPFPPYPVAC PVPLELLAE ...................TTTT....TTTT.....TTTT........................TTTT...TTTT..... ......... 138 2XDH.A mol:aa CELL ADHESION AKTTIIAGSAEAPQGSDIQVPVKIENADKVGSINLILSYPNVLEVEDVLQGSLTQNSLFDYQVEGNQIKVGIADSNGISG DGSLFYVKFRVTTLRNSHALTLQGIEIYDIDGNSVKVATINGTFRIVSQEEAHHHHHH ............TTTT.......................TTTT.......TTTTTTT......TTTT............. ............................TTTT..............TTTT........ 48 2XF7.A mol:aa VIRAL PROTEIN ESLLYGYFLDSWLDGTASEELLRVAVNAGDLTQEEADKIMSYPWGAWN .TTTT.......................TTTT..........TTTTTT 108 2XFV.A mol:aa CELL-CYCLE ALEEVVRYLGPHNEIPLTLTRDSETGHFLLKHFLPILQQYHDTGNINETNPDSFPTDEERNKLLAHYGIAVNTDDRGELW IELEKCLQLLNMLNLFGLFQDAFEFEEP .........TTTT........TTTTT.......................TTTT....................TTTT... ..............TTTTTT........ 147 2XG5.B mol:aa CHAPERONE AAFHGEVVRPACTLAMEDAWQIIDMGETPVRDLQNGFSGPERKFSLRLRNCEFNSQGGNLFSDSRIRVTFDGVRGETPDK FNLSGQAKGINLQIADVRGNIARAGKVMPAIPLTEEALDYTLRIVRNGKKLEAGNYFAVLGFRVDYE TTTT.............TTTT.................................TTTTTTTTTTTTT.........TTTT ...............TTTT...TTTT......................................... 229 2XLG.A mol:aa METAL BINDING PROTEIN IHTFDDIPMPKLADPLLIYTPANEIFDIASCSAKDIGFAIAHAQIPPGGGPMPHIHYFINEWFWTPEGGIELFHSTKQYP NMDELPVVGGAGRGDLYSIQSEPKQLIYSPNHYMHGFVNPTDKTLPIVFVWMRNEVAPDFPYHDGGMREYFQAVGPRITD LNNLPELTAFASEAPKYGINQSSYFMEYVNTISDKLPAQIAKLKNDKDLERMVEVIEAFNRGDKSVTCS ...........TTTT....TTTT.........TTTT.........TTTT......TTTT.....TTTT............ TTTTT.TTTT...........TTTT....TTTT.......................TTTT.TTTT............TTT TTTT....TTTTTT.......TTTTTTTT.................................TTTTT.. 63 2XMJ.A mol:aa CHAPERONE TIQLTVPTIACEACAEAVTKAVQNEDAQATVQVDLTSKKVTITSALGEEQLRTAIASAGHEVE .....TTTT.............TTTTTTT....TTTTT......................... 144 3AG3.D mol:aa OXIDOREDUCTASE SVVKSEDYALPSYVDRRDYPLPDVAHVKNLSASQKALKEKEKASWSSLSIDEKVELYRLKFKESFAEMNRSTNEWKTVVG AAMFFIGFTALLLIWEKHYVYGPIPHTFEEEWVAKQTKRMLDMKVAPIQGFSAKWDYDKNEWKK ......TTTT.....TTTTTTT..TTTT.................................................... ...........................................TTTTTTT.....TTTTT.... 80 3L9A.X mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION RDFFVITNSEYTFAGVHYAKGAVLHVSPTQKRAFWVIADQENFIKQVNKNIEYVEKNASPAFLQRIVEIYQVKFEGKNVH ...........TTTTT..TTTT....TTTTT................TTTT............................. 104 3LAX.A mol:aa LIGASE SNADDIILKGVNIFPIQIETILLQFKELGSDYLITLETAESNDETVEVELSQLFTDDYGRLQALTREITRQLKDEILVTP RVKLVPKGALPKSAVRVKDLRKTF ......TTTTT.............TTTT.........TTTTT........TTTT.......................... .....TTTT............... 82 3LDC.A mol:aa TRANSPORT PROTEIN VPATRILLLVLAVIIYGTAGFHFIEGESWTVSLYWTFVTIATVGYGDYSPHTPLGMYFTCTLIVLGIGTFAVAVERLLEF LI .............................................TTTT............................... .. 370 3LDU.A mol:aa TRANSFERASE KNYTLISPCFFGEKLAREITNLGYEIIKTEDGRITYKTDEFGIAKSNWLRCAERVHLKIAEFEAKSFDELFENTKRINWS RYIPYGAQFPISKASSIKSKLYSTPDVQAIVKKAIVESLKKSYLEDGLLKEDKEKYPIFVFIHKDKVTISIDTTGDALHK RGYREKKAPIRETLAAGLIYLTPWKAGRVLVDPCGSGTILIEAAIGINAPGLNREFISEKWRTLDKKIWWDVRKDAFNKI DNESKFKIYGYDIDEESIDIARENAEIAGVDEYIEFNVGDATQFKSEDEFGFIITNPPYGERLEDKDSVKQLYKELGYAF RKLKNWSYYLITSYEDFEYEFGQKADKKRKLYNGLKTNFFQYPGPKPPRN .............................TTTT.....TTTT......TTTT............................ ...TTTT........TTTTTTT.......................................TTTTT.....TTTTTTTTT T.......................TTTT.....TTTT.......TTTTTTTTTT......TTTT................ TTTT...........................................TTTT............................. ..TTTT.......TTTT.......TTTT...................... 55 3LE4.A mol:aa NUCLEAR PROTEIN PPTEPLPDGWIMTFHNSGVPVYLHRESRVVTWSRPYFLGTGSIRKHDPPLSSIPC ......TTTT....TTTT.....TTTTT.........TTTTTTTTT......... 120 3LFR.A mol:aa TRANSPORT PROTEIN LQVRDIVPRSQISIKATQTPREFLPAVIDAAHSRYPVIGESHDDVLGVLLAKDLLPLILKADGDSDDVKKLLRPATFVPE SKRLNVLLREFRANHNHAIVIDEYGGVAGLVTIEDVLEQI .......TTTT...TTTT...................TTTTTTTT..............TTTT...............TT TT...................TTTT............... 176 3LGB.A mol:aa TRANSFERASE SDEINAQSVWSEEISSNYPLCIKNLEGLKKNHHLRYYGRQQLSLFLKGIGLSADEALKFWSEAFTNTEKFNKEYRYSFRH NYGLEGNRINYKPWDCHTILSKPRPGRGDYHGCPFRDWSHERLSAELRSKLTQAQIISVLDSCQKGEYTIACTKVFETHT HIAHPNLYFERSRQLQ TTTTTTTT........................................................................ ....TTTT.................TTTT................................................... ................ 120 3LHE.A mol:aa TRANSCRIPTION REGULATOR VYGSEVESKIIEFTIVGADEIIAEKLGISVGDFVYKIIRLRIIHSIPTIEHTWPISVIPGVELGLQVGTSVVRVKGIRPD DKEKQFNLTNQDFLRVEQVAYLTDGRTFEYSYADHLPETF .TTTTTTT....................TTTT.........TTTTT.......TTTTTT..................... ........TTTT.........TTTT............... 250 3LHO.A mol:aa HYDROLASE GHTDVNALFAALWQDYIKTPSAAKIHQLLGHGAPIINDHIALRTFNIAKVNLSVLAKHFTSIGYVDSGDYKFEQKKLIAK HFEHPDPKQPKVFISELLVEEFSPEVQKSIHGLIDQVDIAATTADNFIYSGRHWDVDKATYQALLAESEYAAWVAALGYR ANHFTVSINDLPEFERIEDVNQALKQAGFVLNSSGGEVKGSPEVLLEQSSTADKVVVNFTDGDVEIPSCFYEFARRYPAN GQLYTGFVAA .............................TTTT......................................TTTTT.... ...TTTTTT..................................TTTT................................. ......TTTTTTTT..................TTTTTTT...................TTTT.................. .......... 92 3LHR.A mol:aa TRANSCRIPTION REGULATOR GSPDPEIFRQRFRQFGYQDSPGPREAVSQLRELCRLWLRPETHTKEQILELVVLEQFVAILPKELQTWVRDHHPENGEEA VTVLEDLESELD ......................................TTTTT..................................... ............ 132 3LLO.A mol:aa MOTOR PROTEIN SPSYTVLGQLPDTDVYIDIDAYEEVKEIPGIKIFQINAPIYYANSDLYSSANIHTVILDFTQVNFMDSVGVKTLAGIVKE YGDVGIYVYLAGCSAQVVNDLTSNRFFENPALKELLFHSIHDAVLGSQVREA TTTT.....TTTT....TTTTTTTT..TTTT............................TTTT................. .........TTTT...........TTTTTT................TTTT.. 422 3LM3.A mol:aa STRUCTURE GENOMICS, UNKNOWN FUNCTION EPLTIEGNRFVTLCIIRTTPWEVSRDVKLHPRDEVDWHTLEGVRALREAFATNNPNGRLTWGFTNALEDGRKNYREIRDY VVECQKKYGDEVTYFPGYFPAYLPRERVNRESEAIEIISKVGNGYRPQSIGGFLSADNLRYLAEKENIHVAHAVIWSQHN GGGADGSPSYPFYPSTEHFCKPAQGKSDFIDCVNLDGWTDFICARRSGQTGHGIDGYNSRRGVGPIETYKGWGLDLGHRE VHTEAIHFDKGLELNGFGWVANIWEAQVHEFGKDLICDAKWVTGTKERWPDTHFVTFGEFGELWRKQYKSNDDWNYRFVE RGSGLGDSYNNLEIKWFNKEFRLALLRDWHTKNSPAYVIDFTRYDLQAHEPADPSPEKPAKDWSLINKINQKALRPQDKP VLIDKLEKEDQDLIRKYYPELL ....TTTT........TTTTTTTTTTTTTTT...................TTTTTTT.......TTTTT........... ................TTTT.....................TTTT.....TTTT....................TTTT.. ..............TTTT..................................TTTT.TTTTTTT................ ...........................TTTTT................TTTT.................TTTTT...... ....TTTTTTTT.....TTTT......TTTTTTT........TTTT........TTTTT..........TTTT....... ...................... 126 3LQ9.A mol:aa SIGNALING PROTEIN DEHLCANLQLLQESLAQARLGSRRPARLLPSQLVSQVGKELLRLAYSEPCGLRGALLDVCVEQGKSCHSVGQLALDPSLV PTFQLTLVLRLDSRLWKIQGLFSSANSPSQSLTLSTGFRVIKKKLY ..................TTTT.........................TTTT...........TTTT.......TTTTTT. ...........TTTTT...TTTTTTT.................... 125 3LR2.A mol:aa STRUCTURAL PROTEIN HTTPWTNPGLAENFMNSFMQGLSSMPGFTASQLDDMSTIAQSMVQSIQSLAAQGRTSPNKLQALNMAFASSMAEIAASEE GGGSLSTKTSSIASAMSNAFLQTTGVVNQPFINEITQLVSMFAQA ..TTTTT......................................................................... .....................TTTT.................... 104 3LR4.A mol:aa TRANSFERASE AQRVALQLVAIVKLTRTALLYSDPDLRRALLQDLESNEGVRVYPREKTDKFKLQPDESVNRLIEHDIRSRLGDDTVIAQS VNDIPGVWISFKIDDDDYWVALDR .............................................TTTT.......TTTT...........TTTT....T TTTT........TTTT........ 115 3LS0.A mol:aa PHOTOSYNTHESIS TYSPEKIAQLQVYVNPIAVARDGMEKRLQGLIADQNWVDTQTYIHGPLGQLRRDMLGLASSLLPKDQDKAKTLAKEVFGH LERLDAAAKDRNGSQAKIQYQEALADFDSFLNLLP ............................................TTTTTTT............................. ................................... 135 3LUC.A mol:aa RNA BINDING PROTEIN KQFHTGIEIKVWAIACFAPQRQCTEVHLKSFTEQLRKISRDAGMPIQGQPCFCKYAQGADSVEPMFRHLKNTYAGLQLVV VILPGKTPVYAEVKRVGDTVLGMATQCVQMKNVQRTTPQTLSNLCLKINVKLGGV ........TTTT......TTTTTT.........................TTTT...................TTTT.... ...TTTT................................................ 262 3LUM.A mol:aa HYDROLASE RMEIVKIPVVVHVVWNEEEENISDAQIQSQIDILNKDFRKLNSDVSQVPSVWSNLIADLGIEFFLATKDPNGNQTTGITR TQTSVTFFTTSDEVKFASSGGEDAWPADRYLNIWVCHVLKSEIGQDILGYAQFPGGPAETDGVVIVDAAFGTTGTALPPF DKGRTATHEIGHWLNLYHIWGDELRFEDPCSRSDEVDDTPNQADPNFGCPSYPHVSCSNGPNGDMFMNYLDYVDDKCMVM FTQGQATRVNACLDGPRSSFLA ..............TTTT..................................................TTTT........ ........TTTT.............TTTTT..........TTTT........TTTT................TTTTTTTT TTT..............TTTT..TTTTTTTT....TTTT...........TTTT.TTTTTTTT.TTTTTTT......... .............TTTTT.... 152 3LUR.A mol:aa TRANSCRIPTION ACTIVATOR GEYQLQQLASLTLVGIKETYENGRQAQQHIAGFWQRCYQEGVIADLQLKNNGDLAGILGLCIPELDGKSYIAVTGDNSAD IAKYDVITLASSKYVFEAQGAVPKAVQQKEEVHHYIHQYQANTVKSAPFFELYQDGDTTSEKYITEIWPVKG ...................................................TTTT........TTTT............. ....................TTTT...............TTTTT............TTTTTTT......... 87 3LUU.A mol:aa BIOSYNTHETIC PROTEIN SDPRTQPLEIRPLISRVEVDWADGHTSRLTFEHLRVECPCAQIVTGKEHVSVVEVVPVGHYAVQLHFSDGHNTGIFTWEY LRRLDAE .TTTTT..............TTTT......................TTTT.......TTTTT....TTTT.......... ....... 99 3LWC.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION KRKFTIADASLERSPGQEADISVGNLGPITIGYGRYAPGQSLTETAVDDVIVLEGRLSVSTDGETVTAGPGEIVYPKGET VTIRSHEEGALTAYVTYPH ....................................TTTT....................TTTT....TTTT...TTTT. ................... 120 3LWG.A mol:aa UNKNOWN FUNCTION EGLLVCTRLDQNLCAELISFGSGKATVCLTPKEFMLAEDDVVHAGFIVGAASFAALCALNKKNSLISSMKVNLLAPIEIK QEIYFNATITHTSSKKSTIRVEGEFMEIKVFEGDFEILVF .....TTTT...........TTTT...........TTTTTT...................TTTT.............TTT T...........TTTTT......TTTTT............ 181 3LXR.F mol:aa SIGNALING PROTEIN/RHOA-BINDING PROTEIN NFGISLSHKRYFSGKVDEIIRCTMGKRIVKISSTKINTSILSSVSEQIGENITDWKNDEKKVYVSRVVNQCIDKFCAEHS RKIGDNLRKQIFKQVEKDYRISLDINAAQSSINHLVSGSSYFKKKMDELCEGMNRSVKNDTTSNVANLISDQFFEKNVQY IDLKKLRGNMSDYITNLESPF ..........TTTT........TTTTT.....................TTTT............................ .......................TTTT......................TTTT........................... ..................... 154 3LYD.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION PSIRYPSTEFPALTGFTVPIPETWQPDPTGTQFAARPHTPPQGFTPNIIGTVRRAATGALHNQRTELDQRATQLPDYAER GRTETTVDGFPAYHIEYAYRHHGTITIAQITLVEVSHPHAVDIIQLTATCAGDQTADYWDTFRLHADLTVQPHG ...TTTTTTTT.........TTTT......TTTT..TTTT...............TTTT..............TTTT... .....TTTTT.........TTTTT............TTTT........................TTTT..TTTT 116 3LYG.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION GNLANIVQRGWEALGAGDFDTLVTDYVEKIFIPGQADVLKGRQAFRSALDNLGEILPPGFEITGLRQLEGENEIVSIVEW KSDKIASQLSVLFKFEGDQIYEERWFVDTEQWKSVF .................................TTTT...................TTTT.........TTTT....... ..............TTTTT................. 118 3LYH.A mol:aa LYASE PHQIILLAHGSSDARWCETFEKLAEPTVESIENAAIAYELAEPSLDTIVNRAKGQGVEQFTVVPLFLAAGRHLRKDVPAI ERLEAEHGVTIRLAEPIGKNPRLGLAIRDVVKEELERS ..............................TTTT......TTTT.....................TTTT......TTTT. ...................................... 101 3LYY.A mol:aa CELL ADHESION THATSTETIHYVNEDGDQVFEDGGGKLDFTRTVTIDDVTNEVVEYGEWTPVTDDEFAAVTSPDKDGYTPDTSEVAAQKPD TDGPDGTVKDVEVTVTYTANP ............TTTT...................TTTTT.........TTTT..........TTTT.TTTTTT...... .TTTTT............... 261 3M1T.A mol:aa HYDROLASE GHSAALLQKVDELPRLPKAIAELLDVVNNEDSTVKAVSEKLSHDPVLSARVLRLANSAEVGTIDDAVVRLGQTLRTLVIA SAVVGAVPKVEGFDLADFWGNTFEVAIICQELAKRLGTLPEEAFTCGILHSIGELLIVNGDPAVAATISAAVADGADRNL EKELLGYDNAEIGALLAQSWKFTPHLVKGIQFQNHPKSAEPYSKLAGLAAKQIAADWDKIPDDERTSWLAQINILAGIKV DLGGLAEKLAKHGQGEGKQLA ............................TTTT................................................ .........TTTT...................................TTTT........................TTTT ...............................TTTTT..TTTTTTTTT.............TTTTT............... .TTTT...........TTTTT 270 3M66.A mol:aa TRANSCRIPTION DYVDHSETLQKLVLLGVDLSKIEKHPEAANLLLRLDFEKDIKQMLLFLKDVGIEDNQLGAFLTKNHAIFSEDLENLKTRV AYLHSKNFSKADVAQMVRKAPFLLNFSVERLDNRLGFFQKELELSVKKTRDLVVRLPRLLTGSLEPVKENMKVYRLELGF KHNEIQHMITRIPKMLTANKMKLTETFDFVHNVMSIPHHIIVKFPQVFNTRLFKVKERHLFLTYLGRAQYDPAKPNYISL DKLVSIPDEIFCEEIAKASVQDFEKFLKTL ................................................................TTTT............ ...................TTTT......................................................... ......................................................................TTTTTTT... .............................. 130 3M6J.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION VSDRPAGRPLTVHRNVGRWLSEILHASIRDTGVSSRIEFVRRTLHGWVREEYSETELPNAVYRNLYFPGSGKIETISECD RLKNLVRNVTDTLVENYPQGLESEALLIALDGVKLELARIRKDIEYGDPR ..........................TTTTT.....................TTTTTT...................... .................................................. 142 3M7K.A mol:aa HYDROLASE/DNA MTQCPRCQRNLAADEFYAGSSKMCKGCMTWQNLSYNANKEGHANTFTKATFLAWYGLSAQRHCGYCGISEAGFTSLHRTN PRGYHIQCLGVDRSDSFEGYSPQNARLACFICNRIKSNIFSASEMDVLGEAISKAWHGRGIA ...TTTTT......................................................TTTTT............T TTT.........TTTTTT..TTTTT..........TTTT....................... 137 3M7O.A mol:aa IMMUNE SYSTEM GWPKHTACNSGGLEVVYQSCDPLQDFGLSIDQCSKQIQSNLNIRFGIILRQDIRKLFLDITLMAKGSSILNYSYPLCEED QPKFSFCGRRKGEQIYYAGPVNNPGLDVPQGEYQLLLELYNENRATVACANATVTSS .........TTTT.......TTTT.....TTTTTTTTTTTT...........TTTT......TTTTT..........TTT TTTTTTTTTTTTT...........................TTTT............. 90 3M8J.A mol:aa TRANSCRIPTION GGDAFLLKLRESALSSGSMSEEQFFLLIGISSIHSDRVILAMKDYLVSGHSRKDVCEKYQMNNGYFSTTLGRLTRLNVLV ARLAPYYTDS ..............TTTT...........................TTTT............................... .......... 89 3M9Q.A mol:aa DNA BINDING PROTEIN LRDETPLFHKGEIVLCYEPDKSKARVLYTSKVLNVFERRNEHGLRFYEYKIHFQGWRPSYDRAVRATVLLKDTEENRQLQ RELAEAAKL ........TTTT.......TTTT................TTTT.........TTTT........................ ......... 85 3MAB.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION LANLSELPNIGKVLEQDLIKAGIKTPVELKDVGSKEAFLRIWENDSSVCMSELYALEGAVQGIRWHGLDEAKKIELKKFH QSLEG ......TTTT..................................TTTT................................ ..... 181 3MAL.A mol:aa PLANT PROTEIN VEITYGSAIKLMHEKTKFRLHSHDVPYGSGSGQQSVTGFPGVVDSNSYWIVKPVPGTTEKQGDAVKSGATIRLQHMKTRK WLHSHLHASPISGNLEVSCFGDDTNSDTGDHWKLIIEGSGKTWKQDQRVRLQHIDTSGYLHSHDKKYQRIAGGQQEVCGI REKKADNIWLAAEGVYLPLNE ...TTTT.....TTTTT...........TTTT........TTTT.........TTTT..TTTT..TTTT.....TTTTT. ........TTTTT........TTTT..................TTTT.....TTTTT.........TTTTTTTT...... ..................... 54 3MCB.A mol:aa CHAPERONE AMSKLGLRQVTGVTRVTIRKSKNILFVITKPDVYKSPASDTYIVFGEAKIEDLS .........TTTT.....TTTTTT...TTTT....TTTT............... 58 3MCB.B mol:aa CHAPERONE VNNISGIEEVNMFTNQGTVIHFNNPKVQASLAANTFTITGHAETKQLTEMLPSILNQL .............TTTT....TTTT....TTTTT...............TTTT..... 295 3MCQ.A mol:aa TRANSFERASE LIQRYFRRAHPSAVLGVGDDAALIQPSPGELAVSADLVANTHFYPNIDPWLIGWKSLAVNISDAAGAQPRWATLTIALPE ADEDWISKFAAGFFACAAQFDIALIGGDTTRGPLTISVQIGETPPGASLLRSTARADDDIWVSGPLGDAALALAAIQGRY PLSDTELAACGKALHQPQPRVVLGQALRGLAHSALDISDGLLADLGHILEHSQVGAEVWLKAIPKSEVVSAHSQEVAIQK ILSGGDDYELCFTASTQHRQQIADIGRQLSLDAVIGRITDTQQLVIHGLDDAPLT ...TTTT....TTTTT.......................TTTTT.................................... ...........................................TTTT..TTTT.TTTT...................... ..........................TTTTT.....TTTT...............................TTTT..... TTTT..................................TTTT.....TTTT.... 128 3MDP.A mol:aa NUCLEOTIDE BINDING PROTEIN ISPERLRVYRFFASLTDEQLKDIALISEEKSFPTGSVIFKENSKADNLLLLEGGVELFYSSTVCSVVPGAIFGVSSLIKP YHYTSSARATKPVRVVDINGARLRESENNQALGQVLNNVAAAVLARLH .TTTT...........................TTTT...TTTT.......................TTTT.......TTT T.......TTTT.............TTTTT.................. 302 3MG1.A mol:aa CAROTENOID BINDING PROTEIN FTIDSARGIFPNTLAADVVPATIARFSQLNAEDQLALIWFAYLEMGKTLTIAAPGAASMQLAENALKEIQAMGPLQQTQA MCDLANRADTPLCRTYASWSPNIKLGFWYRLGELMEQGFVAPIPAGYQLSANANAVLATIQGLESGQQITVLRNAVVDMG FRIAEPVVPPQDTASRTKVSIEGVTNATVLNYMDNLNANDFDTLIELFTSDGALQPPFQRPIVGKENVLRFFREECQNLK LIPERGVTEPAEDGFTQIKVTGKVQTPWFGGNVGMNIAWRFLLNPEGKIFFVAIDLLASPKE .......TTTTTT................................................................... .....................................TTTT..TTTT................................. ....................TTTT...............................TTTT................TTTT. .........................TTTTT.............TTTT...........TTTT 91 3MHS.B mol:aa HYDROLASE/TRANSCRIPTION REGULATOR/PROTEI TAQLKSQIQQYLVESGNYELISNELKARLLQEGWVDKVKDLTKSEMNINESTNFTQILSTVEPKALEMVSDSTRETVLKQ IREFLEEIVDT ................................................................................ ........... 127 3MR0.A mol:aa TRANSCRIPTION REGULATOR ERFQLAVSGASAGLWDWNPKTGAYLSPHFKKIGYEDHELPDEITESIHPDDRARVLAALKAHLEHRDTYDVEYRVRTRSG DFRWIQSRGQALWNSAGEPYRVGWIDVTDRKRDEDALRVSREELRRL .................TTTTT.................TTTT....TTTTT........................TTTT .............TTTT.............................. 135 3MSW.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION GTNNGKQFIHNDTEGGKLVCREIYANDAASGILNPVKYKYSYDTDQQKTVKSTYAWNIFKNTWETESRTVISRYETETSV EYSVWNKEKGSFDLSKKYIYITDNNNQLIAQYAYKNSRTNQWILEKDALTPIYEN TTTTT........TTTT.........TTTTT...........TTTTT.........TTTTT............TTTT... .....TTTTT............TTTT.........TTTTT.........TTTTT. 279 3MT0.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION SNAQAIRSILVVIEPDQLEGLALKRAQLIAGVTQSHLHLLVCEKRRDHSAALNDLAQELREEGYSVSTNQAWKDSLHQTI IAEQQAEGCGLIIKQHFPDNPLKKAILTPDDWKLLRFAPCPVLTKTARPWTGGKILAAVDVGNNDGEHRSLHAGIISHAY DIAGLAKATLHVISAHPTFQLSETIEARYREACRTFQAEYGFSDEQLHIEEGPADVLIPRTAQKLDAVVTVIGTVARTGL SGALIGNTAEVVLDTLESDVLVLKPDDIIAHLEELASKE .............TTTT............................................................... .................TTTTTTTTT.......................TTTT......TTTTT................ ..........................................TTTTT...................TTTT....TTTT.. ..TTTT......TTTT....................... 233 3MW8.A mol:aa LYASE GKLLLTRPEGKNAAASALDALAIPYLVEPLLSVEAAAVTQAQLDELSRADILIFISTSAVSFATPWLKDQWPKATYYAVG DATADALALQGITAERSPQATEGLLTLPSLEQVSGKQIVIVRGKGGREAADGLRLRGANVSYLEVYQRACPPLDAPASVS RWQSFGIDTIVVTSGEVLENLINLVPKDSFAWLRDCHIIVPSARVETQARKKGLRRVTNAGAANQAAVLDALG .......TTTTTT.....................................................TTTTT......... .............................TTTTTTT............................................ ......................................................................... 111 3MWZ.A mol:aa HYDROLASE INHIBITOR ELALRGGYRERSNQDDPEYLEAHYATSTWSAQQPGKTHFDTVVEVKVETQTVAGTNYRLTLKVAESTCELTSTYNKDTCQ ANANAAQRTCTTVIYRNQGEKSINSFECAAA ...........TTTTTTTTTT...........TTTT..............TTTTT.............TTTT..TTTTTT .TTTT.......................... 205 3MZO.A mol:aa HYDROLASE GGIHQYFQSLSDLENIYRCPGKFKYQEHSVAEHSYKVTSIAQFFGAVEEDAGNEVNWRALYEKALNHDYSELFIGDIKTP VKYATTELRELSEVEESTKNFISREIPATFQPIYRHLLKEGKDSTLEGKILAISDKVDLLYESFGEIQKGNPENIFVEIY SEALATIYEYREASVKYFLKEILPDLAEKGIEKTELPQLTTEITT ................TTTT...TTTT......................................TTTT........... TTTT............................................................................ .....................TTTT..TTTT.............. 255 3N0R.A mol:aa SIGNALING PROTEIN EHLLARLAPHLPYIRRYARALTGDQATGDHYVRVALEALAAGELVLDANLSPRVALYRVFHAIWLSSGAGHDQGLHAGDD AAQRLRIAPRSRQAFLLTALEGFTPTEAAQILDCDFGEVERLIGDAQAEIDAELATEVLIIEDEPVIAADIEALVRELGH DVTDIAATRGEALEAVTRRTPGLVLADIQLADGSSGIDAVKDILGRDVPVIFITAFPERLLTGERPEPTFLITKPFQPET VKAAIGQALFFHPRR .........................................TTTT.TTTT...............TTTT......TTTT. .....TTTT....................................................................... ....................TTTT..TTTTTTT.TTTTTTT............TTTT..........TTTT.TTTT.... ...........TTTT 140 3N1E.A mol:aa TRANSPORT PROTEIN DQWSMLRHFDHITKDYHDHIAEISAKLVAIMDSLFDKLLSKYEVKAPVPSPCFRNICKQMTKMHEAIFDLLPEEQTQMLF LRINASYKLHLKKQLSHLNVINDGGPQNGLVTADVAFYTGNLQALKGLKDLDLNMAEIWE ............TTTT..............................TTTT................TTTTT......... ............................................TTTTTTT......... 121 3N6Y.A mol:aa UNKNOWN FUNCTION GAQAEVRIDGPIEYGVFESRSEQNIQQTTEVPAKLGTKFGRYQLSGKQEGDTPLTLLYLTPGVVTPDGQRHDKFEVVQKL VPGAPTDVAYEFTEPHEVVKGEWRLVFQGDRLLAEKSFDVR .................................TTTT......TTTTTTTT.............TTTT............ TTTT......................TTTTT.......... 148 3N9B.A mol:aa LIGASE LLRYCVQKHDASRLHYDFRLELDGTLKSWAVPKGPCLDPAVKRLAVQVEDHPLDYADFEGSIPQAGDVIVWDRGAWTPLD DPREGLEKGHLSFALDGEKLSGRWHLIRTNLRGKQSQWFLVKAKDGEARSLDRFDVLKERPDSVLSER .........TTTTT......TTTTT.....TTTT...TTTT....................................TTT T...............TTTT..........TTTTTTT.......TTTT.TTTTT........TTTTT. 224 3NE8.A mol:aa HYDROLASE ASFRVVLDPGHGGIDGGARGVTGILEKDVTLAFARALRDELQKGSHTIVALTRDSDIFLRLSERVKKAQEFDADLFISIH ADTIDVHSLRGATVYTISDEASDAIAKSLAESENKVDLLDGLPKEDILLDLTRRETHAFSINFANNVVSNLSKSHINLIN NPHRYADFQVLKAPDVPSVLIEIGYLSNKEDEKLLNNPQWRKQAASIAYSIRQFAEYRQKIQPL ...........TTTTT...TTTT...........................TTTT..................TTTT.... ...TTTTTT........TTTT........................................................... ............TTTT.....TTTTTTTT................................... 262 3NFT.A mol:aa TRANSPORT PROTEIN AMTDDDLRAAGVDRRVPEQKLGAAIDEFASLRLPDRIDGRFVDGRRANLTVFDDARVAVRGHARAQRNLLERLETELLGG GIQPDPILQGLVDVIGQGKSDIDAYATIVEGLTKYFQSVADVMSKLQDYISAKDDKNMKIDGGKIKALIQQVIDHLPTMQ LPKGADIARWRKELGDAVSISDSGVVTINPDKLIKMRDSLPPDGTVWDTARYQAWNTAFSGQKDNIQNDVQTLVEKYSHQ NSNFDNLVKVLSGAISTLTDTA ...................................TTTTT........................................ ....................................................TTTTT....................... .TTTT.........TTTTT.TTTT.................TTTT................................... ...................... 182 3NKE.A mol:aa IMMUNE SYSTEM ARSDKLLYQAKLALDEDLRLKVVRKFELRFGEPAPARRSVEQLRGIEGSRVRATYALLAKQYGVTWNGRRYDEKGDTINQ CISAATSCLYGVTEAAILAAGYAPAIGFVHTGKPLSFVYDIADIIKFDTVVPKAFEIARRNPGEPDREVRLACRDIFRSS KTLAKLIPLIEDVLAAGEIQPP ..................................TTTT.......................................... ......................TTTTTTTTT.TTTT........TTTTTT...........TTTT............... ...................... 168 3NKG.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION APNPISIPIDLSQAGSVVEKEVKIEESWSYHLILQFAVHDRKEDGGLDGKRVWKFLGFNSYDPRDGKQVGYVDYRLAKSE LGDLIDETYDCDGTVVPIKITIHQINQDNTKKLIADNLYTKGNGSGAYTRDITTISLDKGKYIFRIENIEAFSEIGRKVD FTIYINKR .........TTTTTTT.......................TTTTTTTT..............TTTTT.............. .....TTTTTTTT............TTTT................TTTT............................... ........ 120 3NKL.A mol:aa OXIDOREDUCTASE/LYASE AKKKVLIYGAGSAGLQLANLRQGKEFHPIAFIDDDRKKHKTTQGITIYRPKYLERLIKKHCISTVLLAVPSASQVQKKVI IESLAKLHVEVLTIPNLDDLVNGKLSIGQLKEVSIDDLLG ...................TTTT..............TTTT...........................TTTT........ .........................TTTT........... 270 3NO2.A mol:aa UNKNOWN FUNCTION SPQHLLVGGSGWNKIAIINKDTKEIVWEYPLEKGWECNSVAATKAGEILFSYSKGAKITRDGRELWNIAAPAGCEQTARI LPDGNALVAWCGHPSTILEVNKGEVLSKTEFETGIERPHAQFRQINKNKKGNYLVPLFATSEVREIAPNGQLLNSVKLSG TPFSSAFLDNGDCLVACGDAHCFVQLNLESNRIVRRVNANDIEGVQLFFVAQLFPLQNGGLYICNWQGHDREAGKGKHPQ LVEIDSEGKVVWQLNDKVKFGISTICPIRE ........TTTT......TTTTT........TTTT.......TTTT.....TTTT...TTTT........TTTT...... TTTT.......TTTT................................TTTT.....TTTTT.....TTTT.......... .......TTTT...............TTTTT...............TTTT.....TTTT.......TTTTTTT.....TT TT..TTTT.......TTTTT.......... 120 3NOH.A mol:aa PEPTIDE BINDING PROTEIN SAQLEGSYIFCNPLLDKLSDEDIREQLKAFVTGKTDSIRTDTELSFDIYVSETDYALIRYADSLCERLNDAGADVQIKQY SGTLRSRAVSGKYEAFLSESDLVSTDALENADYIILDSAE ...........TTTT.................TTTT...TTTT.......TTTTT......................... ..................TTTT..............TTTT 111 3NPD.A mol:aa UNKNOWN FUNCTION GASLKDFELSKLEKVAKESSVGTPRAINEDILDQGYTVEGNQLINHLSVRASHAERRSNPDSVRSQLGDSVCSNTGYRQL LARGAILTYSFTEYKTNQPVATERFDAGSCR ...................TTTTTTTTTTTT.......TTTT..............TTTT.................... ............TTTTT.............. 230 3NQI.A mol:aa LIPID BINDING PROTEIN PQQWAGVVKVNDRGYVTFTDAAGTELIPTNTIPVTLNARAYIYCQVDEGQKSIKITLLADPTGIDATAITTPKVGESGDV TTNAPVGSLSFVSGYSTVAPFQFSENTIVLPVLYRVKNVTTTEDIKNELAKHTFTLVCYTDDIKSGDTILKLYLRYKVED EPAAIAERATRTSSFKAYEISQILREYTLKSGQTKPAKITIVAQQNEYNNKLEDTSTIEKVYEIEYKTAE ...................TTTT......TTTT.......................................TTTTTTT. ............TTTT.......TTTT....................................TTTT............. ................................TTTTTTT......TTTT.TTTTTTT............. 101 3NRF.A mol:aa UNKNOWN FUNCTION DAVVFARQGDKGSVSVGDKHFRTQAFKVRLVNAAKSEISLKNSCLVAQSAAGQSFRLDTVDEELTADTLKPGASVEGDAI FASEDDAVYGASLVRLSDRCK ...............TTTT....................TTTT.....TTTT.............TTTTTTTT....... ..TTTT..........TTTT. 102 3NRW.A mol:aa RECOMBINATION RPSLSPREARDRYLAHRQTDAADASIKSFRYRLKHFVEWAEERDITARELTGWKLDEYETFRRGSDVSPATLNGEQTLKN WLEYLARIDVVDEDLPEKVHVP ................TTTTT........................................................... ...........TTTT....... 181 3NS2.A mol:aa HORMONE RECEPTOR KGLTDEEQKTLEPVIKTYHQFEPDPTTCTSLITQRIHAPASVVWPLIRRFDNPERYKHFVKRCRLISGDGDVGSVREVTV ISGLPASTSTERLEFVDDDHRVLSFRVVGGEHRLKNYKSVTSVNEFLNQDSGKVYTVVLESYTVDIPEGNTEEDTKMFVD TVVKLNLQKLGVAATSAPMHD .......................TTTT.....................TTTTT...TTTT....TTTTTTTTTT...... .TTTT...........TTTTT..........TTTT............TTTTT..............TTTT.......... ..................... 70 3NY3.A mol:aa LIGASE LCGRVFKVGEPTYSCRDCAVDPTCVLCMECFLGSIHRDHRYRMTTSGGGGFCDCGDTEAWKEGPYCQKHE ......TTTT....TTTTTTTTTT............................TTTTTTTTTTT..TTTTT 103 3NZN.A mol:aa OXIDOREDUCTASE SNAVNLFGQKDRGNHVSGVDRGKVIMYGLSTCVWCKKTKKLLTDLGVDFDYVYVDRLEGKEEEEAVEEVRRFNPSVSFPT TIINDEKAIVGFKEKEIRESLGF ....TTTT....................TTTT........................................TTTTTTTT .TTTTTT................ 256 3O6C.A mol:aa TRANSFERASE NALLGVNIDHIAVLRQARVNDPDLLEAAFIVARHGDQITLHVREDRRHAQDFDLENIIKFCKSPVNLECALNDEILNLAL KLKPHRVTLVPEKREELTTEGGLCLNHAKLKQSIEKLQNANIEVSLFINPSLEDIEKSKILKAQFIELHTGHYANLHNAL FSNISHTAFALKELDQDKKTLQAQFEKELQNLELCAKKGLELGLKVAAGHGLNYKNVKPVVKIKEICELNIGQSIVARSV FTGLQNAILEKELIKR ...................TTTT...................TTTTTTT............................... ...TTTT..........TTTT..TTTTTTT.................................................. ......TTTT......................................TTTTTTTTTT....TTTT.............. ..........TTTTT. 310 3OAJ.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION AKKTMGIHHITAIVGHPQENTDFYAGVLGLRLVKQTVNFDDPGTYHLYFGNEGGKPGTIITFFPWAGARQGVIGDGQVGV TSYVVPKGAMAFWEKRLEKFNVPYTKIERFGEQYVEFDDPHGLHLEIVEREEGEANTWTFGEVTPDVAIKGFGGATLLSE QPDKTADLLENIMGLERVGKEGDFVRYRSAGDIGNVIDLKLTPIGRGQMGAGTVHHIAWRANDDEDQLDWQRYIASHGYG VTPVRDRNYFNAIYFREHGEILFEIATDPPGFAHDETQETMGEKLMLPVQYEPHRTQIEQGLLPFEVREL .....................................TTTTTTT......TTTTTTTTT.....TTTT.....TTTT... .....TTTT..................TTTTT......TTTT.................TTTTTTTTT..........TT TT..................TTTT.......TTTT..............TTTT........................... ......TTTT......TTTT.......TTTTTTTTTTTTTTTT...........................