95 3JU0.A mol:aa DNA BINDING PROTEIN SLTDSKVKNAKSLEKEYKLTDGFGMHLLVHPNGSKYWRLSYRFEKKQRLLALGVYPAVSLADARQRRDEAKKLLAAGIDP SAKKQADNKTIQEKR ............TTTT.....TTTT....TTTT........TTTTT.......TTTTTT..................... ............... 145 3JUD.A mol:aa TRANSFERASE MALVFVYGTLKRGQPNHRVLRDGAHGSAAFRARGRTLEPYPLVIAGEHNIPWLLHLPGSGRLVEGEVYAVDERMLRFLDD FQSCPALYQRTVLRVQLLEEEPPAPTAVQCFVYSRATFPPEWAQLPHHDSYDSEGPHGLRYNPRE .......TTTTTTTTTT..................TTTT......TTTT......TTTT..................... ..TTTTTT...........TTTT........................TTTTTTTTTTT....... 189 2WQF.A mol:aa OXIDOREDUCTASE SFIKSLENRRTIYALGRNVQDEEKVIETIKEAVRFSPTAFNSQTGRLLILTGDAQDKLWDEIVAPELKAAMEAAKLDGFK AAFGTILFFEDQAVVKNLQEQFALYADNFPVWSEQGSGIISVNVWTALAELGLGANLQHYNPLIDEAVAKEWNLPESWKL RGQLVFGSIEAPAGEKTFMDDADRFIVAK ...........TTTT...TTTT.......................................................... ........................TTTT...............................TTTTTT.........TTTT.. ............................. 127 3F8X.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION SPNAAVQSGLQEWHRIIAEADWERLPDLLAEDVVFSNPSTFDPYHGKGPLVILPAVFSVLENFQYARHFSSKSGYVLEFN ANGDELLTGVDLIEFNDAGKITDLVVRPASVVIDLSVEVGKRIAAAQ ....................................TTTT.....TTTTT.........TTTT.......TTTT...... ...............TTTT............................ 125 2X4W.A mol:aa TRANSCRIPTION DSPLDALDLVWAKCRGYPSYPALIIDPKMPREGMFHHGVPIPVPPLEVLKLGEQMTQEAREHLYLVLFFDNKRTWQWLPR TKLVPLGVNQDLDKEKMLEGRKSNIRKSVQIAYHRALQHRSKVQG ....TTTT.....TTTT........TTTTTTTT.TTTTT..............................TTTT....... ....TTTTTT................................... 109 2X4J.A mol:aa VIRAL PROTEIN FFTNKIGCNVSSPLKHVDIVGEIVEEAVYNFLIDAGDKMCVGNKIGVWKVSRKSLYAKVPKGIGVTVYLANGRVQGRLID IGVYEVLVEEVGDIIYIHKDLVYALCWPK .TTTTTTT.....TTTT.........TTTT....TTTT....TTTT.............TTTT.....TTTT........ .TTTT...TTTTT........TTTT.... 62 2X4K.A mol:aa ISOMERASE SMMPIVNVKLLEGRSDEQLKNLVSEVTDAVEKTTGANRQAIHVVIEEMKPNHYGVAGVRKSD .....................................................TTTTTTTTT 422 3LM3.A mol:aa STRUCTURE GENOMICS, UNKNOWN FUNCTION EPLTIEGNRFVTLCIIRTTPWEVSRDVKLHPRDEVDWHTLEGVRALREAFATNNPNGRLTWGFTNALEDGRKNYREIRDY VVECQKKYGDEVTYFPGYFPAYLPRERVNRESEAIEIISKVGNGYRPQSIGGFLSADNLRYLAEKENIHVAHAVIWSQHN GGGADGSPSYPFYPSTEHFCKPAQGKSDFIDCVNLDGWTDFICARRSGQTGHGIDGYNSRRGVGPIETYKGWGLDLGHRE VHTEAIHFDKGLELNGFGWVANIWEAQVHEFGKDLICDAKWVTGTKERWPDTHFVTFGEFGELWRKQYKSNDDWNYRFVE RGSGLGDSYNNLEIKWFNKEFRLALLRDWHTKNSPAYVIDFTRYDLQAHEPADPSPEKPAKDWSLINKINQKALRPQDKP VLIDKLEKEDQDLIRKYYPELL ....TTTT........TTTTTTTTTTTTTTT...................TTTTTTT.......TTTTT........... ................TTTT.....................TTTT.....TTTT....................TTTT.. ..............TTTT..................................TTTT.TTTTTTT................ ...........................TTTTT................TTTT.................TTTTT...... ....TTTTTTTT.....TTTT......TTTTTTT........TTTT........TTTTT..........TTTT....... ...................... 313 3FQG.A mol:aa PROTEIN BINDING MLREFSFYDVPPAHVPPVSEPLEIACYSLSRDRELLLDDSKLSYYYPPPLFSDLNTGFPNRFHPPKSDPDPISIVKDVLM TKGIQMNSSFLTWRGLITKIMCAPLDPRNHWETYLVMDPTSGIIMMEERTNQDRMCYWGYKFEAISTLPEIWDAQDVVPD EQYCSIVKINIGKSKLILAGEVDCIWDKKPCENPNLHYVELKTSKKYPLENYGMRKKLLKYWAQSFLLGIGRIIIGFRDD NGILIEMKELFTHQIPKMLRPYFKPNDWTPNRLLVVLEHALEWIKQTVKQHPPSTEFTLSYTGGSKLVLRQII .............................TTTT.....TTTT......TTTTTTTTTTTTTT.................. .....................TTTTTTTT........TTTTT...........................TTTTT...TTT TT........TTTT.................................TTTT...........................TT TT.....................TTTT........................TTTT.................. 154 3A57.A mol:aa TOXIN GSDEILFVVRDTTFNTNAPVNVEVSDFWTNRNVKRKPYKDVYGQSVFTTSGTKWLTSYMTVNINDKDYTMAAVSGYKHGH SAVFVKSDQVQLQHSYDSVASFVGEDEDSIPSKMYLDETPEYFVNVEAYESGSGNILVMCISNKESFFECKHQQ ...............TTTT.........TTTTTTTT.TTTTTTT.......TTTT......TTTTT.........TTTTT ............................TTTT......TTTT........TTTT........TTTTT....... 103 3A5P.A mol:aa SUGAR BINDING PROTEIN VWSVQIVDNAGLGANLALYPSGNSSTVPRYVTVTGYAPITFSEIGPKTVHQSWYITVHNGDDRAFQLGYEGGGVATATFT AGGNVSISTGFGDAQHLTLKKLA .......TTTT......TTTT...........TTTT.....................TTTTT.......TTTT......T TTTT................... 65 3FJU.B mol:aa HYDROLASE/HYDROLASE INHIBITOR VRKCLSDTDCTNGEKCVQKNKICSTIVEIQRCEKEHFTIPCKSNNDCQVWAHEKICNKGCCWDLL ..........TTTT..TTTTT............................TTTT..TTTTT..... 365 2ZQ5.A mol:aa TRANSFERASE RKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGEAGLTVLGSKMNRFFLRGALVARLLSQSAWKQYPEHV DVAIKRPIFVTGLVRTGTTALHRLLGADPAHQGLHMWLAEYPQPRPPRETWESNPLYRQLDADFTQHHAENPGYTGLHFM AAYELEECWQLLRQSLHSVSYEALAHVPSYADWLSRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMATY PDALVVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERFNAARAKYDSAQFYDVDYHDLIADPLG TVADIYRHFGLTLSDEARQAMTTHSYSLADYGLTVEMVKERFAGL .........................TTTT................................................... ...TTTT.....TTTT...........TTTT................................................. TTTT.............TTTT................................TTTTTTT...................T TTT.......................TTTTTTTTT............................................. .........................................TTTT 80 2ZQE.A mol:aa DNA BINDING PROTEIN VKEVDLRGLTVAEALLEVDQALEEARALGLSTLRLLHGKGTGALRQAIREALRRDKRVESFADAPPGEGGHGVTVVALRP .....TTTT.............................TTTT............TTTT......TTTTTT.......... 178 2ZNR.A mol:aa HYDROLASE GPGHMEGLRCVVLPEDLCHKFLQLAESNTVRGIETCGILCGKLTHNEFTITHVIVPKQSAGPDYCDMENVEELFNVQDQH DLLTLGWIHTHPTQTAFLSSVDLHTHCSYQLMLPEAIAIVCSPKHKDTGIFRLTNAGMLEVSACKKKGFHPHTKEPRLFS ICKHVLVKDIKIIVLDLR TTTT.........TTTT.........................TTTTT.............TTTT..TTTT.......... ..........TTTT..................TTTT.....................................TTTTTTT .TTTT............. 244 2G1P.A mol:aa TRANSFERASE/DNA KNRAFLKWAGGKYPLLDDIKRHLPKGECLVEPFVGAGSVFLNTDFSRYILADINSDLISLYNIVKMRTDEYVQAARELFV PETNCAEVYYQFREEFNKSQDPFRRAVLFLYLNRYGYNGLCRYNLRGEFNVPFGRYKKPYFPEAELYHFAEKAQNAFFYC ESYADSMARADDSSVVYCDPPYAPLNSFTLEQQAHLAEIAEGLVERHIPVLISNHDTMLTREWYQRAKLHVVKKVDELLA LYKP .......TTTT...................TTTTTTT.......TTTT................................ ...........................................TTTT........TTTT..................... ..........TTTT.................................................TTTT............. .... 87 3G1J.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION GVDRDYLQSEYGVLKAGQCYKVVRSFRDYRNINYERGDVRFLGSNFVPYESGLSLFFDKNGSERQILCVRPEFQEIAHHL DSYFCKL ..........TTTTTTTT...TTTT..TTTT...TTTT........TTTTT......TTTTT.......TTTTTTTTTTT ....... 339 3K7X.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION KWSEYANLAQQSLEKFYLADTKEQFLNNFYPTENPEEDNKVFNYWWLAHLVEVRLDAYLRTKKQADLEVAEKTYLHNKNR NGGTLIHDFYDDLWNALAAYRLYKATGKSIYLEDAQLVWQDLVDTGWNDIGGGFAWRRPQYYKNTPVNAPFIILSCWLYN ELNETKYLEWAKTYEWQTKVLVREDGFVEDGINRLEDGTIDYEWKFTYNQGVYIGANLELYRITKEAIYLDTANKTAAIS LKELTEDGIFKDEGNGGDEGLFKGIFYRYFTDLIEETANKTYRDFVLNSCQILVENAKLDGYLLGNWKEKPSGKIPYSAE LSGIALEAAKLELEHHHHH ....................TTTTT...TTTTTTT............................................T TTTT..TTTTTT............................................TTTT.................... ......................TTTT......TTTT....TTTTT................................... ....TTTTT................................................TTTTT...TTTT........... ...TTTT............ 132 3LLO.A mol:aa MOTOR PROTEIN SPSYTVLGQLPDTDVYIDIDAYEEVKEIPGIKIFQINAPIYYANSDLYSSANIHTVILDFTQVNFMDSVGVKTLAGIVKE YGDVGIYVYLAGCSAQVVNDLTSNRFFENPALKELLFHSIHDAVLGSQVREA TTTT.....TTTT....TTTTTTTT..TTTT............................TTTT................. .........TTTT...........TTTTTT................TTTT.. 140 3IVV.A mol:aa LIGASE SGKVVKFSYMWTINNFSFCREEMGEVIKSSTFSSGANDKLKWCLRVNPKGLDEESKDYLSLYLLLVSCPKSEVRAKFKFS ILNAKGEETKAMESQRAYRFVQGKDWGFKKFIRRDFLLDEANGLLPDDKLTLFCEVSVVQ ............TTTT.....TTTT.....................TTTT....TTTT.........TTTT......... ..TTTT.......TTTT...TTTT..................TTTT.............. 112 3ES4.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION GTPIFNISDDVDLVPAPAEGRDGGSYRRQIWQDDVENGTIVAVWAEPGIYNYAGRDLEETFVVVEGEALYSQADADPVKI GPGSIVSIAKGVPSRLEILSSFRKLATVIPKP .....TTTTTTTT....TTTT.............TTTT.................................TTTT..... TTTT....TTTT.................... 100 3K0X.A mol:aa PROTEIN BINDING SAKLIFINQINDCKDGQKLRFLGCVQSYKNGILRLIDGSSSVTCDVTVVLPDVSIQKHEWLNIVGRKRQDGIVDVLLIRS AVGINLPRYRQMVSERQKCD .............TTTT...........TTTT....TTTT.....TTTTTTTT..TTTT.........TTTT........ TTTT................ 146 3K0Z.A mol:aa LYASE EVQLLKEPKPKATIDPSLSQKEATEVHAAQRFYAFWDTGKEELIPQTVTENFFDHTLPKGRPQGTEGLKFAAQNFRKIVP NIHCEIEDLLVVGDKVTARLSFTGTHNDKKIDFFAIDILHVKDGKITEDWHLEDNLTLKQQLGLIA ..............TTTT...................................TTTTTTTT...............TTTT TT.........TTTT.........TTTTT...........TTTTT..................... 87 3IWF.A mol:aa TRANSCRIPTION REGULATOR PNILYKIDNQYPYFTKNEKKIAQFILNYPHKVVNTSQEIANQLETSSTSIIRLSKKVTPGGFNELKTRLSKFLPKEVTQY NNKLHSR .........................................................TTTT...............TTTT ....... 233 3MW8.A mol:aa LYASE GKLLLTRPEGKNAAASALDALAIPYLVEPLLSVEAAAVTQAQLDELSRADILIFISTSAVSFATPWLKDQWPKATYYAVG DATADALALQGITAERSPQATEGLLTLPSLEQVSGKQIVIVRGKGGREAADGLRLRGANVSYLEVYQRACPPLDAPASVS RWQSFGIDTIVVTSGEVLENLINLVPKDSFAWLRDCHIIVPSARVETQARKKGLRRVTNAGAANQAAVLDALG .......TTTTTT.....................................................TTTTT......... .............................TTTTTTT............................................ ......................................................................... 111 3MWZ.A mol:aa HYDROLASE INHIBITOR ELALRGGYRERSNQDDPEYLEAHYATSTWSAQQPGKTHFDTVVEVKVETQTVAGTNYRLTLKVAESTCELTSTYNKDTCQ ANANAAQRTCTTVIYRNQGEKSINSFECAAA ...........TTTTTTTTTT...........TTTT..............TTTTT.............TTTT..TTTTTT .TTTT.......................... 113 3FKA.A mol:aa UNKNOWN FUNCTION TTSEHIAALTALVETYVATRGDRPALERIFFGKASEVGHYEGELLWNSRDAFIACEDAADAETDPFWAISSVSVQGDIAL HVENDWAGRFDDFLTVLLHEGSWRIVSKVYRIR ......................................TTTTT...........TTTTTTTTT...........TTTT.. ....TTTT.........TTTTT........... 130 3FK8.A mol:aa ISOMERASE ALNLPYDEHADAWTQVKKALAAGKRTHKPTLLVFGANWCTDCRALDKSLRNQKNTALIAKHFEVVKIDVGNFDRNLELSQ AYGDPIQDGIPAVVVVNSDGKVRYTTKGGELANARKSDQGIYDFFAKITE TTTT..TTTT.........................TTTT.............................TTTTTTTT.... ........TTTT....TTTT.....TTTTTTTTTTT.............. 143 3GMG.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION ARLAALSILVGAVGATGPGVMITIDDPGPGVAPEVMIDVINELRAAGAEAIQINDAHRSVRVGVDTWVVGVPGSLTVDTK VLSPPYSILAIGDPPTLAAAMNIPGGAQDGVKRVGGRMVVQQADRVDVTALRQPKQHQYAQPV .........................TTTT..................TTTT...TTTT....TTTT....TTTTTTTTTT ......................TTTT..............................TTTT... 328 3I1A.A mol:aa TRANSFERASE KQPIQAQQLIELLKVHYGIDIHTAQFIQGGADTNAFAYQADSESKSYFIKLKYGYHDEINLSIIRLLHDSGIKEIIFPIH TLEAKLFQQLKHFKIIAYPFIHAPNGFTQNLTGKQWKQLGKVLRQIHETSVPISIQQQLRKEIYSPKWREIVRSFYNQIE FDNSDDKLTAAFKSFFNQNSAAIHRLVDTSEKLSKKIQPDLDKYVLCHSDIHAGNVLVGNEESIYIIDWDEPMLAPKERD LMFIGGGVGNVWNKPHEIQYFYEGYGEINVDKTILSYYRHERIVEDIAVYGQDLLSRNQNNQSRLESFKYFKEMFDPNNV VEIAFATE ..........................TTTT.TTTT......TTTT................................... TTTT.....TTTT...........TTTTTT.................................................. .TTTT..........................................TTTT.................TTTT........ ..TTTTTTTTT.............................................TTTT...............TTTT. ........ 82 3FBL.A mol:aa STRUCTURAL PROTEIN YHKLRLAIKEICKTDGIPNIKWGMYIAFGEKLLKSYLKMKAGSASSDMIAEYINNAISAFSSRTGISQETAQKIADFITS NY ..............................................................TTTT.............. .. 89 3FB9.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION GMSDAFTDVAKMKKIKEEIKAHEGQVVEMTLENGRKRQKNRLGKLIEVYPSLFIVEFGDVEGDKQVNVYVESFTYSDILT EKNLIHYLD .....................TTTT........TTTTTTT........TTTT.......TTTTTTTTT............ TTTT..... 137 2WY3.B mol:aa IMMUNE SYSTEM/VIRAL PROTEIN VDLGSKSSNSTCRLNVTELASIHPGETWTLHGMCISICYYENVTEDEIIGVAFTWQHNESVVDLWLYQNDTVIRNFSDIT TNILQDGLKMRTVPVTKLYTSRMVTNLTVGRYDCLRCENGTTKIIERLYVRLGSLYP ....TTTTT..............TTTT..TTTT......TTTTTTTT.......TTTTTTT.....TTTTTTTTTTTTTT TT.......TTTT...........TTTTT.......TTTTT............TTTT 302 3MG1.A mol:aa CAROTENOID BINDING PROTEIN FTIDSARGIFPNTLAADVVPATIARFSQLNAEDQLALIWFAYLEMGKTLTIAAPGAASMQLAENALKEIQAMGPLQQTQA MCDLANRADTPLCRTYASWSPNIKLGFWYRLGELMEQGFVAPIPAGYQLSANANAVLATIQGLESGQQITVLRNAVVDMG FRIAEPVVPPQDTASRTKVSIEGVTNATVLNYMDNLNANDFDTLIELFTSDGALQPPFQRPIVGKENVLRFFREECQNLK LIPERGVTEPAEDGFTQIKVTGKVQTPWFGGNVGMNIAWRFLLNPEGKIFFVAIDLLASPKE .......TTTTTT................................................................... .....................................TTTT..TTTT................................. ....................TTTT...............................TTTT................TTTT. .........................TTTTT.............TTTT...........TTTT 162 3HA2.A mol:aa OXIDOREDUCTASE QTLIIVAHPELARSNTQPFFKAAIENFSNVTWHPLVADFNVEQEQSLLLQNDRIILEFPLYWYSAPALLKQWDTVTTKFA TGHQYALEGKELGIVVSTGDNGNAFQAGAAEKFTISELRPFEAFANKTKYLPILAVHQFLYLEPDAQQRLLVAYQQYATN VG .......TTTTTTTTTT......TTTTTTT.....TTTT...........TTTT.....TTTTT................ TTTTTTTTTT...............TTTTTTT.TTTTT.................TTTT..................... .. 288 3H20.A mol:aa REPLICATION RTLQAIGRQLKAMGCERFDIGVRDATTGQMMNREWSAAEVLQNTPWLKRMNAQGNDVYIRPAEQERHGLVLVDDLSEFDL DDMKAEGREPALVVETSPKNYQAWVKVADAAGGELRGQIARTLASEYDADPASADSRHYGRLAGFTNRKKHTTYQPWVLL RESKGKTATAGPALVQQAGQQIEQAQRQQEKARRLASLERRTALDEYRSEMAGLVKRFGDDLSKCDFIAAQKLASRGRSA EEIGKAMAEASPALAERKHEADYIERTVSKVMGLPSVQLARAELARAP ..............TTTT.....TTTTT.................................TTTT............... .........TTTT.TTTTTT.......TTTT.......................TTTT...................... .......TTTT..............................................TTTTT.................. ................................................ 120 3LFR.A mol:aa TRANSPORT PROTEIN LQVRDIVPRSQISIKATQTPREFLPAVIDAAHSRYPVIGESHDDVLGVLLAKDLLPLILKADGDSDDVKKLLRPATFVPE SKRLNVLLREFRANHNHAIVIDEYGGVAGLVTIEDVLEQI .......TTTT...TTTT...................TTTTTTTT..............TTTT...............TT TT...................TTTT............... 34 2RIV.B mol:aa SIGNALING PROTEIN HPIIQIDRSFMLLILERSTRSILFLGKVVNPTEA ...............TTTTT.......TTTTTT. 168 3H5J.A mol:aa LYASE SEAFHTHSGIGVPLRRSNVDTDQIIPAVFLKRVTRTGFEDGLFAGWRSDPAFVLNLSPFDRGSVLVAGPDFGTGSSREHA VWALMDYGFRVVISSRFGDIFRGNAGKAGLLAAEVAQDDVELLWKLIEQSPGLEITANLQDRIITAATVVLPFKIDDHSA WRLLEGLD ...............TTTT.........TTTT........TTTTTT..TTTT...TTTT........TTTTTT....... .............TTTT................................TTTT....TTTTT...TTTT........... ........ 139 3H51.A mol:aa PROTEIN BINDING GVHYTDKAALPADGEAREVAALFDTWNAALATGNPHKVADLYAPDGVLLPTVSNEVRASREQIENYFEFLTKKPKGVINY RTVRLLDDDSAVDAGVYTFTLTDKNGKKSDVQARYTFVYEKRDGKWLIINHHSSAPEVD .................................................TTTT........................... ......TTTT............TTTT..............TTTTT.............. 150 2VXZ.A mol:aa VIRAL PROTEIN HSREVLVRLRDILALLADGCKTTSLIQQRLGLSHGRAKALIYVLEKEGRVTRVAFGNVALVCLSDQYRQLVDGIREVERL VTTNKLKFISPPRLHDLIIKDPQARKFFSSIIPIAHRTAIILSFLNHLLKIYGEPYVKTDETVYLTANRK ......................................................TTTT...................... .....TTTT.............................................TTTTTTTT........ 156 2VXT.I mol:aa CYTOKINE YFGKLESKLSVIRNLNDQVLFIDQGNRPLFEDMTDSDARDNAPRTIFIISMYKDSQPRGMAVTISVKAEKISTLSAENKI ISFKEMNPPDNIKDTKSDIIFFQRSVPGHDNKMQFESSSYEGYFLAAEKERDLFKLILKKEDELGDRSIMFTVQNE .............TTTT.....TTTT..............TTTTTT........TTTTT.......TTTTT......... ........TTTTTTT.TTTT.....TTTT.......TTTTTTT......TTTT.........TTTT.......... 298 1US5.A mol:aa RECEPTOR AQEFITIGSGSTTGVYFPVATGIAKLVNDANVGIRANARSTGGSVANINAINAGEFEMALAQNDIAYYAYQGCCIPAFEG KPVKTIRALAALYPEVVHVVARKDAGIRTVADLKGKRVVVGDVGSGTEQNARQILEAYGLTFDDLGQAIRVSASQGIQLM QDKRADALFYTVGLGASAIQQLALTTPIALVAVDLNRIQAIAKKYPFYVGFNIPGGTYKGVDVTTPTVAVQAMLIASERL SEETVYKFMKAVFGNLEAFKKIHPNLERFFGLEKAVKGLPIPLHPGAERFYKEAGVLK ..........TTTT.......................................TTTT.................TTTTTT T.TTTT...............TTTT.......TTTT.....TTTT...................TTTT............ ..TTTT......TTTT........................TTTTTT.......TTTTTTTT...............TTTT ............TTTT...................TTTT................... 141 2WCJ.A mol:aa TRANSPORT PROTEIN TAEVMSHVTAHFGKTLEECREESGLSVDILDEFKHFWSDDFDVVHRELGCAIICMSNKFSLMDDDVRMHHVNMDEYIKSF PNGQVLAEKMVKLIHNCEKQFDTETDDCTRVVKVAACFKEDSRKEGIAPEVAMVEAVIEKY .........................TTTTTTTT....TTTT.....................TTTT.............T TTT.................TTTT................................TTTT. 148 2WCR.A mol:aa IMMUNE SYSTEM NSFLQDVPYWMLQNRSEYITQGVDSSHIVDGKKTEEIEKIATKRATIRVAQNIVHKLKEAYLSKTNRIKQKITNEMFIQM TQPIYDSLMNVDRLGIYINPNNEEVFALVRARGFDKDALSEGLHKMSLDNQAVSILVAKVEEIFKDSV ...TTTT..........TTTT.......TTTT..............................TTTT.............. ..................TTTTT............................................. 260 1EG3.A mol:aa STRUCTURAL PROTEIN PASQHFLSTSVQGPWERAISPNKVPYYINHETQTTCWDHPKMTELYQSLADLNNVRFSAYRTAMKLRRLQKALCLDLLSL SAACDALDQHNLKQNDQPMDILQIINCLTTIYDRLEQEHNNLVNVPLCVDMCLNWLLNVYDTGRTGRIRVLSFKTGIISL CKAHLEDKYRYLFKQVASSTGFCDQRRLGLLLHDSIQIPRQLGEVASFGGSNIEPSVRSCFQFANNKPEIEAALFLDWMR LEPQSMVWLPVLHRVAAAET ...........TTTT....TTTT.....TTTTT..................TTTT......................... .............TTTT.....................TTTTTT................TTTT................ .................TTTT..........................................TTTT............. ..TTTTT............. 387 3G02.A mol:aa HYDROLASE KAFAKFPSSASISPNPFTVSIPDEQLDDLKTLVRLSKIAPPTYESLQADGRFGITSEWLTTMREKWLSEFDWRPFEARLN SFPQFTTEIEGLTIHFAALFSEREDAVPIALLHGWPGSFVEFYPILQLFREEYTPETLPFHLVVPSLPGYTFSSGPPLDK DFGLMDNARVVDQLMKDLGFGSGYIIQGGDIGSFVGRLLGVGFDACKAVHLNFCNMSAPPEGPSIESLSAAEKEGIARME KFMTDGYAYAMEHSTRPSTIGHVLSSSPIALLAWIGEKYLQWVDKPLPSETILEMVSLYWLTESFPRAIHTYREWVPTTP YQKELYIHKPFGFSFFPKDLVPVPRSWIATTGNLVFFRDHAEGGHFAALERPRELKTDLTAFVEQVW TTTT..TTTT....................................TTTTTTT........................... .......TTTTT..........TTTT.......TTTT................TTTTT........TTTTTTT...TTTT ...................TTTT...................TTTT............TTTT.................. ..........................................TTTT............................TTTTTT TTTTTT.........TTTTTTT.....................TTTT.................... 138 3G0M.A mol:aa HYDROLASE MAALPDKEKLLRNFTRCANWEEKYLYIIELGQRLAELNPQDRNPQNTIHGCQSQVWIVMRRNANGIIELQGDSDAAIVKG LMAVVFILYHQMTAQDIVHFDVRPWFEKMALAQHLTPSRSQGLEAMIRAIRAKAATLS TTTT.........................................................TTTT............... ........TTTT.............................................. 184 3HOI.A mol:aa OXIDOREDUCTASE GAERTIQLPKPDNRAGLLKALSERHSTREYASKALSNTDLSDLLWAANGINRSSEGKRTAPSANRQDIDIYVVLPQGTYL YDAKGHKLNLISEGDHRSAVAGGQAFVNNAPVSLVLVSDLSKLGDAKSNHVQLGADAGIVSQNISLFCSAARLATVPRAS DLVRLKAALKLKDTQPNHPVGYFK ..TTTT...........................................TTTTTTT.TTTT............TTTT... .TTTTT..............TTTT.....TTTT...........TTTT................................ ...........TTTT......... 251 3HO6.A mol:aa TOXIN GVDFNKNTALDKNYLLNNKIPSNNGSKNYVHYIIQLQGDDISYEATCNLFSKNPKNSIIIQRNMNESAKSYFLSDDGESI LELNKYRIPERLKNKEKVKVTFIGHGKDEFNTSEFARLSVDSLSNEISSFLDTIKLDISPKNVEVNLLGCNMFSYDFNVE ETYPGKLLLSIMDKITSTLPDVNKNSITIGANQYEVRINSEGRKELLAHSGKWINKEEAIMSDLSSKEYIFFDSIDNKLK AKSKNIPGLAS ..TTTT...................................................................TTTT... ...TTTT....TTTT...........TTTT...TTTTT...................................TTTT... ..................TTTT................TTTT.....TTTT.....................TTTTT... ..TTTTTTT.. 168 3KMI.A mol:aa MEMBRANE PROTEIN NIHKIHEVQKKLQEEVSIVLIDIADIIVNPKKENGYSRDLYTLNSLIDSSISETYDNINNTLLSDTRFFLEHDIIKSQRD ILENLYSYVSQLNSTPPQAHILSAFIHKIGYTEFEAETGNLLLEELKRLISKNQPLPVDRTEFENRAILFLCLTELKQFL VNRKHAQL ............................TTTT................................................ ..................................TTTT.......................................... ........ 317 3HRR.A mol:aa TRANSCRIPTION EIKTTTTLHRVVEETTKPLGATLVVETDISRKDVNGLARGHLVDGIPLCTPSFYADIAMQVGQYSMQRLRGLVDVSDMVV DKALVPHGKGPQLLRTTLTMEWPPKAAATTRSAKVKFATYFKLDTEHASCTVRFTSDAQLKSLRRSVSEYKTHIRQLHDG HAKGQFMRYNRKTGYKLMSSMARFNPDYMLLDYLVLNEAENEAASGVDFSLGSSEGTFAAHPAHVDAITQVAGFAMNAND NVDIEKQVYVNHGWDSFQIYQPLDNSKSYQVYTKMGQANDLVHGDVVVLDGEQIVAFFRGLTLRSVPRGALRVVLQT ....TTTT........TTTT.......TTTTTTTT......TTTTT.................................. ......................TTTT...................................................... .................TTTTT...................................TTTT.................TT TTTTTTT................TTTT.....................TTTTT........................ 250 3HR0.A mol:aa TRANSPORT PROTEIN STDEAKMSFLVTLNNVEVCSENISTLKKTLESDCTKLFSQGIGGEQAQAKFDSCLSDLAAVSNKFRDLLQEGLTELNSTA IKPQVQPWINSFFSVSHNIEEEEFNDYEANDPWVQQFILNLEQQMAEFKASLSPVIYDSLTGLMTSLVAVELEKVVLKST FNRLGGLQFDKELRSLIAYLTTVTTWTIRDKFARLSQMATILNLERVTEILDYWGPNSGPLTWRLTPAEVRQVLALRIDF RSEDIKRLRL .....................................TTTT....................................... ...............TTTT............................................................. ...........................................TTTT..........TTTT...............TTTT .......... 91 3HRL.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION ASEAEAKLWQHLRAGRLNGYKFRRQQPGNYIVDFCVTPKLIVEADGVYDHARTVYLNSLGFTVLRFWNHEILQQTNDVLA EILRVLQELEK ..............TTTTTT..TTTT........TTTTT......................................... ........... 77 3G21.A mol:aa VIRAL PROTEIN AGPWADIMQGPSESFVDFANRLIKAVEGSDLPPSARAPVIIDCFRQKSQPDIQQLIRTAPSTLTTPGEIIKYVLDRQ .........TTTT..............................................TTTT.............. 90 3G2B.A mol:aa BIOSYNTHETIC PROTEIN TISRDSCPALRAGVRLQHDRARDQWVLLAPERVVELDDIALVVAQRYDGTQSLAQIAQTLAAEFDADASEIETDVIELTT TLHQKRLLRL ..TTTT....TTTT....TTTTT........................TTTT............................. .......... 146 3G2S.A mol:aa PROTEIN TRANSPORT EPAMEPETLEARINRATNPLNKELDWASINGFCEQLNEDFEGPPLATRLLAHKIQSPQEWEAIQALTVLETCMKSCGKRF HDEVGKFRFLNELIKVVSPKYLGSRTSEKVKNKILELLYSWTVGLPEEVKIAEAYQMLKKQGIVKS .................TTTTTTT..............TTTT.............TTTT..................... .................TTTTT......................TTTT.................. 193 3KOS.A mol:aa TRANSCRIPTION QEKLKIGVVGTFAIGCLFPLLSDFKRSYPHIDLHISTHNNRVDPAAEGLDYTIRYGGGAWHDTDAQYLCSALSPLCSPTL ASQIQTPADILKFPLLRSYRRDEWALWQTVGEAPPSPTHNVVFDSSVTLEAAQAGGVAIAPVRFTHLLSSERIVQPFLTQ IDLGSYWITRLQSRPETPAREFSRWLTGVLHKT ...........................TTTT......TTTT.......TTTT.......TTTT................. ...................TTTT....TTTT....TTTT...TTTTTT......................TTTT...... ..........TTTT................... 224 3KOG.A mol:aa MEMBRANE PROTEIN TPVNAKFIITPVVIDATTGTDVTQSAEISFSKGNGTYEGTPELASESININAKYKGTGSASVTIPALKAGQFGAKEVTII LSENFFAQEESSNSQIETTKHSGFKNNTSDYWYYITVTYTKKEGSEVIKNDYEGDDSEIKNIIDAYNKGVREDKVTLNDV QVLAHSRFSVFVDYKTTSVYQIIEKSPDGNPVASFTVDSYNTIVSPKNEQIPGHGHAPSHGHGH ..............TTTTT..........TTTT.....TTTTT.........TTTT...........TTTT......... .TTTT........................................................................... ..TTTT............................................TTTT.....TTTT. 259 3C7T.A mol:aa HYDROLASE RRWVFALRHGERVDLTYGPWVPHCFENDTYVRKDLNLPLKLAHRAGGKGGYVKDTPLTRLGWFQAQLVGEGMRMAGVSIK HVYASPALRCVETAQGFLDGLRADPSVKIKVEPGLFEFKNWHMPKGIDFMTPIELCKAGLNVDMTYKPYVEMDASAETMD EFFKRGEVAMQAAVNDTEKDGGNVIFIGHAITLDQMVGALHRLRDDMEDVQPYEIGRNLLKVPYCALGAMRGKPWDVVSP PCPPSINSSSGRFDWRILI ................TTTT....TTTTT....TTTTTTTT..TTTT................................. .......................TTTT............TTTTT..................TTTT.TTTT......... ................TTTTT......................TTTTTTT...TTTTTT...TTTT.....TTTT..... ................... 121 2RH3.A mol:aa DNA BINDING PROTEIN IQVFLSARPPAPEVSKIYDNLILQYSPSKSLQMILRRALGDFENMLADGSFRAAPKSYPIPHTAFEKSIIVQTSRMFPVS LIEAARNHFDPLGLETARAFGHKLATAALACFFAREKATNS ..........TTTTTT..............................................TTTT.............. .........TTTT............................ 108 3F62.A mol:aa CYTOKINE GAMVETKCPNLDIVTSSGEFHCSGCVEHMPEFSYMYWLAKDMKSDEDTKFIEHLGDGINEDETVRTTDGGITTLRKVLHV TDTNKFAHYRFTCVLTTLDGVSKKNIWL ......TTTT.....TTTT.......TTTTTTT.......TTTT......................TTTTT........T TTTTTTTTT.......TTTT........ 238 3H0W.A mol:aa LYASE SMFVSKRRFILKTCGTTLLLKALVPLLKLARDYSGFDSIQSFFYSRKNFMKPSHQGYPHRNFQEEIEFLNAIFPNGAAYC MGRMNSDCWYLYTLDFDQTLEILMSELDPAVMDQFYMKDGVTAKDVTRESGIRDLIPGSVIDATMFNPCGYSMNGMKSDG TYWTIHITPEPEFSYVSFETNLSQTSYDDLIRKVVEVFKPGKFVTTLFVNQKIEGFKRLDCQSAMFNDYNFVFTSFAK ....TTTT.....TTTT..............TTTT.............TTTT...TTTTT.........TTTTTTT.... ..TTTT...............................TTTT..............TTTT......TTTT.......TTTT .......................TTTT............TTTT.........TTTT.........TTTT......... 155 3H05.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION MKKIAIFGSAFNPPSLGHKSVIESLSHFDLVLLEPSIMLDYPIRCKLVDAFIKDMGLSNVQRSDLEQALYSVTTYALLEK IQEIYPTADITFVIGPDNFFKFAKFYKAEEITERWTVMACPEKVKIRSTDIRNALIEGKDISTYTTPTVSELLLN TTTT......TTTT..........TTTTTTT.........................TTTT.................... ....TTTT................TTTT.............TTTT...............TTTTT.......... 310 1DC1.A mol:aa HYDROLASE/DNA KPFENHLKSVDDLKTTYEEYRAGFIAFALEKNKRSTPYIERARALKVAASVAKTPKDLLYLEDIQDALLYASGISDKAKK FLTEDDKKESINNLIENFLEPAGEEFIDELIFRYLLFQGDSLGGTMRNIAGALAQQKLTRAIISALDIANIPYKWLDSRD KKYTNWMDKPEDDYELETFAKGISWTINGKHRTLMYNITVSLVKKNVDICLFNCEPQQPEKYLLLGELKGGIDPAGADEH WKTANTALTRIRNKFSEKGLSPKTIFIGAAIEHSMAEEIWDQLQSGSLTNSANLTKTEQVGSLCRWIINI ...................................................TTTT......................... ..................TTTTT.....................................................TTTT TTT......TTTTTTT.........TTTTT.........TTTTT.................................... ...........................TTTT..............TTTT...TTTT.............. 224 3NE8.A mol:aa HYDROLASE ASFRVVLDPGHGGIDGGARGVTGILEKDVTLAFARALRDELQKGSHTIVALTRDSDIFLRLSERVKKAQEFDADLFISIH ADTIDVHSLRGATVYTISDEASDAIAKSLAESENKVDLLDGLPKEDILLDLTRRETHAFSINFANNVVSNLSKSHINLIN NPHRYADFQVLKAPDVPSVLIEIGYLSNKEDEKLLNNPQWRKQAASIAYSIRQFAEYRQKIQPL ...........TTTTT...TTTT...........................TTTT..................TTTT.... ...TTTTTT........TTTT........................................................... ............TTTT.....TTTTTTTT................................... 124 3GDW.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION SNANVGVFVLHGDSTASSLKTAQELLGTSIGTANPLTEVQTYEQLRNQVITQKESLNNGILLLTDGSLNSFGNLFEETGI RTKAITTSTIVLEAIRASVGRSLEDIYQNIQLSFESVVREQFRS ...........TTTTTTT...................TTTT..............TTTT..........TTTT....... ................TTTT........................ 107 3L4H.A mol:aa PROTEIN BINDING LLLQSPAVKFITNPEFFTVLHANYSAYRVFTSSTCLKHILKVRRDARNFERYQHNRDLVNFINFADTRLELPRGWEIKTD QQGKSFFVDHNSRATTFIDPRIPLQNG .TTTT.......TTTT....TTTT........TTTTTT.............TTTT..........TTTT..TTTT....T TTT.....TTTTT.....TTTT.TTTT 144 3GY9.A mol:aa TRANSFERASE DVTIERVNDFDGYNWLPLLAKSSQEGFQLVERLRNRREESFQEDGEAFVALSTTNQVLACGGYKQSGQARTGRIRHVYVL PEARSHGIGTALLEKISEAFLTYDRLVLYSEQADPFYQGLGFQLVSGEKITHTLDKTAFADSNR ......TTTT.................TTTTTTTTTTTTT..TTTT.....TTTT.........TTTTTTT......... ..TTTT................TTTT...TTTT.............TTTT..........TTTT 63 2XMJ.A mol:aa CHAPERONE TIQLTVPTIACEACAEAVTKAVQNEDAQATVQVDLTSKKVTITSALGEEQLRTAIASAGHEVE .....TTTT.............TTTTTTT....TTTTT......................... 84 2XCJ.A mol:aa VIRAL PROTEIN SNTISEKIVLMRKSEYLSRQQLADLTGVPYGTLSYYESGRSTPPTDVMMNILQTPQFTKYTLWFMTNQIAPESGQIAPAL AHFG ......................................TTTT...................................... .... 50 3IM3.A mol:aa STRUCTURAL PROTEIN, SIGNALING PROTEIN SLRECELYVQKHNIQALLKDSIVQLCTARPERPMAFLREYFEKLEKEEAK .............................TTTT................. 155 3IMK.A mol:aa METAL BINDING PROTEIN GDEKPAITKIISGGQTGADRAALDFAIKHHIPYGGWVPKGRLAEGGRVPETYQLQEPTSDYSKRTEKNVLDSDGTLIISH GILKGGSALTEFFAEQYKKPCLHIDLDRISIEDAATLINSWTVSHHIQVLNIAGPRAGKDPEIYQATDLLEVFLA ..........................................TTTT..TTTT............................ ........................TTTTT.................TTTT.....TTTTTTTT............ 123 3I2V.A mol:aa TRANSFERASE SRVSVTDYKRLLDSGAFHLLLDVRPQVEVDICRLPHALHIPLKHLERRDAESLKLLKEAIWEEKQGTAAVPIYVICKLGN DSQKAVKILQSLSAAQELDPLTVRDVVGGLAWAAKIDGTFPQY .................................TTTT..........................TTTT........TTTT. ...............TTTT......TTTT.......TTTT... 96 2WJ5.A mol:aa CHAPERONE GAMAQVPTDPGYFSVLLDVKHFSPEEISVKVVGDHVEVHARHEERPDEHGFIAREFHRRYRLPPGVDPAAVTSALSPEGV LSIQATPASAQASLPS ..................TTTT........TTTTT.........TTTTTT............TTTTTTTTT....TTTT. ................ 204 2WJR.A mol:aa TRANSPORT PROTEIN ALDVRGGYRSGSHAYETRLKVSEGWQNGWWASMESNTWNTINDVQVEVNYAIKLDDQWTVRPGMLTHFSSNGTRYGPYVK LSWDATKDLNFGIRYRYDWKAYRQQDLSGDMSRDNVHRWDGYVTYHINSDFTFAWQTTLYSKQNDYRYANHKKWATENAF VLQYHMTPDITPYIEYDYLDRQGVYNGRDNLSENSYRIGVSFKL ........TTTTT...........TTTT..........................TTTT..........TTTT........ ...TTTTTT................TTTT..................TTTT................TTTT......... ....TTTTTT........TTTT.TTTTT................ 138 2WJ9.A mol:aa HYDROLASE INHIBITOR TSSACAPETGLQQLVATIVPDEQRISFWPQHFGLIPQWVTLEPRVFGWMDRLCCIWNLYTLNNGGAFMAPEETWVLFNAM NGNRAEMSPEAAGIAACLMTYSHHACRTECYAMTVHYYRLRDYALQHPECSAIMRIID ...............................TTTTTTT......................TTTT.............TTT TT............................................TTTT........ 508 3JSZ.A mol:aa TRANSFERASE SNELSKLRRFFSALNHTSEIDLHTLFDNLKSNLTLGSIEHLQEGSVTYAIIQELLKGADAQKKIESFLKGAIKNVIHPGV IKGLTPNEINWNVAKAYPEYYEHEKLPDVTFGGFKVRDSNEFKFKTNVQTSIWFSIKPELFPSKQQEALKRRREQYPGCK IRLIYSSSLLNPEANRQKAFAKKQNISLIDIDSVKTDSPLYPLIKAELANLGGGNPAAASDLCRWIPELFNEGFYVDIDL PVDSSKIVEGHQITGGVPILNGSIISEPIAPHHRRQEAVCATDIIAYANDRETQVDTVALHLKNIYDDPYTALKDTPLAQ TAFFNRCEEEGKNIFELRKGLQDAFRSDSLLELYVFLGPAKFKEVFKLKETQIKYIDDHISEFNEHDLLLHLISDNPSEI NQHTLDFGRAKVYDIAKEHYSAFYKPLVEEISGPGAIYNALGGASNFTTTHRRSTGPLPTTPPRVLQVFCDAHDKGPFVS DNIARWQTNVRELGVLNREGLSWLPSVG .............TTTTTTTT....................TTTT...........TTTT.................... ................TTTTT........TTTTT..........TTTT......TTTTTT...............TTTT. .....TTTTT.......................................................TTTTTT.....TTTT .......TTTT..................TTTT........TTTT...........................TTTT.... ...............................................................................T TTTTTTT......TTTT...........TTTTT............................................... ....TTTT.........TTTTTTT.... 173 3JTW.A mol:aa OXIDOREDUCTASE ARKVILFIASIDNYIADDQGAVDWLEKNVHGTESDDSYEKYSKIDTVIGRTTYEQVTQKLSPEKYVYADRQTYIVTSHLG EDTDKIKYWKQSPVELVKRIQKEKGKDVWIVGGAKIIDPLVQANLIDTYILTTVPIFLGSGIRLFDRLEEQVPVRLIDVY QKNELVYSIYQRG .........TTTT...TTTT...............TTTTTTTTT................TTTTTTTTTT.......... ..TTTT........................................................TTTTTT............ TTTTT........ 372 3D59.A mol:aa HYDROLASE TKIPRGNGPYSVGCTDLMFDHTNKGTFLRLYYPSQDNDRLDTLWIPNKEYFWGLSKFLGTHWLMGNILRLLFGSMTTPAN WNSPLRPGEKYPLVVFSHGLGAFRTLYSAIGIDLASHGFIVAAVEHRDRSASATYYFKDQSAAEIGDKSWLYLRTLKQEE ETHIRNEQVRQRAKECSQALSLILDIDHGKPVKNALDLKFDMEQLKDSIDREKIAVIGHSFGGATVIQTLSEDQRFRCGI ALDAWMFPLGDEVYSRIPQPLFFINSEYFQYPANIIKMKKCYSPDKERKMITIRGSVHQNFADFTFATGKIIGHMLKLKG DIDSNVAIDLSNKASLAFLQKHLGLHKDFDQWDCLIEGDDENLIPGTNINTT .................TTTTTTTT.................TTTT.........................TTTT....T TTT...............TTTTTTTTTT......................TTTT.......................... .................................TTTT.......TTTT........................TTTTT... TTTT.TTTT................TTTTT............TTTT......TTTT....................TTTT .......................................TTTTTTTTTTT.. 106 3H9W.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION KAIPWKINWQTAFEYIGPQIEALLGWPQGSWKSVEDWATRHPEDQEWVVNFCVKQSECGVDHEADYRALHRDGHYVWIRD VVHVVRDDSGEVEALIGFFDISLEHH .......TTTT..........................................................TTTT....... ......TTTT................ 134 3H96.A mol:aa FLAVOPROTEIN DWNSQVIQEFRANGGRVGGNFEGAPVLVHHVGRKTGKAAVTPYLPSDDDPGTIYVFASKAGAASNPAWYYNLTTAGTAQV EVGTETYAVGVTEVTGEDRDRIYSEQARRYPGFADYEKKTAGIRTIPVLALTRT ............TTTT....TTTT.......TTTTT.........TTTTTTT............................ .TTTT..................................TTTTT.......... 118 3FH1.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION ITLHPDDRSEQTAEIRRFNDVFQLHDPAALPELIAEECVIENTVPAPDGARHAGRQACVQLWSAIATQPGTRFDLEETFV AGDRATIRWRYWADGNSVRGVNLRVQDGRIVEAGYVKG .....TTTT...................................TTTTT..................TTTT......... TTTT....................TTTTT......... 216 3CP7.A mol:aa HYDROLASE PADSPHIGKVFFSTNQGDFVCSANIVASANQSTVATAGHCLHDGNGGQFARNFVFAPAYDYGESEHGVWAAEELVTSAEW ANRGDFEHDYAFAVLETKGGTTVQQQVGTASPIAFNQPRGQYYSAYGYPAAAPFNGQELHSCHGTATNDPMGSSTQGIPC NMTGGSSGGPWFLGNGTGGAQNSTNSYGYTFLPNVMFGPYFGSGAQQNYNYASTTN TTTTTTTT.....TTTT..........TTTT............TTTT..TTTT..TTTTTTTTTTTT............. ................TTTTT................TTTT.........TTTTT.............TTTT........ ..TTTTTTT...................TTTTT....................... 154 3LYD.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION PSIRYPSTEFPALTGFTVPIPETWQPDPTGTQFAARPHTPPQGFTPNIIGTVRRAATGALHNQRTELDQRATQLPDYAER GRTETTVDGFPAYHIEYAYRHHGTITIAQITLVEVSHPHAVDIIQLTATCAGDQTADYWDTFRLHADLTVQPHG ...TTTTTTTT.........TTTT......TTTT..TTTT...............TTTT..............TTTT... .....TTTTT.........TTTTT............TTTT........................TTTT..TTTT 101 3LYY.A mol:aa CELL ADHESION THATSTETIHYVNEDGDQVFEDGGGKLDFTRTVTIDDVTNEVVEYGEWTPVTDDEFAAVTSPDKDGYTPDTSEVAAQKPD TDGPDGTVKDVEVTVTYTANP ............TTTT...................TTTTT.........TTTT..........TTTT.TTTTTT...... .TTTTT............... 116 3LYG.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION GNLANIVQRGWEALGAGDFDTLVTDYVEKIFIPGQADVLKGRQAFRSALDNLGEILPPGFEITGLRQLEGENEIVSIVEW KSDKIASQLSVLFKFEGDQIYEERWFVDTEQWKSVF .................................TTTT...................TTTT.........TTTT....... ..............TTTTT................. 118 3LYH.A mol:aa LYASE PHQIILLAHGSSDARWCETFEKLAEPTVESIENAAIAYELAEPSLDTIVNRAKGQGVEQFTVVPLFLAAGRHLRKDVPAI ERLEAEHGVTIRLAEPIGKNPRLGLAIRDVVKEELERS ..............................TTTT......TTTT.....................TTTT......TTTT. ...................................... 130 3I7M.A mol:aa HYDROLASE TKLEQIQQWTAQHHASTYLSNPKTIEYLTGFGSDPIERVLALVVFPDQDPFIFAPALEVEVIKETGWQFPVIGYLDHENP WAIADQVKQRHVNPEHVAIEKGQLQVAREALAAQFSAPSFDLDITSFIEH ............................................TTTT..........................TTTTTT TT...........TTTT..TTTTTTTTTTTTT..TTTT............ 83 3F2E.A mol:aa VIRAL PROTEIN FTAVAEQVSAVLSQYGITGPNRAIYQGFGLKVARALNRLGGGPALVNMINGLKAYYISAFNANPTVLDAVTDIITGSPTG YVS ............................................................................TTTT ... 123 3HZP.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION SSKEEILSILEAFASTERGSFFLDNATADFLFIRPSGNPLDAKGFENWSSGDLVLESAEITKVHKFELLGSNAAICVFTL GSKFTYKGTQNDDLPTVTSIFKKIDEKWKVAWQRSSGQSDTLW ..............TTTT........TTTT...TTTT............TTTTT.............TTTTTT....... ....TTTTT.............TTTTT................ 48 2XF7.A mol:aa VIRAL PROTEIN ESLLYGYFLDSWLDGTASEELLRVAVNAGDLTQEEADKIMSYPWGAWN .TTTT.......................TTTT..........TTTTTT 108 2XFV.A mol:aa CELL-CYCLE ALEEVVRYLGPHNEIPLTLTRDSETGHFLLKHFLPILQQYHDTGNINETNPDSFPTDEERNKLLAHYGIAVNTDDRGELW IELEKCLQLLNMLNLFGLFQDAFEFEEP .........TTTT........TTTTT.......................TTTT....................TTTT... ..............TTTTTT........ 228 3KZP.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION KFQLFIQPKLDVLQGNIVEYEILLRDDSAVPRFPLSELEAVLADEELYLAFSEWFSEAFLDVLKKYPNDRFAINIAPQQL FYIETLHWLDKLKSESHRITVETEDIFDVPGHKRHLNANDKNAFILNKIKVIHGLGYHIAIDDVSCGLNSLERVSYLPYI IEIKFSLIHFKNIPLEDLLLFIKAWANFAQKNKLDFVVEGIETKETTLLESHGVSIFQGYLVNKPFPV ..........TTTTT............TTTT..................................TTTT........... ................................TTTT........................TTTTTTTTTTTTTT...... .........TTTT.............................TTTT...................... 47 3KZ5.A mol:aa DNA BINDING PROTEIN SSRHQFAPGATVLYKGDKMVLNLDRSRVPTECIEKIEAILKELEKPA ......TTTT....TTTT.....TTTTT................... 85 3KZD.A mol:aa SIGNALING PROTEIN GKVTHSIHIEKTYGFSLSSVEEDGIRRLYVNSVKETGLASKKGLKAGDEILEINNRAADALNSSMLKDFLSQPSLGLLVR TYPEL ....................TTTT.........TTTT.......TTTT...TTTTT........................ ..... 191 2WL1.A mol:aa SIGNALING PROTEIN NVPELIGAQAHAVNVILDAETAYPNLIFSDDLKSVRLGNKWERLPDGPQRFDSCIIVLGSPSFLSGRRYWEVEVGDKTAW ILGACKTSISRKGNMTLSPENGYWVVIMMKENEYQASSVPPTRLLIKEPPKRVGIFVDYRVGSISFYNVTARSHIYTFAS CSFSGPLQPIFSPGTRDGGKNTAPLTICPVG .................TTTTTTTTT..TTTT......TTTT....TTTTTTTT...................TTTT... .....TTTTTTTT................TTTT....TTTT....TTTTTTTT....TTTTT.....TTTTT.....TTT T.................TTTT......... 279 3MT0.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION SNAQAIRSILVVIEPDQLEGLALKRAQLIAGVTQSHLHLLVCEKRRDHSAALNDLAQELREEGYSVSTNQAWKDSLHQTI IAEQQAEGCGLIIKQHFPDNPLKKAILTPDDWKLLRFAPCPVLTKTARPWTGGKILAAVDVGNNDGEHRSLHAGIISHAY DIAGLAKATLHVISAHPTFQLSETIEARYREACRTFQAEYGFSDEQLHIEEGPADVLIPRTAQKLDAVVTVIGTVARTGL SGALIGNTAEVVLDTLESDVLVLKPDDIIAHLEELASKE .............TTTT............................................................... .................TTTTTTTTT.......................TTTT......TTTTT................ ..........................................TTTTT...................TTTT....TTTT.. ..TTTT......TTTT....................... 176 3GFP.A mol:aa HYDROLASE NVDAIKQLYDCKNEADKFDVLTELYGLTIGSSIIFVATKKTANVLYGKLKSEGHEVSILHGDLQTQERDRLIDDFREGRS KVLITTNVLARGIDIPTVSVVNYDLPTLANGQADPATYIHRIGRTGRFGRKGVAISFVHDKNSFNILSAIQKYFGDIETR VPTDDWDEVEKIVKKV TTTTT......................................................TTTT..............TTT T....TTTTTTTTTT......TTTT..TTTT..............TTTTT......................TTTT.... .TTTTT.......... 110 3KYZ.A mol:aa TRANSFERASE YFLAPADRHYLADYARQAEDAWRREGAAGAERFRKELSAKEDTWVALVGPHLESLGSTPLSAEESSHLTFRKLDWPSRRL QDELPYVSIEFPGHPEQGRLVIQLPERLLP ................................................TTTT...................TTTT..TTT TT........TTTTT............... 255 3N0R.A mol:aa SIGNALING PROTEIN EHLLARLAPHLPYIRRYARALTGDQATGDHYVRVALEALAAGELVLDANLSPRVALYRVFHAIWLSSGAGHDQGLHAGDD AAQRLRIAPRSRQAFLLTALEGFTPTEAAQILDCDFGEVERLIGDAQAEIDAELATEVLIIEDEPVIAADIEALVRELGH DVTDIAATRGEALEAVTRRTPGLVLADIQLADGSSGIDAVKDILGRDVPVIFITAFPERLLTGERPEPTFLITKPFQPET VKAAIGQALFFHPRR .........................................TTTT.TTTT...............TTTT......TTTT. .....TTTT....................................................................... ....................TTTT..TTTTTTT.TTTTTTT............TTTT..........TTTT.TTTT.... ...........TTTT 101 3EO6.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION GAPDQQVPATALGKSSRISLDGRRSERSVILADGSHSLTLLHPGVYTLSSEVAETIRVLSGAYYHAEGANDVQELHAGDS VIPANQSYRLEVEPLDYLLSS ..................TTTTT.......TTTT...............TTTT............TTTT......TTTT. ..TTTT............... 139 3HWU.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION QELKGKYKTPTGYLVLRHGDNVLQNLEQLARDEHIPSASFVGIGFSEATFGFYDFGRKQFDPKTYRNVEANTGSIAWKEG KPSIHAHGTVTDGTFQGAGGHLLGLTVGTGSCEITVTVYPQRLDRFVDPEIQANVLGLP TTTTTT..TTTT....TTTT.................................TTTTT......TTTT........TTTT T..........TTTT................................TTTTT....... 276 3HWP.A mol:aa HYDROLASE RNTPFTYFSLPQKLFLRNQAAVRNKPYAKYFRSERVPLSAVRKIQQGPALEDTLTPSIEDINRLLEPDFVSEESGYALLP GPAYVQSRKFFPGCTAQFKWWFIWHPAESERYTLWFPYAHVSNPCVHHQRLRDESLSFEERLYGNTFCASEYVGDRLHLH IDFQQPASLGLNTDLYREAKIDGSVSALSLADHPEVPVSLVHLFKEVPDGYLTSRYWVGAHPSARFPGAEKAASLLKENG FGEAELETLAYEFAVHDCEFNHLASFLPDLYREFGT .....................TTTTTTT...............TTTT..................TTTT.........TT TT........TTTT.....................TTTTT....TTTT....TTTT.....TTTT......TTTTT.... .............................TTTTTTT..........TTTT.............TTTTTT........... .................................... 269 2WNF.A mol:aa TRANSFERASE PCTCTRCIEEQRVSAWFDERFNRSQPLLTAKNAHLEEDTYKWWLRLQREKQPNNLNDTIRELFQVVPGNVDPLLEKRLVS CRRCAVVGNSGNLKESYYGPQIDSHDFVLRNKAPTEGFEADVGSKTTHHFVYPESFRELAQEVSILVPFKTTDLEWVISA TTTGRISHTYVPVPAKIKVKKEKILIYHPAFIKYVFDRWLQGHGRYPSTGILSVIFSLHICDEVDLYGFGADSKGNWHHY WEGVHDGDFESNVTTILASINKIRIFKGR ...TTTTTTTTTTTT.............TTTTT..........TTTTTTTTT..............TTTTTTTTTT..TT TT..........TTTT........TTTT......TTTTT............TTTTT...TTTT................T TTT....TTTTT.TTTT......................TTTTTTTT.............TTTT.TTTT..TTTT...TT TT..................TTTT..... 98 3GWH.A mol:aa TRANSCRIPTION HSQLMAQLVEVIEDSFQMKVNKESVNYLRLIRHIRFTIERIKKEEPTKEPEKLMLLLKNEYPLCYNTAWKLIKILQQTLK KPVHEAEAVYLTLHLIPI ....................TTTT........................................................ ..............TTTT 113 3GWN.A mol:aa OXIDOREDUCTASE ANGLITKIWGTAGWTFNHAVTFGYPLNPTSDDKRRYKNYFISLGDVLPCRLCRESYKKFITTGKTALTNEVLRNRHTLTK WFYDVHNAVNNKLEVDYGLSYEDVVNKYESFRA ........................TTTT..............................TTTT.................. ................................. 330 3K2O.A mol:aa OXIDOREDUCTASE HNHKSKKRIREAKRSARPELKDSLDWTRHNYYESFSLSPAAVADNVERADALQLSVEEFVERYERPYKPVVLLNAQEGWS AQEKWTLERLKRKYRNQKFKCGEDNDGYSVKKKYYIEYESTRDDSPLYIFDSSYGEHPKRRKLLEDYKVPKFFTDDLFQY AGEKRRPPYRWFVGPPRSGTGIHIDPLGTSAWNALVQGHKRWCLFPTSTPRELIKVTRDEGGNQQDEAITWFNVIYPRTQ LPTWPPEFKPLEILQKPGETVFVPGGWWHVVLNLDTTIAITQNFASSTNFPVVWHKTVRGRPKLSRKWYRILKQEHPELA VLADSVDLQE ................TTTTT.....TTTT.................................TTTT....TTTTTTTT. .............TTTT......TTTT...............TTTT....TTTT..TTTT.................... .TTTTT........TTTT...........................TTTT..............TTTT............. TTTT...........TTTT....TTTT..................TTTTT.............................. .......... 175 2R16.A mol:aa CELL ADHESION, SPLICING ATVLSYDGSMFMKIQLPVVMHTEAEDVSLRFRSQRAYGILMATTSRDSADTLRLELDAGRVKLTVNLGKGPETLFAGYNL NDNEWHTVRVVRRGKSLKLTVDDQQAMTGQMAGDHTRLEFHNIETGIITERRYLSSVPSNFIGHLQSLTFNGMAYIDLCK NGDIDYCELNARFGF ......TTTT........................TTTT......TTTT.......TTTTT.....TTTT.......TTTT TT..........TTTT...TTTTT....................TTTTTTTTTTTTT...........TTTTT....... .TTTT.......... 473 3DAN.A mol:aa LYASE MDPSSKPLREIPGSYGIPFFQPIKDRLEYFYGTGGRDEYFRSRMQKYQSTVFRANMPPGPFVSSNPKVIVLLDAKSFPIL FDVSKVEKKDLFTGTYMPSTKLTGGYRVLSYLDPSEPRHAQLKNLLFFMLKNSSNRVIPQFETTYTELFEGLEAELAKNG KAAFNDVGEQAAFRFLGRAYFNSNPEETKLGTSAPTLISSWVLFNLAPTLDLGLPWFLQEPLLHTFRLPAFLIKSTYNKL YDYFQSVATPVMEQAEKLGVPKDEAVHNILFAVCFNTFGGVKILFPNTLKWIGLAGENLHTQLAEEIRGAIKSYGDGNVT LEAIEQMPLTKSVVYESLRIEPPVPPQYGKAKSNFTIESHDATFEVKKGEMLFGYQPFATKDPKVFDRPEEYVPDRFVGD GEALLKYVWWSNGPETESPTVENKQCAGKDFVVLITRLFVIELFRRYDSFEIELGESPLGAAVTLTFLKRASI ................TTTT...........TTTT.............TTTT...TTTTTTTTTTT......TTTTT... .TTTTT.TTTTTTTTT......TTTT......TTTT................TTTTT....................... ...........................TTTTTT.............................TTTT.............. ..........................................................................TTTT.. .....................TTTT.............TTTT....TTTT...........TTTTTTTTTT.TTTTTT.. ........TTTT.TTTT..TTTT..TTTT...........................TTTT.........TTTT 243 3GNE.A mol:aa LYASE NVISTLDLNLLTKGGGSWNVDGVNMKKSAVTTFDGKRVVKAVYDKNSGTSANPGVGGFSFSAVPDGLNKNAITFAWEVFY PKGFDFARGGKHGGTFIGHGAASGYQHSKTGASNRIMWQEKGGVIDYIYPPSDLKQKIPGLDPEGHGIGFFQDDFKNALK YDVWNRIEIGTKMNTFKNGIPQLDGESYVIVNGKKEVLKRINWSRSPDLLISRFDWNTFFGGPLPSPKNQVAYFTNFQMK KYE .........TTTTTT.........TTTT...TTTTT.......TTTT.TTTT...........TTTTTTTT......... TTTT.TTTT.............TTTT.TTTT...................TTTT...TTTT.........TTTTTTTTTT TTT............TTTTT.........TTTTT...........TTTTT..........TTTTT............... ... 288 3GN6.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION FNPWTDAALDTIVNQALTLYAERVVPAHHDAFLAAIDTVSAKLRVLPGFLSLALKQSGDSTVKNYPETYKGVLATAYLDG VAAGTQPYFYNLFVRFADGRAARAAGFEALFETHIHPLLHAADGPELLAYRAVLQSVVAGDRHAIYRGAEEIRSFLRRPV ELPERETVTVENHVVPEDKHAAWEPQVAILLQVAQDTFEPQDEPSGVGLPGARDNRYYRKALSTEILRNAHADGGLRAYI HGVWESVWDHENSHLDPRFLAAAGPVGAAAVVGPVEPFYLTRRLVVAD .TTTT........................................TTTT...................TTTTTTTT.... ............................................................TTTT................ ...............TTTTT.................TTTTTTTTT.....TTTT.............TTTTTTTT.... .............................TTTT............... 211 3GNZ.P mol:aa TOXIN AVINHDAVPVWPQPEPADATQALAVRFKPQLDVVNGCQPYPAVDPQGNTSGGLKPSQAAACRDMSKAQVYSRSGTYNGYY AIMYSWYMPKDSPSTGIGHRHDWENVVVWLDNAASANIVALSASAHSGYKKSFPADKSYLDGITAKISYKSTWPLDHELG FTTSAGKQQPLIQWEQMTQAARDALESTDFGNANVPFKSNFQDKLVKAFFQ ...........................................TTTT...........................TTTTT. ............TTTT...............TTTTT........TTTT...TTTT....TTTTT.......TTTT..... .............................TTTT.TTTTTTT.......... 310 3OAJ.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION AKKTMGIHHITAIVGHPQENTDFYAGVLGLRLVKQTVNFDDPGTYHLYFGNEGGKPGTIITFFPWAGARQGVIGDGQVGV TSYVVPKGAMAFWEKRLEKFNVPYTKIERFGEQYVEFDDPHGLHLEIVEREEGEANTWTFGEVTPDVAIKGFGGATLLSE QPDKTADLLENIMGLERVGKEGDFVRYRSAGDIGNVIDLKLTPIGRGQMGAGTVHHIAWRANDDEDQLDWQRYIASHGYG VTPVRDRNYFNAIYFREHGEILFEIATDPPGFAHDETQETMGEKLMLPVQYEPHRTQIEQGLLPFEVREL .....................................TTTTTTT......TTTTTTTTT.....TTTT.....TTTT... .....TTTT..................TTTTT......TTTT.................TTTTTTTTT..........TT TT..................TTTT.......TTTT..............TTTT........................... ......TTTT......TTTT.......TTTTTTTTTTTTTTTT........................... 290 3I0W.A mol:aa HYDROLASE,LYASE/DNA MDFDMIEEKKDSVIVRNVENFELKDIFDCGQCFRWHRQENGNYIGIAFEKVVEVQKIGEDVVIYNINEEEFKNVWSEYFD LYRDYGEIKKELSRDPLLKKSVDFGEGIRILRQDPFEILLSFIISANNRIPMIKKCINNISEKAGKKLEYKGKIYYAFPT VDKLHEFTEKDFEECTAGFRAKYLKDTVDRIYNGELNLEYIKSLNDNECHEELKKFMGVGPQVADCIMLFSMQKYSAFPV DTWVKKAMMSLYVAPDVSLKKIRDFGREKFGSLSGFAQQYLFYYARENNI ........TTTT..TTTTTTT................TTTT....TTTTT......TTTT..TTTT.............T TTT.....................TTTT................TTTT....................TTTTT....... .................................TTTT..................TTTT..............TTTT... .............TTTT................................. 111 3C1Q.A mol:aa TRANSPORT PROTEIN FAFKRGISTPDLALITRQLATLVQSGPLEECLRAVAEQSEKPRIRTLVAVRAKVTEGYTLSDSLGDYPHVFDELFRSVAA GEKSGHLDSVLERLADYAENRQKRSKLQQAS ..................................................................TTTTTT........ ...........................TTTT 138 2QL8.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION QDERWNHPLYTTTAINDEELEGHAYIPGGLKVQTSSPNDHPGTNPEQLLGLSLSTCLEATLEAVEKEHGLPHTGAVRVKV AFIGARAEYQFLVHAQVVKGVDFDTAKAFTNEIENRCPVSKLLKNSGNYTIETVTDFK ....TTTTTTT.......TTTT...TTTT................................................... ....TTTTT........TTTT.....................TTTTTTT......... 161 3CHM.A mol:aa PLANT PROTEIN EQKQAEIIDQLVKRASTCKSEALGPLIIEATSHPSLFAFSEILALPNVAQLEGTTDSVYLDLLRLFAHGTWGDYKCNATR LPHLSPDQILKLKQLTVLTLAESNKVLPYDTLMVELDVSNVRELEDFLINECMYAGIVRGKLDQLKRCFEVPFAAGRDLR P TTTT..........TTTT..............TTTT..............TTTTTTT....................... .......................TTTT.........................TTTT......TTTTT........TTTT. . 146 3CZ6.A mol:aa PROTEIN BINDING RSDFSNEDIYDNIDPDTISFPPKIATTDLFLPLFFHFGSTRQFDKLHEVISGDYEPSQAEKLVQDLCDETGIRKNFSTSI LTCLSGDLVFPRYFLNFKDNVNPPPNVPGIWTHDDDESLKSNDQEQIRKLVKKHGTGRERKRFFEK .........TTTT.....TTTTTTTTTTTT........TTTTT...........TTTTT..................... ...TTTT.........TTTTTTTTTTTTTT........................TTTT........ 149 3EYT.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION SNAKAPELQIQQWFNSATDLTLADLRGKVIVIEAFQLCPGCVHGIPLAQKVRAAFPEDKVAVLGLHTVFEHHEATPISLK AFLHEYRIKFPVGVDQPGDGAPRTAAYQRGTPSLLLIDKAGDLRAHHFGDVSELLLGAEIATLLGEAAP ............TTTT........TTTT.........TTTTT.............TTTTT........TTTTTT...... .................TTTT...TTTT.TTTT....TTTT............................ 146 2WZO.A mol:aa CELL CYCLE GRPVFPIGLGGLTVYSLGEIITDRPGFHDESAIYPVGYCSTRIYASMKCPDQKCLYTCQIKDGGVQPQFEIVPEDDPQNA IVSSSADACHAELLRTISTTMGKLMPNLLPAGADFFGFSHPAIHNLIQSCPGARKCINYQWVKFDV ........TTTT........TTTTT...TTTT..TTTT.......TTTTTTT..........TTTTT.....TTTTT... ........................TTTT........TTTT.........TTTT..TTTT....... 127 2VZC.A mol:aa CELL ADHESION ERDAFDTLFDHAPDKLNVVKKTLITFVNKHLNKLNLEVTELETQFADGVYLVLLMGLLEGYFVPLHSFFLTPDSFEQKVL NVSFAFELMQDGGLEKPKPRPEDIVNCDLKSTLRVLYNLFTKYRNVE .......................................TTTTTTTTT...............TTTTTTTTT........ ..........................................TTTT. 113 3FF2.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION GSNLETAKAIAAYNAQDVDTYVSYTDDACEANYRGDVVREGKEGTRSGLAAAFARWPQNHAEIKDAQQVGTYVLREHVTR GPATDGSPLVEPFDVVAVYSFEGDKCSRVEFIR ........................TTTT...TTTT.TTTT...............TTTT........TTTTT........ ..TTTT..............TTTTT........ 181 3FFV.A mol:aa PROTEIN BINDING MDDLTAQALKDFTARYCDAWHEEHKSWPLSEELYGVPSPCIISTTEDAVYWQPQPFTGEQNVNAVERAFDIVIQPTIHTF YTTQFAGDMHAQFGDIKLTLLQTWSEDDFRRVQENLIGHLVTQKRLKLPPTLFIATLEEELEVISVCNLSGEVCKETLGT RKRTHLASNLAEFLNQLKPLL .............................TTTTTTT.TTTT...TTTT...............................T TTT........TTTTT..........................................TTTT....TTTTT.....TTTT ..................... 54 3FF5.A mol:aa PROTEIN TRANSPORT GPLGSPEFREPLIATAVKFLQNSRVRQSPLATRRAFLKKKGLTDEEIDLAFQQS TTTT.................TTTT............................. 230 3NQI.A mol:aa LIPID BINDING PROTEIN PQQWAGVVKVNDRGYVTFTDAAGTELIPTNTIPVTLNARAYIYCQVDEGQKSIKITLLADPTGIDATAITTPKVGESGDV TTNAPVGSLSFVSGYSTVAPFQFSENTIVLPVLYRVKNVTTTEDIKNELAKHTFTLVCYTDDIKSGDTILKLYLRYKVED EPAAIAERATRTSSFKAYEISQILREYTLKSGQTKPAKITIVAQQNEYNNKLEDTSTIEKVYEIEYKTAE ...................TTTT......TTTT.......................................TTTTTTT. ............TTTT.......TTTT....................................TTTT............. ................................TTTTTTT......TTTT.TTTTTTT............. 147 2XG5.B mol:aa CHAPERONE AAFHGEVVRPACTLAMEDAWQIIDMGETPVRDLQNGFSGPERKFSLRLRNCEFNSQGGNLFSDSRIRVTFDGVRGETPDK FNLSGQAKGINLQIADVRGNIARAGKVMPAIPLTEEALDYTLRIVRNGKKLEAGNYFAVLGFRVDYE TTTT.............TTTT.................................TTTTTTTTTTTTT.........TTTT ...............TTTT...TTTT......................................... 111 3HH1.A mol:aa TRANSFERASE AHKGTLYVVATPLGNLDDTFRAVNTLRNAGAIACEDTRRTSILLKHFGIEGKRLVSYHSFNEERAVRQVIELLEEGSDVA LVTDAGTPAISDPGYTASAAHAAGLPVVPVP ..............TTTT..........TTTT.TTTT....................TTTTT.................. ...TTTT........................ 67 2VE8.A mol:aa TRANSPORT PROTEIN EDDPLYDEAVRFVTESRRASISAVQRKLKIGYNRAARMIEAMEMAGVVTPMNTNGSREVIAPAPVRD ..TTTT.............................................TTTT............ 66 3D2Q.A mol:aa METAL BINDING, RNA BINDING PROTEIN DRLEVCREYQRGNCNRGENDCRFAHPADSTMIDTNDNTVTVCMDYIKGRCSREKCKYFHPPAHLQA .............TTTTTTTT.....TTTT..TTTTT..............TTTT........... 181 3NS2.A mol:aa HORMONE RECEPTOR KGLTDEEQKTLEPVIKTYHQFEPDPTTCTSLITQRIHAPASVVWPLIRRFDNPERYKHFVKRCRLISGDGDVGSVREVTV ISGLPASTSTERLEFVDDDHRVLSFRVVGGEHRLKNYKSVTSVNEFLNQDSGKVYTVVLESYTVDIPEGNTEEDTKMFVD TVVKLNLQKLGVAATSAPMHD .......................TTTT.....................TTTTT...TTTT....TTTTTTTTTT...... .TTTT...........TTTTT..........TTTT............TTTTT..............TTTT.......... ..................... 40 3E7U.X mol:aa ANTIMICROBIAL PROTEIN GFGCNGPWDEDDMQCHNHCKSIKGYKGGYCAKGGFVCKCY TTTT.TTTTT...........TTTT.....TTTTT..... 102 3E7H.A mol:aa TRANSFERASE AVNFEVKDQTLMMELVPERLRGETATFDIEADGKVYVEKGRRVTARHIRQLEKDGVNFIEVPVEYIVGKVSAKDYVNEAT GELIITANQEISLEALANLSQA ...................TTTT.TTTT.TTTTTTTTTTTT...............................TTTTTTTT ...................... 193 2P4F.A mol:aa CHAPERONE KDKPYKTLDDYLKLDKIKDLSKQEVEFLWRAKWSNRDDSLVAVVPYVKTFQGMYKYAVKNPLFVLPLPREVPVELQYVQW QFAGPNTVHCLITSLAEYKLHQDFAKPHTTIQFHLDLANDKDMVLMNGQVESDSNVSLQDAQLLLLNVQRFYGAMGSETS IAKERIQLLEDFNKGSQNFDINKLIQLAQSMEN ................TTTT............TTTTTTT......................................... ...TTTT......TTTTTTTT.....TTTT....TTTT............TTTT................TTTT...... ...............TTTT.............. 191 3IKB.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION ATSLEEITKAIADSQNKVFTEKNIEPLFAAPKTARINIVGQAPGIKAQESRLYWNDKSGDRLREWGVDYDTFYHSGYFAV IPDFYYPGKGKSGDLPPRKGFAQKWHQPILDLLPDIQLTILIGNYAQKYYLHQKSSVKLTDTVAHYKKYLPDYFPLVHPS PRNQIWSRHPWFEAQVVPDLKKIIQQIIQSS ..............................TTTT.................TTTT......................... .........TTTT....TTTT...........TTTT.................TTTT...........TTTTT....... ......TTTT..................... 373 3IKW.A mol:aa LYASE TAQTKNTQTLMPLTERVNVQADSARINQIIDGCWVAVGTNKPHAIQRDFTNLFDGKPSYRFELKTEDNTLEGYAKGETKG RAEFSYCYATSDDFRGLPADVYQKAQITKTVYHHGKGACPQGSSRDYEFSVYIPSSLDSNVSTIFAQWHGMPDRTLVQTP QGEVKKLTVDEFVELEKTTFFKKNVGHEKVARLDKQGNPVKDKNGKPVYKAGKPNGWLVEQGGYPPLAFGFSGGLFYIKA NSDRKWLTDKDDRCNANPGKTPVMKPLTSEYKASTIAYKLPFADFPKDCWITFRVHIDWTVYGKEAETIVKPGMLDVRMD YQEGKKVSKHIVDNEKILIGRNDEDGYYFKFGIYRVGDSTVPVCYNLAGYSER ......TTTT.......TTTTT......TTTTTT......TTTTTT.....TTTTT.......TTTT......TTTT... .............TTTT......................TTTT..........TTTTTTTT...........TTTTT.TT TT..................TTTTT........TTTT....TTTT...................TTTT..TTTTT..... ....TTTTTTTTT...TTTTTTTTT...TTTT.............TTTT...................TTTT........ ...................................TTTT.............. 103 3NZN.A mol:aa OXIDOREDUCTASE SNAVNLFGQKDRGNHVSGVDRGKVIMYGLSTCVWCKKTKKLLTDLGVDFDYVYVDRLEGKEEEEAVEEVRRFNPSVSFPT TIINDEKAIVGFKEKEIRESLGF ....TTTT....................TTTT........................................TTTTTTTT .TTTTTT................ 101 3GI7.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION NVETDQQTFACAAFNKQVAERELQSAYDELIERRDQFGDEAGLSRIEAAEKVWSQLRDADCKVETHAEQPGSNAYQIAWN SCIAQRSDERAEYLRSLGSQN TTTTT............................TTTTTTTTTT.....................TTTTTTTT........ ..................... 141 3GIX.A mol:aa SPLICING SFLLPKLTSKKEVDQAIKSTAEKVLVLRFGRDEDPVCLQLDDILSKTSSDLSKMAAIYLVDVDQTAVYTQYFDISYIPST VFFFNGQHMKVDYGSPDHTKFVGSFKTKQDFIDLIEVIYRGAMRGKLIVQSPIDPKNIPKY TTTT...............TTTT.......TTTT.............TTTTTTTT.....TTTTT..........TTTT. ..TTTTT.........TTTT......................................... 255 3KWS.A mol:aa ISOMERASE DLELKLSFQEGIAPGESLNEKLDFEKLGVVGFEPGGGGLAGRVNEIKQALNGRNIKVSAICAGFKGFILSTDPAIRKECD TKEIIAAAGELGSTGVIIVPAFNGQVPALPHTETRDFLCEQFNEGTFAAQHGTSVIFEPLNRKECFYLRQVADAASLCRD INNPGVRCGDFWHTWEETSDGAFISGGEYLQHVHVASRKRRSPGEDGDADNYINGFKGLKIGYNNYVSFECGCQGDRNVV VPAAVKLLREQWEQA ........TTTT............TTTT.......TTTT..........TTTT.............TTTT.......... .....................TTTTTTT................................TTTTTTTTT........... ..TTTT...TTTTTTTTTTT................TTTTT..TTTT................................. ............... 92 3IP4.C mol:aa LIGASE KVTREEVEHIANLARLQISPEETEEMANTLESILDFAKQNDSADTEGVEPTYHVLDLQNVLREDKAIKGIPQELALKNAK ETEDGQFKVPTI ............................................TTTT..TTTT.....................TTTTT TTTTTT...... 95 3IPJ.A mol:aa TRANSFERASE SNANKYNKIANELIKIIGEDNIISITHCATRLRVMVKDREIINDKKVEKVDEVKGVFFTSGQYQIILGTGIVNKVYAEVE KMGLKTLSKKEQDEL .TTTT......................TTTT....TTTT..........TTTT.....TTTT.....TTTT......... ......TTTT..... 69 3IPF.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION NRQFLSLTGVSKVQSFDPKEILLETIQGVLSIKGEKLGIKHLDLKAGQVEVEGLIDALVYPLEHHHHHH ......TTTT......TTTT....TTTT..............TTTTT...........TTTT....... 76 3KUC.B mol:aa GTP BINDING PROTEIN/TRANSFERASE NTIRVFLPNKQRTVVRVRNGMSLHDCLMKKLKVRGLQPECCAVFRLLHEHKGKKARLDWNTDAASLIGEELQVDFL ......TTTTT......TTTT............................TTTT....TTTT....TTTT....... 114 3ENU.A mol:aa STRUCTURAL PROTEIN TIEVPVLTFVPVQVSAELENRGCWVKFFDKKNFQGDSLFLSGPATLPRLIGPFGYDWENKVRSVKVGPRANLTIFDNHNY RDEDKFLDAGANVANLSKEMGFFDNFRSMVLNCI .............................................TTTT.TTTTT.TTTT......TTTT......TTTT T......TTTT.TTTT......TTTT........ 143 2AQ6.A mol:aa OXIDOREDUCTASE VFDDKLLAVISGNSIGVLATIKHDGRPQLSNVQYHFDPRKLLIQVSIAEPRAKTRNLRRDPRASILVDADDGWSYAVAEG TAQLTPPAAAPDDDTVEALIALYRNIAGEHSDWDDYRQAMVTDRRVLLTLPISHVYGLPPGMR .....................TTTT...........TTTTT......TTTT.................TTTTTT...... .........TTTT................TTTT.........................TTTTT 137 3KLQ.A mol:aa CELL ADHESION STVQTSISVENVLERAGDSTPFSVALESIDAMKTIEEITIAGSGKASFSPLTFTTVGQYTYRVYQKPSQNKDYQADTTVF DVLVYVTYDEDGTLVAKVISRRAGDEEKSAITFKPKRLVKPIPPRQPDFPKTPLPLA ..............TTTT.........TTTTT...................................TTTTTT....... ........TTTT.........TTTT................................ 189 3L15.A mol:aa TRANSCRIPTION GLGTARLQLVEFSAFVEPQRHLFVHISQLESVDVRQIYDKFPEKKGGLRELYDRGPPHAFFLVKFWADLNWGFYGVSSQY ESLEHTLTCSSKVCSFGKQVVEKVETERAQLEDGRFVYRLLRSPCEYLVNFLHKLRQLPERYNSVLENFTILQVVTNRDT QELLLCTAYVFEVSTSERGAQHHIYRLVR ...TTTT..................................TTTTTT................................. .............TTTTT.............TTTT...........................TTTT..........TTTT T.............TTTT........... 148 3N9B.A mol:aa LIGASE LLRYCVQKHDASRLHYDFRLELDGTLKSWAVPKGPCLDPAVKRLAVQVEDHPLDYADFEGSIPQAGDVIVWDRGAWTPLD DPREGLEKGHLSFALDGEKLSGRWHLIRTNLRGKQSQWFLVKAKDGEARSLDRFDVLKERPDSVLSER .........TTTTT......TTTTT.....TTTT...TTTT....................................TTT T...............TTTT..........TTTTTTT.......TTTT.TTTTT........TTTTT. 270 3NO2.A mol:aa UNKNOWN FUNCTION SPQHLLVGGSGWNKIAIINKDTKEIVWEYPLEKGWECNSVAATKAGEILFSYSKGAKITRDGRELWNIAAPAGCEQTARI LPDGNALVAWCGHPSTILEVNKGEVLSKTEFETGIERPHAQFRQINKNKKGNYLVPLFATSEVREIAPNGQLLNSVKLSG TPFSSAFLDNGDCLVACGDAHCFVQLNLESNRIVRRVNANDIEGVQLFFVAQLFPLQNGGLYICNWQGHDREAGKGKHPQ LVEIDSEGKVVWQLNDKVKFGISTICPIRE ........TTTT......TTTTT........TTTT.......TTTT.....TTTT...TTTT........TTTT...... TTTT.......TTTT................................TTTT.....TTTTT.....TTTT.......... .......TTTT...............TTTTT...............TTTT.....TTTT.......TTTTTTT.....TT TT..TTTT.......TTTTT.......... 120 3NOH.A mol:aa PEPTIDE BINDING PROTEIN SAQLEGSYIFCNPLLDKLSDEDIREQLKAFVTGKTDSIRTDTELSFDIYVSETDYALIRYADSLCERLNDAGADVQIKQY SGTLRSRAVSGKYEAFLSESDLVSTDALENADYIILDSAE ...........TTTT.................TTTT...TTTT.......TTTTT......................... ..................TTTT..............TTTT 110 3KDF.A mol:aa REPLICATION VDDLPRSRINAGLAQFIDKPVCFVGRLEKIHPTGKFILSDGEGKNGTIELEPLDEEISGIVEVVGRVTAKATILCTSYVQ FKEDSHPFDLGLYNEAVKIIHDFPQFYPLG ...............TTTT...........TTTT.....TTTT........................TTTT......... ..TTTT........................ 113 3KDF.B mol:aa REPLICATION HIVPCTISQLLSATLVDEVFRIGNVEISQVTIVGIIRHAEKAPTNIVYKIDDTAAPDVRQWVTVVPPETYVKVAGHLRSF QNKKSLVAFKIPLEDNEFTTHILEVINAHVLSK ..............TTTTT.TTTTT................TTTT....................TTTT.........TT TTT....TTTT..................TTTT 75 3KDE.C mol:aa DNA BINDING PROTEIN/DNA MKYCKFCCKAVTGVKLIHVPKCAIKRKLWEQSLGCSLGENSQICDTHFNDSQWKAAKGQTFKRRRLNADAVPSKV ...TTTTT.............................TTTT.........................TTTT..... 191 3KDG.A mol:aa HYDROLASE MDRVPIMYPIGQMHGTYILAQNENGLYIIDQHAAQERIKYEYFREKVGEVEPEVQEMIVPLTFHYSTNEALIIEQHKQEL ESVGVFLESFGSNSYIVRCHPAWFPKGEEAELIEEIIQQVLDSKNIDIKKLREEAAIMMSCKGNRHLRNDEIKALLDDLR STSDPFTCPHGRPIIIHHSTYEMEKMFKRVM ...........TTTTTT....TTTT.......................TTTT.............TTTT........... ..........TTTT......TTTTTTTTT.............TTTT.................................. .TTTTTTTTTT................TTTT 212 3KD3.A mol:aa UNKNOWN FUNCTION KNIIFDFDSTLIKKESLELILEPILQKSPAKLKEIEYITNLGQGDISFRDSLQKRLAIASPTKQSIKEFSNKYCPNLLTD GIKELVQDLKNKGFEIWIFSGGLSESIQPFADYLNIPRENIFAVETIWNSDGSFKELDNSNGACDSKLSAFDKAKGLIDG EVIAIGDGYTDYQLYEKGYATKFIAYEHIEREKVINLSKYVARNVAELASLI ....................TTTTTTTTT..............TTTT.........................TTTTTTTT TT..............................................TTTT......TTTTTTTT.............. .................TTTTTT..............TTTT........... 130 3CT6.A mol:aa TRANSFERASE MTYGIVIVSHSPEIASGLKKLIREVAKNISLTAIGGLENGEIGTSFDRVMNAIEENEADNLLTFFDLGSARMNLDLVSEM TDKELTIFNVPLIEGAYTASALLEAGATFEAIKEQLEKMLIEKRSHHHHH ........TTTT...........TTTTTT.......TTTT.................TTTT................... .................................................. 75 3IG9.A mol:aa VIRAL PROTEIN GGYVNIKTFTHPAGEGKEVKGMEVSVPFEIYSNEHRIADAHYQTFPSEKAAYTTVVTDAADWRTKNAAMFTPTPV .............TTTT.............................TTTT.TTTT.................... 80 3L9A.X mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION RDFFVITNSEYTFAGVHYAKGAVLHVSPTQKRAFWVIADQENFIKQVNKNIEYVEKNASPAFLQRIVEIYQVKFEGKNVH ...........TTTTT..TTTT....TTTTT................TTTT............................. 82 3LDC.A mol:aa TRANSPORT PROTEIN VPATRILLLVLAVIIYGTAGFHFIEGESWTVSLYWTFVTIATVGYGDYSPHTPLGMYFTCTLIVLGIGTFAVAVERLLEF LI .............................................TTTT............................... .. 370 3LDU.A mol:aa TRANSFERASE KNYTLISPCFFGEKLAREITNLGYEIIKTEDGRITYKTDEFGIAKSNWLRCAERVHLKIAEFEAKSFDELFENTKRINWS RYIPYGAQFPISKASSIKSKLYSTPDVQAIVKKAIVESLKKSYLEDGLLKEDKEKYPIFVFIHKDKVTISIDTTGDALHK RGYREKKAPIRETLAAGLIYLTPWKAGRVLVDPCGSGTILIEAAIGINAPGLNREFISEKWRTLDKKIWWDVRKDAFNKI DNESKFKIYGYDIDEESIDIARENAEIAGVDEYIEFNVGDATQFKSEDEFGFIITNPPYGERLEDKDSVKQLYKELGYAF RKLKNWSYYLITSYEDFEYEFGQKADKKRKLYNGLKTNFFQYPGPKPPRN .............................TTTT.....TTTT......TTTT............................ ...TTTT........TTTTTTT.......................................TTTTT.....TTTTTTTTT T.......................TTTT.....TTTT.......TTTTTTTTTT......TTTT................ TTTT...........................................TTTT............................. ..TTTT.......TTTT.......TTTT...................... 291 3FWK.A mol:aa TRANSFERASE GAMVMRLGDAAELCYNLTSSYLQIAAESDSIIAQTQRAINTTKSILINETFPKWSPLNGEISFSYNGGKDCQVLLLLYLS CLWEYYIVKLPTVFIDHDDTFKTLENFIEETSLRYSLSLYESDRDKCETMAEAFETFLQVFPETKAIVIGIRHTDPFGEH LKPIQKTDANWPDFYRLQPLLHWNLANIWSFLLYSNEPICELYRYGFTSLGNVEETLPNPHLRKDKNSTPLKLNFEWEIE NRYKHNEVTKAEPIPIADEDLVKIENLHEDYYPGWYLVDDKLERAGRIKKK ..........................TTTT...................TTTTTTTTTTTTT.................. ................TTTT......................TTTT..............TTTT.......TTTTTTTTT TTTTT..TTTT......TTTTTT.......................TTTT.TTTTT........TTTT....TTTT.... .....TTTTT...............................TTTT...... 137 3FWZ.A mol:aa MEMBRANE PROTEIN SNAVDICNHALLVGYGRVGSLLGEKLLASDIPLVVIETSRTRVDELRERGVRAVLGNAANEEIQLAHLECAKWLILTIPN GYEAGEIVASARAKNPDIEIIARAHYDDEVAYITERGANQVVGEREIARTLELLETP .....TTTT...............................................TTTTTTTTTTT............. ..............TTTT...................TTTT................ 79 3A4R.A mol:aa TRANSCRIPTION GPLGSQELRLRVQGKEKHQMLEISLSPDSPLKVLMSHYEEAMGLSGHKLSFFFDGTKLSGKELPADLGLESGDLIEVWG TTTT...........TTTT......TTTT..............TTTT....TTTTT..TTTT.......TTTT...... 132 3BFQ.G mol:aa STRUCTURAL PROTEIN/STRUCTURAL PROTEIN AKPCTVSTTNATVDLGDLYSFSLMSAGAASAWHDVALELTNCPVGTSRVTASFSGAADSTGYYKNQGTAQNIQLELQDDS GNTLNTGATKTVQVDDSSQSAHFPLQVRALTVNGGATQGTIQAVISITYTYS .....TTTTTT.............TTTT..............TTTT...........TTTT.......TTTT.....TTT T...TTTT......TTTTT...........TTTT.................. 102 3GE3.E mol:aa OXIDOREDUCTASE STLADQALHNNNVGPIIRAGDLVEPVIETAEIDNPGKEITVEDRRAYVRIAAEGELILTRKTLEEQLGRPFNMQELEINL ASFAGQIQADEDQIRFYFDKTM ...................TTTT..........TTTT......TTTTT...TTTT......................... .........TTTT......... 262 3LUM.A mol:aa HYDROLASE RMEIVKIPVVVHVVWNEEEENISDAQIQSQIDILNKDFRKLNSDVSQVPSVWSNLIADLGIEFFLATKDPNGNQTTGITR TQTSVTFFTTSDEVKFASSGGEDAWPADRYLNIWVCHVLKSEIGQDILGYAQFPGGPAETDGVVIVDAAFGTTGTALPPF DKGRTATHEIGHWLNLYHIWGDELRFEDPCSRSDEVDDTPNQADPNFGCPSYPHVSCSNGPNGDMFMNYLDYVDDKCMVM FTQGQATRVNACLDGPRSSFLA ..............TTTT..................................................TTTT........ ........TTTT.............TTTTT..........TTTT........TTTT................TTTTTTTT TTT..............TTTT..TTTTTTTT....TTTT...........TTTT.TTTTTTTT.TTTTTTT......... .............TTTTT.... 87 3LUU.A mol:aa BIOSYNTHETIC PROTEIN SDPRTQPLEIRPLISRVEVDWADGHTSRLTFEHLRVECPCAQIVTGKEHVSVVEVVPVGHYAVQLHFSDGHNTGIFTWEY LRRLDAE .TTTTT..............TTTT......................TTTT.......TTTTT....TTTT.......... ....... 135 3LUC.A mol:aa RNA BINDING PROTEIN KQFHTGIEIKVWAIACFAPQRQCTEVHLKSFTEQLRKISRDAGMPIQGQPCFCKYAQGADSVEPMFRHLKNTYAGLQLVV VILPGKTPVYAEVKRVGDTVLGMATQCVQMKNVQRTTPQTLSNLCLKINVKLGGV ........TTTT......TTTTTT.........................TTTT...................TTTT.... ...TTTT................................................ 152 3LUR.A mol:aa TRANSCRIPTION ACTIVATOR GEYQLQQLASLTLVGIKETYENGRQAQQHIAGFWQRCYQEGVIADLQLKNNGDLAGILGLCIPELDGKSYIAVTGDNSAD IAKYDVITLASSKYVFEAQGAVPKAVQQKEEVHHYIHQYQANTVKSAPFFELYQDGDTTSEKYITEIWPVKG ...................................................TTTT........TTTT............. ....................TTTT...............TTTTT............TTTTTTT......... 142 3IU6.A mol:aa TRANSCRIPTION MNVTLLIQELIHNLFVSVMSHQDDEGRYSDSLAEIPAVDPNFPNKPPLTFDIIRKNVENNRYRRLDLFQEHMFEVLERAR RMNRTDSEIYEDAVELQQFFIKIRDELCKNGEILLSPALSYTTKHLHNDVEKERKEKLPKEI ......................TTTT.....TTTTTTTTTTTTTT................................... ...TTTT.........................TTTT.......................... 106 3IUO.A mol:aa HYDROLASE RVRTLANKSKKVSIVQQIDRKVALDDIAVSHGLDFPELLSEVETIVYSGTRINIDYFINEVDEDHLEDIFEYFKESTTDS LEEAQELGKDYSEEEIRLVRIKFLSE ...........................................................................TTTTT TTTTTTTTTTT............... 77 3IUW.A mol:aa RNA BINDING PROTEIN QHPTIHTLKIETEFFKAVKERRKTFEIRKNDRNFQVGDILILEEYNGYLDDECEAEVIYITDYAQREGYVVLGIELH ....................TTTT.....TTTT.TTTT.......................TTTTTTTT........ 32 3IUF.A mol:aa PROTEIN BINDING EDRDKPYACDICGKRYKNRPGLSYHYAHSHLA TTTTTT..TTTTT..............TTTT. 140 2OX7.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION SLIPKFRAWDTYEKELENVTPLFDDSNSIAIITDFQIKGSPGTSEIEIGSYDTTFNWDEFPYVIQSTGLKDKNGVEIFEG DILVYDAPKKYAHRRSHEIAYADGRFFWEFLDLVFCQSNILYRDGYLVIGNIHENPELLE .........TTTTT.TTTT....TTTT............TTTTT............TTTT..........TTTT...TTT T........TTTT........TTTT..TTTT..........TTTT.....TTTTTTTTTT 92 3LHR.A mol:aa TRANSCRIPTION REGULATOR GSPDPEIFRQRFRQFGYQDSPGPREAVSQLRELCRLWLRPETHTKEQILELVVLEQFVAILPKELQTWVRDHHPENGEEA VTVLEDLESELD ......................................TTTTT..................................... ............ 120 3LHE.A mol:aa TRANSCRIPTION REGULATOR VYGSEVESKIIEFTIVGADEIIAEKLGISVGDFVYKIIRLRIIHSIPTIEHTWPISVIPGVELGLQVGTSVVRVKGIRPD DKEKQFNLTNQDFLRVEQVAYLTDGRTFEYSYADHLPETF .TTTTTTT....................TTTT.........TTTTT.......TTTTTT..................... ........TTTT.........TTTT............... 250 3LHO.A mol:aa HYDROLASE GHTDVNALFAALWQDYIKTPSAAKIHQLLGHGAPIINDHIALRTFNIAKVNLSVLAKHFTSIGYVDSGDYKFEQKKLIAK HFEHPDPKQPKVFISELLVEEFSPEVQKSIHGLIDQVDIAATTADNFIYSGRHWDVDKATYQALLAESEYAAWVAALGYR ANHFTVSINDLPEFERIEDVNQALKQAGFVLNSSGGEVKGSPEVLLEQSSTADKVVVNFTDGDVEIPSCFYEFARRYPAN GQLYTGFVAA .............................TTTT......................................TTTTT.... ...TTTTTT..................................TTTT................................. ......TTTTTTTT..................TTTTTTT...................TTTT.................. .......... 115 3ER7.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION TTLDRYFDLFDASRTDEKAFDDLISLFSDEITFVLNGQEQHGIDAWKQFVRVFTANQDIKHYAGWVPSETGDTETRWAVC GKSADGSVFTQDGTDIARLNADGKIVYLANVPDDT .................................TTTTT.............TTTTT...........TTTT......... ..TTTT.............TTTT............ 149 2QZQ.A mol:aa SIGNALING PROTEIN, LIPID BINDING PROTEIN GSLLPRLPSEPGTLLTLTIEKIGLKDAGQCIDPYITVSVKDLNGIDLNPVQDTPVATRKEDTYIHFSVDVEIQRHLEKLP KGAAIFFEFKHYKPKKRFTSTKCFAFEDEIKPGPIVIELYKKPTDFKRKKLNLLTKKPLYLHLNQTLHK .......................TTTT..TTTT.......TTTT...............TTTT................T TTT.........TTTTT...........................TTTTTTT.................. 140 3N1E.A mol:aa TRANSPORT PROTEIN DQWSMLRHFDHITKDYHDHIAEISAKLVAIMDSLFDKLLSKYEVKAPVPSPCFRNICKQMTKMHEAIFDLLPEEQTQMLF LRINASYKLHLKKQLSHLNVINDGGPQNGLVTADVAFYTGNLQALKGLKDLDLNMAEIWE ............TTTT..............................TTTT................TTTTT......... ............................................TTTTTTT......... 126 3LQ9.A mol:aa SIGNALING PROTEIN DEHLCANLQLLQESLAQARLGSRRPARLLPSQLVSQVGKELLRLAYSEPCGLRGALLDVCVEQGKSCHSVGQLALDPSLV PTFQLTLVLRLDSRLWKIQGLFSSANSPSQSLTLSTGFRVIKKKLY ..................TTTT.........................TTTT...........TTTT.......TTTTTT. ...........TTTTT...TTTTTTT.................... 93 2WTP.A mol:aa METAL BINDING PROTEIN STATAQAMAKRHATLYGDPAGQSQASRIIDVKPGMRYVNVDSGETVAFRAGEKIVAWTFAQMVRDTSVDLGLLMPDLPGS AGVRVYIDRSDLF ...............................TTTT.....TTTT....TTTTT........TTTT........TTTTTTT TTT......TTTT 94 3IC3.A mol:aa OXIDOREDUCTASE NATGPKQQPLPPDVEGREDAIEVLRAFVLDGGLSIAFRAFEDPEWGLLLVDIARHAARSYARESEYTEDEALERIVEFEA ELSRPTDTTTERTQ ..........TTTTTTTTTT.......TTTTT................................................ .............. 90 3M8J.A mol:aa TRANSCRIPTION GGDAFLLKLRESALSSGSMSEEQFFLLIGISSIHSDRVILAMKDYLVSGHSRKDVCEKYQMNNGYFSTTLGRLTRLNVLV ARLAPYYTDS ..............TTTT...........................TTTT............................... .......... 154 2ZCA.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION SPGERFLDWLKRLQGQKAWTAARAAFRRSLAFPPGAYPRAPYVEPFLAKGDWRQEEREAHYLVAALYALKDGDHQVGRTL ARALWEKAQGSASVEKRFLALLEADRDQIAFRLRQAVALVEGGIDFARLLDDLLRWFSPERHVQARWAREYYGA ................................TTTTTTTTTTTT.........................TTTT.TTTT.. ........TTTT............TTTTT.........................TTTTTTT............. 214 3F7Q.A mol:aa CELL ADHESION DLGAPQNPNAKAAGSRKIHFNWLPPSGKPMGYRVKYWIQGDSESEAHLLDSKVPSVELTNLYPYCDYEMKVCAYGAQGEG PYSSLVSCRTHQEVPSEPGRLAFNVVSSTVTQLSWAEPAETNGEITAYEVCYGLVNDDNRPIGPMKKVLVDNPKNRMLLI ENLRESQPYRYTVKARNGAGWGPEREAIINLATQPKRPMSIPIIPDIPIVDAQS .............TTTT....................TTTT....................TTTT.........TTTT.. ..........................TTTT.......TTTTTTT...........TTTT.TTTT.....TTTTTT..... ...TTTT.........TTTT.......................TTTT...TTTT 126 3F7E.A mol:aa UNKNOWN FUNCTION VAVPEGYESLLERPLYGHLATVRPDGTPQVNAWFAWDGEVLRFTHTTKRQKYRNIKANPAVASVIDPDNPYRYLEVRGLV EDIVPDPTGAFYLKLNDRYDGPLTEPPADKADRVIIVVRPTAFSKQ ...TTTT...............TTTT..........TTTT.....TTTT................TTTTTTTT....... .....TTTT.................TTTT................ 93 3MHS.C mol:aa HYDROLASE/TRANSCRIPTION REGULATOR/PROTEI EETITIDSISNGILNNLLTTLIQDIVARETTQQQLLKTRYPDLRSYYFDPNGSLDINGLQKQQESSQYIHCENCGRDVSA NRLAAHLQRCLSR .......................................TTTT.....TTTT..TTTT....TTTTT...TTTTT..... ............. 56 3HSH.A mol:aa PROTEIN BINDING GSSGVRLWATRQAMLGQVHEVPEGWLIFVAEQEELYVRVQNGFRKVQLEARTPLPR .....................TTTT...TTTTT.....TTTT.............. 140 3KKG.A mol:aa LYASE GQDRSPIETQNVETVLRLFDEGWGAQDGWRDVWRETTPGFRSIFHSNQAVEGIEQAIAFNAVLFEGFPRLEVVVENVTVE GDNVVVQARLTGAQDGPFLGVPPSGQVDVPDVTLFTLADGQVIERYFTDLLAVTAISAPP ..................TTTTTTTTTTT.......TTTT...TTTT...................TTTT.........T TTT.............TTTTT...............TTTTT.......TTTTTTTTTT.. 122 3KK4.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION SNAEIFIRANQRSYSVQARSLRLHGVATSVRLEQLFWDVLEEIAARDGRVTQLIERLYDELVQYRGEAANFTSFLRVCCL RYQVLQAEGRIPADATVPIRSLDAQAVLRGLPANLYDSRPLG ...TTTTTT............TTTTT.........................................TTTT......... ........TTTT.TTTT..........TTTT........... 100 3KKF.A mol:aa OXIDOREDUCTASE GAENNVRLSRIIIDPERLEEYNAYLKEEIEVSRLEPGVLVLYAVAEKERPNHVTILEIYADEAAYKSHIATPHFKKYKEG TLDVQLELIDATPLIPGLKK ..................................TTTT.......TTTTTTT............................ ............TTTTTT.. 101 3NRF.A mol:aa UNKNOWN FUNCTION DAVVFARQGDKGSVSVGDKHFRTQAFKVRLVNAAKSEISLKNSCLVAQSAAGQSFRLDTVDEELTADTLKPGASVEGDAI FASEDDAVYGASLVRLSDRCK ...............TTTT....................TTTT.....TTTT.............TTTTTTTT....... ..TTTT..........TTTT. 102 3NRW.A mol:aa RECOMBINATION RPSLSPREARDRYLAHRQTDAADASIKSFRYRLKHFVEWAEERDITARELTGWKLDEYETFRRGSDVSPATLNGEQTLKN WLEYLARIDVVDEDLPEKVHVP ................TTTTT........................................................... ...........TTTT....... 180 3HYN.A mol:aa NUCLEOTIDE-BINDING PROTEIN YQNANYSAFYVSEPFSESNLGANSTHDFVYYNLRWKGEDNSFPFNDAHDKTYNVRDGSDWEKTLKPRLHTRLDNSKNIIL FLSSITANSRALREENYGIGTKGLPVIVIYPDYDKKSDIVDSNGNFKKQIKDLWDKLPAFRDNSSVATLHIPCTKSVIIS ALNNEDFVNTADAEKYYYKP ...........TTTTTTTTTTT..TTTT......TTTTTTTT...TTTTTT.TTTT........................ ..TTTT.......................TTTT.......TTTT.................................... ...TTTT............. 113 3EMF.A mol:aa CELL ADHESION TPVTNKLKAYGDANFNFTNNSIADAEKQVQEAYKGLLNLNEKNASDKLLVEDNTAATVGNLRKLGWVLSSKNGTRNEKSQ QVKHADEVLFEGKGGVQVTSTSENGKHTITFAL ........TTTTTTTTTTTT............TTTT.TTTT.TTTT.TTTTTTT.................TTTT..... ..TTTT......TTTT......TTTT....... 110 3EMI.A mol:aa CELL ADHESION TEVKIGAKTSVMKEKDGKLFTGKANKETNKVDGANATEDADEGKGLVTAKDVIDAVNKTGWRIKTTDANGQNGDFATVAS GTNVTFASGNGTTATVTNGTDGITVKYDAK .............TTTTT........TTTT....................................TTTT........TT TT......TTTT......TTTT........ 115 3C8L.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION GARKRLIIEGGIDQHGQEPTIAASRAVRNAIAHNALPGVWEVAGLSHPNEIIEVQVAVPYPEQVREEEVLAVLPFGRKTL TVESGGIVQGRAIPELNDKNDELIAIAAVTVLIEN ............TTTT..............TTTT.TTTT.......TTTT.......TTTT................... ................................... 244 3C8W.A mol:aa LYASE LSANSLEGVIDNEFSPAPRWLNTYPAGPYRFINREFFIIAYETDPDLLQAILPPDELLEPVVKFEFIRPDSTGFGDYTES GQVVPVRYKGEEGGFTISFLDCHAPIAGGREIWGFPKLAKPKLFVEEDTLIGILKYGSIDIAIATGYKHRPLDAEKVLES VKKPVFLLKNIPNVDGTPLVNQLTKTYLTDITVKGAWTGPGSLELHPHALAPISNLYIKKIVSVSHFITDLTLPYGKVVA DYLA ......TTTTTTTTT...TTTT................................................TTTTT..... ......TTTTT..................................TTTT.....TTTTT......TTTT........... ............TTTT...............................TTTTT............................ TTTT 211 3IR4.A mol:aa OXIDOREDUCTASE SNAKLYIYDHCPFCVKARIFGLKNIPVELNVLQNDDEATPTRIGQKVPILQKDDSRYLPESDIVHYVDNLDGKPLLTGKR NPAIEEWLRKVNGYVNQLLLPRFAKSAFDEFSTPAARQYFIRKKEASSGSFDNHLAHSAGLIKKIGDDLRLLDKLIVQPN AVNGELSEDDIHLFPLLRNLTLVAGIHWPTKVADYRDNAKQTQINLLSSAI .......TTTT.....................TTTT...............TTTT.............TTTT.TTTT... ...........TTTT..............................................................TTT TTTT..................TTTT............TTTTT........ 263 3L23.A mol:aa ISOMERASE GKEIGLQIYSLSQELYKGDVAANLRKVKDGYSKLELAGYGKGAIGGVPDFKKAEDAGLKIISSHVNPVDTSISDPFKAIF KYSKEVTPKIEYWKATAADHAKLGCKYLIQPPTITTHDEAKLVCDIFNQASDVIKAEGIATGFGYHNHNEFNRVATKEQQ FKVGDQIYDLLKDTDPSKVYFEDVYWTVGQNDPVEYQKHPDRIKVLHIKDRAVFGQSGNFEIFKQYANGIKDYFVELEQP DGRTQFAGVKDCADYLIKAPFVK ................TTTT...................TTTTTTTT.TTTTTTTTT...........TTTT.TTTTT.. ..TTTTTTTT................................................TTTT.............TTTTT .....TTTTTTTTTTTTTT............TTTTTTTTTTTT......TTTTTTT.....TTTTTTTT........... ..................TTTT. 123 3L29.A mol:aa RNA BINDING PROTEIN DISAKDLRNIMYDHLPGFGTAFHQLVQVICKLGKDSNSLDIIHAEFQASLAEGDSPQCALIQITKRVPIFQDAAPPVIHI RSRGDIPRACQKSLRPVPPSPAIDAGWVCVFQLQDGKTLGLKI ................TTTT.................................................TTTT....... .................TTTT...........TTTT....... 220 3BO6.A mol:aa LYASE GPPQMSATNEDLKTNFHSLHNQMRQMPMSHFREALDAPDYSGMRQSGFFAMSQGFQLESHGGDVFMHAHRENPQCKGDFA GDKFHISVQREQVPQAFQALSGLLFSVDSPIDKWKVTDMERVDQQSRVAVGAQFTLYVKPDQENSQYSASSLHNTRQFIE CLESRLSESGLMPGQYPESDVHPENWKYVSYRNELRSGRDGGEMQSQALREEPFYRLMAE .................................TTTT........TTTT.TTTTT..............TTTTTTT.... .........................TTTT........TTTTTTTTTTTTTTTTTT......TTTT............... ................TTTT..TTTTTTT...TTTTTTTTT................... 90 2P1M.A mol:aa SIGNALING PROTEIN LKSEAVALESQTIAPLPNVTSKILAKVIEYLILAANYLNIKNLLDLTCQTVADMIKGKTPEEIRTTFNIKNDFTPEEEEE VRRENQWAFE .........TTTTT.TTTT...................................TTTT...................... .......... 181 2ZF9.A mol:aa STRUCTURAL PROTEIN AGPAAGQAYDAGNLDVASSPVKPTLSITKKTLTAAEAPNAKVTELSVEGAADKYAATGLHIQFDPKLKLIPDEDGALATA GRAARLLELKKAEADTDNSFFTATGSSTNNGKDGVLWSFVLQVPADAQPGDKYDVQVAYQSRTTNEDLFTNVKKDEEGLL QAWTFTQGIEQGYIQVESTTS ..........TTTT....TTTT..........TTTTTTTT......TTTTTTT..........TTTT....TTTTTTTT. ...TTTTTTT.....TTTT......TTTT..............TTTTTTTT..........TTTTT....TTTT...... ..................... 174 2X32.A mol:aa CARBOHYDRATE-BINDING PROTEIN HLAYSLDATASFLNFVSSKKTHVLETHRFDVLSGGISTAGEAQLVIDLNSVNTGIDVRNGRMRDYLFETATYSVATVTVP VDLAAVAGLAVGEDMLVDVSATLDLHGVPGVIDTQLNVQRLSATRIMVQNQSPLLIKAADYSLEAGIETLRNLASLNVIS TTVPVDFVLFYEAP ......TTTTT.......TTTTT.............TTTT...........................TTTTTT....... .........TTTT..........TTTTT.............TTTT................................... .............. 113 2X3G.A mol:aa VIRAL PROTEIN GDLKKVLNFHFSYIYTYFITITTNYKYGDTEKIFRKFRSYIYNHDKNSHVFSIKETSNGLHYHILVFTNKKLDYSRVHKH PSHSDIRIELVPKSISDIKNVYKYLKTKKDIKS .........TTTT...............................TTTT................................ TTTT............................. 367 2QPX.A mol:aa HYDROLASE GDDLSEFVDQVPLLDHHCHFLIDGKVPNRDDRLAQVSTEADKDYPLADTKNRLAYHGFLALAKEFALDANNPLAANDPGY ATYNHRIFGHFHFKELLIDTGFVPDDPILDLDQTAELVGIPVKAIYRLETHAEDFLEHDNFAAWWQAFSNDVKQAKAHGF VGFSIAAYRVGLHLEPVNVIEAAAGFDTWKHSGEKRLTSKPLIDYLYHVAPFIIAQDPLQFHVGYGDADTDYLGNPLLRD YLKAFTKKGLKVVLLHCYPYHREAGYLASVFPNLYFDISLLDNLGPSGASRVFNEAVELAPYTRILFASDASTYPEYGLA ARQFKQALVAHFNQLPFVDLAQKKAWINAICWQTSAKLYHQERELRV ......................TTTTTTT...........TTTT.....................TTTTTTTTT...... ..................TTTT.TTTT..............................................TTTTTT. .............................................TTTTT................TTTT.TTTTTTT.. .............TTTTTTTT.........TTTT...TTTT...............TTTT.................... ............................................... 383 3G5B.A mol:aa APOPTOSIS PGSSVSGTFGCLGGRLTIPGTGVSLLVPNGAIPQGKFYDLYLRINKTESTLPLSEGSQTVLSPSVTCGPTGLLLCRPVVL TVPHCAEVIAGDWIFQLKTQAHQGHWEEVVTLDEETLNTPCYCQLEAKSCHILLDQLGTYVFTGESYSRSAVKRLQLAIF APALCTSLEYSLRVYCLEDTPAALKEVLELERTLGGYLVEEPKPLLFKDSYHNLRLSLHDIPHAHWRSKLLAKYQEIPFY HVWNGSQKALHCTFTLERHSLASTEFTCKVCVRQVEGEGQIFQLHTTLTTQLGPYAFKIPLSIRQKICNSLDAPNSRGND WRLLAQKLSMDRYLNYFATKASPTGVILDLWEARQQDDGDLNSLASALEEMGKSEMLVAMTTD TTTTT....TTTT....TTTT......TTTTTTTTT..................TTTT...............TTTT... ......TTTT.........TTTTTT.....TTTTTTTTT......TTTT....TTTT..........TTTT......... .TTTT........................................................TTTTT.............. ......TTTT.........TTTTT.........TTTT..............TTTTTT.................TTTT.. ..........TTTT.....TTTT........................................ 185 2ZK9.X mol:aa HYDROLASE LASVIPDVATLNSLFNQIKNESCGTSTASSPCITFRYPVDGCYARAHKMRQILMNNGYDCEKQFVYGNLKASTGTCCVAW SYHVAILVSYKNASGVTEKRIIDPSLFSSGPVTDTAWRNACVNTSCGSASVSSYANTAGNVYYRSPSNSYLYDNNLINTN CVLTKFSLLSGCSPSPAPDVSSCGF .....................TTTTTTTTTTT.TTTTT.................................TTTTT.... ...........TTTT.......TTTTTTTT...........................TTTT...TTTT....TTTT.... .....TTTT..TTTTTT..TTTT.. 270 3M66.A mol:aa TRANSCRIPTION DYVDHSETLQKLVLLGVDLSKIEKHPEAANLLLRLDFEKDIKQMLLFLKDVGIEDNQLGAFLTKNHAIFSEDLENLKTRV AYLHSKNFSKADVAQMVRKAPFLLNFSVERLDNRLGFFQKELELSVKKTRDLVVRLPRLLTGSLEPVKENMKVYRLELGF KHNEIQHMITRIPKMLTANKMKLTETFDFVHNVMSIPHHIIVKFPQVFNTRLFKVKERHLFLTYLGRAQYDPAKPNYISL DKLVSIPDEIFCEEIAKASVQDFEKFLKTL ................................................................TTTT............ ...................TTTT......................................................... ......................................................................TTTTTTT... .............................. 130 3M6J.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION VSDRPAGRPLTVHRNVGRWLSEILHASIRDTGVSSRIEFVRRTLHGWVREEYSETELPNAVYRNLYFPGSGKIETISECD RLKNLVRNVTDTLVENYPQGLESEALLIALDGVKLELARIRKDIEYGDPR ..........................TTTTT.....................TTTTTT...................... .................................................. 99 3E8O.A mol:aa OXIDOREDUCTASE TVISHGTLSASAEHAAHLRQLLVHIAQATRQEDGCLLYLVSEDLSQPGHFLITEHWDNLGAHTHLALPGVTQAIDALKHL NVTDLKITAYEAGEAINIG ..........TTTTT................TTTT.......TTTTTTT........TTTTTTTTTT............. ................... 216 3E8T.A mol:aa TRANSPORT PROTEIN VEKCNLEDSACMTSAFQQALPTFVAGLPDHGVEVMDVLDLDDFAFDLSGLQFTLKEGKLKGLKGAVIDNVKWDLKKKNIE VDFHLDATVKGHYTAGGRILILPITGDGQMKLKLKNIHIHLVVSYEMEKDAEGVDHVIFKKYTVTFDVKDNAQFGLTNLF NGNKELSDTMLTFLNQNWKQVSEEFGKPVMEAAAKKIFKNIKHFLAKVPIAEIANV ....TTTT.........................TTTTT.......TTTTT......................TTTTT... .................TTTTT...........................TTTT......................TTTT. ........................................................ 464 3ILW.A mol:aa ISOMERASE VGRALPEVRDGLKPVHRRVLYAMFDSGFRPDRSHAKSARSVAETMGNYHPHGDASIYDSLVRMAQPWSLRYPLVDGQGNF GSPGNDPPAAMRYTEARLTPLAMEMLREIDEETVDFIPNYDGRVQEPTVLPSRFPNLLANGSGGIAVGMATNIPPHNLRE LADAVFWALENHDADEEETLAAVMGRVKGPDFPTAGLIVGSQGTADAYKTGRGSIRMRGVVEVEEDSRGRTSLVITELPY QVNHDNFITSIAEQVRDGKLAGISNIEDQSSDRVGLRIVIEIKRDAVAKVVINNLYKHTQLQTSFGANMLAIVDGVPRTL RLDQLIRYYVDHQLDVIVRRTTYRLRKANERAHILRGLVKALDALDEVIALIRASETVDIARAGLIELLDIDEIQAQAIL DMQLRRLAALERQRIIDDLAKIEAEIADLEDILAKPERQRGIVRDELAEIVDRHGDDRRTRIIA TTTTT.TTTTT.................TTTT................................TTTTTTTT........ .TTTT....TTTTT...........TTTT.........TTTT....TTTT.TTTT..........TTTT........... ..........TTTT.................TTTT..............................TTTT.........TT TT...............TTTTTT.......TTTTT.......TTTT............TTTTT........TTTTT.... ................................................................................ ................................................................ 249 2ILR.A mol:aa ONCOPROTEIN AESLELPKAIQDQLPRLQQLLKTLEEAPPVELQLLHECSPSQMDLLCAQLQLPQLSDLGLLRLCTWLLALSPDLSLSNAT VLTRSLFLGRILSLTSSASRLLTTALTSFAAKYTYPVCSALLDPVLQAPGTGPAQTELLCCLVKMESLEPDAQVLMLGQI LELPWKEETFLVLQSLLERQVEMTPEKFSVLMEKLCKTTSMAYAKLMLTVMTKYQANITETQRLGLAMALEPNTTFLRKS LKAALKHLG .....................................................................TTTT....... .............TTTT..............................TTTT.............TTTT............ .......................................................................TTTTTTT.. ......... 145 3BCY.A mol:aa UNKNOWN FUNCTION TEDGETVKVFEDLQGFETFIANETEDDDFDHLHCKLNYYPPFVLHESHEDPEKISDAANSHSKKFVRHLHQHIEKHLLKD IKQAVRKPELKFHEKSKEETFDKITWHYGEETEYHGRPFKIDVQVVCTHEDAVFVDYKTHPVGAN ..TTTTT....................TTTTT..............TTTT....TTTTTTTT.................. ......TTTT.TTTT....TTTT.........TTTTT..........TTTT.........TTTT. 120 3LWG.A mol:aa UNKNOWN FUNCTION EGLLVCTRLDQNLCAELISFGSGKATVCLTPKEFMLAEDDVVHAGFIVGAASFAALCALNKKNSLISSMKVNLLAPIEIK QEIYFNATITHTSSKKSTIRVEGEFMEIKVFEGDFEILVF .....TTTT...........TTTT...........TTTTTT...................TTTT.............TTT T...........TTTTT......TTTTT............ 99 3LWC.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION KRKFTIADASLERSPGQEADISVGNLGPITIGYGRYAPGQSLTETAVDDVIVLEGRLSVSTDGETVTAGPGEIVYPKGET VTIRSHEEGALTAYVTYPH ....................................TTTT....................TTTT....TTTT...TTTT. ................... 95 3H7H.B mol:aa TRANSCRIPTION MDPNLWTVKCKIGEERATAISLMRKFIAYQFTDTPLQIKSVVAPEHVKGYIYVEAYKQTHVKQAIEGVGNLRLGYWNQQM VPIKEMTDVLKVVKE ..........TTTTT.............TTTTTTT........TTTT.......TTTT......TTTT.....TTTT... ....TTTT....... 135 2W9Y.A mol:aa LIPID TRANSPORT GAMSVASLPEVKNFFPTEQLEFSSSITADEKPVLHEVFQKHSCGEMIDEVSKKHPELGKRLATVLEGNKKRLDGLSPAAV EYAKKLIHMVTTTLCSLTVGKPIDDADAKRLHQEFQSLSSEDQAALRKNNPDIKF TTTT......................TTTTT...........TTTT.........................TTTT..... .................................................TTTT.. 202 2JK9.A mol:aa APOPTOSIS DYCKPTRLDLLLDMPPVSYDVQLLHSWNNNDRSLNVFVKEDDKLIFHRHPVAQSTDAIRGKVGYTRGLHVWQITWAMRQR GTHAVVGVATADAPLHSVGYTTLVGNNHESWGWDLGRNRLYHDGKNQPSKTYPAFLEPDETFIVPDSFLVALDMDDGTLS FIVDGQYMGVAFRGLKGKKLYPVVSAVWGHCEIRMRYLNGLD ................................TTTT..TTTTT.......TTTT.........................T TTTT.....TTTT........TTTTTTTTT...TTTTT..TTTTTTT...TTTTTTTTTT....TTTT....TTTTT... .TTTTT........TTTT.........TTTT........... 111 3NPD.A mol:aa UNKNOWN FUNCTION GASLKDFELSKLEKVAKESSVGTPRAINEDILDQGYTVEGNQLINHLSVRASHAERRSNPDSVRSQLGDSVCSNTGYRQL LARGAILTYSFTEYKTNQPVATERFDAGSCR ...................TTTTTTTTTTTT.......TTTT..............TTTT.................... ............TTTTT.............. 182 3H8T.A mol:aa HEME-BINDING PROTEIN EAVTKTVTIDASKYETWQYFSFSKGEVVNVTDYKNDLNWDMALHRYDVRLNCGESGKGKGGAVFSGKTEMDQATTVPTDG YTVDVLGRITVKYEMGPDGHQMEYEEQGFSEVITGKKNAQGFASGGWLEFSHGPAGPTYKLSKRVFFVRGADGNIAKVQF TDYQDAELKKGVITFTYTYPVK ............TTTT....TTTTT....TTTT.........TTTTT................................. ...............TTTT..................TTTT..........TTTTT.............TTTT....... ....TTTTTTTT.......... 142 3M7K.A mol:aa HYDROLASE/DNA MTQCPRCQRNLAADEFYAGSSKMCKGCMTWQNLSYNANKEGHANTFTKATFLAWYGLSAQRHCGYCGISEAGFTSLHRTN PRGYHIQCLGVDRSDSFEGYSPQNARLACFICNRIKSNIFSASEMDVLGEAISKAWHGRGIA ...TTTTT......................................................TTTTT............T TTT.........TTTTTT..TTTTT..........TTTT....................... 137 3M7O.A mol:aa IMMUNE SYSTEM GWPKHTACNSGGLEVVYQSCDPLQDFGLSIDQCSKQIQSNLNIRFGIILRQDIRKLFLDITLMAKGSSILNYSYPLCEED QPKFSFCGRRKGEQIYYAGPVNNPGLDVPQGEYQLLLELYNENRATVACANATVTSS .........TTTT.......TTTT.....TTTTTTTTTTTT...........TTTT......TTTTT..........TTT TTTTTTTTTTTTT...........................TTTT............. 226 3L6T.A mol:aa HYDROLASE SNAMLDITTITRQNVTSVVGYYSDAKDDYYSKDSSFTSWQGTGAEALGLSGDVESARFKELLVGEIDTFTHMQRHVGDAK KERLGYDLTFSAPKGVSMQALIHGDKTIIEAHEKAVAAAVREAEKLAQARTTRKSVTQNTNNLVVATFRHETSRALDPDL HTHAFVMNMTQREDGQWRALKNDELMRNKMHLGDVYKQELALELTKAGYELRYNSKNNTFDMAHFS ..TTTT...........TTTT..TTTTTTTTTTTTTTT..........................TTTTTT.......... ....................TTTT.................................................TTTTT.. ...........TTTT......................................TTTTT...TTTTT 84 3JXO.A mol:aa TRANSPORT PROTEIN NLYFQGMIPLEQGIEFLSVNVEEDSPVVGKKLKDLPLPRDSIIAAIVRGGVLVVPRGDTEILSGDKLYVIVSAEAKETVE ETLL ..........TTTT.......TTTTTTTTT.......TTTT.....TTTTT....TTTT..TTTT......TTTTT.... .... 402 2WUU.A mol:aa TRANSFERASE AHAFWSTQPVPQTEDETEKIVFAGPMDEPKTVADIPEEPYPIASTFEWWTPNMEAADDIHAIYELLRDNYVESMFRFNYS EEFLQWALCPPSYIPDWHVAVRRKADKKLLAFIAGVPVTLRMGTPKYMKVKAQEKGQEEEAAKYDAPRHICEINFLCVHK QLREKRLAPILIKEVTRRVNRTNVWQAVYTAGVLLPTPYASGQYFHRSLNPEKLVEIRFSGIPAQYQKFQNPMAMLKRNY QLPNAPKNSGLREMKPSDVPQVRRILMNYLDNFDVGPVFSDAEISHYLLPRDGVVFTYVVENDKKVTDFFSFYRIPSTVI ILNAAYVHYYAATSMPLHQLILDLLIVAHSRGFDVCNMVEILDNRSFVEQLKFGAGDGHLRYYFYNWAYPKIKPSQVALV ML .TTTT.....................................TTTT.....TTTT......................... .........TTTT.........TTTTT..................................TTTT............... .TTTT................................TTTT......TTTT......TTTT....TTTTTTT........ ..TTTT.TTTT.....................TTTT..............TTTTT.....TTTTT............... ...........TTTT.........................TTTT..TTTTTT...........TTTT............. .. 50 2WUJ.A mol:aa CELL CYCLE LTPNDIHNKTFTKSFRGYDEDEVNEFLAQVRKDYEIVLRKKTELEAKVNE .....TTTT....TTTT................................. 213 2WUX.A mol:aa VIRAL PROTEIN DYSYRPTIGRTYVYDNKYYKNLDAVIKNAPLDNYLVAEDPFLGPGKNQKLTLFKEIRNVKPDTMKLVVGWKGKEFYRETW TRFMEDSFPIVNDQEVMDVFLVVNMRPTRPNRCYKFLAQHALRCDPDYVPHDVIRIVEPSWVGSNNEYRISLAKYTNSFE QFIDRVIWENFYKPIVYIGTDSAEEEEILLEVSLVFKVKEFAPDAPLFTGPAY TTTT........TTTTT.....................TTTTT................TTTT................. ......TTTT..................TTTT..................TTTT.TTTT...TTTT.............. ................................................TTTT. 163 3L51.B mol:aa CELL CYCLE GKVLDAIIQEKKSGRIPGIYGRLGDLGAIDEKYDIAISSCCHALDYIVVDSIDTAQECVNFLKKHNIGIATFIGLDKTVW AKKSKIQTPENTPRLFDLVKVKNEEIRQAFYFALRDTLVANNLDQATRVAYQRDRRWRVVTLQGQIIEQSGTSGGLEHHH HHH .............TTTT........................................................TTTTTTT TTT..............................TTTT..............TTTT.....TTTT...TTTT.......TT TT. 71 3FAU.A mol:aa HYDROLASE SLDLHGLHVDEALEHLMRVLEKKTEEFKQNGGKPYLSVITGRRIKPAVIKYLISHSFRFSEIKPGCLKVML ...TTTT.....................................................TTTTTT..... 518 3L0Q.B mol:aa TRANSFERASE LASYFIGVDVGTGSARAGVFDLQGRVGQASREITFKPKADFVEQSSENIWQAVCNAVRDAVNQADINPIQVKGLGFDATC SLVVLDKEGNPLTVSPSGRNEQNVIVWDHRAITQAERINATKHPVLEFVGGVISPEQTPKLLWLKQHPNTWSNVGHLFDL PDFLTWRATKDETRSLCSTVCKWTYLGHEDRWDPSYFKLVGLADLLDNNAAKIGATVKPGAPLGHGLSQRAASEGLIPGT AVSVSIIDAHAGTIGILGASGVTGENANFDRRIALIGGTSTAHASRSAHFISGIWGPYYSAILPEYWLNEGGQSATGALI DHIIQSHPCYPALLEQAKNKGETIYEALNYILRQAGEPENIAFLTNDIHLPYFHGNRSPRANPNLTGIITGLKLSTTPED ALRYLATIQALALGTRHIIETNQNGYNIDTASGGGTKNPIFVQEHANATGCALLPEESEALLGSAGTVAAGVFESLPEAA ASRIGKTVTPQTNKIKAYYDRKYRVFHQYHDHRYQALQ ..........TTTT......TTTT.............TTTT....................................... .....TTTT.....TTTTTTTTT.....TTTT................TTTT............................ .........................TTTTT...................TTTTTTTT.....TTTT..........TTTT .....................TTTT...TTTT....TTTTT.........TTTT...TTTTTTTTT......TTTT.... ......TTTT........................TTTT......TTTT...TTTT.TTTTTTTTT...........TTTT .....................TTTT.......TTTTTT................TTTTTTTTTTTTTTTTTTTTTTTTT. ............................TTTTTTTTT. 133 3D85.C mol:aa IMMUNE SYSTEM/CYTOKINE PAWTQCQQLSQKLCTLAWSAHPLVDVPHIQCGDGCDPQGLRDNSQFCLQRIHQGLIFYEKLLGSDIFTGEPSLLPDSPVG QLHASLLGLSQLLQPLSPSQPWQRLLLRFKILRSLQAFVAVAARVFAHGAATL .................TTTT...............................................TTTT.TTTT... ..................................................... 295 3MCQ.A mol:aa TRANSFERASE LIQRYFRRAHPSAVLGVGDDAALIQPSPGELAVSADLVANTHFYPNIDPWLIGWKSLAVNISDAAGAQPRWATLTIALPE ADEDWISKFAAGFFACAAQFDIALIGGDTTRGPLTISVQIGETPPGASLLRSTARADDDIWVSGPLGDAALALAAIQGRY PLSDTELAACGKALHQPQPRVVLGQALRGLAHSALDISDGLLADLGHILEHSQVGAEVWLKAIPKSEVVSAHSQEVAIQK ILSGGDDYELCFTASTQHRQQIADIGRQLSLDAVIGRITDTQQLVIHGLDDAPLT ...TTTT....TTTTT.......................TTTTT.................................... ...........................................TTTT..TTTT.TTTT...................... ..........................TTTTT.....TTTT...............................TTTT..... TTTT..................................TTTT.....TTTT.... 54 3MCB.A mol:aa CHAPERONE AMSKLGLRQVTGVTRVTIRKSKNILFVITKPDVYKSPASDTYIVFGEAKIEDLS .........TTTT.....TTTTTT...TTTT....TTTT............... 58 3MCB.B mol:aa CHAPERONE VNNISGIEEVNMFTNQGTVIHFNNPKVQASLAANTFTITGHAETKQLTEMLPSILNQL .............TTTT....TTTT....TTTTT...............TTTT..... 182 2R2A.A mol:aa TOXIN AEICLITGTPGSGKTLKVSANDEFKPDENGIRRKVFTNIKGLKIPHTYIETDAKKLPKSTDEQLSAHDYEWIKKPENIGS IVIVDEAQDVWPARSAGSKIPENVQWLNTHRHQGIDIFVLTQGPKLLDQNLRTLVRKHYHIASNKGRTLLEWKICADDPV KASSAFSSIYTLDKKVYDLYES ........TTTTTTTTT.........TTTT........TTTT.........TTTTTT..TTTT.TTTT........TTTT ...TTTT.TTTT..TTTT..........TTTTTT......TTTT...........................TTTTTTTTT TTTTT.......TTTT...... 123 3DD7.A mol:aa RIBOSOME INHIBITOR MRHISPEELIALHDANISRYGGLPGMSDPGRAEAIIGRVQARVAYEEITDLFEVSATYLVATARGYIFNDANKRTALNSA LLFLRRNGVQVFDSPELADLTVGAATGEISVSSVADTLRRLYG .........................TTTTTT....................................TTTT......... .............TTTT.........TTTT............. 94 3L3E.A mol:aa CELL CYCLE KPLHKVVVCVSKKLSKKQSELNGIAASLGADYRRSFDETVTHFIYQGRPNDTNREYKSVKERGVHIVSEHWLLDCAQECK HLPESLYPHTYNGS TTTTTT..........................TTTTTTTT.......TTTT............................. .......TTTT... 121 3N6Y.A mol:aa UNKNOWN FUNCTION GAQAEVRIDGPIEYGVFESRSEQNIQQTTEVPAKLGTKFGRYQLSGKQEGDTPLTLLYLTPGVVTPDGQRHDKFEVVQKL VPGAPTDVAYEFTEPHEVVKGEWRLVFQGDRLLAEKSFDVR .................................TTTT......TTTTTTTT.............TTTT............ TTTT......................TTTTT.......... 146 3GZB.A mol:aa LYASE GFASLVIPVSAQANSGEPQEQQLAVKYDALTEHDYKTLITFYNRDSIFFDKTANRKYTGGRFIIDFLERAHQGVLEYDFN IEHYNAGSLVVIGNYHFKGPGEQFGKPGKIIDVAIPAVTSLKLDLNRRVTEHVDLIDYQTSDQLAQ ..........TTTTT............TTTTTT.........TTTT...TTTTT................TTTT...... ....TTTTT...............................................TTTTTTTTT. 131 3FM2.A mol:aa HEME-BINDING PROTEIN SHSLKDFLEACETLGTLRLIVTSSAAVLEARGKIEKLFYAELAKGKYANHTEGFEFHLNEKITQVKFETGEAKRGNFTTY AIRFLDEKQESALSLFLQWGKPGEYEPGQVEAWHTLKEKYGEVWEPLPVQL ......................TTTT...............TTTT.....TTTT.................TTTTT.... .....TTTT.........TTTTTT.TTTT...........TTTT....... 65 3FMY.A mol:aa DNA BINDING PROTEIN AETVAPEFIVKVRKKLSLTQKEASEIFGGGVNAFSRYEKGNAPHPSTIKLLRVLDKHPELLNEIR .............................TTTT................................ 79 3I84.A mol:aa CELL ADHESION GTVFTTVEDLGSKILLTCSLDDSTEVTGHRWLKGGVVLKEDALPGQKTEFKVDSDDQWGEYSCVFLPEPMGTANIQLHG .........TTTT..................TTTTT................................TTTT....... 253 2F1N.A mol:aa TOXIN GTDLTDFRVATWNLQGASATTESKWNINVRQLISGENAVDILAVQEAGSPPSTAVDTGRVIPSPGIPVRELIWNLSTNSR PQQVYIYFSAVDALGGRVNLALVSNRRADEVFVLSPVRQGGRPLLGIRIGNDAFFTAHAIAMRNNDAPALVEEVYNFFRD SRDPVHQALNWMILGDFNREPADLEMNLTVPVRRASEIISPAAATQTSQRTLDYAVAGNSVAFRPSPLQAGIVYGARRTQ ISSDHFPVGVSRR ................TTTTT.............TTTTTTTT........TTTT........TTTT.........TTTTT TT.......TTTTTTTTTT....TTTTTTTT......TTTT.......TTTT.........TTTT............... .............................................TTTT.........TTTTT........TTTTTT... ............. 104 3LAX.A mol:aa LIGASE SNADDIILKGVNIFPIQIETILLQFKELGSDYLITLETAESNDETVEVELSQLFTDDYGRLQALTREITRQLKDEILVTP RVKLVPKGALPKSAVRVKDLRKTF ......TTTTT.............TTTT.........TTTTT........TTTT.......................... .....TTTT............... 163 3IX3.A mol:aa TRANSCRIPTION FLELERSSGKLEWSAILQKMASDLGFSKILFGLLPKDSQDYENAFIVGNYPAAWREHYDRAGYARVDPTVSHCTQSVLPI FWEPSIYQTRKQHEFFEEASAAGLVYGLTMPLHGARGELGALSLSVEAENRAEANRFMESVLPTLWMLKDYALQSGAGLA FEH ..................................TTTT.......................................... .................................TTTT........................................... ... 127 3KE7.A mol:aa ISOMERASE QKNENKTLNENIPEIISLEKEALASTDPAFVELSDTDVIYFDPSLETKIEGLEQLRTYYKGQLPPADHFDIRPVVQVAQN IAVLTFNLDSYLSDKVIKWNCTEVYRRNPDNQWKIIQTHWSYVKPLD .....TTTTTTTTT...........................TTTTTTT.................TTTT........TTT T.........TTTTT............TTTT..........TTTTTT 229 2XLG.A mol:aa METAL BINDING PROTEIN IHTFDDIPMPKLADPLLIYTPANEIFDIASCSAKDIGFAIAHAQIPPGGGPMPHIHYFINEWFWTPEGGIELFHSTKQYP NMDELPVVGGAGRGDLYSIQSEPKQLIYSPNHYMHGFVNPTDKTLPIVFVWMRNEVAPDFPYHDGGMREYFQAVGPRITD LNNLPELTAFASEAPKYGINQSSYFMEYVNTISDKLPAQIAKLKNDKDLERMVEVIEAFNRGDKSVTCS ...........TTTT....TTTT.........TTTT.........TTTT......TTTT.....TTTT............ TTTTT.TTTT...........TTTT....TTTT.......................TTTT.TTTT............TTT TTTT....TTTTTT.......TTTTTTTT.................................TTTTT.. 115 3IDU.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION TDYDKLSNLTFEFPDLTVEIKGPDVVGVNKLAEYEVHVKNLGGIGVPSTKVRVYINGTLYKNWTVSLGPKEEKVLTFNWT PTQEGYRINATVDEENTVVELNENNNVATFDVSVV ...TTTT...............TTTTTTTT.......................TTTTT.........TTTT......... ..........TTTTTT...TTTTTTT......... 138 3IDF.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION MKKLLFAIDDTEACERAAQYILDMFGKDADCTLTLIHVKPEFMLYGEAVLAAYDEIEMKEEEKAKLLTQKFSTFFTEKGI NPFVVIKEGEPVEMVLEEAKDYNLLIIGSSENSFLNKIFASHQDDFIQKAPIPVLIVK ........................TTTTTTT................................................. .............................TTTTTTT....TTTT.............. 94 3ID1.A mol:aa HYDROLASE MVRPVVGEIAANSIAAEAQIAPGTELKAVDGIETPDWDAVRLQLVDKIGDESTTITVAPFGSDQRRDVKLDLRHWAFEPD KEDPVSSLGIRPRG .........TTTT.......TTTT...TTTTT..............TTTT........TTTT.........TTTT..TTT TT............ 101 3FOV.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION SAEASAADYLERQGYRILARRFKTRCGEIDLVAQRDALVAFVEVKARAYAVTPRQQSRIVAAAEAWLSRHPEHASELRFD AILIAPNTAPRHLPGAFDATP .......................TTTT.......TTTT.......................................... ....TTTT......TTTT... 507 3FOT.A mol:aa TRANSFERASE LPPLVPALYRWKSTGSSGRQVQRRCVGAEAIVGLEEKNRRALYDLYIATSLRNIAPASTLLTLQNLKEMFELALLDARFE HPECACTVSWDDEVPAIITYESPESNESARDWARGCIHVQPTAKSALDLWSEMEEGRAAANNTPSKSIELFLLSDVSTDS TPIPQDATVEILFHSNHLFWDGIGCRKFVGDLFRLVGSYIGRSDSREMKKIQWGQEIKNLSPPVVDSLKLDINTLGSEFD DKCTEYTSALVANYKSRGMKFQPGLALPRCVIHKLSADESIDIVKAVKTRLGPGFTISHLTQAAIVLALLDHLKLSDDEV FISPTSVDGRRWLREDIASNFYAMCQTAAVVRIENLKSITVSHKDEKELQVRALESACRNIKKSYRQWLENPFLQALGLR VHNFEASYLHAKPIPFEGEANPLFISDGINERFIPHEIKQTATGENVLSVESIDFVVNQSLPYLAIRLDSWRDASTLNII YNDANYTEAEVQKYLQSIVEFMLAFRL ..............TTTT......................TTTT...........TTTT..................... ......................TTTT.......................................TTTT........TTT T..TTTT...............................TTTT.........TTTTTT....................... ...................................................TTTT....................TTTT. .................TTTT....................TTTT.........................TTTT...... ...........TTTTTTTT....................TTTTT.........................TTTTT...... .TTTTT..................... 154 3FCN.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION EHKTYEADLFVWCQQQADGLRALSRSRRDLPDDLDLEHIAEEIEDGRSELREATSLVRQICVRVIASAPEAPDRARWRSE VVSWHNLLLDTITPGIDRIDIGVIWRRAVSEAKAALIEINVAPQAGLSFQAPLPADHFLDEDFDYDATVARLGP ...TTTTTT.................TTTTTTTT.................................TTTT......... ...............TTTT........................TTTT............TTTT........... 321 3D1R.A mol:aa HYDROLASE MRRELAIEFSRVTESAALAGYKWLGRGDKNTADGAAVNAMRIMLNQVNIDGTIVIGEGEIAEAPMLYIGEKVGTGRGDAV DIAVDPIEGTRMTAMGQANALAVLAVGDKGCFLNAPDMYMEKLIVGPGAKGTIDLNLPLADNLRNVAAALGKPLSELTVT ILAKPRHDAVIAEMQQLGVRVFAIPDGDVAASILTCMPDSEVDVLYGIGGAPEGVVSAAVIRALDGDMNGRLLARHDVKG DNENRRIGEQELARCKAMGIEAGKVLRLGDMARSDNVIFSATGITKGDLLEGISRKGNIATTETLLIRGKSTIRRIQSIH Y ......................TTTT..........................TTTT..TTTTTTTTTTTT.......... ......TTTT.................TTTT.................TTTT.TTTT....................... ....................................TTTT........................................ ....................TTTT..................TTTTTTTT.....TTTT........TTTT......... . 145 3IBM.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION HEASRVLRERDYRWEGTEEESGARRQTLVGRPAGQEAPAFETRYFEVEPGGYTTLERHEHTHVVMVVRGHAEVVLDDRVE PLTPLDCVYIAPHAWHQIHATGANEPLGFLCIVDSDRDRPQRPDADDLARMCADPAVARRIRTEG ........TTTTTTTTT.........TTTTTTTTTTTT.........TTTT.......................TTTT.. ..TTTT....TTTT.......TTTT........TTTT............................ 104 3EZI.A mol:aa TRANSFERASE GSAHAINKAGSLRMQSYRLLAAVPLSEKDKPLIKEMEQTAFSAELTRAAERDGQLAQLQGLQDYWRNELIPALMRAQNRE TVSADVSQFVAGLDQLVSGFDRTT .....................TTTT.....................................................TT TT...................... 144 3AG3.D mol:aa OXIDOREDUCTASE SVVKSEDYALPSYVDRRDYPLPDVAHVKNLSASQKALKEKEKASWSSLSIDEKVELYRLKFKESFAEMNRSTNEWKTVVG AAMFFIGFTALLLIWEKHYVYGPIPHTFEEEWVAKQTKRMLDMKVAPIQGFSAKWDYDKNEWKK ......TTTT.....TTTTTTT..TTTT.................................................... ...........................................TTTTTTT.....TTTTT.... 56 3H6P.C mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION GYAGTLQSLGADIASEQAVLSSAWQGDTGITYQGWQTQWNQALEDLVRAYQSMSGT ........................TTTT............................ 152 3H6R.A mol:aa HYDROLASE/HYDROLASE INHIBITOR MASLEDGTYRLRAVTTHNPDPGVGGEYATVEGARQPVKAEPSTPPFSEQQIWQVTRNSDGQYTIKYQGLNAPFEYGFSYD QLEPNAPVIAGDPKEYILQLVPSTADVYIIRAPIQRVGVDVEVGVQGNTLVYKFFPVDGSGGDRPAWRFTRE ..............TTTTTTT..........TTTT.......TTTTT.........TTTT....TTTT..TTTT....TT TTTTTT..............TTTTTTT........TTTT.....TTTTT....................... 262 3NFT.A mol:aa TRANSPORT PROTEIN AMTDDDLRAAGVDRRVPEQKLGAAIDEFASLRLPDRIDGRFVDGRRANLTVFDDARVAVRGHARAQRNLLERLETELLGG GIQPDPILQGLVDVIGQGKSDIDAYATIVEGLTKYFQSVADVMSKLQDYISAKDDKNMKIDGGKIKALIQQVIDHLPTMQ LPKGADIARWRKELGDAVSISDSGVVTINPDKLIKMRDSLPPDGTVWDTARYQAWNTAFSGQKDNIQNDVQTLVEKYSHQ NSNFDNLVKVLSGAISTLTDTA ...................................TTTTT........................................ ....................................................TTTTT....................... .TTTT.........TTTTT.TTTT.................TTTT................................... ...................... 106 2VQ4.A mol:aa HYDROLASE ASIPSSASVQLDSYNYDGSTFSGKIYVKNIAYSKKVTVVYADGSDNWNNNGNIIAASFSGPISGSNYEYWTFSASVKGIK EFYIKYEVSGKTYYDNNNSANYQVST ...TTTTTTT.....TTTTT.....................TTTT................TTTT............... ......TTTTT....TTTTT...... 86 3L7H.A mol:aa PROTEIN TRANSPORT LKRIQSHKGVVGTIVVNNEGIPVKSTLDNTTTVQYAGLMSQLADKARSVVRDLDPSNDMTFLRVRSKKHEIMVAPDKDFI LIVIQN ......TTTT......TTTT.................................TTTT........TTTT......TTTT. ...... 91 3GXW.A mol:aa TRANSCRIPTION SHHRVINHPYYFPFNGKQAEDYLRSKERGDFVIRQSSRGDDHLAITWKLDKDLFQHVDIQEGKVLVVEGQRYHDLDQIIV EYLQNKIRLLN .......TTTT...............TTTT.....TTTTTTT.......TTTT............TTTTT.......... ........... 228 2VTC.A mol:aa HYDROLASE HGQVQNFTINGQYNQGFILDYYYQKQNTGHFPNVAGWYAEDLDLGFISPDQYTTPDIVCHKNAAPGAISATAAAGSNIVF QWGPGVWPHPYGPIVTYVVECSGSCTTVNKNNLRWVKIQEAGINYNTQVWAQQDLINQGNKWTVKIPSSLRPGNYVFRHE LLAAHGASSANGMQNYPQCVNIAVTGSGTKALPAGTPATQLYKPTDPGILFNPYTTITSYTIPGPALW .......TTTTT............................TTTT......TTTT.....TTTT.........TTTT.... ..TTTT..............TTTT...................TTTTT.........TTTT.....TTTT.......... ...TTTTTTTTT..............................TTTTTTTTTTTTT............. 145 3IJM.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION YSHPISLKTLVQEDDIGVNAPIIHQSVIARLTAGLYPLYQSKKIPFEPLPETLTEGYSSPVPDVLLYDHQTEEAKVIIEV CQNSGLKHDTSKIVKLIEDNAYGILEGFVFNYKTQQWLRYRLGDGGVATNSSFSEVLQVDLNTFV ..TTTT.......TTTTTTT.....................TTTT...TTTT...TTTT..TTTT..TTTTT........ ..............................TTTTT.....TTTTTTT......TTTTT....... 80 3GOE.A mol:aa RECOMBINATION, REPLICATION HHHHHHKLITLLLRSSKSEDLRLSIPVDFTVKDLIKRYCTEVKISFHERIRLEFEGEWLDPNDQVQSTELEDEDQVSVVL TTTT..........TTTT.......TTTT.................TTTT..TTTTT..TTTT.......TTTT...... 147 3EDO.A mol:aa FLAVOPROTEIN GAKKTLILYYSWSGETKKAEKINSEIKDSELKEVKVSEGTFDADYKTSDIALDQIQGNKDFPEIQLDNIDYNNYDLILIG SPVWSGYPATPIKTLLDQKNYRGEVASFFTSAGTNHKAYVSHFNEWADGLNVIGVARDDSEVDKWSK ..........TTTTTTTT.......TTTT.......TTTTTT...............................TTTT... ..TTTTT.......................TTTT............TTTT......TTTTTT..... 343 3FGR.B mol:aa HYDROLASE SALIKLLPGGHDLLVAHNTWNSYQNMLRIIKKYRLQFREGPQEEYPLVAGNNLVFSSYPGTIFSGDDFYILGSGLVTLET TIGNKNPALWKYVQPQGCVLEWIRNVVANRLALDGATWADVFKRFNSGTYNNQWMIVDYKAFLPNGPSPGSRVLTILEQI PGMVVVADKTAELYKTTYWASYNIPYFETVFNASGLQALVAQYGDWFSYTKNPRAKIFQRDQSLVEDMDAMVRLMRYNDF LHDPLSLCEACNPKPNAENAISARSDLNPANGSYPFQALHQRAHGGIDVKVTSFTLAKYMSMLAASGPTWDQCPPFQWSK SPFHSMLHMGQPDLWMFSPIRVP .........................................TTTT..TTTTTT....TTTT..TTTT...TTTT...... .............TTTTT........................TTTT.TTTT...........TTTT.....TTTT....T TTT............................................TTTTT..........................TT TTT....TTTTTTT.TTTTTTTT.....TTTT.....................................TTTT....... TTTTTT.TTTTTTTT........ 154 3FG9.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION SNAENQKQEPLVYRRILLTVDEDDNTSSERAFRYATTLAHDYDVPLGICSVLESEDINIFDSLTPSKIQAKRKHVEDVVA EYVQLAEQRGVNQVEPLVYEGGDVDDVILEQVIPEFKPDLLVTGADTEFPHSKIAGAIGPRLARKAPISVIVVR ...TTTT................................................TTTT.TTTT................ ..........TTTT......TTTT.............TTTT..TTTT.TTTTTTT................... 91 3FGV.A mol:aa OXIDOREDUCTASE REVVIVKSTPQRGKFNAFAELVGKLVSETRDFPGCLGAYLLAPERNEQVVHIWETPDALEAYLTWRADRGDFLEINEYLE VEQDFKTYQLA ..........TTTTT................TTTT.................TTTT.........TTTT.........TT TT......... 233 3JQY.A mol:aa TRANSFERASE KTQDSRLKTQDSFSVDDNGSGNVFVCGDLVNSKENKVQFNGNNNKLIIEDDVECRWLTVIFRGDNNYVRIHKNSKIKGDI VATKGSKVIIGRRTTIGAGFEVVTDKCNVTIGHDCMIARDVILRASDGHPIFDIHSKKRINWAKDIIISSYVWVGRNVSI MKGVSVGSGSVIGYGSIVTKDVPSMCAAAGNPAKIIKRNIIWARTDKAELISDDKRCSSYHAKLTQLEHHHHH ...TTTT..TTTT.....TTTT......TTTT................TTTT..................TTTT...... ..TTTT....TTTT..TTTT...TTTT....TTTT..TTTT...........TTTTT.TTTT......TTTT..TTTT.. TTTT..TTTT..TTTT......TTTT...TTTT...........TTTT......................... 147 3K69.A mol:aa TRANSCRIPTION KLDFSVAVHSILYLDAHRDSKVASRELAQSLHLNPVIRNILSVLHKHGYLTGTVGKNGGYQLDLALADNLGDLYDLTIPP TISYARFITGPSKADQSPIAANISETLTDLFTVADRQYRAYYHQFTADLQADLNHHGTFLQHEQDSE ................TTTT..................................TTTT......TTTT............ TTTT..................................................TTTT......... 204 3K6O.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION TVTIAATVEKQPQYDAPYLVLDNGEKLWVVQHIVPYRDLKAGERIFGNYSFLEAGESGFAYNIRLNDYTLVPVQKIIGLN PDNDSIGNKVQIKDWPSDDYLNVRFLNFPSPQKPILNLVVNEIPWTKDGYAHLELRYNNNGSQGRLVPGVSFKLDDYSPE NSELKGIKVLVNPVDGEEKTYIFSYPLTGEDVPGFNPLDLAELK ..........TTTT......TTTT.......TTTTTTTTTTTT............TTTT....................T TTTTTTT.........TTTT........TTTT..........................TTTT...............TTT TTTT........TTTT............TTTT............ 185 3BMZ.A mol:aa BIOSYNTHETIC PROTEIN EPPLLPARWSSAYVSYWSPMLPDDQLTSGYCWFDYERDICRIDGLFNPWSERDTGYRLWMSEVGNAASGRTWKQKVAYGR ERTALGEQLCERPLDDETGPFAELFLPRDVLRRLGARHIGRRVVLGREADGWRYQRPGKGPSTLYLDAASGTPLRMVTGD EASRASLRDFPNVSEAEIPDAVFAA .....TTTT.......TTTTTTTT.........TTTTT.......TTTT...............TTTTT........... ..TTTT.................TTTTTTT............TTTTT........TTTTT......TTTTT........T TTTT.....TTTT............ 257 3BM3.A mol:aa HYDROLASE/DNA VRNLVIDITKKPTQNIPPTNEIIEEAITELNVDELLDRLFEKDESGEVITPSRIAKLEEKAFEIYKEYEKQVREAYLSAG YSREKLEQSFQQARFSRGGKAFEIIFTKLLNKFGIRYEHDRVIKIYDYITEGEKPAFIIPSVRTFLNDPSSAILITVKRK VRERWREAVGEAQILRNKFGDEINFWFVGFDEEFTIYSAIALDNGIDRVYVIDGRYDSLIEEIKRISDPNFNEDKYIQKI RRFSDIFDDIIQFLNKH ......TTTT...........................TTTT.TTTT.................................. .....................................TTTT....TTTT.....TTTTTTT................TTT TTTT...............TTTT.......TTTT.......TTTT......................TTTT......... ................. 709 2WVX.A mol:aa HYDROLASE KDWTQYVNPLGSQSTFELSTGNTYPAIARPWGNFWTPQTGKGDGWQYTYTANKIRGFKQTHQPSPWINDYGQFSIPIVGQ PVFDEEKRASWFAHKGEVATPYYYKVYLAEHDIVTETPTERAVLFRFTFPENDHSYVVVDAFDKGSYIKIIPEENKIIGY TTRNSGGVPENFKNYFIIEFDKPFTYKATVENGNLQENVAEQTTDHAGAIIGFKTRKGEQVNARIASSFISFEQAAANNE LGKDNIEQLAQKGKDAWNQVLGKIEVEGGNLDQYRTFYSCLYRSLLFPRKFYELDANGQPIHYSPYNGQVLPGYFTDTGF WDTFRCLFPLLNLYPSVNKEQEGLINTYLESGFFPEWASPGHRGCVGNNSASILVDAYKGVKVDDIKTLYEGLIHGTENV HPEVSSTGRLGYEYYNKLGYVPYDVKINENAARTLEYAYDDWCIYRLAKELKRPKKEISLFAKRANYKNLFDKESKLRGR NEDGTFQSPFSPLKWGDAFTEGNSWHYTWSVFHDPQGLIDLGGKEFVTDSVFAVPPIFDDSYYGQVIHEIRETVNGNYAH GNQPIQHIYLYDYAGQPWKAQYWLRQVDRYTPGPDGYCGDEDNGQTSAWYVFSALGFYPVCPGTDEYVGTPLFKKATLHF ENGNSLVIDAPNNSTENFYIDSSFNGADHTKNYLRHEDLFKGGTIKVDSNRPNLNRGTKEEDPYSFSKE ..........TTTTTTTTTTTT......TTTT..........TTTTTTTTT............TTTTT............ ...TTTTTT..........TTTT....TTTTT......TTTT....................TTTT.............. ........TTTT.......TTTTTTTT..TTTTT.TTTT....TTTT........TTTT........TTTT.......TT TTTT.........................................TTTT.....TTTT.....TTTTT............ ..TTTTT...........................TTTTTTTT....................TTTT.............. TTTTTTTTTTTT.........TTTT.TTTT.........................................TTTTT.... TTTT..TTTTTTTT.TTTTTTT....TTTTTTTTT........................TTTTT..............TT TT..............................TTTT....TTTTT.............TTTTTT........TTTT...T TTT......TTTTTTTTT....TTTTT.........................TTTTTTTTTT..TTTTT 129 3ECH.A mol:aa TRANSCRIPTION, TRANSCRIPTION REGULATION MNYPVNPDLMPALMAVFQHVRTRIQSELDCQRLDLTPPDVHVLKLIDEQRGLNLQDLGRQMCALITRKIRELEGRNLVRR QLFLTDEGLAIHLHAELIMSRVHDELFAPLTPVEQATLVHLLDQCLAAQ .....TTTT.......................................TTTT.......................TTTT. ..........................TTTT................... 124 3ECF.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION ATEKYHEILKKYFLSFETGDFSQVQFSCNLEFLSPISGNTLKGTEEVIPFLKGVTTRVAEVNISTTVEYPRASGVWQRTT KGTLYTLHNFFRLDEEGIVYVWPFDPKAVENPDALIQWLTGKDY ....................TTTT.........TTTT..............................TTTTT......TT TT...........TTTT........TTTT...........TTTT 256 3O6C.A mol:aa TRANSFERASE NALLGVNIDHIAVLRQARVNDPDLLEAAFIVARHGDQITLHVREDRRHAQDFDLENIIKFCKSPVNLECALNDEILNLAL KLKPHRVTLVPEKREELTTEGGLCLNHAKLKQSIEKLQNANIEVSLFINPSLEDIEKSKILKAQFIELHTGHYANLHNAL FSNISHTAFALKELDQDKKTLQAQFEKELQNLELCAKKGLELGLKVAAGHGLNYKNVKPVVKIKEICELNIGQSIVARSV FTGLQNAILEKELIKR ...................TTTT...................TTTTTTT............................... ...TTTT..........TTTT..TTTTTTT.................................................. ......TTTT......................................TTTTTTTTTT....TTTT.............. ..........TTTTT. 163 3GQQ.A mol:aa SPLICING KQPIGPEDVLGLQRITGDYLCSPEENIYKIDFVRFKIRDDSGTVLFEIKKPPNAGRFVRYQFTPAFLRLRQVGATVEFTV GDKPVNNFRIERHYFRNQLLKSFDFHFGFCIPSSKNTCEHIYDFPPLSEELISEIRHPYETQSDSFYFVDDRLVHNKADY SYS ........TTTT.............TTTT.......................TTTT........................ ....TTTT.....TTTTT............TTTT....................TTTTTTTT.....TTTTT........ ... 176 3LGB.A mol:aa TRANSFERASE SDEINAQSVWSEEISSNYPLCIKNLEGLKKNHHLRYYGRQQLSLFLKGIGLSADEALKFWSEAFTNTEKFNKEYRYSFRH NYGLEGNRINYKPWDCHTILSKPRPGRGDYHGCPFRDWSHERLSAELRSKLTQAQIISVLDSCQKGEYTIACTKVFETHT HIAHPNLYFERSRQLQ TTTTTTTT........................................................................ ....TTTT.................TTTT................................................... ................ 135 3MSW.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION GTNNGKQFIHNDTEGGKLVCREIYANDAASGILNPVKYKYSYDTDQQKTVKSTYAWNIFKNTWETESRTVISRYETETSV EYSVWNKEKGSFDLSKKYIYITDNNNQLIAQYAYKNSRTNQWILEKDALTPIYEN TTTTT........TTTT.........TTTTT...........TTTTT.........TTTTT............TTTT... .....TTTTT............TTTT.........TTTTT.........TTTTT. 56 3KXT.A mol:aa DNA BINDING PROTEIN/DNA MKPVKVKTPAGKEAELVPEKVWALAPKGRKGVKIGLFKDPETGKYFRHKLPDDYPI .......TTTT......TTTT....TTTT.........TTTTT.......TTTT.. 317 3BIY.A mol:aa TRANSFERASE KFSAKRLPSTRLGTFLENRVNDFLRRQNHPESGEVTVRVVHASDKTVEVKPGMKARFVDSGEMAESFPYRTKALFAFEEI DGVDLCFFGMHVQEYGSDCPPPNQRRVYISYLDSVHFFRPKCLRTAVYHEILIGYLEYVKKLGYTTGHIWACPPSEGDDY IFHCHPPDQKIPKPKRLQEWYKKMLDKAVSERIVHDYKDIFKQATEDRLTSAKELPYFEGDFWPNVLEESIKESQKLYAT MEKHKEVFFVIRLIAGPAANSLPPIVDPDPLIPCDLMDGRDAFLTLARDRHLEFSSLRRAQWSTGCMLVELHTQSQD ..TTTTT.....................TTTT.................TTTT....TTTT.................TT TTT............TTTTTTTTTTT...........TTTT.................................TTTTTT TTTTTTTTT......................TTTT......................TTTT................... ...........TTTT...TTTT..............TTTT..................................... 112 3BP6.A mol:aa SIGNALING PROTEIN SLTFYPAWLTVSEGANATFTCSLSNWSEDLMLNWNRLSPSNQTEKQAAFSNGLSQPVQDARFQIIQLPNRHDFHMNILDT RRNDSGIYLCGAISLHPKLKIEESPGAELVVT ...TTTTTT..TTTT.......TTTTTTTT.......TTTT.......TTTTT..TTTTTTTT...TTTTTT....TTTT ..............TTTTT............. 126 3GBY.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION NASVTFSYLAETDYPVFTLGGSTADAARRLAASGCACAPVLDGERYLGVHLSRLLEGRKGWPTVKEKLGEELLETVRSYR PGEQLFDNLISVAAAKCSVVPLADEDGRYEGVVSRKRILGFLAERI TTTT......TTTT...TTTT...................TTTTT.........TTTTTTT.TTTT.............T TTT...........TTTTTT...TTTT................... 159 3GBW.A mol:aa LIGASE SLEDYSVVNRFESHGGGWGYSAHSVEAIRFSADTDILLGGLGLFGGRGEYTAKIKLFELGPDGGDHETDGDLLAETDVLA YDCAAREKYAFDEPVLLQAGWWYVAWARVSGPSSDCGSHGQASITTDDGVIFQFKSSKKSNNGTDVNAGQIPQLLYRLP ....................TTTT...................................TTTTTTTT............. ...TTTT...TTTT...TTTT...................TTTT.TTTT.......TTTTTTT.TTTT........... 94 3FIA.A mol:aa PROTEIN BINDING TPFGGSLDTWAITVEERAKHDQQFHSLKPISGFITGDQARNFFFQSGLPQPVLAQIWALADNNDGRDQVEFSIAKLIKLK LQGYQLPSALPPVK TTTT.TTTTTT.................TTTTT............................................... ......TTTT.... 296 3FID.A mol:aa MEMBRANE PROTEIN SSLAISVANDDAGIFQPSLNALYGHPAADRGDYTAGLFLGYSHDLTDASQLSFHIAQDIYSPSGANKRKPEAVKGDRAFS AFLHTGLEWNSLATNWLRYRLGTDIGVIGPDAGGQEVQNRAHRIIGAEKYPAWQDQIENRYGYTAKGMVSLTPAIDILGV NVGFYPEVSAVGGNLFQYLGYGATVALGNDKTFNSDNGFGLLSRRGLIHTQKEGLIYKVFAGVERREVDKNYTLQGKTLQ TKMETVDINKTVDEYRVGATIGYSPVAFSLSLNKVTSEFRTGDDYSYINGDITFFF ........TTTT..................TTTT.........TTTTTT.......................TTTT.... ...........TTTTTT..........................................................TTTTT ............TTTT...........................TTTT..............................TTT TT......TTTT..........TTTT............TTTT.............. 84 3FX7.A mol:aa UNKNOWN FUNCTION QMDTEEVREFVGHLERFKELLREEVNSLSNHFHNLESWRDARRDKFSEVLDNLKSTFNEFDEAAQEQIAWLKERIRVLEE DYLE ..................................TTTT.......................................... .... 60 2ZSI.B mol:aa HORMONE RECEPTOR GMDELLAVLGYKVRSSEMADVAQKLEQLEVMMSNVLATETVHYNPAELYTWLDSMLTDLN ...................................TTTTT...TTTTT............ 498 3EHM.A mol:aa SUGAR BINDING PROTEIN PLKYGARFNQQRVIPIGSPSLTTGPGNDLQNTDLISSGNYIGYFGNNNNWGFNNEANWNFTDSRNYAYQNFYSQIFLPWN EIYEIAKDSDSPSEQAILEIANIVRNIAWLRATDVFGPIAYNSAGDGSIAPKFDSQEVVYRSLADLSKSVELLNTISYSV AQYDLIYNGNVQNWVKLANSLLRIVVRVHFIDETLAKEYITKALDPKNGGVIEDISSEAKIKSSDKPLLNSLASVNEYNE TRGATIWGYLDGYKDPRLSAYFTEGTYGSGSWAQTGYFPVAPTNSKSKSETSYSAKFASRPKVDSNSPLYWFRASETYFL KAEAALYNLIGGDPKTFYEQGINISFQEQGVSGVATYLSGTGKPTGLTGSNYKYGTYNHDLSIGNTSPKWDDYTGNLSKQ EEQLQKIITQKYLALYPNAVEAWTEYRRTGFPYLKPDEAAPGRIGASIEDCRVPERFRFAPTAYNSNPNAEIPTLLGGGD IGATKLWWVRSNRPKQPN ....TTTT.TTTTTT.....TTTT.........TTTT....TTTT...............TTTT................ .....TTTT..............................TTTTTTT.TTTT............................. TTTTTTTTTT.................TTTTTT............................................... ............................TTTTT.......TTTT......TTTT.........TTTT............. .......TTTT....................................TTTTTTTT....TTTTTT...TTTT..TTTT.. ..............TTTTT..........TTTT...TTTT......TTTTTTTT.......................... TTTT.TTTTTTTT..... 125 3EHG.A mol:aa TRANSFERASE GIRLKDELINIKQILEAADIMFIYEEEKWPENISLLNENILSMCLKEAVTNVVKHSQAKTCRVDIQQLWKEVVITVSDDG TFKGEENSFSKGHGLLGMRERLEFANGSLHIDTENGTKLTMAIPN .........................TTTTTTTT..................................TTTT......... ....TTTT.TTTT...................TTTTT........ 116 3EEH.A mol:aa TRANSFERASE ERRVRELTEATNDILWEFTADLSEVLVINSAYEDIWGRSVAKLRENPHDFLNGIHPEDRELMKDTMQSLMDGESADVECR VNATEEYQRWVWIQGEPITNDAGETVRVAGFARDIT ..................TTTT.......................................................... ....TTTT...........TTTT............. 138 3EER.A mol:aa OXIDOREDUCTASE STIYQTSATASAGRNGVVSTEDKLLELNLSYPKEGGSGTATNPEQLFAVGYAACFSNAILHVAREAKVALKEAPVTATVG IGPNGQGGFALSVALAAHIALEDEQARQLVTVAHQVCPYSNAVRGNIDVQVSVNGLAL .........TTTTTTT...TTTTTTT...................................................... ...TTTT...................................TTTTTT...TTTTT.. 168 3NKG.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION APNPISIPIDLSQAGSVVEKEVKIEESWSYHLILQFAVHDRKEDGGLDGKRVWKFLGFNSYDPRDGKQVGYVDYRLAKSE LGDLIDETYDCDGTVVPIKITIHQINQDNTKKLIADNLYTKGNGSGAYTRDITTISLDKGKYIFRIENIEAFSEIGRKVD FTIYINKR .........TTTTTTT.......................TTTTTTTT..............TTTTT.............. .....TTTTTTTT............TTTT................TTTT............................... ........ 120 3NKL.A mol:aa OXIDOREDUCTASE/LYASE AKKKVLIYGAGSAGLQLANLRQGKEFHPIAFIDDDRKKHKTTQGITIYRPKYLERLIKKHCISTVLLAVPSASQVQKKVI IESLAKLHVEVLTIPNLDDLVNGKLSIGQLKEVSIDDLLG ...................TTTT..............TTTT...........................TTTT........ .........................TTTT........... 182 3NKE.A mol:aa IMMUNE SYSTEM ARSDKLLYQAKLALDEDLRLKVVRKFELRFGEPAPARRSVEQLRGIEGSRVRATYALLAKQYGVTWNGRRYDEKGDTINQ CISAATSCLYGVTEAAILAAGYAPAIGFVHTGKPLSFVYDIADIIKFDTVVPKAFEIARRNPGEPDREVRLACRDIFRSS KTLAKLIPLIEDVLAAGEIQPP ..................................TTTT.......................................... ......................TTTTTTTTT.TTTT........TTTTTT...........TTTT............... ...................... 154 3F4M.A mol:aa IMMUNE SYSTEM IDETSSEVLDELYRVSKEYTHSRPQAQRVIKDLIKVAIKVAVLHRNGSFGPSELALATRFRQKLRQGAMTALSFGEVDFT FEAAVLAGLLTECRDVLLELVEHHLTPKSHGRIRHVFDHFSDPGLLTALYGPDFTQHLGKICDGLRKLLDEGKL ...........................................................................TTTT. ....................TTTTT................................................. 105 3F40.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION KTQITTRDLVLEFIHALNTENFPAAKKRLNENFTFNGPGHREGSERYNDEKKFKYVVHKFEEGNDVCLIYDINNGKTIAA SGLYHLEKGEITSLHVYFDPRPLFE ..........................................TTTTT..............TTTT............... .....TTTTT............... 75 1T07.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION GHSRTVCRKYHEELPGLDRPPYPGAKGEDIYNNVSRKAWDEWQKHQTLINERRLNNAEDRKFLQQEDKFLSGEDY ......TTTTT.....TTTT..............................................TTTTT.... 147 3JYZ.A mol:aa STRUCTURAL PROTEIN GIDPFTVRTRVSEGLVLAEPAKLISTDGSASTADLTRATTTWNQQSNNLGASSKYVTSVLDAGNTGVITITYVADQVGLP TAGNTLILSPYINDGNTRTALATAVAAGTRGTIDWACTSASNATATAQGFTGAAGSVPQEFAPAQCR .......................TTTTT........................TTTT....TTTT...............T TTTT........TTTTT....................TTTTT......................... 178 3JYG.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION SLIYQIAKEFDFCYGHRVWSQELNPDFSLDPCLSCRHLHGHQGKVIVHLESRELQRGVTDFAHLNWFKRFIDEVLDHRFI IDIDDPLFPTLLPHFADKSALVWEEGYARVDFERIKGESSPILELYESFVVVRFVPTSESIASWLLELLRSRIQPLGVKV SSVEFLETPKSRARVYNE .................TTTT......TTTT......................TTTT.................TTTT.. .TTTTTTT......TTTTTTT.............TTTT.......................................... .....TTTTTT....... 72 2W8X.A mol:aa MEMBRANE PROTEIN FNCNKREGPCSQRSLCECDPNLQLGRHSDQLWHYNLRTNRCERGGYRDNCNSHSSSGACVMACERIHHHHHH ...TTTTT..TTTT....TTTT............TTTTT......TTTTTTT.................... 140 3F9S.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION GSISSKAKEILTQFTREVWSEGNIEASDKYIAPKYTVLHDPGDPWEGRELDVAGYKERVKTLRAAFPDQCFDIQGLFADG DAVVTWLWTATHKEDIPGFPSTGKQIKSGATVYYFDGNRLTGHWQITDRLGVYQQLRQAA ...TTTT.............................TTTTTTTTTTTT.................TTTT.........TT TT.............TTTT...............TTTTT..................... 175 3F95.A mol:aa HYDROLASE SDSHPLFVRSLAKNMTWQLADTSTQKVLASGASATSGDKQSLLMQSVNLSYQEDGRGFNWRAQAALSLSYLEPTPLDSKF STGYLELKMRIDKAPEQGANLQVMCSESNCLRDIDFSSFSQLMADKSWHTLAIPLHCQPITDALRITSQNLSLAIADVAL TIKPSDDSISLTCAK ....TTTTTTTTTTT....................TTTTTTTT....TTTTTTT.........................T TTT.......TTTTTTTT.......TTTT.............TTTT.....................TTTT......... TTTTTTTT....... 182 3IT5.A mol:aa HYDROLASE APPSNLMQLPWRQGYSWQPNGAHSNTGSGYPYSSFDASYDWPRWGSATYSVVAAHAGTVRVLSRCQVRVTHPSGWATNYY HMDQIQVSNGQQVSADTKLGVYAGNINTALCEGGSSTGPHLHFSLLYNGAFVSLQGASFGPYRINVGTSNYDNDCRRYYF YNQSAGTTHCAFRPLYNPGLAL ..TTTT.....TTTT........TTTT..........TTTT.TTTT................TTTT....TTTT....TT TT.....TTTT..TTTT............TTTT............TTTTT...TTTTTTTTT......TTTT.TTTTT.. .TTTTT...TTTT......... 489 1PBY.A mol:aa OXIDOREDUCTASE VTGEEVLQNACAACHVQHEDGRWERIDAARKTPEGWDMTVTRMMRNHGVALEPEERAAIVRHLSDTRGLSLAETEERRYI LEREPVAWDEGPDTSMTQTCGRCHSYARVALQRRTPEDWKHLVNFHLGQFPTLEYQALARDRDWWGIAQAEIIPFLARTY PLGEAPDAYADDASGAYVLAGRQPGRGDYTGRLVLKKAGEDYEVTMTLDFADGSRSFSGTGRILGAGEWRATLSDGTVTI RQIFALQDGRFSGRWHDADSDVIGGRLAAVKADAAPQVLAVAPARLKIGEETQLRVAGTGLGSDLTLPEGVAGSVESAGN GVTVLKLTATGTPGPVSLELGGQKVDLVAYDRPDRISIVPDLTIARIGGNGGPIPKVPAQFEAMGWLNGPDGQPGTGDDI ALGAFPASWATDNFDEEAEKMQDAKYAGSIDDTGLFTPAEAGPNPERPMQTNNAGNLKVIATVDAEGEPLSAEAHLYATV QRFVDAPIR .................TTTT.TTTT...............................................TTTTT.. ...TTTT.............TTTT...............................TTTTTTT.................. ......................TTTTT..........TTTT........TTTT..........TTTTT......TTTT.. .....TTTTT......TTTTT.........TTTT......TTTT..TTTT.......TTTT......TTTT.......TT TT................TTTTT..............TTTTTT.....TTTT................TTTTTTTTTTT. ..............................TTTT.........TTTT.................TTTT............ .....TTTT 177 2W2R.A mol:aa VIRAL PROTEIN LSNDFFGEDDSLRYEKFRFLKTVRSNKPFRSYDDVTAAVSQWDNSYIGVGKRPFYKIIALIGSSHLQATPAVLADLNQPE YYATLTGRCFLPHRLGLIPPFNVSETFRKPFNIGIYKGTLDFTFTVSDDESNEKVPHVWEYNPKYQSQIQKEGLKFGLIL SKKATGTWVLDQLSPFK TTTTT...................TTTT..............TTTT..TTTT......................TTTTT. ................................TTTTT........................................... .TTTT........TTTT 48 2WPV.B mol:aa PROTEIN BINDING GPEHEFVSKFLTLATLTEPKLPKSYTKPLKDVTNLGVPLPTLKYKYKQ ..............TTTTTTTTTTT....................... 208 3F0P.A mol:aa LYASE MKLAPYILELLTSVNRTNGTADLLVPLLRELAKGRPVSRTTLAGILDWPAERVAAVLEQATSTEYDKDGNIIGYGLTLRE TSYVFEIDDRRLYAWCALDTLIFPALIGRTARVSSHCAATGAPVSLTVSPSEIQAVEPAGMAVSLVLPQEAADVRQSFCC HVHFFASVPTAEDWASKHQGLEGLAIVSVHEAFGLGQEFNRHLLQTMS ................TTTT.......................................TTTT..TTTT...TTTTT... ......TTTT..........................TTTTT.......TTTT...TTTTTT.......TTTT........ ....................TTTT........................ 282 3HL1.A mol:aa METAL BINDING PROTEIN PPVWTLPRLYQHFQGAIDLELWTIPYYLTVLYSIKDPTTVPYRLIQAAVYQELHAQLVSNIANAYGYSPTLSAPEYVGTA VPHIDFDLDTPNPTSIFTPYSAELGPLDLTRVNTCLIEYPEWRTQREPDLADDVTDYGSIGEFYDALRVGEQLRGHVRGN QKQDENSPPLTVTESGDAGFLQALTLVDIIVDQGEGQAWPHFQRFDFIRRPNWPGVYTGVTDPPAGSPGAEAQARLIADF AGFLDILNGFSGGGAPPAFGVQAKLGGDILSCWKLGAVPRYS .................................TTTTTT.....................................TTTT TTTT....TTTT..TTTTTT..............TTTTT...........TTTT................TTTT...TTT TTT........................................................TTTTTTTT............. .......................................... 100 2HLQ.A mol:aa TRANSFERASE ALCAFKDPYQQDLGIGESRISHENGTILCSKGSTCYGLWEKSKGDINLVKQGCWSHIGDPQECHYEECVVTTTPPSIQNG TYRFCCCSTDLCNVNFTENF ......TTTTTTTTTTT...TTTTT....TTTT.......TTTTT..........TTTTTTT..TTTT....TTTTTTTT TT......TTTT........ 147 3A0Z.A mol:aa TRANSFERASE MEFTEFNLNELIREVYVLFEEKIRKMNIDFCFETDNEDLRVEADRTRIKQVLINLVQNAIEATGENGKIKITSEDMYTKV RVSVWNSGPPIPEELKEKIFSPFFTTGLGLSICRKIIEDEHGGKIWTENRENGVVFIFEIPKTPEKR .................................TTTTTT........................TTTT........TTTT. ..............TTTT..TTTT.........................TTTT.......TTTT... 70 3NY3.A mol:aa LIGASE LCGRVFKVGEPTYSCRDCAVDPTCVLCMECFLGSIHRDHRYRMTTSGGGGFCDCGDTEAWKEGPYCQKHE ......TTTT....TTTTTTTTTT............................TTTTTTTTTTT..TTTTT 253 3GAE.A mol:aa NUCLEAR PROTEIN GMKVLPVKQYLIMENYNPDTIFNGIVKINSNEKTFDDEILAQIGGALHDIDESWELLLSFANTIRSNWEIKTPAYDIVRL IVKKLPYSSDIKDYIEEGLGNKNITLTMLTVRILVNCFNNENWGVKLLESNQVYKSIFETIDTEFSQASAKQSQNLAIAV STLIFNYSALVTKGNSDLELLPIVADAINTKYGPLEEYQECEEAAYRLTVAYGNLATVEPTLRQFANSVTWLANIKRSYG NVPRFKDIFDDLS .TTTTTT............................................................TTTT......... .................TTTTTTT...............TTTTT.........TTTT.......TTTT............ ................TTTT...........TTTT.............................TTTTT.........TT TT........... 156 3GA4.A mol:aa TRANSFERASE IDDILQLKDDTGVITVTADNYPLLSRGVPGYFNILYITMRGTNSNGMSCQLCHDFEKTYHAVADVIRSQAPQSLNLFFTV DVNEVPQLVKDLKLQNVPHLVVYPPAESNKQSQFEWKTSPFYQYSLVPENAENTLQFGDFLAKILNISITVPQAFN ........TTTT....TTTTT......TTTT...........TTTT.......................TTTT....... TTTTTT............................TTTTT..........TTTT...................TTTT 133 3GA3.A mol:aa HYDROLASE AKHYKNNPSLITFLCKNCSVLACSGEDIHVIEKMHHVNMTPEFKELYIVRENKALQKKCADYQINGEIICKCGQAWGTMM VHKGLDLPCLKIRNFVVVFKNNSTKKQYKKWVELPITFPNLDYSELEHHHHHH TTTT..........TTTTT..........TTTTTT................TTTTT..TTTT.......TTTT....... TTTTT.............TTTTT.............................. 78 2V89.A mol:aa PROTEIN BINDING SPEFGYWITCCPTCDVDINTWVPFYSTELNKPAMIYCSHGDGHWVHAQCMDLEERTLIHLSEGSNKYYCNEHVQIARA ....TTTT..TTTT..TTTTT...TTTTTTT.......TTTT..........................TTTTTTT... 101 3IE4.A mol:aa IMMUNE SYSTEM YEVPKAKIDVFYPKGFEVSIPDEEGITLFAFHGKLNEEMEGLEAGTWARDIVKAKNGRWTFRDRITALKPGDTLYYWTYV IYNGLGYREDDGSFVVNGYSG ..........TTTT........TTTT.......TTTT..TTTT.TTTT.....TTTTT....TTTT..TTTT........ TTTTT................ 247 3IEE.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION VSTADIENAAEVIKYYNTSLGVLKDVKEKDVNAVLDYEQKGKTPALSAIVPPAVVSKDSAIVLNPGNCFNEETRRNLKQN YTGLFQARTEFYANFDTYLSYLKKKDVTNAKKLLDVNYQLSTQSEYKQNIFDILSPFTEQAELVLLVDNPLKAQISVRKS STQSILNLYARKHRDGPRIDLKVAELTKQLDAAKKLPVVNGHEGEKSYQAFLSQVETFIKQVKKVREKGEYSDADYDLTS AFETSII .....................................TTTTTT.TTTT.......TTTT..TTTTTTTTT.......... .................................................................TTTTTTTTTTTTTT. ..........TTTT........................TTTTTTT................................... ....... 226 2TPS.A mol:aa THIAMIN BIOSYNTHESIS HGIRMTRISREMMKELLSVYFIMGSNNTKADPVTVVQKALKGGATLYQFREKGGDALTGEARIKFAEKAQAACREAGVPF IVNDDVELALNLKADGIHIGQEDANAKEVRAAIGDMILGVSAHTMSEVKQAEEDGADYVGLGPIYPTETKKDTRAVQGVS LIEAVRRQGISIPIVGIGGITIDNAAPVIQAGADGVSMISAISQAEDPESAARKFREEIQTYKTGR ................TTTT................................TTTT........................ .TTTT..............TTTT.........TTTT..........................TTTT.TTTTTT...TTTT ................TTTTTTTTT...................TTTT.................. 178 3K5J.A mol:aa PROTEIN BINDING GDYNQTVLSHLQKFWKHHDIKGFTWTLGRIVEELPDFQVFQVIPNHEDEPWVYVSSGIGQFLGQEFFIISPFETPEHIET LALASASHYPDQFQLGKTVNIGRPWVEQSSFRHFLISLPYPYGQELEYDNVRFFWLLPITQTERLFLNTHSVEELETKFD EAGIDYLDINRASTVWQA ..............TTTT......TTTT.....TTTT........TTTT............................... ..TTTTT......TTTT...TTTTTTTTT.........TTTTT..................................... ....TTTTTTT....... 149 3G7G.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION EMNYEEVFSITITVDKPILIGQDDIVGRRQLIPIISGKVSGNNFNGKVLPGGIDSQIVRPDGKCELSARYAIRLDDGAAI YIENNGIRTVPDEYIEAVKSGEFVDPNAYYFRTIPTFETYSPKYKWMMNHIFVCCASRENVLLKFYKIS ......................TTTTT.............TTTT..............TTTT...........TTTT... ..................................................................... 189 3JZ9.A mol:aa TRANSPORT PROTEIN GHVTRIENLENAKKLWDNANSLEKGNISGYLKAANELHKFKEKNLKEDDLRPELSDKTISPKGYAILQSLWGAASDYSRA AATLTESTVEPGLVSAVNKSAFFDCKLSPNERATPDPDFKVGKSKILVGIQFIKDVADPTSKIWHNTKALNHKIAAIQKL ERSNNVNDETLESVLSSKGENLSEYLSYK .....................TTTT...............TTTT.........TTTT....................... ..TTTTT............TTTTTTTTTTTT....TTTT..................TTTT....TTTTT.......... ..................TTTT....... 294 3CNY.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION SSKAEKDIKWGIAPIGWRNDDIPSIGKDNNLQQLLSDIVVAGFQGTEVGGFFPGPEKLNYELKLRNLEIAGQWFSSYIIR DGIEKASEAFEKHCQYLKAINAPVAVVSEQTYTIQRSDTANIFKDKPYFTDKEWDEVCKGLNHYGEIAAKYGLKVAYHHH GTGIQTKEETDRLANTDPKLVGLLYDTGHIAVSDGDYALLNAHIDRVVHVHFKDVRRSKEEECRAKGLTFQGSFLNGFTV PGDGDLDFKPVYDKLIANNYKGWIVVEAEQDPSKANPLEAQIAHRYIKQHLIEN ..................TTTTTTTTTTT...................TTTT............................ .............................TTTTTTTTTTTTTTTT................................... .TTTT...........TTTTT..........................................................T TTTT..........................TTTTTTTTT............... 92 3KGK.A mol:aa CHAPERONE MKTLMVFDPAQALVDFSTDVQWLKQSGVQIERFNLAQQPMSFVQNEKVKAFIEASGAEGLPLLLLDGETVMAGRYPKRAE LARWFGIPLDKV .................................TTTTTT........................TTTTT............ ............ 281 3KG9.A mol:aa LYASE LHPLLGEKLNLARIENQHHFQSYLTAESPAYLSQHQVFNKVLFPATGYLEIAAAVGKNLLTTGEQVVVSDVTIVRGLVIP ETDIKTVQTVISTLENNSYKLEIFSTSEANQWTLHAEGKIFLDSTTNTKAKIDLEQYQRECSQVIDIQQHYQQFKSRGID YGNSFQGIKQLWKGQGKALGKIALPEEIAGQATDYQLHPALLDAALQILGHAIGNTETDDKAYLPVGIDKLKQYRQTITQ VWAIVEIPENTLKGSIKLVDNQGSLLAEIEGLRVTATTADA .TTTTT....TTTTTTT.......TTTTTT.....TTTTT....................TTTT...............T TTT...........TTTT.............................................................. .............TTTT.......TTTTT.....TTTT................TTTTT..................... .......TTTT........TTTT..............TTTT 146 3DSB.A mol:aa TRANSFERASE ELIEIREARDDLDTIAKFNYNLAKETEGKELDDVLTKGVKALLLDERKGKYHVYTVFDKVVAQIYTYEWSDWRNGNFLWI QSVYVDKEYRRKGIFNYLFNYIKNICDKDENIVGRLYVEKENINAKATYESLNYECDYNYEYEVIH TTTT..................................................TTTTT..........TTTTT...... ........TTTT................TTTT......TTTTTTT..................... 84 3DS2.A mol:aa VIRAL PROTEIN TSILDIRQGPKEPFRDYVDRFAKTLRAEQASQEVKNWMTETLLVQNANPDCKTILKALGPGATLEEMMTACQGVGGPGHK ARVL ........TTTT..............................................TTTT........TTTT...TTT T... 155 3HPC.X mol:aa PROTEIN TRANSPORT SVSVDLNVDPSLQIDIPDALSERDKVKFTVHTKTTLPTFQSPEFSVTRQHEDFVWLHDTLTETTDYAGLIIPPAPTKPDF DGPREKMQKLGEGEGSMTKEEFAKMKQELEAEYLAVFKKTVSSHEVFLQRLSSHPVLSKDRNFHVFLEYDQDLSV ......TTTTTTT.......TTTTT..........TTTTTTT.......................TTTT........... .....................................................TTTT..............TTTT 115 3LS0.A mol:aa PHOTOSYNTHESIS TYSPEKIAQLQVYVNPIAVARDGMEKRLQGLIADQNWVDTQTYIHGPLGQLRRDMLGLASSLLPKDQDKAKTLAKEVFGH LERLDAAAKDRNGSQAKIQYQEALADFDSFLNLLP ............................................TTTTTTT............................. ................................... 132 3GP4.A mol:aa TRANSCRIPTION REGULATOR SLNIKEASEKSGVSADTIRYYERIGLIPPIHRNESGVRKFGAEDLRWILFTRQMRRAGLSIEALIDYLALFREGEHTLEA RAELLKKQRIELKNRIDVMQEALDRLDFKIDNYDTHLIPAQEELKDFNVERS ........................TTTT....TTTT............................................ ................................................TTTT 155 3GP6.A mol:aa TRANSFERASE MNADEWMTTFRENIAQTWQQPEHYDLYIPAITWHARFAYNERPWGGGFGLSRWDEKGNWHGLYAMAFKDSWNKWEPIAGY GWESTWRPLADENFHLGLGFTAGVTARDNWNYIPLPVLLPLASVGYGPVTFQMTYIPGTYNNGNVYFAWMRFQFL .................................TTTT................TTTT...........TTTT........ ........TTTTTT...............TTTT............TTTT.........TTTTT............ 138 2XDH.A mol:aa CELL ADHESION AKTTIIAGSAEAPQGSDIQVPVKIENADKVGSINLILSYPNVLEVEDVLQGSLTQNSLFDYQVEGNQIKVGIADSNGISG DGSLFYVKFRVTTLRNSHALTLQGIEIYDIDGNSVKVATINGTFRIVSQEEAHHHHHH ............TTTT.......................TTTT.......TTTTTTT......TTTT............. ............................TTTT..............TTTT........ 89 2XDG.A mol:aa SIGNALING PROTEIN MLREDESACLQAAEEMPQTTLGCPATWDGLLCWPTAGSGEWVTLPCPDFFSHFSSESGAVKRDCTITGWSEPFPPYPVAC PVPLELLAE ...................TTTT....TTTT.....TTTT........................TTTT...TTTT..... ......... 195 2ZX0.A mol:aa IMMUNE SYSTEM, SUGAR BINDING PROTEIN AISITCEGSDALLQCDGAKIHIKRANYGRRQHDVCSIGRPDNQLTDTNCLSQSSTSKMAERCGGKSECIVPASNFVFGDP CVGTYKYLDTKYSCVQQQETISSIICEGSDSQLLCDRGEIRIQRANYGRRQHDVCSIGRPHQQLKNTNCLSQSTTSKMAE RCDGKRQCIVKVSNSVFGDPCVGTYKYLDVAYTCD .....TTTT.....TTTT............TTTTTTTTT......TTTT.TTTT.......TTTTTTT..........TT TTTT...........TTTT......TTTT.....TTTT............TTTTTTTTT......TTTT.TTTT...... .TTTTTTT..........TTTTTT........... 86 2ZXY.A mol:aa OXYGEN BINDING, TRANSPORT PROTEIN ADGKAIFQQKGCGSCHQANVDTVGPSLKKIAQAYAGKEDQLIKFLKGEAPAIVDPAKEAIMKPQLTMLKGLSDAELKALA DFILSH ................TTTTTTTTT........TTTT.............TTTTT......................... ...... 68 3I4O.A mol:aa TRANSLATION GAIEVEGRVVEPLPNAMFRIELENGHKVLAHISGKMRQHYIRILPEDRVVVELSPYDLSRGRIVYRYK .TTTT.......TTTTT....TTTT..................TTTT......TTTTT.......... 160 3IHT.A mol:aa TRANSFERASE EQSRLDLFIDRVSQRACLEHAIAQTAGLSGPVYELGLGNGRTYHHLRQHVQGREIYVFERAVASHPDSTPPEAQLILGDI RETLPATLERFGATASLVHADLGGHNREKNDRFARLISPLIEPHLAQGGLVSSDRYFEGLEELPLPPGAVVGRCFIYRRG ........................TTTT........TTTT.........TTTT.....TTTT.................. ..........TTTT.....................................TTTT.TTTT.....TTTTTTTT....... 106 2VV6.A mol:aa SIGNALING PROTEIN, TRANSFERASE IPDAMIVIDGHGIIQLFSTAAERLFGWSELEAIGQNVNILMPEPDRSRHDSYISRYRTTSDPHIIGIGRIVTGKRRDGTT FPMHLSIGEMQSGGEPYFTGFVRDLT ........TTTT...................TTTT......TTTT................TTTTTTT......TTTT.. ..........TTTTT........... 87 3D3B.J mol:aa TRANSCRIPTION GPLGSMQNQRIRIRLKAFDHRLIDQATAEIVETAKRTGAQVRGPIPLPTRSRTHLRLVDIVEPTEKTVDALMRLDLAAGV DVQISLG ..TTTTTTT.......TTTT............................TTTTT......TTTT.............TTTT ....... 211 3D34.A mol:aa IMMUNE SYSTEM ICSARAPAKYSITFTGKWSQTAFPKQYPLFRPPAQWSSLLGAAHSSDYSMWRKNQYVSNGLRDFAERGEAWALMKEIEAA GEALQSVHEVFSAPAVPSGTGQTSAELEVQRRHSLVSFVVRIVPSPDWFVGVDSLDLCDGDRWREQAALDLYPYDAGTDS GFTFSSPNFATIPQDTVTEITSSSPSHPANSFYYPRLKALPPIARVTLLRL ..................TTTTTTTTT.TTTTT...........TTTT...TTTT......................... .................TTTT........TTTTT.......TTTTTTT..........TTTT.................. ..TTTT....TTTT......TTTTTTTTTTTTTTTTTTTT........... 104 3LR4.A mol:aa TRANSFERASE AQRVALQLVAIVKLTRTALLYSDPDLRRALLQDLESNEGVRVYPREKTDKFKLQPDESVNRLIEHDIRSRLGDDTVIAQS VNDIPGVWISFKIDDDDYWVALDR .............................................TTTT.......TTTT...........TTTT....T TTTT........TTTT........ 125 3LR2.A mol:aa STRUCTURAL PROTEIN HTTPWTNPGLAENFMNSFMQGLSSMPGFTASQLDDMSTIAQSMVQSIQSLAAQGRTSPNKLQALNMAFASSMAEIAASEE GGGSLSTKTSSIASAMSNAFLQTTGVVNQPFINEITQLVSMFAQA ..TTTTT......................................................................... .....................TTTT.................... 415 2QEE.A mol:aa HYDROLASE LSINSREVLAEKVKNAVNNQPVTDMHTHLFSPNFGEILLWDIDELLTYHYLVAEVMRWTDVSIEAFWAMSKREQADLIWE ELFIKRSPVSEACRGVLTCLQGLGLDPATRDLQVYREYFAKKTSEEQVDTVLQLANVSDVVMTNDPFDDNERISWLEGKQ PDSRFHAALRLDPLLNEYEQTKHRLRDWGYKVNDEWNEGSIQEVKRFLTDWIERMDPVYMAVSLPPTFSFPEESNRGRII RDCLLPVAEKHNIPFAMMIGVKKRVHPALGDAGDFVGKASMDGVEHLLREYPNNKFLVTMLSRENQHELVVLARKFSNLM IFGCWWFMNNPEIINEMTRMRMEMLGTSFIPQHSDARVLEQLIYKWHHSKSIIAEVLIDKYDDILQAGWEVTEEEIKRDV ADLFSRNFWRFVGRN .................................TTTTTT......................................... ...TTTT.........................................................TTTT............ .TTTT...........................TTTT....................TTTT....TTTTTTTT........ .......................TTTT.......................TTTT.....................TTTT. ........................TTTT.......TTTT......................................... ............... 127 2QEU.A mol:aa LYASE ATSQDILKQHAAHYESDGGLPEALVQLAEYAPETFDAYSRRTTLKSEADGAKLPLKYKHLILVVLDAIRDEPIGIVNHTR AANAGLSVDELIEGILLGIIVYGPAWGKTGRKAVTFAVEFEKELAGK ................................................................................ .......................TTTTTTT................. 411 3HXL.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION GVYINFKSEPQLGERGIVSPLILSWGEPGKITIEAGDDVFPKLGYSIDAQLRLINEALKRAKTLLLYRLNAGTKAAVTVG NLTVTAKWGGARGNDITLVIQENIDDETKFDVSTLVDGAELDKQTVSDIAGLAANDWVIFSGTGALTETAGAPLINGSDG AVTNQAYIDYLAAVEIFDFNTIALPSTDDALKATFTAFAKRLRDDEGKKIQVVLENYPAADYEGVISVKNGVVLADGTIL TAAQATAWVAGATAGARVNESLTYQGYDEAVDVAPRYTNAQIIAALQAGEFLFTASDNQALVEQDINTLTSFTADKGKQF AKNRVIRVLDGINNDFVRIFSKFYSNNADGRNLLKSECINYNTLQDIDAIKNFDGQTDLTVQSDVDAVYIEAYAWPVDSI EKIYVRVRIKL ..........................TTTT...TTTT.........................................TT TT...TTTT.............TTTTTTT.....TTTTT........TTTTT..TTTT...................... .....................................................TTTT....TTTT........TTTT... ................TTTT.TTTT.TTTT..TTTT..................TTTTT..TTTT.......TTTTT... ...............................................TTTT..TTTTT.....TTTT........TTTT. ........... 124 3HX8.A mol:aa ISOMERASE GQSAKEAIEAANADFVKAYNSKDAAGVASKYDDAAAFPPDARVDGRQNIQKLWQGADGISELKLTTLDVQESGDFAFESG SFSLKAPGKDSKLVDAAGKYVVVWRKGQDGGWKLYRDIWNSDPA .......................................................................TTTT..... .......TTTT...............TTTT.............. 89 3M9Q.A mol:aa DNA BINDING PROTEIN LRDETPLFHKGEIVLCYEPDKSKARVLYTSKVLNVFERRNEHGLRFYEYKIHFQGWRPSYDRAVRATVLLKDTEENRQLQ RELAEAAKL ........TTTT.......TTTT................TTTT.........TTTT........................ ......... 236 2X2U.A mol:aa TRANSFERASE LYFSRDAYWEKLYVDQAAGTPLLYVHALRDAPEEVPSFRLGQHLYGTYRTRLHENNWIRIQEDTGLLYLQRSLDHSSWEK LSVRNRGFPLLTVYLKVFLSECQWPGCARVYFSFFNTSFPACSSLKPRELCFPETRPSFRIRENRPPGTFHQFRLLPVQF LCPQISVAYRLLEGEGLPFRSAPDSLEVSTRWALDREQREKYELVAVCTVHREEVVMVPFPVTVYDEDDSAPEFEN ..TTTTTT....TTTTTTTT........TTTTTT....................TTTT..TTTTT...TTTT........ ....TTTT..............TTTTT.........................TTTT.....TTTT............... .TTTT.....TTTTTTT....TTTT.........TTTTT..........................TTTT....... 150 2X2S.A mol:aa CELL ADHESION GFKGVGTYEIVPYQAPSLNLNAWEGKLEPGAVVRTYTRGDKPSDNAKWQVALVAGSGDSAEYLIINVHSGYFLTATKENH IVSTPQISPTDPSARWTIKPATTYEVFTINNKVSELGQLTVKDYSTHSGADVLSASAKTADNQKWYFDAK ...........TTTTTTT.....TTTT...........TTTT.......................TTTTT......TTTT .......TTTT........TTTT.......TTTT............TTTT.................... 129 2V4X.A mol:aa VIRAL PROTEIN PVFENNNQRYYESLPFKQLKELKIACSQYGPTAPFTIAIENLGTQALPPNDWKQTARACLSGGDYLLWKSEFFEQCARIA DVNRQQGIQTSYELIGEGPYQATDTQLNFLPGAYAQISNAARQAWKRLP ....TTTT.....................TTTT............................................... .........................TTTT.................... 65 3HFO.A mol:aa RNA BINDING PROTEIN DSGLPSVRQVQLLIKDQTPVEIKLLTGDSLFGTIRWQDTDGLGLVDDSERSTIVRLAAIAYITPR .TTTT..................TTTT..........TTTT....TTTT................ 274 3G3T.A mol:aa BIOSYNTHETIC PROTEIN FVRQTTKYWVHPDNITELKLIILKHLPVLEREDSAITSIYFDNENLDLYYGRLRKDEGAEAHRLRWYGGMSTDTIFVERK THREDWTGEKSVKARFALKERHVNDFLKGKYTVDQVFAKMRKEGKKPMNEIENLEALASEIQYVMLKKKLRPVVRSFYNR TAFQLPGDARVRISLDTELTMVREDNFDGVDRTHKNWRRTDIGVDWPFKQLDDKDICRFPYAVLEVKLQTQLGQEPPEWV RELVGSHLVEPVPKFSKFIHGVATLLNDKVDSIP ..........................................TTTT.........TTTT.........TTTT........ ............................TTTT................................................ ....TTTT.................TTTT.TTTTTTTTTTTTTTTTTTTTT...................TTTT...... .....TTTT................TTTTTTTT. 51 3G36.A mol:aa NUCLEAR PROTEIN VDLQSLPTRAYLDQTVVPILLQGAVLAKERPPNPIEFLASYLLKNKAQFED ............TTTTT.............TTTT................. 55 3LE4.A mol:aa NUCLEAR PROTEIN PPTEPLPDGWIMTFHNSGVPVYLHRESRVVTWSRPYFLGTGSIRKHDPPLSSIPC ......TTTT....TTTT.....TTTTT.........TTTTTTTTT......... 252 2QJV.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION ANLLSTCTSESGNIQHISPQNAGWEYVGFDVWQLAGESITLPSDERERCLVLVAGLASVAADSFFYRIGQRSPFERIPAY SVYLPHHTEAVTAETDLELAVCSAPGFGELPVRLISPQEVGVEHRGGRNQRLVHNILPDSQLADSLLVVEVYTNAGATSS WPAHHDTAVEGQETYLEETYYHRFNPPQGFCLQRVYTDDRSLDECAVYNRDVVVPGYHPVATIAGYDNYYLNVAGPLRWR FTWEENHAWINS ..........TTTT.....................................TTTT....TTTT................. ....TTTT....TTTT..............................TTTT....TTTTTTT.TTTT.......TTTTTTT TTTT....TTTTT..........TTTTTT.......TTTTTTT....TTTT...........TTTT........TTTT.. ............ 85 3MAB.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION LANLSELPNIGKVLEQDLIKAGIKTPVELKDVGSKEAFLRIWENDSSVCMSELYALEGAVQGIRWHGLDEAKKIELKKFH QSLEG ......TTTT..................................TTTT................................ ..... 181 3MAL.A mol:aa PLANT PROTEIN VEITYGSAIKLMHEKTKFRLHSHDVPYGSGSGQQSVTGFPGVVDSNSYWIVKPVPGTTEKQGDAVKSGATIRLQHMKTRK WLHSHLHASPISGNLEVSCFGDDTNSDTGDHWKLIIEGSGKTWKQDQRVRLQHIDTSGYLHSHDKKYQRIAGGQQEVCGI REKKADNIWLAAEGVYLPLNE ...TTTT.....TTTTT...........TTTT........TTTT.........TTTT..TTTT..TTTT.....TTTTT. ........TTTTT........TTTT..................TTTT.....TTTTT.........TTTTTTTT...... ..................... 167 3HVS.A mol:aa OXIDOREDUCTASE SQTVHFQGNPVTVANSIPQAGSKAQTFTLVAKDLSDVTLGQFAGKRKVLNIFPSIDTGVCAASVRKFNQLATEIDNTVVL CISADLPFAQSRFCGAEGLNNVITLSTFRNAEFLQAYGVAIADGPLKGLAARAVVVIDENDNVIFSQLVDEITTEPDYEA ALAVLKA ....TTTTT.........TTTT........TTTT.......TTTT......TTTTTT................TTTT... ............TTTTTTTTTT....TTTTT...........TTTTTTT........TTTT.......TTTTTT...... ....... 277 3HN0.A mol:aa TRANSPORT PROTEIN DTVIKVSVLRGPSVIAFADWLENPPIIDNKKVQVKVVDSPDLAQALLIKQETDIAVLPINAANLYNKGIKIKLAGCPIWG TLYLVEKTPLKEPALYVFGNGTTPDILTRYYLGRQRLDYPLNYAFNTAGEITQGILAGKVNRAVLGEPFLSIALRKDSSL RITADLNHLTDNDTLGFAQTAVVYTPTEKYRIAFEDALRASCQKAVRYPKETIHSLEEHGIFAQGALTPKSIERCKIYYL SAIEAKDAVGFLRLIEQYEPKAVGGRLPDAGFIPEKQ .........................TTTTT.....................TTTT......................... ......TTTTTTT.....TTTT....................TTTT...................TTTT.......TTTT .......TTTTTTTTT...........TTTT............................TTTTTTTTT............ TTTTTTTTT.............TTTT........... 199 3HN5.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION ESLTGRVYNGEALQLRGNEAVQLQLYQHGYAKHDPINVYVNQDGYSANLFDGEYQITKSGNGPWTSEGRDTINVTVAGNT VQDVEVTPYYLVRDAQTLEGNKVNASFKVEKVAGGGIDRVFFLSTTQFVNDAEHNVDRYDETDNLDAYDETGKLYTFATR DYTDNSFQTALKRGTLFGRICIWPKGSDQGIYSKVIRLK .......TTTT.....TTTT.....TTTT...........TTTT.............TTTT....TTTT.......TTTT ..................TTTT........TTTT...............TTTTTTTT....TTTT....TTTT....... .TTTT..................TTTT............ 270 3HTV.A mol:aa TRANSFERASE KQHNVVAGVDGATHIRFCLRTAEGETLHCEKKRTAEVIAPGLVSGIGEIDEQLRRFNARCHGLVGFPALVSKDKRTIIST PNLPLTAADLYDLADKLENTLNCPVEFSRDVNLQLSWDVVENRLTQQLVLAAYLGTGGFAVWNGAPWTGAHGVAGEETNC SGALRRWYEQQPRNYPLRDLFVHAENAPFVQSLLENAARAIATSINLFDPDAVILGGGVDPAFPRETLVATQKYLRRPLP HQVVRFIAASSSDFNGAQGAAILAHQRFLP ..........TTTT......TTTT.............TTTT.............................TTTT...... .........TTTT..............................TTTT......TTTT...........TTTTTTTTTTTT T......................TTTT................................................TTTTT TTTT.......TTTT............... 127 3EBT.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION GSNNQTVRESYEAFHRRDLPGVLAALAPDVRWTHPDGSPYGLGGTKHGHDEVIAFIRHVPTHIAERLAPDEFIESGERIV VLGTRRVTAVNGRSATLKFVHVWRFENGRAVTFEDHFDTAEIRLITA .....................................TTTT.................................TTTT.. ........TTTT............TTTTT........TTTTTTTTT. 353 3AAP.A mol:aa HYDROLASE KHSCIAVIDAGSTGSRLHIYSYDTDDTNTPIHIEEIWNKKIKPGFASIQPNSVTIDAYLTMLLADAPIHNIPVYFYATAG MRLLPQSQQKKYYDELEYWFRQQSQWQLVEAKTITGNDEALFDWLAVNYKLDTLKSVQNKSVGVMDMGGASVQIVFPMPK NAEISKHNQVELNIYGQNINLYVHSFLGLGQTEMSHQFLNSPSCFANDYPLPDGESGQGNAPSCKEEVTSLMNSVHKVNQ QIQPLLALNPVNEWYSIGGISNLASSQLFHFENSELTNQSLLQQGDNQICHQQWDILNGQYPDDEYLYQYCLLSSYYYAL MVDGYGINPNQTIHYIPPEQNLDWTIGVVLHRA ..........TTTT..........TTTT..................................TTTT.............. ....................................................TTTT...........TTTT......... TTTT........TTTTT........TTTT........TTTT....TTTT.TTTT.......................... ..........TTTT...........TTTT.TTTTT.........................TTTTTTT............. .......TTTT...................... 151 3JRN.A mol:aa PLANT PROTEIN TKYDVFLSFRGHDTRHNFISFLYKELVRRSIRTFKDDKPIEVSRFAVVVVSENYAASSWCLDELVTIMDFEKKGSITVMP IFYGVEPNHVRWQTGVLAEQFKKHASREDPEKVLKWRQALTNFAQLSGDCSGDDDSKLVDKIANEISNKKT TTTT.........TTTTT....................TTTTT.......TTTTTTTT...................... .TTTT..............................................TTTT...........TTTT. 127 3MR0.A mol:aa TRANSCRIPTION REGULATOR ERFQLAVSGASAGLWDWNPKTGAYLSPHFKKIGYEDHELPDEITESIHPDDRARVLAALKAHLEHRDTYDVEYRVRTRSG DFRWIQSRGQALWNSAGEPYRVGWIDVTDRKRDEDALRVSREELRRL .................TTTTT.................TTTT....TTTTT........................TTTT .............TTTT.............................. 470 2W61.A mol:aa GLYCOPROTEIN GVSFEKTPAIKIVGNKFFDSESGEQFFIKGIAYQLQRSANGAFETSYIDALADPKICLRDIPFLKMLGVNTLRVYAIDPT KSHDICMEALSAEGMYVLLDLSEPDISINRENPSWDVHIFERYKSVIDAMSSFPNLLGYFAGNQVTNDHTNTFASPFVKA AIRDAKEYISHSNHRKIPVGYSTNDDAMTRDNLARYFVCGDVKADFYGINMYEWCGYSTYGTSGYRERTKEFEGYPIPVF FSEFGCNLVRPRPFTEVSALYGNKMSSVWSGGLAYMYFEEENEYGVVKINDNDGVDILPDFKNLKKEFAKADPKGITEEE YLTAKEPSVECPHIAVGVWEANEKLPETPDRSKCACLDEILPCEIVPFGAESGKYEEYFSYLCSKVDCSDILANGKTGEY GEFSDCSVEQKLSLQLSKLYCKIGANDRHCPLNDKNVYFNLESLQPLTSESICKNVFDSIRNITYNHGDY ...TTTTT....TTTT..TTTTT...............TTTTT.TTTT.............................TTT T.....................TTTTT.TTTTT................TTTTTTT.....TTTTTTTTTTT........ .........................TTTTT........TTTT.TTTT.......TTTT.............TTTT..... ......TTTTTT.............TTTT..........TTTT......TTTT........................... .TTTT.........TTTTT.....................TTTT..TTTT.......................TTTTT.. TTTTTT...........................TTTT..........TTTTTTT................ 61 2W6A.A mol:aa SIGNALING PROTEIN GPLGDGAVTLQEYLELKKALATSEAKVQQLMKVNSSLSDELRKLQREIHKLQAENLQLRQP .....TTTT.................................................... 186 3FRR.A mol:aa PROTEIN BINDING LGSGFKAERLRVNLRLVINRLKLLEKKKTELAQKARKEIADYLAAGKDERARIRVEHIIREDYLVEAMEILELYCDLLLA RFGLIQSMKELDSGLAESVSTLIWAAPRLQSEVAELKIVADQLCAKYSKEYGKLCRTNQIGTVNDRLMHKLSVEAPPKIL VERYLIEIAKNYNVPYEPDSVVMAEA ................................................................................ ............................TTTTTT.......................TTTT................... .......................... 365 3EKI.A mol:aa TPP BINDING PROTEIN QGQWDKSITFGVSEAWLNKKKGGEKVNKEVINTFLENFKKEFNKLKNANDKTKNFDDVDFKVTPIQDFTVLLNNLSTDNP ELDFGINASGKLVEFLKNNPGIITPALETTTNSFVFDKEKDKFYVDGTDSDPLVKIAKEINKIFVETPYASWTDENHKWN GNVYQSVYDPTVQANFYRGMIWIKGNDETLAKIKKAWNDKDWNTFRNFGILHGKDNSSSKFKLEETILKNHFQNKFTTLN EDRSAHPNAYKQKSADTLGTLDDFHIAFSEEGSFAWTHNKSATKPFETKANEKMEALIVTNPIPYDVGVFRKSVNQLEQN LIVQTFINLAKNKQDTYGPLLGYNGYKKIDNFQKEIVEVYEKAIK ....TTTT...........TTTTT........................TTTTTTT.......................TT TTT...............TTTTTT.........TTTTTTTTT.TTTTTTTT.....................TTTTTTTT TTT..................................................TTTT..............TTTT..... ....................TTTT.....TTTT....TTTTTTTTTT.TTTT..................TTTT...... ..............TTTT..........TTTT............. 153 2WFO.A mol:aa VIRAL PROTEIN ELPSLCMLNNSFYYMKGGANIFLIRVSDVSVLMKEYDVSVYEPEDLGNCLNKSDSSWAIHWFSIALGHDWLMDPPMLCRN KTKKEGSNIQFNISKADESRVYGKKIRNGMRHLFRGFYDPCEEGKVCYVTINQCGDPSSFEYCGTNYLSKCQF ........TTTT....TTTT............TTTTT...............................TTTTT......T TTTTTT.........TTTTT.............TTTT....TTTT.......TTTT..TTTTTT......... 120 2WFB.A mol:aa BIOSYNTHETIC PROTEIN ASHMQRIAVTAEGPGLDGLVDPRFGRAAGFVVVDAATMAAEYVDNGASQTLSHGAGINAAQVLAKSGAGVLLTGYVGPKA FQALQAAGIKVGQDLEGLTVRQAVQRFLDGQVPMAAGPNK ..............TTTT....TTTTT......TTTTT............TTTT.......................... ..............TTTT...................... 205 3MZO.A mol:aa HYDROLASE GGIHQYFQSLSDLENIYRCPGKFKYQEHSVAEHSYKVTSIAQFFGAVEEDAGNEVNWRALYEKALNHDYSELFIGDIKTP VKYATTELRELSEVEESTKNFISREIPATFQPIYRHLLKEGKDSTLEGKILAISDKVDLLYESFGEIQKGNPENIFVEIY SEALATIYEYREASVKYFLKEILPDLAEKGIEKTELPQLTTEITT ................TTTT...TTTT......................................TTTT........... TTTT............................................................................ .....................TTTT..TTTT.............. 218 3FSS.A mol:aa CHAPERONE TNTIFKLEGVSVLSPLRKKLDLVFYLSNVDGSPVITLLKGNDRELSIYQLNKNIKASFLPVPEKPNLIYLFTYTSCEDNK FSEPVVTLNKENTLNQFKKLGLLDSNVTDFEKCVEYIRKQAILTGFKISNPFVKINSFHLQCHRGTKEGTLYFLPDHIIF GFKKPILLLDASDIESITYSSITRLTFNASLVTKDGEKYEFSIDQTEYAKIDDYVKRK ............TTTTT.........TTTTT......TTTTT....TTTT..........TTTTTTT............. .......................TTTT......................TTTT.........TTTTT......TTTT... ..TTTT................TTTT......TTTT...................... 109 3FSO.A mol:aa CELL ADHESION MRDVVSFEQPEFSVSRGDQVARIPVIRRVLDGGKSQVSYRTQDGTAQGNRDYIPVEGELLFQPGEAWKELQVKLLELRQV RRFHVQLSNPKFGAHLGQPHSTTIIIRDP TTTT......................................TTTTTTTTTT.........TTTT............... ..........TTTT..TTTTTT....... 225 3GKJ.A mol:aa TRANSPORT PROTEIN QSCVWYGECGIAYGDKRYNCEYSGPPKPLPKDGYDLVQELCPGFFFGQVSLCCDVRQLQTLKDNLQLPLQFLSRCPSCFY NLLNLFCELTCSPRQSQFLQVTATEDYVDPVTNQTKTNVKELQYYVGQSFANAMYNACRDVEAPSSNDKALGLLCGKDAD ACQATNWIEYMFNKDNGQAPFTITPVFSDFPVHGMEPMNNATKGCDESVDEVTAPCSCQDCSIVC ............TTTT.............................TTTT......................TTTTT.... ...........TTTT.............TTTTT........................TTTT.TTTT...........TTT TT.............TTTTTTT........TTTTT........TTTT..TTTT............ 181 3LXR.F mol:aa SIGNALING PROTEIN/RHOA-BINDING PROTEIN NFGISLSHKRYFSGKVDEIIRCTMGKRIVKISSTKINTSILSSVSEQIGENITDWKNDEKKVYVSRVVNQCIDKFCAEHS RKIGDNLRKQIFKQVEKDYRISLDINAAQSSINHLVSGSSYFKKKMDELCEGMNRSVKNDTTSNVANLISDQFFEKNVQY IDLKKLRGNMSDYITNLESPF ..........TTTT........TTTTT.....................TTTT............................ .......................TTTT......................TTTT........................... ..................... 261 3M1T.A mol:aa HYDROLASE GHSAALLQKVDELPRLPKAIAELLDVVNNEDSTVKAVSEKLSHDPVLSARVLRLANSAEVGTIDDAVVRLGQTLRTLVIA SAVVGAVPKVEGFDLADFWGNTFEVAIICQELAKRLGTLPEEAFTCGILHSIGELLIVNGDPAVAATISAAVADGADRNL EKELLGYDNAEIGALLAQSWKFTPHLVKGIQFQNHPKSAEPYSKLAGLAAKQIAADWDKIPDDERTSWLAQINILAGIKV DLGGLAEKLAKHGQGEGKQLA ............................TTTT................................................ .........TTTT...................................TTTT........................TTTT ...............................TTTTT..TTTTTTTTT.............TTTTT............... .TTTT...........TTTTT 143 3EF8.A mol:aa LYASE TDTNLVERAIERFDYSYHLDNHPEELAALFVEDCEVSYAPNFGATGRDAYKKTLEGIGTFFRGTSHHNSNICIDFVSETE ANVRSVVLAIHRYTKERPDGILYGQYFDTVVKVDGQWKFKRRELRTTTTDYHVRAANPIGRAE .......TTTTT..........................TTTT...........TTTT.................TTTTTT ............TTTT...............TTTTT...........TTTT............ 82 3FYM.A mol:aa DNA BINDING PROTEIN KTVGEALKGRRERLGMTLTELEQRTGIKREMLVHIENNEFDQLPNKNYSEGFIRKYASVVNIEPNQLIQAHQDEIPSNQA EW ..........................................TTTT........................TTTTT..... .. 93 3FYB.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION GADIDQASKTEEAAAFRHLLRHLDEHKDVQNIDLIQADFCRNCLAKWLEAATEQGVELDYDGAREYVYGPFAEWKTLYQK PASEAQLAAFEAK .......................TTTTTTTTTTTTTTT.......................................... ............. 94 2W7N.A mol:aa TRANSCRIPTION/DNA KKRLTESQFQEAIQGLEVGQQTIEIARGVLVDGKPQATFATSLGLTRGAVSQAVHRVWAAFEDKNLPEGYARVTAVLPEH QAYIVRKWEADAKK ............TTTT..............................................TTTTTTTT.......... .............. 97 2W7A.A mol:aa RNA-BINDING PROTEIN GAGNLRLIGVPESDVENGTKLENTLQDIIQENFPNLARQANVQIQEIQRTPQRYSSRRATPRHIIVRFTKVEKEKLRAAR EKGRVTLKGKPIRLTVD ............TTTT................................TTTT..TTTT.........TTTT......... .....TTTTT....... 189 2X5N.A mol:aa NUCLEAR PROTEIN VLEATMILIDNSEWMINGDYIPTRFEAQKDTVHMIFNQKINDNPENMCGLMTIGDNSPQVLSTLTRDYGKFLSAMHDLPV RGNAKFGDGIQIAQLALKHRENKIQRQRIVAFVGSPIVEDEKNLIRLAKRMKKNNVAIDIIHIGELSALQHFIDAANSSD SCHLVSIPPSPQLLSDLVNQSPIGQGVVA .................TTTTTTT..................TTTT.......TTTT.................TTTT.. ...................TTTTTT.....................................TTTT..........TTTT T...................TTTT..... 274 2X55.A mol:aa HYDROLASE QLIPNISPDSFTVAASTGMLSGKSHEMLYDAETGRKISQLDWKIKNVAILKGDISWDPYSFLTLNARGWTSLASGSGNMD DYDWMNENQSEWTDHSSHPATNVNHANEYDLNVKGWLLQDENYKAGITAGYQETRFSWTATGGSYSYNNGAYTGNFPKGV RVIGYNQRFSMPYIGLAGQYRINDFELNALFKFSDWVRAHDNDEHYMRDLTFREKTSGSRYYGTVINAGYYVTPNAKVFA EFTYSKYDESIGGDAAGISNKNYTVTAGLQYRFG ......TTTT...................TTTTT......................TTTTTT.................. .....TTTT..............................TTTT.......................TTTTTTT...TTTT .....................TTTT.............................................TTTTTT.... .................................. 183 3BYQ.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION SLIEIRKRTLIVETTYHENGPAPAQPLKLAASCAVIRNPYAGRYEPDLPFAELRSLGTLLATELVDTLGKDNIEVYSKAA IVGVDGEEHGAVWHEAGGWARSVLGEPKAVPAVKAVATAGYRVPVHYIHASYVRSHFNSIEIGIQDAPRPREILFALVGT GARVHARLGGLTKEAVSVHDGQR ................TTTT..TTTT...........TTTTTT.TTTT................................ ..TTTT.TTTT.........TTTTTTT..........TTTT.....TTTTTTT..........TTTTTTTTT........ ..TTTT.........TTTTTTTT 82 3BYP.A mol:aa TRANSPORT PROTEIN GLPPEEVERIRAFLQERIRGRALEVHDLKTRRAGPRSFLEFHLVVRGDTPVEEAHRLCDELERALAQAFPGLQATIHVEP EG .................TTTTT..........TTTT.........TTTT...................TTTT........ .. 128 3MDP.A mol:aa NUCLEOTIDE BINDING PROTEIN ISPERLRVYRFFASLTDEQLKDIALISEEKSFPTGSVIFKENSKADNLLLLEGGVELFYSSTVCSVVPGAIFGVSSLIKP YHYTSSARATKPVRVVDINGARLRESENNQALGQVLNNVAAAVLARLH .TTTT...........................TTTT...TTTT.......................TTTT.......TTT T.......TTTT.............TTTTT.................. 225 3H3L.A mol:aa HYDROLASE QALDSDGIPTGGEWITFDGKTLNGWRGYCRQDVPLGWVVEDGSITYKGSDNKADTGFGDLIYDKKFKNFVFEIEWKIDKA GNSGIFYTAQEIEGTPIYYSSPEYQLLDNENPDAWEGCDGNRQAGAVYDIPDPQPVKPYGNWNKTRIVVYNQRVIHYNDV KILEFQFGTPVWRALVDHSKFSKFSTSPEKCPEAYDLLQCGKQPGYIGQDHGYGVCFRNIRIKEL ...TTTT..........TTTTTTTT.TTTT...TTTT.TTTTT.....TTTT.............TTTT........TTT T..........TTTT............TTTTTTTTTTTTTTTTTTT....TTTT...TTTT.......TTTTT....... .....TTTT.........TTTTTTT.TTTT...TTTT.TTTTTT....TTTTTTT.......... 123 2RBG.A mol:aa STRUCTURAL GENOMICS, UNKNOWN FUNCTION YKNILTLISVNNDNFENYFRKIFLDVRSSGSKKTTINVFTEIQYQELVTLIREALLENIDIGYELFLWKKNEVDIFLKNL EKSEVDGLLVYCDDENKVFSKIVDNLPTAIKRNLIKDFCRKLS TTTT.....................................................TTTT................... ...............TTTT.................TTTTTT. 48 3HE5.B mol:aa DE NOVO PROTEIN GSARNAYLRKKIARLKKDNLQLERDEQNLEKIIANLRDEIARLENEVA TTTT............................................ 196 3CKK.A mol:aa TRANSFERASE YPVKPEEMDWSELYPEFAQVEFADIGCGYGGLLVELSPLFPDTLILGLEIRVKVSDYVQDRIRALRAAPAGGFQNIACLR SNAMKHLPNFFYKGQLTKMFFLFPDIISPTLLAEYAYVLRVGGLVYTITDVLELHDWMCTHFEEHPLFERVPLEDLSEDP VVGHLGTSTEEGKKVLRNGGKNFPAIFRRIQDPVLQ .TTTT...TTTTTTTTT......TTTTTTT.........TTTT........................TTTT.TTTT.... .TTTTT.....TTTT.....TTTT........................................TTTT.......TTTTT TT.TTTTT.......................TTTTT