Query gi|254780956|ref|YP_003065369.1| thymidylate synthase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 264 No_of_seqs 158 out of 1666 Neff 5.7 Searched_HMMs 39220 Date Mon May 30 01:44:17 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780956.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PTZ00164 bifunctional dihydrof 100.0 0 0 803.6 22.2 263 2-264 230-512 (512) 2 TIGR03284 thym_sym thymidylate 100.0 0 0 797.2 22.8 263 2-264 1-296 (296) 3 PRK01827 thyA thymidylate synt 100.0 0 0 796.4 22.7 264 1-264 1-264 (264) 4 PRK13821 thyA thymidylate synt 100.0 0 0 790.7 22.7 264 1-264 1-323 (323) 5 KOG0673 consensus 100.0 0 0 727.9 17.0 263 2-264 11-293 (293) 6 pfam00303 Thymidylat_synt Thym 100.0 0 0 705.1 22.6 261 2-264 2-262 (262) 7 COG0207 ThyA Thymidylate synth 100.0 0 0 701.0 20.7 264 1-264 3-268 (268) 8 cd00351 TS_Pyrimidine_HMase Th 100.0 0 0 556.7 17.2 214 3-217 1-215 (215) 9 PRK00956 thyA thymidylate synt 100.0 0 0 388.9 11.1 196 3-222 12-207 (208) 10 TIGR03283 thy_syn_methano thym 100.0 0 0 389.3 9.7 190 2-216 10-199 (199) 11 KOG0217 consensus 45.6 25 0.00065 16.5 3.8 87 47-134 497-618 (1125) 12 cd00988 PDZ_CTP_protease PDZ d 42.0 29 0.00074 16.2 4.1 47 189-235 26-72 (85) 13 KOG3690 consensus 41.7 29 0.00074 16.2 3.1 57 93-149 208-269 (646) 14 pfam08617 CGI-121 Kinase bindi 41.6 15 0.00039 17.8 1.6 32 47-78 48-80 (155) 15 pfam09450 DUF2019 Domain of un 41.2 26 0.00066 16.5 2.7 28 107-134 29-57 (106) 16 pfam03410 Peptidase_M44 Protei 35.0 33 0.00084 15.8 2.5 114 108-233 144-284 (590) 17 pfam01024 Colicin Colicin pore 34.6 22 0.00057 16.8 1.6 40 95-142 47-92 (187) 18 TIGR00824 EIIA-man PTS system, 30.2 18 0.00047 17.4 0.5 14 168-181 65-78 (117) 19 pfam09687 P_fal_TIGR01639 Prot 28.0 43 0.0011 15.2 2.1 25 110-135 13-37 (61) 20 pfam05613 Herpes_U15 Human her 27.6 34 0.00086 15.8 1.5 34 41-74 51-84 (110) 21 pfam08359 TetR_C_4 YsiA-like p 27.5 25 0.00064 16.6 0.8 49 1-67 66-116 (133) 22 TIGR02217 chp_TIGR02217 conser 26.0 23 0.00059 16.8 0.4 20 87-106 66-85 (220) 23 cd00989 PDZ_metalloprotease PD 25.4 55 0.0014 14.5 3.1 47 187-235 23-69 (79) 24 pfam09343 DUF2460 Conserved hy 24.4 20 0.0005 17.2 -0.2 18 89-106 66-83 (197) 25 pfam07353 Uroplakin_II Uroplak 23.1 25 0.00063 16.6 0.1 11 141-151 46-56 (184) 26 TIGR01011 rpsB_bact ribosomal 22.9 34 0.00087 15.8 0.8 10 70-79 5-14 (227) 27 TIGR02790 nickel_nikC nickel A 22.8 37 0.00095 15.5 1.0 28 155-183 89-116 (258) 28 cd06570 GH20_chitobiase-like_1 22.7 62 0.0016 14.2 2.8 32 175-206 126-157 (311) 29 pfam07958 DUF1688 Protein of u 21.8 60 0.0015 14.3 1.9 14 107-120 69-82 (413) 30 TIGR02303 HpaG-C-term 4-hydrox 21.7 54 0.0014 14.5 1.6 39 158-203 178-218 (249) 31 cd04483 hOBFC1_like hOBFC1_lik 21.6 65 0.0017 14.1 5.2 65 148-212 14-81 (92) No 1 >PTZ00164 bifunctional dihydrofolate reductase-thymidylate synthase; Provisional Probab=100.00 E-value=0 Score=803.61 Aligned_cols=263 Identities=49% Similarity=0.934 Sum_probs=257.2 Q ss_pred HHHHHHHHHHHHCCCEECCCCCCCEEEEEECEEEEECCCCCCEEEEEEHHHHHHHHHHHHHHCCCCCHHHHHHHCCCHHH Q ss_conf 57999999998639260379985347776120464423698104642022688999999985366646888640032135 Q gi|254780956|r 2 HQYLDLLRHVIKFGSDRRDRTGVGTRSTFGYQMRFDLSKGFPLLTTKKVHWKSVVHELLWFLRGDSNVSYLHRHGVSIWD 81 (264) Q Consensus 2 ~qYl~l~~~il~~G~~~~dRtG~~t~s~fg~~~~~dl~~gfPllTtkk~~~k~~i~EllW~l~G~tn~~~l~~~~~~iWd 81 (264) .|||+|+++||++|..++||||+||+|+||.||||||++|||||||||||||++++||||||+|+||++.|+++|||||| T Consensus 230 ~qYL~ll~~Il~~G~~k~dRTG~gT~S~FG~~~rfdL~~~fPlLTTKkv~~k~vv~ELLWFl~G~Tn~~~L~~~~v~IWd 309 (512) T PTZ00164 230 FQYLELIDRIMKNGNVKEDRTGVGTISKFGYQMRFDLRESFPLLTTKKVFLRGIIEELLWFIRGETNGNLLLDKNVRIWE 309 (512) T ss_pred HHHHHHHHHHHHHCCCCCCCCCCCEEEECCCEEEEECCCCCCEEEECHHHHHHHHHHHHHHHCCCCCHHHHHHCCCEECC T ss_conf 99999999999828947897989748866978888601597515753046999999999980388547879866830378 Q ss_pred HHHHH------------HCCCCCCCCCHHHCCC--------CCCCCHHHHHHHHHHHHHCCCCHHHHHHHHCCHHHHHHC Q ss_conf 65421------------0013876620100036--------775326889999999998088523677751698883202 Q gi|254780956|r 82 EWADK------------DGELGPIYGVQWRSWP--------DYDGNVIDQISSIVQSLRADPYSRRHIVSAWNVALIDKM 141 (264) Q Consensus 82 ~~a~~------------~g~lg~~YG~qwR~~~--------~~~~~~~DQl~~vi~~Lk~np~sRR~i~~~wnp~~~~~~ 141 (264) +|+++ +|||||+||+|||+|+ +|+|+++|||++||++||+||+|||+|+++|||+++++| T Consensus 310 ~n~~~e~ld~~gl~~~~~~dlGpvYG~qWR~f~~~y~~~~~~y~g~g~DQl~~vI~~ik~nP~sRR~i~saWNp~~l~~m 389 (512) T PTZ00164 310 GNGSREFLDSRGLTHREEMDLGPVYGFQWRHFGAEYKDMHADYDGKGVDQLKNIIETIKNNPDDRRLILTAWNPSDLSQM 389 (512) T ss_pred CCCCHHHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEEECCHHHHCCC T ss_conf 87555367536874322466676322777516875667554789967329999999986496665179985485674467 Q ss_pred CCCCCCCCEEEEECCCEEEEEEEECCCCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCEEEHHHHHHHHHHHC Q ss_conf 42113540246676880599997262440222021679999999999976224211113522730210036899999833 Q gi|254780956|r 142 ALPPCHCLFQFYVDNGKLSCQLYQRSGDVFLGIPFNIASYSLLTMMLASVIGFQYGEFIHTLGDVHLYNNHFEQADLQLS 221 (264) Q Consensus 142 ~lpPC~~~~qf~v~~g~L~~~~~qRS~D~~lG~pfNi~~y~lL~~~iA~~~gl~~G~~~~~~~d~HIY~nH~~~vk~ql~ 221 (264) ||||||+++||+|.+|+|+|++||||+|+|||+|||||+||+|++|||++|||+||+|+|++||||||+||+||+++||+ T Consensus 390 aLpPCH~~~Qf~V~~g~Ls~~~yQRS~D~~LGvPFNIasyalL~~miA~~~gl~pge~i~~~gD~HIY~nHi~~~~~Ql~ 469 (512) T PTZ00164 390 ALPPCHLLAQFYVSDGELSCMMYQRSCDMGLGVPFNIASYALLTIMIAKVCGLRPGEFVHTLGDAHVYNNHVDALKEQLE 469 (512) T ss_pred CCCCCCEEEEEEECCCEEEEEEEEECHHHCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEEEEEECHHHHHHHHHHHC T ss_conf 67888758999981998999998421121248647899999999999998499566599980688851878999999867 Q ss_pred CCCCCCCEEEECCCCCCCCCCCCCCEEEECCCCCCCCCCCCCC Q ss_conf 6987687799648868711262033799768768887875569 Q gi|254780956|r 222 RSPRTLPQMIINPNIVDLLSFRYEDFTLRSYEPHAAILAKVSV 264 (264) Q Consensus 222 R~p~~~P~L~i~~~~~~i~d~~~eDf~L~~Y~~hp~Ik~~mAV 264 (264) |+|+++|+|+|++++++|+||++|||+|+||+|||+||||||| T Consensus 470 R~p~~~P~L~i~~~~~~i~d~~~~Df~l~~Y~~hp~Ik~~mAV 512 (512) T PTZ00164 470 RVPYPFPTLKFKREIENIEDFTISDIEVIDYVPHPPIKMEMAV 512 (512) T ss_pred CCCCCCCEEEECCCCCCHHCCCHHHEEEECCCCCCCCCCCCCC T ss_conf 8998899678689878551398207799588889877756119 No 2 >TIGR03284 thym_sym thymidylate synthase. Members of this protein family are thymidylate synthase, an enzyme that produces dTMP from dUMP. In prokaryotes, its gene usually is found close to that for dihydrofolate reductase, and in some systems the two enzymes are found as a fusion protein. This model excludes a set of related proteins (TIGR03283) that appears to replace this family in archaeal methanogens, where tetrahydrofolate is replaced by tetrahydromethanopterin. Probab=100.00 E-value=0 Score=797.17 Aligned_cols=263 Identities=62% Similarity=1.129 Sum_probs=259.4 Q ss_pred HHHHHHHHHHHHCCCEECCCCCCCEEEEEECEEEEECCCCCCEEEEEEHHHHHHHHHHHHHHCCCCCHHHHHHHCCCHHH Q ss_conf 57999999998639260379985347776120464423698104642022688999999985366646888640032135 Q gi|254780956|r 2 HQYLDLLRHVIKFGSDRRDRTGVGTRSTFGYQMRFDLSKGFPLLTTKKVHWKSVVHELLWFLRGDSNVSYLHRHGVSIWD 81 (264) Q Consensus 2 ~qYl~l~~~il~~G~~~~dRtG~~t~s~fg~~~~~dl~~gfPllTtkk~~~k~~i~EllW~l~G~tn~~~l~~~~~~iWd 81 (264) |||++|+++||++|+.++||||++|+|+||.+|||||++|||||||||||||++++||+|||+|+||++.|+++|||||| T Consensus 1 kQYl~l~~~Il~~G~~~~dRTG~gt~svFg~~~rfdl~~gFPlLTTKk~~~k~v~~ELlWfl~G~tn~~~L~~~~v~IWd 80 (296) T TIGR03284 1 KQYLDLLRDILENGHQKGDRTGTGTISVFGYQMRFDLSKGFPLLTTKKVPFRLIASELLWFLKGDTNIRYLLDHNVNIWD 80 (296) T ss_pred CHHHHHHHHHHHHCCCCCCCCCCCEEEECCCEEEEECCCCCCEEEECHHHHHHHHHHHHHHHHCCCCHHHHHHCCCCCCC T ss_conf 92899999999829887898999747850345898666798514522045888999999999679638999976994578 Q ss_pred HHHHH---------------------------------HCCCCCCCCCHHHCCCCCCCCHHHHHHHHHHHHHCCCCHHHH Q ss_conf 65421---------------------------------001387662010003677532688999999999808852367 Q gi|254780956|r 82 EWADK---------------------------------DGELGPIYGVQWRSWPDYDGNVIDQISSIVQSLRADPYSRRH 128 (264) Q Consensus 82 ~~a~~---------------------------------~g~lg~~YG~qwR~~~~~~~~~~DQl~~vi~~Lk~np~sRR~ 128 (264) +|+++ +|+|||+||+|||+|+...|+++|||++||++||+||+|||+ T Consensus 81 ~wa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~GdlGpiYG~qwR~~~~~~~~~~DQl~~vI~~lk~nP~sRR~ 160 (296) T TIGR03284 81 EWAFERWVKSDDYQGPDMTDFGLRAQDDEEEDDEFADKYGDLGPVYGKQWRSWATPDGETIDQIKNVIEMIKTNPDSRRL 160 (296) T ss_pred CCCCHHHHHCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCHHE T ss_conf 74223232023335631011101001243333100135786576525664013378998051999999987649654121 Q ss_pred HHHHCCHHHHHHCCCCCCCCCEEEEECCCEEEEEEEECCCCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCEE Q ss_conf 77516988832024211354024667688059999726244022202167999999999997622421111352273021 Q gi|254780956|r 129 IVSAWNVALIDKMALPPCHCLFQFYVDNGKLSCQLYQRSGDVFLGIPFNIASYSLLTMMLASVIGFQYGEFIHTLGDVHL 208 (264) Q Consensus 129 i~~~wnp~~~~~~~lpPC~~~~qf~v~~g~L~~~~~qRS~D~~lG~pfNi~~y~lL~~~iA~~~gl~~G~~~~~~~d~HI 208 (264) |+++|||+++++|||||||+.+||+|.+|+|||++||||+|+|||+|||||+||+|++|||++||++||+++|++||||| T Consensus 161 ivsaWNp~~l~~maLPPCH~~~Qf~V~~g~L~l~l~QRS~D~fLG~PFNIasYalL~~miA~~~gl~pg~l~~~~gDaHI 240 (296) T TIGR03284 161 IVSAWNPEDVPSMALPPCHTLFQFYVADGKLSCQLYQRSADVFLGVPFNIASYALLTHMIAQETGLEVGEFVHTLGDAHL 240 (296) T ss_pred EEECCCHHHHHCCCCCCCEEEEEEEEECCEEEEEEEEECHHHCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEECCEEEE T ss_conf 65226966862778888814799998199999999953223214750799999999999999848757779999136576 Q ss_pred EHHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCCCCCCEEEECCCCCCCCCCCCCC Q ss_conf 00368999998336987687799648868711262033799768768887875569 Q gi|254780956|r 209 YNNHFEQADLQLSRSPRTLPQMIINPNIVDLLSFRYEDFTLRSYEPHAAILAKVSV 264 (264) Q Consensus 209 Y~nH~~~vk~ql~R~p~~~P~L~i~~~~~~i~d~~~eDf~L~~Y~~hp~Ik~~mAV 264 (264) |+||+|||+|||+|+|+++|+|+|++++++|+||++|||+|+||+|||+||||||| T Consensus 241 Y~nHie~v~eql~R~P~~~P~L~i~~~~~~i~d~~~~Df~L~~Y~~hp~Ik~~mAV 296 (296) T TIGR03284 241 YSNHLEQAKLQLTREPRPLPKLKLNPEKKDIFDFEYEDIELEGYDPHPAIKAPVAV 296 (296) T ss_pred HHHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCCCCHHEEEECCCCCCCCCCCCCC T ss_conf 78689999999668999999889679988630497127799798889877766239 No 3 >PRK01827 thyA thymidylate synthase; Reviewed Probab=100.00 E-value=0 Score=796.43 Aligned_cols=264 Identities=66% Similarity=1.195 Sum_probs=263.1 Q ss_pred CHHHHHHHHHHHHCCCEECCCCCCCEEEEEECEEEEECCCCCCEEEEEEHHHHHHHHHHHHHHCCCCCHHHHHHHCCCHH Q ss_conf 95799999999863926037998534777612046442369810464202268899999998536664688864003213 Q gi|254780956|r 1 MHQYLDLLRHVIKFGSDRRDRTGVGTRSTFGYQMRFDLSKGFPLLTTKKVHWKSVVHELLWFLRGDSNVSYLHRHGVSIW 80 (264) Q Consensus 1 m~qYl~l~~~il~~G~~~~dRtG~~t~s~fg~~~~~dl~~gfPllTtkk~~~k~~i~EllW~l~G~tn~~~l~~~~~~iW 80 (264) ||||++++++||++|+.++||||++|+|+||++|||||++|||||||||+|||++++||+|||+|+||+++|+++||||| T Consensus 1 mkqYl~l~~~Il~~G~~~~~RtG~~t~s~fg~~~rfdl~~gfPllTtKk~~~k~~v~EllWfl~G~tn~~~L~~~~v~iW 80 (264) T PRK01827 1 MKQYLDLLQKILDEGTWKNDRTGTGTLSVFGAQMRFDLSKGFPLLTTKKVHLKSIIHELLWFLRGDTNIAYLQENGVHIW 80 (264) T ss_pred CHHHHHHHHHHHHHCCCCCCCCCCCEEEECCCEEEECCCCCCCEEEECHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCCC T ss_conf 96899999999980986689999865785354589725569853664233588789999999848831557987598755 Q ss_pred HHHHHHHCCCCCCCCCHHHCCCCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHCCHHHHHHCCCCCCCCCEEEEECCCEEE Q ss_conf 56542100138766201000367753268899999999980885236777516988832024211354024667688059 Q gi|254780956|r 81 DEWADKDGELGPIYGVQWRSWPDYDGNVIDQISSIVQSLRADPYSRRHIVSAWNVALIDKMALPPCHCLFQFYVDNGKLS 160 (264) Q Consensus 81 d~~a~~~g~lg~~YG~qwR~~~~~~~~~~DQl~~vi~~Lk~np~sRR~i~~~wnp~~~~~~~lpPC~~~~qf~v~~g~L~ 160 (264) |+|++++||+||+||+|||+|+.++|+++|||++||++||+||+|||+|+++|||+++++|||||||+.+||+|++|+|+ T Consensus 81 d~~~~~~GdlG~~YG~qwR~~~~~~~~~~DQl~~vi~~lk~nP~sRR~iis~WNp~~l~~maLpPCh~~~QF~V~~g~L~ 160 (264) T PRK01827 81 DEWADENGDLGPVYGKQWRSWPTPDGGHIDQISKVIEQLKTNPDSRRLIVSAWNPGELDKMALPPCHALFQFYVADGKLS 160 (264) T ss_pred CCCCCCCCCCCCCCCHHHHCCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEECCCHHHHCCCCCCCCEEEEEEEEECCEEE T ss_conf 66557588375532268760768899841299999999764984201564216977860677999826899998289899 Q ss_pred EEEEECCCCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCEEEHHHHHHHHHHHCCCCCCCCEEEECCCCCCCC Q ss_conf 99972624402220216799999999999762242111135227302100368999998336987687799648868711 Q gi|254780956|r 161 CQLYQRSGDVFLGIPFNIASYSLLTMMLASVIGFQYGEFIHTLGDVHLYNNHFEQADLQLSRSPRTLPQMIINPNIVDLL 240 (264) Q Consensus 161 ~~~~qRS~D~~lG~pfNi~~y~lL~~~iA~~~gl~~G~~~~~~~d~HIY~nH~~~vk~ql~R~p~~~P~L~i~~~~~~i~ 240 (264) |++||||+|+|||+|||||+||+|++|||++||++||+++|++||||||+||+|++++||+|+|+++|+|+|++++++|+ T Consensus 161 ~~~yQRS~D~~LG~PFNiasyalL~~~iA~~~gl~pg~l~~~~gD~HIY~nHie~vkeql~R~P~~~P~L~i~~~~~~i~ 240 (264) T PRK01827 161 CQLYQRSADVFLGVPFNIASYALLTHMIAQQTGLKVGEFVHTIGDAHIYSNHLEQARLQLSREPRPLPKLVINPDIKSIF 240 (264) T ss_pred EEEECCCCCHHHCCCCCHHHHHHHHHHHHHHHCCCCCEEEEEEEEEEEHHHHHHHHHHHHCCCCCCCCEEEECCCCCCCC T ss_conf 99982254322057300999999999999985874778999920678657689999998567999999689689999712 Q ss_pred CCCCCCEEEECCCCCCCCCCCCCC Q ss_conf 262033799768768887875569 Q gi|254780956|r 241 SFRYEDFTLRSYEPHAAILAKVSV 264 (264) Q Consensus 241 d~~~eDf~L~~Y~~hp~Ik~~mAV 264 (264) ||++|||+|+||+|||+||||||| T Consensus 241 d~~~~Df~L~~Y~~hp~Ik~~mAV 264 (264) T PRK01827 241 DFEFEDFELEGYDPHPAIKAPVAV 264 (264) T ss_pred CCCHHHEEEECCCCCCCCCCCCCC T ss_conf 288005799798779888706319 No 4 >PRK13821 thyA thymidylate synthase; Provisional Probab=100.00 E-value=0 Score=790.71 Aligned_cols=264 Identities=40% Similarity=0.753 Sum_probs=252.8 Q ss_pred CHHHHHHHHHHHHCCCEECCCCCCCEEEEEECEEEEECCCCCCEEEEEEHHHHHHHHHHHHHHCCCCCHHHHHHHCCCHH Q ss_conf 95799999999863926037998534777612046442369810464202268899999998536664688864003213 Q gi|254780956|r 1 MHQYLDLLRHVIKFGSDRRDRTGVGTRSTFGYQMRFDLSKGFPLLTTKKVHWKSVVHELLWFLRGDSNVSYLHRHGVSIW 80 (264) Q Consensus 1 m~qYl~l~~~il~~G~~~~dRtG~~t~s~fg~~~~~dl~~gfPllTtkk~~~k~~i~EllW~l~G~tn~~~l~~~~~~iW 80 (264) |||||+|+++||++|+.++||||++|+|+||++|||||++|||||||||||||++++||||||+|+||++.|++.||||| T Consensus 1 MkqYl~l~~~IL~~G~~~~dRTG~gT~S~FG~~~rfdL~~gFPlLTTKkv~~k~vv~ELLWFl~G~tn~~~L~~~gv~IW 80 (323) T PRK13821 1 MKQYLDLVRTILDTGTWQENRTGIRTISMPGAMLRFDLQQGFPAVTTKKLAFKSAIGELVGFLRASRSAADFRALGCKVW 80 (323) T ss_pred CHHHHHHHHHHHHHCCCCCCCCCCCEEEECCCEEEEECCCCCCEEEECHHHHHHHHHHHHHHHHCCCHHHHHHHCCCEEC T ss_conf 96899999999980988789899974774566179756679843562135699999999999967643667886496531 Q ss_pred HHHHHHH------------CCCCCCCCCHHHCCCCC---------------------------CC-------CHHHHHHH Q ss_conf 5654210------------01387662010003677---------------------------53-------26889999 Q gi|254780956|r 81 DEWADKD------------GELGPIYGVQWRSWPDY---------------------------DG-------NVIDQISS 114 (264) Q Consensus 81 d~~a~~~------------g~lg~~YG~qwR~~~~~---------------------------~~-------~~~DQl~~ 114 (264) |+||+++ |||||+||+|||+|+.+ +| +++|||++ T Consensus 81 d~na~~~~~~~~~~~~~~~~dlGpiYG~QWR~~~~~~~~~~~~~~~~~d~~~~g~~~~~~~~~~G~~~~~~~~~iDQl~~ 160 (323) T PRK13821 81 DQNANENAQWLANPYRQGVDDLGDVYGVQWRQWPGYKVLDASADAQIADATSRGFRQVARFDEDGAPKVLLYKAIDQLRQ 160 (323) T ss_pred CCCCCCCHHHHCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHH T ss_conf 45555001432364335678674623575503877445553333221121234533333446666533345565249999 Q ss_pred HHHHHHCCCCHHHHHHHHCCHHHHHHCCCCCCCCCEEEE--ECCCEEEEEEEECCCCHHHHCCHHHHHHHHHHHHHHHHH Q ss_conf 999998088523677751698883202421135402466--768805999972624402220216799999999999762 Q gi|254780956|r 115 IVQSLRADPYSRRHIVSAWNVALIDKMALPPCHCLFQFY--VDNGKLSCQLYQRSGDVFLGIPFNIASYSLLTMMLASVI 192 (264) Q Consensus 115 vi~~Lk~np~sRR~i~~~wnp~~~~~~~lpPC~~~~qf~--v~~g~L~~~~~qRS~D~~lG~pfNi~~y~lL~~~iA~~~ 192 (264) ||++||+||+|||+|+++|||+++++|||||||+++||+ |.+|+|+|++||||+|+|||+||||||||+|++|||++| T Consensus 161 vI~~iK~nP~sRR~ivsaWNp~~l~~maLPPCH~~~QF~v~v~~~~L~l~lyQRS~D~~LGvPFNIasYalL~~miA~~~ 240 (323) T PRK13821 161 CLDTIMNDPGSRRILFHGWNPAVLDEIALPACHLLYQFLPNVETREISLCLYIRSNDVGLGTPFNLTEGAALLSLVGRLT 240 (323) T ss_pred HHHHHHCCCCCCEEEEECCCHHHHCCCCCCCCCEEEEEEEECCCCEEEEEEEECHHHHCCCCCHHHHHHHHHHHHHHHHH T ss_conf 99997429754015875268567526778888148998896589958899995403424485379999999999999985 Q ss_pred CCCCEEEEEECCCCEEEHHHHHHHHHHHCCCCCCCCEEEECCCCCC-----------CCCCCCCCEEEECCCCCCCCCCC Q ss_conf 2421111352273021003689999983369876877996488687-----------11262033799768768887875 Q gi|254780956|r 193 GFQYGEFIHTLGDVHLYNNHFEQADLQLSRSPRTLPQMIINPNIVD-----------LLSFRYEDFTLRSYEPHAAILAK 261 (264) Q Consensus 193 gl~~G~~~~~~~d~HIY~nH~~~vk~ql~R~p~~~P~L~i~~~~~~-----------i~d~~~eDf~L~~Y~~hp~Ik~~ 261 (264) |++||+|+|++||||||+||+||++|||+|+|+++|+|+|++++.+ |++|+++||+|+||+|||+|||| T Consensus 241 gl~pG~~~~~igDaHIY~nHi~~v~eQl~R~P~~~P~L~i~~~i~~~~~~~~~~~~~i~~~~~dDf~l~~Y~~Hp~Ik~~ 320 (323) T PRK13821 241 GYTPRWFTYFIGDAHIYENHLDMLNEQLTREPYESPRLVISDRVPEYAKTGVYEPEWLEKIEPSDFSLVGYRHHEPLTAP 320 (323) T ss_pred CCCCCEEEEECCEEEEHHHHHHHHHHHHCCCCCCCCEEEECCCCCCHHHCCCCCCHHHCCCCCCCEEEECCCCCCCCCCC T ss_conf 88277799987277874768999999966899889978967888731220212210230487466699698899887756 Q ss_pred CCC Q ss_conf 569 Q gi|254780956|r 262 VSV 264 (264) Q Consensus 262 mAV 264 (264) ||| T Consensus 321 mAV 323 (323) T PRK13821 321 MAV 323 (323) T ss_pred CCC T ss_conf 239 No 5 >KOG0673 consensus Probab=100.00 E-value=0 Score=727.86 Aligned_cols=263 Identities=53% Similarity=1.005 Sum_probs=258.0 Q ss_pred HHHHHHHHHHHHCCCEECCCCCCCEEEEEECEEEEECCCCCCEEEEEEHHHHHHHHHHHHHHCCCCCHHHHHHHCCCHHH Q ss_conf 57999999998639260379985347776120464423698104642022688999999985366646888640032135 Q gi|254780956|r 2 HQYLDLLRHVIKFGSDRRDRTGVGTRSTFGYQMRFDLSKGFPLLTTKKVHWKSVVHELLWFLRGDSNVSYLHRHGVSIWD 81 (264) Q Consensus 2 ~qYl~l~~~il~~G~~~~dRtG~~t~s~fg~~~~~dl~~gfPllTtkk~~~k~~i~EllW~l~G~tn~~~l~~~~~~iWd 81 (264) .|||+|+++|++||+.+.||||++|+|+||+||||+|++.|||||||++||+++++|||||++|+||.+.|+++|+|||| T Consensus 11 ~~YL~lv~~Ii~~G~~r~DrtgtgTlSvFg~qmrf~Lr~tfPLLTTkrVfwrgV~EElLwfi~G~t~ak~l~~kgv~IWd 90 (293) T KOG0673 11 FQYLNLVKHIISNGARRMDRTGTGTLSVFGCQMRFSLRDTFPLLTTKRVFWRGVVEELLWFISGSTDAKVLEEKGVHIWD 90 (293) T ss_pred HHHHHHHHHHHHCCCCCCCCCCCCEEEEEEEEEEEEHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCEEEC T ss_conf 89999999999647746776787203322011442011156520123577888999999997287763136545846514 Q ss_pred HHHH------------HHCCCCCCCCCHHHCCC--------CCCCCHHHHHHHHHHHHHCCCCHHHHHHHHCCHHHHHHC Q ss_conf 6542------------10013876620100036--------775326889999999998088523677751698883202 Q gi|254780956|r 82 EWAD------------KDGELGPIYGVQWRSWP--------DYDGNVIDQISSIVQSLRADPYSRRHIVSAWNVALIDKM 141 (264) Q Consensus 82 ~~a~------------~~g~lg~~YG~qwR~~~--------~~~~~~~DQl~~vi~~Lk~np~sRR~i~~~wnp~~~~~~ 141 (264) ++.+ ++||+||+||+|||||. +|+|+++|||..||++||+||++||+|+++|||.|+.+| T Consensus 91 ~ngsrefLds~G~~~re~GDlgpvyGfqWrHfgA~Y~~~~~dy~gqgvdQL~~vI~~ik~nP~drRIimsAwNP~dl~~m 170 (293) T KOG0673 91 GNGSREFLDSVGFTAREEGDLGPVYGFQWRHFGARYEDCDSDYTGQGVDQLADVINKIKNNPDDRRIIMSAWNPLDLGKM 170 (293) T ss_pred CCCHHHHHHHCCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCEEEEECCCCCCCCCC T ss_conf 87618889853987000377666323032116852023444434436799999999985597542255532583435423 Q ss_pred CCCCCCCCEEEEECCCEEEEEEEECCCCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCEEEHHHHHHHHHHHC Q ss_conf 42113540246676880599997262440222021679999999999976224211113522730210036899999833 Q gi|254780956|r 142 ALPPCHCLFQFYVDNGKLSCQLYQRSGDVFLGIPFNIASYSLLTMMLASVIGFQYGEFIHTLGDVHLYNNHFEQADLQLS 221 (264) Q Consensus 142 ~lpPC~~~~qf~v~~g~L~~~~~qRS~D~~lG~pfNi~~y~lL~~~iA~~~gl~~G~~~~~~~d~HIY~nH~~~vk~ql~ 221 (264) ||||||.++||||.+|+|||++||||+||.||+|||||+|+||++|||++|||+||+|+|++||||||.||++++++||+ T Consensus 171 alpPCH~~~QFyV~nGelScq~YQrS~dmglGVPFnIASYsLLT~miAhv~gl~pgdfiH~lGdahvy~~Hv~al~~Ql~ 250 (293) T KOG0673 171 ALPPCHTFCQFYVANGELSCQMYQRSGDMGLGVPFNIASYSLLTCMIAHVCGLKPGDFIHVLGDAHVYKDHVDALQEQLQ 250 (293) T ss_pred CCCCCCEEEEEEECCCEEEEHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHHHHHHHH T ss_conf 68952145788860786632433303565668643247889999999998188877547862203416878899999971 Q ss_pred CCCCCCCEEEECCCCCCCCCCCCCCEEEECCCCCCCCCCCCCC Q ss_conf 6987687799648868711262033799768768887875569 Q gi|254780956|r 222 RSPRTLPQMIINPNIVDLLSFRYEDFTLRSYEPHAAILAKVSV 264 (264) Q Consensus 222 R~p~~~P~L~i~~~~~~i~d~~~eDf~L~~Y~~hp~Ik~~mAV 264 (264) |+|+|||+|+|+|++++|++|++|||+|++|+|||+|||+||| T Consensus 251 r~PrpFPkL~i~~~~~~id~F~~~dfel~~y~php~I~M~Mav 293 (293) T KOG0673 251 RPPRPFPKLKINPEVKDIDDFQAEDFELVGYDPHPKIKMKMAV 293 (293) T ss_pred CCCCCCCCEEECCCCCCCCCCCCCCEEEECCCCCCCCCCCCCC T ss_conf 4998897367356645312342010256136899875512469 No 6 >pfam00303 Thymidylat_synt Thymidylate synthase. Probab=100.00 E-value=0 Score=705.09 Aligned_cols=261 Identities=59% Similarity=1.096 Sum_probs=257.3 Q ss_pred HHHHHHHHHHHHCCCEECCCCCCCEEEEEECEEEEECCCCCCEEEEEEHHHHHHHHHHHHHHCCCCCHHHHHHHCCCHHH Q ss_conf 57999999998639260379985347776120464423698104642022688999999985366646888640032135 Q gi|254780956|r 2 HQYLDLLRHVIKFGSDRRDRTGVGTRSTFGYQMRFDLSKGFPLLTTKKVHWKSVVHELLWFLRGDSNVSYLHRHGVSIWD 81 (264) Q Consensus 2 ~qYl~l~~~il~~G~~~~dRtG~~t~s~fg~~~~~dl~~gfPllTtkk~~~k~~i~EllW~l~G~tn~~~l~~~~~~iWd 81 (264) +||++++++||++|+.++||||++|+|+||++++||+++|||++||||++||++++|++|||+|+||+++|+++|++||| T Consensus 2 ~~Yl~ll~~Il~~G~~~~drtG~~t~sifg~~l~~dl~~~fPllTtkk~~~k~~~~EllWfL~G~~~~~~l~~~~~~iWd 81 (262) T pfam00303 2 EQYLDLLRDILENGEFKEDRTGTGTKSLFGVQLRFDLRKGFPLLTTKKVFWKSAIEELLWFLSGSTNVKFLSEYGVHIWD 81 (262) T ss_pred HHHHHHHHHHHHCCCCCCCCCCCCEEEECCCEEEECCCCCCCEEEECHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCCCC T ss_conf 68999999999729775699998679870453797178898466632366999999999998488478888870898120 Q ss_pred HHHHHHCCCCCCCCCHHHCCCCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHCCHHHHHHCCCCCCCCCEEEEECCCEEEE Q ss_conf 65421001387662010003677532688999999999808852367775169888320242113540246676880599 Q gi|254780956|r 82 EWADKDGELGPIYGVQWRSWPDYDGNVIDQISSIVQSLRADPYSRRHIVSAWNVALIDKMALPPCHCLFQFYVDNGKLSC 161 (264) Q Consensus 82 ~~a~~~g~lg~~YG~qwR~~~~~~~~~~DQl~~vi~~Lk~np~sRR~i~~~wnp~~~~~~~lpPC~~~~qf~v~~g~L~~ 161 (264) +|++++|++||+||+|||+|... +++|||++||++||+||+|||+|+++|||.++++|+|||||+++||+|++|+|+| T Consensus 82 ~~~~~~g~lG~~YG~q~R~~~~~--~~~DQl~~vi~~Lk~~P~sRR~ii~~wnp~~l~~~~lpPCh~~~qF~v~~g~L~~ 159 (262) T pfam00303 82 EWASEDGDLGPVYGFQWRHWGAE--QGVDQLARVIEKLKNNPDSRRIIISAWNPADLPKMALPPCHTLFQFYVRDGKLSC 159 (262) T ss_pred HHCCCCCCCCCCCCHHHHCCCCC--CCCCHHHHHHHHHHHCCCCCEEEEECCCHHHHHCCCCCCCCEEEEEEEECCEEEE T ss_conf 00465886745514676407777--7877999999998769722003531259778726889998346999997898999 Q ss_pred EEEECCCCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCEEEHHHHHHHHHHHCCCCCCCCEEEECCCCCCCCC Q ss_conf 99726244022202167999999999997622421111352273021003689999983369876877996488687112 Q gi|254780956|r 162 QLYQRSGDVFLGIPFNIASYSLLTMMLASVIGFQYGEFIHTLGDVHLYNNHFEQADLQLSRSPRTLPQMIINPNIVDLLS 241 (264) Q Consensus 162 ~~~qRS~D~~lG~pfNi~~y~lL~~~iA~~~gl~~G~~~~~~~d~HIY~nH~~~vk~ql~R~p~~~P~L~i~~~~~~i~d 241 (264) ++||||+|+|||+||||++||+|++|||++||++||+|+|++||+|||+||+|+|++||+|+|+++|+|+|++++++|++ T Consensus 160 ~~~qRS~D~~lG~pfNi~~yalL~~~iA~~~gl~~G~~~~~~gd~HIY~~h~~~v~~ql~r~p~~~P~l~i~~~~~~i~~ 239 (262) T pfam00303 160 QLYQRSNDMFLGVPFNIASYALLTEMIAQVTGLEPGEFIHTIGDAHIYENHVDQVKEQLSREPRPFPKLKINPEVKSIDD 239 (262) T ss_pred EEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEEEEEEHHHHHHHHHHHHCCCCCCCCEEEECCCCCCCCC T ss_conf 99920332101572389999999999999857867889999335564786999999995679999997896799886246 Q ss_pred CCCCCEEEECCCCCCCCCCCCCC Q ss_conf 62033799768768887875569 Q gi|254780956|r 242 FRYEDFTLRSYEPHAAILAKVSV 264 (264) Q Consensus 242 ~~~eDf~L~~Y~~hp~Ik~~mAV 264 (264) ++++||+|+||+|||+||+|||| T Consensus 240 ~~~~Df~l~~Y~~~~~ik~~~av 262 (262) T pfam00303 240 FTIEDFELEGYDPHPKIKMPVAV 262 (262) T ss_pred CCHHHEEEECCCCCCCCCCCCCC T ss_conf 98537799798779887716229 No 7 >COG0207 ThyA Thymidylate synthase [Nucleotide transport and metabolism] Probab=100.00 E-value=0 Score=700.99 Aligned_cols=264 Identities=64% Similarity=1.180 Sum_probs=260.9 Q ss_pred CHHHHHHHHHHHHCCCEECCCCCCCEEEEEECEEEEECCC-CCCEEEEEEHHHHHHHHHHHHHHCCCCCHHHHHHHCCCH Q ss_conf 9579999999986392603799853477761204644236-981046420226889999999853666468886400321 Q gi|254780956|r 1 MHQYLDLLRHVIKFGSDRRDRTGVGTRSTFGYQMRFDLSK-GFPLLTTKKVHWKSVVHELLWFLRGDSNVSYLHRHGVSI 79 (264) Q Consensus 1 m~qYl~l~~~il~~G~~~~dRtG~~t~s~fg~~~~~dl~~-gfPllTtkk~~~k~~i~EllW~l~G~tn~~~l~~~~~~i 79 (264) |+||++++++|+++|..+.||||++|+|+||.+||||+++ +||++|||++++|++++|++||++|+||++.|.++|++| T Consensus 3 ~~~yl~l~~~ile~G~~~~drt~~~t~sv~g~q~~~Dl~~~~fpllttk~v~~ks~i~Ellwf~~g~tn~~~l~~~gv~i 82 (268) T COG0207 3 MKQYLDLIEHILENGTSRSDRTGTGTLSVFGYQMRFDLSKSEFPLLTTKKVHLKSIIHELLWFLRGDTNIAYLKENGVKI 82 (268) T ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCEEEEECCEEEEECCCCCCCEEEEEEEEHHHHHHHHHHHHCCCCHHHHHHHCCCCC T ss_conf 46899999999874861335536653888554788840158875167544631331676654213774177887559632 Q ss_pred HHHHHHHH-CCCCCCCCCHHHCCCCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHCCHHHHHHCCCCCCCCCEEEEECCCE Q ss_conf 35654210-01387662010003677532688999999999808852367775169888320242113540246676880 Q gi|254780956|r 80 WDEWADKD-GELGPIYGVQWRSWPDYDGNVIDQISSIVQSLRADPYSRRHIVSAWNVALIDKMALPPCHCLFQFYVDNGK 158 (264) Q Consensus 80 Wd~~a~~~-g~lg~~YG~qwR~~~~~~~~~~DQl~~vi~~Lk~np~sRR~i~~~wnp~~~~~~~lpPC~~~~qf~v~~g~ 158 (264) ||+|++++ |++|++||+|||+|...+|.++|||.+||++||+||+|||+|+++|||.++++|+|||||..+||+|++|+ T Consensus 83 Wdew~~e~~Gdlg~vYg~q~r~~~~~~g~~iDQi~~vIe~Lk~np~sRR~ivs~WnP~e~~~m~l~PC~~l~qF~V~~gk 162 (268) T COG0207 83 WDEWADENYGDLGPVYGYQWRKWLRPDGEKIDQIDKVIEQLKENPDSRRAIVSAWNPGELDEMALPPCHILFQFYVRDGK 162 (268) T ss_pred CCCCCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCCCEEEEEEEEECCE T ss_conf 32003345675465025751214476887427999999998419865323899518320125467883799999988999 Q ss_pred EEEEEEECCCCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCEEEHHHHHHHHHHHCCCCCCCCEEEECCCCCC Q ss_conf 59999726244022202167999999999997622421111352273021003689999983369876877996488687 Q gi|254780956|r 159 LSCQLYQRSGDVFLGIPFNIASYSLLTMMLASVIGFQYGEFIHTLGDVHLYNNHFEQADLQLSRSPRTLPQMIINPNIVD 238 (264) Q Consensus 159 L~~~~~qRS~D~~lG~pfNi~~y~lL~~~iA~~~gl~~G~~~~~~~d~HIY~nH~~~vk~ql~R~p~~~P~L~i~~~~~~ 238 (264) |+|++|||||||+||+||||+||++|++|||++||++||+|+|+++|||||+||+|++++||+|+|+++|+|+||++.++ T Consensus 163 L~~~lyqRS~Di~lg~pfNi~syalL~~mvA~~~Gle~G~f~h~~~daHIY~nh~e~~~~ql~R~p~~~pkl~in~~~~~ 242 (268) T COG0207 163 LSCTLYQRSNDVFLGVPFNIASYALLTHMVAQVTGLEPGEFVHTIGDAHIYDNHFDQAKEQLKREPRPLPKLVINPKDKD 242 (268) T ss_pred EEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEEEEEEEHHHHHHHHHHHCCCCCCCCEEEECCCCCC T ss_conf 99999871046345642779999999999999859862489998323688764689999984459987975796688665 Q ss_pred CCCCCCCCEEEECCCCCCCCCCCCCC Q ss_conf 11262033799768768887875569 Q gi|254780956|r 239 LLSFRYEDFTLRSYEPHAAILAKVSV 264 (264) Q Consensus 239 i~d~~~eDf~L~~Y~~hp~Ik~~mAV 264 (264) ++||++|||+|+||+|||+||+|||| T Consensus 243 ~f~~~~~Df~l~~Y~~~~~ik~~~av 268 (268) T COG0207 243 LFDFTFEDFELVGYDPHPAIKAPVAV 268 (268) T ss_pred CCCCCCCCEEEECCCCCCCCCCCCCC T ss_conf 45310021589777788643403449 No 8 >cd00351 TS_Pyrimidine_HMase Thymidylate synthase and pyrimidine hydroxymethylase: Thymidylate synthase (TS) and deoxycytidylate hydroxymethylase (dCMP-HMase) are homologs that catalyze analogous alkylation of C5 of pyrimidine nucleotides. Both enzymes are involved in the biosynthesis of DNA precursors and are active as homodimers. However, they exhibit distinct pyrimidine base specificities and differ in the details of their catalyzed reactions. TS is biologically ubiquitous and catalyzes the conversion of dUMP and methylene-tetrahydrofolate (CH2THF) to dTMP and dihydrofolate (DHF). It also acts as a regulator of its own expression by binding and inactivating its own RNA. Due to its key role in the de novo pathway for thymidylate synthesis and, hence, DNA synthesis, it is one of the most conserved enzymes across species and phyla. TS is a well-recognized target for anticancer chemotherapy, as well as a valuable new target against infectious diseases. Interestingly, in several protozoa, Probab=100.00 E-value=0 Score=556.70 Aligned_cols=214 Identities=64% Similarity=1.121 Sum_probs=208.9 Q ss_pred HHHHHHHHHHHCCCEE-CCCCCCCEEEEEECEEEEECCCCCCEEEEEEHHHHHHHHHHHHHHCCCCCHHHHHHHCCCHHH Q ss_conf 7999999998639260-379985347776120464423698104642022688999999985366646888640032135 Q gi|254780956|r 3 QYLDLLRHVIKFGSDR-RDRTGVGTRSTFGYQMRFDLSKGFPLLTTKKVHWKSVVHELLWFLRGDSNVSYLHRHGVSIWD 81 (264) Q Consensus 3 qYl~l~~~il~~G~~~-~dRtG~~t~s~fg~~~~~dl~~gfPllTtkk~~~k~~i~EllW~l~G~tn~~~l~~~~~~iWd 81 (264) ||++++++||++|..+ .+|||++|+|+||++++|||++|||+|||||++||++++|++||++|+|+++.|.+++++||| T Consensus 1 qy~~l~~~il~~g~~~~~~Rtg~~t~s~fg~~~~~dl~~~fPllTtkk~~~k~~i~Ellw~l~G~t~~~~l~~~~~~iWd 80 (215) T cd00351 1 QYLDLWRKILEEGYRKTDDRTGTGTRSLFGAQLRFDLSEGFPLLTTKKVPWKSAIEELLWFLRGDTNAERLKEYGVSIWD 80 (215) T ss_pred CHHHHHHHHHHCCCEECCCCCCCCEEEEECCEEEEECCCCCCCCEECCCCHHHHHHHHHHHHHCCCHHHHHHHCCCCHHH T ss_conf 96899999998098867789998779960244898254588742223433798999999998489366778762867324 Q ss_pred HHHHHHCCCCCCCCCHHHCCCCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHCCHHHHHHCCCCCCCCCEEEEECCCEEEE Q ss_conf 65421001387662010003677532688999999999808852367775169888320242113540246676880599 Q gi|254780956|r 82 EWADKDGELGPIYGVQWRSWPDYDGNVIDQISSIVQSLRADPYSRRHIVSAWNVALIDKMALPPCHCLFQFYVDNGKLSC 161 (264) Q Consensus 82 ~~a~~~g~lg~~YG~qwR~~~~~~~~~~DQl~~vi~~Lk~np~sRR~i~~~wnp~~~~~~~lpPC~~~~qf~v~~g~L~~ 161 (264) +|++++|++|++||+|||+|... ++++||+++||++||+||+|||+|+++|||.++++|+|||||..+||+|++|+|+| T Consensus 81 ~~a~~~g~~g~~YG~~~r~~~~~-~~~~DQl~~vi~~Lk~~p~sRR~i~~~wnp~~~~~~~l~PC~~~~qf~v~~~~L~~ 159 (215) T cd00351 81 EWASKEGDLGYTYGFQWRHWGAP-GQGVDQIEKVIEKLKNNPDSRRAIISAWNPADLDLMALPPCHTLIQFYVRNGKLSL 159 (215) T ss_pred HHCCCCCCCCCCHHHHHHCCCCC-CCCHHHHHHHHHHHHHCCHHHEEEEEECCHHHHCCCCCCCCCEEEEEEECCCEEEE T ss_conf 54254897754476775117788-87616999999998869441205520048656457988898518999942998999 Q ss_pred EEEECCCCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCEEEHHHHHHHH Q ss_conf 99726244022202167999999999997622421111352273021003689999 Q gi|254780956|r 162 QLYQRSGDVFLGIPFNIASYSLLTMMLASVIGFQYGEFIHTLGDVHLYNNHFEQAD 217 (264) Q Consensus 162 ~~~qRS~D~~lG~pfNi~~y~lL~~~iA~~~gl~~G~~~~~~~d~HIY~nH~~~vk 217 (264) +++|||+|+++|+|||+++|++|++|||++||++||+|+|+++|+|||+||+|+|+ T Consensus 160 ~~~qRS~D~~lG~p~Ni~~~~~L~~~iA~~~g~~~G~~~~~~~d~HIYe~h~~~vk 215 (215) T cd00351 160 TLYQRSNDAFLGVPFNIASYALLTEMIARVTGLEPGEFIHTIGDAHIYENHLEQVK 215 (215) T ss_pred EEEEECCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEEEEEECHHHHHHHC T ss_conf 99960001113462379999999999999849935379999877898466676539 No 9 >PRK00956 thyA thymidylate synthase; Provisional Probab=100.00 E-value=0 Score=388.90 Aligned_cols=196 Identities=22% Similarity=0.408 Sum_probs=171.7 Q ss_pred HHHHHHHHHHHCCCEECCCCCCCEEEEEECEEEEECCCCCCEEEEEEHHHHHHHHHHHHHHCCCCCHHHHHHHCCCHHHH Q ss_conf 79999999986392603799853477761204644236981046420226889999999853666468886400321356 Q gi|254780956|r 3 QYLDLLRHVIKFGSDRRDRTGVGTRSTFGYQMRFDLSKGFPLLTTKKVHWKSVVHELLWFLRGDSNVSYLHRHGVSIWDE 82 (264) Q Consensus 3 qYl~l~~~il~~G~~~~dRtG~~t~s~fg~~~~~dl~~gfPllTtkk~~~k~~i~EllW~l~G~tn~~~l~~~~~~iWd~ 82 (264) -|++++++|+++|+.+.||+|.+|+++||.. +|+++-..|++..|..| +...+ .++...++ T Consensus 12 Ay~~~l~~Il~~G~~~~~~~G~~tkel~~~~-----------~~i~~p~~~~i~~~~~~---~~~~l---~~y~~~~~-- 72 (208) T PRK00956 12 AWERLVPKIMNHGEEITDERGSKTKEIMNLI-----------IEITNPSDKSIPEKYPW---GEERL---EEYTKQLL-- 72 (208) T ss_pred HHHHHHHHHHHHCCCCCCCCCCEEEEECCEE-----------EEECCCCCCCCCCCCCC---CHHHH---HHHHHHCC-- T ss_conf 9999999999839643489886878814638-----------99878642564654676---77899---99998655-- Q ss_pred HHHHHCCCCCCCCCHHHCCCCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHCCHHHHHHCCCCCCCCCEEEEECCCEEEEE Q ss_conf 54210013876620100036775326889999999998088523677751698883202421135402466768805999 Q gi|254780956|r 83 WADKDGELGPIYGVQWRSWPDYDGNVIDQISSIVQSLRADPYSRRHIVSAWNVALIDKMALPPCHCLFQFYVDNGKLSCQ 162 (264) Q Consensus 83 ~a~~~g~lg~~YG~qwR~~~~~~~~~~DQl~~vi~~Lk~np~sRR~i~~~wnp~~~~~~~lpPC~~~~qf~v~~g~L~~~ 162 (264) +.++++.++.||.|||+|++ .+|||++||++||+||+|||+|+++|||.++.++++||||..+||+|++|+|+|+ T Consensus 73 -~~~~~~~~y~YG~r~r~~~g----~~DQi~~vi~~Lk~~p~sRR~i~~~wnp~~~~~~~~~PC~~~~qf~v~~~kL~~~ 147 (208) T PRK00956 73 -SGENQGFVYTYGERLREYPG----DVDQIDYIVEKLKEQPNSRRATAVTWDPPIDTKVDEVPCLQLVQFLIRDNKLYLT 147 (208) T ss_pred -CCCCCCCCCCHHHHHHHCCC----CCHHHHHHHHHHHCCCCCCEEEEEEECCCHHCCCCCCCCCCEEEEEEECCEEEEE T ss_conf -77679727746679971899----8419999999970599962699985881222157898996189999949928999 Q ss_pred EEECCCCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCEEEHHHHHHHHHHHCC Q ss_conf 972624402220216799999999999762242111135227302100368999998336 Q gi|254780956|r 163 LYQRSGDVFLGIPFNIASYSLLTMMLASVIGFQYGEFIHTLGDVHLYNNHFEQADLQLSR 222 (264) Q Consensus 163 ~~qRS~D~~lG~pfNi~~y~lL~~~iA~~~gl~~G~~~~~~~d~HIY~nH~~~vk~ql~R 222 (264) ++|||+|+|+|+|||+++|++|+++||+.+|++||+|+|+++|+|||++|+|++++||+- T Consensus 148 ~~~RS~D~~lg~p~Ni~~~~~L~~~iA~~~gl~~G~~~~~~~~~HIY~~h~~~v~~~lk~ 207 (208) T PRK00956 148 VLFRSNDAGGAFPANAIGLIKLGEYVAEKVGVEPGTYTHHSVSAHIYEHDWDKLEKIIKG 207 (208) T ss_pred EEEECCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEEEEEECHHHHHHHHHHHCC T ss_conf 985624331455516999999999999883994059999975379848789999998746 No 10 >TIGR03283 thy_syn_methano thymidylate synthase, methanogen type. Thymidylate synthase makes dTMP for DNA synthesis, and is among the most widely distributed of all enzymes. Members of this protein family are encoded within a completed genome sequence if and only if that species is one of the methanogenenic archaea. In these species, tetrahydromethanopterin replaces tetrahydrofolate, The member from Methanobacterium thermoautotrophicum was shown to behave as a thymidylate synthase based on similar side reactions (the exchange of a characteristic proton with water), although the full reaction was not reconstituted. Partial sequence data showed no similarity to known thymidylate synthases simply because the region sequenced was from a distinctive N-terminal region not found in other thymidylate synthases. Members of this protein family appear, therefore, to a novel, tetrahydromethanopterin-dependent thymidylate synthase. Probab=100.00 E-value=0 Score=389.31 Aligned_cols=190 Identities=22% Similarity=0.282 Sum_probs=171.6 Q ss_pred HHHHHHHHHHHHCCCEECCCCCCCEEEEEECEEEEECCCCCCEEEEEEHHHHHHHHHHHHHHCCCCCHHHHHHHCCCHHH Q ss_conf 57999999998639260379985347776120464423698104642022688999999985366646888640032135 Q gi|254780956|r 2 HQYLDLLRHVIKFGSDRRDRTGVGTRSTFGYQMRFDLSKGFPLLTTKKVHWKSVVHELLWFLRGDSNVSYLHRHGVSIWD 81 (264) Q Consensus 2 ~qYl~l~~~il~~G~~~~dRtG~~t~s~fg~~~~~dl~~gfPllTtkk~~~k~~i~EllW~l~G~tn~~~l~~~~~~iWd 81 (264) .-|++++++|+++|+.++||+|++|+++|| |++|+++...+.+..|+.|+- +.+.++...+|+ T Consensus 10 ~A~~~~l~~Il~~G~~~~d~~G~~tke~~~-----------~~i~i~~p~~~~i~~~~~~~~------~~~~~y~~~~l~ 72 (199) T TIGR03283 10 QAWRELVTKIMVEGEVVEDERGSLTKEVRN-----------LIIHIENPGPEKIPEGYPLDE------EKLEEYEKQLLD 72 (199) T ss_pred HHHHHHHHHHHHCCCEEECCCCCEEEEECC-----------EEEEECCCCCCCCCCCCCCCH------HHHHHHHHHHCC T ss_conf 999999999996095530588978778010-----------189984787775455677557------799999997067 Q ss_pred HHHHHHCCCCCCCCCHHHCCCCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHCCHHHHHHCCCCCCCCCEEEEECCCEEEE Q ss_conf 65421001387662010003677532688999999999808852367775169888320242113540246676880599 Q gi|254780956|r 82 EWADKDGELGPIYGVQWRSWPDYDGNVIDQISSIVQSLRADPYSRRHIVSAWNVALIDKMALPPCHCLFQFYVDNGKLSC 161 (264) Q Consensus 82 ~~a~~~g~lg~~YG~qwR~~~~~~~~~~DQl~~vi~~Lk~np~sRR~i~~~wnp~~~~~~~lpPC~~~~qf~v~~g~L~~ 161 (264) .+++++++.||.|||+|. ++|||++||++||+||+|||+|+++|||.++.+|++||||..+||+|++|+|+| T Consensus 73 ---~~~~g~~y~YG~r~r~~~-----g~DQi~~vi~~Lk~~p~SRR~i~~~wnp~~~~~~~~pPC~~~~qf~v~~~kL~~ 144 (199) T TIGR03283 73 ---PERQGFVYTYGNRLRRYF-----GIDQIDYIIERLNQSPNSRRAIAITWDPPQDIKVDEVPCLQLVQFLIRDNKLYL 144 (199) T ss_pred ---CCCCCCCCCCHHHHHCCC-----CCCHHHHHHHHHHCCCCCCEEEEEEECCHHHCCCCCCCCCEEEEEEECCCEEEE T ss_conf ---434980857268873379-----963999999997159996169998188315515788688259999990992899 Q ss_pred EEEECCCCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCEEEHHHHHHH Q ss_conf 9972624402220216799999999999762242111135227302100368999 Q gi|254780956|r 162 QLYQRSGDVFLGIPFNIASYSLLTMMLASVIGFQYGEFIHTLGDVHLYNNHFEQA 216 (264) Q Consensus 162 ~~~qRS~D~~lG~pfNi~~y~lL~~~iA~~~gl~~G~~~~~~~d~HIY~nH~~~v 216 (264) +++|||+|+|+|+|||+++|++|+++||+.+|+++|+|+|+++|+|||++|+|+| T Consensus 145 ~~~~RS~D~~lg~p~Ni~~~~~L~~~iA~~~gl~~G~~~~~~~~~HIYe~~~d~v 199 (199) T TIGR03283 145 TAFFRSNDVGGAWVANAIGLRRLQEYVAEKVGVEPGTLTTHAISAHIYERDFDEL 199 (199) T ss_pred EEEECCCHHHCCCCHHHHHHHHHHHHHHHHHCCEEEEEEEEEEEEEEECCHHHHC T ss_conf 9997410112544537999999999999983990059999976289800417219 No 11 >KOG0217 consensus Probab=45.57 E-value=25 Score=16.51 Aligned_cols=87 Identities=22% Similarity=0.370 Sum_probs=51.0 Q ss_pred EEEHHHHHHHHHHHHHHCCCCCHHHHHH-HCCCHHHHHH--------------------HHHCCCCCCC----------C Q ss_conf 4202268899999998536664688864-0032135654--------------------2100138766----------2 Q gi|254780956|r 47 TKKVHWKSVVHELLWFLRGDSNVSYLHR-HGVSIWDEWA--------------------DKDGELGPIY----------G 95 (264) Q Consensus 47 tkk~~~k~~i~EllW~l~G~tn~~~l~~-~~~~iWd~~a--------------------~~~g~lg~~Y----------G 95 (264) +++++ -+|++-++|||++.---+.|.. .+....+..+ ...|+-|..| | T Consensus 497 ~~~la-~safg~~~~Ylk~~~id~~llsm~n~~ey~~~~~s~mvlD~~tL~NleIf~Ns~~G~~~gtL~~~~~~csTpfG 575 (1125) T KOG0217 497 DKELA-LSAFGGLFYYLKKLLIDEELLSMKNFEEYDSLDQSEMVLDGATLENLEIFSNSRNGGDEGTLFYAVNRCSTPFG 575 (1125) T ss_pred CCHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHCCCCEEECCHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCHHH T ss_conf 42035-98778899999987667988621324422201121201043333545540157789973208998762247177 Q ss_pred CH-HHCC---CCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHCC Q ss_conf 01-0003---677532688999999999808852367775169 Q gi|254780956|r 96 VQ-WRSW---PDYDGNVIDQISSIVQSLRADPYSRRHIVSAWN 134 (264) Q Consensus 96 ~q-wR~~---~~~~~~~~DQl~~vi~~Lk~np~sRR~i~~~wn 134 (264) .+ .|.| +..+...+.|.+..++.|..+|.+|--+...|- T Consensus 576 KRllk~Wl~~Pl~~~~~I~~R~dav~~l~~~~~~~~~~~e~l~ 618 (1125) T KOG0217 576 KRLLKTWLMAPLCDKEDIKQRQDAVDSLGKAPMDRTKVGETLK 618 (1125) T ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 7889998527677888899989999987417406999999986 No 12 >cd00988 PDZ_CTP_protease PDZ domain of C-terminal processing-, tail-specific-, and tricorn proteases, which function in posttranslational protein processing, maturation, and disassembly or degradation, in Bacteria, Archaea, and plant chloroplasts. May be responsible for substrate recognition and/or binding, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of protease-associated PDZ domains a C-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in Eumetazoan signaling proteins. Probab=41.99 E-value=29 Score=16.18 Aligned_cols=47 Identities=15% Similarity=0.176 Sum_probs=39.4 Q ss_pred HHHHCCCCEEEEEECCCCEEEHHHHHHHHHHHCCCCCCCCEEEECCC Q ss_conf 97622421111352273021003689999983369876877996488 Q gi|254780956|r 189 ASVIGFQYGEFIHTLGDVHLYNNHFEQADLQLSRSPRTLPQMIINPN 235 (264) Q Consensus 189 A~~~gl~~G~~~~~~~d~HIY~nH~~~vk~ql~R~p~~~P~L~i~~~ 235 (264) |...|+++|+.+..+...-+-....+.+.+.|..++=..=+|.|.+. T Consensus 26 A~~aGl~~GD~I~~Vng~~v~~~~~~~~~~~lrg~~Gt~V~l~v~R~ 72 (85) T cd00988 26 AAKAGIKAGDIIVAIDGEPVDGLSLEDVVKLLRGKAGTKVRLTLKRG 72 (85) T ss_pred HHHHCCCCCCEEEEECCEECCCCCHHHHHHHHCCCCCCEEEEEEECC T ss_conf 89808999999999999997899999999986599998899999909 No 13 >KOG3690 consensus Probab=41.72 E-value=29 Score=16.16 Aligned_cols=57 Identities=25% Similarity=0.385 Sum_probs=22.6 Q ss_pred CCCCHHHCCCCCCCCHHHHHHHHHHHHH-----CCCCHHHHHHHHCCHHHHHHCCCCCCCCC Q ss_conf 6620100036775326889999999998-----08852367775169888320242113540 Q gi|254780956|r 93 IYGVQWRSWPDYDGNVIDQISSIVQSLR-----ADPYSRRHIVSAWNVALIDKMALPPCHCL 149 (264) Q Consensus 93 ~YG~qwR~~~~~~~~~~DQl~~vi~~Lk-----~np~sRR~i~~~wnp~~~~~~~lpPC~~~ 149 (264) .||--||...+-.|.-..|++.+.++|+ =.-.-||.+..++-|..+..-.-.|-|.+ T Consensus 208 ~~gdyW~~~yE~~~~~~~~ld~i~~ei~PlY~~LHAYvRr~L~~~Ygp~~i~~~gpiPahLl 269 (646) T KOG3690 208 NGGDYWRLAYETPGDFEQDLDAIFEEIRPLYRQLHAYVRRKLRGAYGPDGISRDGPIPAHLL 269 (646) T ss_pred CHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHH T ss_conf 78888999851725699999999999889999999999999987528333798898488887 No 14 >pfam08617 CGI-121 Kinase binding protein CGI-121. CGI-121 has been shown to bind to the p53-related protein kinase (PRPK). PRPK is a novel protein kinase which binds to and induces phosphorylation of the tumour suppressor protein p53. CGI-121 is part of a conserved protein complex, KEOPS. The KEOPS complex is involved in telomere uncapping and telomere elongation. Interestingly this family also include archaeal homologues, formerly in the DUF509 family. A structure for these proteins has been solved by structural genomics. Probab=41.64 E-value=15 Score=17.83 Aligned_cols=32 Identities=31% Similarity=0.479 Sum_probs=25.9 Q ss_pred EEEHHHHHHHHHHHHHHCCCCCHHH-HHHHCCC Q ss_conf 4202268899999998536664688-8640032 Q gi|254780956|r 47 TKKVHWKSVVHELLWFLRGDSNVSY-LHRHGVS 78 (264) Q Consensus 47 tkk~~~k~~i~EllW~l~G~tn~~~-l~~~~~~ 78 (264) .+++-.|++-.|++.+|||++|+.+ |+.+|++ T Consensus 48 ~~~~~tr~~~~Eil~~lS~~~nI~~al~~fGi~ 80 (155) T pfam08617 48 RGRNKTRTLHSEILLRLSGSNNISEALKRFGIS 80 (155) T ss_pred CCCCCCCCCHHHHHHHHCCCCCHHHHHHHHCCC T ss_conf 388666861889988747524178999984978 No 15 >pfam09450 DUF2019 Domain of unknown function (DUF2019). Protein of unknown function found in bacteria. Probab=41.24 E-value=26 Score=16.47 Aligned_cols=28 Identities=18% Similarity=0.455 Sum_probs=22.5 Q ss_pred CHHHHHHHHHHHHHCCCC-HHHHHHHHCC Q ss_conf 268899999999980885-2367775169 Q gi|254780956|r 107 NVIDQISSIVQSLRADPY-SRRHIVSAWN 134 (264) Q Consensus 107 ~~~DQl~~vi~~Lk~np~-sRR~i~~~wn 134 (264) .-+||+..|-++||..|. -||+++.++. T Consensus 29 RLf~~m~~i~~ELK~R~GDqRraLl~L~~ 57 (106) T pfam09450 29 RLFDEMARISDELKSRDGDQRRALLALYS 57 (106) T ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHH T ss_conf 99999999999985299607899999982 No 16 >pfam03410 Peptidase_M44 Protein G1. Protein G1, named after the vaccinia virus protein, is a glycoprotein expressed by many Poxviridae. Probab=34.96 E-value=33 Score=15.83 Aligned_cols=114 Identities=18% Similarity=0.213 Sum_probs=47.2 Q ss_pred HHHHHHHHHHHHHCCCCHHHH-----HH--HHCCHHHH-----HHCCCCCCCCCE---EEEECCCEEEEEE-------EE Q ss_conf 688999999999808852367-----77--51698883-----202421135402---4667688059999-------72 Q gi|254780956|r 108 VIDQISSIVQSLRADPYSRRH-----IV--SAWNVALI-----DKMALPPCHCLF---QFYVDNGKLSCQL-------YQ 165 (264) Q Consensus 108 ~~DQl~~vi~~Lk~np~sRR~-----i~--~~wnp~~~-----~~~~lpPC~~~~---qf~v~~g~L~~~~-------~q 165 (264) ..+++..|-+.|.+.- ||+ |+ ---|+.-+ .=+.||.|...+ ++...+||.-|+- .| T Consensus 144 ML~~l~~i~~~L~~RM--~~I~GpniVIFVk~l~~~~l~ll~~TFG~LP~~P~~Ip~~~~~~i~gKivMmPsPFYTvmv~ 221 (590) T pfam03410 144 MLDQLDAVRELLGKRM--RRIAGPNVVIFVRELSPAALALLERTFGTLPACPSTIPATRLGSIHGKAVMVPSPFYTVMVQ 221 (590) T ss_pred HHHHHHHHHHHHHHHH--HHHCCCCEEEEEECCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCEEEEECCCCEEEEEE T ss_conf 8762578999999988--86108968999840788999999976278878962036766456687189805873599997 Q ss_pred CCCCHHHHCCHHHHHHHHHHHHHHHHHCCC-CEE---EEEECCCCEEEHHHHHHHHHHHCCCCCCC-CEEEEC Q ss_conf 624402220216799999999999762242-111---13522730210036899999833698768-779964 Q gi|254780956|r 166 RSGDVFLGIPFNIASYSLLTMMLASVIGFQ-YGE---FIHTLGDVHLYNNHFEQADLQLSRSPRTL-PQMIIN 233 (264) Q Consensus 166 RS~D~~lG~pfNi~~y~lL~~~iA~~~gl~-~G~---~~~~~~d~HIY~nH~~~vk~ql~R~p~~~-P~L~i~ 233 (264) =++- | -|+.+--.|-+ +-+..++| .|+ .+.+..+-+=||+-+..++. .+++- |...++ T Consensus 222 V~~t--l---~NiLai~cL~E-~YhLiDYETvg~~LYvtiSFv~E~dYE~fl~gi~~----l~f~~~~~i~l~ 284 (590) T pfam03410 222 VDPT--L---ENILAILCLYE-SYHLVDYETLGNRLYVTFSFIHEQDYEAFLRGVGE----LRFEPAPRIELN 284 (590) T ss_pred CCCC--H---HHHHHHHHHHH-HHHCCCCEEECCEEEEEEEEECHHHHHHHHCCCCC----CCCCCCCCEEEE T ss_conf 4730--8---88999999998-87200030006707999998355778998646442----344556745763 No 17 >pfam01024 Colicin Colicin pore forming domain. Probab=34.59 E-value=22 Score=16.85 Aligned_cols=40 Identities=25% Similarity=0.271 Sum_probs=26.8 Q ss_pred CCHHHCCCCCCCCHHHHHHHHHHHHHCCCC------HHHHHHHHCCHHHHHHCC Q ss_conf 201000367753268899999999980885------236777516988832024 Q gi|254780956|r 95 GVQWRSWPDYDGNVIDQISSIVQSLRADPY------SRRHIVSAWNVALIDKMA 142 (264) Q Consensus 95 G~qwR~~~~~~~~~~DQl~~vi~~Lk~np~------sRR~i~~~wnp~~~~~~~ 142 (264) |.+.|+. |+=-...+++++||+ +|-+|..+|..-+.++|+ T Consensus 47 GKkIRn~--------~dAl~s~eK~~~n~nkK~~~~Dr~AI~nAles~d~~~~A 92 (187) T pfam01024 47 GKKIRSV--------DEALASFEKYKANLNKKINAADRDAIANALESVNAQDMA 92 (187) T ss_pred CCCCCCH--------HHHHHHHHHHHHCCCHHCCHHHHHHHHHHHHHCCHHHHH T ss_conf 7713789--------999999999872820214561199999999976899999 No 18 >TIGR00824 EIIA-man PTS system, mannose/fructose/sorbose family, IIA component; InterPro: IPR013789 The phosphoenolpyruvate-dependent sugar phosphotransferase system (PTS) , is a major carbohydrate transport system in bacteria. The PTS catalyses the phosphorylation of incoming sugar substrates and coupled with translocation across the cell membrane, makes the PTS a link between the uptake and metabolism of sugars. The general mechanism of the PTS is the following: a phosphoryl group from phosphoenolpyruvate (PEP) is transferred via a signal transduction pathway, to enzyme I (EI) which in turn transfers it to a phosphoryl carrier, the histidine protein (HPr). Phospho-HPr then transfers the phosphoryl group to a sugar-specific permease, a membrane-bound complex known as enzyme 2 (EII), which transports the sugar to the cell. EII consists of at least three structurally distinct domains IIA, IIB and IIC . These can either be fused together in a single polypeptide chain or exist as two or three interactive chains, formerly called enzymes II (EII) and III (EIII). The first domain (IIA or EIIA) carries the first permease-specific phosphorylation site, a histidine which is phosphorylated by phospho-HPr. The second domain (IIB or EIIB) is phosphorylated by phospho-IIA on a cysteinyl or histidyl residue, depending on the sugar transported. Finally, the phosphoryl group is transferred from the IIB domain to the sugar substrate concomitantly with the sugar uptake processed by the IIC domain. This third domain (IIC or EIIC) forms the translocation channel and the specific substrate-binding site. An additional transmembrane domain IID, homologous to IIC, can be found in some PTSs, e.g. for mannose , , , . The Man family is unique in several respects among PTS permease families. It is the only PTS family in which members possess a IID protein. It is the only PTS family in which the IIB constituent is phosphorylated on a histidyl rather than a cysteyl residue. Its permease members exhibit broad specificity for a range of sugars, rather than being specific for just one or a few sugars. The mannose permease of Escherichia coli, for example, can transport and phosphorylate glucose, mannose, fructose, glucosamine, N-acetylglucosamine, and other sugars. Other members of this can transport sorbose, fructose and N-acetylglucosamine. This family is specific for IIA and IIB components.. Probab=30.25 E-value=18 Score=17.38 Aligned_cols=14 Identities=43% Similarity=0.698 Sum_probs=10.4 Q ss_pred CCHHHHCCHHHHHH Q ss_conf 44022202167999 Q gi|254780956|r 168 GDVFLGIPFNIASY 181 (264) Q Consensus 168 ~D~~lG~pfNi~~y 181 (264) .|+|-|-|||.|+- T Consensus 65 VD~~GGsPyNAAa~ 78 (117) T TIGR00824 65 VDIFGGSPYNAAAR 78 (117) T ss_pred EECCCCCCHHHHHH T ss_conf 87689982888989 No 19 >pfam09687 P_fal_TIGR01639 Protein of unknown function (P_fal_TIGR01639). These sequences contain a conserved sequence region of about 60 amino acids found in over 40 predicted proteins of Plasmodium falciparum. It is not currently found elsewhere, even in closely related Plasmodium species. No member of this family has been functionally characterized. Probab=27.97 E-value=43 Score=15.17 Aligned_cols=25 Identities=20% Similarity=0.510 Sum_probs=22.1 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHCCH Q ss_conf 89999999998088523677751698 Q gi|254780956|r 110 DQISSIVQSLRADPYSRRHIVSAWNV 135 (264) Q Consensus 110 DQl~~vi~~Lk~np~sRR~i~~~wnp 135 (264) .++..|++.|++.|. |.-+.++||- T Consensus 13 eeL~~VL~sL~E~p~-~edl~nIWnh 37 (61) T pfam09687 13 EELYDVLNSLEECPP-NEDLYNIWNH 37 (61) T ss_pred HHHHHHHHHHHCCCC-HHHHHHHHHH T ss_conf 999999988862993-7789999999 No 20 >pfam05613 Herpes_U15 Human herpesvirus U15 protein. Probab=27.56 E-value=34 Score=15.77 Aligned_cols=34 Identities=26% Similarity=0.320 Sum_probs=26.3 Q ss_pred CCCEEEEEEHHHHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 9810464202268899999998536664688864 Q gi|254780956|r 41 GFPLLTTKKVHWKSVVHELLWFLRGDSNVSYLHR 74 (264) Q Consensus 41 gfPllTtkk~~~k~~i~EllW~l~G~tn~~~l~~ 74 (264) -||.||.--.-.||.+-|+++++..=.+++.+-. T Consensus 51 i~paltlsmtvtks~vie~lfi~krwe~i~qff~ 84 (110) T pfam05613 51 IFPALTLSMTVTKSLVIEALFIIKRWEDIDQFFN 84 (110) T ss_pred HHHHCEEEEEEEHHHHHHHHHHHHHHHHHHHHHH T ss_conf 8630034565408999999999999999999871 No 21 >pfam08359 TetR_C_4 YsiA-like protein, C-terminal region. The members of this family are thought to be TetR-type transcriptional regulators that bear particular similarity to YsiA, a hypothetical protein expressed by B. subtilis. Probab=27.48 E-value=25 Score=16.56 Aligned_cols=49 Identities=16% Similarity=0.462 Sum_probs=26.5 Q ss_pred CHHHHHHHHHHHHCCCEECCCCCCCEEEEEECEEEEECCCCCCEEEEEEHHHHHHHHHH--HHHHCCCC Q ss_conf 95799999999863926037998534777612046442369810464202268899999--99853666 Q gi|254780956|r 1 MHQYLDLLRHVIKFGSDRRDRTGVGTRSTFGYQMRFDLSKGFPLLTTKKVHWKSVVHEL--LWFLRGDS 67 (264) Q Consensus 1 m~qYl~l~~~il~~G~~~~dRtG~~t~s~fg~~~~~dl~~gfPllTtkk~~~k~~i~El--lW~l~G~t 67 (264) ++.|++++..|+.+|..... +|-| .+.-.+|.+++ +.+.|+ -|.++|.. T Consensus 66 l~~Y~~li~~ii~eG~~~G~-------------fr~d----ld~~l~r~~i~-G~id~~v~~Wv~~~~k 116 (133) T pfam08359 66 LKGYLTLIDEIIAEGKESGE-------------FREE----LDVRLARQMIF-GTLDETVTNWVMNDQK 116 (133) T ss_pred HHHHHHHHHHHHHHHHHCCC-------------CCCC----CCHHHHHHHHH-HHHHHHHHHHHHCCCC T ss_conf 99999999999999998397-------------8887----89999999999-9899999999976998 No 22 >TIGR02217 chp_TIGR02217 conserved hypothetical protein TIGR02217; InterPro: IPR011740 This model represents a family of conserved hypothetical proteins. It is usually (but not always) found in apparent phage-derived regions of bacterial chromosomes.. Probab=25.95 E-value=23 Score=16.78 Aligned_cols=20 Identities=20% Similarity=0.612 Sum_probs=16.2 Q ss_pred HCCCCCCCCCHHHCCCCCCC Q ss_conf 00138766201000367753 Q gi|254780956|r 87 DGELGPIYGVQWRSWPDYDG 106 (264) Q Consensus 87 ~g~lg~~YG~qwR~~~~~~~ 106 (264) +---|+.||++||.|.|+.+ T Consensus 66 ~ARrG~~~gFRFRD~~Df~s 85 (220) T TIGR02217 66 EARRGSLFGFRFRDPLDFSS 85 (220) T ss_pred HHCCCCCEEEEECCCCCCCC T ss_conf 54177854322057001123 No 23 >cd00989 PDZ_metalloprotease PDZ domain of bacterial and plant zinc metalloprotases, presumably membrane-associated or integral membrane proteases, which may be involved in signalling and regulatory mechanisms. May be responsible for substrate recognition and/or binding, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of protease-associated PDZ domains a C-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in Eumetazoan signaling proteins. Probab=25.35 E-value=55 Score=14.50 Aligned_cols=47 Identities=15% Similarity=0.223 Sum_probs=34.9 Q ss_pred HHHHHHCCCCEEEEEECCCCEEEHHHHHHHHHHHCCCCCCCCEEEECCC Q ss_conf 9997622421111352273021003689999983369876877996488 Q gi|254780956|r 187 MLASVIGFQYGEFIHTLGDVHLYNNHFEQADLQLSRSPRTLPQMIINPN 235 (264) Q Consensus 187 ~iA~~~gl~~G~~~~~~~d~HIY~nH~~~vk~ql~R~p~~~P~L~i~~~ 235 (264) --|...|+++|+.+..+++--|. .++.+.+++...|-.-=++.+.++ T Consensus 23 spA~~AGl~~GD~I~~ing~~v~--~~~~~~~~i~~~~~~~i~l~v~R~ 69 (79) T cd00989 23 SPAAKAGLKAGDRILAINGQKIK--SWEDLVDAVQENPGKPLTLTVERN 69 (79) T ss_pred CHHHHCCCCCCCEEEEECCEECC--CHHHHHHHHHHCCCCEEEEEEEEC T ss_conf 98998599999999999999958--999999999858998899999999 No 24 >pfam09343 DUF2460 Conserved hypothetical protein 2217 (DUF2460). This model represents a family of conserved hypothetical proteins. It is usually (but not always) found in apparent phage-derived regions of bacterial chromosomes. Probab=24.40 E-value=20 Score=17.19 Aligned_cols=18 Identities=33% Similarity=1.062 Sum_probs=14.1 Q ss_pred CCCCCCCCHHHCCCCCCC Q ss_conf 138766201000367753 Q gi|254780956|r 89 ELGPIYGVQWRSWPDYDG 106 (264) Q Consensus 89 ~lg~~YG~qwR~~~~~~~ 106 (264) --|.+||++||.|.+|.. T Consensus 66 RrG~~~gFRfkD~~D~~s 83 (197) T pfam09343 66 RRGRLYGFRFKDWADYKS 83 (197) T ss_pred HCCCCCCEEECCCCCCCC T ss_conf 478634234148242001 No 25 >pfam07353 Uroplakin_II Uroplakin II. This family contains uroplakin II, which is approximately 180 residues long and seems to be restricted to mammals. Uroplakin II is an integral membrane protein, and is one of the components of the apical plaques of mammalian urothelium formed by the asymmetric unit membrane - this is believed to play a role in strengthening the urothelial apical surface to prevent the cells from rupturing during bladder distension. Probab=23.07 E-value=25 Score=16.61 Aligned_cols=11 Identities=55% Similarity=0.960 Sum_probs=5.8 Q ss_pred CCCCCCCCCEE Q ss_conf 24211354024 Q gi|254780956|r 141 MALPPCHCLFQ 151 (264) Q Consensus 141 ~~lpPC~~~~q 151 (264) .+|||||..-| T Consensus 46 vaLPPChLtgg 56 (184) T pfam07353 46 VALPPCHLTGG 56 (184) T ss_pred HCCCCCEECCC T ss_conf 30895102288 No 26 >TIGR01011 rpsB_bact ribosomal protein S2; InterPro: IPR005706 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites , . About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many of ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome , . This family describes the bacterial, mitochondrial and chloroplast forms of ribosomal protein S2.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0015935 small ribosomal subunit. Probab=22.94 E-value=34 Score=15.76 Aligned_cols=10 Identities=30% Similarity=0.328 Sum_probs=4.0 Q ss_pred HHHHHHCCCH Q ss_conf 8886400321 Q gi|254780956|r 70 SYLHRHGVSI 79 (264) Q Consensus 70 ~~l~~~~~~i 79 (264) +.|.|-|||+ T Consensus 5 k~LleaGvHF 14 (227) T TIGR01011 5 KDLLEAGVHF 14 (227) T ss_pred HHHHHCCCCC T ss_conf 7686388543 No 27 >TIGR02790 nickel_nikC nickel ABC transporter, permease subunit NikC; InterPro: IPR014157 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family consists of the NikC family of nickel ABC transporter permeases. Operons that contain this protein also contain a homologous permease subunit NikB. Nickel is used in cells as part of urease or certain hydrogenases or superoxide dismutases.. Probab=22.82 E-value=37 Score=15.52 Aligned_cols=28 Identities=25% Similarity=0.497 Sum_probs=22.0 Q ss_pred CCCEEEEEEEECCCCHHHHCCHHHHHHHH Q ss_conf 68805999972624402220216799999 Q gi|254780956|r 155 DNGKLSCQLYQRSGDVFLGIPFNIASYSL 183 (264) Q Consensus 155 ~~g~L~~~~~qRS~D~~lG~pfNi~~y~l 183 (264) ++|.++ ++.||-||+++..|--+.+.++ T Consensus 89 ~GG~vD-~~~MR~~D~~lsfPt~~L~L~i 116 (258) T TIGR02790 89 IGGRVD-EVIMRVCDVFLSFPTIVLALAI 116 (258) T ss_pred CCCHHH-EEHHHHHHHHHHHHHHHHHHHH T ss_conf 154020-0025547588755789999999 No 28 >cd06570 GH20_chitobiase-like_1 A functionally uncharacterized subgroup of the Glycosyl hydrolase family 20 (GH20) catalytic domain found in proteins similar to the chitobiase of Serratia marcescens, a beta-N-1,4-acetylhexosaminidase that hydrolyzes the beta-1,4-glycosidic linkages in oligomers derived from chitin. Chitin is degraded by a two step process: i) a chitinase hydrolyzes the chitin to oligosaccharides and disaccharides such as di-N-acetyl-D-glucosamine and chitobiose, ii) chitobiase then further degrades these oligomers into monomers. This subgroup lacks the C-terminal PKD (polycystic kidney disease I)-like domain found in the chitobiases. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself. Probab=22.73 E-value=62 Score=14.19 Aligned_cols=32 Identities=25% Similarity=0.450 Sum_probs=22.2 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCEEEEEECCCC Q ss_conf 21679999999999976224211113522730 Q gi|254780956|r 175 PFNIASYSLLTMMLASVIGFQYGEFIHTLGDV 206 (264) Q Consensus 175 pfNi~~y~lL~~~iA~~~gl~~G~~~~~~~d~ 206 (264) |.|=..|.+|..++.+.+.+-|++++|..||- T Consensus 126 p~~~~ty~fl~~vl~Ev~~lFp~~y~HiGGDE 157 (311) T cd06570 126 PTNEETYTFLDNLFGEMAELFPDEYFHIGGDE 157 (311) T ss_pred CCCHHHHHHHHHHHHHHHHHCCCCEEEECCCC T ss_conf 89889999999999999985786415645754 No 29 >pfam07958 DUF1688 Protein of unknown function (DUF1688). A family of uncharacterized proteins. Probab=21.83 E-value=60 Score=14.25 Aligned_cols=14 Identities=14% Similarity=0.349 Sum_probs=5.9 Q ss_pred CHHHHHHHHHHHHH Q ss_conf 26889999999998 Q gi|254780956|r 107 NVIDQISSIVQSLR 120 (264) Q Consensus 107 ~~~DQl~~vi~~Lk 120 (264) +++|-+..++.+.+ T Consensus 69 Ggv~R~~~L~~~~~ 82 (413) T pfam07958 69 GGVDRIAELLARWA 82 (413) T ss_pred CCCCHHHHHHHHCC T ss_conf 89649999998543 No 30 >TIGR02303 HpaG-C-term 4-hydroxyphenylacetate degradation bifunctional isomerase/decarboxylase, C-terminal subunit; InterPro: IPR012684 This entry represents one of two subunits/domains of the bifunctional isomerase/decarboxylase involved in 4-hydroxyphenylacetate degradation . In Escherichia coli and some other species, this enzyme is encoded by a single polypeptide containing both this domain and the closely related N-terminal domain (IPR012686 from INTERPRO). In other species such as Pasteurella multocida, these domains are found as two separate proteins (usually as tandem genes). Together, these domains carry out the decarboxylation of 5-oxopent-3-ene-1,2,5-tricarboxylic acid (OPET) to 2-hydroxy-2,4-diene-1,7-dioate (HHDD) and the subsequent isomerization to 2-oxohept-3-ene-1,7-dioate (OHED).; GO: 0008704 5-carboxymethyl-2-hydroxymuconate delta-isomerase activity, 0018800 5-oxopent-3-ene-125-tricarboxylate decarboxylase activity. Probab=21.72 E-value=54 Score=14.53 Aligned_cols=39 Identities=21% Similarity=0.375 Sum_probs=29.3 Q ss_pred EEEEEEEE--CCCCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEEEC Q ss_conf 05999972--62440222021679999999999976224211113522 Q gi|254780956|r 158 KLSCQLYQ--RSGDVFLGIPFNIASYSLLTMMLASVIGFQYGEFIHTL 203 (264) Q Consensus 158 ~L~~~~~q--RS~D~~lG~pfNi~~y~lL~~~iA~~~gl~~G~~~~~~ 203 (264) ..|.-+.| --+||..-+|| |.+.+++..=|+||+++-+. T Consensus 178 yVNGEL~Q~G~T~dMiF~va~-------LiEylS~~mTL~PgD~ilTG 218 (249) T TIGR02303 178 YVNGELTQEGNTSDMIFSVAY-------LIEYLSEFMTLEPGDVILTG 218 (249) T ss_pred EEEEEEEECCCCCHHHHCHHH-------HHHHHHHCCCCCCCCEEEEC T ss_conf 771378646874211204678-------99986310247856488705 No 31 >cd04483 hOBFC1_like hOBFC1_like: A subfamily of OB folds similar to that found in human OB fold containing protein 1 (hOBFC1). Members of this group belong to the Replication protein A subunit 2 (RPA2) family of OB folds. RPA is a nuclear ssDNA binding protein (SSB) which appears to be involved in all aspects of DNA metabolism including replication, recombination, and repair. RPA also mediates specific interactions of various nuclear proteins. In animals, plants, and fungi, RPA is a heterotrimer with subunits of 70KDa (RPA1), 32kDa (RPA2), and 14 KDa (RPA3). The OB fold domain of RPA2 has dual roles in ssDNA binding and trimerization. Probab=21.63 E-value=65 Score=14.05 Aligned_cols=65 Identities=17% Similarity=0.338 Sum_probs=46.1 Q ss_pred CCEEEEECCC--EEEEEEEECCCCHHHHCCHHHHH-HHHHHHHHHHHHCCCCEEEEEECCCCEEEHHH Q ss_conf 4024667688--05999972624402220216799-99999999976224211113522730210036 Q gi|254780956|r 148 CLFQFYVDNG--KLSCQLYQRSGDVFLGIPFNIAS-YSLLTMMLASVIGFQYGEFIHTLGDVHLYNNH 212 (264) Q Consensus 148 ~~~qf~v~~g--~L~~~~~qRS~D~~lG~pfNi~~-y~lL~~~iA~~~gl~~G~~~~~~~d~HIY~nH 212 (264) .++-|-|+|| -++|.+..-.++...+-+.+.+. -..++.-+++...++.|++.+.=|-..-|..+ T Consensus 14 ~f~~y~vDDgTGcI~Cv~W~~~~~~~~~s~~~~~~~~~~~~~a~~~~~~l~iG~llRVRG~I~~yRg~ 81 (92) T cd04483 14 TFYSFGVDDGTGVVNCVCWKNLSYAEVSSRSDAARILKSALMALKQAKVLEIGDLLRVRGSIRTYRGE 81 (92) T ss_pred CEEEEEEECCCCEEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEEECE T ss_conf 71699860898669989960457677666678445444457888761157752389994268887044 Done!