Query gi|254780658|ref|YP_003065071.1| 5-formyltetrahydrofolate cyclo-ligase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 113 No_of_seqs 112 out of 1985 Neff 6.0 Searched_HMMs 39220 Date Mon May 30 00:07:05 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780658.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK10333 putative ligase; Prov 100.0 2.6E-41 0 240.5 7.8 103 1-105 92-198 (198) 2 TIGR02727 MTHFS_bact 5,10-meth 100.0 7.1E-41 1.4E-45 238.1 6.3 98 1-99 82-183 (183) 3 pfam01812 5-FTHF_cyc-lig 5-for 100.0 1.3E-37 3.4E-42 220.7 7.6 95 3-99 84-182 (182) 4 COG0212 5-formyltetrahydrofola 100.0 3.5E-36 8.8E-41 213.2 6.9 104 1-106 83-191 (191) 5 KOG3093 consensus 99.9 3.4E-28 8.8E-33 170.7 4.3 91 10-100 101-200 (200) 6 KOG4410 consensus 99.4 9.9E-14 2.5E-18 93.9 4.8 80 29-108 156-241 (396) 7 TIGR01899 cas_TM1807_csm5 CRIS 57.1 2.8 7.1E-05 22.5 -0.2 19 44-62 376-394 (465) 8 PRK12420 histidyl-tRNA synthet 55.4 8.6 0.00022 19.9 2.1 31 51-81 295-325 (421) 9 PRK13176 consensus 47.2 25 0.00063 17.4 3.4 54 28-83 42-96 (216) 10 PRK01175 phosphoribosylformylg 43.3 26 0.00066 17.3 3.0 60 29-90 47-114 (255) 11 PRK00037 hisS histidyl-tRNA sy 41.6 12 0.0003 19.1 1.0 30 50-81 283-312 (417) 12 TIGR02821 fghA_ester_D S-formy 41.1 15 0.00037 18.6 1.4 58 2-59 14-109 (279) 13 PRK12295 hisZ ATP phosphoribos 39.3 23 0.0006 17.5 2.3 36 47-83 334-369 (373) 14 PRK12292 hisZ ATP phosphoribos 39.0 22 0.00055 17.7 2.1 30 50-82 286-315 (388) 15 COG1332 CRISPR system related 38.2 5.1 0.00013 21.1 -1.3 15 47-61 304-318 (369) 16 CHL00201 syh histidine-tRNA sy 38.1 24 0.00061 17.5 2.2 29 50-80 282-310 (424) 17 cd00773 HisRS-like_core Class 34.1 31 0.00079 16.9 2.2 49 28-78 194-253 (261) 18 cd01748 GATase1_IGP_Synthase T 29.6 54 0.0014 15.6 3.2 51 28-82 34-85 (198) 19 pfam11959 DUF3473 Domain of un 29.2 44 0.0011 16.1 2.3 51 11-61 5-57 (133) 20 KOG0967 consensus 28.6 21 0.00054 17.8 0.6 72 32-110 560-635 (714) 21 pfam01686 Adeno_Penton_B Adeno 27.0 26 0.00066 17.3 0.8 22 27-48 196-217 (454) 22 PRK13175 consensus 24.3 58 0.0015 15.5 2.2 51 28-82 37-88 (206) 23 pfam10001 DUF2242 Uncharacteri 23.7 37 0.00093 16.5 1.1 17 83-99 86-102 (121) 24 PRK13178 consensus 23.6 70 0.0018 15.0 3.2 52 29-83 36-87 (213) 25 PRK13177 consensus 21.5 77 0.002 14.8 2.8 55 28-84 39-94 (207) No 1 >PRK10333 putative ligase; Provisional Probab=100.00 E-value=2.6e-41 Score=240.48 Aligned_cols=103 Identities=29% Similarity=0.501 Sum_probs=92.5 Q ss_pred CEEEEECCCCCCEECCCCCCCCCCCCC----CCCCCEEEEHHHHHCCCHHHHCCCCEEHHHHHHHHHHCCCCEEEEEEEC Q ss_conf 934653488852464888708899986----1146677401422111001112486037899988874499837999832 Q gi|254780658|r 1 MIFRQYENPTDLVKSAVGTLSPPSYAK----EMSPDLILMPLIAFDLKGNRIGYGRGYYDRAIADIRLEGKNPYLLGIAF 76 (113) Q Consensus 1 l~F~~~~~~~~l~~~~~gI~EP~~~~~----~~~~DlilVP~lafD~~G~RLG~GgGyYDR~l~~~~~~~~~~~~igla~ 76 (113) |.|+.|++.++|.+|+|||+||..+.. ...+|++||||||||++|+|||||||||||+|++++.. ++..||+|| T Consensus 92 l~f~~~~~~~~l~~~~~gI~EP~~~~~~~~~~~~iDlilVP~laFD~~G~RLGyGgGyYDR~L~~~~~~--~~~~Igla~ 169 (198) T PRK10333 92 LLFLNYHPQSELVMNRLKIHEPKLDVRDVLPLSRLDVLITPLVAFDEYGQRLGMGGGFYDRTLQNWQHY--KTQPVGYAH 169 (198) T ss_pred EEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCEEECCCCCEEECCCCCHHHHHHHCCCC--CCCEEEEEE T ss_conf 799980899845528889999998755544644488499440466588816366874251575644203--871899971 Q ss_pred HHHEECCCCCCCCCCCCCEEECCCCEEEE Q ss_conf 23242478998447511589989855970 Q gi|254780658|r 77 DMQETSCIQAEATDIRLHAILTESRFSQF 105 (113) Q Consensus 77 ~~q~~~~ip~e~hD~~ld~iiTe~~~~~f 105 (113) ++|+++++|.|+||++||+||||+++|+| T Consensus 170 ~~Q~~~~iP~e~hD~~lD~iiTe~~iy~f 198 (198) T PRK10333 170 DCQLVEKLPVEEWDIPLPAVVTPSKVWEW 198 (198) T ss_pred CCEEECCCCCCCCCCCCCEEECCCCEEEC T ss_conf 22760657997325547999888811519 No 2 >TIGR02727 MTHFS_bact 5,10-methenyltetrahydrofolate synthetase; InterPro: IPR002698 5-formyltetrahydrofolate cyclo-ligase or methenyl-THF synthetase 6.3.3.2 from EC catalyses the interchange of 5-formyltetrahydrofolate (5-FTHF) to 5-10-methenyltetrahydrofolate, this requires ATP and Mg^2+ . 5-FTHF is used in chemotherapy where it is clinically known as Leucovorin .; GO: 0005524 ATP binding, 0030272 5-formyltetrahydrofolate cyclo-ligase activity, 0009396 folic acid and derivative biosynthetic process. Probab=100.00 E-value=7.1e-41 Score=238.13 Aligned_cols=98 Identities=39% Similarity=0.691 Sum_probs=88.7 Q ss_pred CEEEEECCCCCCEECCCCCCCCCCCCC-C---CCCCEEEEHHHHHCCCHHHHCCCCEEHHHHHHHHHHCCCCEEEEEEEC Q ss_conf 934653488852464888708899986-1---146677401422111001112486037899988874499837999832 Q gi|254780658|r 1 MIFRQYENPTDLVKSAVGTLSPPSYAK-E---MSPDLILMPLIAFDLKGNRIGYGRGYYDRAIADIRLEGKNPYLLGIAF 76 (113) Q Consensus 1 l~F~~~~~~~~l~~~~~gI~EP~~~~~-~---~~~DlilVP~lafD~~G~RLG~GgGyYDR~l~~~~~~~~~~~~igla~ 76 (113) |.|..|+..+.|.+|+|||+||..+.. . .++|+||||+||||++|+|||||||||||||++++.....++.||+|| T Consensus 82 ~~~~~~~~~~~l~~~~~Gi~eP~~~~~~~~~~~~~Dl~~VP~vAfD~~G~RlGyGgGyYDR~L~~~~~~~~~~~~ig~~f 161 (183) T TIGR02727 82 LFFRIWSPEDPLVKGPFGILEPPEDDTEPVPPDEIDLIIVPGVAFDRRGYRLGYGGGYYDRFLARLKGKTVPAIKIGLAF 161 (183) T ss_pred EEEEEECCCCCCCCCCCCCCCCCCCCCCCCCHHHCCEEEECCEEECCCCCEEECCCCHHHHHHHCCCCCCCCEEEEEEEE T ss_conf 56775168764002667885888755556551011768826376889886740568746787550678887606888510 Q ss_pred HHHEECCCCCCCCCCCCCEEECC Q ss_conf 23242478998447511589989 Q gi|254780658|r 77 DMQETSCIQAEATDIRLHAILTE 99 (113) Q Consensus 77 ~~q~~~~ip~e~hD~~ld~iiTe 99 (113) ++|+++ +|.|+||+|||.|||| T Consensus 162 ~~q~~~-~P~e~hD~p~d~~~Te 183 (183) T TIGR02727 162 DFQLVD-LPREPHDVPVDAIITE 183 (183) T ss_pred HHHHHC-CCCCCCCCCCCEEECC T ss_conf 001220-6887355225714709 No 3 >pfam01812 5-FTHF_cyc-lig 5-formyltetrahydrofolate cyclo-ligase family. 5-formyltetrahydrofolate cyclo-ligase or methenyl-THF synthetase EC:6.3.3.2 catalyses the interchange of 5-formyltetrahydrofolate (5-FTHF) to 5-10-methenyltetrahydrofolate, this requires ATP and Mg2+. 5-FTHF is used in chemotherapy where it is clinically known as Leucovorin. Probab=100.00 E-value=1.3e-37 Score=220.72 Aligned_cols=95 Identities=34% Similarity=0.550 Sum_probs=84.2 Q ss_pred EEEECCCCCCEECCCCCCCCCCCCC----CCCCCEEEEHHHHHCCCHHHHCCCCEEHHHHHHHHHHCCCCEEEEEEECHH Q ss_conf 4653488852464888708899986----114667740142211100111248603789998887449983799983223 Q gi|254780658|r 3 FRQYENPTDLVKSAVGTLSPPSYAK----EMSPDLILMPLIAFDLKGNRIGYGRGYYDRAIADIRLEGKNPYLLGIAFDM 78 (113) Q Consensus 3 F~~~~~~~~l~~~~~gI~EP~~~~~----~~~~DlilVP~lafD~~G~RLG~GgGyYDR~l~~~~~~~~~~~~igla~~~ 78 (113) ++.|.+.+.|++|+|||+||..... ...+|++||||||||++|+|||||||||||||+.++. .++++|||||++ T Consensus 84 ~~~~~~~~~l~~~~~gI~EP~~~~~~~~~~~~lDlilVP~vafD~~G~RLG~GgGyYDR~L~~~~~--~~~~~Igva~~~ 161 (182) T pfam01812 84 ITPYYPETGLPSGPYGLPEPIEEEQRELALNQIDLVLVPGVAFDRQGYRLGRGGGYYDRYLARLQG--HGPLTVGLAYDE 161 (182) T ss_pred EEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCEEECCCCCEEECCCCCHHHHHHHCCC--CCCEEEEEEEHH T ss_conf 872698773642788987898765544675668889955135758994700687407789885545--698399998202 Q ss_pred HEECCCCCCCCCCCCCEEECC Q ss_conf 242478998447511589989 Q gi|254780658|r 79 QETSCIQAEATDIRLHAILTE 99 (113) Q Consensus 79 q~~~~ip~e~hD~~ld~iiTe 99 (113) |+++++|.|+||++||+|||| T Consensus 162 Q~~~~ip~e~hD~~ld~iiTE 182 (182) T pfam01812 162 QLVERLPQEPHDVPVDEIVTE 182 (182) T ss_pred EEECCCCCCCCCCCCCEEECC T ss_conf 251768997215438989709 No 4 >COG0212 5-formyltetrahydrofolate cyclo-ligase [Coenzyme metabolism] Probab=100.00 E-value=3.5e-36 Score=213.22 Aligned_cols=104 Identities=39% Similarity=0.745 Sum_probs=94.1 Q ss_pred CEEEEECCC--CCCEECCCCCCCCCCCCCCC---CCCEEEEHHHHHCCCHHHHCCCCEEHHHHHHHHHHCCCCEEEEEEE Q ss_conf 934653488--85246488870889998611---4667740142211100111248603789998887449983799983 Q gi|254780658|r 1 MIFRQYENP--TDLVKSAVGTLSPPSYAKEM---SPDLILMPLIAFDLKGNRIGYGRGYYDRAIADIRLEGKNPYLLGIA 75 (113) Q Consensus 1 l~F~~~~~~--~~l~~~~~gI~EP~~~~~~~---~~DlilVP~lafD~~G~RLG~GgGyYDR~l~~~~~~~~~~~~igla 75 (113) |.|..|.+. ..+.+++|||.||......+ .+|+++||+||||++|+|||||||||||+|+.+. +.++.+||+| T Consensus 83 ~~f~~~~~~~~~~~~~~~~gi~eP~~~~~~~~~~~iDlvlvP~Vafd~~G~RLG~GgGyYDR~la~~~--~~~~~~ig~~ 160 (191) T COG0212 83 LLFLRYIPDPLQPLIKNRFGILEPGEYGRKIPPPEIDLVLVPLVAFDKQGYRLGYGGGYYDRYLANLR--GRKTPTVGIA 160 (191) T ss_pred CEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECEEEECCCCCEEECCCCHHHHHHHHHC--CCCCCEEEEE T ss_conf 31389426876411233213448764567688667889995927798999432668657889987633--4688778987 Q ss_pred CHHHEECCCCCCCCCCCCCEEECCCCEEEEC Q ss_conf 2232424789984475115899898559700 Q gi|254780658|r 76 FDMQETSCIQAEATDIRLHAILTESRFSQFS 106 (113) Q Consensus 76 ~~~q~~~~ip~e~hD~~ld~iiTe~~~~~f~ 106 (113) |++|+++.+|.|+||++||.|+||++++++. T Consensus 161 ~~~Q~v~~lP~e~hDv~ld~ivt~~~~~~~~ 191 (191) T COG0212 161 YDCQLVDHLPREPHDVPLDAIVTEEGVIRCL 191 (191) T ss_pred EEEHHHCCCCCCCCCCCCCEEEECCCEEECC T ss_conf 7001222378850026257899687327549 No 5 >KOG3093 consensus Probab=99.94 E-value=3.4e-28 Score=170.73 Aligned_cols=91 Identities=34% Similarity=0.594 Sum_probs=80.1 Q ss_pred CCCEECCCCCCCCCCCCCC-----C--CCCEEEEHHHHHCCCHHHHCCCCEEHHHHHHHHHHC--CCCEEEEEEECHHHE Q ss_conf 8524648887088999861-----1--466774014221110011124860378999888744--998379998322324 Q gi|254780658|r 10 TDLVKSAVGTLSPPSYAKE-----M--SPDLILMPLIAFDLKGNRIGYGRGYYDRAIADIRLE--GKNPYLLGIAFDMQE 80 (113) Q Consensus 10 ~~l~~~~~gI~EP~~~~~~-----~--~~DlilVP~lafD~~G~RLG~GgGyYDR~l~~~~~~--~~~~~~igla~~~q~ 80 (113) +.|..++|||++|...... . -+|++|||+||||+.|.|||+|+||||+||.++... ..+|+++|||+.+|+ T Consensus 101 ~~l~~t~w~i~qp~~~~~re~~~~t~~~lDLiivPGvAFd~~g~RlGhGkGYYD~flkry~~~~~~~kp~~vgL~l~EQI 180 (200) T KOG3093 101 ESLPMTSWGIRQPKPGGPREDALETGHPLDLIIVPGVAFDRKGARLGHGKGYYDDFLKRYQIHAPEQKPLLVGLCLKEQI 180 (200) T ss_pred HHCCCCCCCCCCCCCCCCHHHHCCCCCCCEEEEECCCCCCHHHHHCCCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHH T ss_conf 52752002787899887234323579860289836622133202126886518899999998655568314544546765 Q ss_pred ECCCCCCCCCCCCCEEECCC Q ss_conf 24789984475115899898 Q gi|254780658|r 81 TSCIQAEATDIRLHAILTES 100 (113) Q Consensus 81 ~~~ip~e~hD~~ld~iiTe~ 100 (113) +.++|+++||++||.+|||+ T Consensus 181 ~~~IPv~~~Dv~lD~vvt~~ 200 (200) T KOG3093 181 LSEIPVEEHDVKLDAVVTED 200 (200) T ss_pred CCCCCCCCCCEEEEEEECCC T ss_conf 36678885650356763388 No 6 >KOG4410 consensus Probab=99.44 E-value=9.9e-14 Score=93.92 Aligned_cols=80 Identities=24% Similarity=0.348 Sum_probs=68.4 Q ss_pred CCCCEEEEHHHHHCCCHHHHCCCCEEHHHHHHHHHHCC---CCEEEEEEECHHHEECCCCC---CCCCCCCCEEECCCCE Q ss_conf 14667740142211100111248603789998887449---98379998322324247899---8447511589989855 Q gi|254780658|r 29 MSPDLILMPLIAFDLKGNRIGYGRGYYDRAIADIRLEG---KNPYLLGIAFDMQETSCIQA---EATDIRLHAILTESRF 102 (113) Q Consensus 29 ~~~DlilVP~lafD~~G~RLG~GgGyYDR~l~~~~~~~---~~~~~igla~~~q~~~~ip~---e~hD~~ld~iiTe~~~ 102 (113) +..|+++|-.||+++.|+|||.|.||-|--++-+..-+ +..+.+-+.++||+++++|. +.||+|+|+|+||+++ T Consensus 156 lkvDlvViGSVavs~~G~RiGkGeGfAdLeygmli~mGAi~~~TpvVTiVHDcQvVD~iP~el~~~hD~pVDiI~TPTrv 235 (396) T KOG4410 156 LKVDLVVIGSVAVSREGYRIGKGEGFADLEYGMLIEMGAITPKTPVVTIVHDCQVVDSIPPELFQKHDTPVDIIATPTRV 235 (396) T ss_pred CEEEEEEEEEEEECCCCEEECCCCCHHHHHHHHHHHHCCCCCCCCEEEEEECCEEECCCCHHHHHHCCCCCCEEECCCEE T ss_conf 26779998457751333010367760344667888752668888559997413541447988986427872078635047 Q ss_pred EEECHH Q ss_conf 970033 Q gi|254780658|r 103 SQFSTE 108 (113) Q Consensus 103 ~~f~~~ 108 (113) +..+.- T Consensus 236 I~t~~~ 241 (396) T KOG4410 236 IRTPKR 241 (396) T ss_pred EECCCC T ss_conf 754887 No 7 >TIGR01899 cas_TM1807_csm5 CRISPR-associated RAMP protein, Csm5 family; InterPro: IPR010173 CRISPR is a term for Clustered Regularly Interspaced Short Palidromic Repeats. A number of protein families appear only in association with these repeats and are designated Cas (CRISPR associated) proteins. This family, represented by TM1807 of Thermotoga maritima, is found also in Mycobacterium tuberculosis and in a number of archaeal species.. Probab=57.14 E-value=2.8 Score=22.46 Aligned_cols=19 Identities=21% Similarity=0.525 Sum_probs=15.2 Q ss_pred CHHHHCCCCEEHHHHHHHH Q ss_conf 0011124860378999888 Q gi|254780658|r 44 KGNRIGYGRGYYDRAIADI 62 (113) Q Consensus 44 ~G~RLG~GgGyYDR~l~~~ 62 (113) .--||||||||+-.|+-.+ T Consensus 376 ~~~~~G~GgG~~~~T~~~~ 394 (465) T TIGR01899 376 EVLRLGWGGGLRSKTVGIL 394 (465) T ss_pred EEEEEECCCCHHHCCCEEE T ss_conf 5788405611344101002 No 8 >PRK12420 histidyl-tRNA synthetase; Provisional Probab=55.36 E-value=8.6 Score=19.86 Aligned_cols=31 Identities=29% Similarity=0.447 Sum_probs=23.1 Q ss_pred CCEEHHHHHHHHHHCCCCEEEEEEECHHHEE Q ss_conf 8603789998887449983799983223242 Q gi|254780658|r 51 GRGYYDRAIADIRLEGKNPYLLGIAFDMQET 81 (113) Q Consensus 51 GgGyYDR~l~~~~~~~~~~~~igla~~~q~~ 81 (113) |||=||..++.+.+.....+.||.|+-...+ T Consensus 295 gGGRYD~Li~~fgg~~~~~PavGfa~Gierl 325 (421) T PRK12420 295 SGGRYDNIIGAFRGDNMNYPTVGISFGLDVI 325 (421) T ss_pred CCCCHHHHHHHHCCCCCCCCEEEEEEHHHHH T ss_conf 5665054898818998999747998519999 No 9 >PRK13176 consensus Probab=47.17 E-value=25 Score=17.44 Aligned_cols=54 Identities=19% Similarity=0.228 Sum_probs=36.9 Q ss_pred CCCCCEEEEHHHH-HCCCHHHHCCCCEEHHHHHHHHHHCCCCEEEEEEECHHHEECC Q ss_conf 1146677401422-1110011124860378999888744998379998322324247 Q gi|254780658|r 28 EMSPDLILMPLIA-FDLKGNRIGYGRGYYDRAIADIRLEGKNPYLLGIAFDMQETSC 83 (113) Q Consensus 28 ~~~~DlilVP~la-fD~~G~RLG~GgGyYDR~l~~~~~~~~~~~~igla~~~q~~~~ 83 (113) ....|.+|.|+|+ |+..-.+|=.=.|+.|..+..+...+ -+.+|+|.-.|+.-+ T Consensus 42 ~~~~~~lILPGVGsf~~~m~~L~~~~g~~~~i~~~~~~~~--~piLGIClGmQlL~~ 96 (216) T PRK13176 42 VARADRIVLPGVGAFADCRRGLDAVDGMVEALTEAVEKKA--RPFLGICVGMQLMAT 96 (216) T ss_pred HHCCCEEEECCCCCHHHHHHHHHHCCCHHHHHHHHHHHCC--CCEEEEEHHHEEHHH T ss_conf 8379979989987789999987762772899999999769--957998531212653 No 10 >PRK01175 phosphoribosylformylglycinamidine synthase subunit I; Provisional Probab=43.33 E-value=26 Score=17.33 Aligned_cols=60 Identities=23% Similarity=0.258 Sum_probs=39.5 Q ss_pred CCCCEEEEHHHHHCCCHHHHCCCCEEHHHHHHHHHH-----CCCCEEEEEEECHHHEECC---CCCCCCC Q ss_conf 146677401422111001112486037899988874-----4998379998322324247---8998447 Q gi|254780658|r 29 MSPDLILMPLIAFDLKGNRIGYGRGYYDRAIADIRL-----EGKNPYLLGIAFDMQETSC---IQAEATD 90 (113) Q Consensus 29 ~~~DlilVP~lafD~~G~RLG~GgGyYDR~l~~~~~-----~~~~~~~igla~~~q~~~~---ip~e~hD 90 (113) .+.|++++|+ +|+- |-+||=|+++.-|+.+.+.. ..+..+.+|+|--||+.-+ +|.+.+. T Consensus 47 ~~~~~lv~pG-GFSy-GD~l~aG~~~a~~~~~~~~~~i~~f~~~g~~vLGICNGfQiL~e~gLLPg~~~~ 114 (255) T PRK01175 47 SDYDCLIIPG-GFSA-GDYIRAGAIFAARLKAVLGKDIREFIDSGRPIIGICNGFQVLMELGLIPAERPE 114 (255) T ss_pred CCCCEEEECC-CCCC-CCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCHHHHHHHCCCCCCCCCC T ss_conf 5554899737-7676-433575188999867999999999996799679980168999975877899988 No 11 >PRK00037 hisS histidyl-tRNA synthetase; Reviewed Probab=41.62 E-value=12 Score=19.14 Aligned_cols=30 Identities=23% Similarity=0.460 Sum_probs=21.8 Q ss_pred CCCEEHHHHHHHHHHCCCCEEEEEEECHHHEE Q ss_conf 48603789998887449983799983223242 Q gi|254780658|r 50 YGRGYYDRAIADIRLEGKNPYLLGIAFDMQET 81 (113) Q Consensus 50 ~GgGyYDR~l~~~~~~~~~~~~igla~~~q~~ 81 (113) -|||=||..++.+. ++..+.+|.|+....+ T Consensus 283 ~gGGRYD~L~~~fg--g~~~PAvGfa~gieRi 312 (417) T PRK00037 283 CGGGRYDGLVEQFG--GPPTPAVGFAIGIERL 312 (417) T ss_pred ECCCCHHHHHHHHC--CCCCCEEEEEECHHHH T ss_conf 25873157899848--9978747898639999 No 12 >TIGR02821 fghA_ester_D S-formylglutathione hydrolase; InterPro: IPR014186 This entry describes proteins from bacteria, yeast, and human, with a conserved critical role in formaldehyde detoxification as S-formylglutathione hydrolase (3.1.2.12 from EC). Eukaryotic members, particularly the human protein, are better known as esterase D (3.1.1.1 from EC), an enzyme with broad specificity, although S-formylglutathione hydrolase has now been demonstrated as well.. Probab=41.05 E-value=15 Score=18.65 Aligned_cols=58 Identities=22% Similarity=0.256 Sum_probs=37.0 Q ss_pred EEEEECCCCCCEECCCCCCCCCCCCCCC-CCCEEEEHHHHHC------------------------------------CC Q ss_conf 3465348885246488870889998611-4667740142211------------------------------------10 Q gi|254780658|r 2 IFRQYENPTDLVKSAVGTLSPPSYAKEM-SPDLILMPLIAFD------------------------------------LK 44 (113) Q Consensus 2 ~F~~~~~~~~l~~~~~gI~EP~~~~~~~-~~DlilVP~lafD------------------------------------~~ 44 (113) .|++-++.+.=..-+|++.-|+.-.... -|=|.+.-+|.+. .+ T Consensus 14 ~fYRH~S~~c~~~m~F~vf~PPqA~~g~kVPvL~yLsGLTCT~en~m~Kag~~R~AAe~G~aLVaPDTSPR~~gi~gE~D 93 (279) T TIGR02821 14 GFYRHKSETCGVPMTFAVFLPPQAAAGPKVPVLYYLSGLTCTEENFMIKAGAQRFAAEYGLALVAPDTSPRGTGIAGEDD 93 (279) T ss_pred EEEEECCEEECCCCEECCCCCHHHHCCCCCCEEEECCCCCHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCC T ss_conf 56860355636862231468857733899723310000131157899877789999768907972678987777767865 Q ss_pred HHHHCCCCEEH-HHHH Q ss_conf 01112486037-8999 Q gi|254780658|r 45 GNRIGYGRGYY-DRAI 59 (113) Q Consensus 45 G~RLG~GgGyY-DR~l 59 (113) -+-+|.|.||| |-+= T Consensus 94 ~wDfG~GAGFYvDAt~ 109 (279) T TIGR02821 94 EWDFGKGAGFYVDATE 109 (279) T ss_pred CCCCCCCCCCHHHHHH T ss_conf 4214798752110220 No 13 >PRK12295 hisZ ATP phosphoribosyltransferase regulatory subunit; Provisional Probab=39.27 E-value=23 Score=17.55 Aligned_cols=36 Identities=17% Similarity=0.233 Sum_probs=24.3 Q ss_pred HHCCCCEEHHHHHHHHHHCCCCEEEEEEECHHHEECC Q ss_conf 1124860378999888744998379998322324247 Q gi|254780658|r 47 RIGYGRGYYDRAIADIRLEGKNPYLLGIAFDMQETSC 83 (113) Q Consensus 47 RLG~GgGyYDR~l~~~~~~~~~~~~igla~~~q~~~~ 83 (113) +-=-|||=||..++.+.. +...+.+|.|+....+-. T Consensus 334 ~~i~gGGRYD~L~~~fG~-~~~~PAvGfaigidRl~~ 369 (373) T PRK12295 334 PPLAGGGRYDGLLTRLGA-GEPIPAVGFSIWLDRLAA 369 (373) T ss_pred CEEEECCCHHHHHHHHCC-CCCCCEEEEEEEHHHHHH T ss_conf 706511653179998589-999863699970999853 No 14 >PRK12292 hisZ ATP phosphoribosyltransferase regulatory subunit; Provisional Probab=39.02 E-value=22 Score=17.73 Aligned_cols=30 Identities=20% Similarity=0.321 Sum_probs=22.2 Q ss_pred CCCEEHHHHHHHHHHCCCCEEEEEEECHHHEEC Q ss_conf 486037899988874499837999832232424 Q gi|254780658|r 50 YGRGYYDRAIADIRLEGKNPYLLGIAFDMQETS 82 (113) Q Consensus 50 ~GgGyYDR~l~~~~~~~~~~~~igla~~~q~~~ 82 (113) -|||=||..++.+ +...+.+|.|.....+- T Consensus 286 ~gGGRYD~L~~~f---G~~~PAvGfa~g~eRl~ 315 (388) T PRK12292 286 ARGGRYDDLGGRF---GRARPATGFSLDLDRLR 315 (388) T ss_pred CCCCCHHHHHHHH---CCCCCEEEEEECHHHHH T ss_conf 0575664799983---89997158996399999 No 15 >COG1332 CRISPR system related protein, RAMP superfamily [Defense mechanisms] Probab=38.19 E-value=5.1 Score=21.06 Aligned_cols=15 Identities=27% Similarity=0.758 Sum_probs=10.5 Q ss_pred HHCCCCEEHHHHHHH Q ss_conf 112486037899988 Q gi|254780658|r 47 RIGYGRGYYDRAIAD 61 (113) Q Consensus 47 RLG~GgGyYDR~l~~ 61 (113) +||+||||+-.|+-. T Consensus 304 ~lG~GgG~~skTi~~ 318 (369) T COG1332 304 HLGFGGGWRSKTIGI 318 (369) T ss_pred EEECCCCEEEEEEEE T ss_conf 984256525305665 No 16 >CHL00201 syh histidine-tRNA synthetase; Provisional Probab=38.10 E-value=24 Score=17.48 Aligned_cols=29 Identities=24% Similarity=0.508 Sum_probs=21.3 Q ss_pred CCCEEHHHHHHHHHHCCCCEEEEEEECHHHE Q ss_conf 4860378999888744998379998322324 Q gi|254780658|r 50 YGRGYYDRAIADIRLEGKNPYLLGIAFDMQE 80 (113) Q Consensus 50 ~GgGyYDR~l~~~~~~~~~~~~igla~~~q~ 80 (113) =|||=||..++.+. ++..+.+|.|+-... T Consensus 282 ~gGGRYD~Lv~~fg--g~~~PavGfaiGieR 310 (424) T CHL00201 282 CGGGRYDSLVEQLG--GPKTPAVGWAIGIER 310 (424) T ss_pred CCCCCCHHHHHHCC--CCCCCEEEEEECHHH T ss_conf 37877077899729--999985778843999 No 17 >cd00773 HisRS-like_core Class II Histidinyl-tRNA synthetase (HisRS)-like catalytic core domain. HisRS is a homodimer. It is responsible for the attachment of histidine to the 3' OH group of ribose of the appropriate tRNA. This domain is primarily responsible for ATP-dependent formation of the enzyme bound aminoacyl-adenylate. Class II assignment is based upon its structure and the presence of three characteristic sequence motifs. This domain is also found at the C-terminus of eukaryotic GCN2 protein kinase and at the N-terminus of the ATP phosphoribosyltransferase accessory subunit, HisZ. HisZ along with HisG catalyze the first reaction in histidine biosynthesis. HisZ is found only in a subset of bacteria and differs from HisRS in lacking a C-terminal anti-codon binding domain. Probab=34.06 E-value=31 Score=16.90 Aligned_cols=49 Identities=22% Similarity=0.257 Sum_probs=29.0 Q ss_pred CCCCCEEEEHHHH---------HCCC-H-HHHCCCCEEHHHHHHHHHHCCCCEEEEEEECHH Q ss_conf 1146677401422---------1110-0-111248603789998887449983799983223 Q gi|254780658|r 28 EMSPDLILMPLIA---------FDLK-G-NRIGYGRGYYDRAIADIRLEGKNPYLLGIAFDM 78 (113) Q Consensus 28 ~~~~DlilVP~la---------fD~~-G-~RLG~GgGyYDR~l~~~~~~~~~~~~igla~~~ 78 (113) .+..|+-+++.+. |..+ + .+-=-|||=||..++.+.+ ...+.+|.|+.. T Consensus 194 ~i~~D~~~~r~~~YYtG~vFe~~~~~~~~~~~i~~GGRYD~L~~~~~~--~~~pAvGfa~~l 253 (261) T cd00773 194 KYSIDLSLVRGLDYYTGIVFEAVADGLGAQGSIAGGGRYDGLLEEFGG--EDVPAVGFAIGL 253 (261) T ss_pred EEEECCCCCCCCCCCCCEEEEEEECCCCCCCCEEECCCHHHHHHHHCC--CCCCEEEEEEEH T ss_conf 399875126687576871999997898865726754430588997089--999758999609 No 18 >cd01748 GATase1_IGP_Synthase Type 1 glutamine amidotransferase (GATase1) domain found in imidazole glycerol phosphate synthase (IGPS). Type 1 glutamine amidotransferase (GATase1) domain found in imidazole glycerol phosphate synthase (IGPS). IGPS incorporates ammonia derived from glutamine into N1-[(5'-phosphoribulosyl)-formimino]-5-aminoimidazole-4-carboxamide ribonucleotide (PRFAR) to form 5'-(5-aminoimidazole-4-carboxamide) ribonucleotide (AICAR) and imidazole glycerol phosphate (IGP). The glutamine amidotransferase domain generates the ammonia nucleophile which is channeled from the glutaminase active site to the PRFAR active site. IGPS belong to the triad family of amidotransferases having a conserved Cys-His-Glu catalytic triad in the glutaminase active site. Probab=29.60 E-value=54 Score=15.63 Aligned_cols=51 Identities=24% Similarity=0.294 Sum_probs=33.4 Q ss_pred CCCCCEEEEHHHH-HCCCHHHHCCCCEEHHHHHHHHHHCCCCEEEEEEECHHHEEC Q ss_conf 1146677401422-111001112486037899988874499837999832232424 Q gi|254780658|r 28 EMSPDLILMPLIA-FDLKGNRIGYGRGYYDRAIADIRLEGKNPYLLGIAFDMQETS 82 (113) Q Consensus 28 ~~~~DlilVP~la-fD~~G~RLG~GgGyYDR~l~~~~~~~~~~~~igla~~~q~~~ 82 (113) ....|.+++|+|+ |+.....| .-.|++|-....++. . .+.+|+|+-.|+.- T Consensus 34 l~~~d~iIlPGvG~f~~~~~~l-~~~~~~~~i~~~~~~--~-~PiLGIClG~QlL~ 85 (198) T cd01748 34 ILSADKLILPGVGAFGDAMANL-RERGLIEALKEAIAS--G-KPFLGICLGMQLLF 85 (198) T ss_pred HHHCCEEEECCCCCCHHHHHHH-HHCCHHHHHHHHHHC--C-CCEEEEEEEEEEEE T ss_conf 8308989987887403444441-003819999999876--9-96899981145466 No 19 >pfam11959 DUF3473 Domain of unknown function (DUF3473). This presumed domain is functionally uncharacterized. This domain is found in bacteria and archaea. This domain is about 130 amino acids in length. This domain is found associated with pfam01522. This domain has two completely conserved residues (P and H) that may be functionally important. Probab=29.21 E-value=44 Score=16.08 Aligned_cols=51 Identities=20% Similarity=0.312 Sum_probs=34.6 Q ss_pred CCEECCCCCCCCCCCCCCCCC--CEEEEHHHHHCCCHHHHCCCCEEHHHHHHH Q ss_conf 524648887088999861146--677401422111001112486037899988 Q gi|254780658|r 11 DLVKSAVGTLSPPSYAKEMSP--DLILMPLIAFDLKGNRIGYGRGYYDRAIAD 61 (113) Q Consensus 11 ~l~~~~~gI~EP~~~~~~~~~--DlilVP~lafD~~G~RLG~GgGyYDR~l~~ 61 (113) +...+.||+++-+..--.... .++=.|.....--|.+|-.|||.|=|+|.- T Consensus 5 P~~hd~yG~p~~p~~P~~~~~~~~i~EfP~~t~~~~g~~lP~~GGgyfRl~P~ 57 (133) T pfam11959 5 PVRHDHYGMPDAPRFPFRPETGGGLVEFPVTTTPLGGRNLPAGGGGYFRLLPY 57 (133) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCEEEECCEEHHCCCCCEEECCCEEHHHCCH T ss_conf 22458789999988884407999658944440000680031407511403849 No 20 >KOG0967 consensus Probab=28.58 E-value=21 Score=17.76 Aligned_cols=72 Identities=17% Similarity=0.284 Sum_probs=39.6 Q ss_pred CEEEEHHHHHCCCHHHHCCCCEE----HHHHHHHHHHCCCCEEEEEEECHHHEECCCCCCCCCCCCCEEECCCCEEEECH Q ss_conf 67740142211100111248603----78999888744998379998322324247899844751158998985597003 Q gi|254780658|r 32 DLILMPLIAFDLKGNRIGYGRGY----YDRAIADIRLEGKNPYLLGIAFDMQETSCIQAEATDIRLHAILTESRFSQFST 107 (113) Q Consensus 32 DlilVP~lafD~~G~RLG~GgGy----YDR~l~~~~~~~~~~~~igla~~~q~~~~ip~e~hD~~ld~iiTe~~~~~f~~ 107 (113) .+-+||.-|+-.+|.|.|.+||| ||--=..+.. +-.||-.|+-+.+.++-. +.....+.++...|+|.. T Consensus 560 slDLv~iga~~G~GrrtG~yg~fLlacyn~dteefqs----iCKigtGFsD~~l~e~~~---~l~~~~~~~~~~~y~~d~ 632 (714) T KOG0967 560 SLDLVVIGAYYGRGRRTGWYGGFLLACYNPDTEEFQS----ICKIGTGFSDEFLQELHE---SLSSTVIDSPKPYYRFDE 632 (714) T ss_pred CEEEEEEEEEECCCCCCCCCCCEEEEEECCCHHHHHH----HHHHCCCCCHHHHHHHHH---HHHHCCCCCCHHHCCCCC T ss_conf 2223553000035522443442567764586288888----876167777789999878---753213568576502276 Q ss_pred HHC Q ss_conf 330 Q gi|254780658|r 108 EHI 110 (113) Q Consensus 108 ~~~ 110 (113) +.. T Consensus 633 s~k 635 (714) T KOG0967 633 SLK 635 (714) T ss_pred CCC T ss_conf 679 No 21 >pfam01686 Adeno_Penton_B Adenovirus penton base protein. This family consists of various adenovirus penton base proteins, from both the Mastadenoviradae having mammalian hosts and the Aviadenoviradae having avian hosts. The penton base is a major structural protein forming part of the penton which consists of a base and a fibre, the pentons hold a morphologically prominent position at the vertex capsomer in the adenovirus particle. In mammalian adenovirus there is only one tail on each base where as in avian adenovirus there are two. Probab=27.02 E-value=26 Score=17.31 Aligned_cols=22 Identities=36% Similarity=0.688 Sum_probs=17.2 Q ss_pred CCCCCCEEEEHHHHHCCCHHHH Q ss_conf 6114667740142211100111 Q gi|254780658|r 27 KEMSPDLILMPLIAFDLKGNRI 48 (113) Q Consensus 27 ~~~~~DlilVP~lafD~~G~RL 48 (113) +.-.||++|.|+.|+|=.-.|| T Consensus 196 kafHpDIvLlPgCaVDFT~SRL 217 (454) T pfam01686 196 KAFHPDIVLLPGCAVDFTYSRL 217 (454) T ss_pred CCCCCCEEECCCCEEECCHHHH T ss_conf 3447877962884020517665 No 22 >PRK13175 consensus Probab=24.33 E-value=58 Score=15.46 Aligned_cols=51 Identities=22% Similarity=0.324 Sum_probs=34.9 Q ss_pred CCCCCEEEEHHHH-HCCCHHHHCCCCEEHHHHHHHHHHCCCCEEEEEEECHHHEEC Q ss_conf 1146677401422-111001112486037899988874499837999832232424 Q gi|254780658|r 28 EMSPDLILMPLIA-FDLKGNRIGYGRGYYDRAIADIRLEGKNPYLLGIAFDMQETS 82 (113) Q Consensus 28 ~~~~DlilVP~la-fD~~G~RLG~GgGyYDR~l~~~~~~~~~~~~igla~~~q~~~ 82 (113) ....|.+++|+++ |+..-.+| .-.|++|-....++. ..+.+|+|+-.|+.- T Consensus 37 i~~~d~lILPGvGsf~~~~~~l-~~~~l~~~i~~~~~~---g~PiLGIClGmQlL~ 88 (206) T PRK13175 37 LLAADALILPGVGAFDPAMQNL-RSRGLIPPIKDAIAS---GKPFLGICLGLQLLF 88 (206) T ss_pred HHCCCEEEECCCCCCHHHHHHH-HHCCCHHHHHHHHHC---CCCEEEEEHHHHHHH T ss_conf 8328928968998526788755-531871689999856---997799867866761 No 23 >pfam10001 DUF2242 Uncharacterized protein conserved in bacteria (DUF2242). This domain, found in various hypothetical bacterial proteins, has no known function. Probab=23.68 E-value=37 Score=16.51 Aligned_cols=17 Identities=18% Similarity=0.184 Sum_probs=8.6 Q ss_pred CCCCCCCCCCCCEEECC Q ss_conf 78998447511589989 Q gi|254780658|r 83 CIQAEATDIRLHAILTE 99 (113) Q Consensus 83 ~ip~e~hD~~ld~iiTe 99 (113) .+|....|-.|=-|-.| T Consensus 86 SlP~~ss~DslVKV~sE 102 (121) T pfam10001 86 SLPFSSSDDSLVKVGSE 102 (121) T ss_pred EECCCCCCCCEEEEEEE T ss_conf 60146788637887102 No 24 >PRK13178 consensus Probab=23.56 E-value=70 Score=15.02 Aligned_cols=52 Identities=17% Similarity=0.078 Sum_probs=31.8 Q ss_pred CCCCEEEEHHHHHCCCHHHHCCCCEEHHHHHHHHHHCCCCEEEEEEECHHHEECC Q ss_conf 1466774014221110011124860378999888744998379998322324247 Q gi|254780658|r 29 MSPDLILMPLIAFDLKGNRIGYGRGYYDRAIADIRLEGKNPYLLGIAFDMQETSC 83 (113) Q Consensus 29 ~~~DlilVP~lafD~~G~RLG~GgGyYDR~l~~~~~~~~~~~~igla~~~q~~~~ 83 (113) .+.|.+|+|+|+==.++.+--.=.|+.+...... ..+ -+.+|+|.-.|+.-+ T Consensus 36 ~~~d~lIlPGVGsf~~~m~~L~~~~l~~~i~~~~-~~g--kpiLGIClGmQlLf~ 87 (213) T PRK13178 36 QGAAGIIFPGVGAAGQAMNELRTTGLDEVLREQV-QAG--KPLLGICVGCQIMLD 87 (213) T ss_pred HHCCEEEECCCCCHHHHHHHHHHCCHHHHHHHHH-HCC--CCEEEEEHHHEEEEC T ss_conf 4399899899885699999875006289999874-369--957998424555541 No 25 >PRK13177 consensus Probab=21.52 E-value=77 Score=14.79 Aligned_cols=55 Identities=18% Similarity=0.164 Sum_probs=35.7 Q ss_pred CCCCCEEEEHHHH-HCCCHHHHCCCCEEHHHHHHHHHHCCCCEEEEEEECHHHEECCC Q ss_conf 1146677401422-11100111248603789998887449983799983223242478 Q gi|254780658|r 28 EMSPDLILMPLIA-FDLKGNRIGYGRGYYDRAIADIRLEGKNPYLLGIAFDMQETSCI 84 (113) Q Consensus 28 ~~~~DlilVP~la-fD~~G~RLG~GgGyYDR~l~~~~~~~~~~~~igla~~~q~~~~i 84 (113) ...-|.+++|+|+ |+..-.+|=-=.|++|.........+ .+.+|+|.-.|+.-+- T Consensus 39 i~~~d~lIlPGVGsF~~~m~~Lk~~~~~~~~i~~~~~~~~--~piLGICLGmQlLf~~ 94 (207) T PRK13177 39 VRRADRVVLPGVGAFAACMGGLRAIPGMVEAMEERVLEGG--APFLGICVGMQLLADR 94 (207) T ss_pred HHCCCEEEECCCCCHHHHHHHHHHCCCHHHHHHHHHHHCC--CCEEEECHHHHHHHHC T ss_conf 8359979989978789999776410444899999999769--9569981878857657 Done!