Query gi|254781074|ref|YP_003065487.1| uracil-DNA glycosylase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 227 No_of_seqs 140 out of 1475 Neff 5.8 Searched_HMMs 39220 Date Mon May 30 04:06:05 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254781074.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR00628 ung uracil-DNA glyco 100.0 0 0 658.1 17.1 211 7-217 1-225 (225) 2 PRK05254 uracil-DNA glycosylas 100.0 0 0 630.7 23.7 222 3-224 2-223 (223) 3 COG0692 Ung Uracil DNA glycosy 100.0 0 0 632.7 21.2 221 5-225 2-223 (223) 4 KOG2994 consensus 100.0 0 0 593.6 18.7 223 1-224 73-296 (297) 5 pfam03167 UDG Uracil DNA glyco 99.4 1.7E-12 4.3E-17 98.1 9.4 132 57-213 6-147 (147) 6 pfam03585 Herpes_ICP4_C Herpes 93.5 0.64 1.6E-05 26.5 8.0 184 2-215 35-269 (425) 7 COG1573 Uracil-DNA glycosylase 92.9 0.24 6.2E-06 29.1 5.0 140 56-213 38-200 (202) 8 pfam08749 DUF1790 Family of un 81.9 1.5 3.8E-05 24.2 3.1 92 76-170 44-135 (145) 9 cd03131 GATase1_HTS Type 1 glu 71.1 3 7.6E-05 22.4 2.1 101 45-175 15-115 (175) 10 KOG0121 consensus 70.1 5.2 0.00013 20.9 3.2 53 39-97 45-100 (153) 11 COG5465 Uncharacterized conser 68.5 7.5 0.00019 19.9 3.7 85 84-173 67-154 (166) 12 PRK05368 homoserine O-succinyl 62.5 5.7 0.00015 20.6 2.2 97 46-173 53-150 (302) 13 pfam04204 HTS Homoserine O-suc 59.4 6.9 0.00017 20.1 2.2 103 40-173 42-149 (298) 14 cd01840 SGNH_hydrolase_yrhL_li 55.7 11 0.00027 18.9 2.6 59 120-195 81-139 (150) 15 TIGR02061 aprA adenylylsulfate 48.3 16 0.00041 17.9 2.6 76 67-154 299-378 (651) 16 TIGR01001 metA homoserine O-su 46.1 7.5 0.00019 19.9 0.6 122 44-201 51-185 (305) 17 TIGR01977 am_tr_V_EF2568 cyste 46.0 17 0.00042 17.8 2.3 77 118-197 258-351 (384) 18 COG1922 WecG Teichoic acid bio 43.8 24 0.0006 16.8 2.9 41 134-174 137-177 (253) 19 pfam02514 CobN-Mg_chel CobN/Ma 41.2 30 0.00077 16.1 5.6 78 48-130 232-311 (1064) 20 TIGR02966 phoR_proteo phosphat 39.2 32 0.00082 16.0 3.1 40 44-91 241-282 (339) 21 COG3663 Mug G:T/U mismatch-spe 38.5 25 0.00063 16.7 2.3 143 57-211 8-160 (169) 22 pfam08558 TRF Telomere repeat 35.8 14 0.00035 18.2 0.6 30 139-171 205-234 (235) 23 pfam06972 DUF1296 Protein of u 34.7 19 0.00048 17.4 1.2 29 43-71 20-50 (60) 24 PRK03692 putative UDP-N-acetyl 33.8 39 0.001 15.4 3.0 40 134-173 136-175 (246) 25 PRK12493 magnesium chelatase; 29.1 47 0.0012 14.9 5.6 74 39-117 411-495 (1318) 26 PRK13405 bchH magnesium chelat 28.4 49 0.0012 14.9 5.5 77 39-120 408-495 (1207) 27 PRK02102 ornithine carbamoyltr 26.5 52 0.0013 14.7 4.3 65 46-113 142-218 (331) 28 pfam03040 CemA CemA family. Me 26.5 27 0.00068 16.5 0.8 33 135-167 163-196 (230) 29 CHL00181 cbbX CbbX; Provisiona 26.3 53 0.0014 14.6 5.5 152 14-177 5-177 (287) 30 KOG3325 consensus 26.3 23 0.00059 16.8 0.5 63 57-126 70-132 (183) 31 pfam00121 TIM Triosephosphate 25.9 30 0.00077 16.1 1.0 67 9-77 6-73 (243) 32 TIGR02884 spore_pdaA delta-lac 25.3 55 0.0014 14.5 2.4 58 127-192 40-102 (225) 33 PRK01816 hypothetical protein; 24.8 56 0.0014 14.5 2.2 23 81-103 85-107 (145) 34 cd03186 GST_C_SspA GST_N famil 24.7 25 0.00065 16.6 0.4 49 167-226 30-78 (107) 35 PRK00042 tpiA triosephosphate 23.9 35 0.00088 15.8 1.0 70 6-77 6-76 (251) 36 CHL00043 cemA envelope membran 23.4 30 0.00076 16.2 0.6 33 135-167 166-199 (233) 37 pfam00551 Formyl_trans_N Formy 22.9 61 0.0016 14.2 4.2 54 144-199 65-118 (181) 38 KOG2165 consensus 22.2 40 0.001 15.4 1.0 47 87-133 602-663 (765) 39 KOG4706 consensus 22.0 58 0.0015 14.4 1.8 35 74-108 54-88 (196) 40 COG3219 Uncharacterized protei 21.9 64 0.0016 14.1 2.1 34 89-124 193-232 (237) 41 cd00311 TIM Triosephosphate is 21.2 43 0.0011 15.2 1.1 62 9-72 6-67 (242) No 1 >TIGR00628 ung uracil-DNA glycosylase; InterPro: IPR002043 Uracil-DNA glycosylase 3.2.2 from EC (UNG) is a DNA repair enzyme that excises uracil residues from DNA by cleaving the N-glycosylic bond. Uracil in DNA can arise as a result of mis-incorporation of dUMP residues by DNA polymerase or deamination of cytosine. The sequence of uracil-DNA glycosylase is extremely well conserved in bacteria and eukaryotes as well as in herpes viruses. More distantly related uracil-DNA glycosylases are also found in poxviruses . In eukaryotic cells, UNG activity is found in both the nucleus and the mitochondria. Human UNG1 protein is transported to both the mitochondria and the nucleus . The N-terminal 77 amino acids of UNG1 seem to be required for mitochondrial localisation , but the presence of a mitochondrial transit peptide has not been directly demonstrated. The most N-terminal conserved region contains an aspartic acid residue which has been proposed, based on X-ray structures , to act as a general base in the catalytic mechanism. ; GO: 0004844 uracil DNA N-glycosylase activity, 0006284 base-excision repair. Probab=100.00 E-value=0 Score=658.06 Aligned_cols=211 Identities=55% Similarity=0.978 Sum_probs=206.3 Q ss_pred CHHHHHHHHH-HHCCHHHHHHHHHHHHHHH--CCCEECCCHHHHHHHHHCCCHHHCEEEEEECCCCCCCCCCCCCEEECC Q ss_conf 7338999888-7265899999999999997--798085887699999842998617799981188877520007100034 Q gi|254781074|r 7 HESWKSLLEN-HFQSEHMRNLKEFLLSEKR--KGKRIFPRGSHYFRAFDITPFNKVKVVILGQDPYHNYGQAHGLCFSVP 83 (227) Q Consensus 7 ~~sW~~~L~~-e~~~~~~~~i~~~l~~e~~--~~~~i~P~~~~IF~af~~~p~~~vKVVIlGQDPY~~~g~A~GLaFsv~ 83 (227) +++|+++|+. |+.++|+.+|..++.+++. +..+|||++++||+|+++|+|+|||||||||||||++|||+||||||. T Consensus 1 ~~~W~~~L~~Se~~~~y~~~ll~~~~~~~~~~~~~~vyPp~~~vF~w~~~~~p~dVKVVIlGQDPYH~pGqA~GLAFSV~ 80 (225) T TIGR00628 1 SPSWRDFLQLSEFKKPYFQELLAFVKRERATPKQETVYPPKEDVFAWTRLCPPEDVKVVILGQDPYHGPGQAHGLAFSVK 80 (225) T ss_pred CHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCEECCCHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCCCCCCCCCC T ss_conf 92227764288110089999999999997278983314965678889864882231578840358454677344432558 Q ss_pred CCCCCHHHHHHHHHHHHHHCC--CCCCCCCHHHHHHCCCEEEECCHHCCCCCCCHHHHHCCHHHHHHHHHHHHHCCCCC- Q ss_conf 798630469999999997418--88877401243220896886010112368604454325999999999999617898- Q gi|254781074|r 84 LGIRIPPSLVNVYKELQEDVN--FIPPTHGFLEHWGHEGVLLLNAVLTVEEGRAASHRGRGWEQFTDSVIDLISNNHQN- 160 (227) Q Consensus 84 ~~~~~p~SLrNI~Kel~~d~~--~~~~~~g~L~~Wa~QGVLLLNt~LTv~~g~~~SH~~~gW~~ft~~ii~~l~~~~~~- 160 (227) +++++|||||||||||++||+ +..|+||||++||+||||||||+|||++|+|+||+++|||.||++||+.|+++.++ T Consensus 81 ~~~~~PPSL~NIf~EL~~~~p~~~~~p~hG~L~~WA~QGVLLLNt~LTV~~g~p~SH~~~GWe~Ft~~vi~~Ls~~~~~l 160 (225) T TIGR00628 81 RGVPIPPSLKNIFKELEADYPDFFSPPKHGCLESWARQGVLLLNTVLTVRRGQPGSHSGLGWERFTDAVISRLSEKRDGL 160 (225) T ss_pred CCCCCCCCHHHHHHHHHHCCHHHCCCCCCCCCHHHHHHCHHHHCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHCCCC T ss_conf 64677824678999997210211268888985557851413542201125678622122147899999999999727998 Q ss_pred EEEEEECCHHHHHHHCCCCCCCEEEEECCCCHHHCCCC-------CCC-CCHHHHHHHHHHHCCC Q ss_conf 79999753233344205766607998478882225788-------627-8708999999998788 Q gi|254781074|r 161 IVFMLWGASAQKKQDVLDHKRHLVLKAAHPSPLSASHG-------FFG-CRHFSKANRYLQEHGK 217 (227) Q Consensus 161 vVfiLwG~~A~~~~~~i~~~~~~il~~~HPSPls~~~g-------F~~-~~~F~~~N~~L~~~~~ 217 (227) +||||||++||++..+||.++|+||+++||||||+++| |+| |+||++||+||+++|+ T Consensus 161 lVFmLWG~~A~~~~~~i~~~~HlvL~~~HPSPLS~~~G~~s~~~~F~GPC~HF~~AN~yL~~~g~ 225 (225) T TIGR00628 161 LVFMLWGAHAQKKKSLIDPKKHLVLKSPHPSPLSARRGRKSAQPKFFGPCNHFSKANEYLEKHGK 225 (225) T ss_pred EEEEECCHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCC T ss_conf 89981406777763057887607874088870015887887747688987558999999987069 No 2 >PRK05254 uracil-DNA glycosylase; Provisional Probab=100.00 E-value=0 Score=630.70 Aligned_cols=222 Identities=55% Similarity=1.044 Sum_probs=218.3 Q ss_pred CCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCEECCCHHHHHHHHHCCCHHHCEEEEEECCCCCCCCCCCCCEEEC Q ss_conf 68687338999888726589999999999999779808588769999984299861779998118887752000710003 Q gi|254781074|r 3 GVKIHESWKSLLENHFQSEHMRNLKEFLLSEKRKGKRIFPRGSHYFRAFDITPFNKVKVVILGQDPYHNYGQAHGLCFSV 82 (227) Q Consensus 3 ~l~i~~sW~~~L~~e~~~~~~~~i~~~l~~e~~~~~~i~P~~~~IF~af~~~p~~~vKVVIlGQDPY~~~g~A~GLaFsv 82 (227) +..|+++|+++|++|+.++||.+|.++|.+|+.++++|||+.++|||||++||+++||||||||||||++|||+|||||| T Consensus 2 ~~~i~~~W~~~l~~e~~k~yf~~l~~~l~~e~~~~~~i~P~~~~iF~af~~~~~~~vKVVIlGQDPYh~~gqA~GLaFSv 81 (223) T PRK05254 2 KMALEPSWKEVLKPEFKKPYFQELLEFLRAERAAGKTIYPPGEDIFRAFNLTPFDDVKVVILGQDPYHGPGQAHGLSFSV 81 (223) T ss_pred CCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCEEECCHHHHHHHHHCCCHHHCEEEEEECCCCCCCCCCCCEEEEC T ss_conf 88518789999999967699999999999999769977588589999984389878379998478999976558846567 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHCCCEEEECCHHCCCCCCCHHHHHCCHHHHHHHHHHHHHCCCCCEE Q ss_conf 47986304699999999974188887740124322089688601011236860445432599999999999961789879 Q gi|254781074|r 83 PLGIRIPPSLVNVYKELQEDVNFIPPTHGFLEHWGHEGVLLLNAVLTVEEGRAASHRGRGWEQFTDSVIDLISNNHQNIV 162 (227) Q Consensus 83 ~~~~~~p~SLrNI~Kel~~d~~~~~~~~g~L~~Wa~QGVLLLNt~LTv~~g~~~SH~~~gW~~ft~~ii~~l~~~~~~vV 162 (227) ++++++|||||||||||++|+++..|.+|||+.||+||||||||+|||++|+|+||+++||+.||+.+|++|+++++++| T Consensus 82 ~~~~~~PpSL~NI~kEl~~d~~~~~~~~gdL~~Wa~QGVLLLNt~LTv~~g~~~SH~~~GWe~ft~~vi~~l~~~~~~~V 161 (223) T PRK05254 82 PPGVPLPPSLRNIFKELEDDLGFPIPNHGDLTSWAEQGVLLLNTVLTVEAGQANSHAGKGWETFTDAVIKALNERREPVV 161 (223) T ss_pred CCCCCCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHCCEEEEECCEEEECCCCCCCCCCCHHHHHHHHHHHHHCCCCCEE T ss_conf 99999983599999999986089888888689998579788835324557999986689979999999999972789809 Q ss_pred EEEECCHHHHHHHCCCCCCCEEEEECCCCHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCCCC Q ss_conf 99975323334420576660799847888222578862787089999999987889746508 Q gi|254781074|r 163 FMLWGASAQKKQDVLDHKRHLVLKAAHPSPLSASHGFFGCRHFSKANRYLQEHGKTTVNWQI 224 (227) Q Consensus 163 fiLwG~~A~~~~~~i~~~~~~il~~~HPSPls~~~gF~~~~~F~~~N~~L~~~~~~~IdW~l 224 (227) |||||++||+++.+|+.++|+||+++|||||||+||||||+||++||++|+++|++||||+| T Consensus 162 f~LWG~~Aq~~~~~i~~~~h~il~s~HPSPlSa~rgF~g~~~F~k~N~~L~~~~~~pIdW~l 223 (223) T PRK05254 162 FILWGSHAQKKKALIDTSKHLILESPHPSPLSAHRGFFGSKHFSKANEYLKQHGKTPIDWQL 223 (223) T ss_pred EEECCHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCC T ss_conf 99816689998742698787799617999523468998887799999999987999835709 No 3 >COG0692 Ung Uracil DNA glycosylase [DNA replication, recombination, and repair] Probab=100.00 E-value=0 Score=632.66 Aligned_cols=221 Identities=56% Similarity=1.048 Sum_probs=217.4 Q ss_pred CCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCEECCCHHHHHHHHHCCCHHHCEEEEEECCCCCCCCCCCCCEEECCC Q ss_conf 68733899988872658999999999999977980858876999998429986177999811888775200071000347 Q gi|254781074|r 5 KIHESWKSLLENHFQSEHMRNLKEFLLSEKRKGKRIFPRGSHYFRAFDITPFNKVKVVILGQDPYHNYGQAHGLCFSVPL 84 (227) Q Consensus 5 ~i~~sW~~~L~~e~~~~~~~~i~~~l~~e~~~~~~i~P~~~~IF~af~~~p~~~vKVVIlGQDPY~~~g~A~GLaFsv~~ 84 (227) .++.+|+++|++|+.++||.+|.+||.+|+..|.+|||+.++||+||++||+++|||||+||||||++|||+||||||+. T Consensus 2 ~~~~~W~~~l~~e~~k~yf~~l~~fl~~e~~~g~~i~P~~~~iF~af~~tp~~~vKVVIlGQDPYh~p~qAhGLsFSV~~ 81 (223) T COG0692 2 KLELSWKEFLKPEFKKPYFQELLEFLKQEYASGKTIYPPKEDIFNAFRLTPFDDVKVVILGQDPYHGPGQAHGLSFSVPP 81 (223) T ss_pred CCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCEECCCHHHHHHHHHCCCHHHEEEEEEECCCCCCCCCCCEEEEECCC T ss_conf 73467999987763038999999999999974984579889875787368824538999826999998841126662699 Q ss_pred CCCCHHHHHHHHHHHHHHCC-CCCCCCCHHHHHHCCCEEEECCHHCCCCCCCHHHHHCCHHHHHHHHHHHHHCCCCCEEE Q ss_conf 98630469999999997418-88877401243220896886010112368604454325999999999999617898799 Q gi|254781074|r 85 GIRIPPSLVNVYKELQEDVN-FIPPTHGFLEHWGHEGVLLLNAVLTVEEGRAASHRGRGWEQFTDSVIDLISNNHQNIVF 163 (227) Q Consensus 85 ~~~~p~SLrNI~Kel~~d~~-~~~~~~g~L~~Wa~QGVLLLNt~LTv~~g~~~SH~~~gW~~ft~~ii~~l~~~~~~vVf 163 (227) ++++||||+||||||+.|++ +..+++|||+.||+||||||||+|||++|+|+||+++|||.||++||+.|+++++++|| T Consensus 82 ~v~~PpSL~NI~kEL~~dl~~~~~~~~g~L~~WA~QGVLLLNtvLTV~~g~a~SH~~~GWE~FTd~vi~~l~~~~~~vVf 161 (223) T COG0692 82 GVKIPPSLKNIYKELEADLGGFPIPNHGDLTSWAEQGVLLLNTVLTVRAGQANSHAGKGWETFTDAVIKALNERREPVVF 161 (223) T ss_pred CCCCCCHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHCHHHEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEE T ss_conf 99999029999999974407777898886277997364120123575488756444668389999999999845786399 Q ss_pred EEECCHHHHHHHCCCCCCCEEEEECCCCHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 99753233344205766607998478882225788627870899999999878897465088 Q gi|254781074|r 164 MLWGASAQKKQDVLDHKRHLVLKAAHPSPLSASHGFFGCRHFSKANRYLQEHGKTTVNWQIP 225 (227) Q Consensus 164 iLwG~~A~~~~~~i~~~~~~il~~~HPSPls~~~gF~~~~~F~~~N~~L~~~~~~~IdW~lp 225 (227) ||||++||++.++|+.++|+||+++|||||||+||||||+||++||+||+++|++||||++| T Consensus 162 iLWG~~Aq~k~~~I~~~~h~il~s~HPSPLSa~rGFfG~~hFsk~N~~L~~~g~~pIdW~~~ 223 (223) T COG0692 162 VLWGSHAQKKKKLIDNKKHLILEAPHPSPLSAHRGFFGCKHFSKANEWLEKHGKKPIDWSLP 223 (223) T ss_pred EEECHHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCC T ss_conf 99688899987146888708985799996422368668970679999999839998787789 No 4 >KOG2994 consensus Probab=100.00 E-value=0 Score=593.65 Aligned_cols=223 Identities=52% Similarity=0.914 Sum_probs=216.2 Q ss_pred CCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCEECCCHHHHHHHHHCCCHHHCEEEEEECCCCCCCCCCCCCEE Q ss_conf 98686873389998887265899999999999997798085887699999842998617799981188877520007100 Q gi|254781074|r 1 MEGVKIHESWKSLLENHFQSEHMRNLKEFLLSEKRKGKRIFPRGSHYFRAFDITPFNKVKVVILGQDPYHNYGQAHGLCF 80 (227) Q Consensus 1 l~~l~i~~sW~~~L~~e~~~~~~~~i~~~l~~e~~~~~~i~P~~~~IF~af~~~p~~~vKVVIlGQDPY~~~g~A~GLaF 80 (227) ||+..+.++|...|..||.++||..|.+||.+|+. +.+||||+++||.|+++||+++||||||||||||++|||||||| T Consensus 73 Le~~~~~esW~~~L~~EF~KpYf~~lk~Fl~~E~~-~~~vfPP~~~Ift~~~~~P~~kvKVVIiGQDPYHn~nQAhGLsF 151 (297) T KOG2994 73 LEERTIGESWFKHLKGEFGKPYFIKLKGFLAEERN-SYTVFPPPHQIFTWTRLTPFDKVKVVIIGQDPYHNPNQAHGLSF 151 (297) T ss_pred HHHHHCCHHHHHHHHHHHCCHHHHHHHHHHHHHHC-CCCCCCCHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCCCEEEE T ss_conf 66752276888862565467799999999998644-78526988885255505765422699962688889751200566 Q ss_pred ECCCCCCCHHHHHHHHHHHHHHCC-CCCCCCCHHHHHHCCCEEEECCHHCCCCCCCHHHHHCCHHHHHHHHHHHHHCCCC Q ss_conf 034798630469999999997418-8887740124322089688601011236860445432599999999999961789 Q gi|254781074|r 81 SVPLGIRIPPSLVNVYKELQEDVN-FIPPTHGFLEHWGHEGVLLLNAVLTVEEGRAASHRGRGWEQFTDSVIDLISNNHQ 159 (227) Q Consensus 81 sv~~~~~~p~SLrNI~Kel~~d~~-~~~~~~g~L~~Wa~QGVLLLNt~LTv~~g~~~SH~~~gW~~ft~~ii~~l~~~~~ 159 (227) ||+.+++.||||+||||||+.|++ +..|.+|||..||+|||||||++|||++.+|+||+++|||.||++||+.|+++++ T Consensus 152 SV~~gvp~PPSLkNIYkEl~~Di~df~~P~hGdL~~WA~QGVLlLNavLTVrahqaNSHa~~GWetFT~aVvq~i~q~~k 231 (297) T KOG2994 152 SVQPGVPPPPSLKNIYKELSSDIEDFVIPKHGDLSGWAKQGVLLLNAVLTVRAHQANSHAKIGWETFTDAVVQVINQQSK 231 (297) T ss_pred ECCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHCCEEEEEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHCCC T ss_conf 05899998924999999985014665257977720333435687630578750466643333689999999999982367 Q ss_pred CEEEEEECCHHHHHHHCCCCCCCEEEEECCCCHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCCCC Q ss_conf 87999975323334420576660799847888222578862787089999999987889746508 Q gi|254781074|r 160 NIVFMLWGASAQKKQDVLDHKRHLVLKAAHPSPLSASHGFFGCRHFSKANRYLQEHGKTTVNWQI 224 (227) Q Consensus 160 ~vVfiLwG~~A~~~~~~i~~~~~~il~~~HPSPls~~~gF~~~~~F~~~N~~L~~~~~~~IdW~l 224 (227) ++||+|||++||++.+.|+..+|.||+++|||||||+||||||+||++||++|+++|++||||++ T Consensus 232 ~lVFLLWG~~Aqkk~~~i~~~kH~Vl~~~HPSpLSa~rgFFgC~HFsk~N~~Le~~g~kpIdW~~ 296 (297) T KOG2994 232 GLVFLLWGRYAQKKGKLIDRKKHHVLTTAHPSPLSAYRGFFGCRHFSKTNELLEESGKKPIDWQV 296 (297) T ss_pred CEEEEEECCHHHHHHCCCCCCCCEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCC T ss_conf 55999817268774313475552688436898642015711432277888999983899867523 No 5 >pfam03167 UDG Uracil DNA glycosylase superfamily. Probab=99.41 E-value=1.7e-12 Score=98.13 Aligned_cols=132 Identities=29% Similarity=0.365 Sum_probs=101.4 Q ss_pred HHCEEEEEECCCCCCCCCCCCCEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHCCCEEEECCH---HCCCCC Q ss_conf 61779998118887752000710003479863046999999999741888877401243220896886010---112368 Q gi|254781074|r 57 NKVKVVILGQDPYHNYGQAHGLCFSVPLGIRIPPSLVNVYKELQEDVNFIPPTHGFLEHWGHEGVLLLNAV---LTVEEG 133 (227) Q Consensus 57 ~~vKVVIlGQDPY~~~g~A~GLaFsv~~~~~~p~SLrNI~Kel~~d~~~~~~~~g~L~~Wa~QGVLLLNt~---LTv~~g 133 (227) .+.||+|+||+|++. ..++|..|+.+. ...|++++.++ .|++|+|.+.|++ .+.... T Consensus 6 ~~a~i~ivg~~Pg~~-~~~~G~~f~g~~----g~~l~~~l~~~---------------gl~~~~~~itn~~~c~~~~~~~ 65 (147) T pfam03167 6 PNAKVLIVGEAPGPG-EDATGLPFAGRA----GNLLWRILAEA---------------GLAREGVYLTNVVKCPRPVGRK 65 (147) T ss_pred CCCCEEEEECCCCHH-HHHCCCCCCCCH----HHHHHHHHHHH---------------CCCCCCEEEEEEECCCCCCCCC T ss_conf 999999996798756-763699675756----99999999982---------------8875577998532256876799 Q ss_pred CCHHHHHCCHHHHHHHHHHHHHCCCCCEEEEEECCHHHHHHHCCC-------CCCCEEEEECCCCHHHCCCCCCCCCHHH Q ss_conf 604454325999999999999617898799997532333442057-------6660799847888222578862787089 Q gi|254781074|r 134 RAASHRGRGWEQFTDSVIDLISNNHQNIVFMLWGASAQKKQDVLD-------HKRHLVLKAAHPSPLSASHGFFGCRHFS 206 (227) Q Consensus 134 ~~~SH~~~gW~~ft~~ii~~l~~~~~~vVfiLwG~~A~~~~~~i~-------~~~~~il~~~HPSPls~~~gF~~~~~F~ 206 (227) -..+|...+| ++ +.+.|..-++.+|++| |+.|.+....+. ...+.|+.+.|||||+.+..+.....|. T Consensus 66 ~~~~ei~~~~-~~---L~~~i~~i~p~viv~l-G~~a~~~~~~~~~~~~~~~~~~~~v~~~~HPS~l~~~~~~~~~~~~~ 140 (147) T pfam03167 66 PTRSEIAACW-PY---LLEELALLRPKVVVLL-GKTAAKAFLGLGKKESLFEGGGIPVLPLPHPSPLNRNPGFKRPEAFR 140 (147) T ss_pred CCHHHHHHHH-HH---HHHHHHHCCCCEEEEE-CHHHHHHHHCCCCCCCEECCCCCEEEEECCCHHHHCCHHHCCHHHHH T ss_conf 8989999989-99---9999985596689998-89999998477876522426991799955907974197641747899 Q ss_pred HHHHHHH Q ss_conf 9999999 Q gi|254781074|r 207 KANRYLQ 213 (227) Q Consensus 207 ~~N~~L~ 213 (227) ++++.|+ T Consensus 141 ~~~~~L~ 147 (147) T pfam03167 141 EALEDLK 147 (147) T ss_pred HHHHHHC T ss_conf 9999729 No 6 >pfam03585 Herpes_ICP4_C Herpesvirus ICP4-like protein C-terminal region. The immediate-early protein ICP4 (infected-cell polypeptide 4) is required for efficient transcription of early and late viral genes and is thus essential for productive infection. ICP4 is a large phosphoprotein that binds DNA in a sequence specific manner as a homodimer. ICP4 represses transcription from LAT, ICP4 and ORF-P that have high-affinity a ICP4 binding site that spans the transcription initiation site. ICP4 proteins have two highly conserved regions, this family contains the C-terminal region that probably acts as an enhancer for the N-terminal region. Probab=93.53 E-value=0.64 Score=26.49 Aligned_cols=184 Identities=16% Similarity=0.219 Sum_probs=98.0 Q ss_pred CCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCC-----------EECCCHHHHHHHHHCC-CHHHCEEEEEECCCC Q ss_conf 8686873389998887265899999999999997798-----------0858876999998429-986177999811888 Q gi|254781074|r 2 EGVKIHESWKSLLENHFQSEHMRNLKEFLLSEKRKGK-----------RIFPRGSHYFRAFDIT-PFNKVKVVILGQDPY 69 (227) Q Consensus 2 ~~l~i~~sW~~~L~~e~~~~~~~~i~~~l~~e~~~~~-----------~i~P~~~~IF~af~~~-p~~~vKVVIlGQDPY 69 (227) |.=.+++.|++.|. |.-.-+.+|.. +..+- ...=+...--+|.+.. +++|||||||= +|- T Consensus 35 d~plfP~~WRpaL~--fdP~ALA~IAa-----r~~~~~~~~~~~~g~l~as~pLRrr~AWMrQi~dPEDVrvVvLY-~PL 106 (425) T pfam03585 35 DHPLFPEPWRPALA--FDPEALAEIAA-----RCNGPPPRSDRGFGPLTASGPLRRRTAWMRQIADPEDVRVVVLY-SPL 106 (425) T ss_pred CCCCCCCCCCHHHC--CCHHHHHHHHH-----HCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCHHHEEEEEEE-CCC T ss_conf 78767754322220--49899999986-----43799985555437644436788888887518984333799983-699 Q ss_pred CCCCCCCC---------CEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCC----------CHHHHHHCCCEEEECCHHCC Q ss_conf 77520007---------100034798630469999999997418888774----------01243220896886010112 Q gi|254781074|r 70 HNYGQAHG---------LCFSVPLGIRIPPSLVNVYKELQEDVNFIPPTH----------GFLEHWGHEGVLLLNAVLTV 130 (227) Q Consensus 70 ~~~g~A~G---------LaFsv~~~~~~p~SLrNI~Kel~~d~~~~~~~~----------g~L~~Wa~QGVLLLNt~LTv 130 (227) |++.-+.+ ..||..++ -|.-...+|.+-+= .|.+ -|.+..-.||||||.| T Consensus 107 pgE~L~~~~~~~~~~~~P~w~~~rG-----GLS~LLAAl~NRLc--~p~s~AWAGnWtgaPDiSaLnaqGVLlLSt---- 175 (425) T pfam03585 107 PGEHLFGVPPPGDPPGGPRWSPTRG-----GLSFLLAALSNRLC--LPDSHAWAGNWTGAPDISALNAQGVLLLST---- 175 (425) T ss_pred CCHHHCCCCCCCCCCCCCCCCCCCC-----CHHHHHHHHHCCCC--CCCCHHHCCCCCCCCCCCHHHCCCEEEEEC---- T ss_conf 9525317798888889987786777-----49999999856555--865303206767898723120071599752---- Q ss_pred CCCCCHHHHHCCHHHHHHHHHHHHHCCCCCEEEEEECCHHHHHH---HCCCCCCCEEEEECCCCHHH------------- Q ss_conf 36860445432599999999999961789879999753233344---20576660799847888222------------- Q gi|254781074|r 131 EEGRAASHRGRGWEQFTDSVIDLISNNHQNIVFMLWGASAQKKQ---DVLDHKRHLVLKAAHPSPLS------------- 194 (227) Q Consensus 131 ~~g~~~SH~~~gW~~ft~~ii~~l~~~~~~vVfiLwG~~A~~~~---~~i~~~~~~il~~~HPSPls------------- 194 (227) .++||.--.+.+=..+.+....+|++= --+....- .-|. +.|.-+++. =+|.+ T Consensus 176 --------~DL~FaGaVEyL~l~~~~a~rRLIV~D-tV~~~dwP~dGPavs-~~h~Y~r~~-v~P~aqc~vRWP~~~~L~ 244 (425) T pfam03585 176 --------RDLAFAGAVEYLCLRLGSARRRLIVLD-TVDPEDWPEDGPAVS-QYHVYVRCA-VSPRAQCAVRWPGARDLA 244 (425) T ss_pred --------CHHHHHHHHHHHHHHHHHCCCEEEEEE-CCCHHHCCCCCCCCE-EEEEEEECC-CCCHHHCEEECCCCCCHH T ss_conf --------302567699999998752464179972-457311788897111-105788544-472122133378852110 Q ss_pred ----CCCCCCCCCHHHHHHHHHHHC Q ss_conf ----578862787089999999987 Q gi|254781074|r 195 ----ASHGFFGCRHFSKANRYLQEH 215 (227) Q Consensus 195 ----~~~gF~~~~~F~~~N~~L~~~ 215 (227) +.+.-||-..|.++....... T Consensus 245 raVltS~~v~GP~~farvEaafarl 269 (425) T pfam03585 245 RAVLTSSRVFGPGSFARVEAAFARL 269 (425) T ss_pred HHHEECCCCCCCHHHHHHHHHHHHH T ss_conf 3421035544717999999999864 No 7 >COG1573 Uracil-DNA glycosylase [DNA replication, recombination, and repair] Probab=92.89 E-value=0.24 Score=29.09 Aligned_cols=140 Identities=21% Similarity=0.214 Sum_probs=73.7 Q ss_pred HHHCEEEEEECCCCCCCCCCCCCEEECCCCCCCHHHHHHHHHHHHHHC-CCCCCCCCHHHHHHCCCEEEECCHHCCCCCC Q ss_conf 861779998118887752000710003479863046999999999741-8888774012432208968860101123686 Q gi|254781074|r 56 FNKVKVVILGQDPYHNYGQAHGLCFSVPLGIRIPPSLVNVYKELQEDV-NFIPPTHGFLEHWGHEGVLLLNAVLTVEEGR 134 (227) Q Consensus 56 ~~~vKVVIlGQDPY~~~g~A~GLaFsv~~~~~~p~SLrNI~Kel~~d~-~~~~~~~g~L~~Wa~QGVLLLNt~LTv~~g~ 134 (227) ..+.+++|+||.|-- ..+.+|+-|--..+. =|+.+..++.... ...... .+..|.. ..=....|+ T Consensus 38 ~~~a~i~iVGeaPG~-~~~~tG~pF~g~aG~----~L~~~l~~~~~~~~~~~~it--~~~~~~~-------~p~~~~~~~ 103 (202) T COG1573 38 NPTARILIVGEAPGA-GEDRTGRPFVGKAGD----LLDRILGAIGGLRYEDVYIT--NVVKCRY-------PPGNRPPGD 103 (202) T ss_pred CCCCCEEEEECCCCC-CHHHCCCCCCCCHHH----HHHHHHHHHHCCCCCCEEEE--ECCCCCC-------CCCCCCCCC T ss_conf 887788998379971-311059815580489----99999998516656766775--2435436-------899999999 Q ss_pred CHHHHHCCHHHHHHHHHHHHHCCCCCEEEEEECCHHHHHHHCCC-----C--CC-----------CEEEEECCCCHHHCC Q ss_conf 04454325999999999999617898799997532333442057-----6--66-----------079984788822257 Q gi|254781074|r 135 AASHRGRGWEQFTDSVIDLISNNHQNIVFMLWGASAQKKQDVLD-----H--KR-----------HLVLKAAHPSPLSAS 196 (227) Q Consensus 135 ~~SH~~~gW~~ft~~ii~~l~~~~~~vVfiLwG~~A~~~~~~i~-----~--~~-----------~~il~~~HPSPls~~ 196 (227) +....-.-=.+|-..-|+.+. + -|.++.|+.|++..-... . .+ ..++-+.|||++... T Consensus 104 p~~~E~~~c~p~l~~~i~l~~---p-kviv~LG~~A~~~~l~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~HPs~~~~~ 179 (202) T COG1573 104 PTPEEIKACRPFLEAEIALIR---P-KVILLLGEYAAKSLLGLKGEPITELRGRWGAYRPKGGGGIRVFPTYHPSPLNRN 179 (202) T ss_pred CCHHHHHHHHHHHHHHHHCCC---C-CEEEECCHHHHHHHHCCCCCCCCCCCCEEEEECCCCCCCEEEEEECCCCHHHHC T ss_conf 698898640567999984279---8-889980789999986315665100254244332567876289860587098738 Q ss_pred CC----CCCCCHHHHHHHHHH Q ss_conf 88----627870899999999 Q gi|254781074|r 197 HG----FFGCRHFSKANRYLQ 213 (227) Q Consensus 197 ~g----F~~~~~F~~~N~~L~ 213 (227) -+ .+--..|.++-.++. T Consensus 180 ~~~~~~~~~~~~~~~~k~~~~ 200 (202) T COG1573 180 PGKLNKWFFWEDLKKLKALLR 200 (202) T ss_pred HHHHCHHHHHHHHHHHHHHHH T ss_conf 212075999999999999864 No 8 >pfam08749 DUF1790 Family of unknown function (DUF1790). Family of uncharacterized proteins ranging in length from 150 to 210 amino acids. Probab=81.93 E-value=1.5 Score=24.22 Aligned_cols=92 Identities=20% Similarity=0.339 Sum_probs=65.4 Q ss_pred CCCEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHCCCEEEECCHHCCCCCCCHHHHHCCHHHHHHHHHHHHH Q ss_conf 07100034798630469999999997418888774012432208968860101123686044543259999999999996 Q gi|254781074|r 76 HGLCFSVPLGIRIPPSLVNVYKELQEDVNFIPPTHGFLEHWGHEGVLLLNAVLTVEEGRAASHRGRGWEQFTDSVIDLIS 155 (227) Q Consensus 76 ~GLaFsv~~~~~~p~SLrNI~Kel~~d~~~~~~~~g~L~~Wa~QGVLLLNt~LTv~~g~~~SH~~~gW~~ft~~ii~~l~ 155 (227) ..|-|++--+.++|..=+.=.-+|-.-.+... --|..+-|..+|++....+|.++.+..-|.. . -+.+.+..+..+- T Consensus 44 ~~L~~~~~~d~~vp~~k~~~~~~ll~~iN~~l-wlGhF~~~~~~g~i~fR~~l~l~g~~~~s~~-q-ie~lv~~av~~ce 120 (145) T pfam08749 44 EALQLICYFDMKVPEGKRPEVAELLNLINEQL-WIGHFELWEEDGLIRYRYGLLLAGGQEASPE-Q-LENMLEAAVDACE 120 (145) T ss_pred CEEEEEEECCCCCCCCHHHHHHHHHHHHHHCC-CCCCEEECCCCCEEEEEEEEECCCCCCCCHH-H-HHHHHHHHHHHHH T ss_conf 86999998477689400899999999997447-6177375189996999998771898889999-9-9999999999999 Q ss_pred CCCCCEEEEEECCHH Q ss_conf 178987999975323 Q gi|254781074|r 156 NNHQNIVFMLWGASA 170 (227) Q Consensus 156 ~~~~~vVfiLwG~~A 170 (227) +--+..-|++||+.. T Consensus 121 r~yPafq~v~~g~~s 135 (145) T pfam08749 121 RFYPAFQFVVWGGKS 135 (145) T ss_pred HHHHHHHHHHCCCCC T ss_conf 887787876318989 No 9 >cd03131 GATase1_HTS Type 1 glutamine amidotransferase (GATase1)-like domain found in homoserine trans-succinylase (HTS). Type 1 glutamine amidotransferase (GATase1)-like domain found in homoserine trans-succinylase (HTS). HTS, the first enzyme in methionine biosynthesis in Escherichia coli, transfers a succinyl group from succinyl-CoA to homoserine forming succinyl homoserine. It has been suggested that the succinyl group of succinyl-CoA is initially transferred to an enzyme nucleophile before subsequent transfer to homoserine. The catalytic triad typical of GATase1 domains is not conserved in this GATase1-like domain. However, in common with GATase1 domains a reactive cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. It has been proposed that this cys is in the active site of the molecule. However, as succinyl has been found bound to a conserved lysine residue, this conserved cys may play a role in dimer formation. HTS acti Probab=71.06 E-value=3 Score=22.37 Aligned_cols=101 Identities=16% Similarity=0.303 Sum_probs=58.6 Q ss_pred HHHHHHHHCCCHHHCEEEEEECCCCCCCCCCCCCEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHCCCEEEE Q ss_conf 69999984299861779998118887752000710003479863046999999999741888877401243220896886 Q gi|254781074|r 45 SHYFRAFDITPFNKVKVVILGQDPYHNYGQAHGLCFSVPLGIRIPPSLVNVYKELQEDVNFIPPTHGFLEHWGHEGVLLL 124 (227) Q Consensus 45 ~~IF~af~~~p~~~vKVVIlGQDPY~~~g~A~GLaFsv~~~~~~p~SLrNI~Kel~~d~~~~~~~~g~L~~Wa~QGVLLL 124 (227) .+++|-+..+| -+|.|..+=-+-|+.++ ..+.=|...|+.+.+ +.. ..-.|- + T Consensus 15 ~qf~rll~~sp-lqv~~~~~~~~sh~~k~-------------~~~~~l~~~Y~~~~~-i~~-~~~Dgl-----------I 67 (175) T cd03131 15 RQFLRLLGNTP-LQVEITFIRPSSHSSKN-------------TPPEHVNRFYETFDD-IRD-AKFDGL-----------I 67 (175) T ss_pred HHHHHHHCCCC-CCEEEEEEECCCCCCCC-------------CCHHHHHHHHHHHHH-HHH-CCCCEE-----------E T ss_conf 99999856898-71699999612568899-------------889999999884998-741-777668-----------9 Q ss_pred CCHHCCCCCCCHHHHHCCHHHHHHHHHHHHHCCCCCEEEEEECCHHHHHHH Q ss_conf 010112368604454325999999999999617898799997532333442 Q gi|254781074|r 125 NAVLTVEEGRAASHRGRGWEQFTDSVIDLISNNHQNIVFMLWGASAQKKQD 175 (227) Q Consensus 125 Nt~LTv~~g~~~SH~~~gW~~ft~~ii~~l~~~~~~vVfiLwG~~A~~~~~ 175 (227) =|.=- .+..+-....+ |+++++ |++....+.....|++||.+|.-+.. T Consensus 68 ITGAP-ve~l~fe~v~Y-W~El~~-i~dwa~~~v~stl~iCWaA~Aal~~~ 115 (175) T cd03131 68 VTGAP-VEHLPFEQVDY-WEELTE-ILDWAKTHVTSTLFSCWAAMAALYYF 115 (175) T ss_pred EECCC-CCCCCCCCCCC-HHHHHH-HHHHHHHHCCCEEEEHHHHHHHHHHH T ss_conf 71899-88788343877-999999-99999873663345649999999998 No 10 >KOG0121 consensus Probab=70.07 E-value=5.2 Score=20.88 Aligned_cols=53 Identities=15% Similarity=0.397 Sum_probs=38.8 Q ss_pred EECCCHHHHHHHHHCCCHHHCEEEEEECCCCCCCCCCCCCEE---ECCCCCCCHHHHHHHHH Q ss_conf 085887699999842998617799981188877520007100---03479863046999999 Q gi|254781074|r 39 RIFPRGSHYFRAFDITPFNKVKVVILGQDPYHNYGQAHGLCF---SVPLGIRIPPSLVNVYK 97 (227) Q Consensus 39 ~i~P~~~~IF~af~~~p~~~vKVVIlGQDPY~~~g~A~GLaF---sv~~~~~~p~SLrNI~K 97 (227) ..|-..++|+.-|..| .+||.||+|-|-+.. .++|.|| .++.+ .--+||+|-- T Consensus 45 SfyttEEqiyELFs~c--G~irriiMGLdr~kk--tpCGFCFVeyy~~~d--A~~Alryisg 100 (153) T KOG0121 45 SFYTTEEQIYELFSKC--GDIRRIIMGLDRFKK--TPCGFCFVEYYSRDD--AEDALRYISG 100 (153) T ss_pred EEEECHHHHHHHHHHC--CCHHEEEECCCCCCC--CCCCEEEEEEECCHH--HHHHHHHHCC T ss_conf 3553099999999834--562036761344776--755158999963355--8999987416 No 11 >COG5465 Uncharacterized conserved protein [Function unknown] Probab=68.50 E-value=7.5 Score=19.90 Aligned_cols=85 Identities=22% Similarity=0.398 Sum_probs=62.3 Q ss_pred CCCCCHHHHHHHHHHHHHHCCCC-CCCCCHHHHHHCCCEEEECCHHCCCCC-CCHHHHHCCHHHHHHHHHHHHHCCCCCE Q ss_conf 79863046999999999741888-877401243220896886010112368-6044543259999999999996178987 Q gi|254781074|r 84 LGIRIPPSLVNVYKELQEDVNFI-PPTHGFLEHWGHEGVLLLNAVLTVEEG-RAASHRGRGWEQFTDSVIDLISNNHQNI 161 (227) Q Consensus 84 ~~~~~p~SLrNI~Kel~~d~~~~-~~~~g~L~~Wa~QGVLLLNt~LTv~~g-~~~SH~~~gW~~ft~~ii~~l~~~~~~v 161 (227) -++++|.|-||=.-+|-.-++-. ...|. .-|-+||+...--+|-+-.| +|.+-. =+...+.++..+-.--+-. T Consensus 67 fd~kvpes~~~~~~rLL~liN~~v~~ghF--~lweeeg~I~fR~~L~l~~G~ep~~~q---~~v~idta~eacdsff~Af 141 (166) T COG5465 67 FDIKVPESRRAEVARLLSLINRDVLIGHF--DLWEEEGAIFFRQSLLLAGGVEPTSQQ---VEVLIDTALEACDSFFQAF 141 (166) T ss_pred CCCCCCHHHHHHHHHHHHHHHCCCCCCCC--CCCCCCCEEEEEEEHHCCCCCCCCHHH---HHHHHHHHHHHHHHHHHHH T ss_conf 43327525788999999985115212776--732246826642301126999941899---9999999999999888877 Q ss_pred EEEEECC-HHHHH Q ss_conf 9999753-23334 Q gi|254781074|r 162 VFMLWGA-SAQKK 173 (227) Q Consensus 162 VfiLwG~-~A~~~ 173 (227) -|+.||+ +|+.. T Consensus 142 ~fv~~g~~~~dea 154 (166) T COG5465 142 QFVVWGGMDADEA 154 (166) T ss_pred HHHHCCCCCHHHH T ss_conf 8763479999999 No 12 >PRK05368 homoserine O-succinyltransferase; Provisional Probab=62.50 E-value=5.7 Score=20.60 Aligned_cols=97 Identities=19% Similarity=0.368 Sum_probs=56.0 Q ss_pred HHHHHHHCCCHHHCEEEEEECCCCCCCCCCCCCEEECCCCCCCHHH-HHHHHHHHHHHCCCCCCCCCHHHHHHCCCEEEE Q ss_conf 9999984299861779998118887752000710003479863046-999999999741888877401243220896886 Q gi|254781074|r 46 HYFRAFDITPFNKVKVVILGQDPYHNYGQAHGLCFSVPLGIRIPPS-LVNVYKELQEDVNFIPPTHGFLEHWGHEGVLLL 124 (227) Q Consensus 46 ~IF~af~~~p~~~vKVVIlGQDPY~~~g~A~GLaFsv~~~~~~p~S-LrNI~Kel~~d~~~~~~~~g~L~~Wa~QGVLLL 124 (227) +++|-+..+| -+|.|..+--+-|+. ...+++ |.+.|+.+.+=-. ..-.|-. T Consensus 53 Q~lRlL~~tp-lqv~~~~~~~~sh~~--------------k~t~~~hl~~fY~~f~~ik~--~~~DGlI----------- 104 (302) T PRK05368 53 QFLRLLGNTP-LQVDLHLLRIDSHES--------------KNTPAEHLNNFYRTFEDIKD--QKFDGLI----------- 104 (302) T ss_pred HHHHHHCCCC-CEEEEEEEEECCCCC--------------CCCCHHHHHHHHHHHHHHHH--CCCCEEE----------- T ss_conf 9999837987-358999987344468--------------99879999999764999965--8776699----------- Q ss_pred CCHHCCCCCCCHHHHHCCHHHHHHHHHHHHHCCCCCEEEEEECCHHHHH Q ss_conf 0101123686044543259999999999996178987999975323334 Q gi|254781074|r 125 NAVLTVEEGRAASHRGRGWEQFTDSVIDLISNNHQNIVFMLWGASAQKK 173 (227) Q Consensus 125 Nt~LTv~~g~~~SH~~~gW~~ft~~ii~~l~~~~~~vVfiLwG~~A~~~ 173 (227) =|.=- .+..+-....+ |+++++ |++....+.....|+|||.+|.-. T Consensus 105 ITGAP-vE~l~Fe~V~Y-W~El~~-I~dws~~~v~stl~iCWaA~AaL~ 150 (302) T PRK05368 105 VTGAP-VELLPFEDVDY-WDELRE-ILDWAKTHVTSTLFICWAAQAALY 150 (302) T ss_pred ECCCC-CCCCCCCCCCC-HHHHHH-HHHHHHHCCCCEEEECHHHHHHHH T ss_conf 72898-77788553741-999999-999998626522567499999999 No 13 >pfam04204 HTS Homoserine O-succinyltransferase. Probab=59.42 E-value=6.9 Score=20.12 Aligned_cols=103 Identities=21% Similarity=0.382 Sum_probs=58.4 Q ss_pred ECCCHH----HHHHHHHCCCHHHCEEEEEECCCCCCCCCCCCCEEECCCCCCCHHH-HHHHHHHHHHHCCCCCCCCCHHH Q ss_conf 858876----9999984299861779998118887752000710003479863046-99999999974188887740124 Q gi|254781074|r 40 IFPRGS----HYFRAFDITPFNKVKVVILGQDPYHNYGQAHGLCFSVPLGIRIPPS-LVNVYKELQEDVNFIPPTHGFLE 114 (227) Q Consensus 40 i~P~~~----~IF~af~~~p~~~vKVVIlGQDPY~~~g~A~GLaFsv~~~~~~p~S-LrNI~Kel~~d~~~~~~~~g~L~ 114 (227) .-|.+. +++|-+..+| -+|.|..+--+-|+.. ..|++ |...|+.+.+=.. ..-.|- T Consensus 42 LMP~K~~TE~Q~lRlL~ntp-lqv~v~~~~~~sh~~k--------------nt~~~hl~~fY~~f~~ik~--~~~DGl-- 102 (298) T pfam04204 42 LMPKKIETETQLLRLLGNTP-LQVEVTLLRMSSHESK--------------NTPAEHLESFYRTFDEIKD--EKFDGL-- 102 (298) T ss_pred CCCCCHHHHHHHHHHHCCCC-CEEEEEEEEECCCCCC--------------CCCHHHHHHHHHHHHHHHH--CCCCEE-- T ss_conf 68983899999999857887-4489999873444689--------------9878999999871999965--887569-- Q ss_pred HHHCCCEEEECCHHCCCCCCCHHHHHCCHHHHHHHHHHHHHCCCCCEEEEEECCHHHHH Q ss_conf 32208968860101123686044543259999999999996178987999975323334 Q gi|254781074|r 115 HWGHEGVLLLNAVLTVEEGRAASHRGRGWEQFTDSVIDLISNNHQNIVFMLWGASAQKK 173 (227) Q Consensus 115 ~Wa~QGVLLLNt~LTv~~g~~~SH~~~gW~~ft~~ii~~l~~~~~~vVfiLwG~~A~~~ 173 (227) +=|.=- .+..+-....+ |+++++ |++..-.+.....|+|||.+|.-. T Consensus 103 ---------IITGAP-vE~l~FeeV~Y-W~El~~-I~dWs~~~v~Stl~iCWaAqAaL~ 149 (298) T pfam04204 103 ---------IITGAP-VEHLPFEDVDY-WEELTE-IMDWSKTNVTSTLHICWGAQAALY 149 (298) T ss_pred ---------EECCCC-CCCCCCCCCCC-HHHHHH-HHHHHHHCCCCEEEECHHHHHHHH T ss_conf ---------972898-77788553741-999999-999998625440457499999999 No 14 >cd01840 SGNH_hydrolase_yrhL_like yrhL-like subfamily of SGNH-hydrolases, a diverse family of lipases and esterases. The tertiary fold of the enzyme is substantially different from that of the alpha/beta hydrolase family and unique among all known hydrolases; its active site closely resembles the Ser-His-Asp(Glu) triad found in other serine hydrolases. Most members of this sub-family appear to co-occur with N-terminal acyltransferase domains. Might be involved in lipid metabolism. Probab=55.68 E-value=11 Score=18.94 Aligned_cols=59 Identities=24% Similarity=0.376 Sum_probs=44.5 Q ss_pred CEEEECCHHCCCCCCCHHHHHCCHHHHHHHHHHHHHCCCCCEEEEEECCHHHHHHHCCCCCCCEEEEECCCCHHHC Q ss_conf 9688601011236860445432599999999999961789879999753233344205766607998478882225 Q gi|254781074|r 120 GVLLLNAVLTVEEGRAASHRGRGWEQFTDSVIDLISNNHQNIVFMLWGASAQKKQDVLDHKRHLVLKAAHPSPLSA 195 (227) Q Consensus 120 GVLLLNt~LTv~~g~~~SH~~~gW~~ft~~ii~~l~~~~~~vVfiLwG~~A~~~~~~i~~~~~~il~~~HPSPls~ 195 (227) -|.|.|+. ....|+.-++..|+.++++.+++..+=|.+.|+.-...+..+ ..||.|-++ T Consensus 81 ~v~lVn~~-----------~~~~w~~~~N~~l~~~a~~y~Nv~viDW~~~a~~~~~~f~~D------gvH~~~~G~ 139 (150) T cd01840 81 QVYLVNPH-----------VPRPWEPDVNAYLLDAAKKYKNVTIIDWYKAAKGHPDWFYGD------GVHPNPAGA 139 (150) T ss_pred EEEEEECC-----------CCCCHHHHHHHHHHHHHHHCCCEEEEEHHHHHHCCCCEECCC------CCCCCCCCH T ss_conf 79999588-----------995135899999999998789879971698860095516389------861185509 No 15 >TIGR02061 aprA adenylylsulfate reductase, alpha subunit; InterPro: IPR011803 During dissimilatory sulphate reduction or sulphur oxidation, adenylylsulphate (APS) reductase catalyzes reversibly the two-electron reduction of APS to sulphite and AMP. Found in several bacterial lineages and in Archaeoglobales, APS reductase is a heterodimer composed of an alpha subunit containing a noncovalently bound FAD, and a beta subunit containing two [4Fe-4S] clusters. This entry describes the alpha subunit of APS reductase, which shares a common evolutionary origin with fumarate reductase/succinate dehydrogenase flavoproteins.. Probab=48.27 E-value=16 Score=17.86 Aligned_cols=76 Identities=22% Similarity=0.347 Sum_probs=48.8 Q ss_pred CCCCCCCCCCC-CEEECCCCCCCHHHHHH--HHHHHHHHC-CCCCCCCCHHHHHHCCCEEEECCHHCCCCCCCHHHHHCC Q ss_conf 88877520007-10003479863046999--999999741-888877401243220896886010112368604454325 Q gi|254781074|r 67 DPYHNYGQAHG-LCFSVPLGIRIPPSLVN--VYKELQEDV-NFIPPTHGFLEHWGHEGVLLLNAVLTVEEGRAASHRGRG 142 (227) Q Consensus 67 DPY~~~g~A~G-LaFsv~~~~~~p~SLrN--I~Kel~~d~-~~~~~~~g~L~~Wa~QGVLLLNt~LTv~~g~~~SH~~~g 142 (227) .+|-+.|.|.| ++.-+.+ +||..||| +..|+.+-- ++-+-+.--| -|+.=|..+.+-.-=...+ T Consensus 299 ~~y~~~gya~p~~shrthr--PIPTcLRNH~~~~E~~~GrgPIyM~T~eAl----------q~~~~~l~~~~~Khlee~a 366 (651) T TIGR02061 299 KEYKDKGYAKPELSHRTHR--PIPTCLRNHALLREVREGRGPIYMDTKEAL----------QDLAATLDKKKLKHLEEEA 366 (651) T ss_pred CCCCCCCCCCCCCCCCCCC--CCCCCCHHHHHHHHHHCCCCCCEECHHHHH----------HHHHHCCCHHHHHHHHHHH T ss_conf 3557753458620245578--887841446999999608798110538887----------5253010335543367654 Q ss_pred HHHHHHHHHHHH Q ss_conf 999999999999 Q gi|254781074|r 143 WEQFTDSVIDLI 154 (227) Q Consensus 143 W~~ft~~ii~~l 154 (227) |+.|-+=.+.+- T Consensus 367 wedFLdMtVgQA 378 (651) T TIGR02061 367 WEDFLDMTVGQA 378 (651) T ss_pred HHHHHCCHHHHH T ss_conf 454510128888 No 16 >TIGR01001 metA homoserine O-succinyltransferase; InterPro: IPR005697 This family of enzymes, homoserine O-succinyltransferase (2.3.1.46 from EC), catalyses the first step in the biosynthesis of methionine.Succinyl-CoA + L-homoserine = CoA + O-succinyl-L-homoserine. ; GO: 0008415 acyltransferase activity, 0019281 methionine biosynthetic process from homoserine via O-succinyl-L-homoserine and cystathionine, 0005737 cytoplasm. Probab=46.11 E-value=7.5 Score=19.87 Aligned_cols=122 Identities=20% Similarity=0.342 Sum_probs=73.4 Q ss_pred HHHHHHHHHCCCHHHCEEEEEECCCCCCCCCCCCCEEECCCCCCCHHHHHHHHHHHHH--HCCCCCCCCCHHHHHHCCCE Q ss_conf 7699999842998617799981188877520007100034798630469999999997--41888877401243220896 Q gi|254781074|r 44 GSHYFRAFDITPFNKVKVVILGQDPYHNYGQAHGLCFSVPLGIRIPPSLVNVYKELQE--DVNFIPPTHGFLEHWGHEGV 121 (227) Q Consensus 44 ~~~IF~af~~~p~~~vKVVIlGQDPY~~~g~A~GLaFsv~~~~~~p~SLrNI~Kel~~--d~~~~~~~~g~L~~Wa~QGV 121 (227) -.|++|-+.- +|=+|.|-+|--|-|..++ ++. .=|.+.|+.++. |-.+ .|-.-.=|= T Consensus 51 E~Q~LrLL~N-sPLQV~i~LL~~~~~~~kn------------TP~-eHl~~FY~~fe~vk~~~F----DG~IvTGAP--- 109 (305) T TIGR01001 51 ENQLLRLLSN-SPLQVNITLLRIDSRKSKN------------TPI-EHLNKFYKTFEEVKDRKF----DGLIVTGAP--- 109 (305) T ss_pred HHHHHHHHCC-CCCEEEEEEEEECCCCCCC------------CCH-HHHHHHHHHHHHHHHCCC----CEEEECCCC--- T ss_conf 9999998618-9840456875432779887------------717-899998654788851278----815884776--- Q ss_pred EEECCHHCCCCCCCHHHHHCCHHHHHHHHHHHHHCCCCCEEEEEECCHHHHH-HHCCCC----------CCCEEEEECCC Q ss_conf 8860101123686044543259999999999996178987999975323334-420576----------66079984788 Q gi|254781074|r 122 LLLNAVLTVEEGRAASHRGRGWEQFTDSVIDLISNNHQNIVFMLWGASAQKK-QDVLDH----------KRHLVLKAAHP 190 (227) Q Consensus 122 LLLNt~LTv~~g~~~SH~~~gW~~ft~~ii~~l~~~~~~vVfiLwG~~A~~~-~~~i~~----------~~~~il~~~HP 190 (227) +-. ..=....+ |+.++ .|++..-.+.....|||||.+|.=+ ---|+. =+|.|++..|+ T Consensus 110 ------~el---~~Fe~VaY-W~~l~-~i~~w~k~~VtStlyiCWaaqAaL~~lYg~pK~~~~~KlsGVY~H~~~~~~~~ 178 (305) T TIGR01001 110 ------VEL---VDFEDVAY-WEELK-EIMEWSKQNVTSTLYICWAAQAALKYLYGIPKITLDEKLSGVYKHDIVKPDDL 178 (305) T ss_pred ------CCC---CCCCCCHH-HHHHH-HHHHHHHHCCEEEHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEEEECCCCC T ss_conf ------244---66443223-68999-99998750670520477899999998717873222322000244322267343 Q ss_pred CHHHCCCCCCC Q ss_conf 82225788627 Q gi|254781074|r 191 SPLSASHGFFG 201 (227) Q Consensus 191 SPls~~~gF~~ 201 (227) |- |||-+ T Consensus 179 --L~--rg~Dd 185 (305) T TIGR01001 179 --LL--RGFDD 185 (305) T ss_pred --CC--CCCCC T ss_conf --32--26871 No 17 >TIGR01977 am_tr_V_EF2568 cysteine desulfurase family protein; InterPro: IPR010969 This entry describes a subfamily of probable pyridoxal phosphate-dependent enzymes in the aminotransferase class V family. Related families contain members active as cysteine desulphurases, selenocysteine lyases, or both. The members of this family form a distinct clade and all are shorter at the N terminus. The function of members of this family are unknown.. Probab=45.97 E-value=17 Score=17.76 Aligned_cols=77 Identities=23% Similarity=0.381 Sum_probs=48.7 Q ss_pred CCCEEEECCHHCCCC--CCCHHHHHCCHHHHHHHHHHHHHCCC-------------CCEE-EEEECCHHHHHHHCCCCCC Q ss_conf 089688601011236--86044543259999999999996178-------------9879-9997532333442057666 Q gi|254781074|r 118 HEGVLLLNAVLTVEE--GRAASHRGRGWEQFTDSVIDLISNNH-------------QNIV-FMLWGASAQKKQDVLDHKR 181 (227) Q Consensus 118 ~QGVLLLNt~LTv~~--g~~~SH~~~gW~~ft~~ii~~l~~~~-------------~~vV-fiLwG~~A~~~~~~i~~~~ 181 (227) -+|+.=||+++=-.+ |-..-.+ . =...++.+++-|.+.+ -+|| |.+||=+.......++.+- T Consensus 258 ~pGI~gL~agik~~~~~gi~~i~~-k-E~~l~~~l~~~L~~~~~V~iyG~~d~~~~~~vvSf~~~g~d~~~va~~L~~~~ 335 (384) T TIGR01977 258 TPGIAGLNAGIKFIEKIGIANIAK-K-ELKLTEKLLNGLKEIKKVKIYGPKDPAKRVGVVSFTVEGIDSEEVADILDEKF 335 (384) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-H-HHHHHHHHHHHCCCCCCEEEECCCCHHHCCCEEEEEECCCCHHHHHHHHHHCC T ss_conf 799999988878875644778999-9-99999999874016780676277034332675899834888778999998618 Q ss_pred CEEEE-ECCCCHHHCCC Q ss_conf 07998-47888222578 Q gi|254781074|r 182 HLVLK-AAHPSPLSASH 197 (227) Q Consensus 182 ~~il~-~~HPSPls~~~ 197 (227) ..+.. .=|..||+ |+ T Consensus 336 ~I~~RtGlHCAPLA-H~ 351 (384) T TIGR01977 336 DIAVRTGLHCAPLA-HK 351 (384) T ss_pred CCEEECCCCCCHHH-HH T ss_conf 91897676664123-03 No 18 >COG1922 WecG Teichoic acid biosynthesis proteins [Cell envelope biogenesis, outer membrane] Probab=43.76 E-value=24 Score=16.80 Aligned_cols=41 Identities=17% Similarity=0.198 Sum_probs=34.1 Q ss_pred CCHHHHHCCHHHHHHHHHHHHHCCCCCEEEEEECCHHHHHH Q ss_conf 60445432599999999999961789879999753233344 Q gi|254781074|r 134 RAASHRGRGWEQFTDSVIDLISNNHQNIVFMLWGASAQKKQ 174 (227) Q Consensus 134 ~~~SH~~~gW~~ft~~ii~~l~~~~~~vVfiLwG~~A~~~~ 174 (227) -.++|..+.+..=-++|++.|+..+..++|+-.|..-|..= T Consensus 137 ivg~h~GYf~~~e~~~i~~~I~~s~pdil~VgmG~P~QE~w 177 (253) T COG1922 137 IVGSHDGYFDPEEEEAIVERIAASGPDILLVGMGVPRQEIW 177 (253) T ss_pred EEEECCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHH T ss_conf 99951787883568999999985599989995699622799 No 19 >pfam02514 CobN-Mg_chel CobN/Magnesium Chelatase. This family contains a domain common to the cobN protein and to magnesium protoporphyrin chelatase. CobN is implicated in the conversion of hydrogenobyrinic acid a,c-diamide to cobyrinic acid. Magnesium protoporphyrin chelatase is involved in chlorophyll biosynthesis. Probab=41.21 E-value=30 Score=16.15 Aligned_cols=78 Identities=21% Similarity=0.203 Sum_probs=48.4 Q ss_pred HHHHHCCCHHHCEEEEEECCCCCCCCCCCCCEEECCCCCCCHHHHHHHHHHHHHH-CCCCCCCCCH-HHHHHCCCEEEEC Q ss_conf 9998429986177999811888775200071000347986304699999999974-1888877401-2432208968860 Q gi|254781074|r 48 FRAFDITPFNKVKVVILGQDPYHNYGQAHGLCFSVPLGIRIPPSLVNVYKELQED-VNFIPPTHGF-LEHWGHEGVLLLN 125 (227) Q Consensus 48 F~af~~~p~~~vKVVIlGQDPY~~~g~A~GLaFsv~~~~~~p~SLrNI~Kel~~d-~~~~~~~~g~-L~~Wa~QGVLLLN 125 (227) +..++.+|-.+-||.|+=-+ ||......|-|. +-..|+||.||.+.|+++ |.+..|.+++ |...-.||..-=+ T Consensus 232 w~~Lr~k~n~eKkVAIi~~n-yPpg~~~iG~A~----~LDv~~Sl~~iL~~L~~~GY~v~~P~~~~eL~~~i~~~~~~~~ 306 (1064) T pfam02514 232 WARLRRKPNAEKRVAIVLYN-YPPGKGNIGTAA----GLDVPASLVNLLRALKAEGYDVGLPESGDELIELLLQGRNNGS 306 (1064) T ss_pred HHHHHCCCCCCCEEEEEEEC-CCCCCCCEEEEC----CCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCCCC T ss_conf 99970587302669999857-998776610224----5373999999999999779977999999999999983766883 Q ss_pred CHHCC Q ss_conf 10112 Q gi|254781074|r 126 AVLTV 130 (227) Q Consensus 126 t~LTv 130 (227) ...+. T Consensus 307 ~~~~~ 311 (1064) T pfam02514 307 WAPGE 311 (1064) T ss_pred CCCCC T ss_conf 34571 No 20 >TIGR02966 phoR_proteo phosphate regulon sensor kinase PhoR; InterPro: IPR014310 Members of this protein are the regulatory histidine kinase PhoR associated with the phosphate ABC transporter found in most Proteobacteria, and related proteins from Gram-positive organisms are excluded. The phoR gene usually is adjacent to the response regulator phoB gene (IPR011879 from INTERPRO).. Probab=39.25 E-value=32 Score=15.96 Aligned_cols=40 Identities=28% Similarity=0.251 Sum_probs=22.0 Q ss_pred HHHHHHHHHCCCH-HHCEEEEEECCCCCCCCCCCCCEEECCCC-CCCHHH Q ss_conf 7699999842998-61779998118887752000710003479-863046 Q gi|254781074|r 44 GSHYFRAFDITPF-NKVKVVILGQDPYHNYGQAHGLCFSVPLG-IRIPPS 91 (227) Q Consensus 44 ~~~IF~af~~~p~-~~vKVVIlGQDPY~~~g~A~GLaFsv~~~-~~~p~S 91 (227) .|.|.||-+|||. ..|.|=.-.+ +.|..|||.++ .-|||. T Consensus 241 ~NLv~NAikYTp~gg~I~v~W~~~--------~~ga~fsV~DtG~GI~~e 282 (339) T TIGR02966 241 SNLVSNAIKYTPEGGTITVRWRRD--------EGGAEFSVTDTGIGIAPE 282 (339) T ss_pred HHHHHHHHHCCCCCCEEEEEEEEE--------CCEEEEEEEECCCCCCHH T ss_conf 999887530089988799999985--------780399998779898731 No 21 >COG3663 Mug G:T/U mismatch-specific DNA glycosylase [DNA replication, recombination, and repair] Probab=38.47 E-value=25 Score=16.69 Aligned_cols=143 Identities=18% Similarity=0.160 Sum_probs=69.8 Q ss_pred HHCEEEEEECCCCCCCCCCCCCEEECCCCC--C-------CHHHHHHHHHHHHHHCCCCCCCCCHHHHHHCCCEEEECCH Q ss_conf 617799981188877520007100034798--6-------3046999999999741888877401243220896886010 Q gi|254781074|r 57 NKVKVVILGQDPYHNYGQAHGLCFSVPLGI--R-------IPPSLVNVYKELQEDVNFIPPTHGFLEHWGHEGVLLLNAV 127 (227) Q Consensus 57 ~~vKVVIlGQDPY~~~g~A~GLaFsv~~~~--~-------~p~SLrNI~Kel~~d~~~~~~~~g~L~~Wa~QGVLLLNt~ 127 (227) .+.+|||+|--|= ..-.++|..|.-+.+. + .+.-|.++.-|= -..+.+|+| ...|+=+.-+. T Consensus 8 ~~~~vli~Gs~Pg-~~S~~~gfyya~P~NrFWr~l~~v~~~d~~l~~~~~e~-----~~~~~~g~l---~~~~i~~~~~~ 78 (169) T COG3663 8 PGLRVLILGSFPG-EDSRAAGFYYAHPRNRFWRVLGLVFFTDAQLEPQEYEK-----LLDYAKGFL---KLRGIGLWDVL 78 (169) T ss_pred CCCEEEEEECCCC-HHHHHCCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHH-----HHCCCCCHH---HHCCCCCHHHH T ss_conf 9876999836986-65675068778821249999988706831244446665-----232000156---64265403577 Q ss_pred HCCCCCCCHHHHHCCHHHHHHHHHHHHHCCCCCEEEE-EECCHHHHHHHCCCCCCCEEEEECCCCHHHCCCCCCCCCHHH Q ss_conf 1123686044543259999999999996178987999-975323334420576660799847888222578862787089 Q gi|254781074|r 128 LTVEEGRAASHRGRGWEQFTDSVIDLISNNHQNIVFM-LWGASAQKKQDVLDHKRHLVLKAAHPSPLSASHGFFGCRHFS 206 (227) Q Consensus 128 LTv~~g~~~SH~~~gW~~ft~~ii~~l~~~~~~vVfi-LwG~~A~~~~~~i~~~~~~il~~~HPSPls~~~gF~~~~~F~ 206 (227) .+......+|-.+. -+..-.+-++.+....+.+--+ .=|..|....+.+......+...+-+|| +.|++.--+-=+ T Consensus 79 ~~~~re~~~~~d~~-ie~~e~~d~aall~~~p~~~~i~~~g~~a~~~~~~l~~~~~~~~vlpssSp--a~R~~s~e~l~~ 155 (169) T COG3663 79 EVAKRELHNSLDKL-IEKVEANDLAALLIRGPQLREIGFTGGKAGELPQTLAGGGTETWVLPSSSP--AARGLSLEKLPS 155 (169) T ss_pred HHHHHHCCCHHHHH-HHHCCCCCHHHHHHHHHHHHHHHCCCCCCEECCCCCCCCCEEEEECCCCCH--HHHCCCHHHHHH T ss_conf 88777534515555-552342038899987468886651697650445323346434675577870--210333888899 Q ss_pred HHHHH Q ss_conf 99999 Q gi|254781074|r 207 KANRY 211 (227) Q Consensus 207 ~~N~~ 211 (227) .-+++ T Consensus 156 ~~r~l 160 (169) T COG3663 156 AWRAL 160 (169) T ss_pred HHHHH T ss_conf 99999 No 22 >pfam08558 TRF Telomere repeat binding factor (TRF). Telomere repeat binding factor (TRF) family proteins are important for the regulation of telomere stability. The two related human TRF proteins hTRF1 and hTRF2 form homodimers and bind directly to telomeric TTAGGG repeats via the myb DNA binding domain pfam00249 at the carboxy terminus. TRF1 is implicated in telomere length regulation and TRF2 in telomere protection. Other telomere complex associated proteins are recruited through their interaction with either TRF1 or TRF2. The fission yeast protein Taz1p (telomere-associated in Schizosaccharomyces pombe) has similarity to both hTRF1 and hTRF2 and may perform the dual functions of TRF1 and TRF2 at fission yeast telomeres. This domain is composed of multiple alpha helices arranged in a solenoid conformation similar to TPR repeats. The fungal members have now also been found to carry two double strand telomeric repeat binding factors. Probab=35.81 E-value=14 Score=18.24 Aligned_cols=30 Identities=30% Similarity=0.780 Sum_probs=23.5 Q ss_pred HHCCHHHHHHHHHHHHHCCCCCEEEEEECCHHH Q ss_conf 432599999999999961789879999753233 Q gi|254781074|r 139 RGRGWEQFTDSVIDLISNNHQNIVFMLWGASAQ 171 (227) Q Consensus 139 ~~~gW~~ft~~ii~~l~~~~~~vVfiLwG~~A~ 171 (227) .++-|+.|.+.+.+++..+-. ++.||+..+ T Consensus 205 ~~y~w~~fl~~l~~yv~kn~~---~l~~~~~~~ 234 (235) T pfam08558 205 QNYSWESFLKELLSYVSKNLG---LLLWGAKKR 234 (235) T ss_pred HHCCHHHHHHHHHHHHHHCCH---HHEECCCCC T ss_conf 757899999999999981634---503314557 No 23 >pfam06972 DUF1296 Protein of unknown function (DUF1296). This family represents a conserved region approximately 60 residues long within a number of plant proteins of unknown function. Probab=34.69 E-value=19 Score=17.41 Aligned_cols=29 Identities=17% Similarity=0.302 Sum_probs=23.7 Q ss_pred CHHHHHHHHHCC--CHHHCEEEEEECCCCCC Q ss_conf 876999998429--98617799981188877 Q gi|254781074|r 43 RGSHYFRAFDIT--PFNKVKVVILGQDPYHN 71 (227) Q Consensus 43 ~~~~IF~af~~~--p~~~vKVVIlGQDPY~~ 71 (227) +-.+||+++..| +|+++--=+|-|||+|. T Consensus 20 Sd~eIYa~L~ecnMDPnEtvqkLL~qD~FhE 50 (60) T pfam06972 20 SDADIYAMLKECNMDPNETVQKLLSQDTFHE 50 (60) T ss_pred CHHHHHHHHHHHCCCHHHHHHHHHCCCCHHH T ss_conf 7899999999967986799999882272689 No 24 >PRK03692 putative UDP-N-acetyl-D-mannosaminuronic acid transferase; Provisional Probab=33.81 E-value=39 Score=15.42 Aligned_cols=40 Identities=18% Similarity=0.159 Sum_probs=29.8 Q ss_pred CCHHHHHCCHHHHHHHHHHHHHCCCCCEEEEEECCHHHHH Q ss_conf 6044543259999999999996178987999975323334 Q gi|254781074|r 134 RAASHRGRGWEQFTDSVIDLISNNHQNIVFMLWGASAQKK 173 (227) Q Consensus 134 ~~~SH~~~gW~~ft~~ii~~l~~~~~~vVfiLwG~~A~~~ 173 (227) -+|+|..+.=..=.+.+++.|.+.+..+||+..|..-|.. T Consensus 136 IvG~~~Gyf~~~e~~~ii~~I~~s~pdil~VglG~PkQE~ 175 (246) T PRK03692 136 IVGSQDGYFTPEQRQALFERIHASGAKIVTVAMGSPKQEI 175 (246) T ss_pred EEEECCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHH T ss_conf 9984699998377799999998569999999569818999 No 25 >PRK12493 magnesium chelatase; Provisional Probab=29.07 E-value=47 Score=14.93 Aligned_cols=74 Identities=14% Similarity=0.076 Sum_probs=47.5 Q ss_pred EECCCHHHH---------HHHHHCCCHHHCEEEEEECCCCCCCCCCCCCEEECCCCCCCHHHHHHHHHHHHHH-CCCCCC Q ss_conf 085887699---------9998429986177999811888775200071000347986304699999999974-188887 Q gi|254781074|r 39 RIFPRGSHY---------FRAFDITPFNKVKVVILGQDPYHNYGQAHGLCFSVPLGIRIPPSLVNVYKELQED-VNFIPP 108 (227) Q Consensus 39 ~i~P~~~~I---------F~af~~~p~~~vKVVIlGQDPY~~~g~A~GLaFsv~~~~~~p~SLrNI~Kel~~d-~~~~~~ 108 (227) ...|-.++| +..++.+|-.+-||.|+=-+ ||......|-|. .-..|.||.||.++|+.+ |.+..| T Consensus 411 ~~~Pi~eRve~la~Ra~~W~~LR~k~n~eKKVAIIl~N-yPp~~g~iGtAa----~LDvp~Sl~~iL~~Lk~~GY~v~lP 485 (1318) T PRK12493 411 KAIPLQDRIEAIAERAIRWVRLRRKPRAEKKLAITLFS-FPPDKGNVGTAA----YLDVFGSIHRLLQELKAAGYDVDLP 485 (1318) T ss_pred CCCCCHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEEC-CCCCCCCCCCCC----CCCCHHHHHHHHHHHHHCCCCCCCC T ss_conf 61237999999999999999963587312759999417-999888734032----3574899999999999779657899 Q ss_pred CCCH-HHHHH Q ss_conf 7401-24322 Q gi|254781074|r 109 THGF-LEHWG 117 (227) Q Consensus 109 ~~g~-L~~Wa 117 (227) .+++ |...- T Consensus 486 ~~~~eL~~~l 495 (1318) T PRK12493 486 ESPEELMEAV 495 (1318) T ss_pred CCHHHHHHHH T ss_conf 9889999999 No 26 >PRK13405 bchH magnesium chelatase subunit H; Provisional Probab=28.40 E-value=49 Score=14.86 Aligned_cols=77 Identities=14% Similarity=0.077 Sum_probs=48.9 Q ss_pred EECCCHHHH---------HHHHHCCCHHHCEEEEEECCCCCCCCCCCCCEEECCCCCCCHHHHHHHHHHHHHH-CCCCCC Q ss_conf 085887699---------9998429986177999811888775200071000347986304699999999974-188887 Q gi|254781074|r 39 RIFPRGSHY---------FRAFDITPFNKVKVVILGQDPYHNYGQAHGLCFSVPLGIRIPPSLVNVYKELQED-VNFIPP 108 (227) Q Consensus 39 ~i~P~~~~I---------F~af~~~p~~~vKVVIlGQDPY~~~g~A~GLaFsv~~~~~~p~SLrNI~Kel~~d-~~~~~~ 108 (227) .+.|-++.| +..++.+|-.+-||.|+=-+ ||....+.|-|. .-..|.||.||.++|+.+ |.+..| T Consensus 408 ~~~pipeRv~~la~r~~~w~~Lr~k~n~eKKVAivl~n-yPp~~g~iGtAa----~LDv~~Sl~~iL~~Lk~~GY~v~~P 482 (1207) T PRK13405 408 DMAVCPERAEMLAARTARLVALRRSERAERKVAVVLFN-FPPNAGATGTAA----YLSVFESLFNTLRAMKAEGYTVEVP 482 (1207) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEC-CCCCCCCCCCCC----CCCCHHHHHHHHHHHHHCCCCCCCC T ss_conf 32218999999999999999970488544558999836-998777701100----1265889999999999769987899 Q ss_pred CCCH-HHHHHCCC Q ss_conf 7401-24322089 Q gi|254781074|r 109 THGF-LEHWGHEG 120 (227) Q Consensus 109 ~~g~-L~~Wa~QG 120 (227) .+++ |...-.+| T Consensus 483 ~~~~~L~~~i~~~ 495 (1207) T PRK13405 483 ESVDALREAILGG 495 (1207) T ss_pred CCHHHHHHHHHCC T ss_conf 9999999999748 No 27 >PRK02102 ornithine carbamoyltransferase; Validated Probab=26.54 E-value=52 Score=14.65 Aligned_cols=65 Identities=18% Similarity=0.236 Sum_probs=39.2 Q ss_pred HHHHHHHCC-CHHHCEEEEEECCCCCCCCCCCCC-----------EEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCHH Q ss_conf 999998429-986177999811888775200071-----------00034798630469999999997418888774012 Q gi|254781074|r 46 HYFRAFDIT-PFNKVKVVILGQDPYHNYGQAHGL-----------CFSVPLGIRIPPSLVNVYKELQEDVNFIPPTHGFL 113 (227) Q Consensus 46 ~IF~af~~~-p~~~vKVVIlGQDPY~~~g~A~GL-----------aFsv~~~~~~p~SLrNI~Kel~~d~~~~~~~~g~L 113 (227) ++|...+.. +++.+||+|+| |..+ +.|+-+ .+..|++...+..+..+.+++..+.|....-..|+ T Consensus 142 Dl~Ti~e~~g~l~gl~i~~vG-D~~n--nVa~S~~~~~~~lG~~v~~~~P~~~~p~~~~~~~~~~~~~~~g~~i~~~~d~ 218 (331) T PRK02102 142 DFMTMKEHFGPLKGLKLAYLG-DGRN--NMANSLLVGGAKLGMDVRICAPKSLWPEEELVAQAKEIAKETGAKITITEDV 218 (331) T ss_pred HHHHHHHHHCCCCCCEEEEEC-CCCC--CHHHHHHHHHHHCCCEEEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEEECH T ss_conf 999999983875677389978-8764--2466899999855985999758644889799999999999829938999566 No 28 >pfam03040 CemA CemA family. Members of this family are probable integral membrane proteins. Their molecular function is unknown. CemA proteins are found in the inner envelope membrane of chloroplasts but not in the thylakoid membrane. A cyanobacterial member of this family has been implicated in CO2 transport, but is probably not a CO2 transporter itself. They are predicted to be haem-binding however this has not been proven experimentally. Probab=26.51 E-value=27 Score=16.47 Aligned_cols=33 Identities=15% Similarity=0.481 Sum_probs=25.6 Q ss_pred CHHHHHCCHHHHHHHHHHHHH-CCCCCEEEEEEC Q ss_conf 044543259999999999996-178987999975 Q gi|254781074|r 135 AASHRGRGWEQFTDSVIDLIS-NNHQNIVFMLWG 167 (227) Q Consensus 135 ~~SH~~~gW~~ft~~ii~~l~-~~~~~vVfiLwG 167 (227) -|-|+..|||-+.+.+.++.- ..+++.|+++-. T Consensus 163 VGFHSp~GWEvil~~i~~hfGl~~n~~~I~lFVa 196 (230) T pfam03040 163 VGFHSPHGWEVLIESILEHFGLPENEQFISLFVA 196 (230) T ss_pred HCCCCCCHHHHHHHHHHHHCCCCCCCCCHHHHHH T ss_conf 4178960299999999986599866351364344 No 29 >CHL00181 cbbX CbbX; Provisional Probab=26.26 E-value=53 Score=14.62 Aligned_cols=152 Identities=12% Similarity=0.220 Sum_probs=80.0 Q ss_pred HHHHHCCHHHHHHHHHHHHHHHCCCEECCCHHHHHHHHHCCCHHHCEEEEEECCCCCCCCCCCCCEEECCCCCCCHHHHH Q ss_conf 88872658999999999999977980858876999998429986177999811888775200071000347986304699 Q gi|254781074|r 14 LENHFQSEHMRNLKEFLLSEKRKGKRIFPRGSHYFRAFDITPFNKVKVVILGQDPYHNYGQAHGLCFSVPLGIRIPPSLV 93 (227) Q Consensus 14 L~~e~~~~~~~~i~~~l~~e~~~~~~i~P~~~~IF~af~~~p~~~vKVVIlGQDPY~~~g~A~GLaFsv~~~~~~p~SLr 93 (227) |+.|+.+..+.++.+.|.++.- | .=.-+..|-.--.....+..|- -.|..+- .+..-+.|.=|+|+- ..+.- T Consensus 5 ~~~~~~~~~lee~L~eLd~eli-G--L~~VK~~v~~l~~~~~~~~~R~-~~Gl~~~---~~s~h~vF~GnPGTG-KTTVA 76 (287) T CHL00181 5 LQEEYNKTQIQEVLDELDEELI-G--LVPVKTRIREIAALLLVDRLRK-NLGLVSS---SPGLHMSFTGSPGTG-KTTVA 76 (287) T ss_pred HHHHHHHCCHHHHHHHHHHHCC-C--HHHHHHHHHHHHHHHHHHHHHH-HCCCCCC---CCCCEEEEECCCCCC-HHHHH T ss_conf 6888645349999999988646-9--6999999999999999999999-8799988---876538887899867-99999 Q ss_pred HHHHHHHHHCCCCCCC---------------------CCHHHHHHCCCEEEECCHHCCCCCCCHHHHHCCHHHHHHHHHH Q ss_conf 9999999741888877---------------------4012432208968860101123686044543259999999999 Q gi|254781074|r 94 NVYKELQEDVNFIPPT---------------------HGFLEHWGHEGVLLLNAVLTVEEGRAASHRGRGWEQFTDSVID 152 (227) Q Consensus 94 NI~Kel~~d~~~~~~~---------------------~g~L~~Wa~QGVLLLNt~LTv~~g~~~SH~~~gW~~ft~~ii~ 152 (227) .++-.+-.++|+.... .-..-.=|.-|||++.-+-|...+ +++..+| +.-.+.+++ T Consensus 77 Rl~a~il~~lG~L~~g~vve~~r~dLvg~yvG~Ta~kt~~~i~~a~GGVLfIDEAY~L~~~--~~~~dfg-~eaidtLl~ 153 (287) T CHL00181 77 LKMADILYRLGYIKKGHLITVTRDDLVGQYIGHTAPKTKEVLKKAMGGVLFIDEAYYLYKP--DNERDYG-AEAIEILLQ 153 (287) T ss_pred HHHHHHHHHCCCCCCCEEEEECHHHHCCCCCCCCHHHHHHHHHHCCCCEEEEECHHHHCCC--CCCCCCH-HHHHHHHHH T ss_conf 9999999986995589589953588416353521699999999645987998244653578--8999837-999999999 Q ss_pred HHHCCCCCEEEEEECCHHHHHHHCC Q ss_conf 9961789879999753233344205 Q gi|254781074|r 153 LISNNHQNIVFMLWGASAQKKQDVL 177 (227) Q Consensus 153 ~l~~~~~~vVfiLwG~~A~~~~~~i 177 (227) .+......+|+|+=|...+ +..++ T Consensus 154 ~me~~~~~lvvI~AGY~~e-M~~fl 177 (287) T CHL00181 154 VMENQRDDLVVIFAGYKDR-MDKFY 177 (287) T ss_pred HHHHCCCCEEEEEECCHHH-HHHHH T ss_conf 9870799889998467899-99999 No 30 >KOG3325 consensus Probab=26.25 E-value=23 Score=16.85 Aligned_cols=63 Identities=35% Similarity=0.533 Sum_probs=29.2 Q ss_pred HHCEEEEEECCCCCCCCCCCCCEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHCCCEEEECC Q ss_conf 6177999811888775200071000347986304699999999974188887740124322089688601 Q gi|254781074|r 57 NKVKVVILGQDPYHNYGQAHGLCFSVPLGIRIPPSLVNVYKELQEDVNFIPPTHGFLEHWGHEGVLLLNA 126 (227) Q Consensus 57 ~~vKVVIlGQDPY~~~g~A~GLaFsv~~~~~~p~SLrNI~Kel~~d~~~~~~~~g~L~~Wa~QGVLLLNt 126 (227) -+.|||-+||= ..|.-+|-- -+|.+ -|.||--+-.+|-.|+=...-++. .+.....|-+..|- T Consensus 70 P~~kvvtvGqf---kIG~chGhq-ViP~g--d~~sL~~LaRqldvDILl~G~Th~-f~Aye~eg~ffvnP 132 (183) T KOG3325 70 PENKVVTVGQF---KIGLCHGHQ-VIPWG--DPESLALLARQLDVDILLTGHTHK-FEAYEHEGKFFVNP 132 (183) T ss_pred CCCCEEEECCE---EEEEECCCE-EECCC--CHHHHHHHHHHCCCCEEEECCCEE-EEEEEECCCEEECC T ss_conf 76435885417---887654757-62389--989999999754975999678135-89998478099678 No 31 >pfam00121 TIM Triosephosphate isomerase. Probab=25.93 E-value=30 Score=16.15 Aligned_cols=67 Identities=13% Similarity=0.024 Sum_probs=36.1 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHHCCCEECCCHHHHHHHHHCCCHHHCEEEEEECCCCCC-CCCCCC Q ss_conf 389998887265899999999999997798085887699999842998617799981188877-520007 Q gi|254781074|r 9 SWKSLLENHFQSEHMRNLKEFLLSEKRKGKRIFPRGSHYFRAFDITPFNKVKVVILGQDPYHN-YGQAHG 77 (227) Q Consensus 9 sW~~~L~~e~~~~~~~~i~~~l~~e~~~~~~i~P~~~~IF~af~~~p~~~vKVVIlGQDPY~~-~g~A~G 77 (227) .|+..........+++++...+.........|+|+.-.+-.+-+... +.++-+-+||-|.. .|--+| T Consensus 6 NWKMn~~~~~~~~~~~~~~~~~~~~~~~~v~i~Pp~~~L~~~~~~~~--~~~i~igaQn~~~~~~Ga~TG 73 (243) T pfam00121 6 NWKMNKTLAEALELAKALLAAALKNDGVEVVVAPPFTYLSPVSQALK--GSNIKVGAQNVSAEDSGAFTG 73 (243) T ss_pred ECCCCCCHHHHHHHHHHHHHHCCCCCCCEEEEECCHHHHHHHHHHHC--CCCCEEEECCCCCCCCCCCCC T ss_conf 41658999999999999986346778956999799999999999845--899679832354567888747 No 32 >TIGR02884 spore_pdaA delta-lactam-biosynthetic de-N-acetylase; InterPro: IPR014235 Muramic delta-lactam is an unusual constituent of peptidoglycan, found only in bacterial spores, in the peptidoglycan wall or spore cortex. This entry represents a subset of the larger polysaccharide deacetylase family that is specifically involved in delta-lactam biosynthesis. PdaA from Bacillus subtilis acts immediately after CwlD, an N-acetylmuramoyl-L-alanine amidase and performs a de-N-acetylation. PdaA may also perform the subsequent transpeptidation that leads to lactam ring formation, as heterologous expression in Escherichia coli of CwlD and PdaA together is sufficient for delta-lactam production.. Probab=25.35 E-value=55 Score=14.52 Aligned_cols=58 Identities=24% Similarity=0.453 Sum_probs=40.9 Q ss_pred HHCCCCCCCHHHHHCCHHHHHHHHHHHHHCCCCCEEEEEEC---CHHHHHHHCCCCCCCEE--EEECCCCH Q ss_conf 01123686044543259999999999996178987999975---32333442057666079--98478882 Q gi|254781074|r 127 VLTVEEGRAASHRGRGWEQFTDSVIDLISNNHQNIVFMLWG---ASAQKKQDVLDHKRHLV--LKAAHPSP 192 (227) Q Consensus 127 ~LTv~~g~~~SH~~~gW~~ft~~ii~~l~~~~~~vVfiLwG---~~A~~~~~~i~~~~~~i--l~~~HPSP 192 (227) .||..+|== .+ .|..||+.|=+++-+-.|.-=| +.+..+.+....+-|+| -...|||= T Consensus 40 YLTFDnGYE---nG-----YT~kILDVLKe~~V~AaFFVTghY~K~~pdLvKRM~~EGHivGNHS~hHPsl 102 (225) T TIGR02884 40 YLTFDNGYE---NG-----YTPKILDVLKENKVPAAFFVTGHYIKTQPDLVKRMVDEGHIVGNHSVHHPSL 102 (225) T ss_pred EEECCCCCC---CC-----CCHHHHHHHHCCCCCEEEEEECCCCCCCHHHHHHHHHCCCEECCCCCCCCCH T ss_conf 851435533---57-----7033423320378831146407731476676665554583432565577555 No 33 >PRK01816 hypothetical protein; Provisional Probab=24.82 E-value=56 Score=14.47 Aligned_cols=23 Identities=26% Similarity=0.530 Sum_probs=18.5 Q ss_pred ECCCCCCCHHHHHHHHHHHHHHC Q ss_conf 03479863046999999999741 Q gi|254781074|r 81 SVPLGIRIPPSLVNVYKELQEDV 103 (227) Q Consensus 81 sv~~~~~~p~SLrNI~Kel~~d~ 103 (227) +....+++||||.+.|.||..-+ T Consensus 85 GkRa~t~LPp~l~~Wy~ei~~kl 107 (145) T PRK01816 85 GKRSNTPLPPALLNWFYEVREKL 107 (145) T ss_pred HHHCCCCCCHHHHHHHHHHHHHH T ss_conf 11118979879999999999999 No 34 >cd03186 GST_C_SspA GST_N family, Stringent starvation protein A (SspA) subfamily; SspA is a RNA polymerase (RNAP)-associated protein required for the lytic development of phage P1 and for stationary phase-induced acid tolerance of E. coli. It is implicated in survival during nutrient starvation. SspA adopts the GST fold with an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, but it does not bind glutathione (GSH) and lacks GST activity. SspA is highly conserved among gram-negative bacteria. Related proteins found in Neisseria (called RegF), Francisella and Vibrio regulate the expression of virulence factors necessary for pathogenesis. Probab=24.66 E-value=25 Score=16.60 Aligned_cols=49 Identities=27% Similarity=0.434 Sum_probs=32.4 Q ss_pred CCHHHHHHHCCCCCCCEEEEECCCCHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 532333442057666079984788822257886278708999999998788974650889 Q gi|254781074|r 167 GASAQKKQDVLDHKRHLVLKAAHPSPLSASHGFFGCRHFSKANRYLQEHGKTTVNWQIPL 226 (227) Q Consensus 167 G~~A~~~~~~i~~~~~~il~~~HPSPls~~~gF~~~~~F~~~N~~L~~~~~~~IdW~lp~ 226 (227) ++.|.+.++.+.. . |.+. +|+=+.+.||.+.-|+-+.-+|. ||=|+||. T Consensus 30 ~k~~~~ark~L~e--~--L~~l--~p~F~~~pyFmsdEFslvDc~la-----PlLWRL~~ 78 (107) T cd03186 30 KKEAEKARKELRE--S--LLAL--APVFAHKPYFMSEEFSLVDCALA-----PLLWRLPA 78 (107) T ss_pred HHHHHHHHHHHHH--H--HHHH--HHHHCCCCCCCCCCCHHHHHHHH-----HHHHHHHH T ss_conf 6789999999999--9--9999--99980484402265208899998-----99997012 No 35 >PRK00042 tpiA triosephosphate isomerase; Provisional Probab=23.87 E-value=35 Score=15.77 Aligned_cols=70 Identities=14% Similarity=0.089 Sum_probs=40.9 Q ss_pred CCHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCEECCCHHHHHHHHHCCCHHHCEEEEEECCCCCC-CCCCCC Q ss_conf 873389998887265899999999999997798085887699999842998617799981188877-520007 Q gi|254781074|r 6 IHESWKSLLENHFQSEHMRNLKEFLLSEKRKGKRIFPRGSHYFRAFDITPFNKVKVVILGQDPYHN-YGQAHG 77 (227) Q Consensus 6 i~~sW~~~L~~e~~~~~~~~i~~~l~~e~~~~~~i~P~~~~IF~af~~~p~~~vKVVIlGQDPY~~-~g~A~G 77 (227) |=-.|+.-........+++++...+.........|+||.-.+-.+.... .+.++-+-.||-++. .|--+| T Consensus 6 IiaNWKmn~~~~~~~~~~~~l~~~~~~~~~~~v~i~Pp~~~L~~~~~~~--~~~~i~igaQn~~~~~~Ga~TG 76 (251) T PRK00042 6 IAGNWKMNKTLAEAKELVEELAALLPDADGVEVAVAPPFTYLSSVKEAL--EGSNLKLGAQNVHFEDSGAFTG 76 (251) T ss_pred EEEECCCCCCHHHHHHHHHHHHHHCCCCCCCEEEEECCHHHHHHHHHHH--CCCCCEEEEEECCCCCCCCCCC T ss_conf 9997354889999999999999746666898799979999999999982--7999679972578777767757 No 36 >CHL00043 cemA envelope membrane protein Probab=23.38 E-value=30 Score=16.17 Aligned_cols=33 Identities=21% Similarity=0.431 Sum_probs=25.3 Q ss_pred CHHHHHCCHHHHHHHHHHHHH-CCCCCEEEEEEC Q ss_conf 044543259999999999996-178987999975 Q gi|254781074|r 135 AASHRGRGWEQFTDSVIDLIS-NNHQNIVFMLWG 167 (227) Q Consensus 135 ~~SH~~~gW~~ft~~ii~~l~-~~~~~vVfiLwG 167 (227) -|-|+..|||-+.+.+.++.- ..+++.|+++-+ T Consensus 166 vGFHSphGWEvli~~i~~hfGl~~n~~~I~~fVa 199 (233) T CHL00043 166 IGFHSPHGWELLIGSIYKHFGFPHNDQIISLLVS 199 (233) T ss_pred HCCCCCCHHHHHHHHHHHHCCCCCCCCCHHHHHH T ss_conf 5178960199999999986599866350466666 No 37 >pfam00551 Formyl_trans_N Formyl transferase. This family includes the following members. Glycinamide ribonucleotide transformylase catalyses the third step in de novo purine biosynthesis, the transfer of a formyl group to 5'-phosphoribosylglycinamide. Formyltetrahydrofolate deformylase produces formate from formyl- tetrahydrofolate. Methionyl-tRNA formyltransferase transfers a formyl group onto the amino terminus of the acyl moiety of the methionyl aminoacyl-tRNA. Inclusion of the following members is supported by PSI-blast. HOXX_BRAJA (P31907) contains a related domain of unknown function. PRTH_PORGI (P46071) contains a related domain of unknown function. Y09P_MYCTU (Q50721) contains a related domain of unknown function. Probab=22.92 E-value=61 Score=14.23 Aligned_cols=54 Identities=22% Similarity=0.277 Sum_probs=30.4 Q ss_pred HHHHHHHHHHHHCCCCCEEEEEECCHHHHHHHCCCCCCCEEEEECCCCHHHCCCCC Q ss_conf 99999999999617898799997532333442057666079984788822257886 Q gi|254781074|r 144 EQFTDSVIDLISNNHQNIVFMLWGASAQKKQDVLDHKRHLVLKAAHPSPLSASHGF 199 (227) Q Consensus 144 ~~ft~~ii~~l~~~~~~vVfiLwG~~A~~~~~~i~~~~~~il~~~HPSPls~~~gF 199 (227) +.+-..+++.|.+.+..+|+ +-|-.-.--..+++.-++.++ --|||+|-.+||. T Consensus 65 ~~~~~~~~~~l~~~~~Dliv-~~g~~~il~~~~~~~~~~~~i-N~HpslLP~~~G~ 118 (181) T pfam00551 65 SQFDSELADSLAALAPDLIV-LAGYMRILPPEFLQAFPGKIL-NIHPSLLPRFRGA 118 (181) T ss_pred HHHHHHHHHHHHHHCCCEEE-EECHHHHCCHHHHHHCCCCEE-EECCCCCCCCCCC T ss_conf 46189999999974999999-801633569789700668879-9585405288890 No 38 >KOG2165 consensus Probab=22.15 E-value=40 Score=15.38 Aligned_cols=47 Identities=30% Similarity=0.487 Sum_probs=30.3 Q ss_pred CCHHHHHHHHH--------HHHHHCCCCCCC-CCHHHHHHCCCEEEECC------HHCCCCC Q ss_conf 63046999999--------999741888877-40124322089688601------0112368 Q gi|254781074|r 87 RIPPSLVNVYK--------ELQEDVNFIPPT-HGFLEHWGHEGVLLLNA------VLTVEEG 133 (227) Q Consensus 87 ~~p~SLrNI~K--------el~~d~~~~~~~-~g~L~~Wa~QGVLLLNt------~LTv~~g 133 (227) +..+++-+-|. ++..-++...+. .--|+-|-+||||.-++ ++|+.+. T Consensus 602 p~qA~iI~~Fqek~twt~eelse~l~ip~~~lrrrL~fWi~~GvL~e~~~~s~tgt~T~iEs 663 (765) T KOG2165 602 PEQAAIINLFQEKNTWTLEELSESLGIPVPALRRRLSFWIQKGVLREEPIISDTGTLTVIES 663 (765) T ss_pred HHHHHHHHHHCCCCCCCHHHHHHHHCCCHHHHHHHHHHHHHCCEEECCCCCCCCCEEEECCC T ss_conf 89999999964745243999998868997999999999997482214788788761463322 No 39 >KOG4706 consensus Probab=21.97 E-value=58 Score=14.37 Aligned_cols=35 Identities=17% Similarity=0.219 Sum_probs=24.0 Q ss_pred CCCCCEEECCCCCCCHHHHHHHHHHHHHHCCCCCC Q ss_conf 00071000347986304699999999974188887 Q gi|254781074|r 74 QAHGLCFSVPLGIRIPPSLVNVYKELQEDVNFIPP 108 (227) Q Consensus 74 ~A~GLaFsv~~~~~~p~SLrNI~Kel~~d~~~~~~ 108 (227) .-+||||.+|+.+++|.--+.|+-+-.-|.++..+ T Consensus 54 ~eMGlA~dpN~Avpi~~~kk~~~~a~~i~~~v~~~ 88 (196) T KOG4706 54 REMGLAFDPNDAVPIAEHKKEIIDAEPIDIVVPKP 88 (196) T ss_pred HHHCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCC T ss_conf 98432258654445156666552000231056651 No 40 >COG3219 Uncharacterized protein conserved in bacteria [Function unknown] Probab=21.86 E-value=64 Score=14.10 Aligned_cols=34 Identities=24% Similarity=0.414 Sum_probs=19.2 Q ss_pred HHHHHHHHHHHHHHCCCCCCCC------CHHHHHHCCCEEEE Q ss_conf 0469999999997418888774------01243220896886 Q gi|254781074|r 89 PPSLVNVYKELQEDVNFIPPTH------GFLEHWGHEGVLLL 124 (227) Q Consensus 89 p~SLrNI~Kel~~d~~~~~~~~------g~L~~Wa~QGVLLL 124 (227) |-||++...+|..-. ..+.+ +-++.|..|||++= T Consensus 193 p~Sl~~ll~aLa~~~--s~~~~~~~~l~~~w~~w~~q~Vi~p 232 (237) T COG3219 193 PLSLDTLLQALAEFM--SADDSLKPILLQLWSGWTEQGVIIP 232 (237) T ss_pred CCCHHHHHHHHHHHC--CCCHHHHHHHHHHHHHHHHCCEECC T ss_conf 454689999999754--5724577999999986676372034 No 41 >cd00311 TIM Triosephosphate isomerase (TIM) is a glycolytic enzyme that catalyzes the interconversion of dihydroxyacetone phosphate and D-glyceraldehyde-3-phosphate. The reaction is very efficient and requires neither cofactors nor metal ions. TIM, usually homodimeric, but in some organisms tetrameric, is ubiqitous and conserved in function across eukaryotes, bacteria and archaea. Probab=21.25 E-value=43 Score=15.16 Aligned_cols=62 Identities=15% Similarity=0.027 Sum_probs=36.6 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHHCCCEECCCHHHHHHHHHCCCHHHCEEEEEECCCCCCC Q ss_conf 3899988872658999999999999977980858876999998429986177999811888775 Q gi|254781074|r 9 SWKSLLENHFQSEHMRNLKEFLLSEKRKGKRIFPRGSHYFRAFDITPFNKVKVVILGQDPYHNY 72 (227) Q Consensus 9 sW~~~L~~e~~~~~~~~i~~~l~~e~~~~~~i~P~~~~IF~af~~~p~~~vKVVIlGQDPY~~~ 72 (227) .|+..........+++.+...+.........|+|+.-.+-.+.+.. .+.++-+-.||-++.. T Consensus 6 NWKmn~~~~~~~~~~~~~~~~~~~~~~~~v~i~Pp~~~L~~~~~~~--~~~~i~vgAQn~~~~~ 67 (242) T cd00311 6 NWKMNGTLAEALELAKALNAVLKDESGVEVVVAPPFTYLAAVAEAL--EGSKIKVGAQNVSPED 67 (242) T ss_pred ECCCCCCHHHHHHHHHHHHHHCCCCCCCEEEEECCHHHHHHHHHHH--CCCCCEEEECCCCCCC T ss_conf 4144899999999999998625676897699989989999999983--5898469840143456 Done!