Query gi|254780995|ref|YP_003065408.1| Endonuclease/exonuclease/phosphatase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 125 No_of_seqs 109 out of 919 Neff 9.4 Searched_HMMs 39220 Date Mon May 30 04:09:29 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780995.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR00633 xth exodeoxyribonucl 99.8 5.1E-21 1.3E-25 132.6 6.4 113 5-119 136-279 (279) 2 TIGR00195 exoDNase_III exodeox 99.8 3.4E-21 8.7E-26 133.5 5.4 111 6-118 141-281 (281) 3 COG0708 XthA Exonuclease III [ 99.7 1E-18 2.7E-23 120.2 5.2 113 6-120 126-261 (261) 4 PRK11756 exonuclease III; Prov 99.7 6.6E-18 1.7E-22 115.9 5.8 113 6-120 129-268 (268) 5 PRK13911 hypothetical protein; 99.6 2.6E-15 6.5E-20 102.1 6.5 106 6-118 123-249 (250) 6 COG3568 ElsH Metal-dependent h 99.5 1.5E-13 3.9E-18 92.6 6.5 109 3-121 149-258 (259) 7 pfam03372 Exo_endo_phos Endonu 99.4 1.9E-12 4.9E-17 86.7 7.1 113 4-118 133-255 (255) 8 TIGR03395 sphingomy sphingomye 99.3 6.6E-12 1.7E-16 83.9 7.6 115 2-117 158-282 (283) 9 COG2374 Predicted extracellula 99.2 7.3E-11 1.9E-15 78.3 7.1 116 3-123 655-791 (798) 10 smart00476 DNaseIc deoxyribonu 99.1 3.6E-10 9.2E-15 74.6 6.7 113 4-119 158-276 (276) 11 PRK05421 hypothetical protein; 98.9 2.4E-09 6.1E-14 70.2 6.3 97 2-123 157-254 (254) 12 COG3021 Uncharacterized protei 98.8 1.2E-08 3.1E-13 66.4 5.6 100 7-121 209-308 (309) 13 KOG3873 consensus 98.0 1.2E-05 3.1E-10 50.3 5.8 115 5-122 155-294 (422) 14 KOG0620 consensus 97.9 1.1E-05 2.7E-10 50.7 4.2 104 19-123 220-354 (361) 15 KOG2338 consensus 97.7 5.5E-05 1.4E-09 46.8 4.8 38 4-41 273-313 (495) 16 KOG0566 consensus 97.6 0.00013 3.4E-09 44.8 5.2 19 103-121 814-832 (1080) 17 COG5239 CCR4 mRNA deadenylase, 97.4 0.00015 3.7E-09 44.5 3.7 102 22-125 244-370 (378) 18 smart00128 IPPc Inositol polyp 97.4 0.0004 1E-08 42.2 5.7 21 103-123 287-307 (310) 19 PTZ00297 pantothenate kinase; 97.1 0.0019 4.8E-08 38.6 6.2 119 4-123 170-317 (1452) 20 KOG1294 consensus 96.7 0.0035 9E-08 37.2 5.1 112 4-120 193-335 (335) 21 KOG2756 consensus 96.0 0.0065 1.7E-07 35.7 3.2 107 3-119 228-348 (349) 22 COG5411 Phosphatidylinositol 5 94.9 0.045 1.1E-06 31.3 4.4 17 106-122 312-328 (460) 23 KOG1976 consensus 72.5 4 0.0001 20.8 3.1 36 3-38 200-235 (391) 24 PTZ00312 inositol-1,4,5-tripho 65.1 8.8 0.00023 19.0 3.6 21 16-36 124-144 (356) 25 COG3459 Cellobiose phosphoryla 50.1 12 0.0003 18.3 2.1 30 6-35 718-747 (1056) 26 TIGR00619 sbcd nuclease SbcCD, 40.4 31 0.00079 16.1 3.3 39 1-39 29-68 (275) 27 TIGR02339 thermosome_arch ther 39.5 32 0.00082 16.0 3.5 25 2-26 264-288 (522) 28 TIGR01759 MalateDH-SF1 malate 35.5 34 0.00087 15.9 2.6 23 1-27 107-129 (329) 29 pfam10230 DUF2305 Uncharacteri 34.2 39 0.001 15.5 3.8 26 2-27 58-85 (260) 30 cd02855 Glycogen_branching_enz 33.3 27 0.0007 16.4 1.8 15 21-35 33-47 (106) 31 PRK05340 UDP-2,3-diacylglucosa 30.9 45 0.0011 15.2 3.0 30 4-33 17-46 (240) 32 cd02854 Glycogen_branching_enz 29.5 31 0.00079 16.1 1.6 16 19-34 15-30 (99) 33 pfam11288 DUF3089 Protein of u 26.9 53 0.0014 14.8 3.0 20 8-28 83-102 (201) 34 COG2908 Uncharacterized protei 26.7 54 0.0014 14.8 2.7 36 4-39 14-49 (237) 35 pfam05193 Peptidase_M16_C Pept 26.0 55 0.0014 14.7 2.5 26 6-31 4-30 (182) 36 TIGR02815 agaS_fam putative su 25.4 57 0.0014 14.7 3.0 27 1-27 22-51 (374) 37 TIGR01931 cysJ sulfite reducta 24.8 58 0.0015 14.6 2.8 28 4-32 83-110 (628) 38 cd02859 AMPKbeta_GBD_like AMP- 24.2 40 0.001 15.5 1.4 15 19-33 11-25 (79) 39 COG1640 MalQ 4-alpha-glucanotr 22.9 64 0.0016 14.4 3.3 39 2-45 208-246 (520) 40 cd02861 E_set_proteins_like E 22.1 52 0.0013 14.9 1.6 15 19-33 12-26 (82) 41 TIGR01734 D-ala-DACP-lig D-ala 22.0 67 0.0017 14.3 3.4 29 2-30 31-61 (513) 42 TIGR01515 branching_enzym 1,4- 21.3 69 0.0018 14.2 2.4 38 19-60 38-75 (608) No 1 >TIGR00633 xth exodeoxyribonuclease III (xth); InterPro: IPR004808 All proteins in this family for which functions are known are 5' AP endonucleases that function in base excision repair and the repair of abasic sites in DNA.; GO: 0004518 nuclease activity, 0006281 DNA repair. Probab=99.83 E-value=5.1e-21 Score=132.58 Aligned_cols=113 Identities=19% Similarity=0.232 Sum_probs=84.7 Q ss_pred HHHHHHHHHHHHHCC--CCCEEEEEECCCCCCCCCCCCHH---------------------HHHCCCC----CCCHHHCC Q ss_conf 999999999985116--99889996147776883100278---------------------7632655----41000025 Q gi|254780995|r 5 QGEWLKKWADQKIKT--GIPFVIAGDFNRKINSIGDTDDF---------------------WQKMDPD----GLLIRFPQ 57 (125) Q Consensus 5 Q~~~L~~~i~~~~~~--~~pvIi~GDFN~~~~~~~~~~~~---------------------~~~l~~~----~~~~~~~~ 57 (125) .-+.+.+......+. +.|||||||||++|.+.+++++. +..|... ..++.+.+ T Consensus 136 ~w~~~~~~~~~~l~~~~~~pv~~cGD~NvAh~eiDl~~P~~NRKr~l~~~~~gF~~~ER~~~~~ll~~ggf~DtFR~~~p 215 (279) T TIGR00633 136 FWEALLQYYLKELDASDGKPVIICGDLNVAHTEIDLANPKENRKRWLRKGNAGFTPEEREAFDELLEAGGFVDTFRHFNP 215 (279) T ss_pred HHHHHHHHHHHHHHHCCCCCEEEECCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCEEEECCCCC T ss_conf 99999999999986179997999847788977112457750247998818898788999999999861796487036588 Q ss_pred CC-CCCCCCCCCC-CCCCCCCCCCCHHHHCCCCCCC--EEEEECCCCCCCCCCCCCCCCCEEEEEE Q ss_conf 76-7687103887-5543335521012100257321--1543024741125867887564999996 Q gi|254780995|r 58 EK-ESTCNVIKRN-KSSLDYFVIDRDNKNFLIDNSF--SIVSYDQSDLDTRRSKLSTHCPLTIEYD 119 (125) Q Consensus 58 ~~-~~~~~~~~~~-~~~~d~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~SDH~PV~~~l~ 119 (125) +. ..+|||+++. .++.+.+ |.|||+++++..+ ++....+.....+...|||||||+++|+ T Consensus 216 ~~~~~YTwW~YR~K~ar~~N~--GWRiDy~l~s~~l~~~v~d~~I~~~~rg~ekPSDHcPi~l~~~ 279 (279) T TIGR00633 216 DTEGAYTWWDYRSKRARDRNR--GWRIDYFLVSEPLAARVVDSYIDSEVRGMEKPSDHCPIVLELD 279 (279) T ss_pred CCCCCEEEEECCCCCCCCCCC--CCEEEEEECCHHHHHHHHHHHCCCHHHCCCCCCCCCCEEEEEC T ss_conf 879731254002367724673--4145326538689986510001303452788887355477759 No 2 >TIGR00195 exoDNase_III exodeoxyribonuclease III; InterPro: IPR000097 DNA damaging agents such as the antitumor drugs bleomycin and neocarzinostatin or those that generate oxygen radicals produce a variety of lesions in DNA. Amongst these is base-loss which forms apurinic/apyrimidinic (AP) sites or strand breaks with atypical 3' termini. DNA repair at the AP sites is initiated by specific endonuclease cleavage of the phosphodiester backbone. Such endonucleases are also generally capable of removing blocking groups from the 3' terminus of DNA strand breaks. AP endonucleases can be classified into two families based on sequence similarity. This family contains members of AP endonuclease family 1. Except for Rrp1 and arp, these enzymes are proteins of about 300 amino-acid residues. Rrp1 and arp both contain additional and unrelated sequences in their N-terminal section (about 400 residues for Rrp1 and 270 for arp). The proteins contain glutamate which has been shown , in the Escherichia coli enzyme to bind a divalent metal ion such as magnesium or manganese.; GO: 0003677 DNA binding, 0004519 endonuclease activity, 0006281 DNA repair, 0005622 intracellular. Probab=99.83 E-value=3.4e-21 Score=133.51 Aligned_cols=111 Identities=19% Similarity=0.220 Sum_probs=87.1 Q ss_pred HHHHHHHHHHHHCCCCC-EEEEEECCCCCCCCCCCCHH---------------------HHHCCCCC----CCHHHCCC- Q ss_conf 99999999985116998-89996147776883100278---------------------76326554----10000257- Q gi|254780995|r 6 GEWLKKWADQKIKTGIP-FVIAGDFNRKINSIGDTDDF---------------------WQKMDPDG----LLIRFPQE- 58 (125) Q Consensus 6 ~~~L~~~i~~~~~~~~p-vIi~GDFN~~~~~~~~~~~~---------------------~~~l~~~~----~~~~~~~~- 58 (125) -+.|.+|+++....+.| ||||||||++|.+.+.+.+. ++.+++.+ .++.+.++ T Consensus 141 ~~~l~~yl~~~~~~~~p~v~~CGD~NiAp~eiDl~~P~~nrnRWL~~GKCSF~~~ER~~~~~~l~~Gn~vDtFR~~~p~~ 220 (281) T TIGR00195 141 LEALQNYLEKLRDKDKPEVIICGDFNIAPEEIDLHSPDENRNRWLHAGKCSFLPEEREWLDRLLEAGNFVDTFRKFNPDK 220 (281) T ss_pred HHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCEEEECCCCCCC T ss_conf 99999999998636898899976778898613567885111331333321388899999999986006133200005688 Q ss_pred CCCCCCCCCCCC-CCCCCCCCCCHHHHCCCCCCC--EEEEECCCCCCCCCCCCCCCCCEEEEE Q ss_conf 676871038875-543335521012100257321--154302474112586788756499999 Q gi|254780995|r 59 KESTCNVIKRNK-SSLDYFVIDRDNKNFLIDNSF--SIVSYDQSDLDTRRSKLSTHCPLTIEY 118 (125) Q Consensus 59 ~~~~~~~~~~~~-~~~d~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~SDH~PV~~~l 118 (125) ...+|||+++.. ++-+.+ |.|||++++++.+ ++..+.+.....+.++|||||||+++| T Consensus 221 ~~~YtWwdYR~~Ga~drN~--GWRID~~l~s~~L~~~~~~~~I~~d~rg~EKPSDH~Pv~l~~ 281 (281) T TIGR00195 221 EGAYTWWDYRTRGARDRNR--GWRIDYFLVSEPLKERCVDCLIDKDVRGSEKPSDHCPVVLEI 281 (281) T ss_pred CCCCCEEECCCCCCCCCCC--HHHHHHHHHCHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEC T ss_conf 8940033113368502462--243355330848988898764121300168888756526658 No 3 >COG0708 XthA Exonuclease III [DNA replication, recombination, and repair] Probab=99.75 E-value=1e-18 Score=120.22 Aligned_cols=113 Identities=19% Similarity=0.240 Sum_probs=85.0 Q ss_pred HHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCHH------------------HHHCCCCC---CCHHHCCCCCCCCC Q ss_conf 9999999998511699889996147776883100278------------------76326554---10000257676871 Q gi|254780995|r 6 GEWLKKWADQKIKTGIPFVIAGDFNRKINSIGDTDDF------------------WQKMDPDG---LLIRFPQEKESTCN 64 (125) Q Consensus 6 ~~~L~~~i~~~~~~~~pvIi~GDFN~~~~~~~~~~~~------------------~~~l~~~~---~~~~~~~~~~~~~~ 64 (125) .+.|++|+++....+.|+|+|||||+.|.+.+.+++. +..+...+ .++...++...++| T Consensus 126 ~~~l~~~l~~l~~~~~~~vl~GD~NIap~~iDv~~~~~~~~n~~~~~f~~eeR~~~~~ll~~G~~D~~R~~~p~~~~YTw 205 (261) T COG0708 126 LDALRNYLEELLKKGKPVVLCGDFNIAPEEIDVANPKKRWLNEGNSGFLPEERAWFRRLLNAGFVDTFRLFHPEPEKYTW 205 (261) T ss_pred HHHHHHHHHHHHHCCCCEEEECCCCCCCCHHCCCCCHHHHHCCCCCCCCHHHHHHHHHHHHCCHHHHHHHHCCCCCCCCC T ss_conf 99999999998745998799666311785321257255540168889898999999999875306456764899996351 Q ss_pred CCCCCCCCCCCCCCCCHHHHCCCCCCC--EEEEECCCCCCCCCCCCCCCCCEEEEEEC Q ss_conf 038875543335521012100257321--15430247411258678875649999961 Q gi|254780995|r 65 VIKRNKSSLDYFVIDRDNKNFLIDNSF--SIVSYDQSDLDTRRSKLSTHCPLTIEYDF 120 (125) Q Consensus 65 ~~~~~~~~~d~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~SDH~PV~~~l~f 120 (125) |+++.+..... .+.|||++++++.+ ++..+.+.....+..+|||||||+++|++ T Consensus 206 W~YR~~~~~~n--~G~RID~~l~S~~L~~~~~~a~I~~~~rg~e~pSDHaPV~~e~~~ 261 (261) T COG0708 206 WDYRANAARRN--RGWRIDYILVSPALADRLKDAGIDREVRGWEKPSDHAPVWVELDL 261 (261) T ss_pred CCCCCCHHHHC--CCEEEEEEEECHHHHHHHHHCCCCHHHHCCCCCCCCCCEEEEECC T ss_conf 25621305663--860688897189999888666863788558888885757999629 No 4 >PRK11756 exonuclease III; Provisional Probab=99.72 E-value=6.6e-18 Score=115.93 Aligned_cols=113 Identities=14% Similarity=0.115 Sum_probs=77.9 Q ss_pred HHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCH---------------------HHHHCCCCC---CCHHHC-CCCC Q ss_conf 999999999851169988999614777688310027---------------------876326554---100002-5767 Q gi|254780995|r 6 GEWLKKWADQKIKTGIPFVIAGDFNRKINSIGDTDD---------------------FWQKMDPDG---LLIRFP-QEKE 60 (125) Q Consensus 6 ~~~L~~~i~~~~~~~~pvIi~GDFN~~~~~~~~~~~---------------------~~~~l~~~~---~~~~~~-~~~~ 60 (125) .+.|.++++...+.+.|+|+|||||+.+.+.+.+++ .+..+...+ .++... .... T Consensus 129 ~~~l~~~l~~~~~~~k~vIi~GD~NVA~~~iDv~~~~~n~k~~~~~g~~~f~~eER~~~~~ll~~G~vD~fR~~~p~~~~ 208 (268) T PRK11756 129 YQDLQNYLETELSPDNPLLIMGDMNISPTDLDIGIGEENRKRWLRTGKCSFLPEEREWLERLMDWGLVDTFRQLNPDVND 208 (268) T ss_pred HHHHHHHHHHHHCCCCCEEEECCCCCCCCHHHCCCCCCCHHHHCCCCCCCCCHHHHHHHHHHCCCCCEEEEEEECCCCCC T ss_conf 99999999997647995899564343583431357400335534368656888999998622038968877230589987 Q ss_pred CCCCCCCCCCCCCCCCCCCCHHHHCCCCCCC--EEEEECCCCCCCCCCCCCCCCCEEEEEEC Q ss_conf 6871038875543335521012100257321--15430247411258678875649999961 Q gi|254780995|r 61 STCNVIKRNKSSLDYFVIDRDNKNFLIDNSF--SIVSYDQSDLDTRRSKLSTHCPLTIEYDF 120 (125) Q Consensus 61 ~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~SDH~PV~~~l~f 120 (125) .++||+++...+.. ..+.|||++++++++ ++..+.+.....+..++||||||+++|++ T Consensus 209 ~yTwW~yr~~a~~~--n~G~RIDy~l~S~~l~~~v~~~~I~~~~rg~~~~SDH~PV~~~l~l 268 (268) T PRK11756 209 RFSWFDYRSKGFDD--NRGLRIDLILATQPLAERCVETGIDYDIRGMEKPSDHAPIWATFKL 268 (268) T ss_pred CEEEECCCCCCCCC--CCEEEEEEEEECHHHHHHHEEEEECHHHHCCCCCCCEEEEEEEEEC T ss_conf 36873655686314--8717689998599998453064851888279999980018999969 No 5 >PRK13911 hypothetical protein; Provisional Probab=99.59 E-value=2.6e-15 Score=102.09 Aligned_cols=106 Identities=19% Similarity=0.242 Sum_probs=69.0 Q ss_pred HHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCHH---------------HHHCCCCCC---CHHH-CCCCCCCCCCC Q ss_conf 9999999998511699889996147776883100278---------------763265541---0000-25767687103 Q gi|254780995|r 6 GEWLKKWADQKIKTGIPFVIAGDFNRKINSIGDTDDF---------------WQKMDPDGL---LIRF-PQEKESTCNVI 66 (125) Q Consensus 6 ~~~L~~~i~~~~~~~~pvIi~GDFN~~~~~~~~~~~~---------------~~~l~~~~~---~~~~-~~~~~~~~~~~ 66 (125) ...|++++++ ...+.|+|+|||||+.|.+.+.+++. +..+...+. ++.. +.....+|||+ T Consensus 123 ~~~~~~~~~~-l~~~k~vI~~GD~NIa~~~~Dv~~~~~~~~~~gf~~~ER~~~~~ll~~G~vD~fR~~~p~~~~~yTwW~ 201 (250) T PRK13911 123 EVEFKKFLKA-LELKKPVIVCGDLNVAHNEIDLENPKTNRKNAGFSDEERGKFSELLNAGFIDTFRYFYPNKEKAYTWWS 201 (250) T ss_pred HHHHHHHHHH-HCCCCCEEEECCCCCCCCHHCCCCHHHHCCCCCCCHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCC T ss_conf 9999999998-514898899633354687201218587567977698999999998607971765210789998645307 Q ss_pred CCCCCCCCCCCCCCHHHHCCCCCCCE--EEEECCCCCCCCCCCCCCCCCEEEEE Q ss_conf 88755433355210121002573211--54302474112586788756499999 Q gi|254780995|r 67 KRNKSSLDYFVIDRDNKNFLIDNSFS--IVSYDQSDLDTRRSKLSTHCPLTIEY 118 (125) Q Consensus 67 ~~~~~~~d~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~SDH~PV~~~l 118 (125) ++...+. ...|.|||++++++.+. +..+.+. ...+.||||||+++| T Consensus 202 yr~~a~~--~N~G~RID~~l~S~~l~~~v~~~~I~----~~~~~SDH~PV~leL 249 (250) T PRK13911 202 YMQQARD--KNIGWRIDYFLCSNPLKTRLKDALIY----KDILGSDHCPVGLEL 249 (250) T ss_pred CCCCCCC--CCCEEEEEEEEECHHHHHHCEEEEEC----CCCCCCCCCCEEEEE T ss_conf 0016676--89887888999888999550161673----899999975769995 No 6 >COG3568 ElsH Metal-dependent hydrolase [General function prediction only] Probab=99.45 E-value=1.5e-13 Score=92.59 Aligned_cols=109 Identities=19% Similarity=0.148 Sum_probs=64.0 Q ss_pred HHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCHHHHHCCCC-CCCHHHCCCCCCCCCCCCCCCCCCCCCCCCCH Q ss_conf 89999999999985116998899961477768831002787632655-41000025767687103887554333552101 Q gi|254780995|r 3 SQQGEWLKKWADQKIKTGIPFVIAGDFNRKINSIGDTDDFWQKMDPD-GLLIRFPQEKESTCNVIKRNKSSLDYFVIDRD 81 (125) Q Consensus 3 ~~Q~~~L~~~i~~~~~~~~pvIi~GDFN~~~~~~~~~~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~ 81 (125) ++|++.|.+++.- ....|+|+|||||..++++++.......+... +........ ....-..+++...- .+ T Consensus 149 ~~Q~~~L~~~~~l--~~~~p~vl~GDFN~~p~s~~yr~~~~~~~~~~~~~~~~~~~a---~~~~~~tfps~~p~----lr 219 (259) T COG3568 149 LRQAAALLALAGL--PALNPTVLMGDFNNEPGSAEYRLAARSPLNAQAALTGAFAPA---VGRTIRTFPSNTPL----LR 219 (259) T ss_pred HHHHHHHHHHCCC--CCCCCEEEECCCCCCCCCCCCEECCCCCHHHCCCCCCCCCCC---CCCCCCCCCCCCCC----CC T ss_conf 9999999863567--566863897137899887240310488211103223565766---67556777777854----22 Q ss_pred HHHCCCCCCCEEEEECCCCCCCCCCCCCCCCCEEEEEECC Q ss_conf 2100257321154302474112586788756499999616 Q gi|254780995|r 82 NKNFLIDNSFSIVSYDQSDLDTRRSKLSTHCPLTIEYDFE 121 (125) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~SDH~PV~~~l~f~ 121 (125) +|++++++.+.+....+..... ....|||+||.++|+++ T Consensus 220 iD~Ifvs~~~~i~~~~v~~~~~-a~~aSDHlPl~aeL~~~ 258 (259) T COG3568 220 LDRIFVSKELAIRSVHVLTDRL-ARVASDHLPLLAELRLK 258 (259) T ss_pred CCEEEECCCCCEEEEEEECCCC-CCCCCCCCCEEEEEECC T ss_conf 6689954744178777523777-64214665568897427 No 7 >pfam03372 Exo_endo_phos Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNase I proteins EC:3.1.21.1, Synaptojanin an inositol-1,4,5-trisphosphate phosphatase EC:3.1.3.56, Sphingomyelinase EC:3.1.4.12 and Nocturnin. Probab=99.36 E-value=1.9e-12 Score=86.73 Aligned_cols=113 Identities=18% Similarity=-0.044 Sum_probs=58.8 Q ss_pred HHHHHHHHHHHHH-HCCCCCEEEEEECCCCCCCCCCCCHHHHHCCCCCC---------CHHHCCCCCCCCCCCCCCCCCC Q ss_conf 9999999999985-11699889996147776883100278763265541---------0000257676871038875543 Q gi|254780995|r 4 QQGEWLKKWADQK-IKTGIPFVIAGDFNRKINSIGDTDDFWQKMDPDGL---------LIRFPQEKESTCNVIKRNKSSL 73 (125) Q Consensus 4 ~Q~~~L~~~i~~~-~~~~~pvIi~GDFN~~~~~~~~~~~~~~~l~~~~~---------~~~~~~~~~~~~~~~~~~~~~~ 73 (125) +|...+.+.+... ...+.|+|++||||..+.+.+.............. ..........+++|.......- T Consensus 133 ~~~~~l~~~l~~~~~~~~~~~ii~GDfN~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~T~~~~~~~~~~ 212 (255) T pfam03372 133 AQLAELLDFLSDLRIPKSDPVILCGDFNARPDSWDSALLKSIGKSTLFLLLERDLVFDGFDELPIGFPPTWWSYRNSSEK 212 (255) T ss_pred HHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCHHHHCCHHHHHCCCCCCHHHHHHEEEEEECCCCCCCCCCCCCCCCCCC T ss_conf 99999999999865435980899720243684220117787724546885665333722554899998787077467563 Q ss_pred CCCCCCCHHHHCCCCCCCEEEEECCCCCCCCCCCCCCCCCEEEEE Q ss_conf 335521012100257321154302474112586788756499999 Q gi|254780995|r 74 DYFVIDRDNKNFLIDNSFSIVSYDQSDLDTRRSKLSTHCPLTIEY 118 (125) Q Consensus 74 d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~SDH~PV~~~l 118 (125) ...+.++|+++++++.................+|||+||.++| T Consensus 213 --~~~~~riD~il~s~~~~~~~~~~~~~~~~~~~~SDH~Pv~~~l 255 (255) T pfam03372 213 --KNTGSRLDRILVSGSLLRRVVILSLLLLVIFTGSDHRPVLATL 255 (255) T ss_pred --CCCCEEEEEEEECHHHHHHCEEEEEEECCCCCCCCCCCEEEEC T ss_conf --7987823899968899822338987406799999974869979 No 8 >TIGR03395 sphingomy sphingomyelin phosphodiesterase. Members of this family are bacterial proteins that act as sphingomyelin phosphodiesterase (EC 3.1.4.12), also called sphingomyelinase. Some members of this family have been shown to act as hemolysins. Probab=99.31 E-value=6.6e-12 Score=83.86 Aligned_cols=115 Identities=15% Similarity=0.125 Sum_probs=64.4 Q ss_pred HHHHHHHHHHHHHHH-HCCCCCEEEEEECCCCCCCCCCCCHHHHHCCCCCCCHHH---CCCCCCCCCC----CCCCCCCC Q ss_conf 789999999999985-116998899961477768831002787632655410000---2576768710----38875543 Q gi|254780995|r 2 LSQQGEWLKKWADQK-IKTGIPFVIAGDFNRKINSIGDTDDFWQKMDPDGLLIRF---PQEKESTCNV----IKRNKSSL 73 (125) Q Consensus 2 l~~Q~~~L~~~i~~~-~~~~~pvIi~GDFN~~~~~~~~~~~~~~~l~~~~~~~~~---~~~~~~~~~~----~~~~~~~~ 73 (125) -++|...|++||+++ ++.++|||++||||+.-.+.+ ...++..|......... +-+....... ......++ T Consensus 158 R~~Q~~ei~~fi~~~~ip~~e~vi~aGDfNV~k~s~E-Y~~Ml~~Ln~~~p~~~g~~~T~D~~~N~~a~~~y~~~~~e~L 236 (283) T TIGR03395 158 RANQLNEIQDFIDSKNIPKDETVLIGGDLNVNKGSNE-YHDMFKTLNVSEPRYVGVPATWDATTNSIAKYYYPKEEPEYL 236 (283) T ss_pred HHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCHH-HHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHCCCCCCCCEE T ss_conf 9999999999998559998971899401442689878-999999728999764798653087643345523699985306 Q ss_pred CCCCCCCHHHHCCCCCCCEEEEECCC-C-CCCCCCCCCCCCCEEEE Q ss_conf 33552101210025732115430247-4-11258678875649999 Q gi|254780995|r 74 DYFVIDRDNKNFLIDNSFSIVSYDQS-D-LDTRRSKLSTHCPLTIE 117 (125) Q Consensus 74 d~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~~~~~~~SDH~PV~~~ 117 (125) ||+++++...-.....+......... . .......+||||||.+- T Consensus 237 DYIl~s~~h~qp~~~~n~v~~p~s~~~w~~~~~~~dlSDHyPV~~~ 282 (283) T TIGR03395 237 DYIFVSKSHAQPPVWQNKVLDPKSVTSWFKKYTYDDFSDHYPVYGF 282 (283) T ss_pred EEEEEECCCCCCCCCCCCEECCCCCCHHHCCCCCCCCCCCCCEEEE T ss_conf 8999978866875030317512114122126456776777853644 No 9 >COG2374 Predicted extracellular nuclease [General function prediction only] Probab=99.18 E-value=7.3e-11 Score=78.27 Aligned_cols=116 Identities=16% Similarity=0.138 Sum_probs=76.9 Q ss_pred HHHHHHHHHHHH--HHHCCCCCEEEEEECCCCCCCCCCCCHHHHHCCCCCCCHHH---CCCCCCCCCCCCCCCCCCCCCC Q ss_conf 899999999999--85116998899961477768831002787632655410000---2576768710388755433355 Q gi|254780995|r 3 SQQGEWLKKWAD--QKIKTGIPFVIAGDFNRKINSIGDTDDFWQKMDPDGLLIRF---PQEKESTCNVIKRNKSSLDYFV 77 (125) Q Consensus 3 ~~Q~~~L~~~i~--~~~~~~~pvIi~GDFN~~~~~~~~~~~~~~~l~~~~~~~~~---~~~~~~~~~~~~~~~~~~d~~~ 77 (125) -+||++|.+|+. +......|++|+||||+...+.+.. .+...+...+. ......++|.+..+...+||++ T Consensus 655 ~~~AqaL~~~la~~~~~~~d~~~viLGD~N~y~~edpI~-----~l~~aGy~~l~~~~~~~~~~YSY~f~G~~gtLDhaL 729 (798) T COG2374 655 VRAAQALAAFLATNPTGKADADIVILGDFNDYAFEDPIQ-----ALEGAGYMNLAARFHDAGDRYSYVFNGQSGTLDHAL 729 (798) T ss_pred HHHHHHHHHHHHHCCCCCCCCCEEEEECCCHHHHCCHHH-----HHHHCCCHHHHHHCCCCCCCEEEEECCCCCHHHHHH T ss_conf 999999999986285655678879972432131013789-----986248301100105778734899878264076665 Q ss_pred CCCHHHHCCCCC--------CCEEEEEC-------CCCCC-CCCCCCCCCCCEEEEEECCCC Q ss_conf 210121002573--------21154302-------47411-258678875649999961646 Q gi|254780995|r 78 IDRDNKNFLIDN--------SFSIVSYD-------QSDLD-TRRSKLSTHCPLTIEYDFEKG 123 (125) Q Consensus 78 ~~~~~~~~~~~~--------~~~~~~~~-------~~~~~-~~~~~~SDH~PV~~~l~f~~~ 123 (125) .+..+.+..... ...+..|. +.... ....++|||-||++.|++-.. T Consensus 730 as~sl~~~v~~a~ewHINAdE~~~ldYn~~Fk~q~~~~~~~~~~fR~SDHDPvvvglnL~~~ 791 (798) T COG2374 730 ASASLAAQVSGATEWHINADEPDALDYNLEFKGQNVSLYKTTNPFRASDHDPVVVGLNLLGT 791 (798) T ss_pred HHHHHHHHCCCCEEEEECCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCCCEEEEEEECCC T ss_conf 22445521567426530456662322102103565533356775535789976999870266 No 10 >smart00476 DNaseIc deoxyribonuclease I. Deoxyribonuclease I catalyzes the endonucleolytic cleavage of double-stranded DNA. The enzyme is secreted outside the cell and also involved in apoptosis in the nucleus. Probab=99.07 E-value=3.6e-10 Score=74.55 Aligned_cols=113 Identities=12% Similarity=0.129 Sum_probs=66.4 Q ss_pred HHHHHHHHH---HHHHHCCCCCEEEEEECCCCCCCCCCCCHHHHHCCCCCCCHHHCCCCCCCCCCCCCCCCCCCCCCC-C Q ss_conf 999999999---998511699889996147776883100278763265541000025767687103887554333552-1 Q gi|254780995|r 4 QQGEWLKKW---ADQKIKTGIPFVIAGDFNRKINSIGDTDDFWQKMDPDGLLIRFPQEKESTCNVIKRNKSSLDYFVI-D 79 (125) Q Consensus 4 ~Q~~~L~~~---i~~~~~~~~pvIi~GDFN~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~-~ 79 (125) ++.++|-+. +.++ -...++|+|||||.+.+.-....+..-.+..+..+.|+..+....+. ..+ .+..|.+.+ + T Consensus 158 ~Eid~L~dV~~dv~~~-~~~~diiilGDFNAdcsyv~~~~~~~irl~~~~~f~WlI~d~~dTTv-~~t-~caYDriv~~g 234 (276) T smart00476 158 AEIDALYDVYLDVRQK-WGTEDVIFMGDFNAGCSYVTKKQWSSIRLRTSPTFHWLIPDSADTTV-TST-HCAYDRIVVAG 234 (276) T ss_pred HHHHHHHHHHHHHHHH-CCCCCEEEECCCCCCCCCCCHHHHHHCCCCCCCCCEEECCCCCCCCC-CCC-CCCCCEEEEEC T ss_conf 8999999999999874-47666799624568876557556532201026861374478877643-688-87761499975 Q ss_pred CHHHHCCCCCCCEEEEECCCCCC--CCCCCCCCCCCEEEEEE Q ss_conf 01210025732115430247411--25867887564999996 Q gi|254780995|r 80 RDNKNFLIDNSFSIVSYDQSDLD--TRRSKLSTHCPLTIEYD 119 (125) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~SDH~PV~~~l~ 119 (125) ........+++..+..|...... ......|||+||.++|+ T Consensus 235 ~~~~~~~~~gsa~v~~F~~~~~Lt~~~Al~VSDH~PVev~Lk 276 (276) T smart00476 235 ERLRSSVVPGSAAVFDFQTAYGLTEEEALAISDHFPVEVTLK 276 (276) T ss_pred HHHHHCCCCCCCEEEECHHHHCCCHHHHHHCCCCCCEEEEEC T ss_conf 455423136874267437864959999974677776699949 No 11 >PRK05421 hypothetical protein; Provisional Probab=98.93 E-value=2.4e-09 Score=70.16 Aligned_cols=97 Identities=20% Similarity=0.225 Sum_probs=49.9 Q ss_pred HHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCC-CCCHHHHHCCCCCCCHHHCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 789999999999985116998899961477768831-0027876326554100002576768710388755433355210 Q gi|254780995|r 2 LSQQGEWLKKWADQKIKTGIPFVIAGDFNRKINSIG-DTDDFWQKMDPDGLLIRFPQEKESTCNVIKRNKSSLDYFVIDR 80 (125) Q Consensus 2 l~~Q~~~L~~~i~~~~~~~~pvIi~GDFN~~~~~~~-~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~ 80 (125) +++|.+.|.+-|.. ..=|+|++||||.+-...- .-..+.+.+.-.+.. +.+..+. ..+...+||+++++ T Consensus 157 y~~QL~~l~~~i~~---H~GPvI~AGDFNtWs~~R~~~l~~~~~~~gL~ev~-f~~d~R~------~~~g~pLDhifyRg 226 (254) T PRK05421 157 YSKQLEPIGDQIAH---HSGPIILAGDFNTWSRKRMNALKRFARELGLKEVR-FTDDQRR------KAFGRPLDHVFYRG 226 (254) T ss_pred HHHHHHHHHHHHHH---CCCCEEEECCCCCCCHHHHHHHHHHHHHCCCCEEE-CCCCCCE------ECCCCCCCEEEECC T ss_conf 99999999999984---89987984453301399999999999976980875-3767734------52589865699878 Q ss_pred HHHHCCCCCCCEEEEECCCCCCCCCCCCCCCCCEEEEEECCCC Q ss_conf 1210025732115430247411258678875649999961646 Q gi|254780995|r 81 DNKNFLIDNSFSIVSYDQSDLDTRRSKLSTHCPLTIEYDFEKG 123 (125) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~SDH~PV~~~l~f~~~ 123 (125) +.+....+. ....|||.|++++|.+++. T Consensus 227 ----------l~~~~a~~~-----~t~~SDHnPl~~~f~L~~~ 254 (254) T PRK05421 227 ----------LNVEKASVL-----VTRASDHNPLLVEFSLKKP 254 (254) T ss_pred ----------CEEEEEECC-----CCCCCCCCCEEEEEEECCC T ss_conf ----------523310556-----5677877738899994289 No 12 >COG3021 Uncharacterized protein conserved in bacteria [Function unknown] Probab=98.77 E-value=1.2e-08 Score=66.38 Aligned_cols=100 Identities=19% Similarity=0.219 Sum_probs=46.7 Q ss_pred HHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCHHHHHCCCCCCCHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCC Q ss_conf 99999999851169988999614777688310027876326554100002576768710388755433355210121002 Q gi|254780995|r 7 EWLKKWADQKIKTGIPFVIAGDFNRKINSIGDTDDFWQKMDPDGLLIRFPQEKESTCNVIKRNKSSLDYFVIDRDNKNFL 86 (125) Q Consensus 7 ~~L~~~i~~~~~~~~pvIi~GDFN~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~ 86 (125) +++.+..++-.....|+|++||||+.|=+... +.+..-+.....+... .+-.+.++.... ...+.+||+++ T Consensus 209 ~ql~~l~~~i~~~~gpvIlaGDfNa~pWS~~~-----~R~~~l~~~~~~~~aG---~~~~~~~p~~~~-r~~g~PIDhvf 279 (309) T COG3021 209 AQLLELGDQIAGHSGPVILAGDFNAPPWSRTA-----KRMAALGGLRAAPRAG---LWEVRFTPDERR-RAFGLPIDHVF 279 (309) T ss_pred HHHHHHHHHHHCCCCCEEEEECCCCCCHHHHH-----HHHHHHCCCCCCHHCC---CCCCCCCHHHHH-HCCCCCCCEEE T ss_conf 99999999997178875886137885055778-----9999861643222205---776642777874-12589852123 Q ss_pred CCCCCEEEEECCCCCCCCCCCCCCCCCEEEEEECC Q ss_conf 57321154302474112586788756499999616 Q gi|254780995|r 87 IDNSFSIVSYDQSDLDTRRSKLSTHCPLTIEYDFE 121 (125) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~SDH~PV~~~l~f~ 121 (125) ..+ ..+... .......|||+||++||..- T Consensus 280 ~rg-l~~~ka-----~rl~~~gSDH~PLLveF~~~ 308 (309) T COG3021 280 YRG-LTVMKA-----RRLPDRGSDHRPLLVEFSYG 308 (309) T ss_pred ECC-CCHHHH-----HHCCCCCCCCCCEEEEEEEC T ss_conf 358-604433-----20545577777168998856 No 13 >KOG3873 consensus Probab=98.02 E-value=1.2e-05 Score=50.33 Aligned_cols=115 Identities=12% Similarity=0.093 Sum_probs=63.4 Q ss_pred HHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCHHHHHCC-----------CCCCCHHH---------CCCCCCCCC Q ss_conf 9999999999851169988999614777688310027876326-----------55410000---------257676871 Q gi|254780995|r 5 QGEWLKKWADQKIKTGIPFVIAGDFNRKINSIGDTDDFWQKMD-----------PDGLLIRF---------PQEKESTCN 64 (125) Q Consensus 5 Q~~~L~~~i~~~~~~~~pvIi~GDFN~~~~~~~~~~~~~~~l~-----------~~~~~~~~---------~~~~~~~~~ 64 (125) ||=.+.++|+.....+.-||++||||..|.+-+..-.+-..+. +...+++. +++...++| T Consensus 155 QAwdlaqfi~~t~q~~~vVI~~GDLN~~P~dl~~~ll~~a~l~daw~~~h~~q~e~~~~r~s~~~~l~~g~tcd~~~N~y 234 (422) T KOG3873 155 QAWDLAQFIRATRQNADVVILAGDLNMQPQDLGHKLLLSAGLVDAWTSLHLDQCESDSFRLSEDKELVEGNTCDSPLNCY 234 (422) T ss_pred HHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCEEEEECCCHHHHHHHHCHHHHCCCCCCCCHHHHHHCCCCCCCCCHHH T ss_conf 89999999999744786899946778886666612444030045676533243127522445244553287656751036 Q ss_pred CCCCCCCCCCCCCCCCHHHHCCCCCCC-E--EEEEC--CCCCCCCCCCCCCCCCEEEEEECCC Q ss_conf 038875543335521012100257321-1--54302--4741125867887564999996164 Q gi|254780995|r 65 VIKRNKSSLDYFVIDRDNKNFLIDNSF-S--IVSYD--QSDLDTRRSKLSTHCPLTIEYDFEK 122 (125) Q Consensus 65 ~~~~~~~~~d~~~~~~~~~~~~~~~~~-~--~~~~~--~~~~~~~~~~~SDH~PV~~~l~f~~ 122 (125) .+ ......-.++.|||+.+..++. . ...++ ....+..+-..|||..+.++|.... T Consensus 235 ~~---aqk~~ddp~~~RiDYvl~k~~~~~~~~a~~~~t~~rvP~~d~s~SDH~Al~a~L~I~~ 294 (422) T KOG3873 235 TS---AQKREDDPLGKRIDYVLVKPGDCNAKIAEVEFTEPRVPGEDCSYSDHEALMATLKIFK 294 (422) T ss_pred HH---HHHCCCCCCCEEEEEEEECCCCEEEEEEEEEECCCCCCCCCCCCCCHHHHEEEEEEEC T ss_conf 67---8747887543156689973761378730588547878999998420243436898404 No 14 >KOG0620 consensus Probab=97.94 E-value=1.1e-05 Score=50.67 Aligned_cols=104 Identities=17% Similarity=0.181 Sum_probs=53.9 Q ss_pred CCCCEEEEEECCCCCCCCCCCCHH----------HHHCCCCCCC------------------HHHCCCCCCCCCCCCCCC Q ss_conf 699889996147776883100278----------7632655410------------------000257676871038875 Q gi|254780995|r 19 TGIPFVIAGDFNRKINSIGDTDDF----------WQKMDPDGLL------------------IRFPQEKESTCNVIKRNK 70 (125) Q Consensus 19 ~~~pvIi~GDFN~~~~~~~~~~~~----------~~~l~~~~~~------------------~~~~~~~~~~~~~~~~~~ 70 (125) ...|++++||||..+.++.+.-.. ++.+...... ...+.....++-...++. T Consensus 220 ~~~p~l~~gdfNs~p~~~v~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ftn~t~~F~ 299 (361) T KOG0620 220 ASFPLLLCGDFNSTPLSPVYALLASGILSPEDADNKDLPLMSALGSKVNKSYEEMSHDQRRKLEYTTGEPRFTNYTPGFK 299 (361) T ss_pred CCCCEEEECCCCCCCCCCCEEECCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCC T ss_conf 23432212144678886501331467889536667644444400012323200045113215445567655324657755 Q ss_pred CCCCCCCCCCHHHHCCCCCCCE---EEEECCCCCCCCCCCCCCCCCEEEEEECCCC Q ss_conf 5433355210121002573211---5430247411258678875649999961646 Q gi|254780995|r 71 SSLDYFVIDRDNKNFLIDNSFS---IVSYDQSDLDTRRSKLSTHCPLTIEYDFEKG 123 (125) Q Consensus 71 ~~~d~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~SDH~PV~~~l~f~~~ 123 (125) +.+|||+++......-..-... ....++... .....||||.|+.++|++... T Consensus 300 g~~DYIfys~~~~~~~~~L~~~~e~~~~~~~~~l-Ps~~~pSDHi~L~~ef~~~~~ 354 (361) T KOG0620 300 GTLDYIFYSKAALIVESALELLDEDELVIKVTGL-PSPHHPSDHIPLLAEFEIAPP 354 (361) T ss_pred CEEEEEEECCCCCCCCCCCCCCCHHHHHCCCCCC-CCCCCCCCCCHHHCCCCCCCC T ss_conf 5156899716533002135677746642354448-888787753023212444675 No 15 >KOG2338 consensus Probab=97.71 E-value=5.5e-05 Score=46.85 Aligned_cols=38 Identities=21% Similarity=0.108 Sum_probs=26.2 Q ss_pred HHHHHHHHHHHHHHC---CCCCEEEEEECCCCCCCCCCCCH Q ss_conf 999999999998511---69988999614777688310027 Q gi|254780995|r 4 QQGEWLKKWADQKIK---TGIPFVIAGDFNRKINSIGDTDD 41 (125) Q Consensus 4 ~Q~~~L~~~i~~~~~---~~~pvIi~GDFN~~~~~~~~~~~ 41 (125) +|...|-+.+++... ..-|+++|||||..|+++.+... T Consensus 273 ~Q~~iiL~~~~~~~~~~~~~~pi~l~GDfNt~p~~~~y~fl 313 (495) T KOG2338 273 AQVYIILAELEKMSKSSKSHWPIFLCGDFNTEPDSPPYLFL 313 (495) T ss_pred HHHHHHHHHHHHHHHHCCCCCCEEEECCCCCCCCCCCCHHH T ss_conf 78999999999997534357980770577789988731544 No 16 >KOG0566 consensus Probab=97.58 E-value=0.00013 Score=44.81 Aligned_cols=19 Identities=26% Similarity=0.273 Sum_probs=14.9 Q ss_pred CCCCCCCCCCCEEEEEECC Q ss_conf 2586788756499999616 Q gi|254780995|r 103 TRRSKLSTHCPLTIEYDFE 121 (125) Q Consensus 103 ~~~~~~SDH~PV~~~l~f~ 121 (125) ...-+.|||+||.+.|+.+ T Consensus 814 ~~el~~SDHRPV~A~~~a~ 832 (1080) T KOG0566 814 RAELKTSDHRPVYAIFRAE 832 (1080) T ss_pred CCCCCCCCCCCEEEEEEEE T ss_conf 5530214787357999989 No 17 >COG5239 CCR4 mRNA deadenylase, exonuclease subunit and related nucleases [RNA processing and modification] Probab=97.42 E-value=0.00015 Score=44.55 Aligned_cols=102 Identities=20% Similarity=0.145 Sum_probs=58.7 Q ss_pred CEEEEEECCCCCCCCCCCCHHHHHC-------------CCCC--------CCHHHCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 8899961477768831002787632-------------6554--------100002576768710388755433355210 Q gi|254780995|r 22 PFVIAGDFNRKINSIGDTDDFWQKM-------------DPDG--------LLIRFPQEKESTCNVIKRNKSSLDYFVIDR 80 (125) Q Consensus 22 pvIi~GDFN~~~~~~~~~~~~~~~l-------------~~~~--------~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~ 80 (125) ++.++||||+...+..+.....+.. .+.+ ....-+.+....+-|.+.+.+-+|||++.. T Consensus 244 ~~l~~gd~ns~~~s~vy~~l~~~~~q~H~~~~~~~~~lysvg~~~~h~~n~~~~~~~~~~~fTN~t~~~kG~iDYIfy~~ 323 (378) T COG5239 244 DILITGDFNSLRASLVYKFLVTSQIQLHESLNGRDFSLYSVGYKFVHPENLKSDNSKGELGFTNWTPGFKGVIDYIFYHG 323 (378) T ss_pred CCCCCCCCCCEECCEEHHHHHHHHHHHHHCCCCCCHHHHCCCCCCCCHHHCCCCCCCCCCCCCCCCCCCCCEEEEEEEEC T ss_conf 42357871000033001454407988641035443222015545558423155787687650135665454257999846 Q ss_pred HHHHCCCCCCCEEEE----ECCCCCCCCCCCCCCCCCEEEEEECCCCCC Q ss_conf 121002573211543----024741125867887564999996164689 Q gi|254780995|r 81 DNKNFLIDNSFSIVS----YDQSDLDTRRSKLSTHCPLTIEYDFEKGNV 125 (125) Q Consensus 81 ~~~~~~~~~~~~~~~----~~~~~~~~~~~~~SDH~PV~~~l~f~~~~~ 125 (125) ..... +.+...+.+ .++.+.+ ....||||-|..++|.++.-|. T Consensus 324 ~~~~~-~~~~l~~ve~e~~~k~~G~p-n~~~pSdhipl~~ef~~~~~~~ 370 (378) T COG5239 324 GLLTR-QTGLLGVVEGEYASKVIGLP-NMPFPSDHIPLLAEFASDHKNL 370 (378) T ss_pred CCCEE-EECCCCCCCCCHHHHHCCCC-CCCCCCCCCCCHHCCCCCCCCC T ss_conf 86104-40455554510233110567-8999512440010134432013 No 18 >smart00128 IPPc Inositol polyphosphate phosphatase, catalytic domain homologues. Mg(2+)-dependent/Li(+)-sensitive enzymes. Probab=97.40 E-value=0.0004 Score=42.20 Aligned_cols=21 Identities=14% Similarity=0.180 Sum_probs=16.5 Q ss_pred CCCCCCCCCCCEEEEEECCCC Q ss_conf 258678875649999961646 Q gi|254780995|r 103 TRRSKLSTHCPLTIEYDFEKG 123 (125) Q Consensus 103 ~~~~~~SDH~PV~~~l~f~~~ 123 (125) ......|||.||.+.|+++-. T Consensus 287 ~~~~~~SDHkPV~a~f~v~v~ 307 (310) T smart00128 287 GMELTTSDHKPVFATFRLKVT 307 (310) T ss_pred CCCCCCCCCCCEEEEEEEEEE T ss_conf 587667785147889999989 No 19 >PTZ00297 pantothenate kinase; Provisional Probab=97.06 E-value=0.0019 Score=38.60 Aligned_cols=119 Identities=16% Similarity=0.200 Sum_probs=51.1 Q ss_pred HHHHHHHHHHHHHH---------CCCCCEEEEEECCCCCCCCCC-C------CHHHHHCCCCCC-CHHHCCCCC---CCC Q ss_conf 99999999999851---------169988999614777688310-0------278763265541-000025767---687 Q gi|254780995|r 4 QQGEWLKKWADQKI---------KTGIPFVIAGDFNRKINSIGD-T------DDFWQKMDPDGL-LIRFPQEKE---STC 63 (125) Q Consensus 4 ~Q~~~L~~~i~~~~---------~~~~pvIi~GDFN~~~~~~~~-~------~~~~~~l~~~~~-~~~~~~~~~---~~~ 63 (125) .|.+..+++++..+ ....|+||+||||+.--++-- . ..+++.+.+-+. .+.+..+.. ..+ T Consensus 170 sqv~etrrFv~svi~n~~~~~~d~a~iPfviAGDFninGIdp~n~G~ptK~f~ell~elQ~lGsgvrevi~d~~g~hPpT 249 (1452) T PTZ00297 170 SQVQETRRFVESVIANVYEQNNDGAEIPFVIAGDFNINGIDPHNGGHPTKRFQELLNELQDLGSGVREVIYDETGQHPPT 249 (1452) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHCCCCCCC T ss_conf 99999999999999998737787531547996354556758666898628999999988764100665515421789899 Q ss_pred ----CCCCCCCCCCCCCCCCCHHHHCCCCCCCEEEEE-----CCCCCCCCCCCCCCCCCEEEEEECCCC Q ss_conf ----103887554333552101210025732115430-----247411258678875649999961646 Q gi|254780995|r 64 ----NVIKRNKSSLDYFVIDRDNKNFLIDNSFSIVSY-----DQSDLDTRRSKLSTHCPLTIEYDFEKG 123 (125) Q Consensus 64 ----~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~SDH~PV~~~l~f~~~ 123 (125) ...+.....-.+..-..|-|+++++..+++..- .+.....+ .++|||+-|.+.+.+... T Consensus 250 RP~~LFFP~~SKL~R~s~tPQRQDyFfvt~~v~V~~p~i~Kfv~ssr~PY-tYLSDHFGvs~~l~l~~~ 317 (1452) T PTZ00297 250 RPPILFFPEQSKLERYSSTPQRQDYFFVTPCVQVEKPRIEKFVVSSRRPY-TYLSDHFGVSARLTLPLN 317 (1452) T ss_pred CCCEEECCCHHHHHHCCCCCCCCCEEEEEEEEEECCCCHHHHHHCCCCCC-EEECCCCCCEEEEEECCC T ss_conf 99764554255553045883424338752001103642676631367874-442012571489994478 No 20 >KOG1294 consensus Probab=96.69 E-value=0.0035 Score=37.16 Aligned_cols=112 Identities=16% Similarity=0.187 Sum_probs=57.5 Q ss_pred HHHHHHHHHHHHHHC---CCCCEEEEEECCCCCCCCCC----CCH-----------------HHHHCCCCC-----CCHH Q ss_conf 999999999998511---69988999614777688310----027-----------------876326554-----1000 Q gi|254780995|r 4 QQGEWLKKWADQKIK---TGIPFVIAGDFNRKINSIGD----TDD-----------------FWQKMDPDG-----LLIR 54 (125) Q Consensus 4 ~Q~~~L~~~i~~~~~---~~~pvIi~GDFN~~~~~~~~----~~~-----------------~~~~l~~~~-----~~~~ 54 (125) +=...++..+.++-. ...|++.+||.|..+..... ... +...+.+.+ .... T Consensus 193 ~~~~~~~~~~~~~~~~k~~~~~~v~~gd~nvs~~~i~~~~~~~~~~~~~~~~~~~t~e~R~~~~~~~~~~~~~iDt~r~~ 272 (335) T KOG1294 193 RWDKEIEEKRKKQSSSKNLKAPVVICGDLNVSHEEIDPSKPLVSPAGNTLSNAGFTPEERDSFFAELLEKGPLIDTYREL 272 (335) T ss_pred HHHHHHHHHHHHCCCCCCCCCCCEECCCCCCCHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCEEEHHHH T ss_conf 66777777764036444445761341556544222166655114456776778988788656777643577502320431 Q ss_pred HCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCE--EEEECCCCCCCCCCCCCCCCCEEEEEEC Q ss_conf 02576768710388755433355210121002573211--5430247411258678875649999961 Q gi|254780995|r 55 FPQEKESTCNVIKRNKSSLDYFVIDRDNKNFLIDNSFS--IVSYDQSDLDTRRSKLSTHCPLTIEYDF 120 (125) Q Consensus 55 ~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~SDH~PV~~~l~f 120 (125) .......++.|.+-.+.... .++.+++++.++.... ..++.+. ......||||||...|.+ T Consensus 273 ~~~~~~~~t~Wk~~~~~r~~--~~~~r~dy~~Vsk~~~n~~r~~~Ic---~r~~~gsdh~pi~~~~~~ 335 (335) T KOG1294 273 HKDQKKAYTFWKYMPNGRQR--GHGERCDYILVSKPGPNNGRRFYIC---SRPIHGSDHCPITLEFFL 335 (335) T ss_pred CCCCCCCCCCHHHCCCCCCC--CCCCCEEEEEECCCCCCCCCEEEEE---CCCCCCCCCCCEEEEECC T ss_conf 67753345524220023368--9887136898347677877423242---376578899981466429 No 21 >KOG2756 consensus Probab=95.98 E-value=0.0065 Score=35.73 Aligned_cols=107 Identities=12% Similarity=0.035 Sum_probs=47.7 Q ss_pred HHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCC------CCCCCHHHHHCCCCCCCHHHCCCCCCCCCCCCCCCCCCCCC Q ss_conf 899999999999851169988999614777688------31002787632655410000257676871038875543335 Q gi|254780995|r 3 SQQGEWLKKWADQKIKTGIPFVIAGDFNRKINS------IGDTDDFWQKMDPDGLLIRFPQEKESTCNVIKRNKSSLDYF 76 (125) Q Consensus 3 ~~Q~~~L~~~i~~~~~~~~pvIi~GDFN~~~~~------~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 76 (125) +.-.+.+++-|++. ++..||..||.|-.-.. ++-+-..|..+... +....+|-+..++..--.. T Consensus 228 ~~~~~k~~EaIe~l--PnA~ViFGGD~NlrD~ev~r~~lPD~~vDvWE~lg~p--------~~~~FTwDT~~N~nl~G~~ 297 (349) T KOG2756 228 KMVLKKMQEAIESL--PNATVIFGGDTNLRDREVTRCGLPDNIVDVWEFLGKP--------KHCQFTWDTQMNSNLGGTA 297 (349) T ss_pred HHHHHHHHHHHHHC--CCCEEEECCCCCCHHHHCCCCCCCCHHHHHHHHHCCC--------CCCCEEEECCCCCCCCCHH T ss_conf 99999999999859--9976996586640133313678991378999981898--------7674144434486432148 Q ss_pred CCCCHHHHCCCC-----CCCEEEEECC---CCCCCCCCCCCCCCCEEEEEE Q ss_conf 521012100257-----3211543024---741125867887564999996 Q gi|254780995|r 77 VIDRDNKNFLID-----NSFSIVSYDQ---SDLDTRRSKLSTHCPLTIEYD 119 (125) Q Consensus 77 ~~~~~~~~~~~~-----~~~~~~~~~~---~~~~~~~~~~SDH~PV~~~l~ 119 (125) ....|+|.++.. +......+.. .........||||.-+.++|. T Consensus 298 a~k~RfDRi~~r~~~~~G~~~~~~l~l~g~~kiRgc~~fPSDHwgll~Tl~ 348 (349) T KOG2756 298 ACKLRFDRIFFRAAAEEGHIIPRSLDLLGLEKLRGCGRFPSDHWGLLCTLD 348 (349) T ss_pred HHHHHHHHHHEECHHHCCCCCCCCCCHHHHHHHCCCCCCCCCCCEEEEEEC T ss_conf 888877767300012218856544452425664578867854301466615 No 22 >COG5411 Phosphatidylinositol 5-phosphate phosphatase [Signal transduction mechanisms] Probab=94.90 E-value=0.045 Score=31.26 Aligned_cols=17 Identities=18% Similarity=0.266 Sum_probs=14.2 Q ss_pred CCCCCCCCEEEEEECCC Q ss_conf 67887564999996164 Q gi|254780995|r 106 SKLSTHCPLTIEYDFEK 122 (125) Q Consensus 106 ~~~SDH~PV~~~l~f~~ 122 (125) .+.|||+||++.+..+- T Consensus 312 l~~SDHrPV~a~~~~~i 328 (460) T COG5411 312 LMISDHRPVYATFRAKI 328 (460) T ss_pred EEECCCCEEEEEEECCE T ss_conf 46407870799996225 No 23 >KOG1976 consensus Probab=72.51 E-value=4 Score=20.85 Aligned_cols=36 Identities=25% Similarity=0.210 Sum_probs=24.2 Q ss_pred HHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCC Q ss_conf 899999999999851169988999614777688310 Q gi|254780995|r 3 SQQGEWLKKWADQKIKTGIPFVIAGDFNRKINSIGD 38 (125) Q Consensus 3 ~~Q~~~L~~~i~~~~~~~~pvIi~GDFN~~~~~~~~ 38 (125) +||.+.+-+-++++.-....+.|.||||..+++... T Consensus 200 ~~al~~vL~el~~~~~~~~~~fVfGdfNfrLds~s~ 235 (391) T KOG1976 200 EQALEMVLKELDEEGLRNDAIFVFGDFNFRLDSTSL 235 (391) T ss_pred HHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHH T ss_conf 889999999987630475289996134421052889 No 24 >PTZ00312 inositol-1,4,5-triphosphate 5-phosphatase; Provisional Probab=65.09 E-value=8.8 Score=18.98 Aligned_cols=21 Identities=33% Similarity=0.555 Sum_probs=16.8 Q ss_pred HHCCCCCEEEEEECCCCCCCC Q ss_conf 511699889996147776883 Q gi|254780995|r 16 KIKTGIPFVIAGDFNRKINSI 36 (125) Q Consensus 16 ~~~~~~pvIi~GDFN~~~~~~ 36 (125) -.....|+.|-||||...+.. T Consensus 124 ~v~p~~PlFIFGDfN~RlD~~ 144 (356) T PTZ00312 124 FISPSDPLFIFGDFNVRLDGH 144 (356) T ss_pred CCCCCCCEEEECCCCEEECHH T ss_conf 048899758952441344417 No 25 >COG3459 Cellobiose phosphorylase [Carbohydrate transport and metabolism] Probab=50.14 E-value=12 Score=18.31 Aligned_cols=30 Identities=23% Similarity=0.413 Sum_probs=20.5 Q ss_pred HHHHHHHHHHHHCCCCCEEEEEECCCCCCC Q ss_conf 999999999851169988999614777688 Q gi|254780995|r 6 GEWLKKWADQKIKTGIPFVIAGDFNRKINS 35 (125) Q Consensus 6 ~~~L~~~i~~~~~~~~pvIi~GDFN~~~~~ 35 (125) +.+|.--+...-+.|.|.|+.||+|+..+. T Consensus 718 ~rAld~av~r~g~~glpli~gGDWND~ln~ 747 (1056) T COG3459 718 ARALDLAVARLGPHGLPLILGGDWNDGLNL 747 (1056) T ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCCCCCC T ss_conf 999999998608777643157761100350 No 26 >TIGR00619 sbcd nuclease SbcCD, D subunit; InterPro: IPR004593 All proteins in this family for which functions are known are double-stranded DNA exonuclease (as part of a complex with SbcC homologs). This complex functions in the initiation of recombination and recombinational repair and is particularly important in regulating the stability of DNA sections that can form secondary structures. This family is likely to be homologous to the MRE11 family.; GO: 0004527 exonuclease activity, 0006259 DNA metabolic process. Probab=40.45 E-value=31 Score=16.07 Aligned_cols=39 Identities=18% Similarity=0.213 Sum_probs=27.4 Q ss_pred CHHHHHHHHHHHHHHHHCC-CCCEEEEEECCCCCCCCCCC Q ss_conf 9789999999999985116-99889996147776883100 Q gi|254780995|r 1 MLSQQGEWLKKWADQKIKT-GIPFVIAGDFNRKINSIGDT 39 (125) Q Consensus 1 ~l~~Q~~~L~~~i~~~~~~-~~pvIi~GDFN~~~~~~~~~ 39 (125) .+++|.+.|.+.++--.+. -.-+||+||==+...-+.+- T Consensus 29 r~~e~~~~L~~l~~~~~~e~~Da~lvAGDvFD~~~P~~ea 68 (275) T TIGR00619 29 RLAEQKAFLDDLLEILKAEQVDALLVAGDVFDTANPPAEA 68 (275) T ss_pred HHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCHHH T ss_conf 8999999999999999738998899700007889887999 No 27 >TIGR02339 thermosome_arch thermosome, various subunits, archaeal; InterPro: IPR012714 Thermosome is the name given to the archaeal rather than eukaryotic form of the group II chaperonin (counterpart to the group I chaperonin, GroEL/GroES, in bacteria), a toroidal, ATP-dependent molecular chaperone that assists in the folding or refolding of nascent or denatured proteins. Various homologous subunits, one to five per archaeal genome, may be designated alpha, beta, etc., but phylogenetic analysis does not show distinct alpha subunit and beta subunit lineages traceable to ancient paralogs.; GO: 0005524 ATP binding, 0051082 unfolded protein binding, 0006457 protein folding. Probab=39.53 E-value=32 Score=15.99 Aligned_cols=25 Identities=28% Similarity=0.387 Sum_probs=23.2 Q ss_pred HHHHHHHHHHHHHHHHCCCCCEEEE Q ss_conf 7899999999999851169988999 Q gi|254780995|r 2 LSQQGEWLKKWADQKIKTGIPFVIA 26 (125) Q Consensus 2 l~~Q~~~L~~~i~~~~~~~~pvIi~ 26 (125) |.|+...|+++++.-.+.|.+||+| T Consensus 264 ldqEe~~Lke~VD~i~~~GAnVVfC 288 (522) T TIGR02339 264 LDQEEAMLKEMVDKIADAGANVVFC 288 (522) T ss_pred HHHHHHHHHHHHHHHHHCCCCEEEE T ss_conf 9999999999988999828987982 No 28 >TIGR01759 MalateDH-SF1 malate dehydrogenase; InterPro: IPR010945 Malate dehydrogenases catalyse the interconversion of malate and oxaloacetate using dinucleotide cofactors . The enzymes in this entry are found in archaea, bacteria and eukaryotes and fall into two distinct groups. The first group are cytoplamsic, NAD-dependent enzymes which participate in the citric acid cycle (1.1.1.37 from EC). The second group are found in plant chloroplasts, use NADP as cofactor, and participate in the C4 cycle (1.1.1.82 from EC). Structural studies indicate that these enzymes are homodimers with very sinmilar overall topology, though the chloroplast enzymes also have N- and C-terminal extensions, and all contain the classical Rossman fold for NAD(P)H binding , , , . Substrate specificity is determined by a mobile loop at the active site which uses charge balancing to discriminate between the correct substrates (malate and oxaloacetate) and other potential oxo/hydroxyacid substrates the enzyme may encounter within the cell .; GO: 0016615 malate dehydrogenase activity, 0006108 malate metabolic process. Probab=35.50 E-value=34 Score=15.85 Aligned_cols=23 Identities=30% Similarity=0.560 Sum_probs=19.3 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCEEEEE Q ss_conf 978999999999998511699889996 Q gi|254780995|r 1 MLSQQGEWLKKWADQKIKTGIPFVIAG 27 (125) Q Consensus 1 ~l~~Q~~~L~~~i~~~~~~~~pvIi~G 27 (125) ||+.|+++|+++. ++...|+|.| T Consensus 107 IF~~QG~aLn~~A----k~~vKVLVVG 129 (329) T TIGR01759 107 IFKEQGKALNKVA----KKDVKVLVVG 129 (329) T ss_pred HHHHHHHHHHHHC----CCCCEEEEEC T ss_conf 4687999999861----8985699852 No 29 >pfam10230 DUF2305 Uncharacterized conserved protein (DUF2305). This family of proteins is conserved from plants to humans. The function is unknown. Probab=34.18 E-value=39 Score=15.52 Aligned_cols=26 Identities=15% Similarity=0.253 Sum_probs=21.7 Q ss_pred HHHHHHHHHHHHHHHHCC--CCCEEEEE Q ss_conf 789999999999985116--99889996 Q gi|254780995|r 2 LSQQGEWLKKWADQKIKT--GIPFVIAG 27 (125) Q Consensus 2 l~~Q~~~L~~~i~~~~~~--~~pvIi~G 27 (125) |+.|.+-..+++++.... +.++|++| T Consensus 58 L~~QI~hk~~fi~~~~~~~~~~k~iLiG 85 (260) T pfam10230 58 LQDQIEHKIDFLRAFLPKKTDVKLILIG 85 (260) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCEEEEE T ss_conf 7999999999999985456898579983 No 30 >cd02855 Glycogen_branching_enzyme_N_term Glycogen branching enzyme N-terminus domain. Glycogen branching enzyme (AKA 1,4 alpha glucan branching enzyme) catalyzes the formation of alpha-1,6 branch points in either glycogen or starch by cleavage of the alpha-1,4 glucosidic linkage yielding a non-reducing end oligosaccharide chain and subsequent attachment to the alpha-1,6 position. By increasing the number of non-reducing ends glycogen is more reactive to synthesis and digestion as well as being more soluble. The N-terminus of the 1,4 alpha glucan branching enzyme may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitina Probab=33.27 E-value=27 Score=16.37 Aligned_cols=15 Identities=27% Similarity=0.291 Sum_probs=10.9 Q ss_pred CCEEEEEECCCCCCC Q ss_conf 988999614777688 Q gi|254780995|r 21 IPFVIAGDFNRKINS 35 (125) Q Consensus 21 ~pvIi~GDFN~~~~~ 35 (125) ..|-|+||||.+-.. T Consensus 33 ~~V~vvGdFN~W~~~ 47 (106) T cd02855 33 RRVSVVGDFNGWDGR 47 (106) T ss_pred CEEEEEEECCCCCCC T ss_conf 299999888789976 No 31 >PRK05340 UDP-2,3-diacylglucosamine hydrolase; Provisional Probab=30.93 E-value=45 Score=15.21 Aligned_cols=30 Identities=10% Similarity=0.103 Sum_probs=22.1 Q ss_pred HHHHHHHHHHHHHHCCCCCEEEEEECCCCC Q ss_conf 999999999998511699889996147776 Q gi|254780995|r 4 QQGEWLKKWADQKIKTGIPFVIAGDFNRKI 33 (125) Q Consensus 4 ~Q~~~L~~~i~~~~~~~~pvIi~GDFN~~~ 33 (125) +.++.|.+|++.......-+.|+||+=+.. T Consensus 17 ~~~~~l~~fL~~~~~~~~~LyllGDifD~W 46 (240) T PRK05340 17 AITAAFLRFLRGEARQADALYILGDLFEAW 46 (240) T ss_pred HHHHHHHHHHHHHCCCCCEEEEECCEEEEE T ss_conf 799999999985315588899934577576 No 32 >cd02854 Glycogen_branching_enzyme_like_N_term Glycogen branching enzyme-like N-terminus domain. Glycogen branching enzyme (AKA 1,4 alpha glucan branching enzyme) catalyzes the formation of alpha-1,6 branch points in either glycogen or starch by cleavage of the alpha-1,4 glucosidic linkage yielding a non-reducing end oligosaccharide chain and subsequent attachment to the alpha-1,6 position. By increasing the number of non-reducing ends glycogen is more reactive to synthesis and digestion as well as being more soluble. The N-terminus of the glycogen branching enzyme-like proteins may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobi Probab=29.51 E-value=31 Score=16.07 Aligned_cols=16 Identities=25% Similarity=0.298 Sum_probs=11.8 Q ss_pred CCCCEEEEEECCCCCC Q ss_conf 6998899961477768 Q gi|254780995|r 19 TGIPFVIAGDFNRKIN 34 (125) Q Consensus 19 ~~~pvIi~GDFN~~~~ 34 (125) +-..|-|.||||..-. T Consensus 15 nA~~v~liGdFN~W~~ 30 (99) T cd02854 15 NAEEVYLIGDFNNWDR 30 (99) T ss_pred CCCEEEEEEECCCCCC T ss_conf 8849999812778688 No 33 >pfam11288 DUF3089 Protein of unknown function (DUF3089). This family of proteins has no known function. Probab=26.93 E-value=53 Score=14.82 Aligned_cols=20 Identities=25% Similarity=0.534 Sum_probs=14.4 Q ss_pred HHHHHHHHHHCCCCCEEEEEE Q ss_conf 999999985116998899961 Q gi|254780995|r 8 WLKKWADQKIKTGIPFVIAGD 28 (125) Q Consensus 8 ~L~~~i~~~~~~~~pvIi~GD 28 (125) +...|+ ++.++|.|+||+|- T Consensus 83 AF~~yl-~~~n~grPfILagH 102 (201) T pfam11288 83 AFDAYL-ANDNGGRPFILAGH 102 (201) T ss_pred HHHHHH-HHCCCCCCEEEEEE T ss_conf 999999-96599996799983 No 34 >COG2908 Uncharacterized protein conserved in bacteria [Function unknown] Probab=26.74 E-value=54 Score=14.80 Aligned_cols=36 Identities=8% Similarity=0.079 Sum_probs=26.1 Q ss_pred HHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCC Q ss_conf 999999999998511699889996147776883100 Q gi|254780995|r 4 QQGEWLKKWADQKIKTGIPFVIAGDFNRKINSIGDT 39 (125) Q Consensus 4 ~Q~~~L~~~i~~~~~~~~pvIi~GDFN~~~~~~~~~ 39 (125) -+++.+.++++.+.+....+.++||+=+..-..+++ T Consensus 14 ~~t~~fl~Fl~~~a~~ad~lyilGDifd~w~g~~~~ 49 (237) T COG2908 14 ALTAFFLDFLREEAAQADALYILGDIFDGWIGDDEP 49 (237) T ss_pred HHHHHHHHHHHHCCCCCCEEEEECHHHHHHHCCCCC T ss_conf 789999999983535675799950016666268760 No 35 >pfam05193 Peptidase_M16_C Peptidase M16 inactive domain. Peptidase M16 consists of two structurally related domains. One is the active peptidase, whereas the other is inactive. The two domains hold the substrate like a clamp. Probab=25.96 E-value=55 Score=14.72 Aligned_cols=26 Identities=23% Similarity=0.381 Sum_probs=20.2 Q ss_pred HHHHHHHHHHHHC-CCCCEEEEEECCC Q ss_conf 9999999998511-6998899961477 Q gi|254780995|r 6 GEWLKKWADQKIK-TGIPFVIAGDFNR 31 (125) Q Consensus 6 ~~~L~~~i~~~~~-~~~pvIi~GDFN~ 31 (125) .+.|++|.+..+. .+.-++|+|||+. T Consensus 4 ~e~l~~fy~~~Y~p~n~~l~i~Gdi~~ 30 (182) T pfam05193 4 REDLKDFYKKHYSPDNMVLVVVGDVDL 30 (182) T ss_pred HHHHHHHHHHHCCCCCEEEEEECCCCH T ss_conf 899999999847820679999839999 No 36 >TIGR02815 agaS_fam putative sugar isomerase, AgaS family; InterPro: IPR014180 Some members of this protein family are found in genic regions associated with N-acetyl-galactosamine and galactosamine utilisation and are suggested to be isomerases.. Probab=25.40 E-value=57 Score=14.66 Aligned_cols=27 Identities=11% Similarity=0.388 Sum_probs=19.7 Q ss_pred CHHHHHHHHHHHHHHHH-C--CCCCEEEEE Q ss_conf 97899999999999851-1--699889996 Q gi|254780995|r 1 MLSQQGEWLKKWADQKI-K--TGIPFVIAG 27 (125) Q Consensus 1 ~l~~Q~~~L~~~i~~~~-~--~~~pvIi~G 27 (125) +++++...|+.|++... + .+..|||+| T Consensus 22 i~q~~~~a~~aFL~PLLa~Pe~~LRIvLTG 51 (374) T TIGR02815 22 IIQAERPALEAFLEPLLARPEEDLRIVLTG 51 (374) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCEEEEEC T ss_conf 888899998887677760856477589856 No 37 >TIGR01931 cysJ sulfite reductase [NADPH] flavoprotein, alpha-component; InterPro: IPR010199 This entry describes an NADPH-dependent sulphite reductase flavoprotein subunit. Most members of the proteins of this entry are found in Cys biosynthesis gene clusters. The closest homologues are designated as subunits nitrate reductase.; GO: 0004783 sulfite reductase (NADPH) activity, 0000103 sulfate assimilation, 0019344 cysteine biosynthetic process. Probab=24.85 E-value=58 Score=14.60 Aligned_cols=28 Identities=18% Similarity=0.206 Sum_probs=18.9 Q ss_pred HHHHHHHHHHHHHHCCCCCEEEEEECCCC Q ss_conf 99999999999851169988999614777 Q gi|254780995|r 4 QQGEWLKKWADQKIKTGIPFVIAGDFNRK 32 (125) Q Consensus 4 ~Q~~~L~~~i~~~~~~~~pvIi~GDFN~~ 32 (125) +-|+.|.+.+++. .-...+|=|+|||.. T Consensus 83 ~~A~~l~~~l~~~-g~~V~l~~~~dYk~k 110 (628) T TIGR01931 83 RLAKALAEKLEAA-GLSVRLVSADDYKFK 110 (628) T ss_pred HHHHHHHHHHHHC-CCEEEEEECCCCCCC T ss_conf 9999999999865-983889610268832 No 38 >cd02859 AMPKbeta_GBD_like AMP-activated protein kinase (AMPK) beta subunit glycogen binding domain (GBD). AMPK is a metabolic stress sensing protein that senses AMP/ATP and has recently been found to act as a glycogen sensor as well. The protein functions as a alpha-beta-gamma heterotrimer. This domain is the glycogen binding domain of the beta subunit. Probab=24.18 E-value=40 Score=15.47 Aligned_cols=15 Identities=20% Similarity=0.370 Sum_probs=12.0 Q ss_pred CCCCEEEEEECCCCC Q ss_conf 699889996147776 Q gi|254780995|r 19 TGIPFVIAGDFNRKI 33 (125) Q Consensus 19 ~~~pvIi~GDFN~~~ 33 (125) .+.-|.|+|+||.+- T Consensus 11 ~g~~V~v~Gsf~~W~ 25 (79) T cd02859 11 GGKEVYVTGSFDNWK 25 (79) T ss_pred CCEEEEEEEEECCCC T ss_conf 998999999878999 No 39 >COG1640 MalQ 4-alpha-glucanotransferase [Carbohydrate transport and metabolism] Probab=22.94 E-value=64 Score=14.40 Aligned_cols=39 Identities=18% Similarity=0.107 Sum_probs=24.1 Q ss_pred HHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCHHHHH Q ss_conf 78999999999998511699889996147776883100278763 Q gi|254780995|r 2 LSQQGEWLKKWADQKIKTGIPFVIAGDFNRKINSIGDTDDFWQK 45 (125) Q Consensus 2 l~~Q~~~L~~~i~~~~~~~~pvIi~GDFN~~~~~~~~~~~~~~~ 45 (125) ..+|.+.++.++.+ ..+.|.||.=+.+......-+..+. T Consensus 208 ~~~Q~~~~k~~A~~-----~~I~i~gDLpv~va~~saDvW~~~~ 246 (520) T COG1640 208 FFRQLAALKRYAND-----MGIGIIGDLPVGVAQDSADVWANPE 246 (520) T ss_pred HHHHHHHHHHHHHH-----CCCEEEECCCCEECCCCHHHHCCCC T ss_conf 99999999999984-----6924751244325677252421731 No 40 >cd02861 E_set_proteins_like E or "early" set-like proteins. These alpha amylase-like sugar utilizing enzymes which may be related to the immunoglobulin and/or fibronectin type III superfamilies are associated with different types of catalytic domains at either the N-terminal or C-terminal end. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase. Probab=22.06 E-value=52 Score=14.85 Aligned_cols=15 Identities=27% Similarity=0.293 Sum_probs=11.5 Q ss_pred CCCCEEEEEECCCCC Q ss_conf 699889996147776 Q gi|254780995|r 19 TGIPFVIAGDFNRKI 33 (125) Q Consensus 19 ~~~pvIi~GDFN~~~ 33 (125) ...-|-|+|+||.+- T Consensus 12 ~a~~V~v~GsFn~W~ 26 (82) T cd02861 12 EADSVYLAGSFNNWN 26 (82) T ss_pred CCCEEEEEEECCCCC T ss_conf 999999984258606 No 41 >TIGR01734 D-ala-DACP-lig D-alanine-activating enzyme; InterPro: IPR010072 This entry represents the enzyme (also called D-alanine-D-alanyl carrier protein ligase) which activates D-alanine as an adenylate via the reaction D-ala + ATP to D-ala-AMP + PPi, and further catalyses the condensation of the amino acid adenylate with the D-alanyl carrier protein (D-ala-ACP). The D-alanine is then further transferred to teichoic acid in the biosynthesis of lipoteichoic acid (LTA) and wall teichoic acid (WTA) in Gram-positive bacteria, both polysacchatides .; GO: 0016208 AMP binding, 0047473 D-alanine-poly(phosphoribitol) ligase activity, 0019350 teichoic acid biosynthetic process. Probab=21.97 E-value=67 Score=14.29 Aligned_cols=29 Identities=17% Similarity=0.479 Sum_probs=24.3 Q ss_pred HHHHHHHHHHHHHHH-HCCCC-CEEEEEECC Q ss_conf 789999999999985-11699-889996147 Q gi|254780995|r 2 LSQQGEWLKKWADQK-IKTGI-PFVIAGDFN 30 (125) Q Consensus 2 l~~Q~~~L~~~i~~~-~~~~~-pvIi~GDFN 30 (125) |++|.+.|...|.++ +.... |+||=|--- T Consensus 31 L~~~Sd~la~~i~~~~l~~k~kPiivfG~~~ 61 (513) T TIGR01734 31 LKEQSDRLAAFIQERLLPEKEKPIIVFGHME 61 (513) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCEEEECCCC T ss_conf 9999999999998605766677578864886 No 42 >TIGR01515 branching_enzym 1,4-alpha-glucan branching enzyme; InterPro: IPR006407 This entry represents the core region of the glycogen branching enzymes, which are responsible for the transfer of chains of approximately seven alpha(1,4)-linked glucosyl residues to other similar chains (in new alpha-(1,6) linkages) in the biosynthesis of glycogen . The branching enzyme is responsible for the degree of alpha(1,6) branch linkages found in polysaccharides .; GO: 0003844 14-alpha-glucan branching enzyme activity, 0005978 glycogen biosynthetic process. Probab=21.34 E-value=69 Score=14.21 Aligned_cols=38 Identities=18% Similarity=0.153 Sum_probs=20.6 Q ss_pred CCCCEEEEEECCCCCCCCCCCCHHHHHCCCCCCCHHHCCCCC Q ss_conf 699889996147776883100278763265541000025767 Q gi|254780995|r 19 TGIPFVIAGDFNRKINSIGDTDDFWQKMDPDGLLIRFPQEKE 60 (125) Q Consensus 19 ~~~pvIi~GDFN~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 60 (125) +-..|=|+||||.+-+..-. ++.+.+.+...+..++.. T Consensus 38 nA~~V~v~GdFn~WdG~~h~----Mr~~~~~GvWeLFiPg~~ 75 (608) T TIGR01515 38 NAREVRVVGDFNYWDGREHP----MRKRNDNGVWELFIPGVG 75 (608) T ss_pred CCCCCEEEEECCCCCCCCCC----CCCCCCCCEEEECCCCCC T ss_conf 64711287645763221155----420278722551177654 Done!