Query gi|254780797|ref|YP_003065210.1| hypothetical protein CLIBASIA_03440 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 231 No_of_seqs 122 out of 2316 Neff 10.3 Searched_HMMs 39220 Date Sun May 29 19:32:32 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780797.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR00195 exoDNase_III exodeox 99.9 2.1E-23 5.3E-28 143.2 12.1 160 7-190 1-168 (281) 2 TIGR00633 xth exodeoxyribonucl 99.9 1.4E-22 3.6E-27 138.7 13.4 157 7-190 1-165 (279) 3 TIGR03395 sphingomy sphingomye 99.9 3.5E-22 8.9E-27 136.7 14.3 190 7-210 1-206 (283) 4 PRK11756 exonuclease III; Prov 99.9 6.5E-21 1.7E-25 129.9 16.0 153 7-190 1-155 (268) 5 COG3568 ElsH Metal-dependent h 99.9 1.3E-20 3.3E-25 128.3 12.8 160 6-196 9-182 (259) 6 COG0708 XthA Exonuclease III [ 99.8 4.9E-19 1.3E-23 119.9 15.5 151 7-191 1-153 (261) 7 pfam03372 Exo_endo_phos Endonu 99.8 8E-19 2E-23 118.8 13.0 156 7-193 1-165 (255) 8 KOG3873 consensus 99.8 1.6E-18 4.2E-23 117.1 10.8 209 2-226 4-236 (422) 9 KOG2756 consensus 99.8 2E-17 5E-22 111.4 15.8 190 3-222 96-290 (349) 10 PRK13911 hypothetical protein; 99.8 5.8E-18 1.5E-22 114.2 12.2 147 7-190 1-148 (250) 11 KOG2338 consensus 99.8 3.7E-17 9.4E-22 109.9 14.6 175 6-193 116-307 (495) 12 smart00476 DNaseIc deoxyribonu 99.8 1.5E-16 3.8E-21 106.7 17.6 193 1-225 12-221 (276) 13 COG2374 Predicted extracellula 99.6 1.1E-14 2.9E-19 96.7 14.0 187 3-189 460-685 (798) 14 PRK05421 hypothetical protein; 99.6 2.1E-14 5.3E-19 95.3 11.2 157 3-193 30-187 (254) 15 smart00128 IPPc Inositol polyp 99.5 4.5E-13 1.1E-17 88.1 12.4 165 6-193 4-196 (310) 16 PTZ00297 pantothenate kinase; 99.2 5.3E-10 1.4E-14 71.8 10.8 174 4-191 8-208 (1452) 17 KOG0566 consensus 99.1 1.3E-09 3.3E-14 69.7 12.2 160 6-191 538-727 (1080) 18 COG5239 CCR4 mRNA deadenylase, 99.1 8.7E-10 2.2E-14 70.7 9.3 75 6-81 30-109 (378) 19 KOG0620 consensus 98.9 3.7E-09 9.5E-14 67.3 6.6 74 6-79 19-95 (361) 20 COG3021 Uncharacterized protei 98.8 1.2E-08 3.2E-13 64.5 7.2 133 38-194 104-238 (309) 21 COG5411 Phosphatidylinositol 5 98.2 1.3E-05 3.4E-10 48.4 8.3 98 84-191 118-219 (460) 22 KOG1976 consensus 97.7 0.00042 1.1E-08 40.4 9.4 74 117-194 155-233 (391) 23 PTZ00312 inositol-1,4,5-tripho 97.0 0.0066 1.7E-07 34.0 8.3 72 116-191 65-141 (356) 24 KOG0565 consensus 95.3 0.046 1.2E-06 29.5 5.3 77 109-194 37-124 (145) 25 KOG1294 consensus 90.8 0.4 1E-05 24.5 4.3 56 129-189 165-223 (335) 26 KOG1857 consensus 81.0 1.5 3.9E-05 21.4 2.9 59 130-191 434-492 (623) 27 cd07587 ML_beta-AS mammalian-l 58.6 12 0.0003 16.7 3.3 23 36-58 91-113 (363) 28 pfam00952 Bunya_nucleocap Buny 49.6 17 0.00044 15.8 5.4 28 123-150 55-82 (228) 29 COG3459 Cellobiose phosphoryla 47.2 15 0.00039 16.1 2.3 43 162-204 716-759 (1056) 30 TIGR02104 pulA_typeI pullulana 45.2 20 0.00052 15.4 6.2 26 31-56 344-369 (655) 31 cd02861 E_set_proteins_like E 38.9 14 0.00037 16.2 1.2 13 179-191 14-26 (82) 32 KOG0808 consensus 34.5 31 0.00078 14.5 3.1 23 36-58 101-123 (387) 33 cd02854 Glycogen_branching_enz 34.4 24 0.00061 15.1 1.7 13 180-192 18-30 (99) 34 pfam02562 PhoH PhoH-like prote 32.7 30 0.00076 14.5 2.0 62 166-229 135-197 (205) 35 cd02859 AMPKbeta_GBD_like AMP- 32.4 24 0.00061 15.1 1.4 13 178-190 12-24 (79) 36 pfam01963 TraB TraB family. pA 32.1 34 0.00086 14.3 3.2 27 33-60 15-41 (225) 37 TIGR00216 ispH_lytB 4-hydroxy- 32.1 34 0.00086 14.3 4.9 12 178-189 264-275 (354) 38 cd02855 Glycogen_branching_enz 27.7 40 0.001 13.8 3.0 28 181-208 35-65 (106) 39 TIGR01485 SPP_plant-cyano sucr 26.6 42 0.0011 13.7 2.9 50 121-188 149-199 (257) 40 PRK10438 hypothetical protein; 25.4 45 0.0011 13.6 12.6 28 31-58 16-44 (256) 41 PRK05340 UDP-2,3-diacylglucosa 22.1 52 0.0013 13.2 3.8 16 176-191 185-200 (240) 42 PRK11340 phosphodiesterase Yae 21.4 54 0.0014 13.2 3.3 24 33-56 64-87 (270) 43 TIGR01420 pilT_fam twitching m 20.6 56 0.0014 13.1 4.6 32 39-70 94-126 (350) No 1 >TIGR00195 exoDNase_III exodeoxyribonuclease III; InterPro: IPR000097 DNA damaging agents such as the antitumor drugs bleomycin and neocarzinostatin or those that generate oxygen radicals produce a variety of lesions in DNA. Amongst these is base-loss which forms apurinic/apyrimidinic (AP) sites or strand breaks with atypical 3' termini. DNA repair at the AP sites is initiated by specific endonuclease cleavage of the phosphodiester backbone. Such endonucleases are also generally capable of removing blocking groups from the 3' terminus of DNA strand breaks. AP endonucleases can be classified into two families based on sequence similarity. This family contains members of AP endonuclease family 1. Except for Rrp1 and arp, these enzymes are proteins of about 300 amino-acid residues. Rrp1 and arp both contain additional and unrelated sequences in their N-terminal section (about 400 residues for Rrp1 and 270 for arp). The proteins contain glutamate which has been shown , in the Escherichia coli enzyme to bind a divalent metal ion such as magnesium or manganese.; GO: 0003677 DNA binding, 0004519 endonuclease activity, 0006281 DNA repair, 0005622 intracellular. Probab=99.90 E-value=2.1e-23 Score=143.20 Aligned_cols=160 Identities=17% Similarity=0.227 Sum_probs=111.0 Q ss_pred EEEEEEEHHCCCCCCCCCCCCCCCCCHHHHHHHH--HHHHHHCCCCEEEEECCC--CHHHHHHHHCCCCCEEEEEECCCC Q ss_conf 8998873100767775566676666879999999--999975189899997279--988999872623221677403578 Q gi|254780797|r 7 IRIASWNINNLSEKSGVALFKNSVIREDNDYALL--QKYAEQLDADIVCLQEIG--SYEAIKRVFPNDKWDILYSGSNTD 82 (231) Q Consensus 7 lrv~twNi~~~~~~~~~~~~~~~~~~~~~~~~~i--~~~i~~~~~DVi~LQEv~--~~~~~~~~~~~~~~~~~~~~~~~~ 82 (231) |+|+||||+.+. ++++.+ .+++++.+|||+||||.. +.++-.......+|..++.+...+ T Consensus 1 mK~iSwNVNGlR----------------Ar~~~~dFl~~~~~~~pDilClQEtKv~~e~~P~~~~~~~GYh~f~~~~~~K 64 (281) T TIGR00195 1 MKLISWNVNGLR----------------ARLHKGDFLDWLKENKPDILCLQETKVADEQFPRELFEEEGYHVFFSGAKKK 64 (281) T ss_pred CCEEEECCCHHH----------------HHCCHHHHHHHHCCCCCCEEECCCCCCCCCCCCHHHHHHCCCEEEECCCEEE T ss_conf 955786022355----------------3300435888842389958732010145667787899764981787364233 Q ss_pred CCCCEEEEEEECCCCEEE-EEECCCCCCC--CCCCCCCCCEEEEEEEEECCEEEEEEEEEECCCCCCCCCCCHHHHHHHH Q ss_conf 765035688614641010-0001221467--5655445750499999849978999997511688887543002545788 Q gi|254780797|r 83 KHAMHTAIVIRKGAIHLL-QKSYLPMDTE--GLDSKAGKRRAVEILFEVDGRKIWLLDIHLKSFCFLDSIEDSYISSCYM 159 (231) Q Consensus 83 ~~~~~~g~~~~~~~~~~~-~~~~~~~~~~--~~~~~~~~r~~~~~~~~~~g~~~~v~n~HL~s~~~~~~~~~~~~~~~~~ 159 (231) + |.|+++.++..+.. ....++.+.. ........+..+.+.++.--+.++|+|++++.|-+ ......... T Consensus 65 G---y~GVA~~sk~EP~~dV~~G~~~e~~PaltGdlD~e~R~I~a~~~~~~G~~Tv~N~Y~PnGES-----~~~~~k~~Y 136 (281) T TIGR00195 65 G---YSGVAIFSKEEPLSDVRYGFGVEEEPALTGDLDAEGRVITAEFDGFLGLLTVVNGYFPNGES-----RLDSEKLPY 136 (281) T ss_pred C---CCCEEEEECCCCHHHHHCCCCCCCCCCCCCCCCCCCCEEEEEECCCEEEEEEEEEEECCCCC-----CCHHHHHHH T ss_conf 6---43079986268858860367887765332761216717999757830248999866559843-----001455788 Q ss_pred HHHHHHHHHHHHHHHCCCCCC-EEEEEECCCC Q ss_conf 754899999999974048878-9999136888 Q gi|254780797|r 160 LNLQATWLKQWVDQKNNLNMP-FIIAGDFNRK 190 (231) Q Consensus 160 ~~~q~~~l~~~i~~~~~~~~~-vii~GDFN~~ 190 (231) +..-.+.|.+++++....+.| ||||||||.. T Consensus 137 Kl~w~~~l~~yl~~~~~~~~p~v~~CGD~NiA 168 (281) T TIGR00195 137 KLQWLEALQNYLEKLRDKDKPEVIICGDFNIA 168 (281) T ss_pred HHHHHHHHHHHHHHHHHCCCCEEEEECCCCCC T ss_conf 99999999999999863689889997677889 No 2 >TIGR00633 xth exodeoxyribonuclease III (xth); InterPro: IPR004808 All proteins in this family for which functions are known are 5' AP endonucleases that function in base excision repair and the repair of abasic sites in DNA.; GO: 0004518 nuclease activity, 0006281 DNA repair. Probab=99.90 E-value=1.4e-22 Score=138.74 Aligned_cols=157 Identities=17% Similarity=0.238 Sum_probs=105.6 Q ss_pred EEEEEEEHHCCCCCCCCCCCCCCCCCHHHHHHHH--HHHHHHCCCCEEEEECCC--CHHHHHHHHCCCCCEEEEEECCCC Q ss_conf 8998873100767775566676666879999999--999975189899997279--988999872623221677403578 Q gi|254780797|r 7 IRIASWNINNLSEKSGVALFKNSVIREDNDYALL--QKYAEQLDADIVCLQEIG--SYEAIKRVFPNDKWDILYSGSNTD 82 (231) Q Consensus 7 lrv~twNi~~~~~~~~~~~~~~~~~~~~~~~~~i--~~~i~~~~~DVi~LQEv~--~~~~~~~~~~~~~~~~~~~~~~~~ 82 (231) |||+||||+.+. +.+..+ .+.|..++|||+||||.. +...........+|..++.....+ T Consensus 1 mKiiSWNVNGlR----------------A~~~~~sfL~~l~~~~pDvLCLQETK~~~~~~p~~~~~~~GY~~~~~~~~~k 64 (279) T TIGR00633 1 MKIISWNVNGLR----------------ARLKKLSFLDWLKEENPDVLCLQETKVADEQFPKELLEELGYHVFFHGAKKK 64 (279) T ss_pred CCEEEEECCHHH----------------HHCCHHHHHHHHHHCCCCEEEEEEECCCCCCCCHHHHHCCCCCEEEEECCCC T ss_conf 916887112344----------------2135112888986538978762200278776634666407981588414665 Q ss_pred CCCCEEEEEEECCCCEEEEEECCCC-CCCCCCCCCCCCEEEEEEEEECCEEEEEEEEEECCCCCCCCCCCHHHH-HHHHH Q ss_conf 7650356886146410100001221-467565544575049999984997899999751168888754300254-57887 Q gi|254780797|r 83 KHAMHTAIVIRKGAIHLLQKSYLPM-DTEGLDSKAGKRRAVEILFEVDGRKIWLLDIHLKSFCFLDSIEDSYIS-SCYML 160 (231) Q Consensus 83 ~~~~~~g~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~r~~~~~~~~~~g~~~~v~n~HL~s~~~~~~~~~~~~~-~~~~~ 160 (231) . ..|.|+++.++..+......++. +.. ....|....+.+...=..|.|+|++++.+...+ .. ....| T Consensus 65 k-~Gy~GVA~~Sk~~P~~V~~G~g~~~~~----D~egR~~i~~~i~a~f~~f~v~n~YvPNg~~~~------~~~rl~Yk 133 (279) T TIGR00633 65 K-KGYSGVAILSKVEPLSVRYGFGVLEEH----DEEGRVIIMRVITAEFDGFYVVNVYVPNGGARG------LETRLEYK 133 (279) T ss_pred C-CCEEEEEEEECCCCCEEEECCCCCCCC----CCCCCEEEEEEEEEECCCEEEEEEEECCCCCCC------CCCCHHHH T ss_conf 8-851368998601632022068788721----037746533310101054799987766861136------54462788 Q ss_pred HHHHHHHHHHHHHHCCC--CCCEEEEEECCCC Q ss_conf 54899999999974048--8789999136888 Q gi|254780797|r 161 NLQATWLKQWVDQKNNL--NMPFIIAGDFNRK 190 (231) Q Consensus 161 ~~q~~~l~~~i~~~~~~--~~~vii~GDFN~~ 190 (231) ....+.+.+...+.... +.|||||||||.- T Consensus 134 ~~~w~~~~~~~~~~l~~~~~~pv~~cGD~NvA 165 (279) T TIGR00633 134 LDFWEALLQYYLKELDASDGKPVIICGDLNVA 165 (279) T ss_pred HHHHHHHHHHHHHHHHHCCCCCEEEECCCCCC T ss_conf 89999999999999861799979998477889 No 3 >TIGR03395 sphingomy sphingomyelin phosphodiesterase. Members of this family are bacterial proteins that act as sphingomyelin phosphodiesterase (EC 3.1.4.12), also called sphingomyelinase. Some members of this family have been shown to act as hemolysins. Probab=99.89 E-value=3.5e-22 Score=136.67 Aligned_cols=190 Identities=16% Similarity=0.157 Sum_probs=115.4 Q ss_pred EEEEEEEHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHCCCC--CEEEEEECC--CC Q ss_conf 89988731007677755666766668799999999999751898999972799889998726232--216774035--78 Q gi|254780797|r 7 IRIASWNINNLSEKSGVALFKNSVIREDNDYALLQKYAEQLDADIVCLQEIGSYEAIKRVFPNDK--WDILYSGSN--TD 82 (231) Q Consensus 7 lrv~twNi~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~DVi~LQEv~~~~~~~~~~~~~~--~~~~~~~~~--~~ 82 (231) ||||||||..+.... +++ ..+.++.+.|++..--.+.|||+|||+++......++.... |.|....-+ .. T Consensus 1 lKIlsyNv~~lp~~l----~~n--~~q~~Ra~~I~~~~~i~~~DVvvlqEvFd~~a~~~l~~~L~~~YpY~T~vvg~~~~ 74 (283) T TIGR03395 1 IKILSHNVYMLSTNL----YPN--WGQMERADLIASADYIKNQDVVILNEAFDTSASKRLLDNLREEYPYQTDVIGRSKK 74 (283) T ss_pred CEEEEEEEEECCHHH----CCC--CCHHHHHHHHCCHHHCCCCCEEEEEHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCC T ss_conf 927887703257101----378--77789998744142136898899961228689999999788638841542476666 Q ss_pred C-----------CCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCEEEEEEEEEECCCCCCCCCCC Q ss_conf 7-----------65035688614641010000122146756554457504999998499789999975116888875430 Q gi|254780797|r 83 K-----------HAMHTAIVIRKGAIHLLQKSYLPMDTEGLDSKAGKRRAVEILFEVDGRKIWLLDIHLKSFCFLDSIED 151 (231) Q Consensus 83 ~-----------~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~g~~~~v~n~HL~s~~~~~~~~~ 151 (231) + ....+|+++.++........ ..............++++++.+..+|+.++|++|||.|....- T Consensus 75 gwd~t~g~~~~~~~~~gGv~I~Sk~PI~~~~~-~if~~~~g~D~~a~KG~~y~kI~k~gk~~hv~~THlqa~d~~~---- 149 (283) T TIGR03395 75 GWDKTLGNYSSSALEDGGVAIVSKWPIEEKIQ-YIFNKGCGADNLSNKGFAYVKINKNGKKFHVIGTHLQAQDSMC---- 149 (283) T ss_pred CCCCCCCCCCCCCCCCCEEEEEECCCCEEEEE-EEECCCCCCHHHHCCCEEEEEEEECCEEEEEEECCCCCCCCCC---- T ss_conf 76456676556613177489996787103468-9865774731201157089999879979999981446578765---- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH-CCCCCCEEEEEECCCCCCCCCCCCCCHHHHHCCCCC Q ss_conf 02545788754899999999974-048878999913688846222222767786303468 Q gi|254780797|r 152 SYISSCYMLNLQATWLKQWVDQK-NNLNMPFIIAGDFNRKINHSHSGIKDELWQKINQDN 210 (231) Q Consensus 152 ~~~~~~~~~~~q~~~l~~~i~~~-~~~~~~vii~GDFN~~~~~~~~~~~~~~~~~~~~~~ 210 (231) ........|..|++++++++.+. ...+++||++||||.+-.+. .+..+...++.+. T Consensus 150 ~~~~~~~~R~~Q~~ei~~fi~~~~ip~~e~vi~aGDfNV~k~s~---EY~~Ml~~Ln~~~ 206 (283) T TIGR03395 150 SKLGPASIRANQLNEIQDFIDSKNIPKDETVLIGGDLNVNKGSN---EYHDMFKTLNVSE 206 (283) T ss_pred CCCHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCH---HHHHHHHHCCCCC T ss_conf 54115899999999999999855999897189940144268987---8999999728999 No 4 >PRK11756 exonuclease III; Provisional Probab=99.87 E-value=6.5e-21 Score=129.89 Aligned_cols=153 Identities=17% Similarity=0.200 Sum_probs=97.7 Q ss_pred EEEEEEEHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCH--HHHHHHHCCCCCEEEEEECCCCCC Q ss_conf 899887310076777556667666687999999999997518989999727998--899987262322167740357876 Q gi|254780797|r 7 IRIASWNINNLSEKSGVALFKNSVIREDNDYALLQKYAEQLDADIVCLQEIGSY--EAIKRVFPNDKWDILYSGSNTDKH 84 (231) Q Consensus 7 lrv~twNi~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~DVi~LQEv~~~--~~~~~~~~~~~~~~~~~~~~~~~~ 84 (231) |||+||||+.+ | +++..+..++...+|||+||||+... .+....+...+|..++.. . T Consensus 1 MkI~SwNVNgi--------------R--ar~~~l~~~l~~~~pDIlcLQEtK~~~~~~p~~~~~~~gY~~~~~~-----~ 59 (268) T PRK11756 1 MKFVSFNINGL--------------R--ARPHQLEAIIEKHQPDVIGLQETKVHDEMFPLEEVEALGYHVFYHG-----Q 59 (268) T ss_pred CEEEEEEEHHH--------------H--HHHHHHHHHHHHCCCCEEEEECCCCCHHHCCHHHHHHCCCEEEEEC-----C T ss_conf 98999986668--------------6--7588899999777998999987738811289889852695899961-----6 Q ss_pred CCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCEEEEEEEEEECCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 50356886146410100001221467565544575049999984997899999751168888754300254578875489 Q gi|254780797|r 85 AMHTAIVIRKGAIHLLQKSYLPMDTEGLDSKAGKRRAVEILFEVDGRKIWLLDIHLKSFCFLDSIEDSYISSCYMLNLQA 164 (231) Q Consensus 85 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~g~~~~v~n~HL~s~~~~~~~~~~~~~~~~~~~~q~ 164 (231) ..+.|+++.++.........++ ........| .+.+++..+.+.+.|+|+++++|...+. ......+..-. T Consensus 60 KgysGVAi~sk~~p~~v~~g~~----~~~~d~egR-~i~~~~~~~~g~~~lin~Y~PnG~~~~~-----~~k~~~K~~f~ 129 (268) T PRK11756 60 KGHYGVALLSKQTPIAVRKGFP----TDDEEAQRR-IIMATIPTPNGNLTVINGYFPQGESRDH-----PTKFPAKRQFY 129 (268) T ss_pred CCCCCEEEEECCCCHHEEECCC----CCCCCCCCC-EEEEEEECCCCCEEEEEEEECCCCCCCC-----HHHHHHHHHHH T ss_conf 7734446872367244586488----885444562-7999996377629999985048887772-----56778899999 Q ss_pred HHHHHHHHHHCCCCCCEEEEEECCCC Q ss_conf 99999999740488789999136888 Q gi|254780797|r 165 TWLKQWVDQKNNLNMPFIIAGDFNRK 190 (231) Q Consensus 165 ~~l~~~i~~~~~~~~~vii~GDFN~~ 190 (231) +++.+++++..+.+.|+|+|||||.- T Consensus 130 ~~l~~~l~~~~~~~k~vIi~GD~NVA 155 (268) T PRK11756 130 QDLQNYLETELSPDNPLLIMGDMNIS 155 (268) T ss_pred HHHHHHHHHHHCCCCCEEEECCCCCC T ss_conf 99999999976479958995643435 No 5 >COG3568 ElsH Metal-dependent hydrolase [General function prediction only] Probab=99.85 E-value=1.3e-20 Score=128.29 Aligned_cols=160 Identities=20% Similarity=0.199 Sum_probs=94.6 Q ss_pred EEEEEEEEHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCHHH--------HHHHHC-CCCCEEEE Q ss_conf 489988731007677755666766668799999999999751898999972799889--------998726-23221677 Q gi|254780797|r 6 RIRIASWNINNLSEKSGVALFKNSVIREDNDYALLQKYAEQLDADIVCLQEIGSYEA--------IKRVFP-NDKWDILY 76 (231) Q Consensus 6 ~lrv~twNi~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~DVi~LQEv~~~~~--------~~~~~~-~~~~~~~~ 76 (231) +++|+||||.++.+.. ...-+++.+++.+.+.++||+||||+..... ...... .....+.+ T Consensus 9 ~~~v~TyNih~~~~~~----------d~r~~~~r~~~~i~~~~~Div~LQEv~~~~~~~r~~~~~~~~~~~~~~~~~~~~ 78 (259) T COG3568 9 RFKVLTYNIHKGFGAF----------DRRFDLPRIAEVIREVGADIVALQEVDGAFGRHRDGLLDLPHLLGRLGLAPYWW 78 (259) T ss_pred EEEEEEEEEEECCCCC----------CCEECHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCHHHHHHHHHCCCCCCC T ss_conf 2378999988766766----------726447999999976436701464144455542224213678888750875210 Q ss_pred EECCC---CC-CCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCEEEEEEEEEC-CEEEEEEEEEECCCCCCCCCCC Q ss_conf 40357---87-6503568861464101000012214675655445750499999849-9789999975116888875430 Q gi|254780797|r 77 SGSNT---DK-HAMHTAIVIRKGAIHLLQKSYLPMDTEGLDSKAGKRRAVEILFEVD-GRKIWLLDIHLKSFCFLDSIED 151 (231) Q Consensus 77 ~~~~~---~~-~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~-g~~~~v~n~HL~s~~~~~~~~~ 151 (231) +.... .. ....++++++.. ........+... ....+|+++.+.+..+ |++++|+|+||.- T Consensus 79 ~~~~~~a~~~~~~~GnaiLS~~p---i~~v~~~~lp~~---~~~~~Rgal~a~~~~~~g~~l~V~~~HL~l--------- 143 (259) T COG3568 79 SGAAFGAVYGEGQHGNAILSRLP---IRDVENLALPDP---TGLEPRGALLAEIELPGGKPLRVINAHLGL--------- 143 (259) T ss_pred CHHHHHHHCCCCEEEEEEEECCC---CCCHHHCCCCCC---CCCCCCEEEEEEEECCCCCEEEEEEEECCC--------- T ss_conf 11344420010011159985376---422110369987---787775269999976899779999974163--------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCCCCC Q ss_conf 025457887548999999999740488789999136888462222 Q gi|254780797|r 152 SYISSCYMLNLQATWLKQWVDQKNNLNMPFIIAGDFNRKINHSHS 196 (231) Q Consensus 152 ~~~~~~~~~~~q~~~l~~~i~~~~~~~~~vii~GDFN~~~~~~~~ 196 (231) ..+.|.+|++.|.+.+.- ....|+|+|||||+.+.++.+ T Consensus 144 ----~~~~R~~Q~~~L~~~~~l--~~~~p~vl~GDFN~~p~s~~y 182 (259) T COG3568 144 ----SEESRLRQAAALLALAGL--PALNPTVLMGDFNNEPGSAEY 182 (259) T ss_pred ----CHHHHHHHHHHHHHHCCC--CCCCCEEEECCCCCCCCCCCC T ss_conf ----387899999999863567--566863897137899887240 No 6 >COG0708 XthA Exonuclease III [DNA replication, recombination, and repair] Probab=99.83 E-value=4.9e-19 Score=119.89 Aligned_cols=151 Identities=18% Similarity=0.289 Sum_probs=96.2 Q ss_pred EEEEEEEHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCH--HHHHHHHCCCCCEEEEEECCCCCC Q ss_conf 899887310076777556667666687999999999997518989999727998--899987262322167740357876 Q gi|254780797|r 7 IRIASWNINNLSEKSGVALFKNSVIREDNDYALLQKYAEQLDADIVCLQEIGSY--EAIKRVFPNDKWDILYSGSNTDKH 84 (231) Q Consensus 7 lrv~twNi~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~DVi~LQEv~~~--~~~~~~~~~~~~~~~~~~~~~~~~ 84 (231) |||+||||+.+ .+++..+.+++.+.+|||+||||+... .+....+...+|..++..-. T Consensus 1 mkI~SwNVNgi----------------Rar~~~~~~~l~~~~pDVlclQEtK~~~~~fp~~~~~~~GY~~~~~~gq---- 60 (261) T COG0708 1 MKIASWNVNGL----------------RARLKKLLDWLEEEQPDVLCLQETKAQDEQFPREELEALGYHHVFNHGQ---- 60 (261) T ss_pred CEEEEEEHHHH----------------HHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHCCCEEEEECCC---- T ss_conf 90689830428----------------9899999999987299989997414676668976775279518995276---- Q ss_pred CCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCEEEEEEEEEECCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 50356886146410100001221467565544575049999984997899999751168888754300254578875489 Q gi|254780797|r 85 AMHTAIVIRKGAIHLLQKSYLPMDTEGLDSKAGKRRAVEILFEVDGRKIWLLDIHLKSFCFLDSIEDSYISSCYMLNLQA 164 (231) Q Consensus 85 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~g~~~~v~n~HL~s~~~~~~~~~~~~~~~~~~~~q~ 164 (231) ..++|+++.++.........++ ...........+.+++. + +.|+|+++++|..-...+ ...+..-. T Consensus 61 KgysGVailsr~~~~~v~~g~~----~~~~~d~e~R~I~a~~~--~--~~v~~~Y~PnG~~~~~~k------~~yKl~f~ 126 (261) T COG0708 61 KGYSGVAILSKKPPDDVRRGFP----GEEEDDEEGRVIEAEFD--G--FRVINLYFPNGSSIGLEK------FDYKLRFL 126 (261) T ss_pred CCCCEEEEEECCCCHHHHCCCC----CCCCCCCCCCEEEEEEC--C--EEEEEEECCCCCCCCCCC------HHHHHHHH T ss_conf 7734489998468302102799----87665434757999979--8--999999727999888710------67899999 Q ss_pred HHHHHHHHHHCCCCCCEEEEEECCCCC Q ss_conf 999999997404887899991368884 Q gi|254780797|r 165 TWLKQWVDQKNNLNMPFIIAGDFNRKI 191 (231) Q Consensus 165 ~~l~~~i~~~~~~~~~vii~GDFN~~~ 191 (231) +.+.+++.+....+.|+|+|||||.-+ T Consensus 127 ~~l~~~l~~l~~~~~~~vl~GD~NIap 153 (261) T COG0708 127 DALRNYLEELLKKGKPVVLCGDFNIAP 153 (261) T ss_pred HHHHHHHHHHHHCCCCEEEECCCCCCC T ss_conf 999999999874599879966631178 No 7 >pfam03372 Exo_endo_phos Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNase I proteins EC:3.1.21.1, Synaptojanin an inositol-1,4,5-trisphosphate phosphatase EC:3.1.3.56, Sphingomyelinase EC:3.1.4.12 and Nocturnin. Probab=99.81 E-value=8e-19 Score=118.76 Aligned_cols=156 Identities=17% Similarity=0.191 Sum_probs=87.6 Q ss_pred EEEEEEEHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHH--HCCCCCEE------EEEE Q ss_conf 899887310076777556667666687999999999997518989999727998899987--26232216------7740 Q gi|254780797|r 7 IRIASWNINNLSEKSGVALFKNSVIREDNDYALLQKYAEQLDADIVCLQEIGSYEAIKRV--FPNDKWDI------LYSG 78 (231) Q Consensus 7 lrv~twNi~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~DVi~LQEv~~~~~~~~~--~~~~~~~~------~~~~ 78 (231) |||+||||+.+....... ....+...+...+|||+||||+......... .....+.. .... T Consensus 1 l~i~t~Nv~g~~~~~~~~-----------~~~~l~~~~~~~~~Dii~LQE~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 69 (255) T pfam03372 1 LKILTWNVNGLSAALKLL-----------WARLLLELLKLEDPDVICLQETKLSPESLILLLLLLLGYVSPLTSSVYTGF 69 (255) T ss_pred CEEEEEEECCHHHHHHCC-----------CHHHHHHHHHHCCCCEEEEECCCCCCHHCCHHHHHHCCCCCCCCCCCEEEE T ss_conf 989999946721441046-----------678999999768998899981217901279757753587410034210355 Q ss_pred CCCCCCCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCEEEEEEEEEECCCCCCCCCCCHHHHHHH Q ss_conf 35787650356886146410100001221467565544575049999984997899999751168888754300254578 Q gi|254780797|r 79 SNTDKHAMHTAIVIRKGAIHLLQKSYLPMDTEGLDSKAGKRRAVEILFEVDGRKIWLLDIHLKSFCFLDSIEDSYISSCY 158 (231) Q Consensus 79 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~g~~~~v~n~HL~s~~~~~~~~~~~~~~~~ 158 (231) ..........+++++............ ..........+..++..+.|+|+|++++.. ... T Consensus 70 ~~~~~~~~Gvai~~r~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~v~~~y~~~~~~----------~~~ 129 (255) T pfam03372 70 GGGGGGKGGVAILSKEPLVEVILGIFS----------ENEKDFIRRRILANGKSFVVVNTHLPAGGE----------NLD 129 (255) T ss_pred ECCCCCCCCEEEEECCCCCEEEECCCC----------CCCCCEEEEEEEEECCCEEEEEEECCCCCC----------CHH T ss_conf 314576252599964786224415787----------655727899999744989999997469997----------533 Q ss_pred HHHHHHHHHHHHHHHH-CCCCCCEEEEEECCCCCCC Q ss_conf 8754899999999974-0488789999136888462 Q gi|254780797|r 159 MLNLQATWLKQWVDQK-NNLNMPFIIAGDFNRKINH 193 (231) Q Consensus 159 ~~~~q~~~l~~~i~~~-~~~~~~vii~GDFN~~~~~ 193 (231) .|..|+..+.+.+... ...+.++|+|||||.++.. T Consensus 130 ~r~~~~~~l~~~l~~~~~~~~~~~ii~GDfN~~~~~ 165 (255) T pfam03372 130 ERLAQLAELLDFLSDLRIPKSDPVILCGDFNARPDS 165 (255) T ss_pred HHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCH T ss_conf 799999999999998654359808997202436842 No 8 >KOG3873 consensus Probab=99.78 E-value=1.6e-18 Score=117.12 Aligned_cols=209 Identities=17% Similarity=0.082 Sum_probs=135.3 Q ss_pred CCCCEEEEEEEEHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHCCCC--CEEEEEEC Q ss_conf 8665489988731007677755666766668799999999999751898999972799889998726232--21677403 Q gi|254780797|r 2 ILAQRIRIASWNINNLSEKSGVALFKNSVIREDNDYALLQKYAEQLDADIVCLQEIGSYEAIKRVFPNDK--WDILYSGS 79 (231) Q Consensus 2 ~~~~~lrv~twNi~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~DVi~LQEv~~~~~~~~~~~~~~--~~~~~~~~ 79 (231) +.+-+|||+|.|+ |+-+ -.......++..+.+.+.++..||+.||||.+++....+..... |.|..... T Consensus 4 ~~a~~lriltlN~-Wgip--------~~Sk~R~~Rm~~~g~~l~~E~yDiv~LQEvWs~eD~~~L~~~~ss~yPysh~FH 74 (422) T KOG3873 4 MLALELRILTLNI-WGIP--------YVSKDRRHRMDAIGDELASEKYDIVSLQEVWSQEDFEYLQSGCSSVYPYSHYFH 74 (422) T ss_pred CHHHEEEEEEEEC-CCCC--------CCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHH T ss_conf 1322013467631-2564--------343014788987767776355211428988878789999974364574577640 Q ss_pred CCCCCCCEEEEEEECCCCEEEEEECCCCCCCCCCC----CCCCCEEEEEEEEECCEEEEEEEEEECCCCCCCCCCCHHHH Q ss_conf 57876503568861464101000012214675655----44575049999984997899999751168888754300254 Q gi|254780797|r 80 NTDKHAMHTAIVIRKGAIHLLQKSYLPMDTEGLDS----KAGKRRAVEILFEVDGRKIWLLDIHLKSFCFLDSIEDSYIS 155 (231) Q Consensus 80 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~r~~~~~~~~~~g~~~~v~n~HL~s~~~~~~~~~~~~~ 155 (231) +. ..+...+++++.++ .......+++...+... ....++.-...+..+|..+.++|+||++.+..... . T Consensus 75 SG-imGaGL~vfSK~PI-~~t~~~~y~lNG~p~~i~rGDWf~GK~Vgl~~l~~~g~~v~~yntHLHAeY~rq~D-----~ 147 (422) T KOG3873 75 SG-IMGAGLCVFSKHPI-LETLFHRYSLNGYPHAIHRGDWFGGKGVGLTVLLVGGRMVNLYNTHLHAEYDRQND-----E 147 (422) T ss_pred CC-CCCCCEEEEECCCH-HHHHHHCCCCCCCCCEEEECCCCCCCEEEEEEEEECCEEEEEEEHHCCCCCCCCCC-----H T ss_conf 24-55674587653750-65542001417861024305633563268999953988855120002562235676-----1 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCCC-CCCCC----CHHHHHCCCCC---CCEE----------CEE Q ss_conf 578875489999999997404887899991368884622-22227----67786303468---7102----------102 Q gi|254780797|r 156 SCYMLNLQATWLKQWVDQKNNLNMPFIIAGDFNRKINHS-HSGIK----DELWQKINQDN---TLMR----------LPH 217 (231) Q Consensus 156 ~~~~~~~q~~~l~~~i~~~~~~~~~vii~GDFN~~~~~~-~~~~~----~~~~~~~~~~~---~~~~----------~~~ 217 (231) -..+|..|+-.+.++|......+.-||++||||+.|.+- +..+. .+-|...+.+- ...+ -|. T Consensus 148 YL~HR~~QAwdlaqfi~~t~q~~~vVI~~GDLN~~P~dl~~~ll~~a~l~daw~~~h~~q~e~~~~r~s~~~~l~~g~tc 227 (422) T KOG3873 148 YLCHRVAQAWDLAQFIRATRQNADVVILAGDLNMQPQDLGHKLLLSAGLVDAWTSLHLDQCESDSFRLSEDKELVEGNTC 227 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCEEEEECCCHHHHHHHHCHHHHCCCCCCCCHHHHHHCCCCC T ss_conf 14678898999999999974478689994677888666661244403004567653324312752244524455328765 Q ss_pred ECCCCEEEE Q ss_conf 035863522 Q gi|254780797|r 218 KKNHNAIRT 226 (231) Q Consensus 218 ~~~~n~~~~ 226 (231) +.+.||.++ T Consensus 228 d~~~N~y~~ 236 (422) T KOG3873 228 DSPLNCYTS 236 (422) T ss_pred CCCCHHHHH T ss_conf 675103667 No 9 >KOG2756 consensus Probab=99.78 E-value=2e-17 Score=111.37 Aligned_cols=190 Identities=15% Similarity=0.223 Sum_probs=118.4 Q ss_pred CCCEEEEEEEEHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHC-CCCCEEEEEECCC Q ss_conf 665489988731007677755666766668799999999999751898999972799889998726-2322167740357 Q gi|254780797|r 3 LAQRIRIASWNINNLSEKSGVALFKNSVIREDNDYALLQKYAEQLDADIVCLQEIGSYEAIKRVFP-NDKWDILYSGSNT 81 (231) Q Consensus 3 ~~~~lrv~twNi~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~DVi~LQEv~~~~~~~~~~~-~~~~~~~~~~~~~ 81 (231) .+..++++||||+.+.. ..-..++.+++++|+...|||+|||||-... .+.+-. ...|..++ . T Consensus 96 ~g~~~S~~~WnidgLdl-----------n~l~~RMrAv~H~i~l~sPdiiflQEV~p~~-y~~~~K~~s~y~i~~----~ 159 (349) T KOG2756 96 QGSMFSLITWNIDGLDL-----------NNLSERMRAVCHYLALYSPDVIFLQEVIPPY-YSYLKKRSSNYEIIT----G 159 (349) T ss_pred CCCEEEEEEEECCCCCC-----------CHHHHHHHHHHHHHHHCCCCEEEEEECCCHH-HHHHHHHHHHEEEEE----E T ss_conf 04378889861156544-----------3189999999999985298779875337356-678987654306899----5 Q ss_pred CCCCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCEEEEEEEEEECCCCCCCCCCCHHHHHHHHHH Q ss_conf 87650356886146410100001221467565544575049999984997899999751168888754300254578875 Q gi|254780797|r 82 DKHAMHTAIVIRKGAIHLLQKSYLPMDTEGLDSKAGKRRAVEILFEVDGRKIWLLDIHLKSFCFLDSIEDSYISSCYMLN 161 (231) Q Consensus 82 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~g~~~~v~n~HL~s~~~~~~~~~~~~~~~~~~~ 161 (231) +...+++..........+..... .......+.|....+++..+|..+++.++||+|.++..+.... .+ . T Consensus 160 ~~~~~~~~~~l~~s~~~Vks~~~-----i~F~NS~M~R~L~I~Ev~v~G~Kl~l~tsHLEStr~h~P~r~~---qF---~ 228 (349) T KOG2756 160 HEEGYFTAIMLKKSRVKVKSQEI-----IPFPNSKMMRNLLIVEVNVSGNKLCLMTSHLESTRGHAPERMN---QF---K 228 (349) T ss_pred CCCEEEEEEEEEHHHCCCCCCCE-----EECCCCHHHHEEEEEEEEECCEEEEEEECCCCCCCCCCCHHHH---HH---H T ss_conf 14515652211024428520121-----1057521210467999851770699984133578899817889---99---9 Q ss_pred HHHHHHHHHHHHHCCCCCCEEEEEECCCCCC--CC--CCCCCCHHHHHCCCCCCCEECEEECCCC Q ss_conf 4899999999974048878999913688846--22--2222767786303468710210203586 Q gi|254780797|r 162 LQATWLKQWVDQKNNLNMPFIIAGDFNRKIN--HS--HSGIKDELWQKINQDNTLMRLPHKKNHN 222 (231) Q Consensus 162 ~q~~~l~~~i~~~~~~~~~vii~GDFN~~~~--~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~n 222 (231) ..+++.++.|++. ++..||++||+|-+-. .. ......++|+.+...+ ...||.+..-| T Consensus 229 ~~~~k~~EaIe~l--PnA~ViFGGD~NlrD~ev~r~~lPD~~vDvWE~lg~p~-~~~FTwDT~~N 290 (349) T KOG2756 229 MVLKKMQEAIESL--PNATVIFGGDTNLRDREVTRCGLPDNIVDVWEFLGKPK-HCQFTWDTQMN 290 (349) T ss_pred HHHHHHHHHHHHC--CCCEEEECCCCCCHHHHCCCCCCCCHHHHHHHHHCCCC-CCCEEEECCCC T ss_conf 9999999999859--99769965866401333136789913789999818987-67414443448 No 10 >PRK13911 hypothetical protein; Provisional Probab=99.77 E-value=5.8e-18 Score=114.19 Aligned_cols=147 Identities=18% Similarity=0.232 Sum_probs=84.8 Q ss_pred EEEEEEEHHCCCCCCCCCCCCCCCCCHHHHH-HHHHHHHHHCCCCEEEEECCCCHHHHHHHHCCCCCEEEEEECCCCCCC Q ss_conf 8998873100767775566676666879999-999999975189899997279988999872623221677403578765 Q gi|254780797|r 7 IRIASWNINNLSEKSGVALFKNSVIREDNDY-ALLQKYAEQLDADIVCLQEIGSYEAIKRVFPNDKWDILYSGSNTDKHA 85 (231) Q Consensus 7 lrv~twNi~~~~~~~~~~~~~~~~~~~~~~~-~~i~~~i~~~~~DVi~LQEv~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (231) |||+||||+... +++ +.+.+++...+|||+||||+..... ...+...+|..++.... .. T Consensus 1 MkI~swNVNgiR----------------a~~~~~~~~~l~~~~pDIlcLQEtK~~~~-~~~~~~~gy~~~~~~~~---~k 60 (250) T PRK13911 1 MKLISWNVNGLR----------------ACMTKGFMDFFNSVDADVFCIQESKMQQE-QNTFEFKGYFDFWNCAI---KK 60 (250) T ss_pred CEEEEEEEHHHH----------------HHHHHHHHHHHHHCCCCEEEEEEEECCCC-CCCCCCCCCCEEEEEEE---EC T ss_conf 989999737489----------------87770399999667998999973445876-57723058500562115---54 Q ss_pred CEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCEEEEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 03568861464101000012214675655445750499999849978999997511688887543002545788754899 Q gi|254780797|r 86 MHTAIVIRKGAIHLLQKSYLPMDTEGLDSKAGKRRAVEILFEVDGRKIWLLDIHLKSFCFLDSIEDSYISSCYMLNLQAT 165 (231) Q Consensus 86 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~g~~~~v~n~HL~s~~~~~~~~~~~~~~~~~~~~q~~ 165 (231) .+.|+++..+..+....... .........| .+.+++ + .+.|+|++.+++..... ....+..... T Consensus 61 gysGVai~sk~~p~~v~~g~----~~~~~d~egR-~i~~~~--~--~~~l~n~Y~Png~~~~~-------~~~~K~~~~~ 124 (250) T PRK13911 61 GYSGVVTFTKKEPLSVSYGI----NIEEHDKEGR-VVTCEF--E--SFYLVNVYTPNSQQALS-------RLSYRMSWEV 124 (250) T ss_pred CCCCEEEEECCCCCEEEECC----CCCCCCCCCC-EEEEEE--C--CEEEEEEECCCCCCCCC-------HHHHHHHHHH T ss_conf 41634888648872067168----8643220263-799998--9--99998520489866550-------1668999999 Q ss_pred HHHHHHHHHCCCCCCEEEEEECCCC Q ss_conf 9999999740488789999136888 Q gi|254780797|r 166 WLKQWVDQKNNLNMPFIIAGDFNRK 190 (231) Q Consensus 166 ~l~~~i~~~~~~~~~vii~GDFN~~ 190 (231) .+.++++.. +.+.|+|+|||||.- T Consensus 125 ~~~~~~~~l-~~~k~vI~~GD~NIa 148 (250) T PRK13911 125 EFKKFLKAL-ELKKPVIVCGDLNVA 148 (250) T ss_pred HHHHHHHHH-CCCCCEEEECCCCCC T ss_conf 999999985-148988996333546 No 11 >KOG2338 consensus Probab=99.76 E-value=3.7e-17 Score=109.91 Aligned_cols=175 Identities=16% Similarity=0.197 Sum_probs=99.1 Q ss_pred EEEEEEEEHHC--CCCCCCCCCCC--CCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCHH---HHHHHHCCCCCEEEEEE Q ss_conf 48998873100--76777556667--6666879999999999975189899997279988---99987262322167740 Q gi|254780797|r 6 RIRIASWNINN--LSEKSGVALFK--NSVIREDNDYALLQKYAEQLDADIVCLQEIGSYE---AIKRVFPNDKWDILYSG 78 (231) Q Consensus 6 ~lrv~twNi~~--~~~~~~~~~~~--~~~~~~~~~~~~i~~~i~~~~~DVi~LQEv~~~~---~~~~~~~~~~~~~~~~~ 78 (231) .++||||||.- +..+.....++ ...-.+..+-+.|...+...+|||+|||||+-.. .....+...+|..++.. T Consensus 116 ~ftvmSYNILAQ~l~~~~~r~~~~~s~~~L~W~~R~~~Ll~EL~~~dpDIlCLQEVq~d~~~~~~~~~~~~lGy~~~~~r 195 (495) T KOG2338 116 DFTVMSYNILAQDLLRDIRRLYFPESGPALKWLRRSQNLLNELKHYDPDVLCLQEVQEDHYPEFWQPLLGKLGYTGFFKR 195 (495) T ss_pred CEEEEEEHHHHHHHHHHHHHHHCCCCHHHCCHHHHHHHHHHHHHHCCCCEEEEHHHHHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 44898715747888887677615698032355677689999875139874330023134548999998862683278984 Q ss_pred CCCCCCCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCC--CEEEEEEEEE-----CCEEEEEEEEEECCCCCCCCCCC Q ss_conf 3578765035688614641010000122146756554457--5049999984-----99789999975116888875430 Q gi|254780797|r 79 SNTDKHAMHTAIVIRKGAIHLLQKSYLPMDTEGLDSKAGK--RRAVEILFEV-----DGRKIWLLDIHLKSFCFLDSIED 151 (231) Q Consensus 79 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--r~~~~~~~~~-----~g~~~~v~n~HL~s~~~~~~~~~ 151 (231) . ........++++.......... .++........... ..++.+.+.. .++.+.|.||||-.. +... T Consensus 196 ~-t~~KthG~ai~w~~~~F~lv~~--~~l~y~~~~~~l~n~~NV~lvv~l~f~~~~~~sq~ilVanTHLl~n---p~~~- 268 (495) T KOG2338 196 R-TGTKTHGVAILWHSAKFKLVNH--SELNYFDSGSALANRDNVGLVVSLEFRLVDESSQGILVANTHLLFN---PSRS- 268 (495) T ss_pred C-CCCCCCEEEEEEECCCCEECCC--CHHHCCCCCCHHHCCCCEEEEEEEEECCCCCCCCCEEEEEEEEEEC---CCCC- T ss_conf 1-5777765899970330203254--0012146420231325522799877524674457337875205655---7666- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCC---CCCEEEEEECCCCCCC Q ss_conf 02545788754899999999974048---8789999136888462 Q gi|254780797|r 152 SYISSCYMLNLQATWLKQWVDQKNNL---NMPFIIAGDFNRKINH 193 (231) Q Consensus 152 ~~~~~~~~~~~q~~~l~~~i~~~~~~---~~~vii~GDFN~~~~~ 193 (231) ..|.+|...+.+-+++..+. .-|+++|||||.-+.+ T Consensus 269 ------~vrL~Q~~iiL~~~~~~~~~~~~~~pi~l~GDfNt~p~~ 307 (495) T KOG2338 269 ------DVRLAQVYIILAELEKMSKSSKSHWPIFLCGDFNTEPDS 307 (495) T ss_pred ------CHHHHHHHHHHHHHHHHHHHCCCCCCEEEECCCCCCCCC T ss_conf ------111578999999999997534357980770577789988 No 12 >smart00476 DNaseIc deoxyribonuclease I. Deoxyribonuclease I catalyzes the endonucleolytic cleavage of double-stranded DNA. The enzyme is secreted outside the cell and also involved in apoptosis in the nucleus. Probab=99.76 E-value=1.5e-16 Score=106.69 Aligned_cols=193 Identities=16% Similarity=0.207 Sum_probs=112.3 Q ss_pred CCCCCEEEEEEEEHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCH--HHHH----HHHC--CCCC Q ss_conf 986654899887310076777556667666687999999999997518989999727998--8999----8726--2322 Q gi|254780797|r 1 MILAQRIRIASWNINNLSEKSGVALFKNSVIREDNDYALLQKYAEQLDADIVCLQEIGSY--EAIK----RVFP--NDKW 72 (231) Q Consensus 1 ~~~~~~lrv~twNi~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~DVi~LQEv~~~--~~~~----~~~~--~~~~ 72 (231) +..+++|||+||||+.|+.. +.....-++.|+++++.. ||+++|||.+. .++. .+.. ...| T Consensus 12 ~~~~~~lrIasFNIq~FG~s---------K~~~~~v~~~l~kI~~ry--Div~IQEV~d~~~~av~~L~~~LN~~~~~~y 80 (276) T smart00476 12 LHGAASLRICAFNIQSFGDS---------KMSNATLMSIIVKILSRY--DIALVQEVRDSDLSAVPKLMDQLNSDSPNTY 80 (276) T ss_pred HCCCCCEEEEEEECHHHCCC---------CCCCHHHHHHHHHHHHHC--CEEEEEEEECCCCCHHHHHHHHHHCCCCCCC T ss_conf 46666259999860323734---------013758999999999739--8899999984887789999999963688886 Q ss_pred EEEEEECCC-CCCCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCE---EEEEEEEEECCCCCCCC Q ss_conf 167740357-87650356886146410100001221467565544575049999984997---89999975116888875 Q gi|254780797|r 73 DILYSGSNT-DKHAMHTAIVIRKGAIHLLQKSYLPMDTEGLDSKAGKRRAVEILFEVDGR---KIWLLDIHLKSFCFLDS 148 (231) Q Consensus 73 ~~~~~~~~~-~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~g~---~~~v~n~HL~s~~~~~~ 148 (231) .+..+.+.+ ..-.+..+++++...........++-... .....+.|.+..+.|..++. .+.++++|-... +. T Consensus 81 ~~~iS~rlGr~sykEqYafiYR~d~v~v~~~y~y~D~~~-~~~D~F~REPfv~~F~~~~t~~~~F~lv~~Ht~P~---~a 156 (276) T smart00476 81 SYVSSEPLGRNSYKEQYLFLYRSDLVSVLDSYLYDDGCE-CGNDVFSREPFVVKFSSPSTAVKEFVIVPLHTTPE---AA 156 (276) T ss_pred CEEECCCCCCCCCCEEEEEEEECCEEEECCCEEECCCCC-CCCCCEECCCEEEEEECCCCCCCCEEEEEEECCCC---CC T ss_conf 078567788876426799998255688726358247778-98872713997999975896666269999744823---02 Q ss_pred CCCHHHHHHHHHHHHHHHHHH---HHHHHCCCCCCEEEEEECCCCCCCCCCCCCCHHHHHC--CCCCCCEECEEECCCCE Q ss_conf 430025457887548999999---9997404887899991368884622222276778630--34687102102035863 Q gi|254780797|r 149 IEDSYISSCYMLNLQATWLKQ---WVDQKNNLNMPFIIAGDFNRKINHSHSGIKDELWQKI--NQDNTLMRLPHKKNHNA 223 (231) Q Consensus 149 ~~~~~~~~~~~~~~q~~~l~~---~i~~~~~~~~~vii~GDFN~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~n~ 223 (231) ..++++|.+ .+.++. ...++|++||||..-++.. ..-|..+ ..+....|+..+..++. T Consensus 157 ------------~~Eid~L~dV~~dv~~~~-~~~diiilGDFNAdcsyv~----~~~~~~irl~~~~~f~WlI~d~~dTT 219 (276) T smart00476 157 ------------VAEIDALYDVYLDVRQKW-GTEDVIFMGDFNAGCSYVT----KKQWSSIRLRTSPTFHWLIPDSADTT 219 (276) T ss_pred ------------HHHHHHHHHHHHHHHHHC-CCCCEEEECCCCCCCCCCC----HHHHHHCCCCCCCCCEEECCCCCCCC T ss_conf ------------789999999999998744-7666799624568876557----55653220102686137447887764 Q ss_pred EE Q ss_conf 52 Q gi|254780797|r 224 IR 225 (231) Q Consensus 224 ~~ 225 (231) ++ T Consensus 220 v~ 221 (276) T smart00476 220 VT 221 (276) T ss_pred CC T ss_conf 36 No 13 >COG2374 Predicted extracellular nuclease [General function prediction only] Probab=99.64 E-value=1.1e-14 Score=96.66 Aligned_cols=187 Identities=20% Similarity=0.199 Sum_probs=110.4 Q ss_pred CCCEEEEEEEEHHCCCC--CCCCCCCCC----CCCCHH----HHHHHHHHHHHHCCCCEEEEECCCCH-----H--HHHH Q ss_conf 66548998873100767--775566676----666879----99999999997518989999727998-----8--9998 Q gi|254780797|r 3 LAQRIRIASWNINNLSE--KSGVALFKN----SVIRED----NDYALLQKYAEQLDADIVCLQEIGSY-----E--AIKR 65 (231) Q Consensus 3 ~~~~lrv~twNi~~~~~--~~~~~~~~~----~~~~~~----~~~~~i~~~i~~~~~DVi~LQEv~~~-----~--~~~~ 65 (231) +.-.|||++|||.+++. +........ ....+. .....+...|..++|||++|.|+++. . +++. T Consensus 460 v~G~LkiasfNVlNyf~~~~~~g~~~~~~~d~rga~~~~e~~~Q~~k~v~ai~~ldaDV~GL~Eien~~~~~~s~~ai~~ 539 (798) T COG2374 460 VGGSLKIASFNVLNYFNKLSGDGGGASTFSDCRGANTTEEFARQRAKIVTAILTLDADVLGLMEIENNGYGTDSGDAIAQ 539 (798) T ss_pred CCCEEEEEEEEHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHEEEEEECCCCCCCCHHHHHH T ss_conf 57634431332122420377777775553102476635677787788888886124012102355206777774778999 Q ss_pred H-------HCCC-CCEEEEEECC--CCCCCCEEEEEEECCCCEEEEEECCCC-CCCC---CCCCCCCCEEEEEEEE--EC Q ss_conf 7-------2623-2216774035--787650356886146410100001221-4675---6554457504999998--49 Q gi|254780797|r 66 V-------FPND-KWDILYSGSN--TDKHAMHTAIVIRKGAIHLLQKSYLPM-DTEG---LDSKAGKRRAVEILFE--VD 129 (231) Q Consensus 66 ~-------~~~~-~~~~~~~~~~--~~~~~~~~g~~~~~~~~~~~~~~~~~~-~~~~---~~~~~~~r~~~~~~~~--~~ 129 (231) + .+.. .|......+. .......+|++|+.............- +... .....-.|..+..+|. .. T Consensus 540 lV~~lna~~G~~~~y~~v~p~~~~~lgtD~I~vg~iY~~~~V~~~g~~~~~~~~a~~~v~~~~~~n~R~~laqtF~~~~~ 619 (798) T COG2374 540 LVNALNAKAGAGTRYAFVAPGRNGGLGTDAIRVGFIYKPSAVSLVGKAAIVDNDAFLGVGEIDDNNARPPLAQTFQDLSG 619 (798) T ss_pred HHHHHHHHCCCCCEEEEEECCCCCCCCCCCEEEEEEECCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCC T ss_conf 99999863279952899846755776777256788961536776365321155333554556654457405666532057 Q ss_pred CEEEEEEEEEECCCCCCCC---CC-CHHHHHHHHHHHHHHHHHHHHHH--HCCCCCCEEEEEECCC Q ss_conf 9789999975116888875---43-00254578875489999999997--4048878999913688 Q gi|254780797|r 130 GRKIWLLDIHLKSFCFLDS---IE-DSYISSCYMLNLQATWLKQWVDQ--KNNLNMPFIIAGDFNR 189 (231) Q Consensus 130 g~~~~v~n~HL~s~~~~~~---~~-~~~~~~~~~~~~q~~~l~~~i~~--~~~~~~~vii~GDFN~ 189 (231) +..|.++..||+|+.+... .. +....+...|.+|+++|..|++. ......+++|+||||+ T Consensus 620 ~ekfvvVvNHfkSKgs~~p~~gd~~dgQg~~~~~R~~~AqaL~~~la~~~~~~~d~~~viLGD~N~ 685 (798) T COG2374 620 GEKFVVVVNHFKSKGSDCPVDGDTQDGQGNSNQTRVRAAQALAAFLATNPTGKADADIVILGDFND 685 (798) T ss_pred CCEEEEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEECCCH T ss_conf 717999983012668888775656664441046889999999999862856556788799724321 No 14 >PRK05421 hypothetical protein; Provisional Probab=99.59 E-value=2.1e-14 Score=95.25 Aligned_cols=157 Identities=18% Similarity=0.170 Sum_probs=92.8 Q ss_pred CCCEEEEEEEEHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHCCCCCEEEEEECCCC Q ss_conf 66548998873100767775566676666879999999999975189899997279988999872623221677403578 Q gi|254780797|r 3 LAQRIRIASWNINNLSEKSGVALFKNSVIREDNDYALLQKYAEQLDADIVCLQEIGSYEAIKRVFPNDKWDILYSGSNTD 82 (231) Q Consensus 3 ~~~~lrv~twNi~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~DVi~LQEv~~~~~~~~~~~~~~~~~~~~~~~~~ 82 (231) .+++++|++|||. ......+.+.+..+.+ ++|+++|||+.....+........+...+ .+.-. T Consensus 30 ~~~~~~llvWNI~-----------Kq~~~~W~~~L~~l~~-----~~dLvlLQEA~~~~~l~~~~~~~~~~~~~-a~af~ 92 (254) T PRK05421 30 GSERFRLLVWNIY-----------KQQRAGWLSVLKNLGK-----DADLVLLQEAQTTPELVQFATANYLAADQ-APAFV 92 (254) T ss_pred CCCCEEEEEEEEE-----------CCCCHHHHHHHHHHCC-----CCCEEEEEECCCCHHHHHHHHCCCCHHHH-HHHHH T ss_conf 8764069999942-----------1665307999998606-----68789996136776579988525517886-12220 Q ss_pred CCCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCEEEEEEEEE-CCEEEEEEEEEECCCCCCCCCCCHHHHHHHHHH Q ss_conf 7650356886146410100001221467565544575049999984-997899999751168888754300254578875 Q gi|254780797|r 83 KHAMHTAIVIRKGAIHLLQKSYLPMDTEGLDSKAGKRRAVEILFEV-DGRKIWLLDIHLKSFCFLDSIEDSYISSCYMLN 161 (231) Q Consensus 83 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~-~g~~~~v~n~HL~s~~~~~~~~~~~~~~~~~~~ 161 (231) ..+...|++..+.............+.+ ...++..+...... +|..+.|+|+|.=.+..+ ...-. T Consensus 93 ~~~~~~GVmtlS~~~p~~~c~~~~~EP~----lr~pKs~Lit~YpL~~~q~LlVVNiHaINF~~g----------~~~y~ 158 (254) T PRK05421 93 LPQHPSGVMTLSKAHPVYCCPLREREPW----LRLPKSALITEYPLPNGQTLLVVNIHAINFSVG----------VDVYS 158 (254) T ss_pred CCCCCCEEEECCCCCHHHHCCCCCCCCC----CCCCHHHHEEEECCCCCCEEEEEEEEEEEEECC----------HHHHH T ss_conf 4798634787541112544235666722----036312304772058997799999406766515----------69999 Q ss_pred HHHHHHHHHHHHHCCCCCCEEEEEECCCCCCC Q ss_conf 48999999999740488789999136888462 Q gi|254780797|r 162 LQATWLKQWVDQKNNLNMPFIIAGDFNRKINH 193 (231) Q Consensus 162 ~q~~~l~~~i~~~~~~~~~vii~GDFN~~~~~ 193 (231) .|++.+.+.|.. ..-|+|++||||.|... T Consensus 159 ~QL~~l~~~i~~---H~GPvI~AGDFNtWs~~ 187 (254) T PRK05421 159 KQLEPIGDQIAH---HSGPIILAGDFNTWSRK 187 (254) T ss_pred HHHHHHHHHHHH---CCCCEEEECCCCCCCHH T ss_conf 999999999984---89987984453301399 No 15 >smart00128 IPPc Inositol polyphosphate phosphatase, catalytic domain homologues. Mg(2+)-dependent/Li(+)-sensitive enzymes. Probab=99.51 E-value=4.5e-13 Score=88.15 Aligned_cols=165 Identities=13% Similarity=0.171 Sum_probs=85.4 Q ss_pred EEEEEEEEHHCCCCCCCCCCCCCCCCCHHHHHH-HHH---HHHHHCCCC--EEEEECCCCHH--------------HHHH Q ss_conf 489988731007677755666766668799999-999---999751898--99997279988--------------9998 Q gi|254780797|r 6 RIRIASWNINNLSEKSGVALFKNSVIREDNDYA-LLQ---KYAEQLDAD--IVCLQEIGSYE--------------AIKR 65 (231) Q Consensus 6 ~lrv~twNi~~~~~~~~~~~~~~~~~~~~~~~~-~i~---~~i~~~~~D--Vi~LQEv~~~~--------------~~~~ 65 (231) ++-|.||||....+++ ...+. .+. +......|| ||||||+.... +... T Consensus 4 ~V~v~TWNVg~~~~~~------------~~~l~~wl~~~~~~~~~~~~Di~viglQE~~~~~~~~~~~~~~~~~~~w~~~ 71 (310) T smart00128 4 KVLVGTWNVGGLKADP------------KVDVTSWLFQKIDVKQSEKPDIYVIGLQEVVDLENGVLLETIAGKERLWSKL 71 (310) T ss_pred EEEEEEEECCCCCCCC------------HHHHHHHHCCCCCCCCCCCCCEEEEEEEECCCCHHHHHHHCCCHHHHHHHHH T ss_conf 9999997169857988------------7788876474433467899999999747677353466643252235449999 Q ss_pred HHCC--CCCEEEEEECCCCCCCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCEEEEEEEEEECCC Q ss_conf 7262--32216774035787650356886146410100001221467565544575049999984997899999751168 Q gi|254780797|r 66 VFPN--DKWDILYSGSNTDKHAMHTAIVIRKGAIHLLQKSYLPMDTEGLDSKAGKRRAVEILFEVDGRKIWLLDIHLKSF 143 (231) Q Consensus 66 ~~~~--~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~g~~~~v~n~HL~s~ 143 (231) +... ....|... ......+....++++..............-..+.....+.++++.+.+...+..++++|+||.++ T Consensus 72 i~~~l~~~~~Y~~v-~~~~l~gi~l~Vf~k~~~~~~i~~v~~~~v~~G~~g~~gnKGaV~vr~~i~~t~~~Fvn~HL~a~ 150 (310) T smart00128 72 IESSLNGDGQYNVL-AKVRLVGILVLVFVKANHLVYIKDVETFTVKTGMGGLWGNKGAVAVRFKLSDTSFCFVNSHLAAG 150 (310) T ss_pred HHHHHCCCCCEEEE-EEEECCCEEEEEEECHHHCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCEEEEEEEECCCCC T ss_conf 99861778877899-88861444999998012256466641011567877622557518999999786888876237777 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHHHH------HCCCCCCEEEEEECCCCCCC Q ss_conf 888754300254578875489999999997------40488789999136888462 Q gi|254780797|r 144 CFLDSIEDSYISSCYMLNLQATWLKQWVDQ------KNNLNMPFIIAGDFNRKINH 193 (231) Q Consensus 144 ~~~~~~~~~~~~~~~~~~~q~~~l~~~i~~------~~~~~~~vii~GDFN~~~~~ 193 (231) .. +...|..+...+.+.+.- .......++++||||-++.. T Consensus 151 ~~----------~~~~Rn~~~~~I~~~l~f~~~~~~~~~~~d~vf~~GDlNyRi~~ 196 (310) T smart00128 151 AS----------NVEQRNQDYKTILRALSFPERAELSQFDHDVVFWFGDLNFRLDS 196 (310) T ss_pred CC----------HHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEECCCCCCCCC T ss_conf 43----------07777889999998605787666666677505983476310168 No 16 >PTZ00297 pantothenate kinase; Provisional Probab=99.16 E-value=5.3e-10 Score=71.78 Aligned_cols=174 Identities=17% Similarity=0.213 Sum_probs=82.3 Q ss_pred CCEEEEEEEEHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCHHH--------------HHHHHCC Q ss_conf 65489988731007677755666766668799999999999751898999972799889--------------9987262 Q gi|254780797|r 4 AQRIRIASWNINNLSEKSGVALFKNSVIREDNDYALLQKYAEQLDADIVCLQEIGSYEA--------------IKRVFPN 69 (231) Q Consensus 4 ~~~lrv~twNi~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~DVi~LQEv~~~~~--------------~~~~~~~ 69 (231) ....||+|||-+-+. .+ . +.-+.++++....-+.+ .||+.||||-.... +-+.+.. T Consensus 8 ~aq~RvLSYNFNILP-RG-~------GGfQ~ERI~tFL~~vd~--YDVvllQEVYAaS~lPY~lQk~lCfQKmLvD~Lk~ 77 (1452) T PTZ00297 8 CAQARVLSYNFNILP-RG-C------GGFQHERIETFLASVDA--YDVVLLQEVYAASVLPYFLQKQLCFQKMLVDELKA 77 (1452) T ss_pred HHHHHHHHCCCCCCC-CC-C------CCCHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 044411120354244-77-7------75138789999964434--55457699878644079999889999998989986 Q ss_pred CCCEEEEEECCCCCC-------CCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCEEEEEEEEEC-----CEEEEEEE Q ss_conf 322167740357876-------503568861464101000012214675655445750499999849-----97899999 Q gi|254780797|r 70 DKWDILYSGSNTDKH-------AMHTAIVIRKGAIHLLQKSYLPMDTEGLDSKAGKRRAVEILFEVD-----GRKIWLLD 137 (231) Q Consensus 70 ~~~~~~~~~~~~~~~-------~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~-----g~~~~v~n 137 (231) .++.++......... -..+|.++.++.. +-..-.+.............|++++++++.+ ..++.++| T Consensus 78 rGf~hYvISkQPSY~TmlR~NV~SDNGLIIASRFP-i~qrGSYTFr~heRg~qsVrrGCLFAEV~VPl~~GGs~~IvFFN 156 (1452) T PTZ00297 78 RGFHHYVISKQPSYLTMLRYNVCSDNGLIIASRFP-IWQRGSYTFRNHERGEQSVRRGCLFAEVEVPLAEGGSQRIVFFN 156 (1452) T ss_pred CCCEEEEEECCCCHHHHHHHCCCCCCCEEEEECCC-CCCCCCEEECCCHHHHHHHHCCCEEEEEEEECCCCCCEEEEEEE T ss_conf 67657897458852667654123678669973264-32346335115322345552361689986120248942489998 Q ss_pred EEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHC-CCCCCEEEEEECCCCC Q ss_conf 75116888875430025457887548999999999740-4887899991368884 Q gi|254780797|r 138 IHLKSFCFLDSIEDSYISSCYMLNLQATWLKQWVDQKN-NLNMPFIIAGDFNRKI 191 (231) Q Consensus 138 ~HL~s~~~~~~~~~~~~~~~~~~~~q~~~l~~~i~~~~-~~~~~vii~GDFN~~~ 191 (231) +||---...+... ..-.++++.....+.+.-.+.. ...-|+|++||||.+- T Consensus 157 VHLR~eds~~~ts---sqv~etrrFv~svi~n~~~~~~d~a~iPfviAGDFninG 208 (1452) T PTZ00297 157 VHLRQEDSLPSTS---SQVQETRRFVESVIANVYEQNNDGAEIPFVIAGDFNING 208 (1452) T ss_pred EEECCCCCCCCHH---HHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEECCCCCCC T ss_conf 7623434783258---999999999999999987377875315479963545567 No 17 >KOG0566 consensus Probab=99.14 E-value=1.3e-09 Score=69.75 Aligned_cols=160 Identities=17% Similarity=0.148 Sum_probs=87.2 Q ss_pred EEEEEEEEHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH-------CCCC--EEEEECCCCH---------------- Q ss_conf 48998873100767775566676666879999999999975-------1898--9999727998---------------- Q gi|254780797|r 6 RIRIASWNINNLSEKSGVALFKNSVIREDNDYALLQKYAEQ-------LDAD--IVCLQEIGSY---------------- 60 (231) Q Consensus 6 ~lrv~twNi~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~-------~~~D--Vi~LQEv~~~---------------- 60 (231) +|=|.||||+....-. + +.+.+++-. ..|| ||++|||-.- T Consensus 538 ~IfvgTfNvNG~s~~~------------k---~~L~~WLfp~s~~~~~~~aDIyviG~eEvVeLnag~iv~As~tk~~~W 602 (1080) T KOG0566 538 SIFVGTFNVNGRSAAF------------K---DDLSDWLFPISRGKEFSPADIYVIGFEEVVELNAGNIVSASTTKRRFW 602 (1080) T ss_pred EEEEEEEECCCCCCCC------------H---HHHHHHCCCCCCCCCCCCCCEEEEEEHHHHHCCCCCEECCCHHHHHHH T ss_conf 8999866426855643------------5---668764364346774776767998604321127121322681788999 Q ss_pred -HHHHHHHCCCCCEEEEEECCCCCCCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCEEEEEEEEE Q ss_conf -8999872623221677403578765035688614641010000122146756554457504999998499789999975 Q gi|254780797|r 61 -EAIKRVFPNDKWDILYSGSNTDKHAMHTAIVIRKGAIHLLQKSYLPMDTEGLDSKAGKRRAVEILFEVDGRKIWLLDIH 139 (231) Q Consensus 61 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~g~~~~v~n~H 139 (231) +.+.+.+......|... ...-..+.-..++++.................+..+..+.++++.+++...+..+|+++.| T Consensus 603 ee~i~~~Ln~~~~kYvlL-~s~QlvGv~L~iF~r~~~~p~Ik~V~~~tkKTGfGG~tgNKGAVAIrf~~~~TsfCFv~SH 681 (1080) T KOG0566 603 EEKILKTLNRYKNKYVLL-RSEQLVGVCLLLFIRPDHAPYIKDVAGDTKKTGFGGATGNKGAVAIRFVYHATSFCFVCSH 681 (1080) T ss_pred HHHHHHHHCCCCCCEEEE-EHHHHHEEEEEEEECCCCCCHHHHCCCCEEECCCCCCCCCCCEEEEEEEECCCCEEEEECC T ss_conf 999999865778865998-6255001258999753336223310230354155663577751899999646527998500 Q ss_pred ECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHH----HHCCCCCCEEEEEECCCCC Q ss_conf 116888875430025457887548999999999----7404887899991368884 Q gi|254780797|r 140 LKSFCFLDSIEDSYISSCYMLNLQATWLKQWVD----QKNNLNMPFIIAGDFNRKI 191 (231) Q Consensus 140 L~s~~~~~~~~~~~~~~~~~~~~q~~~l~~~i~----~~~~~~~~vii~GDFN~~~ 191 (231) |.+|...-. .|..-...+.+-|. +.......++.|||||=++ T Consensus 682 lAAG~snv~----------ERn~DY~tI~r~l~Fp~Gr~I~~HD~ifW~GDFNYRI 727 (1080) T KOG0566 682 LAAGQSNVE----------ERNEDYKTIARKLRFPRGRMIFSHDYIFWLGDFNYRI 727 (1080) T ss_pred CCCCCCHHH----------HHHHHHHHHHHHCCCCCCCCCCCCCEEEEECCCCEEE T ss_conf 015544076----------6420499998756065775135776289833544043 No 18 >COG5239 CCR4 mRNA deadenylase, exonuclease subunit and related nucleases [RNA processing and modification] Probab=99.09 E-value=8.7e-10 Score=70.66 Aligned_cols=75 Identities=17% Similarity=0.232 Sum_probs=50.7 Q ss_pred EEEEEEEEHHCCCCCCCCCCCCC--CCCCHHHHHHHHHHHHHHCCCCEEEEECCCCH---HHHHHHHCCCCCEEEEEECC Q ss_conf 48998873100767775566676--66687999999999997518989999727998---89998726232216774035 Q gi|254780797|r 6 RIRIASWNINNLSEKSGVALFKN--SVIREDNDYALLQKYAEQLDADIVCLQEIGSY---EAIKRVFPNDKWDILYSGSN 80 (231) Q Consensus 6 ~lrv~twNi~~~~~~~~~~~~~~--~~~~~~~~~~~i~~~i~~~~~DVi~LQEv~~~---~~~~~~~~~~~~~~~~~~~~ 80 (231) +++|||||+.--.. .....++. ....+..+-..|.+.+..++|||+|||||+.. ++.+..++..+|...+.... T Consensus 30 ~ftimTYN~Laq~y-~~r~~y~~s~~aL~W~~R~~~L~~EL~~Yn~Di~CLQEvd~~~~~~fw~~~l~~~gY~~if~~k~ 108 (378) T COG5239 30 DFTIMTYNVLAQTY-ATRKMYPYSGWALKWSYRSRLLLQELLYYNADILCLQEVDAEDFEDFWKDQLGKLGYDGIFIPKE 108 (378) T ss_pred EEEEEEHHHHHHHH-CCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEEHHHHHHHHHHHHHHHCCCCCCEEEECCC T ss_conf 36998613534665-33312477555522378889999987403786566211336678888888733556432882589 Q ss_pred C Q ss_conf 7 Q gi|254780797|r 81 T 81 (231) Q Consensus 81 ~ 81 (231) + T Consensus 109 ~ 109 (378) T COG5239 109 R 109 (378) T ss_pred C T ss_conf 8 No 19 >KOG0620 consensus Probab=98.90 E-value=3.7e-09 Score=67.28 Aligned_cols=74 Identities=18% Similarity=0.228 Sum_probs=49.4 Q ss_pred EEEEEEEEHHCCCC--CCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCH-HHHHHHHCCCCCEEEEEEC Q ss_conf 48998873100767--77556667666687999999999997518989999727998-8999872623221677403 Q gi|254780797|r 6 RIRIASWNINNLSE--KSGVALFKNSVIREDNDYALLQKYAEQLDADIVCLQEIGSY-EAIKRVFPNDKWDILYSGS 79 (231) Q Consensus 6 ~lrv~twNi~~~~~--~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~DVi~LQEv~~~-~~~~~~~~~~~~~~~~~~~ 79 (231) +++|+|||+..... .......+....++..+...+.+.|...+||++|||||+.. +++...+...+|...+... T Consensus 19 ~~~v~s~n~l~~~~~~~~~~~~~~~~~~~w~~r~~~~~~ei~~~~ad~icLqev~~~~~~~~p~l~~~gY~g~~~~k 95 (361) T KOG0620 19 TFTVLSYNILSQLYATSKLYMYCPSWALSWEYRRQLLLEEILNYNADILCLQEVDRYHDFFSPELEASGYSGIFIEK 95 (361) T ss_pred EEEECCCCHHHHHHCCCCCCCCCCHHHCCHHHHHHHHHHHHHCCCCCEEECCHHHHHHHHCCCHHHHCCCCCEEECC T ss_conf 35421111101655046777535144316788889999987477855131512667987756415435874124214 No 20 >COG3021 Uncharacterized protein conserved in bacteria [Function unknown] Probab=98.83 E-value=1.2e-08 Score=64.49 Aligned_cols=133 Identities=23% Similarity=0.138 Sum_probs=73.5 Q ss_pred HHHHHHHHHCCCCEEEEECCCCHHHHHHH-HCCCCCEEEEEECCCCCCCCEEEEEEECCCCEEEEEECCCCCCCCCCCCC Q ss_conf 99999997518989999727998899987-26232216774035787650356886146410100001221467565544 Q gi|254780797|r 38 ALLQKYAEQLDADIVCLQEIGSYEAIKRV-FPNDKWDILYSGSNTDKHAMHTAIVIRKGAIHLLQKSYLPMDTEGLDSKA 116 (231) Q Consensus 38 ~~i~~~i~~~~~DVi~LQEv~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 116 (231) ..+...+...++|++.+||... ...... .....|.++..+.... .......+++..... ....-...+ T Consensus 104 ~k~Lsl~~~~~~D~v~~~E~~~-~~~~~~~~l~~~yP~~~~~~~~~-~~~~~a~~sr~~~~~---------~~~~e~~~~ 172 (309) T COG3021 104 AKLLSLIQQLDADAVTTPEGVQ-LWTAKVGALAAQYPAFILCQHPT-GVFTLAILSRRPCCP---------LTEAEPWLR 172 (309) T ss_pred HHHHHHHHHHCCCHHHHHHHHH-HHHHHHHHHHHHCCCEEECCCCC-CEEEEEECCCCCCCC---------CCCCCCCCC T ss_conf 9999887664753222188887-76767889987497424068998-722110003465555---------223576401 Q ss_pred CCCEEEEE-EEEECCEEEEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCCC Q ss_conf 57504999-9984997899999751168888754300254578875489999999997404887899991368884622 Q gi|254780797|r 117 GKRRAVEI-LFEVDGRKIWLLDIHLKSFCFLDSIEDSYISSCYMLNLQATWLKQWVDQKNNLNMPFIIAGDFNRKINHS 194 (231) Q Consensus 117 ~~r~~~~~-~~~~~g~~~~v~n~HL~s~~~~~~~~~~~~~~~~~~~~q~~~l~~~i~~~~~~~~~vii~GDFN~~~~~~ 194 (231) .++..+.. ....+|.++.+++.|........ + ..+ .|..++.+.|.. -..|+|++||||..+-++ T Consensus 173 ~pk~~~~t~~~~~~g~~l~v~~lh~~~~~~~~---~------~~~-~ql~~l~~~i~~---~~gpvIlaGDfNa~pWS~ 238 (309) T COG3021 173 LPKSALATAYPLPDGTELTVVALHAVNFPVGT---D------PQR-AQLLELGDQIAG---HSGPVILAGDFNAPPWSR 238 (309) T ss_pred CCCCCEEEEEECCCCCEEEEEEECCCCCCCCC---C------HHH-HHHHHHHHHHHC---CCCCEEEEECCCCCCHHH T ss_conf 77651268998689988999862254566775---1------889-999999999971---788758861378850557 No 21 >COG5411 Phosphatidylinositol 5-phosphate phosphatase [Signal transduction mechanisms] Probab=98.17 E-value=1.3e-05 Score=48.36 Aligned_cols=98 Identities=10% Similarity=0.087 Sum_probs=60.7 Q ss_pred CCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCEEEEEEEEEECCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 65035688614641010000122146756554457504999998499789999975116888875430025457887548 Q gi|254780797|r 84 HAMHTAIVIRKGAIHLLQKSYLPMDTEGLDSKAGKRRAVEILFEVDGRKIWLLDIHLKSFCFLDSIEDSYISSCYMLNLQ 163 (231) Q Consensus 84 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~g~~~~v~n~HL~s~~~~~~~~~~~~~~~~~~~~q 163 (231) .+....++++.................+.......++++.+++......++++|.||.++...- ..|... T Consensus 118 ~~~~~~vf~~~~~~~v~~~V~~~~~KtG~gg~s~nKGav~i~~~~~~t~~cFv~shlaag~~N~----------eeR~~D 187 (460) T COG5411 118 GGILLRVFSLATNLPVVKPVSGTVKKTGFGGSSSNKGAVAIRFNYERTSFCFVNSHLAAGVNNI----------EERIFD 187 (460) T ss_pred CCCCEEEEEECCCCCEECCCCCCCCCCCCCEECCCCCCCCEEEEEECCCCEEEECCHHCCCCCH----------HHHHHH T ss_conf 6743377650346611113335312443210015555321268752277289963010024668----------887778 Q ss_pred HHHHHHHHH-H---HCCCCCCEEEEEECCCCC Q ss_conf 999999999-7---404887899991368884 Q gi|254780797|r 164 ATWLKQWVD-Q---KNNLNMPFIIAGDFNRKI 191 (231) Q Consensus 164 ~~~l~~~i~-~---~~~~~~~vii~GDFN~~~ 191 (231) .+.+..-|. . .......++.+||||-+. T Consensus 188 y~~I~~~i~f~~g~~I~~hdti~w~GDlNyRV 219 (460) T COG5411 188 YRSIASNICFSRGLRIYDHDTIFWLGDLNYRV 219 (460) T ss_pred HHHHHHHEECCCCCEECCCCEEEEECCCCCEE T ss_conf 88788760057774523652399951567445 No 22 >KOG1976 consensus Probab=97.72 E-value=0.00042 Score=40.38 Aligned_cols=74 Identities=16% Similarity=0.239 Sum_probs=48.5 Q ss_pred CCCEEEEEEEEECCEEEEEEEEEECCCCCCCCC-----CCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCC Q ss_conf 575049999984997899999751168888754-----300254578875489999999997404887899991368884 Q gi|254780797|r 117 GKRRAVEILFEVDGRKIWLLDIHLKSFCFLDSI-----EDSYISSCYMLNLQATWLKQWVDQKNNLNMPFIIAGDFNRKI 191 (231) Q Consensus 117 ~~r~~~~~~~~~~g~~~~v~n~HL~s~~~~~~~-----~~~~~~~~~~~~~q~~~l~~~i~~~~~~~~~vii~GDFN~~~ 191 (231) ..++.+..++...|+.+.++|.||= .+.. ...-......|.+++..+.+-+....-....++|.||||.++ T Consensus 155 ~rkg~~~~r~~I~~k~fdfVN~hLF----hD~snla~~~sspt~ys~~R~~al~~vL~el~~~~~~~~~~fVfGdfNfrL 230 (391) T KOG1976 155 QRKGFLLARFRIHGKEFDFVNLHLF----HDVSNLATKNSSPTKYSSKREQALEMVLKELDEEGLRNDAIFVFGDFNFRL 230 (391) T ss_pred HHCCCCCEEEEECCCEEEEEEHHHH----CCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCC T ss_conf 5416540048983735556530122----235666651378146555578899999999876304752899961344210 Q ss_pred CCC Q ss_conf 622 Q gi|254780797|r 192 NHS 194 (231) Q Consensus 192 ~~~ 194 (231) .+. T Consensus 231 ds~ 233 (391) T KOG1976 231 DST 233 (391) T ss_pred CHH T ss_conf 528 No 23 >PTZ00312 inositol-1,4,5-triphosphate 5-phosphatase; Provisional Probab=96.96 E-value=0.0066 Score=34.00 Aligned_cols=72 Identities=13% Similarity=0.104 Sum_probs=41.8 Q ss_pred CCCCEEEEEEEEECCEEEEEEEEEECCCCCCCCCCCHHH-----HHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCC Q ss_conf 457504999998499789999975116888875430025-----457887548999999999740488789999136888 Q gi|254780797|r 116 AGKRRAVEILFEVDGRKIWLLDIHLKSFCFLDSIEDSYI-----SSCYMLNLQATWLKQWVDQKNNLNMPFIIAGDFNRK 190 (231) Q Consensus 116 ~~~r~~~~~~~~~~g~~~~v~n~HL~s~~~~~~~~~~~~-----~~~~~~~~q~~~l~~~i~~~~~~~~~vii~GDFN~~ 190 (231) ...++.+...+..++..+.++|+||-.-. ....+.. .....+++-.+.+.+... ...+..|++|-||||.+ T Consensus 65 ~sRKGfLllslrlgtvv~n~~NVHLfhD~---~N~~A~~~sPS~y~~~R~rAl~e~laec~a-~v~p~~PlFIFGDfN~R 140 (356) T PTZ00312 65 RSRKGFLLLSLRLGTVVVNVLNVHLYNDD---DNRVAAASSPSLYTGQRQEALLEAIAECSA-FISPSDPLFIFGDFNVR 140 (356) T ss_pred CCCCCEEEEEEEECCEEEEEEEEEEECCC---CCHHHHHHCCCCCHHHHHHHHHHHHHHHHH-CCCCCCCEEEECCCCEE T ss_conf 66653489999847689988888874266---406788619972027799999999998652-04889975895244134 Q ss_pred C Q ss_conf 4 Q gi|254780797|r 191 I 191 (231) Q Consensus 191 ~ 191 (231) + T Consensus 141 l 141 (356) T PTZ00312 141 L 141 (356) T ss_pred E T ss_conf 4 No 24 >KOG0565 consensus Probab=95.27 E-value=0.046 Score=29.53 Aligned_cols=77 Identities=17% Similarity=0.152 Sum_probs=45.6 Q ss_pred CCCCCCCCCCCEEEEEEEEECCEEEEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHH---HHHHH--CCC------ Q ss_conf 6756554457504999998499789999975116888875430025457887548999999---99974--048------ Q gi|254780797|r 109 TEGLDSKAGKRRAVEILFEVDGRKIWLLDIHLKSFCFLDSIEDSYISSCYMLNLQATWLKQ---WVDQK--NNL------ 177 (231) Q Consensus 109 ~~~~~~~~~~r~~~~~~~~~~g~~~~v~n~HL~s~~~~~~~~~~~~~~~~~~~~q~~~l~~---~i~~~--~~~------ 177 (231) ..........++.+.+.+...+..+++++.|+.++....... |..-.+...+ +.... ..+ T Consensus 37 ~~g~~~~~~nkg~v~is~~~~~~~~~~v~~hl~~~~~~~~~~---------r~~d~~~i~~~~~~~~~~~~~~~~~~~~~ 107 (145) T KOG0565 37 GTGIMGYLGNKGGVAISFVLSQTSFCFVISHLTSGVHKVYER---------RNEDYQEILNGLRFPSVSPASEPVISDGE 107 (145) T ss_pred CCEEEEEEECCCCEEEEEEECCCEEEEEEECCCCCCCCCCEE---------ECCCHHHHHHCCCCCCCCCCCCCCCCCCC T ss_conf 760899972798399999964870699995436655331113---------24778876520366533566666566655 Q ss_pred CCCEEEEEECCCCCCCC Q ss_conf 87899991368884622 Q gi|254780797|r 178 NMPFIIAGDFNRKINHS 194 (231) Q Consensus 178 ~~~vii~GDFN~~~~~~ 194 (231) .+-++..||+|.++... T Consensus 108 ~D~v~w~GDlN~Rl~~~ 124 (145) T KOG0565 108 HDTVIWLGDLNYRLSGP 124 (145) T ss_pred CCEEEEECCCCCEECCC T ss_conf 66799922134201586 No 25 >KOG1294 consensus Probab=90.85 E-value=0.4 Score=24.50 Aligned_cols=56 Identities=13% Similarity=0.062 Sum_probs=30.4 Q ss_pred CCEEEEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCC---CCCEEEEEECCC Q ss_conf 9978999997511688887543002545788754899999999974048---878999913688 Q gi|254780797|r 129 DGRKIWLLDIHLKSFCFLDSIEDSYISSCYMLNLQATWLKQWVDQKNNL---NMPFIIAGDFNR 189 (231) Q Consensus 129 ~g~~~~v~n~HL~s~~~~~~~~~~~~~~~~~~~~q~~~l~~~i~~~~~~---~~~vii~GDFN~ 189 (231) +-..+.+++.|+++......... ...+..-...+...+.+.... ..+++.|||.|- T Consensus 165 e~e~~~l~~~y~p~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~k~~~~~~v~~gd~nv 223 (335) T KOG1294 165 EFEIFILINTYVPNIGGGLVNLV-----YRILDRWDKEIEEKRKKQSSSKNLKAPVVICGDLNV 223 (335) T ss_pred EECCEEECCCCCCCCCCCCCHHH-----HHHHHHHHHHHHHHHHHCCCCCCCCCCCEECCCCCC T ss_conf 63120311433753343200024-----444446677777776403644444576134155654 No 26 >KOG1857 consensus Probab=81.01 E-value=1.5 Score=21.43 Aligned_cols=59 Identities=5% Similarity=-0.202 Sum_probs=31.9 Q ss_pred CEEEEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCC Q ss_conf 97899999751168888754300254578875489999999997404887899991368884 Q gi|254780797|r 130 GRKIWLLDIHLKSFCFLDSIEDSYISSCYMLNLQATWLKQWVDQKNNLNMPFIIAGDFNRKI 191 (231) Q Consensus 130 g~~~~v~n~HL~s~~~~~~~~~~~~~~~~~~~~q~~~l~~~i~~~~~~~~~vii~GDFN~~~ 191 (231) .+..+.+|+||++..-.....+..-....+|.... ......-......++..|||-... T Consensus 434 S~~~t~vN~~lk~L~l~~~~~~~~N~~~~~RL~~~---~~~g~lT~~g~~~~~~~~~~~~~~ 492 (623) T KOG1857 434 SKSKTAVNVLLKPLPLDQTCIHPENYDIAMRLFLS---SIGGTLTEVGKPEMQQKINSFLEK 492 (623) T ss_pred CCCHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHH---HCCCEEEECCCCCEEEECCCCCCC T ss_conf 55305577653001000236885453167788887---434300222676637853777677 No 27 >cd07587 ML_beta-AS mammalian-like beta-alanine synthase (beta-AS) and similar proteins (class 5 nitrilases). This subgroup includes mammalian-like beta-AS (EC 3.5.1.6, also known as beta-ureidopropionase or N-carbamoyl-beta-alanine amidohydrolase). This enzyme catalyzes the third and final step in the catabolic pyrimidine catabolic pathway responsible for the degradation of uracil and thymine, the hydrolysis of N-carbamyl-beta-alanine and N-carbamyl-beta-aminoisobutyrate to the beta-amino acids, beta-alanine and beta-aminoisobutyrate respectively. This subgroup belongs to a larger nitrilase superfamily comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13), this subgroup corresponds to class 5. Members of this superfamily generally form homomeric Probab=58.61 E-value=12 Score=16.69 Aligned_cols=23 Identities=22% Similarity=0.307 Sum_probs=15.7 Q ss_pred HHHHHHHHHHHCCCCEEEEECCC Q ss_conf 99999999975189899997279 Q gi|254780797|r 36 DYALLQKYAEQLDADIVCLQEIG 58 (231) Q Consensus 36 ~~~~i~~~i~~~~~DVi~LQEv~ 58 (231) +++.+.+.-..-+..|+||||.. T Consensus 91 ~i~~~i~~Aa~~gvni~c~qe~w 113 (363) T cd07587 91 RIKKIIEAAAMAGVNIICFQEAW 113 (363) T ss_pred HHHHHHHHHHHCCCCEEEEEHHH T ss_conf 99999999997598799840010 No 28 >pfam00952 Bunya_nucleocap Bunyavirus nucleocapsid (N) protein. The bunyaviruses are enveloped viruses with a genome consisting of 3 ssRNA segments (called L, M and S). The nucleocapsid protein is encode on the small (S) genomic RNA. The N protein is the major component of the nucleocapsids. This protein is thought to interact with the L protein, virus RNA and/or other N proteins. Probab=49.57 E-value=17 Score=15.81 Aligned_cols=28 Identities=11% Similarity=0.071 Sum_probs=21.8 Q ss_pred EEEEEECCEEEEEEEEEECCCCCCCCCC Q ss_conf 9999849978999997511688887543 Q gi|254780797|r 123 EILFEVDGRKIWLLDIHLKSFCFLDSIE 150 (231) Q Consensus 123 ~~~~~~~g~~~~v~n~HL~s~~~~~~~~ 150 (231) .++++.+|-.+.|+|+|++.....+-.. T Consensus 55 ~v~~~fG~~~~~vvNthfPgnr~npv~d 82 (228) T pfam00952 55 TVTLTFGGWKFPVVNNHFPGNRNNPVPD 82 (228) T ss_pred EEEEEECCEEEEEEECCCCCCCCCCCCC T ss_conf 4513326679998725785545798998 No 29 >COG3459 Cellobiose phosphorylase [Carbohydrate transport and metabolism] Probab=47.23 E-value=15 Score=16.12 Aligned_cols=43 Identities=26% Similarity=0.455 Sum_probs=27.3 Q ss_pred HHHHHHHHHHHHHCCCCCCEEEEEECCCCCCC-CCCCCCCHHHH Q ss_conf 48999999999740488789999136888462-22222767786 Q gi|254780797|r 162 LQATWLKQWVDQKNNLNMPFIIAGDFNRKINH-SHSGIKDELWQ 204 (231) Q Consensus 162 ~q~~~l~~~i~~~~~~~~~vii~GDFN~~~~~-~~~~~~~~~~~ 204 (231) ....+|.-.+.+.-..+.|.+++||+|+-++. ...+....+|. T Consensus 716 H~~rAld~av~r~g~~glpli~gGDWND~ln~vg~~g~geSv~l 759 (1056) T COG3459 716 HCARALDLAVARLGPHGLPLILGGDWNDGLNLVGNGGKGESVWL 759 (1056) T ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEH T ss_conf 99999999998608777643157761100350014775626630 No 30 >TIGR02104 pulA_typeI pullulanase, type I; InterPro: IPR011840 Pullulan is an unusual, industrially important polysaccharide in which short alpha-1,4 chains (maltotriose) are connected in alpha-1,6 linkages. Enzymes that cleave alpha-1,6 linkages in pullulan and release maltotriose are called pullulanases although pullulan itself may not be the natural substrate. This family consists of pullulanases related to the entries in IPR011838 from INTERPRO and IPR011839 from INTERPRO but having a different domain architecture with shorter sequences. Members are called type I pullulanases .. Probab=45.20 E-value=20 Score=15.43 Aligned_cols=26 Identities=19% Similarity=0.294 Sum_probs=14.1 Q ss_pred CCHHHHHHHHHHHHHHCCCCEEEEEC Q ss_conf 68799999999999751898999972 Q gi|254780797|r 31 IREDNDYALLQKYAEQLDADIVCLQE 56 (231) Q Consensus 31 ~~~~~~~~~i~~~i~~~~~DVi~LQE 56 (231) .--...+..|.+.+.+++|.|+.+=| T Consensus 344 ihD~~TMn~ir~~l~~Idp~IlLyGE 369 (655) T TIGR02104 344 IHDIETMNEIRKALNKIDPSILLYGE 369 (655) T ss_pred HHHHHHHHHHHHHHHHCCCCEEEECC T ss_conf 43699999999998533983799745 No 31 >cd02861 E_set_proteins_like E or "early" set-like proteins. These alpha amylase-like sugar utilizing enzymes which may be related to the immunoglobulin and/or fibronectin type III superfamilies are associated with different types of catalytic domains at either the N-terminal or C-terminal end. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase. Probab=38.90 E-value=14 Score=16.22 Aligned_cols=13 Identities=31% Similarity=0.358 Sum_probs=9.0 Q ss_pred CCEEEEEECCCCC Q ss_conf 7899991368884 Q gi|254780797|r 179 MPFIIAGDFNRKI 191 (231) Q Consensus 179 ~~vii~GDFN~~~ 191 (231) ..|-|+||||.|- T Consensus 14 ~~V~v~GsFn~W~ 26 (82) T cd02861 14 DSVYLAGSFNNWN 26 (82) T ss_pred CEEEEEEECCCCC T ss_conf 9999984258606 No 32 >KOG0808 consensus Probab=34.51 E-value=31 Score=14.49 Aligned_cols=23 Identities=26% Similarity=0.352 Sum_probs=15.8 Q ss_pred HHHHHHHHHHHCCCCEEEEECCC Q ss_conf 99999999975189899997279 Q gi|254780797|r 36 DYALLQKYAEQLDADIVCLQEIG 58 (231) Q Consensus 36 ~~~~i~~~i~~~~~DVi~LQEv~ 58 (231) ++..|.+.-.--++.|+||||.. T Consensus 101 r~kaiieaaa~agvniiclqeaw 123 (387) T KOG0808 101 RLKAIIEAAAVAGVNIICLQEAW 123 (387) T ss_pred HHHHHHHHHHHCCCCEEEEEHHH T ss_conf 99999999875486579740122 No 33 >cd02854 Glycogen_branching_enzyme_like_N_term Glycogen branching enzyme-like N-terminus domain. Glycogen branching enzyme (AKA 1,4 alpha glucan branching enzyme) catalyzes the formation of alpha-1,6 branch points in either glycogen or starch by cleavage of the alpha-1,4 glucosidic linkage yielding a non-reducing end oligosaccharide chain and subsequent attachment to the alpha-1,6 position. By increasing the number of non-reducing ends glycogen is more reactive to synthesis and digestion as well as being more soluble. The N-terminus of the glycogen branching enzyme-like proteins may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobi Probab=34.39 E-value=24 Score=15.05 Aligned_cols=13 Identities=31% Similarity=0.419 Sum_probs=9.3 Q ss_pred CEEEEEECCCCCC Q ss_conf 8999913688846 Q gi|254780797|r 180 PFIIAGDFNRKIN 192 (231) Q Consensus 180 ~vii~GDFN~~~~ 192 (231) .|-|.||||.+-. T Consensus 18 ~v~liGdFN~W~~ 30 (99) T cd02854 18 EVYLIGDFNNWDR 30 (99) T ss_pred EEEEEEECCCCCC T ss_conf 9999812778688 No 34 >pfam02562 PhoH PhoH-like protein. PhoH is a cytoplasmic protein and predicted ATPase that is induced by phosphate starvation. Probab=32.74 E-value=30 Score=14.55 Aligned_cols=62 Identities=8% Similarity=0.163 Sum_probs=31.5 Q ss_pred HHHHHHHHHCCCCCCEEEEEECCCCCC-CCCCCCCCHHHHHCCCCCCCEECEEECCCCEEEEECC Q ss_conf 999999974048878999913688846-2222227677863034687102102035863522012 Q gi|254780797|r 166 WLKQWVDQKNNLNMPFIIAGDFNRKIN-HSHSGIKDELWQKINQDNTLMRLPHKKNHNAIRTKIL 229 (231) Q Consensus 166 ~l~~~i~~~~~~~~~vii~GDFN~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~ 229 (231) ++...+.+ ...+..+|++||....-. .........+.+.+........+.- ....++++.+. T Consensus 135 ~lk~ilTR-iG~~SK~vi~GD~~Q~D~~~~~~nGl~~~~~~l~~~~~~~~i~f-~~~DIvRs~iv 197 (205) T pfam02562 135 QMKMFLTR-IGFNSKMVVTGDITQIDLPKGQKSGLADALEILKGVEGIGFIEF-TLKDVVRHPLV 197 (205) T ss_pred HHHHHHHH-CCCCCEEEEECCHHHCCCCCCCCCCHHHHHHHHCCCCCEEEEEE-CCCCEECCHHH T ss_conf 99999842-17996899947866517899998729999999669998599993-58623607799 No 35 >cd02859 AMPKbeta_GBD_like AMP-activated protein kinase (AMPK) beta subunit glycogen binding domain (GBD). AMPK is a metabolic stress sensing protein that senses AMP/ATP and has recently been found to act as a glycogen sensor as well. The protein functions as a alpha-beta-gamma heterotrimer. This domain is the glycogen binding domain of the beta subunit. Probab=32.42 E-value=24 Score=15.06 Aligned_cols=13 Identities=15% Similarity=0.378 Sum_probs=6.9 Q ss_pred CCCEEEEEECCCC Q ss_conf 8789999136888 Q gi|254780797|r 178 NMPFIIAGDFNRK 190 (231) Q Consensus 178 ~~~vii~GDFN~~ 190 (231) +..|.|+|+||.| T Consensus 12 g~~V~v~Gsf~~W 24 (79) T cd02859 12 GKEVYVTGSFDNW 24 (79) T ss_pred CEEEEEEEEECCC T ss_conf 9899999987899 No 36 >pfam01963 TraB TraB family. pAD1 is a hemolysin/bacteriocin plasmid originally identified in Enterococcus faecalis DS16. It encodes a mating response to a peptide sex pheromone, cAD1, secreted by recipient bacteria. Once the plasmid pAD1 is acquired, production of the pheromone ceases--a trait related in part to a determinant designated traB. However a related protein is found in C. elegans, suggesting that members of the TraB family have some more general function. Probab=32.09 E-value=34 Score=14.26 Aligned_cols=27 Identities=15% Similarity=0.239 Sum_probs=20.6 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEECCCCH Q ss_conf 7999999999997518989999727998 Q gi|254780797|r 33 EDNDYALLQKYAEQLDADIVCLQEIGSY 60 (231) Q Consensus 33 ~~~~~~~i~~~i~~~~~DVi~LQEv~~~ 60 (231) +++..+.+.+.|...+||+||+ |.+.. T Consensus 15 S~~S~~~V~~~I~~~~PD~V~V-ELc~~ 41 (225) T pfam01963 15 SKESVEEVRRVIEEERPDAVAV-ELDES 41 (225) T ss_pred CHHHHHHHHHHHHHCCCCEEEE-EECHH T ss_conf 9999999999998619997997-56888 No 37 >TIGR00216 ispH_lytB 4-hydroxy-3-methylbut-2-enyl diphosphate reductase; InterPro: IPR003451 Terpenes are among the largest groups of natural products and include compounds such as vitamins, cholesterol and carotenoids. The biosynthesis of all terpenoids begins with one or both of the two C5 precursors of the pathway: isopentenyl diphosphate (IPP) and dimethylallyl diphosphate (DMAPP). In animals, fungi, and certain bacteria, the synthesis of IPP and DMAPP occurs via the well-known mevalonate pathway, however, a second, nonmevalonate terpenoid pathway has been identified in many eubacteria, algae and the chloroplasts of higher plants . LytB(IspH) catalyses the conversion of 1-hydroy-2-methyl-2-(E)-butenyl 4-diphosphate into IPP and DMAPP in this second pathway The enzyme appears to be responsible for a branch-step in the nonmevalonate pathway, in that IPP and DMAPP are produced in parallel from a single precursor although the exact mechanism of this is not currently fully understood . Escherichia coli LytB protein had been found to regulate the activity of RelA (guanosine 3',5'-bispyrophosphate synthetase I), which in turn controls the level of a regulatory metabolite. It is involved in penicillin tolerance and the stringent response .; GO: 0019288 isopentenyl diphosphate biosynthetic process mevalonate-independent pathway. Probab=32.06 E-value=34 Score=14.26 Aligned_cols=12 Identities=17% Similarity=0.174 Sum_probs=7.5 Q ss_pred CCCEEEEEECCC Q ss_conf 878999913688 Q gi|254780797|r 178 NMPFIIAGDFNR 189 (231) Q Consensus 178 ~~~vii~GDFN~ 189 (231) -+-+||-||+|+ T Consensus 264 ~Dl~~ViGg~nS 275 (354) T TIGR00216 264 VDLMLVIGGKNS 275 (354) T ss_pred CEEEEEECCCCC T ss_conf 148999757777 No 38 >cd02855 Glycogen_branching_enzyme_N_term Glycogen branching enzyme N-terminus domain. Glycogen branching enzyme (AKA 1,4 alpha glucan branching enzyme) catalyzes the formation of alpha-1,6 branch points in either glycogen or starch by cleavage of the alpha-1,4 glucosidic linkage yielding a non-reducing end oligosaccharide chain and subsequent attachment to the alpha-1,6 position. By increasing the number of non-reducing ends glycogen is more reactive to synthesis and digestion as well as being more soluble. The N-terminus of the 1,4 alpha glucan branching enzyme may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitina Probab=27.75 E-value=40 Score=13.85 Aligned_cols=28 Identities=21% Similarity=0.338 Sum_probs=14.8 Q ss_pred EEEEEECCCCCCCCCCC---CCCHHHHHCCC Q ss_conf 99991368884622222---27677863034 Q gi|254780797|r 181 FIIAGDFNRKINHSHSG---IKDELWQKINQ 208 (231) Q Consensus 181 vii~GDFN~~~~~~~~~---~~~~~~~~~~~ 208 (231) |-|.||||.|-...+.. ...-+|+..-+ T Consensus 35 V~vvGdFN~W~~~~~~m~~~~~~GiW~~fip 65 (106) T cd02855 35 VSVVGDFNGWDGRRHPMRRRGDSGVWELFIP 65 (106) T ss_pred EEEEEECCCCCCCEEEEEECCCCCCEEEEEC T ss_conf 9999888789976143289789972899949 No 39 >TIGR01485 SPP_plant-cyano sucrose phosphatase; InterPro: IPR012847 This entry describes the sucrose phosphate phosphohydrolase from plants and cyanobacteria (SPP). However, a closely related group of sequences from bacteria and archaea may prove to catalyze the same reaction. SPP is a member of the Class IIB subfamily of the Haloacid Dehalogenase (HAD) superfamily of aspartate-nucleophile hydrolases. SPP catalyzes the final step in the biosynthesis of sucrose, a critically important molecule for plants. Sucrose phosphate synthase (SPS), the prior step in the biosynthesis of sucrose contains a domain, which exhibits considerable similarity to SPP albeit without conservation of the catalytic residues. The catalytic machinery of the synthase resides in another domain. It seems likely that the phosphatase-like domain is involved in substrate binding, possibly binding both substrates in a "product-like" orientation prior to ligation by the synthase catalytic domain.. Probab=26.61 E-value=42 Score=13.73 Aligned_cols=50 Identities=12% Similarity=0.147 Sum_probs=28.1 Q ss_pred EEEEEEE-ECCEEEEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECC Q ss_conf 4999998-4997899999751168888754300254578875489999999997404887899991368 Q gi|254780797|r 121 AVEILFE-VDGRKIWLLDIHLKSFCFLDSIEDSYISSCYMLNLQATWLKQWVDQKNNLNMPFIIAGDFN 188 (231) Q Consensus 121 ~~~~~~~-~~g~~~~v~n~HL~s~~~~~~~~~~~~~~~~~~~~q~~~l~~~i~~~~~~~~~vii~GDFN 188 (231) .+-|.+. .+|.-+-|+- -+. .+ -..++.|.+-+.-.-++....+||||== T Consensus 149 g~~vk~iYSsG~dlDilP----~~s--------~K------G~A~~YL~~kL~~~GK~p~~TlVCGDSG 199 (257) T TIGR01485 149 GLDVKLIYSSGKDLDILP----QGS--------GK------GQALQYLLQKLKIEGKQPSQTLVCGDSG 199 (257) T ss_pred CCCEEEEEECCEEEEEEC----CCC--------CH------HHHHHHHHHHHHHCCCCCCCCEEECCCC T ss_conf 897589975882377502----778--------85------6799999999984388977717858842 No 40 >PRK10438 hypothetical protein; Provisional Probab=25.40 E-value=45 Score=13.61 Aligned_cols=28 Identities=14% Similarity=0.230 Sum_probs=21.3 Q ss_pred CCHHHHHHHHHHHHHH-CCCCEEEEECCC Q ss_conf 6879999999999975-189899997279 Q gi|254780797|r 31 IREDNDYALLQKYAEQ-LDADIVCLQEIG 58 (231) Q Consensus 31 ~~~~~~~~~i~~~i~~-~~~DVi~LQEv~ 58 (231) ...+++++.+.++|++ .++|+|+|-|.. T Consensus 16 ~D~~~Nl~~~~~~i~~a~gadlivlPElf 44 (256) T PRK10438 16 MDGPANLRHFDRQLEGITGRDVIVLPEMF 44 (256) T ss_pred CCHHHHHHHHHHHHHHCCCCCEEECCCCC T ss_conf 79999999999999745499999908755 No 41 >PRK05340 UDP-2,3-diacylglucosamine hydrolase; Provisional Probab=22.09 E-value=52 Score=13.25 Aligned_cols=16 Identities=19% Similarity=0.289 Sum_probs=10.9 Q ss_pred CCCCCEEEEEECCCCC Q ss_conf 4887899991368884 Q gi|254780797|r 176 NLNMPFIIAGDFNRKI 191 (231) Q Consensus 176 ~~~~~vii~GDFN~~~ 191 (231) +.+..++|||=.=.+. T Consensus 185 ~~~~d~vI~GH~H~P~ 200 (240) T PRK05340 185 KHGVQTLIHGHTHRPA 200 (240) T ss_pred HCCCCEEEECCCCCCC T ss_conf 8399979978867733 No 42 >PRK11340 phosphodiesterase YaeI; Provisional Probab=21.40 E-value=54 Score=13.17 Aligned_cols=24 Identities=8% Similarity=-0.063 Sum_probs=19.6 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEEC Q ss_conf 799999999999751898999972 Q gi|254780797|r 33 EDNDYALLQKYAEQLDADIVCLQE 56 (231) Q Consensus 33 ~~~~~~~i~~~i~~~~~DVi~LQE 56 (231) ..+.++.+.+.+.+.+||+|++-= T Consensus 64 ~~~~l~~~v~~~n~~~PDlVv~tG 87 (270) T PRK11340 64 PLSLISDAIALGIEQKPDLILLGG 87 (270) T ss_pred CHHHHHHHHHHHHHCCCCEEEECC T ss_conf 989999999998712999999867 No 43 >TIGR01420 pilT_fam twitching motility protein; InterPro: IPR006321 These represent the PilT subfamily of proteins related to GspE, a protein involved in type II secretion (also called the General Secretion Pathway). PilT is an apparent cytosolic ATPase associated with type IV pilus systems. It is not required for pilin biogenesis, but is required for twitching motility and social gliding behaviors, shown in some species, powered by pilus retraction . Members of this family may be found in some species that do not have type IV pili but have related structures for DNA uptake and natural transformation. ; GO: 0005524 ATP binding, 0006810 transport. Probab=20.59 E-value=56 Score=13.07 Aligned_cols=32 Identities=9% Similarity=0.133 Sum_probs=17.9 Q ss_pred HHHHHHHHCCCCEEEEECCCCH-HHHHHHHCCC Q ss_conf 9999997518989999727998-8999872623 Q gi|254780797|r 39 LLQKYAEQLDADIVCLQEIGSY-EAIKRVFPND 70 (231) Q Consensus 39 ~i~~~i~~~~~DVi~LQEv~~~-~~~~~~~~~~ 70 (231) .++-+++.+...|--|+|..-+ ..+..+.... T Consensus 94 ~~a~vlR~ip~~Ip~fe~LGLP~~v~~~~a~~~ 126 (350) T TIGR01420 94 GVALVLRLIPSKIPTFEELGLPRPVLRELAERP 126 (350) T ss_pred HHHHHHHHCCCCCCCHHHCCCCHHHHHHHHHCC T ss_conf 064232311534621666379878999998366 Done!