Query gi|254781003|ref|YP_003065416.1| hypothetical protein CLIBASIA_04520 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 304 No_of_seqs 132 out of 2576 Neff 11.2 Searched_HMMs 39220 Date Mon May 30 05:23:37 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254781003.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR00633 xth exodeoxyribonucl 100.0 2.2E-33 5.6E-38 200.5 17.3 238 24-303 1-279 (279) 2 TIGR00195 exoDNase_III exodeox 100.0 5.3E-32 1.3E-36 192.9 11.9 238 24-302 1-281 (281) 3 TIGR03395 sphingomy sphingomye 100.0 4.5E-29 1.1E-33 176.9 22.6 265 24-302 1-283 (283) 4 PRK11756 exonuclease III; Prov 100.0 3.2E-29 8.2E-34 177.7 20.4 232 24-303 1-267 (268) 5 COG0708 XthA Exonuclease III [ 100.0 5.9E-28 1.5E-32 170.8 18.0 230 24-304 1-261 (261) 6 COG3568 ElsH Metal-dependent h 100.0 4.5E-28 1.1E-32 171.4 16.7 238 23-304 9-257 (259) 7 pfam03372 Exo_endo_phos Endonu 100.0 4.8E-27 1.2E-31 165.8 18.9 238 24-302 1-255 (255) 8 PRK13911 hypothetical protein; 100.0 4.2E-26 1.1E-30 160.7 19.0 221 24-302 1-249 (250) 9 smart00476 DNaseIc deoxyribonu 99.9 1.1E-24 2.8E-29 152.9 20.7 249 17-303 11-276 (276) 10 KOG2338 consensus 99.9 5.3E-22 1.3E-26 138.3 17.7 180 22-220 115-310 (495) 11 COG5239 CCR4 mRNA deadenylase, 99.9 9.2E-21 2.3E-25 131.5 14.0 275 23-304 30-365 (378) 12 KOG3873 consensus 99.9 1.4E-20 3.5E-25 130.6 13.6 260 20-303 5-291 (422) 13 COG2374 Predicted extracellula 99.9 5.8E-20 1.5E-24 127.1 16.4 275 20-304 460-788 (798) 14 PRK05421 hypothetical protein; 99.9 5.6E-20 1.4E-24 127.2 15.7 220 20-303 30-250 (254) 15 KOG0620 consensus 99.8 2E-18 5.1E-23 118.7 9.5 282 18-304 14-351 (361) 16 KOG2756 consensus 99.7 1.1E-16 2.8E-21 109.2 14.5 242 22-303 98-348 (349) 17 smart00128 IPPc Inositol polyp 99.7 2.9E-15 7.5E-20 101.4 18.6 146 139-303 122-303 (310) 18 COG3021 Uncharacterized protei 99.6 1E-14 2.6E-19 98.5 12.6 201 56-304 105-307 (309) 19 KOG0566 consensus 99.6 1.2E-14 3E-19 98.1 10.9 146 135-303 653-830 (1080) 20 PTZ00297 pantothenate kinase; 99.6 7.5E-14 1.9E-18 93.7 13.7 265 16-303 3-313 (1452) 21 COG5411 Phosphatidylinositol 5 98.6 2.2E-07 5.6E-12 58.4 8.2 65 143-217 153-221 (460) 22 KOG1294 consensus 98.2 0.00017 4.3E-09 42.6 13.8 138 152-304 164-335 (335) 23 KOG1976 consensus 97.4 0.0012 3.2E-08 37.9 8.6 72 147-218 161-233 (391) 24 PTZ00312 inositol-1,4,5-tripho 96.7 0.015 3.9E-07 31.9 8.4 72 144-216 69-142 (356) 25 KOG1857 consensus 96.1 0.011 2.8E-07 32.7 5.0 106 154-268 434-539 (623) 26 PRK13861 type IV secretion sys 86.7 0.57 1.4E-05 23.4 2.6 32 1-32 1-42 (293) 27 KOG0565 consensus 85.0 2.2 5.7E-05 20.1 5.4 71 142-218 46-124 (145) 28 PRK11627 hypothetical protein; 84.1 1.1 2.8E-05 21.8 3.1 26 1-26 1-27 (192) 29 PRK12789 flgI flagellar basal 74.7 3.3 8.3E-05 19.2 3.1 61 1-68 1-76 (367) 30 PRK04792 tolB translocation pr 72.6 2.9 7.3E-05 19.5 2.4 20 1-20 1-20 (450) 31 cd07587 ML_beta-AS mammalian-l 66.5 7.5 0.00019 17.2 3.4 24 53-76 91-114 (363) 32 PRK10002 outer membrane protei 63.1 5.2 0.00013 18.1 2.1 20 1-20 1-20 (362) 33 cd05785 DNA_polB_like2_exo A s 54.2 13 0.00034 15.9 5.9 20 53-72 61-80 (207) 34 PRK10780 periplasmic chaperone 53.2 6.9 0.00018 17.4 1.3 32 2-34 1-32 (161) 35 cd07577 Ph0642_like Pyrococcus 49.9 16 0.0004 15.5 3.9 27 49-75 13-39 (259) 36 TIGR03431 PhnD phosphonate ABC 48.5 17 0.00042 15.3 8.4 80 1-94 1-86 (288) 37 PRK10752 sulfate transporter s 46.4 6.3 0.00016 17.6 0.3 81 2-82 1-93 (329) 38 KOG0808 consensus 45.5 19 0.00047 15.1 3.2 25 52-76 100-124 (387) 39 cd05780 DNA_polB_Kod1_like_exo 40.0 23 0.00058 14.6 5.7 23 53-75 59-81 (195) 40 PRK11067 outer membrane protei 37.7 25 0.00063 14.4 3.5 33 1-34 2-34 (801) 41 PRK09501 potD spermidine/putre 36.6 25 0.00065 14.3 2.1 29 2-30 1-35 (348) 42 pfam07424 TrbM TrbM. This fami 33.3 20 0.00051 14.9 1.1 27 2-28 1-27 (189) 43 pfam11153 DUF2931 Protein of u 33.0 30 0.00076 14.0 2.4 33 1-33 1-33 (202) 44 cd01129 PulE-GspE PulE/GspE Th 32.3 31 0.00078 13.9 3.0 14 200-213 147-160 (264) 45 PRK09495 glnH glutamine ABC tr 31.5 32 0.00081 13.8 3.2 85 1-93 1-88 (247) 46 pfam06838 Alum_res Aluminium r 30.8 33 0.00083 13.7 3.9 20 52-71 176-195 (405) 47 cd05781 DNA_polB_B3_exo The 3' 30.4 33 0.00084 13.7 5.6 25 52-76 50-74 (188) 48 CHL00091 apcE phycobillisome l 29.3 11 0.00027 16.4 -0.8 33 2-34 146-179 (886) 49 TIGR02622 CDP_4_6_dhtase CDP-g 29.1 35 0.00089 13.6 2.4 20 53-72 69-88 (361) 50 pfam01963 TraB TraB family. pA 28.6 36 0.00091 13.5 3.3 26 50-76 15-40 (225) 51 PRK10954 periplasmic protein d 28.6 29 0.00074 14.0 1.3 20 2-21 1-20 (207) 52 PRK12698 flgH flagellar basal 26.8 38 0.00098 13.4 2.6 18 2-19 1-18 (224) 53 pfam10913 DUF2706 Protein of u 26.5 39 0.00099 13.3 2.6 25 1-25 1-29 (60) 54 PRK12696 flgH flagellar basal 25.6 41 0.001 13.2 2.8 19 1-19 1-21 (238) 55 COG3459 Cellobiose phosphoryla 25.1 42 0.0011 13.2 2.9 33 185-217 715-747 (1056) 56 COG5510 Predicted small secret 24.7 40 0.001 13.2 1.5 17 1-17 1-17 (44) 57 PRK00178 tolB translocation pr 24.0 44 0.0011 13.1 2.1 23 1-23 1-23 (433) 58 pfam06649 DUF1161 Protein of u 23.6 44 0.0011 13.0 1.6 18 2-19 1-18 (75) 59 PRK13883 conjugal transfer pro 23.6 45 0.0011 13.0 2.5 78 1-94 1-84 (156) 60 PRK11443 hypothetical protein; 22.1 48 0.0012 12.8 2.4 20 2-21 1-20 (120) 61 TIGR02855 spore_yabG sporulati 21.7 49 0.0012 12.8 2.6 12 255-266 241-252 (292) 62 PRK01742 tolB translocation pr 21.2 50 0.0013 12.7 2.2 18 1-18 3-20 (430) 63 TIGR01420 pilT_fam twitching m 20.9 51 0.0013 12.7 3.7 32 56-87 94-126 (350) 64 pfam02446 Glyco_hydro_77 4-alp 20.5 52 0.0013 12.7 3.4 17 202-218 204-220 (494) 65 pfam03104 DNA_pol_B_exo DNA po 20.2 52 0.0013 12.6 5.5 23 189-213 153-175 (254) 66 cd02854 Glycogen_branching_enz 20.1 53 0.0013 12.6 1.8 14 200-213 14-27 (99) No 1 >TIGR00633 xth exodeoxyribonuclease III (xth); InterPro: IPR004808 All proteins in this family for which functions are known are 5' AP endonucleases that function in base excision repair and the repair of abasic sites in DNA.; GO: 0004518 nuclease activity, 0006281 DNA repair. Probab=100.00 E-value=2.2e-33 Score=200.45 Aligned_cols=238 Identities=16% Similarity=0.245 Sum_probs=138.6 Q ss_pred EEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHH--HHHHHHHCCCEEEEEEC--CCHHHHHHHHCCCCCEEEEECCCCC Q ss_conf 6999983014788667666776531018899999--99998608988999950--7857999860333430786214556 Q gi|254781003|r 24 VRLVSWNINTLSEQEGVSLWKNSVKRTTSDYTLL--RQYAKNLDADIVFLQEM--GSYNAVAKVFPKNTWCIFYSTERLI 99 (304) Q Consensus 24 lrV~twNi~~~~~~~~~~~~~~~~~~~~~~~~~l--~~~i~~~~~DIi~lQEv--~~~~~~~~~~~~~~~~~~~~~~~~~ 99 (304) +||+||||+++.. .+..+ .++|.+.+||||||||. .+...-.+.....+|+.++...... T Consensus 1 mKiiSWNVNGlRA----------------~~~~~sfL~~l~~~~pDvLCLQETK~~~~~~p~~~~~~~GY~~~~~~~~~k 64 (279) T TIGR00633 1 MKIISWNVNGLRA----------------RLKKLSFLDWLKEENPDVLCLQETKVADEQFPKELLEELGYHVFFHGAKKK 64 (279) T ss_pred CCEEEEECCHHHH----------------HCCHHHHHHHHHHCCCCEEEEEEECCCCCCCCHHHHHCCCCCEEEEECCCC T ss_conf 9168871123442----------------135112888986538978762200278776634666407981588414665 Q ss_pred CCCCCCCCCCCEEEEEEECCCE-EEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCEEEEEEEECCCCCCCCCCC-HHHH Q ss_conf 7543223446315899825852-6765301224676555555653369999982994999999406788877520-0002 Q gi|254781003|r 100 NHSKRDSNNDIHTAIAVRKKNV-RVLQQSYPLLGAKDSFSRAGNRRAVELLVEINGKKIWVLDIHLKSFCFLDSL-ENTY 177 (304) Q Consensus 100 ~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~v~n~Hl~~~~~~~~~-~~~~ 177 (304) ...++|+++.++.. ......++.....+... |......+...-..|+|+|++.+.+...+.. .. T Consensus 65 --------k~Gy~GVA~~Sk~~P~~V~~G~g~~~~~D~eg----R~~i~~~i~a~f~~f~v~n~YvPNg~~~~~~~rl-- 130 (279) T TIGR00633 65 --------KKGYSGVAILSKVEPLSVRYGFGVLEEHDEEG----RVIIMRVITAEFDGFYVVNVYVPNGGARGLETRL-- 130 (279) T ss_pred --------CCCEEEEEEEECCCCCEEEECCCCCCCCCCCC----CEEEEEEEEEECCCEEEEEEEECCCCCCCCCCCH-- T ss_conf --------88513689986016320220687887210377----4653331010105479998776686113654462-- Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCC--CCCEEEEECCCCCCCCCC----------------------HHHHHHHHHCCCCC Q ss_conf 577899999999999998630368--980999802688877662----------------------04689875225575 Q gi|254781003|r 178 SPSCSLLSQQAQWLKDWITQKKES--LVPFVIAGDFNRKINYLG----------------------NNDDFWKTIDPNDS 233 (304) Q Consensus 178 ~~~~~~~~~q~~~l~~~~~~~~~~--~~~vIi~GDfN~~~~~~~----------------------~~~~~~~~~~~~~~ 233 (304) ..+.+..+.+.+.+.+.... +.|||||||||..+..-. ++.++.+.+... + T Consensus 131 ----~Yk~~~w~~~~~~~~~~l~~~~~~pv~~cGD~NvAh~eiDl~~P~~NRKr~l~~~~~gF~~~ER~~~~~ll~~g-g 205 (279) T TIGR00633 131 ----EYKLDFWEALLQYYLKELDASDGKPVIICGDLNVAHTEIDLANPKENRKRWLRKGNAGFTPEEREAFDELLEAG-G 205 (279) T ss_pred ----HHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCHHHHHHHHHHHHCC-C T ss_conf ----78889999999999999861799979998477889771124577502479988188987889999999998617-9 Q ss_pred CCCCCCCCCCCC-CCCCCCCC----------CCCEEEEEECCCHHHCCCCCCEEEEEECCCCCCCCCCCCCCCCCEEEEE Q ss_conf 311144567876-56466888----------2014799986204310024721786411345544455557634389998 Q gi|254781003|r 234 LIRFPKEKDSRC-NANKNLRN----------KIPIDYFVMDQNAYKFLIQESFSEILYNEDDIKSRGKRLSDHCPISIDY 302 (304) Q Consensus 234 ~~~~~~~~~~~~-~~~~~~~~----------~~~iD~I~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~SDH~PV~~~l 302 (304) +.+..+....+. ..++++.+ ..||||+++|+.+........+... ..-...|||||||+++| T Consensus 206 f~DtFR~~~p~~~~~YTwW~YR~K~ar~~N~GWRiDy~l~s~~l~~~v~d~~I~~~-------~rg~ekPSDHcPi~l~~ 278 (279) T TIGR00633 206 FVDTFRHFNPDTEGAYTWWDYRSKRARDRNRGWRIDYFLVSEPLAARVVDSYIDSE-------VRGMEKPSDHCPIVLEL 278 (279) T ss_pred CEEEECCCCCCCCCCEEEEECCCCCCCCCCCCCEEEEEECCHHHHHHHHHHHCCCH-------HHCCCCCCCCCCEEEEE T ss_conf 64870365888797312540023677246734145326538689986510001303-------45278888735547775 Q ss_pred E Q ss_conf 4 Q gi|254781003|r 303 D 303 (304) Q Consensus 303 ~ 303 (304) + T Consensus 279 ~ 279 (279) T TIGR00633 279 D 279 (279) T ss_pred C T ss_conf 9 No 2 >TIGR00195 exoDNase_III exodeoxyribonuclease III; InterPro: IPR000097 DNA damaging agents such as the antitumor drugs bleomycin and neocarzinostatin or those that generate oxygen radicals produce a variety of lesions in DNA. Amongst these is base-loss which forms apurinic/apyrimidinic (AP) sites or strand breaks with atypical 3' termini. DNA repair at the AP sites is initiated by specific endonuclease cleavage of the phosphodiester backbone. Such endonucleases are also generally capable of removing blocking groups from the 3' terminus of DNA strand breaks. AP endonucleases can be classified into two families based on sequence similarity. This family contains members of AP endonuclease family 1. Except for Rrp1 and arp, these enzymes are proteins of about 300 amino-acid residues. Rrp1 and arp both contain additional and unrelated sequences in their N-terminal section (about 400 residues for Rrp1 and 270 for arp). The proteins contain glutamate which has been shown , in the Escherichia coli enzyme to bind a divalent metal ion such as magnesium or manganese.; GO: 0003677 DNA binding, 0004519 endonuclease activity, 0006281 DNA repair, 0005622 intracellular. Probab=99.98 E-value=5.3e-32 Score=192.93 Aligned_cols=238 Identities=17% Similarity=0.245 Sum_probs=142.2 Q ss_pred EEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHH--HHHHHHHCCCEEEEEE--CCCHHHHHHHHCCCCCEEEEECCCCC Q ss_conf 6999983014788667666776531018899999--9999860898899995--07857999860333430786214556 Q gi|254781003|r 24 VRLVSWNINTLSEQEGVSLWKNSVKRTTSDYTLL--RQYAKNLDADIVFLQE--MGSYNAVAKVFPKNTWCIFYSTERLI 99 (304) Q Consensus 24 lrV~twNi~~~~~~~~~~~~~~~~~~~~~~~~~l--~~~i~~~~~DIi~lQE--v~~~~~~~~~~~~~~~~~~~~~~~~~ 99 (304) ++++||||+++. +....+ ..++.+.+|||||||| |.+.+.-.+.+...+|+.+++...- T Consensus 1 mK~iSwNVNGlR----------------Ar~~~~dFl~~~~~~~pDilClQEtKv~~e~~P~~~~~~~GYh~f~~~~~~- 63 (281) T TIGR00195 1 MKLISWNVNGLR----------------ARLHKGDFLDWLKENKPDILCLQETKVADEQFPRELFEEEGYHVFFSGAKK- 63 (281) T ss_pred CCEEEECCCHHH----------------HHCCHHHHHHHHCCCCCCEEECCCCCCCCCCCCHHHHHHCCCEEEECCCEE- T ss_conf 955786022355----------------330043588884238995873201014566778789976498178736423- Q ss_pred CCCCCCCCCCCEEEEEEECC-CEEE-EEEECCCCCCCC--CCCCCCCCEEEEEEEEECCEEEEEEEECCCCC-CCCCCCH Q ss_conf 75432234463158998258-5267-653012246765--55555653369999982994999999406788-8775200 Q gi|254781003|r 100 NHSKRDSNNDIHTAIAVRKK-NVRV-LQQSYPLLGAKD--SFSRAGNRRAVELLVEINGKKIWVLDIHLKSF-CFLDSLE 174 (304) Q Consensus 100 ~~~~~~~~~~~~~~~~~~~~-~~~~-~~~~~~~~~~~~--~~~~~~~r~~~~~~~~~~~~~~~v~n~Hl~~~-~~~~~~~ 174 (304) +.++|+++.++ -|.. ....++....+. .......|. +.+.+..--..++|+|++.+.| +..+. T Consensus 64 ---------KGy~GVA~~sk~EP~~dV~~G~~~e~~PaltGdlD~e~R~-I~a~~~~~~G~~Tv~N~Y~PnGES~~~~-- 131 (281) T TIGR00195 64 ---------KGYSGVAIFSKEEPLSDVRYGFGVEEEPALTGDLDAEGRV-ITAEFDGFLGLLTVVNGYFPNGESRLDS-- 131 (281) T ss_pred ---------ECCCCEEEEECCCCHHHHHCCCCCCCCCCCCCCCCCCCCE-EEEEECCCEEEEEEEEEEECCCCCCCHH-- T ss_conf ---------3643079986268858860367887765332761216717-9997578302489998665598430014-- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCC-EEEEECCCCCCCCC----------------------CHHHHHHHHHCCC Q ss_conf 002577899999999999998630368980-99980268887766----------------------2046898752255 Q gi|254781003|r 175 NTYSPSCSLLSQQAQWLKDWITQKKESLVP-FVIAGDFNRKINYL----------------------GNNDDFWKTIDPN 231 (304) Q Consensus 175 ~~~~~~~~~~~~q~~~l~~~~~~~~~~~~~-vIi~GDfN~~~~~~----------------------~~~~~~~~~~~~~ 231 (304) +....+.+-.+.|..++++....+.| ||||||||..|..- .++.++.+.++.. T Consensus 132 ----~k~~YKl~w~~~l~~yl~~~~~~~~p~v~~CGD~NiAp~eiDl~~P~~nrnRWL~~GKCSF~~~ER~~~~~~l~~G 207 (281) T TIGR00195 132 ----EKLPYKLQWLEALQNYLEKLRDKDKPEVIICGDFNIAPEEIDLHSPDENRNRWLHAGKCSFLPEEREWLDRLLEAG 207 (281) T ss_pred ----HHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHC T ss_conf ----5578899999999999999863689889997677889861356788511133133332138889999999998600 Q ss_pred CCCCCCCCCCCCC-CCCCCCCCCC----------CCEEEEEECCCHHHCCCCCCEEEEEECCCCCCCCCCCCCCCCCEEE Q ss_conf 7531114456787-6564668882----------0147999862043100247217864113455444555576343899 Q gi|254781003|r 232 DSLIRFPKEKDSR-CNANKNLRNK----------IPIDYFVMDQNAYKFLIQESFSEILYNEDDIKSRGKRLSDHCPISI 300 (304) Q Consensus 232 ~~~~~~~~~~~~~-~~~~~~~~~~----------~~iD~I~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~SDH~PV~~ 300 (304) .+.+..+..... ...++++.++ .||||+++|+.+..+.....+-..... .-.|||||||++ T Consensus 208 -n~vDtFR~~~p~~~~~YtWwdYR~~Ga~drN~GWRID~~l~s~~L~~~~~~~~I~~d~rg-------~EKPSDH~Pv~l 279 (281) T TIGR00195 208 -NFVDTFRKFNPDKEGAYTWWDYRTRGARDRNRGWRIDYFLVSEPLKERCVDCLIDKDVRG-------SEKPSDHCPVVL 279 (281) T ss_pred -CCEEEECCCCCCCCCCCCEEECCCCCCCCCCCHHHHHHHHHCHHHHHHHHHHHHCCCCCC-------CCCCCCCCCEEE T ss_conf -613320000568889400331133685024622433553308489888987641213001-------688887565266 Q ss_pred EE Q ss_conf 98 Q gi|254781003|r 301 DY 302 (304) Q Consensus 301 ~l 302 (304) +| T Consensus 280 ~~ 281 (281) T TIGR00195 280 EI 281 (281) T ss_pred EC T ss_conf 58 No 3 >TIGR03395 sphingomy sphingomyelin phosphodiesterase. Members of this family are bacterial proteins that act as sphingomyelin phosphodiesterase (EC 3.1.4.12), also called sphingomyelinase. Some members of this family have been shown to act as hemolysins. Probab=99.97 E-value=4.5e-29 Score=176.92 Aligned_cols=265 Identities=17% Similarity=0.105 Sum_probs=147.5 Q ss_pred EEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHCCCCCEEEEECCCCC---- Q ss_conf 6999983014788667666776531018899999999986089889999507857999860333430786214556---- Q gi|254781003|r 24 VRLVSWNINTLSEQEGVSLWKNSVKRTTSDYTLLRQYAKNLDADIVFLQEMGSYNAVAKVFPKNTWCIFYSTERLI---- 99 (304) Q Consensus 24 lrV~twNi~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~DIi~lQEv~~~~~~~~~~~~~~~~~~~~~~~~~---- 99 (304) |+||||||+.++..- . +......+.+.|++-.--.+.|||+||||.+..+..+++......+.......+ T Consensus 1 lKIlsyNv~~lp~~l----~--~n~~q~~Ra~~I~~~~~i~~~DVvvlqEvFd~~a~~~l~~~L~~~YpY~T~vvg~~~~ 74 (283) T TIGR03395 1 IKILSHNVYMLSTNL----Y--PNWGQMERADLIASADYIKNQDVVILNEAFDTSASKRLLDNLREEYPYQTDVIGRSKK 74 (283) T ss_pred CEEEEEEEEECCHHH----C--CCCCHHHHHHHHCCHHHCCCCCEEEEEHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCC T ss_conf 927887703257101----3--7877789998744142136898899961228689999999788638841542476666 Q ss_pred CCCCC----CCCCCCEEEEEEECCCEEEEEEE--CCCCCCCCCCCCCCCCEEEEEEEEECCEEEEEEEECCCCCCCCCCC Q ss_conf 75432----23446315899825852676530--1224676555555653369999982994999999406788877520 Q gi|254781003|r 100 NHSKR----DSNNDIHTAIAVRKKNVRVLQQS--YPLLGAKDSFSRAGNRRAVELLVEINGKKIWVLDIHLKSFCFLDSL 173 (304) Q Consensus 100 ~~~~~----~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~v~n~Hl~~~~~~~~~ 173 (304) +.... ........|+.+.++.|++.... +......+. ...++++.+++..+|+.++|++|||.+....- T Consensus 75 gwd~t~g~~~~~~~~~gGv~I~Sk~PI~~~~~~if~~~~g~D~---~a~KG~~y~kI~k~gk~~hv~~THlqa~d~~~-- 149 (283) T TIGR03395 75 GWDKTLGNYSSSALEDGGVAIVSKWPIEEKIQYIFNKGCGADN---LSNKGFAYVKINKNGKKFHVIGTHLQAQDSMC-- 149 (283) T ss_pred CCCCCCCCCCCCCCCCCEEEEEECCCCEEEEEEEECCCCCCHH---HHCCCEEEEEEEECCEEEEEEECCCCCCCCCC-- T ss_conf 7645667655661317748999678710346898657747312---01157089999879979999981446578765-- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHC-CCCCCCEEEEECCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCC----- Q ss_conf 0002577899999999999998630-3689809998026888776620468987522557531114456787656----- Q gi|254781003|r 174 ENTYSPSCSLLSQQAQWLKDWITQK-KESLVPFVIAGDFNRKINYLGNNDDFWKTIDPNDSLIRFPKEKDSRCNA----- 247 (304) Q Consensus 174 ~~~~~~~~~~~~~q~~~l~~~~~~~-~~~~~~vIi~GDfN~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----- 247 (304) ........|..|+++++++++.. .+.+.+||++||||..-.+.++ ..+...+....+.......+..+... T Consensus 150 --~~~~~~~~R~~Q~~ei~~fi~~~~ip~~e~vi~aGDfNV~k~s~EY-~~Ml~~Ln~~~p~~~g~~~T~D~~~N~~a~~ 226 (283) T TIGR03395 150 --SKLGPASIRANQLNEIQDFIDSKNIPKDETVLIGGDLNVNKGSNEY-HDMFKTLNVSEPRYVGVPATWDATTNSIAKY 226 (283) T ss_pred --CCCHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCHHH-HHHHHHCCCCCCCCCCCCCCCCCCCCHHHHH T ss_conf --5411589999999999999985599989718994014426898789-9999972899976479865308764334552 Q ss_pred CCCCCCCCCEEEEEECCCHHHCCCCCCEEEEEECC-CCC-CCCCCCCCCCCCEEEEE Q ss_conf 46688820147999862043100247217864113-455-44455557634389998 Q gi|254781003|r 248 NKNLRNKIPIDYFVMDQNAYKFLIQESFSEILYNE-DDI-KSRGKRLSDHCPISIDY 302 (304) Q Consensus 248 ~~~~~~~~~iD~I~~s~~~~~~~~~~~~~~~~~~~-~~~-~~~~~~~SDH~PV~~~l 302 (304) ......++.|||||++++..+-............. .+. ......+|||+||...+ T Consensus 227 ~y~~~~~e~LDYIl~s~~h~qp~~~~n~v~~p~s~~~w~~~~~~~dlSDHyPV~~~i 283 (283) T TIGR03395 227 YYPKEEPEYLDYIFVSKSHAQPPVWQNKVLDPKSVTSWFKKYTYDDFSDHYPVYGFI 283 (283) T ss_pred CCCCCCCCEEEEEEEECCCCCCCCCCCCEECCCCCCHHHCCCCCCCCCCCCCEEEEC T ss_conf 369998530689999788668750303175121141221264567767778536449 No 4 >PRK11756 exonuclease III; Provisional Probab=99.97 E-value=3.2e-29 Score=177.71 Aligned_cols=232 Identities=17% Similarity=0.184 Sum_probs=127.9 Q ss_pred EEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCHH--HHHHHHCCCCCEEEEECCCCCCC Q ss_conf 6999983014788667666776531018899999999986089889999507857--99986033343078621455675 Q gi|254781003|r 24 VRLVSWNINTLSEQEGVSLWKNSVKRTTSDYTLLRQYAKNLDADIVFLQEMGSYN--AVAKVFPKNTWCIFYSTERLINH 101 (304) Q Consensus 24 lrV~twNi~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~DIi~lQEv~~~~--~~~~~~~~~~~~~~~~~~~~~~~ 101 (304) +||+||||+++. +++..+..++.+.+||||||||+-... .-...+...+|..++.. T Consensus 1 MkI~SwNVNgiR----------------ar~~~l~~~l~~~~pDIlcLQEtK~~~~~~p~~~~~~~gY~~~~~~------ 58 (268) T PRK11756 1 MKFVSFNINGLR----------------ARPHQLEAIIEKHQPDVIGLQETKVHDEMFPLEEVEALGYHVFYHG------ 58 (268) T ss_pred CEEEEEEEHHHH----------------HHHHHHHHHHHHCCCCEEEEECCCCCHHHCCHHHHHHCCCEEEEEC------ T ss_conf 989999866686----------------7588899999777998999987738811289889852695899961------ Q ss_pred CCCCCCCCCEEEEEEECCCE-EEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCEEEEEEEECCCCCCCCCCCHHHHHHH Q ss_conf 43223446315899825852-67653012246765555556533699999829949999994067888775200002577 Q gi|254781003|r 102 SKRDSNNDIHTAIAVRKKNV-RVLQQSYPLLGAKDSFSRAGNRRAVELLVEINGKKIWVLDIHLKSFCFLDSLENTYSPS 180 (304) Q Consensus 102 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~v~n~Hl~~~~~~~~~~~~~~~~ 180 (304) .+.++|+++.++.. ......++.. . .....| .+.+.+......+.|+|++++++...+... . T Consensus 59 ------~KgysGVAi~sk~~p~~v~~g~~~~----~-~d~egR-~i~~~~~~~~g~~~lin~Y~PnG~~~~~~~-----k 121 (268) T PRK11756 59 ------QKGHYGVALLSKQTPIAVRKGFPTD----D-EEAQRR-IIMATIPTPNGNLTVINGYFPQGESRDHPT-----K 121 (268) T ss_pred ------CCCCCCEEEEECCCCHHEEECCCCC----C-CCCCCC-EEEEEEECCCCCEEEEEEEECCCCCCCCHH-----H T ss_conf ------6773444687236724458648888----5-444562-799999637762999998504888777256-----7 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCC----------------------HHHHHHHHHCCCCCCCCCC Q ss_conf 899999999999998630368980999802688877662----------------------0468987522557531114 Q gi|254781003|r 181 CSLLSQQAQWLKDWITQKKESLVPFVIAGDFNRKINYLG----------------------NNDDFWKTIDPNDSLIRFP 238 (304) Q Consensus 181 ~~~~~~q~~~l~~~~~~~~~~~~~vIi~GDfN~~~~~~~----------------------~~~~~~~~~~~~~~~~~~~ 238 (304) ...+....+.+.++++.....+.++|+|||||..+.... ++.++...+. ..+.+.. T Consensus 122 ~~~K~~f~~~l~~~l~~~~~~~k~vIi~GD~NVA~~~iDv~~~~~n~k~~~~~g~~~f~~eER~~~~~ll~--~G~vD~f 199 (268) T PRK11756 122 FPAKRQFYQDLQNYLETELSPDNPLLIMGDMNISPTDLDIGIGEENRKRWLRTGKCSFLPEEREWLERLMD--WGLVDTF 199 (268) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCCCCHHHCCCCCCCHHHHCCCCCCCCCHHHHHHHHHHCC--CCCEEEE T ss_conf 78899999999999999764799589956434358343135740033553436865688899999862203--8968877 Q ss_pred CCCCCC-CCCCCCCC---------CCCCEEEEEECCCHHHCCCCCCEEEEEECCCCCCCCCCCCCCCCCEEEEEE Q ss_conf 456787-65646688---------820147999862043100247217864113455444555576343899984 Q gi|254781003|r 239 KEKDSR-CNANKNLR---------NKIPIDYFVMDQNAYKFLIQESFSEILYNEDDIKSRGKRLSDHCPISIDYD 303 (304) Q Consensus 239 ~~~~~~-~~~~~~~~---------~~~~iD~I~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~SDH~PV~~~l~ 303 (304) +...+. ...++++. ...||||+++|+++........+.. -......+|||+||+++|+ T Consensus 200 R~~~p~~~~~yTwW~yr~~a~~~n~G~RIDy~l~S~~l~~~v~~~~I~~-------~~rg~~~~SDH~PV~~~l~ 267 (268) T PRK11756 200 RQLNPDVNDRFSWFDYRSKGFDDNRGLRIDLILATQPLAERCVETGIDY-------DIRGMEKPSDHAPIWATFK 267 (268) T ss_pred EEECCCCCCCEEEECCCCCCCCCCCEEEEEEEEECHHHHHHHEEEEECH-------HHHCCCCCCCEEEEEEEEE T ss_conf 2305899873687365568631487176899985999984530648518-------8827999998001899996 No 5 >COG0708 XthA Exonuclease III [DNA replication, recombination, and repair] Probab=99.96 E-value=5.9e-28 Score=170.81 Aligned_cols=230 Identities=15% Similarity=0.203 Sum_probs=131.1 Q ss_pred EEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECCC--HHHHHHHHCCCCCEEEEECCCCCCC Q ss_conf 69999830147886676667765310188999999999860898899995078--5799986033343078621455675 Q gi|254781003|r 24 VRLVSWNINTLSEQEGVSLWKNSVKRTTSDYTLLRQYAKNLDADIVFLQEMGS--YNAVAKVFPKNTWCIFYSTERLINH 101 (304) Q Consensus 24 lrV~twNi~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~DIi~lQEv~~--~~~~~~~~~~~~~~~~~~~~~~~~~ 101 (304) +||+||||+++ ..++..+.+++.+.+||||||||+-. ...-...+...+|+.++.. T Consensus 1 mkI~SwNVNgi----------------Rar~~~~~~~l~~~~pDVlclQEtK~~~~~fp~~~~~~~GY~~~~~~------ 58 (261) T COG0708 1 MKIASWNVNGL----------------RARLKKLLDWLEEEQPDVLCLQETKAQDEQFPREELEALGYHHVFNH------ 58 (261) T ss_pred CEEEEEEHHHH----------------HHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHCCCEEEEEC------ T ss_conf 90689830428----------------98999999999872999899974146766689767752795189952------ Q ss_pred CCCCCCCCCEEEEEEECCCEEEEE-EECCCCCCCCCCCCCCCCEEEEEEEEECCEEEEEEEECCCCCCCCCCCHHHHHHH Q ss_conf 432234463158998258526765-3012246765555556533699999829949999994067888775200002577 Q gi|254781003|r 102 SKRDSNNDIHTAIAVRKKNVRVLQ-QSYPLLGAKDSFSRAGNRRAVELLVEINGKKIWVLDIHLKSFCFLDSLENTYSPS 180 (304) Q Consensus 102 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~v~n~Hl~~~~~~~~~~~~~~~~ 180 (304) +.+.++|+++.++.+.... ..++.....+.. +-.+....++ +.|+|++++++...+... T Consensus 59 -----gqKgysGVailsr~~~~~v~~g~~~~~~~d~e-------~R~I~a~~~~--~~v~~~Y~PnG~~~~~~k------ 118 (261) T COG0708 59 -----GQKGYSGVAILSKKPPDDVRRGFPGEEEDDEE-------GRVIEAEFDG--FRVINLYFPNGSSIGLEK------ 118 (261) T ss_pred -----CCCCCCEEEEEECCCCHHHHCCCCCCCCCCCC-------CCEEEEEECC--EEEEEEECCCCCCCCCCC------ T ss_conf -----76773448999846830210279987665434-------7579999798--999999727999888710------ Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCH-------------------HHHHHHHHCCCCCCCCCCCCC Q ss_conf 8999999999999986303689809998026888776620-------------------468987522557531114456 Q gi|254781003|r 181 CSLLSQQAQWLKDWITQKKESLVPFVIAGDFNRKINYLGN-------------------NDDFWKTIDPNDSLIRFPKEK 241 (304) Q Consensus 181 ~~~~~~q~~~l~~~~~~~~~~~~~vIi~GDfN~~~~~~~~-------------------~~~~~~~~~~~~~~~~~~~~~ 241 (304) ...+..-++.+..++.+....+.|+|+|||||..+..-.. +.++...+... +.+..... T Consensus 119 ~~yKl~f~~~l~~~l~~l~~~~~~~vl~GD~NIap~~iDv~~~~~~~~n~~~~~f~~eeR~~~~~ll~~G--~~D~~R~~ 196 (261) T COG0708 119 FDYKLRFLDALRNYLEELLKKGKPVVLCGDFNIAPEEIDVANPKKRWLNEGNSGFLPEERAWFRRLLNAG--FVDTFRLF 196 (261) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCHHCCCCCHHHHHCCCCCCCCHHHHHHHHHHHHCC--HHHHHHHH T ss_conf 6789999999999999987459987996663117853212572555401688898989999999998753--06456764 Q ss_pred CCCCCCCCCCC---------CCCCEEEEEECCCHHHCCCCCCEEEEEECCCCCCCCCCCCCCCCCEEEEEEC Q ss_conf 78765646688---------8201479998620431002472178641134554445555763438999849 Q gi|254781003|r 242 DSRCNANKNLR---------NKIPIDYFVMDQNAYKFLIQESFSEILYNEDDIKSRGKRLSDHCPISIDYDF 304 (304) Q Consensus 242 ~~~~~~~~~~~---------~~~~iD~I~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~SDH~PV~~~l~f 304 (304) ......++++. ...|||||++|+.+..+.....+-. -......+|||+||+++|++ T Consensus 197 ~p~~~~YTwW~YR~~~~~~n~G~RID~~l~S~~L~~~~~~a~I~~-------~~rg~e~pSDHaPV~~e~~~ 261 (261) T COG0708 197 HPEPEKYTWWDYRANAARRNRGWRIDYILVSPALADRLKDAGIDR-------EVRGWEKPSDHAPVWVELDL 261 (261) T ss_pred CCCCCCCCCCCCCCCHHHHCCCEEEEEEEECHHHHHHHHHCCCCH-------HHHCCCCCCCCCCEEEEECC T ss_conf 899996351256213056638606888971899998886668637-------88558888885757999629 No 6 >COG3568 ElsH Metal-dependent hydrolase [General function prediction only] Probab=99.96 E-value=4.5e-28 Score=171.44 Aligned_cols=238 Identities=16% Similarity=0.108 Sum_probs=131.1 Q ss_pred CEEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCHHHH--------HHHHC-CCCCEEEE Q ss_conf 2699998301478866766677653101889999999998608988999950785799--------98603-33430786 Q gi|254781003|r 23 KVRLVSWNINTLSEQEGVSLWKNSVKRTTSDYTLLRQYAKNLDADIVFLQEMGSYNAV--------AKVFP-KNTWCIFY 93 (304) Q Consensus 23 ~lrV~twNi~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~DIi~lQEv~~~~~~--------~~~~~-~~~~~~~~ 93 (304) .++|+||||+++....+ ...+.+.+++.+++.++||||||||...... ..... ..+....+ T Consensus 9 ~~~v~TyNih~~~~~~d----------~r~~~~r~~~~i~~~~~Div~LQEv~~~~~~~r~~~~~~~~~~~~~~~~~~~~ 78 (259) T COG3568 9 RFKVLTYNIHKGFGAFD----------RRFDLPRIAEVIREVGADIVALQEVDGAFGRHRDGLLDLPHLLGRLGLAPYWW 78 (259) T ss_pred EEEEEEEEEEECCCCCC----------CEECHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCHHHHHHHHHCCCCCCC T ss_conf 23789999887667667----------26447999999976436701464144455542224213678888750875210 Q ss_pred ECCCCCCCCCCCCCCCCEEEEEEECCCEEEEEEECCCCCCCCCCCCCCCCEEEEEEEEEC-CEEEEEEEECCCCCCCCCC Q ss_conf 214556754322344631589982585267653012246765555556533699999829-9499999940678887752 Q gi|254781003|r 94 STERLINHSKRDSNNDIHTAIAVRKKNVRVLQQSYPLLGAKDSFSRAGNRRAVELLVEIN-GKKIWVLDIHLKSFCFLDS 172 (304) Q Consensus 94 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~-~~~~~v~n~Hl~~~~~~~~ 172 (304) +... ..........|.+++++.+............ .....|+++.+++..+ |+++.|+|+||... T Consensus 79 ~~~~-----~~a~~~~~~~GnaiLS~~pi~~v~~~~lp~~----~~~~~Rgal~a~~~~~~g~~l~V~~~HL~l~----- 144 (259) T COG3568 79 SGAA-----FGAVYGEGQHGNAILSRLPIRDVENLALPDP----TGLEPRGALLAEIELPGGKPLRVINAHLGLS----- 144 (259) T ss_pred CHHH-----HHHHCCCCEEEEEEEECCCCCCHHHCCCCCC----CCCCCCEEEEEEEECCCCCEEEEEEEECCCC----- T ss_conf 1134-----4420010011159985376422110369987----7877752699999768997799999741633----- Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCC-CCCCCCCC Q ss_conf 00002577899999999999998630368980999802688877662046898752255753111445678-76564668 Q gi|254781003|r 173 LENTYSPSCSLLSQQAQWLKDWITQKKESLVPFVIAGDFNRKINYLGNNDDFWKTIDPNDSLIRFPKEKDS-RCNANKNL 251 (304) Q Consensus 173 ~~~~~~~~~~~~~~q~~~l~~~~~~~~~~~~~vIi~GDfN~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 251 (304) .+.+..|++.|.+.+. . ....|+|+|||||..+.+..++......+..............+ +..+.... T Consensus 145 --------~~~R~~Q~~~L~~~~~-l-~~~~p~vl~GDFN~~p~s~~yr~~~~~~~~~~~~~~~~~~~a~~~~~~tfps~ 214 (259) T COG3568 145 --------EESRLRQAAALLALAG-L-PALNPTVLMGDFNNEPGSAEYRLAARSPLNAQAALTGAFAPAVGRTIRTFPSN 214 (259) T ss_pred --------HHHHHHHHHHHHHHCC-C-CCCCCEEEECCCCCCCCCCCCEECCCCCHHHCCCCCCCCCCCCCCCCCCCCCC T ss_conf --------8789999999986356-7-56686389713789988724031048821110322356576667556777777 Q ss_pred CCCCCEEEEEECCCHHHCCCCCCEEEEEECCCCCCCCCCCCCCCCCEEEEEEC Q ss_conf 88201479998620431002472178641134554445555763438999849 Q gi|254781003|r 252 RNKIPIDYFVMDQNAYKFLIQESFSEILYNEDDIKSRGKRLSDHCPISIDYDF 304 (304) Q Consensus 252 ~~~~~iD~I~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~SDH~PV~~~l~f 304 (304) ....+|||||+++.+.... ..+. .. ......|||+||.+||++ T Consensus 215 ~p~lriD~Ifvs~~~~i~~----~~v~-----~~-~~a~~aSDHlPl~aeL~~ 257 (259) T COG3568 215 TPLLRLDRIFVSKELAIRS----VHVL-----TD-RLARVASDHLPLLAELRL 257 (259) T ss_pred CCCCCCCEEEECCCCCEEE----EEEE-----CC-CCCCCCCCCCCEEEEEEC T ss_conf 8542266899547441787----7752-----37-776421466556889742 No 7 >pfam03372 Exo_endo_phos Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNase I proteins EC:3.1.21.1, Synaptojanin an inositol-1,4,5-trisphosphate phosphatase EC:3.1.3.56, Sphingomyelinase EC:3.1.4.12 and Nocturnin. Probab=99.96 E-value=4.8e-27 Score=165.85 Aligned_cols=238 Identities=16% Similarity=0.068 Sum_probs=120.6 Q ss_pred EEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHH--HCCCCCEEEEECCCCCCC Q ss_conf 699998301478866766677653101889999999998608988999950785799986--033343078621455675 Q gi|254781003|r 24 VRLVSWNINTLSEQEGVSLWKNSVKRTTSDYTLLRQYAKNLDADIVFLQEMGSYNAVAKV--FPKNTWCIFYSTERLINH 101 (304) Q Consensus 24 lrV~twNi~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~DIi~lQEv~~~~~~~~~--~~~~~~~~~~~~~~~~~~ 101 (304) |||+||||+++....... ....+.+++...+|||+||||+......... .....+.... ....... T Consensus 1 l~i~t~Nv~g~~~~~~~~-----------~~~~l~~~~~~~~~Dii~LQE~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 68 (255) T pfam03372 1 LKILTWNVNGLSAALKLL-----------WARLLLELLKLEDPDVICLQETKLSPESLILLLLLLLGYVSPL-TSSVYTG 68 (255) T ss_pred CEEEEEEECCHHHHHHCC-----------CHHHHHHHHHHCCCCEEEEECCCCCCHHCCHHHHHHCCCCCCC-CCCCEEE T ss_conf 989999946721441046-----------6789999997689988999812179012797577535874100-3421035 Q ss_pred CCCCCCCCCEEEEEEECCCEEEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCEEEEEEEECCCCCCCCCCCHHHHHHHH Q ss_conf 43223446315899825852676530122467655555565336999998299499999940678887752000025778 Q gi|254781003|r 102 SKRDSNNDIHTAIAVRKKNVRVLQQSYPLLGAKDSFSRAGNRRAVELLVEINGKKIWVLDIHLKSFCFLDSLENTYSPSC 181 (304) Q Consensus 102 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~v~n~Hl~~~~~~~~~~~~~~~~~ 181 (304) .........++++.++.+............ ........+..++..+.++++|++++... . T Consensus 69 ~~~~~~~~~Gvai~~r~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~v~~~y~~~~~~~----------~ 128 (255) T pfam03372 69 FGGGGGGKGGVAILSKEPLVEVILGIFSEN----------EKDFIRRRILANGKSFVVVNTHLPAGGEN----------L 128 (255) T ss_pred EECCCCCCCCEEEEECCCCCEEEECCCCCC----------CCCEEEEEEEEECCCEEEEEEECCCCCCC----------H T ss_conf 531457625259996478622441578765----------57278999997449899999974699975----------3 Q ss_pred HHHHHHHHHHHHHHHHC-CCCCCCEEEEECCCCCCCCCCHHHHHHHHHCCCCCC---------CCCCCCCCCCCCCCC-- Q ss_conf 99999999999998630-368980999802688877662046898752255753---------111445678765646-- Q gi|254781003|r 182 SLLSQQAQWLKDWITQK-KESLVPFVIAGDFNRKINYLGNNDDFWKTIDPNDSL---------IRFPKEKDSRCNANK-- 249 (304) Q Consensus 182 ~~~~~q~~~l~~~~~~~-~~~~~~vIi~GDfN~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~-- 249 (304) ..+..|...+.+.+... ...+.++|+|||||..+................... ..........++... T Consensus 129 ~~r~~~~~~l~~~l~~~~~~~~~~~ii~GDfN~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~T~~~~~~ 208 (255) T pfam03372 129 DERLAQLAELLDFLSDLRIPKSDPVILCGDFNARPDSWDSALLKSIGKSTLFLLLERDLVFDGFDELPIGFPPTWWSYRN 208 (255) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCHHHHCCHHHHHCCCCCCHHHHHHEEEEEECCCCCCCCCCCCCCC T ss_conf 37999999999999986543598089972024368422011778772454688566533372255489999878707746 Q ss_pred ---CCCCCCCEEEEEECCCHHHCCCCCCEEEEEECCCCCCCCCCCCCCCCCEEEEE Q ss_conf ---68882014799986204310024721786411345544455557634389998 Q gi|254781003|r 250 ---NLRNKIPIDYFVMDQNAYKFLIQESFSEILYNEDDIKSRGKRLSDHCPISIDY 302 (304) Q Consensus 250 ---~~~~~~~iD~I~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~SDH~PV~~~l 302 (304) ....+.||||||++++............ .....+|||+||.++| T Consensus 209 ~~~~~~~~~riD~il~s~~~~~~~~~~~~~~---------~~~~~~SDH~Pv~~~l 255 (255) T pfam03372 209 SSEKKNTGSRLDRILVSGSLLRRVVILSLLL---------LVIFTGSDHRPVLATL 255 (255) T ss_pred CCCCCCCCEEEEEEEECHHHHHHCEEEEEEE---------CCCCCCCCCCCEEEEC T ss_conf 7563798782389996889982233898740---------6799999974869979 No 8 >PRK13911 hypothetical protein; Provisional Probab=99.95 E-value=4.2e-26 Score=160.70 Aligned_cols=221 Identities=17% Similarity=0.217 Sum_probs=116.9 Q ss_pred EEEEEEECCCCCCCCCCCCCCCCCCCHHHHH-HHHHHHHHHHCCCEEEEEECCCHHHHHHHHCCCCCEEEEECCCCCCCC Q ss_conf 6999983014788667666776531018899-999999986089889999507857999860333430786214556754 Q gi|254781003|r 24 VRLVSWNINTLSEQEGVSLWKNSVKRTTSDY-TLLRQYAKNLDADIVFLQEMGSYNAVAKVFPKNTWCIFYSTERLINHS 102 (304) Q Consensus 24 lrV~twNi~~~~~~~~~~~~~~~~~~~~~~~-~~l~~~i~~~~~DIi~lQEv~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 102 (304) +||+||||+++. +++ +.+.+++...+||||||||+-.... ...+...+|..++... T Consensus 1 MkI~swNVNgiR----------------a~~~~~~~~~l~~~~pDIlcLQEtK~~~~-~~~~~~~gy~~~~~~~------ 57 (250) T PRK13911 1 MKLISWNVNGLR----------------ACMTKGFMDFFNSVDADVFCIQESKMQQE-QNTFEFKGYFDFWNCA------ 57 (250) T ss_pred CEEEEEEEHHHH----------------HHHHHHHHHHHHHCCCCEEEEEEEECCCC-CCCCCCCCCCEEEEEE------ T ss_conf 989999737489----------------87770399999667998999973445876-5772305850056211------ Q ss_pred CCCCCCCCEEEEEEECCCE-EEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCEEEEEEEECCCCCCCCCCCHHHHHHHH Q ss_conf 3223446315899825852-676530122467655555565336999998299499999940678887752000025778 Q gi|254781003|r 103 KRDSNNDIHTAIAVRKKNV-RVLQQSYPLLGAKDSFSRAGNRRAVELLVEINGKKIWVLDIHLKSFCFLDSLENTYSPSC 181 (304) Q Consensus 103 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~v~n~Hl~~~~~~~~~~~~~~~~~ 181 (304) ..+.++|+++.++.. .......... ......| .+.... ..+.++|++.+++...... . T Consensus 58 ----~~kgysGVai~sk~~p~~v~~g~~~~-----~~d~egR---~i~~~~--~~~~l~n~Y~Png~~~~~~-------~ 116 (250) T PRK13911 58 ----IKKGYSGVVTFTKKEPLSVSYGINIE-----EHDKEGR---VVTCEF--ESFYLVNVYTPNSQQALSR-------L 116 (250) T ss_pred ----EECCCCCEEEEECCCCCEEEECCCCC-----CCCCCCC---EEEEEE--CCEEEEEEECCCCCCCCCH-------H T ss_conf ----55441634888648872067168864-----3220263---799998--9999985204898665501-------6 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCC----------------HHHHHHHHHCCCCCCCCCCCCCCCC- Q ss_conf 99999999999998630368980999802688877662----------------0468987522557531114456787- Q gi|254781003|r 182 SLLSQQAQWLKDWITQKKESLVPFVIAGDFNRKINYLG----------------NNDDFWKTIDPNDSLIRFPKEKDSR- 244 (304) Q Consensus 182 ~~~~~q~~~l~~~~~~~~~~~~~vIi~GDfN~~~~~~~----------------~~~~~~~~~~~~~~~~~~~~~~~~~- 244 (304) ..+......+.++++.. ..+.++|+|||||..+.... ++.++...+.. .+.+..+...+. T Consensus 117 ~~K~~~~~~~~~~~~~l-~~~k~vI~~GD~NIa~~~~Dv~~~~~~~~~~gf~~~ER~~~~~ll~~--G~vD~fR~~~p~~ 193 (250) T PRK13911 117 SYRMSWEVEFKKFLKAL-ELKKPVIVCGDLNVAHNEIDLENPKTNRKNAGFSDEERGKFSELLNA--GFIDTFRYFYPNK 193 (250) T ss_pred HHHHHHHHHHHHHHHHH-CCCCCEEEECCCCCCCCHHCCCCHHHHCCCCCCCHHHHHHHHHHHHC--CCEEEEEECCCCC T ss_conf 68999999999999985-14898899633354687201218587567977698999999998607--9717652107899 Q ss_pred CCCCCCCC---------CCCCEEEEEECCCHHHCCCCCCEEEEEECCCCCCCCCCCCCCCCCEEEEE Q ss_conf 65646688---------82014799986204310024721786411345544455557634389998 Q gi|254781003|r 245 CNANKNLR---------NKIPIDYFVMDQNAYKFLIQESFSEILYNEDDIKSRGKRLSDHCPISIDY 302 (304) Q Consensus 245 ~~~~~~~~---------~~~~iD~I~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~SDH~PV~~~l 302 (304) ...++++. ...||||+++|+.+........+ ......|||+||++|| T Consensus 194 ~~~yTwW~yr~~a~~~N~G~RID~~l~S~~l~~~v~~~~I-----------~~~~~~SDH~PV~leL 249 (250) T PRK13911 194 EKAYTWWSYMQQARDKNIGWRIDYFLCSNPLKTRLKDALI-----------YKDILGSDHCPVGLEL 249 (250) T ss_pred CCCCCCCCCCCCCCCCCCEEEEEEEEECHHHHHHCEEEEE-----------CCCCCCCCCCCEEEEE T ss_conf 9864530700166768988788899988899955016167-----------3899999975769995 No 9 >smart00476 DNaseIc deoxyribonuclease I. Deoxyribonuclease I catalyzes the endonucleolytic cleavage of double-stranded DNA. The enzyme is secreted outside the cell and also involved in apoptosis in the nucleus. Probab=99.94 E-value=1.1e-24 Score=152.90 Aligned_cols=249 Identities=16% Similarity=0.186 Sum_probs=142.9 Q ss_pred CCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCH--HHHHHHHCC------CC Q ss_conf 8767882699998301478866766677653101889999999998608988999950785--799986033------34 Q gi|254781003|r 17 TASVAQKVRLVSWNINTLSEQEGVSLWKNSVKRTTSDYTLLRQYAKNLDADIVFLQEMGSY--NAVAKVFPK------NT 88 (304) Q Consensus 17 ~~~~~~~lrV~twNi~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~DIi~lQEv~~~--~~~~~~~~~------~~ 88 (304) ....+.+|||+||||+.|+.. +...+.-++.|++++++. ||+++|||.+. .++.+++.. .. T Consensus 11 ~~~~~~~lrIasFNIq~FG~s---------K~~~~~v~~~l~kI~~ry--Div~IQEV~d~~~~av~~L~~~LN~~~~~~ 79 (276) T smart00476 11 LLHGAASLRICAFNIQSFGDS---------KMSNATLMSIIVKILSRY--DIALVQEVRDSDLSAVPKLMDQLNSDSPNT 79 (276) T ss_pred HHCCCCCEEEEEEECHHHCCC---------CCCCHHHHHHHHHHHHHC--CEEEEEEEECCCCCHHHHHHHHHHCCCCCC T ss_conf 846666259999860323734---------013758999999999739--889999998488778999999996368888 Q ss_pred CEEEEECCCCCCCCCCCCCCCCEEEEEEECCCEEEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCE---EEEEEEECCC Q ss_conf 3078621455675432234463158998258526765301224676555555653369999982994---9999994067 Q gi|254781003|r 89 WCIFYSTERLINHSKRDSNNDIHTAIAVRKKNVRVLQQSYPLLGAKDSFSRAGNRRAVELLVEINGK---KIWVLDIHLK 165 (304) Q Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~---~~~v~n~Hl~ 165 (304) |....+.+ .+ .. ..+...++.+|...+.+... .............+.|.+..+++..++. .|.++++|.. T Consensus 80 y~~~iS~r-lG----r~-sykEqYafiYR~d~v~v~~~-y~y~D~~~~~~D~F~REPfv~~F~~~~t~~~~F~lv~~Ht~ 152 (276) T smart00476 80 YSYVSSEP-LG----RN-SYKEQYLFLYRSDLVSVLDS-YLYDDGCECGNDVFSREPFVVKFSSPSTAVKEFVIVPLHTT 152 (276) T ss_pred CCEEECCC-CC----CC-CCCEEEEEEEECCEEEECCC-EEECCCCCCCCCCEECCCEEEEEECCCCCCCCEEEEEEECC T ss_conf 60785677-88----87-64267999982556887263-58247778988727139979999758966662699997448 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHHH---HHHCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 88877520000257789999999999999---863036898099980268887766204689875225575311144567 Q gi|254781003|r 166 SFCFLDSLENTYSPSCSLLSQQAQWLKDW---ITQKKESLVPFVIAGDFNRKINYLGNNDDFWKTIDPNDSLIRFPKEKD 242 (304) Q Consensus 166 ~~~~~~~~~~~~~~~~~~~~~q~~~l~~~---~~~~~~~~~~vIi~GDfN~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 242 (304) +.... + ++++|.+. +.+. -...++|++||||..++......+..-.+.....+.++-.... T Consensus 153 P~~a~-----------~----Eid~L~dV~~dv~~~-~~~~diiilGDFNAdcsyv~~~~~~~irl~~~~~f~WlI~d~~ 216 (276) T smart00476 153 PEAAV-----------A----EIDALYDVYLDVRQK-WGTEDVIFMGDFNAGCSYVTKKQWSSIRLRTSPTFHWLIPDSA 216 (276) T ss_pred CCCCH-----------H----HHHHHHHHHHHHHHH-CCCCCEEEECCCCCCCCCCCHHHHHHCCCCCCCCCEEECCCCC T ss_conf 23027-----------8----999999999999874-4766679962456887655755653220102686137447887 Q ss_pred CCCCCCCCCCCCCCEEEEEECC-CHHHCCCCCCEEEEEECCCCCC--CCCCCCCCCCCEEEEEE Q ss_conf 8765646688820147999862-0431002472178641134554--44555576343899984 Q gi|254781003|r 243 SRCNANKNLRNKIPIDYFVMDQ-NAYKFLIQESFSEILYNEDDIK--SRGKRLSDHCPISIDYD 303 (304) Q Consensus 243 ~~~~~~~~~~~~~~iD~I~~s~-~~~~~~~~~~~~~~~~~~~~~~--~~~~~~SDH~PV~~~l~ 303 (304) .++- . .....+|.|++.. .+.....+++..+..|...... ......|||+||.++|. T Consensus 217 dTTv---~-~t~caYDriv~~g~~~~~~~~~gsa~v~~F~~~~~Lt~~~Al~VSDH~PVev~Lk 276 (276) T smart00476 217 DTTV---T-STHCAYDRIVVAGERLRSSVVPGSAAVFDFQTAYGLTEEEALAISDHFPVEVTLK 276 (276) T ss_pred CCCC---C-CCCCCCCEEEEECHHHHHCCCCCCCEEEECHHHHCCCHHHHHHCCCCCCEEEEEC T ss_conf 7643---6-8887761499975455423136874267437864959999974677776699949 No 10 >KOG2338 consensus Probab=99.90 E-value=5.3e-22 Score=138.28 Aligned_cols=180 Identities=16% Similarity=0.151 Sum_probs=102.8 Q ss_pred CCEEEEEEEC--CCCCC-CCCCC-CCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCHHH---HHHHHCCCCCEEEEE Q ss_conf 8269999830--14788-66766-67765310188999999999860898899995078579---998603334307862 Q gi|254781003|r 22 QKVRLVSWNI--NTLSE-QEGVS-LWKNSVKRTTSDYTLLRQYAKNLDADIVFLQEMGSYNA---VAKVFPKNTWCIFYS 94 (304) Q Consensus 22 ~~lrV~twNi--~~~~~-~~~~~-~~~~~~~~~~~~~~~l~~~i~~~~~DIi~lQEv~~~~~---~~~~~~~~~~~~~~~ 94 (304) ..++|||||| +.+.. +.... ...+..-.|..+.+.|.+.+...+|||+|||||+.... ..+.+...||..+.. T Consensus 115 f~ftvmSYNILAQ~l~~~~~r~~~~~s~~~L~W~~R~~~Ll~EL~~~dpDIlCLQEVq~d~~~~~~~~~~~~lGy~~~~~ 194 (495) T KOG2338 115 FDFTVMSYNILAQDLLRDIRRLYFPESGPALKWLRRSQNLLNELKHYDPDVLCLQEVQEDHYPEFWQPLLGKLGYTGFFK 194 (495) T ss_pred CCEEEEEEHHHHHHHHHHHHHHHCCCCHHHCCHHHHHHHHHHHHHHCCCCEEEEHHHHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 64489871574788888767761569803235567768999987513987433002313454899999886268327898 Q ss_pred CCCCCCCCCCCCCCCCEEEEEEECCCEEEEEEECC-CCCCCCCCCCCCCCEEEEEEEEE-----CCEEEEEEEECCCCCC Q ss_conf 14556754322344631589982585267653012-24676555555653369999982-----9949999994067888 Q gi|254781003|r 95 TERLINHSKRDSNNDIHTAIAVRKKNVRVLQQSYP-LLGAKDSFSRAGNRRAVELLVEI-----NGKKIWVLDIHLKSFC 168 (304) Q Consensus 95 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~r~~~~~~~~~-----~~~~~~v~n~Hl~~~~ 168 (304) .. .....-+++++++...+.+....-. +........+. ...+..+.+.. -++.+.|.||||-+.+ T Consensus 195 r~--------t~~KthG~ai~w~~~~F~lv~~~~l~y~~~~~~l~n~-~NV~lvv~l~f~~~~~~sq~ilVanTHLl~np 265 (495) T KOG2338 195 RR--------TGTKTHGVAILWHSAKFKLVNHSELNYFDSGSALANR-DNVGLVVSLEFRLVDESSQGILVANTHLLFNP 265 (495) T ss_pred EC--------CCCCCCEEEEEEECCCCEECCCCHHHCCCCCCHHHCC-CCEEEEEEEEECCCCCCCCCEEEEEEEEEECC T ss_conf 41--------5777765899970330203254001214642023132-55227998775246744573378752056557 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCC---CCCCEEEEECCCCCCCCCCH Q ss_conf 77520000257789999999999999863036---89809998026888776620 Q gi|254781003|r 169 FLDSLENTYSPSCSLLSQQAQWLKDWITQKKE---SLVPFVIAGDFNRKINYLGN 220 (304) Q Consensus 169 ~~~~~~~~~~~~~~~~~~q~~~l~~~~~~~~~---~~~~vIi~GDfN~~~~~~~~ 220 (304) ..... +..|...|.+.+++... ..-|+++|||||..|.++.+ T Consensus 266 ~~~~v----------rL~Q~~iiL~~~~~~~~~~~~~~pi~l~GDfNt~p~~~~y 310 (495) T KOG2338 266 SRSDV----------RLAQVYIILAELEKMSKSSKSHWPIFLCGDFNTEPDSPPY 310 (495) T ss_pred CCCCH----------HHHHHHHHHHHHHHHHHHCCCCCCEEEECCCCCCCCCCCC T ss_conf 66611----------1578999999999997534357980770577789988731 No 11 >COG5239 CCR4 mRNA deadenylase, exonuclease subunit and related nucleases [RNA processing and modification] Probab=99.86 E-value=9.2e-21 Score=131.50 Aligned_cols=275 Identities=18% Similarity=0.116 Sum_probs=143.6 Q ss_pred CEEEEEEEC--CCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCHH---HHHHHHCCCCCEEEEECCC Q ss_conf 269999830--14788667666776531018899999999986089889999507857---9998603334307862145 Q gi|254781003|r 23 KVRLVSWNI--NTLSEQEGVSLWKNSVKRTTSDYTLLRQYAKNLDADIVFLQEMGSYN---AVAKVFPKNTWCIFYSTER 97 (304) Q Consensus 23 ~lrV~twNi--~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~DIi~lQEv~~~~---~~~~~~~~~~~~~~~~~~~ 97 (304) +++|||||+ +.+....-.+.. ++.-.|..+...|.+.+...+|||+|||||+... .+.+.+...+|...+-... T Consensus 30 ~ftimTYN~Laq~y~~r~~y~~s-~~aL~W~~R~~~L~~EL~~Yn~Di~CLQEvd~~~~~~fw~~~l~~~gY~~if~~k~ 108 (378) T COG5239 30 DFTIMTYNVLAQTYATRKMYPYS-GWALKWSYRSRLLLQELLYYNADILCLQEVDAEDFEDFWKDQLGKLGYDGIFIPKE 108 (378) T ss_pred EEEEEEHHHHHHHHCCCCCCCCC-HHHHHHHHHHHHHHHHHHCCCCCEEEEEHHHHHHHHHHHHHHHCCCCCCEEEECCC T ss_conf 36998613534665333124775-55522378889999987403786566211336678888888733556432882589 Q ss_pred CCCCCCCC--CCCCCEEEEEEEC----CCEEEEEEECCCCCCCCCC--------------CCCCCCE----EEEEEEEEC Q ss_conf 56754322--3446315899825----8526765301224676555--------------5556533----699999829 Q gi|254781003|r 98 LINHSKRD--SNNDIHTAIAVRK----KNVRVLQQSYPLLGAKDSF--------------SRAGNRR----AVELLVEIN 153 (304) Q Consensus 98 ~~~~~~~~--~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~--------------~~~~~r~----~~~~~~~~~ 153 (304) .+...-.. .....++++..+. ++..++.........+... ....... .....-..+ T Consensus 109 ~k~~~m~~~d~t~~dGc~if~k~~~~~sk~~li~~~~~~f~~p~~~~er~r~t~~~lnri~e~~~~~w~~l~~~l~n~e~ 188 (378) T COG5239 109 RKVKWMIDYDTTKVDGCAIFLKRFIDSSKLGLILAVTHLFWHPYGYYERFRQTYILLNRIGEKDNIAWVCLFVGLFNKEP 188 (378) T ss_pred CCCCCCCCCCCCCCCEEEEEEEECCCCCHHHHHHHHHHHHCCCCEEEHHHHHHHHHHHHHHHHHHCCHHHEEEEECCCCC T ss_conf 84442443334323038999875266101124324167640620010134677999999865431031320333215799 Q ss_pred CEEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCC-----------CCCCEEEEECCCCCCCCCCHHH Q ss_conf 94999999406788877520000257789999999999999863036-----------8980999802688877662046 Q gi|254781003|r 154 GKKIWVLDIHLKSFCFLDSLENTYSPSCSLLSQQAQWLKDWITQKKE-----------SLVPFVIAGDFNRKINYLGNND 222 (304) Q Consensus 154 ~~~~~v~n~Hl~~~~~~~~~~~~~~~~~~~~~~q~~~l~~~~~~~~~-----------~~~~vIi~GDfN~~~~~~~~~~ 222 (304) |.++.+.|||+++.....+... ..+..+... +...+.+..+ +..++.++||||+...+..+.. T Consensus 189 gd~~~va~Th~~w~~~~~dvk~---iq~s~l~~~---~k~~~~e~~~~d~~~~d~k~~~~~~~l~~gd~ns~~~s~vy~~ 262 (378) T COG5239 189 GDTPYVANTHLPWDPKYRDVKL---IQCSLLYRE---LKKVLKEELNDDKEEGDIKSYPEVDILITGDFNSLRASLVYKF 262 (378) T ss_pred CCCEEEEECCCCCCCCCCCHHE---EHHHHHHHH---HHHHHHHCCCCCHHCCCCCCCCCCCCCCCCCCCCEECCEEHHH T ss_conf 9830677011245677771210---024679999---9998642077410014545676554235787100003300145 Q ss_pred HHHHHHCCC--------------CCCCC---C--CCCCCCCCCCCCCCCCCCCEEEEEECCCHHHC-CCCCCEEEEEECC Q ss_conf 898752255--------------75311---1--44567876564668882014799986204310-0247217864113 Q gi|254781003|r 223 DFWKTIDPN--------------DSLIR---F--PKEKDSRCNANKNLRNKIPIDYFVMDQNAYKF-LIQESFSEILYNE 282 (304) Q Consensus 223 ~~~~~~~~~--------------~~~~~---~--~~~~~~~~~~~~~~~~~~~iD~I~~s~~~~~~-~~~~~~~~~~~~~ 282 (304) ........+ ..... + .........+++++.+.+.|||||+......+ ..........+.+ T Consensus 263 l~~~~~q~H~~~~~~~~~lysvg~~~~h~~n~~~~~~~~~~~fTN~t~~~kG~iDYIfy~~~~~~~~~~~l~~ve~e~~~ 342 (378) T COG5239 263 LVTSQIQLHESLNGRDFSLYSVGYKFVHPENLKSDNSKGELGFTNWTPGFKGVIDYIFYHGGLLTRQTGLLGVVEGEYAS 342 (378) T ss_pred HHHHHHHHHHCCCCCCHHHHCCCCCCCCHHHCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCEEEECCCCCCCCCHHH T ss_conf 44079886410354432220155455584231557876876501356654542579998468610440455554510233 Q ss_pred C-CCCCCCCCCCCCCCEEEEEEC Q ss_conf 4-554445555763438999849 Q gi|254781003|r 283 D-DIKSRGKRLSDHCPISIDYDF 304 (304) Q Consensus 283 ~-~~~~~~~~~SDH~PV~~~l~f 304 (304) . ...++...+|||.|+.+++.| T Consensus 343 k~~G~pn~~~pSdhipl~~ef~~ 365 (378) T COG5239 343 KVIGLPNMPFPSDHIPLLAEFAS 365 (378) T ss_pred HHCCCCCCCCCCCCCCCHHCCCC T ss_conf 11056789995124400101344 No 12 >KOG3873 consensus Probab=99.86 E-value=1.4e-20 Score=130.56 Aligned_cols=260 Identities=16% Similarity=0.146 Sum_probs=151.9 Q ss_pred CCCCEEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHCCCCCEEEEECCCCC Q ss_conf 78826999983014788667666776531018899999999986089889999507857999860333430786214556 Q gi|254781003|r 20 VAQKVRLVSWNINTLSEQEGVSLWKNSVKRTTSDYTLLRQYAKNLDADIVFLQEMGSYNAVAKVFPKNTWCIFYSTERLI 99 (304) Q Consensus 20 ~~~~lrV~twNi~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~DIi~lQEv~~~~~~~~~~~~~~~~~~~~~~~~~ 99 (304) .+-.|||+|.|+.+..- -.+. ...++.++.+.+.+...||+.||||++..+.+.+.....-.+. . T Consensus 5 ~a~~lriltlN~Wgip~--------~Sk~-R~~Rm~~~g~~l~~E~yDiv~LQEvWs~eD~~~L~~~~ss~yP------y 69 (422) T KOG3873 5 LALELRILTLNIWGIPY--------VSKD-RRHRMDAIGDELASEKYDIVSLQEVWSQEDFEYLQSGCSSVYP------Y 69 (422) T ss_pred HHHEEEEEEEECCCCCC--------CCCH-HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCCCC------H T ss_conf 32201346763125643--------4301-4788987767776355211428988878789999974364574------5 Q ss_pred CCCCCCCCCCCEEEEEEECCCEEEEEEECCCCCCCCCCCCCCCC---EEEEEEEEECCEEEEEEEECCCCCCCCCCCHHH Q ss_conf 75432234463158998258526765301224676555555653---369999982994999999406788877520000 Q gi|254781003|r 100 NHSKRDSNNDIHTAIAVRKKNVRVLQQSYPLLGAKDSFSRAGNR---RAVELLVEINGKKIWVLDIHLKSFCFLDSLENT 176 (304) Q Consensus 100 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r---~~~~~~~~~~~~~~~v~n~Hl~~~~~~~~~~~~ 176 (304) .+..+....+.+.++.++.++.......+++.+.+....+...- +.-..++..+|+.+.++|+||++....... T Consensus 70 sh~FHSGimGaGL~vfSK~PI~~t~~~~y~lNG~p~~i~rGDWf~GK~Vgl~~l~~~g~~v~~yntHLHAeY~rq~D--- 146 (422) T KOG3873 70 SHYFHSGIMGAGLCVFSKHPILETLFHRYSLNGYPHAIHRGDWFGGKGVGLTVLLVGGRMVNLYNTHLHAEYDRQND--- 146 (422) T ss_pred HHHHHCCCCCCCEEEEECCCHHHHHHHCCCCCCCCCEEEECCCCCCCEEEEEEEEECCEEEEEEEHHCCCCCCCCCC--- T ss_conf 77640245567458765375065542001417861024305633563268999953988855120002562235676--- Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCHHH-------HHHHHHCCCC----CCCCC-------- Q ss_conf 2577899999999999998630368980999802688877662046-------8987522557----53111-------- Q gi|254781003|r 177 YSPSCSLLSQQAQWLKDWITQKKESLVPFVIAGDFNRKINYLGNND-------DFWKTIDPND----SLIRF-------- 237 (304) Q Consensus 177 ~~~~~~~~~~q~~~l~~~~~~~~~~~~~vIi~GDfN~~~~~~~~~~-------~~~~~~~~~~----~~~~~-------- 237 (304) .-...|..|+-.+.++++.....+.-+|++||||..|.+.+..- ..|..+.... ..... T Consensus 147 --~YL~HR~~QAwdlaqfi~~t~q~~~vVI~~GDLN~~P~dl~~~ll~~a~l~daw~~~h~~q~e~~~~r~s~~~~l~~g 224 (422) T KOG3873 147 --EYLCHRVAQAWDLAQFIRATRQNADVVILAGDLNMQPQDLGHKLLLSAGLVDAWTSLHLDQCESDSFRLSEDKELVEG 224 (422) T ss_pred --HHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCEEEEECCCHHHHHHHHCHHHHCCCCCCCCHHHHHHCC T ss_conf --114678898999999999974478689994677888666661244403004567653324312752244524455328 Q ss_pred -CCCCCCCCCCCCCC--C--CCCCEEEEEECCCHHHCCCCCCEEEEEECCCCCCCCCCCCCCCCCEEEEEE Q ss_conf -44567876564668--8--820147999862043100247217864113455444555576343899984 Q gi|254781003|r 238 -PKEKDSRCNANKNL--R--NKIPIDYFVMDQNAYKFLIQESFSEILYNEDDIKSRGKRLSDHCPISIDYD 303 (304) Q Consensus 238 -~~~~~~~~~~~~~~--~--~~~~iD~I~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~SDH~PV~~~l~ 303 (304) +.+...+++..... . ..-||||+|++++. ...+..+ ..+.....|......|||..+.+.|. T Consensus 225 ~tcd~~~N~y~~aqk~~ddp~~~RiDYvl~k~~~-~~~~~a~---~~~t~~rvP~~d~s~SDH~Al~a~L~ 291 (422) T KOG3873 225 NTCDSPLNCYTSAQKREDDPLGKRIDYVLVKPGD-CNAKIAE---VEFTEPRVPGEDCSYSDHEALMATLK 291 (422) T ss_pred CCCCCCCHHHHHHHHCCCCCCCEEEEEEEECCCC-EEEEEEE---EEECCCCCCCCCCCCCCHHHHEEEEE T ss_conf 7656751036678747887543156689973761-3787305---88547878999998420243436898 No 13 >COG2374 Predicted extracellular nuclease [General function prediction only] Probab=99.86 E-value=5.8e-20 Score=127.15 Aligned_cols=275 Identities=15% Similarity=0.094 Sum_probs=151.6 Q ss_pred CCCCEEEEEEECCCCCC---CCCCCCCCCC-------CCCHHHHHHHHHHHHHHHCCCEEEEEECCCH-------HHHHH Q ss_conf 78826999983014788---6676667765-------3101889999999998608988999950785-------79998 Q gi|254781003|r 20 VAQKVRLVSWNINTLSE---QEGVSLWKNS-------VKRTTSDYTLLRQYAKNLDADIVFLQEMGSY-------NAVAK 82 (304) Q Consensus 20 ~~~~lrV~twNi~~~~~---~~~~~~~~~~-------~~~~~~~~~~l~~~i~~~~~DIi~lQEv~~~-------~~~~~ 82 (304) .+..|||+++|+.+++. .......... ....+.+..++...+..+++||++|-|+++. .++.. T Consensus 460 v~G~LkiasfNVlNyf~~~~~~g~~~~~~~d~rga~~~~e~~~Q~~k~v~ai~~ldaDV~GL~Eien~~~~~~s~~ai~~ 539 (798) T COG2374 460 VGGSLKIASFNVLNYFNKLSGDGGGASTFSDCRGANTTEEFARQRAKIVTAILTLDADVLGLMEIENNGYGTDSGDAIAQ 539 (798) T ss_pred CCCEEEEEEEEHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHEEEEEECCCCCCCCHHHHHH T ss_conf 57634431332122420377777775553102476635677787788888886124012102355206777774778999 Q ss_pred HH-------CCC-CCEEEEECCCCCCCCCCCCCCCCEEEEEEECCCEEEEEEECCC----CC--CCCCCCCCCCCEEEEE Q ss_conf 60-------333-4307862145567543223446315899825852676530122----46--7655555565336999 Q gi|254781003|r 83 VF-------PKN-TWCIFYSTERLINHSKRDSNNDIHTAIAVRKKNVRVLQQSYPL----LG--AKDSFSRAGNRRAVEL 148 (304) Q Consensus 83 ~~-------~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~--~~~~~~~~~~r~~~~~ 148 (304) ++ ... .|.+..... ..........+|+.++...+......... .. ......+.....+... T Consensus 540 lV~~lna~~G~~~~y~~v~p~~-----~~~lgtD~I~vg~iY~~~~V~~~g~~~~~~~~a~~~v~~~~~~n~R~~laqtF 614 (798) T COG2374 540 LVNALNAKAGAGTRYAFVAPGR-----NGGLGTDAIRVGFIYKPSAVSLVGKAAIVDNDAFLGVGEIDDNNARPPLAQTF 614 (798) T ss_pred HHHHHHHHCCCCCEEEEEECCC-----CCCCCCCCEEEEEEECCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCHHHHH T ss_conf 9999986327995289984675-----57767772567889615367763653211553335545566544574056665 Q ss_pred EEEECCEEEEEEEECCCCCCCCC----CCHHHHHHHHHHHHHHHHHHHHHHHH--CCCCCCCEEEEECCCCCCCCCCHHH Q ss_conf 99829949999994067888775----20000257789999999999999863--0368980999802688877662046 Q gi|254781003|r 149 LVEINGKKIWVLDIHLKSFCFLD----SLENTYSPSCSLLSQQAQWLKDWITQ--KKESLVPFVIAGDFNRKINYLGNND 222 (304) Q Consensus 149 ~~~~~~~~~~v~n~Hl~~~~~~~----~~~~~~~~~~~~~~~q~~~l~~~~~~--~~~~~~~vIi~GDfN~~~~~~~~~~ 222 (304) .....+..|.++..|++++.... +...........|..|+++|.+|+.. ......+++|+||||+-..+.. T Consensus 615 ~~~~~~ekfvvVvNHfkSKgs~~p~~gd~~dgQg~~~~~R~~~AqaL~~~la~~~~~~~d~~~viLGD~N~y~~edp--- 691 (798) T COG2374 615 QDLSGGEKFVVVVNHFKSKGSDCPVDGDTQDGQGNSNQTRVRAAQALAAFLATNPTGKADADIVILGDFNDYAFEDP--- 691 (798) T ss_pred HHCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEECCCHHHHCCH--- T ss_conf 32057717999983012668888775656664441046889999999999862856556788799724321310137--- Q ss_pred HHHHHHCCCCCCCCCC-CCCCCCCCCCCCCCCCCCEEEEEECCCHHHCCCCCCEEEEEECCCCCC--------------- Q ss_conf 8987522557531114-456787656466888201479998620431002472178641134554--------------- Q gi|254781003|r 223 DFWKTIDPNDSLIRFP-KEKDSRCNANKNLRNKIPIDYFVMDQNAYKFLIQESFSEILYNEDDIK--------------- 286 (304) Q Consensus 223 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~iD~I~~s~~~~~~~~~~~~~~~~~~~~~~~--------------- 286 (304) ...+.......... .......+++........|||+|+|.++......-..+......++.. T Consensus 692 --I~~l~~aGy~~l~~~~~~~~~~YSY~f~G~~gtLDhaLas~sl~~~v~~a~ewHINAdE~~~ldYn~~Fk~q~~~~~~ 769 (798) T COG2374 692 --IQALEGAGYMNLAARFHDAGDRYSYVFNGQSGTLDHALASASLAAQVSGATEWHINADEPDALDYNLEFKGQNVSLYK 769 (798) T ss_pred --HHHHHHCCCHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHCCCCCCCCC T ss_conf --899862483011001057787348998782640766652244552156742653045666232210210356553335 Q ss_pred -CCCCCCCCCCCEEEEEEC Q ss_conf -445555763438999849 Q gi|254781003|r 287 -SRGKRLSDHCPISIDYDF 304 (304) Q Consensus 287 -~~~~~~SDH~PV~~~l~f 304 (304) ..-++.|||-||++.|++ T Consensus 770 ~~~~fR~SDHDPvvvglnL 788 (798) T COG2374 770 TTNPFRASDHDPVVVGLNL 788 (798) T ss_pred CCCCCCCCCCCCEEEEEEE T ss_conf 6775535789976999870 No 14 >PRK05421 hypothetical protein; Provisional Probab=99.85 E-value=5.6e-20 Score=127.21 Aligned_cols=220 Identities=20% Similarity=0.175 Sum_probs=115.6 Q ss_pred CCCCEEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHCCCCCEEEEECCCCC Q ss_conf 78826999983014788667666776531018899999999986089889999507857999860333430786214556 Q gi|254781003|r 20 VAQKVRLVSWNINTLSEQEGVSLWKNSVKRTTSDYTLLRQYAKNLDADIVFLQEMGSYNAVAKVFPKNTWCIFYSTERLI 99 (304) Q Consensus 20 ~~~~lrV~twNi~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~DIi~lQEv~~~~~~~~~~~~~~~~~~~~~~~~~ 99 (304) .+++|+|++|||.-. ....|...+..+++ ++|+++|||+.-...+.+.+....|...... T Consensus 30 ~~~~~~llvWNI~Kq-----------~~~~W~~~L~~l~~-----~~dLvlLQEA~~~~~l~~~~~~~~~~~~~a~---- 89 (254) T PRK05421 30 GSERFRLLVWNIYKQ-----------QRAGWLSVLKNLGK-----DADLVLLQEAQTTPELVQFATANYLAADQAP---- 89 (254) T ss_pred CCCCEEEEEEEEECC-----------CCHHHHHHHHHHCC-----CCCEEEEEECCCCHHHHHHHHCCCCHHHHHH---- T ss_conf 876406999994216-----------65307999998606-----6878999613677657998852551788612---- Q ss_pred CCCCCCCCCCCEEEEEEECCCEEEEEEECCCCCCCCCCCCCCCCEEEEEEEE-ECCEEEEEEEECCCCCCCCCCCHHHHH Q ss_conf 7543223446315899825852676530122467655555565336999998-299499999940678887752000025 Q gi|254781003|r 100 NHSKRDSNNDIHTAIAVRKKNVRVLQQSYPLLGAKDSFSRAGNRRAVELLVE-INGKKIWVLDIHLKSFCFLDSLENTYS 178 (304) Q Consensus 100 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~-~~~~~~~v~n~Hl~~~~~~~~~~~~~~ 178 (304) .+......+|++..++..-........ .+.+-..++.++..... .+++.+.|+|+|.-+.+.+- T Consensus 90 ----af~~~~~~~GVmtlS~~~p~~~c~~~~----~EP~lr~pKs~Lit~YpL~~~q~LlVVNiHaINF~~g~------- 154 (254) T PRK05421 90 ----AFVLPQHPSGVMTLSKAHPVYCCPLRE----REPWLRLPKSALITEYPLPNGQTLLVVNIHAINFSVGV------- 154 (254) T ss_pred ----HHHCCCCCCEEEECCCCCHHHHCCCCC----CCCCCCCCHHHHEEEECCCCCCEEEEEEEEEEEEECCH------- T ss_conf ----220479863478754111254423566----67220363123047720589977999994067665156------- Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEE Q ss_conf 77899999999999998630368980999802688877662046898752255753111445678765646688820147 Q gi|254781003|r 179 PSCSLLSQQAQWLKDWITQKKESLVPFVIAGDFNRKINYLGNNDDFWKTIDPNDSLIRFPKEKDSRCNANKNLRNKIPID 258 (304) Q Consensus 179 ~~~~~~~~q~~~l~~~~~~~~~~~~~vIi~GDfN~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iD 258 (304) .....|++.+.+.+. ....|+|++||||+ ....+......+.....+.-..-. .. .+.. .+...+| T Consensus 155 ---~~y~~QL~~l~~~i~---~H~GPvI~AGDFNt---Ws~~R~~~l~~~~~~~gL~ev~f~--~d--~R~~-~~g~pLD 220 (254) T PRK05421 155 ---DVYSKQLEPIGDQIA---HHSGPIILAGDFNT---WSRKRMNALKRFARELGLKEVRFT--DD--QRRK-AFGRPLD 220 (254) T ss_pred ---HHHHHHHHHHHHHHH---HCCCCEEEECCCCC---CCHHHHHHHHHHHHHCCCCEEECC--CC--CCEE-CCCCCCC T ss_conf ---999999999999998---48998798445330---139999999999997698087537--67--7345-2589865 Q ss_pred EEEECCCHHHCCCCCCEEEEEECCCCCCCCCCCCCCCCCEEEEEE Q ss_conf 999862043100247217864113455444555576343899984 Q gi|254781003|r 259 YFVMDQNAYKFLIQESFSEILYNEDDIKSRGKRLSDHCPISIDYD 303 (304) Q Consensus 259 ~I~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~SDH~PV~~~l~ 303 (304) |||+.+=. ..... .+ ....|||.|+.++|. T Consensus 221 hifyRgl~----------~~~a~---~~--~t~~SDHnPl~~~f~ 250 (254) T PRK05421 221 HVFYRGLN----------VEKAS---VL--VTRASDHNPLLVEFS 250 (254) T ss_pred EEEECCCE----------EEEEE---CC--CCCCCCCCCEEEEEE T ss_conf 69987852----------33105---56--567787773889999 No 15 >KOG0620 consensus Probab=99.77 E-value=2e-18 Score=118.73 Aligned_cols=282 Identities=15% Similarity=0.108 Sum_probs=132.7 Q ss_pred CCCCCCEEEEEEECCC--CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCH-HHHHHHHCCCCCEEEEE Q ss_conf 7678826999983014--78866766677653101889999999998608988999950785-79998603334307862 Q gi|254781003|r 18 ASVAQKVRLVSWNINT--LSEQEGVSLWKNSVKRTTSDYTLLRQYAKNLDADIVFLQEMGSY-NAVAKVFPKNTWCIFYS 94 (304) Q Consensus 18 ~~~~~~lrV~twNi~~--~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~DIi~lQEv~~~-~~~~~~~~~~~~~~~~~ 94 (304) .++-.+++|+|||+-. ..........+.+..+++.+...+.+.+...+||++|||||+.. ......+...+|...+. T Consensus 14 ~~~~~~~~v~s~n~l~~~~~~~~~~~~~~~~~~~w~~r~~~~~~ei~~~~ad~icLqev~~~~~~~~p~l~~~gY~g~~~ 93 (361) T KOG0620 14 VPQGETFTVLSYNILSQLYATSKLYMYCPSWALSWEYRRQLLLEEILNYNADILCLQEVDRYHDFFSPELEASGYSGIFI 93 (361) T ss_pred CCCCCEEEECCCCHHHHHHCCCCCCCCCCHHHCCHHHHHHHHHHHHHCCCCCEEECCHHHHHHHHCCCHHHHCCCCCEEE T ss_conf 66773354211111016550467775351443167888899999874778551315126679877564154358741242 Q ss_pred CCCCCCCCCCCCCCCCEEEEEEECCCEEEEEEECC-C--CCC-----C-CCCCCCCC--CEEE-EEEEE-ECCEEEE--- Q ss_conf 14556754322344631589982585267653012-2--467-----6-55555565--3369-99998-2994999--- Q gi|254781003|r 95 TERLINHSKRDSNNDIHTAIAVRKKNVRVLQQSYP-L--LGA-----K-DSFSRAGN--RRAV-ELLVE-INGKKIW--- 158 (304) Q Consensus 95 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~--~~~-----~-~~~~~~~~--r~~~-~~~~~-~~~~~~~--- 158 (304) .... ..........++++..+.+...+..+... + ... . ....+... ..+. .+.+. ..+..+. T Consensus 94 ~k~~--~~~~~~~~~dGcaiffk~s~f~li~~~~i~~n~~~~~~~~~~~~~~~~~~~~d~~~~~l~~L~~~~~~~~~~~~ 171 (361) T KOG0620 94 EKTR--MGEVELEKIDGCAIFFKPSLFQLIHSSAIELNQLALSRLPPNIAQLNRLTTLDNSGNKLVSLKAELGNMVSLPH 171 (361) T ss_pred CCCC--CCHHHCCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCCHHHHHHHHCCCCEEECC T ss_conf 1465--21110366763666644158766565556667778742344289998662113201122301522278413110 Q ss_pred -EEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHH-HHHH-------HCCCCCCCEEEEECCCCCCCCCCHHH------- Q ss_conf -9994067888775200002577899999999999-9986-------30368980999802688877662046------- Q gi|254781003|r 159 -VLDIHLKSFCFLDSLENTYSPSCSLLSQQAQWLK-DWIT-------QKKESLVPFVIAGDFNRKINYLGNND------- 222 (304) Q Consensus 159 -v~n~Hl~~~~~~~~~~~~~~~~~~~~~~q~~~l~-~~~~-------~~~~~~~~vIi~GDfN~~~~~~~~~~------- 222 (304) ++|+|+.......... ..+.. .+..+.+.+. +.+. .......|+++|||||+.+..+.+.- T Consensus 172 ~~~n~h~~~~~~~~~vk--l~qv~-~l~~~~~~~~~~~~~~~~~~~~~~~~~~~p~l~~gdfNs~p~~~v~~~~~~~~~~ 248 (361) T KOG0620 172 LLLNTHILWDLPYEDVK--LFQVL-TLLLELAPLSPEILKIYREGNLATRIASFPLLLCGDFNSTPLSPVYALLASGILS 248 (361) T ss_pred CCCCCEECCCCCCCCHH--HHHHH-HHHHCCCCCCHHHHHHHCCCCCCCCCCCCCEEEECCCCCCCCCCCEEECCCCCCC T ss_conf 00243121477765017--99999-9762023234555001103553233423432212144678886501331467889 Q ss_pred ---HHHHHHCCCCCCC----------------CCCCCCCCCCCCCCCCCCCCCEEEEEECCCHHHCCCCCCEEE--EEEC Q ss_conf ---8987522557531----------------114456787656466888201479998620431002472178--6411 Q gi|254781003|r 223 ---DFWKTIDPNDSLI----------------RFPKEKDSRCNANKNLRNKIPIDYFVMDQNAYKFLIQESFSE--ILYN 281 (304) Q Consensus 223 ---~~~~~~~~~~~~~----------------~~~~~~~~~~~~~~~~~~~~~iD~I~~s~~~~~~~~~~~~~~--~~~~ 281 (304) ..+..+....... ...........+...+++.+++||||++++.......-.... .... T Consensus 249 ~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ftn~t~~F~g~~DYIfys~~~~~~~~~L~~~~e~~~~~ 328 (361) T KOG0620 249 PEDADNKDLPLMSALGSKVNKSYEEMSHDQRRKLEYTTGEPRFTNYTPGFKGTLDYIFYSKAALIVESALELLDEDELVI 328 (361) T ss_pred CCHHHHHHCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCHHHHHC T ss_conf 53666764444440001232320004511321544556765532465775551568997165330021356777466423 Q ss_pred CCCCCCCCCCCCCCCCEEEEEEC Q ss_conf 34554445555763438999849 Q gi|254781003|r 282 EDDIKSRGKRLSDHCPISIDYDF 304 (304) Q Consensus 282 ~~~~~~~~~~~SDH~PV~~~l~f 304 (304) .....++-.++|||.|+.++|++ T Consensus 329 ~~~~lPs~~~pSDHi~L~~ef~~ 351 (361) T KOG0620 329 KVTGLPSPHHPSDHIPLLAEFEI 351 (361) T ss_pred CCCCCCCCCCCCCCCHHHCCCCC T ss_conf 54448888787753023212444 No 16 >KOG2756 consensus Probab=99.74 E-value=1.1e-16 Score=109.25 Aligned_cols=242 Identities=19% Similarity=0.236 Sum_probs=124.5 Q ss_pred CCEEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCH--HHHHHHHCCCCCEEEEECCCCC Q ss_conf 82699998301478866766677653101889999999998608988999950785--7999860333430786214556 Q gi|254781003|r 22 QKVRLVSWNINTLSEQEGVSLWKNSVKRTTSDYTLLRQYAKNLDADIVFLQEMGSY--NAVAKVFPKNTWCIFYSTERLI 99 (304) Q Consensus 22 ~~lrV~twNi~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~DIi~lQEv~~~--~~~~~~~~~~~~~~~~~~~~~~ 99 (304) ..++++||||+++.-. ....++.+++.+++-++|||||||||-.. +.+.+. ...|..+.+.... T Consensus 98 ~~~S~~~WnidgLdln-----------~l~~RMrAv~H~i~l~sPdiiflQEV~p~~y~~~~K~--~s~y~i~~~~~~~- 163 (349) T KOG2756 98 SMFSLITWNIDGLDLN-----------NLSERMRAVCHYLALYSPDVIFLQEVIPPYYSYLKKR--SSNYEIITGHEEG- 163 (349) T ss_pred CEEEEEEEECCCCCCC-----------HHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHH--HHHEEEEEECCCE- T ss_conf 3788898611565443-----------1899999999999852987798753373566789876--5430689951451- Q ss_pred CCCCCCCCCCCEEEEEEECCCEEEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCEEEEEEEECCCCCCCCCCCHHHHHH Q ss_conf 75432234463158998258526765301224676555555653369999982994999999406788877520000257 Q gi|254781003|r 100 NHSKRDSNNDIHTAIAVRKKNVRVLQQSYPLLGAKDSFSRAGNRRAVELLVEINGKKIWVLDIHLKSFCFLDSLENTYSP 179 (304) Q Consensus 100 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~v~n~Hl~~~~~~~~~~~~~~~ 179 (304) .+.......+...+...+. ..-.+....|....++++.+|..+.+.++||.+.....- ++. T Consensus 164 ----------~~~~~~l~~s~~~Vks~~~-----i~F~NS~M~R~L~I~Ev~v~G~Kl~l~tsHLEStr~h~P---~r~- 224 (349) T KOG2756 164 ----------YFTAIMLKKSRVKVKSQEI-----IPFPNSKMMRNLLIVEVNVSGNKLCLMTSHLESTRGHAP---ERM- 224 (349) T ss_pred ----------EEEEEEEEHHHCCCCCCCE-----EECCCCHHHHEEEEEEEEECCEEEEEEECCCCCCCCCCC---HHH- T ss_conf ----------5652211024428520121-----105752121046799985177069998413357889981---788- Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCC------CCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 7899999999999998630368980999802688877------6620468987522557531114456787656466888 Q gi|254781003|r 180 SCSLLSQQAQWLKDWITQKKESLVPFVIAGDFNRKIN------YLGNNDDFWKTIDPNDSLIRFPKEKDSRCNANKNLRN 253 (304) Q Consensus 180 ~~~~~~~q~~~l~~~~~~~~~~~~~vIi~GDfN~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 253 (304) ......++..++.+... ++..||.+||+|-.-. .+..-...|+.+....... .+.++..+.+-.-...- T Consensus 225 --~qF~~~~~k~~EaIe~l--PnA~ViFGGD~NlrD~ev~r~~lPD~~vDvWE~lg~p~~~~-FTwDT~~N~nl~G~~a~ 299 (349) T KOG2756 225 --NQFKMVLKKMQEAIESL--PNATVIFGGDTNLRDREVTRCGLPDNIVDVWEFLGKPKHCQ-FTWDTQMNSNLGGTAAC 299 (349) T ss_pred --HHHHHHHHHHHHHHHHC--CCCEEEECCCCCCHHHHCCCCCCCCHHHHHHHHHCCCCCCC-EEEECCCCCCCCCHHHH T ss_conf --99999999999999859--99769965866401333136789913789999818987674-14443448643214888 Q ss_pred CCCEEEEEECCCHHHC-CCCCCEEEEEECCCCCCCCCCCCCCCCCEEEEEE Q ss_conf 2014799986204310-0247217864113455444555576343899984 Q gi|254781003|r 254 KIPIDYFVMDQNAYKF-LIQESFSEILYNEDDIKSRGKRLSDHCPISIDYD 303 (304) Q Consensus 254 ~~~iD~I~~s~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~SDH~PV~~~l~ 303 (304) .-|+|.|++.-+.... ....++..... .....-...||||..+.++|+ T Consensus 300 k~RfDRi~~r~~~~~G~~~~~~l~l~g~--~kiRgc~~fPSDHwgll~Tl~ 348 (349) T KOG2756 300 KLRFDRIFFRAAAEEGHIIPRSLDLLGL--EKLRGCGRFPSDHWGLLCTLD 348 (349) T ss_pred HHHHHHHHEECHHHCCCCCCCCCCHHHH--HHHCCCCCCCCCCCEEEEEEC T ss_conf 8877767300012218856544452425--664578867854301466615 No 17 >smart00128 IPPc Inositol polyphosphate phosphatase, catalytic domain homologues. Mg(2+)-dependent/Li(+)-sensitive enzymes. Probab=99.71 E-value=2.9e-15 Score=101.43 Aligned_cols=146 Identities=14% Similarity=0.105 Sum_probs=77.4 Q ss_pred CCCCCEEEEEEEEECCEEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH------CCCCCCCEEEEECCC Q ss_conf 55653369999982994999999406788877520000257789999999999999863------036898099980268 Q gi|254781003|r 139 RAGNRRAVELLVEINGKKIWVLDIHLKSFCFLDSLENTYSPSCSLLSQQAQWLKDWITQ------KKESLVPFVIAGDFN 212 (304) Q Consensus 139 ~~~~r~~~~~~~~~~~~~~~v~n~Hl~~~~~~~~~~~~~~~~~~~~~~q~~~l~~~~~~------~~~~~~~vIi~GDfN 212 (304) ..+..+++.+.+...+..+.++|+||.++... ...|..+.+.+.+.+.- .......++++|||| T Consensus 122 ~~gnKGaV~vr~~i~~t~~~Fvn~HL~a~~~~----------~~~Rn~~~~~I~~~l~f~~~~~~~~~~~d~vf~~GDlN 191 (310) T smart00128 122 LWGNKGAVAVRFKLSDTSFCFVNSHLAAGASN----------VEQRNQDYKTILRALSFPERAELSQFDHDVVFWFGDLN 191 (310) T ss_pred CCCCCCCEEEEEEECCEEEEEEEECCCCCCCH----------HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEECCCC T ss_conf 22557518999999786888876237777430----------77778899999986057876666666775059834763 Q ss_pred CCCCCCCHHH-----------------HHHHHHCC--------CCCCCCCCCCCCC--CCCCCC---CCCCCCCEEEEEE Q ss_conf 8877662046-----------------89875225--------5753111445678--765646---6888201479998 Q gi|254781003|r 213 RKINYLGNND-----------------DFWKTIDP--------NDSLIRFPKEKDS--RCNANK---NLRNKIPIDYFVM 262 (304) Q Consensus 213 ~~~~~~~~~~-----------------~~~~~~~~--------~~~~~~~~~~~~~--~~~~~~---~~~~~~~iD~I~~ 262 (304) ...+...... .+...... +.+..+.+..+.. ++..+. ..+-|+.-|.|++ T Consensus 192 yRi~~~~~~~v~~~i~~~~~~~L~~~DQL~~~~~~~~~f~~f~E~~I~F~PTYK~d~~~t~~Ydts~k~R~PsWcDRIL~ 271 (310) T smart00128 192 FRLDSPSYEEVRRKISKKEFDDLLEKDQLNRQKEAGKVFKGFQEGPITFPPTYKYDSVGTETYDTSEKKRVPAWCDRILY 271 (310) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCCCEEEE T ss_conf 10168977999999976129988677889998753864587287886889980467788764377777055677765865 Q ss_pred CCCHHHCCCCCCEEEEEECCCCCCCCCCCCCCCCCEEEEEE Q ss_conf 62043100247217864113455444555576343899984 Q gi|254781003|r 263 DQNAYKFLIQESFSEILYNEDDIKSRGKRLSDHCPISIDYD 303 (304) Q Consensus 263 s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~SDH~PV~~~l~ 303 (304) ...... .+....|.. ......|||.||.+.|+ T Consensus 272 ~~~~~~-----~~~~~~Y~s----~~~~~~SDHkPV~a~f~ 303 (310) T smart00128 272 RSNGPN-----LIQLSEYHS----GMELTTSDHKPVFATFR 303 (310) T ss_pred CCCCCC-----CEEEEEEEC----CCCCCCCCCCCEEEEEE T ss_conf 268887-----146764222----58766778514788999 No 18 >COG3021 Uncharacterized protein conserved in bacteria [Function unknown] Probab=99.62 E-value=1e-14 Score=98.46 Aligned_cols=201 Identities=17% Similarity=0.088 Sum_probs=92.6 Q ss_pred HHHHHHHHHCCCEEEEEECCCHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCCCEEEEEEECCCEEEEEEECCCCCCCC Q ss_conf 99999986089889999507857999860333430786214556754322344631589982585267653012246765 Q gi|254781003|r 56 LLRQYAKNLDADIVFLQEMGSYNAVAKVFPKNTWCIFYSTERLINHSKRDSNNDIHTAIAVRKKNVRVLQQSYPLLGAKD 135 (304) Q Consensus 56 ~l~~~i~~~~~DIi~lQEv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 135 (304) ++...+.+.++|++-+||+.......-.....+|..+...... ...++.+..++.+.. ..... T Consensus 105 k~Lsl~~~~~~D~v~~~E~~~~~~~~~~~l~~~yP~~~~~~~~----------~~~~~~a~~sr~~~~-------~~~~~ 167 (309) T COG3021 105 KLLSLIQQLDADAVTTPEGVQLWTAKVGALAAQYPAFILCQHP----------TGVFTLAILSRRPCC-------PLTEA 167 (309) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCEEECCCC----------CCEEEEEECCCCCCC-------CCCCC T ss_conf 9998876647532221888877676788998749742406899----------872211000346555-------52235 Q ss_pred CCCCCCCCEEEE-EEEEECCEEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCC Q ss_conf 555556533699-9998299499999940678887752000025778999999999999986303689809998026888 Q gi|254781003|r 136 SFSRAGNRRAVE-LLVEINGKKIWVLDIHLKSFCFLDSLENTYSPSCSLLSQQAQWLKDWITQKKESLVPFVIAGDFNRK 214 (304) Q Consensus 136 ~~~~~~~r~~~~-~~~~~~~~~~~v~n~Hl~~~~~~~~~~~~~~~~~~~~~~q~~~l~~~~~~~~~~~~~vIi~GDfN~~ 214 (304) ...-...+.+.. ......|+.+.+++.|........+.. + .|...+.+.+ .....|+|++||||++ T Consensus 168 e~~~~~pk~~~~t~~~~~~g~~l~v~~lh~~~~~~~~~~~---------~-~ql~~l~~~i---~~~~gpvIlaGDfNa~ 234 (309) T COG3021 168 EPWLRLPKSALATAYPLPDGTELTVVALHAVNFPVGTDPQ---------R-AQLLELGDQI---AGHSGPVILAGDFNAP 234 (309) T ss_pred CCCCCCCCCCEEEEEECCCCCEEEEEEECCCCCCCCCCHH---------H-HHHHHHHHHH---HCCCCCEEEEECCCCC T ss_conf 7640177651268998689988999862254566775188---------9-9999999999---7178875886137885 Q ss_pred CCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCC-CCCCCEEEEEECCCHHHCCCCCCEEEEEECCCCCCCCCCCCC Q ss_conf 7766204689875225575311144567876564668-882014799986204310024721786411345544455557 Q gi|254781003|r 215 INYLGNNDDFWKTIDPNDSLIRFPKEKDSRCNANKNL-RNKIPIDYFVMDQNAYKFLIQESFSEILYNEDDIKSRGKRLS 293 (304) Q Consensus 215 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~iD~I~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~S 293 (304) |=+...+. ...+.......... .....+.+.... .+..+|||||++.-... +. ..... ..| T Consensus 235 pWS~~~~R--~~~l~~~~~~~~aG-~~~~~~~p~~~~r~~g~PIDhvf~rgl~~~--ka-----------~rl~~--~gS 296 (309) T COG3021 235 PWSRTAKR--MAALGGLRAAPRAG-LWEVRFTPDERRRAFGLPIDHVFYRGLTVM--KA-----------RRLPD--RGS 296 (309) T ss_pred CHHHHHHH--HHHHCCCCCCHHCC-CCCCCCCHHHHHHCCCCCCCEEEECCCCHH--HH-----------HHCCC--CCC T ss_conf 05577899--99861643222205-776642777874125898521233586044--33-----------20545--577 Q ss_pred CCCCEEEEEEC Q ss_conf 63438999849 Q gi|254781003|r 294 DHCPISIDYDF 304 (304) Q Consensus 294 DH~PV~~~l~f 304 (304) ||+|+.+||.. T Consensus 297 DH~PLLveF~~ 307 (309) T COG3021 297 DHRPLLVEFSY 307 (309) T ss_pred CCCCEEEEEEE T ss_conf 77716899885 No 19 >KOG0566 consensus Probab=99.60 E-value=1.2e-14 Score=98.15 Aligned_cols=146 Identities=16% Similarity=0.128 Sum_probs=77.6 Q ss_pred CCCCCCCCCEEEEEEEEECCEEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH----HCCCCCCCEEEEEC Q ss_conf 55555565336999998299499999940678887752000025778999999999999986----30368980999802 Q gi|254781003|r 135 DSFSRAGNRRAVELLVEINGKKIWVLDIHLKSFCFLDSLENTYSPSCSLLSQQAQWLKDWIT----QKKESLVPFVIAGD 210 (304) Q Consensus 135 ~~~~~~~~r~~~~~~~~~~~~~~~v~n~Hl~~~~~~~~~~~~~~~~~~~~~~q~~~l~~~~~----~~~~~~~~vIi~GD 210 (304) ....-.+..+++.+++...+..+++++.|+.+|...-+ +|+.-.+.+.+-+. .....-..|++||| T Consensus 653 GfGG~tgNKGAVAIrf~~~~TsfCFv~SHlAAG~snv~----------ERn~DY~tI~r~l~Fp~Gr~I~~HD~ifW~GD 722 (1080) T KOG0566 653 GFGGATGNKGAVAIRFVYHATSFCFVCSHLAAGQSNVE----------ERNEDYKTIARKLRFPRGRMIFSHDYIFWLGD 722 (1080) T ss_pred CCCCCCCCCCEEEEEEEECCCCEEEEECCCCCCCCHHH----------HHHHHHHHHHHHCCCCCCCCCCCCCEEEEECC T ss_conf 55663577751899999646527998500015544076----------64204999987560657751357762898335 Q ss_pred CCCCCCCCCHH----------------HHHHHHHCCCC--------CCCCCCCCCCCCCC----CCCCCCCCCCEEEEEE Q ss_conf 68887766204----------------68987522557--------53111445678765----6466888201479998 Q gi|254781003|r 211 FNRKINYLGNN----------------DDFWKTIDPND--------SLIRFPKEKDSRCN----ANKNLRNKIPIDYFVM 262 (304) Q Consensus 211 fN~~~~~~~~~----------------~~~~~~~~~~~--------~~~~~~~~~~~~~~----~~~~~~~~~~iD~I~~ 262 (304) ||...+...+. |.+-....... ...+.+..+..... +.-..+-|++-|.|++ T Consensus 723 FNYRI~l~nEEVr~~v~~~d~~kL~e~DQL~~q~~~G~vF~gF~E~~ltF~PTYKyD~gTd~YDTSeK~R~PAWTDRIL~ 802 (1080) T KOG0566 723 FNYRIDLSNEEVRRLVRNQDLDKLLEYDQLTQQMNAGQVFPGFHEGQLTFPPTYKYDPGTDDYDTSEKCRTPAWTDRILW 802 (1080) T ss_pred CCEEECCCHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCHHHHEE T ss_conf 44043388799999997221988866778888885376146643452366875013689876564011047630455222 Q ss_pred CCCHHHCCCCCCEEEEEECCCCCCCCCCCCCCCCCEEEEEE Q ss_conf 62043100247217864113455444555576343899984 Q gi|254781003|r 263 DQNAYKFLIQESFSEILYNEDDIKSRGKRLSDHCPISIDYD 303 (304) Q Consensus 263 s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~SDH~PV~~~l~ 303 (304) ...... ...|... ....|||+||.+.++ T Consensus 803 r~e~~~--------~l~Y~~~-----el~~SDHRPV~A~~~ 830 (1080) T KOG0566 803 RGEKLE--------LLSYKRA-----ELKTSDHRPVYAIFR 830 (1080) T ss_pred CCCCCC--------CCCCCCC-----CCCCCCCCCEEEEEE T ss_conf 453234--------4302555-----302147873579999 No 20 >PTZ00297 pantothenate kinase; Provisional Probab=99.58 E-value=7.5e-14 Score=93.74 Aligned_cols=265 Identities=16% Similarity=0.165 Sum_probs=119.7 Q ss_pred CCCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHC-CCEEEEEECCCHHHHHH-----------H Q ss_conf 58767882699998301478866766677653101889999999998608-98899995078579998-----------6 Q gi|254781003|r 16 CTASVAQKVRLVSWNINTLSEQEGVSLWKNSVKRTTSDYTLLRQYAKNLD-ADIVFLQEMGSYNAVAK-----------V 83 (304) Q Consensus 16 ~~~~~~~~lrV~twNi~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~-~DIi~lQEv~~~~~~~~-----------~ 83 (304) .+....-.+||+|||.+-++...++. .-++|...++..| .||+.||||...+.+.. + T Consensus 3 ~~~ag~aq~RvLSYNFNILPRG~GGf-----------Q~ERI~tFL~~vd~YDVvllQEVYAaS~lPY~lQk~lCfQKmL 71 (1452) T PTZ00297 3 ASDAGCAQARVLSYNFNILPRGCGGF-----------QHERIETFLASVDAYDVVLLQEVYAASVLPYFLQKQLCFQKML 71 (1452) T ss_pred CCCCCHHHHHHHHCCCCCCCCCCCCC-----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 75532044411120354244777751-----------3878999996443455457699878644079999889999998 Q ss_pred ---HCCCCCE-EEEECCCCCCCCC-CCCCCCCEEEEEEECCCEEEEEEECCCCCCCCCCCCCCCCEEEEEEEEE-----C Q ss_conf ---0333430-7862145567543-2234463158998258526765301224676555555653369999982-----9 Q gi|254781003|r 84 ---FPKNTWC-IFYSTERLINHSK-RDSNNDIHTAIAVRKKNVRVLQQSYPLLGAKDSFSRAGNRRAVELLVEI-----N 153 (304) Q Consensus 84 ---~~~~~~~-~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~-----~ 153 (304) +...++. +..+..+...... ..... .+|+.+-++.++.-...+.+. ........-.|+++.++++. + T Consensus 72 vD~Lk~rGf~hYvISkQPSY~TmlR~NV~S--DNGLIIASRFPi~qrGSYTFr-~heRg~qsVrrGCLFAEV~VPl~~GG 148 (1452) T PTZ00297 72 VDELKARGFHHYVISKQPSYLTMLRYNVCS--DNGLIIASRFPIWQRGSYTFR-NHERGEQSVRRGCLFAEVEVPLAEGG 148 (1452) T ss_pred HHHHHHCCCEEEEEECCCCHHHHHHHCCCC--CCCEEEEECCCCCCCCCEEEC-CCHHHHHHHHCCCEEEEEEEECCCCC T ss_conf 989986676578974588526676541236--786699732643234633511-53223455523616899861202489 Q ss_pred CEEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCC--CCCCCEEEEECCCCCCCCCCH-------HHHH Q ss_conf 9499999940678887752000025778999999999999986303--689809998026888776620-------4689 Q gi|254781003|r 154 GKKIWVLDIHLKSFCFLDSLENTYSPSCSLLSQQAQWLKDWITQKK--ESLVPFVIAGDFNRKINYLGN-------NDDF 224 (304) Q Consensus 154 ~~~~~v~n~Hl~~~~~~~~~~~~~~~~~~~~~~q~~~l~~~~~~~~--~~~~~vIi~GDfN~~~~~~~~-------~~~~ 224 (304) .+++.++|+||......+.. . ...++-++.++....-+.++. ....|+|++||||...-++-. ...+ T Consensus 149 s~~IvFFNVHLR~eds~~~t---s-sqv~etrrFv~svi~n~~~~~~d~a~iPfviAGDFninGIdp~n~G~ptK~f~el 224 (1452) T PTZ00297 149 SQRIVFFNVHLRQEDSLPST---S-SQVQETRRFVESVIANVYEQNNDGAEIPFVIAGDFNINGIDPHNGGHPTKRFQEL 224 (1452) T ss_pred CEEEEEEEEEECCCCCCCCH---H-HHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEECCCCCCCCCCCCCCCCCHHHHHH T ss_conf 42489998762343478325---8-9999999999999999873778753154799635455675866689862899999 Q ss_pred HHHHCC-CC---CCCCCCCCCCCCCCCC-----------CCCCCCCCEEEEEECCCHHHCCCCCCEEEEEECCCCCCCCC Q ss_conf 875225-57---5311144567876564-----------66888201479998620431002472178641134554445 Q gi|254781003|r 225 WKTIDP-ND---SLIRFPKEKDSRCNAN-----------KNLRNKIPIDYFVMDQNAYKFLIQESFSEILYNEDDIKSRG 289 (304) Q Consensus 225 ~~~~~~-~~---~~~~~~~~~~~~~~~~-----------~~~~~~~~iD~I~~s~~~~~~~~~~~~~~~~~~~~~~~~~~ 289 (304) .+.+.+ .. ...........++.+. .+...++|-||.|+++....+.- ....|......++ T Consensus 225 l~elQ~lGsgvrevi~d~~g~hPpTRP~~LFFP~~SKL~R~s~tPQRQDyFfvt~~v~V~~p----~i~Kfv~ssr~PY- 299 (1452) T PTZ00297 225 LNELQDLGSGVREVIYDETGQHPPTRPPILFFPEQSKLERYSSTPQRQDYFFVTPCVQVEKP----RIEKFVVSSRRPY- 299 (1452) T ss_pred HHHHHHHCCCHHHHHHHHCCCCCCCCCCEEECCCHHHHHHCCCCCCCCCEEEEEEEEEECCC----CHHHHHHCCCCCC- T ss_conf 99887641006655154217898999976455425555304588342433875200110364----2676631367874- Q ss_pred CCCCCCCCEEEEEE Q ss_conf 55576343899984 Q gi|254781003|r 290 KRLSDHCPISIDYD 303 (304) Q Consensus 290 ~~~SDH~PV~~~l~ 303 (304) -++|||+.|.+.+. T Consensus 300 tYLSDHFGvs~~l~ 313 (1452) T PTZ00297 300 TYLSDHFGVSARLT 313 (1452) T ss_pred EEECCCCCCEEEEE T ss_conf 44201257148999 No 21 >COG5411 Phosphatidylinositol 5-phosphate phosphatase [Signal transduction mechanisms] Probab=98.62 E-value=2.2e-07 Score=58.41 Aligned_cols=65 Identities=12% Similarity=0.032 Sum_probs=38.6 Q ss_pred CEEEEEEEEECCEEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH----CCCCCCCEEEEECCCCCCCC Q ss_conf 3369999982994999999406788877520000257789999999999999863----03689809998026888776 Q gi|254781003|r 143 RRAVELLVEINGKKIWVLDIHLKSFCFLDSLENTYSPSCSLLSQQAQWLKDWITQ----KKESLVPFVIAGDFNRKINY 217 (304) Q Consensus 143 r~~~~~~~~~~~~~~~v~n~Hl~~~~~~~~~~~~~~~~~~~~~~q~~~l~~~~~~----~~~~~~~vIi~GDfN~~~~~ 217 (304) .+++.+++......+.++|.|+.++...- .+|....+.+..-+.- .......++++||||...+. T Consensus 153 KGav~i~~~~~~t~~cFv~shlaag~~N~----------eeR~~Dy~~I~~~i~f~~g~~I~~hdti~w~GDlNyRVts 221 (460) T COG5411 153 KGAVAIRFNYERTSFCFVNSHLAAGVNNI----------EERIFDYRSIASNICFSRGLRIYDHDTIFWLGDLNYRVTS 221 (460) T ss_pred CCCCCEEEEEECCCCEEEECCHHCCCCCH----------HHHHHHHHHHHHHEECCCCCEECCCCEEEEECCCCCEEEC T ss_conf 55321268752277289963010024668----------8877788878876005777452365239995156744525 No 22 >KOG1294 consensus Probab=98.16 E-value=0.00017 Score=42.64 Aligned_cols=138 Identities=15% Similarity=0.138 Sum_probs=64.9 Q ss_pred ECCEEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCC---CCCCCEEEEECCCCCCCCCC--------- Q ss_conf 299499999940678887752000025778999999999999986303---68980999802688877662--------- Q gi|254781003|r 152 INGKKIWVLDIHLKSFCFLDSLENTYSPSCSLLSQQAQWLKDWITQKK---ESLVPFVIAGDFNRKINYLG--------- 219 (304) Q Consensus 152 ~~~~~~~v~n~Hl~~~~~~~~~~~~~~~~~~~~~~q~~~l~~~~~~~~---~~~~~vIi~GDfN~~~~~~~--------- 219 (304) .....+.+++.|.+............. +..-...+...+.+.. ....+++.+||.|..+.... T Consensus 164 ~e~e~~~l~~~y~p~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~k~~~~~~v~~gd~nvs~~~i~~~~~~~~~~ 238 (335) T KOG1294 164 AEFEIFILINTYVPNIGGGLVNLVYRI-----LDRWDKEIEEKRKKQSSSKNLKAPVVICGDLNVSHEEIDPSKPLVSPA 238 (335) T ss_pred EEECCEEECCCCCCCCCCCCCHHHHHH-----HHHHHHHHHHHHHHCCCCCCCCCCCEECCCCCCCHHHCCCCCCCCCCC T ss_conf 863120311433753343200024444-----446677777776403644444576134155654422216665511445 Q ss_pred ------------HHHHHHHHHCCCCCCCCC-----CCCC-CCCCCCCC----CCCCCCCEEEEEECCCHHHCCCCCCEEE Q ss_conf ------------046898752255753111-----4456-78765646----6888201479998620431002472178 Q gi|254781003|r 220 ------------NNDDFWKTIDPNDSLIRF-----PKEK-DSRCNANK----NLRNKIPIDYFVMDQNAYKFLIQESFSE 277 (304) Q Consensus 220 ------------~~~~~~~~~~~~~~~~~~-----~~~~-~~~~~~~~----~~~~~~~iD~I~~s~~~~~~~~~~~~~~ 277 (304) .+..+...+....++... .... ...+|.+. ......++||+++++.........-+.- T Consensus 239 ~~~~~~~~~t~e~R~~~~~~~~~~~~~iDt~r~~~~~~~~~~t~Wk~~~~~r~~~~~~r~dy~~Vsk~~~n~~r~~~Ic~ 318 (335) T KOG1294 239 GNTLSNAGFTPEERDSFFAELLEKGPLIDTYRELHKDQKKAYTFWKYMPNGRQRGHGERCDYILVSKPGPNNGRRFYICS 318 (335) T ss_pred CCCCCCCCCCHHHHHHHHHHHCCCCCCEEEHHHHCCCCCCCCCCHHHCCCCCCCCCCCCEEEEEECCCCCCCCCEEEEEC T ss_conf 67767789887886567776435775023204316775334552422002336898871368983476778774232423 Q ss_pred EEECCCCCCCCCCCCCCCCCEEEEEEC Q ss_conf 641134554445555763438999849 Q gi|254781003|r 278 ILYNEDDIKSRGKRLSDHCPISIDYDF 304 (304) Q Consensus 278 ~~~~~~~~~~~~~~~SDH~PV~~~l~f 304 (304) .....|||+||...|.+ T Consensus 319 ----------r~~~gsdh~pi~~~~~~ 335 (335) T KOG1294 319 ----------RPIHGSDHCPITLEFFL 335 (335) T ss_pred ----------CCCCCCCCCCEEEEECC T ss_conf ----------76578899981466429 No 23 >KOG1976 consensus Probab=97.44 E-value=0.0012 Score=37.89 Aligned_cols=72 Identities=15% Similarity=0.103 Sum_probs=47.7 Q ss_pred EEEEEECCEEEEEEEECCCCCCCCCCC-HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCC Q ss_conf 999982994999999406788877520-000257789999999999999863036898099980268887766 Q gi|254781003|r 147 ELLVEINGKKIWVLDIHLKSFCFLDSL-ENTYSPSCSLLSQQAQWLKDWITQKKESLVPFVIAGDFNRKINYL 218 (304) Q Consensus 147 ~~~~~~~~~~~~v~n~Hl~~~~~~~~~-~~~~~~~~~~~~~q~~~l~~~~~~~~~~~~~vIi~GDfN~~~~~~ 218 (304) ..++.+.+..+.++|.||-.....-.. ..........|.+++..+.+.+.++...+..++|.||||...++. T Consensus 161 ~~r~~I~~k~fdfVN~hLFhD~snla~~~sspt~ys~~R~~al~~vL~el~~~~~~~~~~fVfGdfNfrLds~ 233 (391) T KOG1976 161 LARFRIHGKEFDFVNLHLFHDVSNLATKNSSPTKYSSKREQALEMVLKELDEEGLRNDAIFVFGDFNFRLDST 233 (391) T ss_pred CEEEEECCCEEEEEEHHHHCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCHH T ss_conf 0048983735556530122235666651378146555578899999999876304752899961344210528 No 24 >PTZ00312 inositol-1,4,5-triphosphate 5-phosphatase; Provisional Probab=96.68 E-value=0.015 Score=31.91 Aligned_cols=72 Identities=17% Similarity=0.128 Sum_probs=39.0 Q ss_pred EEEEEEEEECCEEEEEEEECCCCCCCCCCCHHHH-HHHHHHHHH-HHHHHHHHHHHCCCCCCCEEEEECCCCCCC Q ss_conf 3699999829949999994067888775200002-577899999-999999998630368980999802688877 Q gi|254781003|r 144 RAVELLVEINGKKIWVLDIHLKSFCFLDSLENTY-SPSCSLLSQ-QAQWLKDWITQKKESLVPFVIAGDFNRKIN 216 (304) Q Consensus 144 ~~~~~~~~~~~~~~~v~n~Hl~~~~~~~~~~~~~-~~~~~~~~~-q~~~l~~~~~~~~~~~~~vIi~GDfN~~~~ 216 (304) +-+.+.+..++.-+.++|+||......-....+. ..-...++. -.+.|.+. ........|+.|-||||...+ T Consensus 69 GfLllslrlgtvv~n~~NVHLfhD~~N~~A~~~sPS~y~~~R~rAl~e~laec-~a~v~p~~PlFIFGDfN~RlD 142 (356) T PTZ00312 69 GFLLLSLRLGTVVVNVLNVHLYNDDDNRVAAASSPSLYTGQRQEALLEAIAEC-SAFISPSDPLFIFGDFNVRLD 142 (356) T ss_pred CEEEEEEEECCEEEEEEEEEEECCCCCHHHHHHCCCCCHHHHHHHHHHHHHHH-HHCCCCCCCEEEECCCCEEEC T ss_conf 34899998476899888888742664067886199720277999999999986-520488997589524413444 No 25 >KOG1857 consensus Probab=96.15 E-value=0.011 Score=32.71 Aligned_cols=106 Identities=7% Similarity=-0.137 Sum_probs=48.0 Q ss_pred CEEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHCCCCC Q ss_conf 94999999406788877520000257789999999999999863036898099980268887766204689875225575 Q gi|254781003|r 154 GKKIWVLDIHLKSFCFLDSLENTYSPSCSLLSQQAQWLKDWITQKKESLVPFVIAGDFNRKINYLGNNDDFWKTIDPNDS 233 (304) Q Consensus 154 ~~~~~v~n~Hl~~~~~~~~~~~~~~~~~~~~~~q~~~l~~~~~~~~~~~~~vIi~GDfN~~~~~~~~~~~~~~~~~~~~~ 233 (304) ....+.+|+|+++-.-.........-....| +............+...++..|||-..++................+ T Consensus 434 S~~~t~vN~~lk~L~l~~~~~~~~N~~~~~R---L~~~~~~g~lT~~g~~~~~~~~~~~~~~Dg~~k~~~rK~~t~h~~~ 510 (623) T KOG1857 434 SKSKTAVNVLLKPLPLDQTCIHPENYDIAMR---LFLSSIGGTLTEVGKPEMQQKINSFLEKDGMEKIAERKQTTVHTLQ 510 (623) T ss_pred CCCHHHHHHHHHHHHHCCCCCCCCCCCHHHH---HHHHHCCCEEEECCCCCEEEECCCCCCCCCCHHHHHHHHCCCCCCC T ss_conf 5530557765300100023688545316778---8887434300222676637853777677774677665531100036 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCHHH Q ss_conf 31114456787656466888201479998620431 Q gi|254781003|r 234 LIRFPKEKDSRCNANKNLRNKIPIDYFVMDQNAYK 268 (304) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~~~iD~I~~s~~~~~ 268 (304) ....+-... ..++....+|-++.++++.. T Consensus 511 ~~~~T~~~p------~~~~~~~S~D~~~~~KS~~~ 539 (623) T KOG1857 511 VIIDTLSQP------ESFDFRTSFDKPDFKKSIVC 539 (623) T ss_pred CCCCCCCCC------CCCCCCCCCCCCCHHHEEEE T ss_conf 300133478------88875644444131330442 No 26 >PRK13861 type IV secretion system protein VirB9; Provisional Probab=86.65 E-value=0.57 Score=23.36 Aligned_cols=32 Identities=25% Similarity=0.530 Sum_probs=21.8 Q ss_pred CHHHHHHHHHHHHHHCCCCC----------CCCEEEEEEECC Q ss_conf 91789999999997158767----------882699998301 Q gi|254781003|r 1 MIRKYVLALVFFLVPCTASV----------AQKVRLVSWNIN 32 (304) Q Consensus 1 ~~~~~~l~~~~~~~~~~~~~----------~~~lrV~twNi~ 32 (304) |||+.+|++.++++.++++. -..++.++||=+ T Consensus 1 mmk~l~~~~~~~l~~~~~a~A~~~P~~s~~D~Riq~v~Ynp~ 42 (293) T PRK13861 1 MIKKLFLTLACLLFAAIGALAEDTPAAGKLDPRMRYLAYNPD 42 (293) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCEEEEECCC T ss_conf 908999999999986126678756988888977179880888 No 27 >KOG0565 consensus Probab=85.05 E-value=2.2 Score=20.10 Aligned_cols=71 Identities=11% Similarity=0.058 Sum_probs=38.0 Q ss_pred CCEEEEEEEEECCEEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCC--C------CCCEEEEECCCC Q ss_conf 53369999982994999999406788877520000257789999999999999863036--8------980999802688 Q gi|254781003|r 142 NRRAVELLVEINGKKIWVLDIHLKSFCFLDSLENTYSPSCSLLSQQAQWLKDWITQKKE--S------LVPFVIAGDFNR 213 (304) Q Consensus 142 ~r~~~~~~~~~~~~~~~v~n~Hl~~~~~~~~~~~~~~~~~~~~~~q~~~l~~~~~~~~~--~------~~~vIi~GDfN~ 213 (304) ..+++.+.....+..+.+++.|+.++....... +........... .+...... . ..-++..||+|. T Consensus 46 nkg~v~is~~~~~~~~~~v~~hl~~~~~~~~~~--r~~d~~~i~~~~----~~~~~~~~~~~~~~~~~~D~v~w~GDlN~ 119 (145) T KOG0565 46 NKGGVAISFVLSQTSFCFVISHLTSGVHKVYER--RNEDYQEILNGL----RFPSVSPASEPVISDGEHDTVIWLGDLNY 119 (145) T ss_pred CCCCEEEEEEECCCEEEEEEECCCCCCCCCCEE--ECCCHHHHHHCC----CCCCCCCCCCCCCCCCCCCEEEEECCCCC T ss_conf 798399999964870699995436655331113--247788765203----66533566666566655667999221342 Q ss_pred CCCCC Q ss_conf 87766 Q gi|254781003|r 214 KINYL 218 (304) Q Consensus 214 ~~~~~ 218 (304) ..... T Consensus 120 Rl~~~ 124 (145) T KOG0565 120 RLSGP 124 (145) T ss_pred EECCC T ss_conf 01586 No 28 >PRK11627 hypothetical protein; Provisional Probab=84.13 E-value=1.1 Score=21.79 Aligned_cols=26 Identities=15% Similarity=0.465 Sum_probs=16.1 Q ss_pred CHHHHHHH-HHHHHHHCCCCCCCCEEE Q ss_conf 91789999-999997158767882699 Q gi|254781003|r 1 MIRKYVLA-LVFFLVPCTASVAQKVRL 26 (304) Q Consensus 1 ~~~~~~l~-~~~~~~~~~~~~~~~lrV 26 (304) ||||.+++ +..++|+.+++.+..|.+ T Consensus 1 ~mkk~~~~l~a~~lL~gCas~p~~l~l 27 (192) T PRK11627 1 MLKKILFPLVALFMLAGCATPPTTLEV 27 (192) T ss_pred CHHHHHHHHHHHHHHHHHCCCCCEEEE T ss_conf 928799999999999860699763786 No 29 >PRK12789 flgI flagellar basal body P-ring protein; Reviewed Probab=74.70 E-value=3.3 Score=19.20 Aligned_cols=61 Identities=16% Similarity=0.324 Sum_probs=35.2 Q ss_pred CHHHHHHHHHHHHHHCCCCCC-CCEEEEEEECCC--------------CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHC Q ss_conf 917899999999971587678-826999983014--------------78866766677653101889999999998608 Q gi|254781003|r 1 MIRKYVLALVFFLVPCTASVA-QKVRLVSWNINT--------------LSEQEGVSLWKNSVKRTTSDYTLLRQYAKNLD 65 (304) Q Consensus 1 ~~~~~~l~~~~~~~~~~~~~~-~~lrV~twNi~~--------------~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~ 65 (304) |||+++|.+++.++..+.+.+ .+||=++ +|++ |...++.. ....-..+.+++++++.+ T Consensus 1 m~~~~~~~~~~~l~~~~~~~~a~RIkDi~-~v~GvR~NqLiGYGLVVGL~GTGD~~------~~~~fT~qsl~nmL~~~G 73 (367) T PRK12789 1 MIRRLLLAVLLALLAGPAAAAATRIKDIA-TLQGVRDNQLVGYGLVTGLQGTGDTL------RNSQFTEQSLQSMLDRMG 73 (367) T ss_pred CHHHHHHHHHHHHHHCCCHHHHHHHHHHE-EEEEEEECCEEEEEEEEECCCCCCCC------CCCCCHHHHHHHHHHHCC T ss_conf 90899999999998553066666564020-04002443556778998138878887------778528999999999769 Q ss_pred CCE Q ss_conf 988 Q gi|254781003|r 66 ADI 68 (304) Q Consensus 66 ~DI 68 (304) ..+ T Consensus 74 v~v 76 (367) T PRK12789 74 INV 76 (367) T ss_pred CCC T ss_conf 664 No 30 >PRK04792 tolB translocation protein TolB; Provisional Probab=72.60 E-value=2.9 Score=19.51 Aligned_cols=20 Identities=20% Similarity=0.536 Sum_probs=14.9 Q ss_pred CHHHHHHHHHHHHHHCCCCC Q ss_conf 91789999999997158767 Q gi|254781003|r 1 MIRKYVLALVFFLVPCTASV 20 (304) Q Consensus 1 ~~~~~~l~~~~~~~~~~~~~ 20 (304) |||+++++++++++..+..+ T Consensus 1 M~k~~~~~l~lll~~~s~~A 20 (450) T PRK04792 1 MLKRLILGLFVLLLSFSQVA 20 (450) T ss_pred CCHHHHHHHHHHHHHHHHHH T ss_conf 93899999999999865752 No 31 >cd07587 ML_beta-AS mammalian-like beta-alanine synthase (beta-AS) and similar proteins (class 5 nitrilases). This subgroup includes mammalian-like beta-AS (EC 3.5.1.6, also known as beta-ureidopropionase or N-carbamoyl-beta-alanine amidohydrolase). This enzyme catalyzes the third and final step in the catabolic pyrimidine catabolic pathway responsible for the degradation of uracil and thymine, the hydrolysis of N-carbamyl-beta-alanine and N-carbamyl-beta-aminoisobutyrate to the beta-amino acids, beta-alanine and beta-aminoisobutyrate respectively. This subgroup belongs to a larger nitrilase superfamily comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13), this subgroup corresponds to class 5. Members of this superfamily generally form homomeric Probab=66.49 E-value=7.5 Score=17.23 Aligned_cols=24 Identities=17% Similarity=0.131 Sum_probs=16.6 Q ss_pred HHHHHHHHHHHHCCCEEEEEECCC Q ss_conf 999999999860898899995078 Q gi|254781003|r 53 DYTLLRQYAKNLDADIVFLQEMGS 76 (304) Q Consensus 53 ~~~~l~~~i~~~~~DIi~lQEv~~ 76 (304) +.+.+.+........|+||||++. T Consensus 91 ~i~~~i~~Aa~~gvni~c~qe~w~ 114 (363) T cd07587 91 RIKKIIEAAAMAGVNIICFQEAWT 114 (363) T ss_pred HHHHHHHHHHHCCCCEEEEEHHHC T ss_conf 999999999975987998400105 No 32 >PRK10002 outer membrane protein F; Provisional Probab=63.13 E-value=5.2 Score=18.11 Aligned_cols=20 Identities=15% Similarity=0.376 Sum_probs=16.4 Q ss_pred CHHHHHHHHHHHHHHCCCCC Q ss_conf 91789999999997158767 Q gi|254781003|r 1 MIRKYVLALVFFLVPCTASV 20 (304) Q Consensus 1 ~~~~~~l~~~~~~~~~~~~~ 20 (304) ||||-+|++...++..++++ T Consensus 1 mMKK~~LA~ai~all~ag~a 20 (362) T PRK10002 1 MMKRNILAVIVPALLVAGTA 20 (362) T ss_pred CCHHHHHHHHHHHHHHHCCC T ss_conf 93087999999999860644 No 33 >cd05785 DNA_polB_like2_exo A subfamily of the 3'-5' exonuclease domain of family-B DNA polymerases. This subfamily is composed of uncharacterized bacterial family-B DNA polymerases. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. This exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are involved in metal binding and catalysis. The exonuclease domain of family-B DNA polymerases has a fundamental role in proofreading activity. It contains a beta hairpin structure that plays an important role in active site switching in the event of a nucleotide misincorporation. Family-B DNA polymerases are predominantly involved in DNA replication and DNA repair. Probab=54.16 E-value=13 Score=15.86 Aligned_cols=20 Identities=15% Similarity=0.062 Sum_probs=14.7 Q ss_pred HHHHHHHHHHHHCCCEEEEE Q ss_conf 99999999986089889999 Q gi|254781003|r 53 DYTLLRQYAKNLDADIVFLQ 72 (304) Q Consensus 53 ~~~~l~~~i~~~~~DIi~lQ 72 (304) -+.+..+++.+.+||||.== T Consensus 61 lL~~F~~~i~~~dPDIItGy 80 (207) T cd05785 61 LLEELVAIIRERDPDVIEGH 80 (207) T ss_pred HHHHHHHHHHHCCCCEEEEC T ss_conf 99999999987399999867 No 34 >PRK10780 periplasmic chaperone; Provisional Probab=53.25 E-value=6.9 Score=17.44 Aligned_cols=32 Identities=25% Similarity=0.362 Sum_probs=23.7 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCC Q ss_conf 178999999999715876788269999830147 Q gi|254781003|r 2 IRKYVLALVFFLVPCTASVAQKVRLVSWNINTL 34 (304) Q Consensus 2 ~~~~~l~~~~~~~~~~~~~~~~lrV~twNi~~~ 34 (304) ||||++++.+++..++++.+.. +|+-=|++.. T Consensus 1 mKk~l~~~~l~l~l~~~a~Aa~-KIgvVd~~~i 32 (161) T PRK10780 1 MKKWLLAAGLGLALATSAQAAD-KIAIVNMGSL 32 (161) T ss_pred CHHHHHHHHHHHHHHHHHHHHH-CEEEEEHHHH T ss_conf 9589999999998588887753-5455329999 No 35 >cd07577 Ph0642_like Pyrococcus horikoshii Ph0642 and related proteins, members of the nitrilase superfamily (putative class 13 nitrilases). Uncharacterized subgroup of the nitrilase superfamily. This superfamily is comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. Pyrococcus horikoshii Ph0642 is a hypothetical protein belonging to this subgroup. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13). This subgroup was classified as belonging to class 13, which represents proteins that at the time were difficult to place in a distinct similarity group. Members of this superfamily generally form homomeric complexes, the basic building block of which is a homodimer. Probab=49.85 E-value=16 Score=15.47 Aligned_cols=27 Identities=19% Similarity=0.352 Sum_probs=22.6 Q ss_pred CHHHHHHHHHHHHHHHCCCEEEEEECC Q ss_conf 018899999999986089889999507 Q gi|254781003|r 49 RTTSDYTLLRQYAKNLDADIVFLQEMG 75 (304) Q Consensus 49 ~~~~~~~~l~~~i~~~~~DIi~lQEv~ 75 (304) ..+.+++++.+.+++.++|+|+|.|.. T Consensus 13 d~~~Nl~k~~~~i~~a~adlvvfPEl~ 39 (259) T cd07577 13 EVEKNLKKVESLIKGVEADLIVLPELF 39 (259) T ss_pred CHHHHHHHHHHHHHHCCCCEEECCCCC T ss_conf 999999999999997599899978861 No 36 >TIGR03431 PhnD phosphonate ABC transporter, periplasmic phosphonate binding protein. Note that this model does not identify all phnD-subfamily genes with evident phosphonate context, but all sequences above the trusted context may be inferred to bind phosphonate compounds even in the absence of such context. Furthermore, there is ample evidence to suggest that many other members of the TIGR01098 subfamily have a different primary function. Probab=48.47 E-value=17 Score=15.35 Aligned_cols=80 Identities=15% Similarity=0.181 Sum_probs=43.9 Q ss_pred CHHHHHHHHHHHHHHCCCC-CC----CCEEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHH-CCCEEEEEEC Q ss_conf 9178999999999715876-78----8269999830147886676667765310188999999999860-8988999950 Q gi|254781003|r 1 MIRKYVLALVFFLVPCTAS-VA----QKVRLVSWNINTLSEQEGVSLWKNSVKRTTSDYTLLRQYAKNL-DADIVFLQEM 74 (304) Q Consensus 1 ~~~~~~l~~~~~~~~~~~~-~~----~~lrV~twNi~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~-~~DIi~lQEv 74 (304) |+|+++|+++++++..+++ .+ ++||+.---.+ .......+++-|++++.+. +..|- +.=+ T Consensus 1 m~~r~l~~~~~~~~~~~~~a~a~~~p~~l~~gv~P~~-------------~~~~~~~~~~Pl~~yL~~~lg~~Ve-~~~~ 66 (288) T TIGR03431 1 MLRRLILSLVAAFMLISSNAQAEDWPKELNFGIIPTE-------------NASDLKQRWEPLADYLSKKLGVKVK-LFFA 66 (288) T ss_pred CCHHHHHHHHHHHHHHCCHHHHCCCCCEEEEEEECCC-------------CHHHHHHHHHHHHHHHHHHHCCCEE-EEEC T ss_conf 9088999999999997432243048836899980699-------------9999999999999999998789789-9968 Q ss_pred CCHHHHHHHHCCCCCEEEEE Q ss_conf 78579998603334307862 Q gi|254781003|r 75 GSYNAVAKVFPKNTWCIFYS 94 (304) Q Consensus 75 ~~~~~~~~~~~~~~~~~~~~ 94 (304) .+...+.+.+.....+..+. T Consensus 67 ~~y~~~~eal~~g~~Dia~~ 86 (288) T TIGR03431 67 TDYAGVIEGMRFGKVDIAWY 86 (288) T ss_pred CCHHHHHHHHHCCCCEEEEE T ss_conf 99999999998598309998 No 37 >PRK10752 sulfate transporter subunit; Provisional Probab=46.42 E-value=6.3 Score=17.63 Aligned_cols=81 Identities=19% Similarity=0.213 Sum_probs=44.6 Q ss_pred HHHHHHHHHHHHHHCCCCCCC-CEEEEEEECCC-CCCCCCCCCCCCCCCCH----------HHHHHHHHHHHHHHCCCEE Q ss_conf 178999999999715876788-26999983014-78866766677653101----------8899999999986089889 Q gi|254781003|r 2 IRKYVLALVFFLVPCTASVAQ-KVRLVSWNINT-LSEQEGVSLWKNSVKRT----------TSDYTLLRQYAKNLDADIV 69 (304) Q Consensus 2 ~~~~~l~~~~~~~~~~~~~~~-~lrV~twNi~~-~~~~~~~~~~~~~~~~~----------~~~~~~l~~~i~~~~~DIi 69 (304) |++|.+++++++-.++..+++ +|.-+||-+-. +-..-...+...+..++ ..--.+-+.++..+++||+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~llnvSYd~tRely~~~n~~F~~~wk~~tG~~V~i~qShGGSg~QARaVi~Gl~ADVV 80 (329) T PRK10752 1 MNKWGVGLTLLLAATSVMAKDIQLLNVSYDPTRELYEQYNKAFSAHWKQQTGDNVVIRQSHGGSGKQATSVINGIEADVV 80 (329) T ss_pred CCHHHHHHHHHHHHHHHHHCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHCCCCCCEE T ss_conf 92579999999985465405526996140758999999989999999883299038985578762789998478756778 Q ss_pred EEEECCCHHHHHH Q ss_conf 9995078579998 Q gi|254781003|r 70 FLQEMGSYNAVAK 82 (304) Q Consensus 70 ~lQEv~~~~~~~~ 82 (304) .|-=..+.+.+.+ T Consensus 81 tlal~~Did~l~~ 93 (329) T PRK10752 81 TLALAYDVDAIAE 93 (329) T ss_pred EECCHHHHHHHHH T ss_conf 7514213689986 No 38 >KOG0808 consensus Probab=45.52 E-value=19 Score=15.09 Aligned_cols=25 Identities=20% Similarity=0.181 Sum_probs=17.1 Q ss_pred HHHHHHHHHHHHHCCCEEEEEECCC Q ss_conf 8999999999860898899995078 Q gi|254781003|r 52 SDYTLLRQYAKNLDADIVFLQEMGS 76 (304) Q Consensus 52 ~~~~~l~~~i~~~~~DIi~lQEv~~ 76 (304) .++.+|.+...-....||||||.+. T Consensus 100 ~r~kaiieaaa~agvniiclqeawt 124 (387) T KOG0808 100 DRLKAIIEAAAVAGVNIICLQEAWT 124 (387) T ss_pred HHHHHHHHHHHHCCCCEEEEEHHHC T ss_conf 9999999998754865797401224 No 39 >cd05780 DNA_polB_Kod1_like_exo The 3'-5' exonuclease domain of archaeal family-B DNA polymerases with similarity to Pyrococcus kodakaraensis Kod1, including polymerases from Desulfurococcus (D. Tok Pol) and Thermococcus gorgonarius (Tgo Pol). Kod1, D. Tok Pol, and Tgo Pol are thermostable enzymes that exhibit both polymerase and 3'-5' exonuclease activities. They are family-B DNA polymerases. Their amino termini harbor a DEDDy-type DnaQ-like 3'-5' exonuclease domain that contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and are involved in metal binding and catalysis. The exonuclease domain of family B polymerases contains a beta hairpin structure that plays an important role in active site switching in the event of nucleotide misincorporation. Members of this subfamily show similarity to eukaryotic DNA polymerases involved in DNA replication. Some archaea possess multiple family-B D Probab=40.03 E-value=23 Score=14.60 Aligned_cols=23 Identities=13% Similarity=0.182 Sum_probs=14.5 Q ss_pred HHHHHHHHHHHHCCCEEEEEECC Q ss_conf 99999999986089889999507 Q gi|254781003|r 53 DYTLLRQYAKNLDADIVFLQEMG 75 (304) Q Consensus 53 ~~~~l~~~i~~~~~DIi~lQEv~ 75 (304) -+.+..+++...+||||.==.+. T Consensus 59 LL~~F~~~i~~~dPDii~GyN~~ 81 (195) T cd05780 59 MIKRFIEIVKEKDPDVIYTYNGD 81 (195) T ss_pred HHHHHHHHHHHCCCCEEEECCCC T ss_conf 99999999987299999957987 No 40 >PRK11067 outer membrane protein assembly factor YaeT; Provisional Probab=37.66 E-value=25 Score=14.39 Aligned_cols=33 Identities=15% Similarity=0.212 Sum_probs=22.1 Q ss_pred CHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCC Q ss_conf 9178999999999715876788269999830147 Q gi|254781003|r 1 MIRKYVLALVFFLVPCTASVAQKVRLVSWNINTL 34 (304) Q Consensus 1 ~~~~~~l~~~~~~~~~~~~~~~~lrV~twNi~~~ 34 (304) ||||++|++++|....+ .+++...|-.=.|+++ T Consensus 2 ~~k~~~~~~~l~~~~~~-~~a~~~~I~~I~v~G~ 34 (801) T PRK11067 2 AMKKLLIASLLFSSATV-YGAEGFVVKDIHFEGL 34 (801) T ss_pred CHHHHHHHHHHHHHHHH-HHCCCEEEEEEEEECC T ss_conf 75899999999999998-7558729979999897 No 41 >PRK09501 potD spermidine/putrescine ABC transporter periplasmic substrate-binding protein; Reviewed Probab=36.59 E-value=25 Score=14.34 Aligned_cols=29 Identities=17% Similarity=0.320 Sum_probs=16.9 Q ss_pred HHHHHH---HHHHHHHHCCCCCC---CCEEEEEEE Q ss_conf 178999---99999971587678---826999983 Q gi|254781003|r 2 IRKYVL---ALVFFLVPCTASVA---QKVRLVSWN 30 (304) Q Consensus 2 ~~~~~l---~~~~~~~~~~~~~~---~~lrV~twN 30 (304) ||||.+ ++-.+.+..++..+ .+|+|.+|. T Consensus 1 ~~~~~~~ll~~~a~~l~~~~~~a~~~~~L~i~~W~ 35 (348) T PRK09501 1 MKKWSRHLLAAGALALGMSAAHADDNNTLYFYNWT 35 (348) T ss_pred CHHHHHHHHHHHHHHHHCCHHHHCCCCEEEEECCC T ss_conf 93689999999999973503354338979998710 No 42 >pfam07424 TrbM TrbM. This family contains the bacterial protein TrbM (approximately 180 residues long). In Comamonas testosteroni T-2, TrbM is derived from the IncP1beta plasmid pTSA, which encodes the widespread genes for p-toluenesulfonate (TSA) degradation. Probab=33.33 E-value=20 Score=14.91 Aligned_cols=27 Identities=22% Similarity=0.347 Sum_probs=16.8 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCEEEEE Q ss_conf 178999999999715876788269999 Q gi|254781003|r 2 IRKYVLALVFFLVPCTASVAQKVRLVS 28 (304) Q Consensus 2 ~~~~~l~~~~~~~~~~~~~~~~lrV~t 28 (304) |||.+|++.+.+....+..+..+-++| T Consensus 1 MKK~ll~~al~~~~~~s~~A~~~d~lt 27 (189) T pfam07424 1 MKKILLAVALAFGMFGSASAQKLDVLT 27 (189) T ss_pred CCHHHHHHHHHHHHHHCCCCCHHHHHC T ss_conf 920179999999997147642003414 No 43 >pfam11153 DUF2931 Protein of unknown function (DUF2931). Some members in this family of proteins are annotated as lipoproteins however this cannot be confirmed. Currently, there is no known function. Probab=32.97 E-value=30 Score=13.96 Aligned_cols=33 Identities=21% Similarity=0.553 Sum_probs=25.8 Q ss_pred CHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCC Q ss_conf 917899999999971587678826999983014 Q gi|254781003|r 1 MIRKYVLALVFFLVPCTASVAQKVRLVSWNINT 33 (304) Q Consensus 1 ~~~~~~l~~~~~~~~~~~~~~~~lrV~twNi~~ 33 (304) |++..+|.+.++...|+.+++.+..--.|-+.. T Consensus 1 m~~i~~ll~~lll~aCs~s~~~~~~~~~W~~~v 33 (202) T pfam11153 1 MKKILLLLLLLLLTACSTSAPTTLPPFEWRVGV 33 (202) T ss_pred CHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEE T ss_conf 901489999999876025788777788617931 No 44 >cd01129 PulE-GspE PulE/GspE The type II secretory pathway is the main terminal branch of the general secretory pathway (GSP). It is responsible for the export the majority of Gram-negative bacterial exoenzymes and toxins. PulE is a cytoplasmic protein of the GSP, which contains an ATP binding site and a tetracysteine motif. This subgroup also includes PillB and HofB. Probab=32.33 E-value=31 Score=13.90 Aligned_cols=14 Identities=7% Similarity=0.266 Sum_probs=7.8 Q ss_pred CCCCCEEEEECCCC Q ss_conf 68980999802688 Q gi|254781003|r 200 ESLVPFVIAGDFNR 213 (304) Q Consensus 200 ~~~~~vIi~GDfN~ 213 (304) ..+..+|+.|+.-+ T Consensus 147 R~dPDvi~igEiRD 160 (264) T cd01129 147 RQDPDIIMVGEIRD 160 (264) T ss_pred CCCCCEEEECCCCC T ss_conf 56999887468899 No 45 >PRK09495 glnH glutamine ABC transporter periplasmic protein; Reviewed Probab=31.46 E-value=32 Score=13.82 Aligned_cols=85 Identities=12% Similarity=0.128 Sum_probs=36.5 Q ss_pred CHHHHHHHHHHHHHHC--C-CCCCCCEEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCH Q ss_conf 9178999999999715--8-767882699998301478866766677653101889999999998608988999950785 Q gi|254781003|r 1 MIRKYVLALVFFLVPC--T-ASVAQKVRLVSWNINTLSEQEGVSLWKNSVKRTTSDYTLLRQYAKNLDADIVFLQEMGSY 77 (304) Q Consensus 1 ~~~~~~l~~~~~~~~~--~-~~~~~~lrV~twNi~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~DIi~lQEv~~~ 77 (304) |+|.+.++++.+++.. + .++..+|+|+| +-. +++-+. ...+....... +.+..+.++.+..+- ++.+ +. T Consensus 1 mk~~~~~~~~~~~~~~~~~~~a~~~~l~Vg~-~~~-~pPf~f--~~~g~~~Gfdv--Dl~~~ia~~lg~~~e-~~~~-~~ 72 (247) T PRK09495 1 MKSVLKVSLAALTLAFAVSSHAADKELVVAT-DTA-FVPFEF--KQGDKYVGFDV--DLWAAIAKELKLDYT-LKPM-DF 72 (247) T ss_pred CHHHHHHHHHHHHHHHHHHHHHCCCEEEEEE-CCC-CCCEEE--CCCCEEEEEHH--HHHHHHHHHHCCCEE-EEEC-CH T ss_conf 9779999999999999876651499499998-998-798057--67992788299--999999999699469-9977-88 Q ss_pred HHHHHHHCCCCCEEEE Q ss_conf 7999860333430786 Q gi|254781003|r 78 NAVAKVFPKNTWCIFY 93 (304) Q Consensus 78 ~~~~~~~~~~~~~~~~ 93 (304) +.+-..+.....+... T Consensus 73 ~~~i~~l~~g~~D~~~ 88 (247) T PRK09495 73 SGIIPALQTKNIDLAL 88 (247) T ss_pred HHHHHHHHCCCCCEEE T ss_conf 9999999769956364 No 46 >pfam06838 Alum_res Aluminium resistance protein. This family represents the aluminium resistance protein, which confers resistance to aluminium in bacteria. Probab=30.77 E-value=33 Score=13.75 Aligned_cols=20 Identities=5% Similarity=0.458 Sum_probs=10.8 Q ss_pred HHHHHHHHHHHHHCCCEEEE Q ss_conf 89999999998608988999 Q gi|254781003|r 52 SDYTLLRQYAKNLDADIVFL 71 (304) Q Consensus 52 ~~~~~l~~~i~~~~~DIi~l 71 (304) .+.+.+.+.++..+||+||| T Consensus 176 ~~I~~~i~~vk~~~pd~ivf 195 (405) T pfam06838 176 AEIKEMIKFVKEINPNVIVF 195 (405) T ss_pred HHHHHHHHHHHHHCCCEEEE T ss_conf 99999999999768993999 No 47 >cd05781 DNA_polB_B3_exo The 3'-5' exonuclease domain of archaeal proteins with similarity to Sulfurisphaera ohwakuensis DNA polymerase B3. B3 is a family-B DNA polymerase. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. B3 exhibits both polymerase and 3'-5' exonuclease activities. This exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and are involved in metal binding and catalysis. The exonuclease domain of family B polymerases also contains a beta hairpin structure that plays an important role in active site switching in the event of nucleotide misincorporation. Archaeal proteins that are involved in DNA replication are similar to those from eukaryotes. Some archaea possess multiple family-B DNA polymerases. B3 is mainly found in crenarch Probab=30.42 E-value=33 Score=13.72 Aligned_cols=25 Identities=20% Similarity=0.240 Sum_probs=17.1 Q ss_pred HHHHHHHHHHHHHCCCEEEEEECCC Q ss_conf 8999999999860898899995078 Q gi|254781003|r 52 SDYTLLRQYAKNLDADIVFLQEMGS 76 (304) Q Consensus 52 ~~~~~l~~~i~~~~~DIi~lQEv~~ 76 (304) .-+.+..+++.+.+||||.==.+.. T Consensus 50 ~ll~~f~~~i~~~~Pdii~gyN~~~ 74 (188) T cd05781 50 KIIREFVKYVKEYDPDIIVGYNSNA 74 (188) T ss_pred HHHHHHHHHHHHCCCCEEEECCCCC T ss_conf 9999999999872999999669867 No 48 >CHL00091 apcE phycobillisome linker protein Probab=29.33 E-value=11 Score=16.40 Aligned_cols=33 Identities=36% Similarity=0.359 Sum_probs=19.1 Q ss_pred HHHHHHHHHHHHHHCC-CCCCCCEEEEEEECCCC Q ss_conf 1789999999997158-76788269999830147 Q gi|254781003|r 2 IRKYVLALVFFLVPCT-ASVAQKVRLVSWNINTL 34 (304) Q Consensus 2 ~~~~~l~~~~~~~~~~-~~~~~~lrV~twNi~~~ 34 (304) |+|.+.-|-||+-=.+ +-.+..=.|++=|++++ T Consensus 146 M~KSlRDl~WFLRY~tYAiVAGDp~Il~VNtrGL 179 (886) T CHL00091 146 MRKSLRDLDWFLRYVTYAIVAGDPSILSVNTRGL 179 (886) T ss_pred HHHHHHHHHHHHHHHHHEEECCCCCEEEEECCHH T ss_conf 7777765789998765303517984688844108 No 49 >TIGR02622 CDP_4_6_dhtase CDP-glucose 4,6-dehydratase; InterPro: IPR013445 This entry contains CDP-glucose 4,6-dehydratases from a variety of Gram-negative and Gram-positive bacteria. Members are typically encoded next to a gene that encodes a glucose-1-phosphate cytidylyltransferase, which produces the substrate CDP-D-glucose. This is used by the proteins in this entry to produce CDP-4-keto-6-deoxyglucose.. Probab=29.11 E-value=35 Score=13.59 Aligned_cols=20 Identities=25% Similarity=0.348 Sum_probs=17.2 Q ss_pred HHHHHHHHHHHHCCCEEEEE Q ss_conf 99999999986089889999 Q gi|254781003|r 53 DYTLLRQYAKNLDADIVFLQ 72 (304) Q Consensus 53 ~~~~l~~~i~~~~~DIi~lQ 72 (304) |.++|.+.|.+.+||||+-- T Consensus 69 D~~~L~~~~~~~~PeIvFHl 88 (361) T TIGR02622 69 DAAKLEKAIAEFKPEIVFHL 88 (361) T ss_pred CHHHHHHHHHHCCCCEEEHH T ss_conf 27899999997289898333 No 50 >pfam01963 TraB TraB family. pAD1 is a hemolysin/bacteriocin plasmid originally identified in Enterococcus faecalis DS16. It encodes a mating response to a peptide sex pheromone, cAD1, secreted by recipient bacteria. Once the plasmid pAD1 is acquired, production of the pheromone ceases--a trait related in part to a determinant designated traB. However a related protein is found in C. elegans, suggesting that members of the TraB family have some more general function. Probab=28.58 E-value=36 Score=13.53 Aligned_cols=26 Identities=15% Similarity=0.222 Sum_probs=20.3 Q ss_pred HHHHHHHHHHHHHHHCCCEEEEEECCC Q ss_conf 188999999999860898899995078 Q gi|254781003|r 50 TTSDYTLLRQYAKNLDADIVFLQEMGS 76 (304) Q Consensus 50 ~~~~~~~l~~~i~~~~~DIi~lQEv~~ 76 (304) ++...+.+.+.|...+||+||+ |.+. T Consensus 15 S~~S~~~V~~~I~~~~PD~V~V-ELc~ 40 (225) T pfam01963 15 SKESVEEVRRVIEEERPDAVAV-ELDE 40 (225) T ss_pred CHHHHHHHHHHHHHCCCCEEEE-EECH T ss_conf 9999999999998619997997-5688 No 51 >PRK10954 periplasmic protein disulfide isomerase I; Provisional Probab=28.57 E-value=29 Score=14.02 Aligned_cols=20 Identities=35% Similarity=0.396 Sum_probs=13.2 Q ss_pred HHHHHHHHHHHHHHCCCCCC Q ss_conf 17899999999971587678 Q gi|254781003|r 2 IRKYVLALVFFLVPCTASVA 21 (304) Q Consensus 2 ~~~~~l~~~~~~~~~~~~~~ 21 (304) ||++++.+..+++++++.++ T Consensus 1 mkk~~~~~~~llls~~a~Aa 20 (207) T PRK10954 1 MKKIWLALAGMVLAFSASAA 20 (207) T ss_pred CCHHHHHHHHHHHHHHHHHH T ss_conf 91599999999999772250 No 52 >PRK12698 flgH flagellar basal body L-ring protein; Reviewed Probab=26.82 E-value=38 Score=13.36 Aligned_cols=18 Identities=22% Similarity=0.473 Sum_probs=8.8 Q ss_pred HHHHHHHHHHHHHHCCCC Q ss_conf 178999999999715876 Q gi|254781003|r 2 IRKYVLALVFFLVPCTAS 19 (304) Q Consensus 2 ~~~~~l~~~~~~~~~~~~ 19 (304) ||+++|.++.++|..+++ T Consensus 1 M~r~~l~~~~l~L~gCa~ 18 (224) T PRK12698 1 MARYILLALALLLAGCSS 18 (224) T ss_pred CHHHHHHHHHHHHHCCCC T ss_conf 977999999999836568 No 53 >pfam10913 DUF2706 Protein of unknown function (DUF2706). This family of proteins with unknown function appears to be restricted to Rickettsia spp. Probab=26.51 E-value=39 Score=13.32 Aligned_cols=25 Identities=20% Similarity=0.661 Sum_probs=16.0 Q ss_pred CHHHHHHHHHHHH----HHCCCCCCCCEE Q ss_conf 9178999999999----715876788269 Q gi|254781003|r 1 MIRKYVLALVFFL----VPCTASVAQKVR 25 (304) Q Consensus 1 ~~~~~~l~~~~~~----~~~~~~~~~~lr 25 (304) |.|.+.+.+++++ ++|++.++-.++ T Consensus 1 mlk~~kf~l~~imlaqllsctpsapyeik 29 (60) T pfam10913 1 MLKLFKFGLLLIMLAQLLSCTPSAPYEIK 29 (60) T ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCCC T ss_conf 91588999999999999757999974035 No 54 >PRK12696 flgH flagellar basal body L-ring protein; Reviewed Probab=25.64 E-value=41 Score=13.23 Aligned_cols=19 Identities=42% Similarity=0.683 Sum_probs=10.7 Q ss_pred CHHHHHHHHH-HHHH-HCCCC Q ss_conf 9178999999-9997-15876 Q gi|254781003|r 1 MIRKYVLALV-FFLV-PCTAS 19 (304) Q Consensus 1 ~~~~~~l~~~-~~~~-~~~~~ 19 (304) |||++++++. .++| .|+++ T Consensus 1 mm~~~l~~~~~~~~L~GC~~~ 21 (238) T PRK12696 1 MIRKLLAASCAVLLLSGCNAA 21 (238) T ss_pred CHHHHHHHHHHHHHHHCCCCC T ss_conf 958999999999995165688 No 55 >COG3459 Cellobiose phosphorylase [Carbohydrate transport and metabolism] Probab=25.07 E-value=42 Score=13.17 Aligned_cols=33 Identities=21% Similarity=0.353 Sum_probs=21.8 Q ss_pred HHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCC Q ss_conf 999999999986303689809998026888776 Q gi|254781003|r 185 SQQAQWLKDWITQKKESLVPFVIAGDFNRKINY 217 (304) Q Consensus 185 ~~q~~~l~~~~~~~~~~~~~vIi~GDfN~~~~~ 217 (304) ...+.+|.-.+......+.|.|+.||+|+..+. T Consensus 715 EH~~rAld~av~r~g~~glpli~gGDWND~ln~ 747 (1056) T COG3459 715 EHCARALDLAVARLGPHGLPLILGGDWNDGLNL 747 (1056) T ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCC T ss_conf 999999999998608777643157761100350 No 56 >COG5510 Predicted small secreted protein [Function unknown] Probab=24.68 E-value=40 Score=13.24 Aligned_cols=17 Identities=29% Similarity=0.596 Sum_probs=11.6 Q ss_pred CHHHHHHHHHHHHHHCC Q ss_conf 91789999999997158 Q gi|254781003|r 1 MIRKYVLALVFFLVPCT 17 (304) Q Consensus 1 ~~~~~~l~~~~~~~~~~ 17 (304) |||+++|....+.+.++ T Consensus 1 mmk~t~l~i~~vll~s~ 17 (44) T COG5510 1 MMKKTILLIALVLLAST 17 (44) T ss_pred CCHHHHHHHHHHHHHHH T ss_conf 90489999999999999 No 57 >PRK00178 tolB translocation protein TolB; Provisional Probab=24.00 E-value=44 Score=13.06 Aligned_cols=23 Identities=22% Similarity=0.297 Sum_probs=13.1 Q ss_pred CHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 91789999999997158767882 Q gi|254781003|r 1 MIRKYVLALVFFLVPCTASVAQK 23 (304) Q Consensus 1 ~~~~~~l~~~~~~~~~~~~~~~~ 23 (304) |||...|.++++++.++.++... T Consensus 1 m~~l~~~~~l~~~l~~~~~a~A~ 23 (433) T PRK00178 1 MIKLKRIALLVLCLLAGGAAAAE 23 (433) T ss_pred CCCHHHHHHHHHHHHHHHHHCEE T ss_conf 90179999999999876762203 No 58 >pfam06649 DUF1161 Protein of unknown function (DUF1161). This family consists of several short, hypothetical bacterial proteins of unknown function. Probab=23.62 E-value=44 Score=13.01 Aligned_cols=18 Identities=22% Similarity=0.563 Sum_probs=13.2 Q ss_pred HHHHHHHHHHHHHHCCCC Q ss_conf 178999999999715876 Q gi|254781003|r 2 IRKYVLALVFFLVPCTAS 19 (304) Q Consensus 2 ~~~~~l~~~~~~~~~~~~ 19 (304) ||+++|+..+++++.++- T Consensus 1 Mkk~~l~~~l~lla~~al 18 (75) T pfam06649 1 MKRLLLAVALLLLAASAL 18 (75) T ss_pred CCHHHHHHHHHHHHHHHH T ss_conf 924699999999713453 No 59 >PRK13883 conjugal transfer protein TrbH; Provisional Probab=23.56 E-value=45 Score=13.01 Aligned_cols=78 Identities=18% Similarity=0.146 Sum_probs=37.5 Q ss_pred CHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCCHH-HHHHHHHHHHHHHCC--CEEEEEECCCH Q ss_conf 917899999999971587678826999983014788667666776531018-899999999986089--88999950785 Q gi|254781003|r 1 MIRKYVLALVFFLVPCTASVAQKVRLVSWNINTLSEQEGVSLWKNSVKRTT-SDYTLLRQYAKNLDA--DIVFLQEMGSY 77 (304) Q Consensus 1 ~~~~~~l~~~~~~~~~~~~~~~~lrV~twNi~~~~~~~~~~~~~~~~~~~~-~~~~~l~~~i~~~~~--DIi~lQEv~~~ 77 (304) |||+.+++++.+.|..+++...= .+|. ..+....+ ..-+.+.+.++-.-| -.++||---.. T Consensus 1 MrK~~~~all~laL~gCAtt~~Y--------GNFv--------~~~~~~~q~lA~Dav~QL~~lYPPA~Trf~l~Q~~~D 64 (156) T PRK13883 1 MRKILSLALLALALGGCATTSQY--------GNFV--------QASAADQQKLATDAVQQLAALYPPAQTRLELQQPTPD 64 (156) T ss_pred CHHHHHHHHHHHHHHCCCCCCCC--------CCCC--------CCCHHHHHHHHHHHHHHHHHHCCCCCCEEEECCCCCC T ss_conf 92689999999997033135788--------7735--------8740354888999999999856875437886068996 Q ss_pred ---HHHHHHHCCCCCEEEEE Q ss_conf ---79998603334307862 Q gi|254781003|r 78 ---NAVAKVFPKNTWCIFYS 94 (304) Q Consensus 78 ---~~~~~~~~~~~~~~~~~ 94 (304) ..+...+...+|..... T Consensus 65 ~FG~aLv~~LR~kGYav~E~ 84 (156) T PRK13883 65 PFGQALVKSLRKKGYAVLEY 84 (156) T ss_pred HHHHHHHHHHHHCCEEEEEC T ss_conf 78999999998648468760 No 60 >PRK11443 hypothetical protein; Provisional Probab=22.07 E-value=48 Score=12.84 Aligned_cols=20 Identities=20% Similarity=0.366 Sum_probs=11.5 Q ss_pred HHHHHHHHHHHHHHCCCCCC Q ss_conf 17899999999971587678 Q gi|254781003|r 2 IRKYVLALVFFLVPCTASVA 21 (304) Q Consensus 2 ~~~~~l~~~~~~~~~~~~~~ 21 (304) ||+++.+++.++|......+ T Consensus 1 Mk~~~~~~~~lll~GCq~~p 20 (120) T PRK11443 1 MKKFIAPLLALLVSGCQIDP 20 (120) T ss_pred CHHHHHHHHHHHHHCCCCCC T ss_conf 91589999999996667896 No 61 >TIGR02855 spore_yabG sporulation peptidase YabG; InterPro: IPR008764 Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This is a group of peptidases belong to MEROPS peptidase family U57 (clan U-). The type example is the YabG protein of Bacillus subtilis. This is a protease involved in the synthesis and maturation of the spore coat proteins SpoIVA and YrbA of Bacillus subtilis .. Probab=21.70 E-value=49 Score=12.80 Aligned_cols=12 Identities=8% Similarity=0.105 Sum_probs=7.8 Q ss_pred CCEEEEEECCCH Q ss_conf 014799986204 Q gi|254781003|r 255 IPIDYFVMDQNA 266 (304) Q Consensus 255 ~~iD~I~~s~~~ 266 (304) +.||=||+.... T Consensus 241 HaLDPVyi~~k~ 252 (292) T TIGR02855 241 HALDPVYIVEKV 252 (292) T ss_pred HCCCHHHEEEEE T ss_conf 226730501100 No 62 >PRK01742 tolB translocation protein TolB; Provisional Probab=21.17 E-value=50 Score=12.73 Aligned_cols=18 Identities=6% Similarity=0.316 Sum_probs=11.3 Q ss_pred CHHHHHHHHHHHHHHCCC Q ss_conf 917899999999971587 Q gi|254781003|r 1 MIRKYVLALVFFLVPCTA 18 (304) Q Consensus 1 ~~~~~~l~~~~~~~~~~~ 18 (304) |||++++.+++++..... T Consensus 3 m~~~~~~~~~~l~~~~~~ 20 (430) T PRK01742 3 LLKRLVSVFAVLFAVISN 20 (430) T ss_pred HHHHHHHHHHHHHHHHHH T ss_conf 699999999999999877 No 63 >TIGR01420 pilT_fam twitching motility protein; InterPro: IPR006321 These represent the PilT subfamily of proteins related to GspE, a protein involved in type II secretion (also called the General Secretion Pathway). PilT is an apparent cytosolic ATPase associated with type IV pilus systems. It is not required for pilin biogenesis, but is required for twitching motility and social gliding behaviors, shown in some species, powered by pilus retraction . Members of this family may be found in some species that do not have type IV pili but have related structures for DNA uptake and natural transformation. ; GO: 0005524 ATP binding, 0006810 transport. Probab=20.94 E-value=51 Score=12.71 Aligned_cols=32 Identities=9% Similarity=0.108 Sum_probs=22.4 Q ss_pred HHHHHHHHHCCCEEEEEECCCH-HHHHHHHCCC Q ss_conf 9999998608988999950785-7999860333 Q gi|254781003|r 56 LLRQYAKNLDADIVFLQEMGSY-NAVAKVFPKN 87 (304) Q Consensus 56 ~l~~~i~~~~~DIi~lQEv~~~-~~~~~~~~~~ 87 (304) .++-+++-+-..|--|+|..-. ..+.+++... T Consensus 94 ~~a~vlR~ip~~Ip~fe~LGLP~~v~~~~a~~~ 126 (350) T TIGR01420 94 GVALVLRLIPSKIPTFEELGLPRPVLRELAERP 126 (350) T ss_pred HHHHHHHHCCCCCCCHHHCCCCHHHHHHHHHCC T ss_conf 064232311534621666379878999998366 No 64 >pfam02446 Glyco_hydro_77 4-alpha-glucanotransferase. These enzymes EC:2.4.1.25 transfer a segment of a (1,4)-alpha-D-glucan to a new 4-position in an acceptor, which may be glucose or (1,4)-alpha-D-glucan. Probab=20.48 E-value=52 Score=12.65 Aligned_cols=17 Identities=24% Similarity=0.169 Sum_probs=11.0 Q ss_pred CCCEEEEECCCCCCCCC Q ss_conf 98099980268887766 Q gi|254781003|r 202 LVPFVIAGDFNRKINYL 218 (304) Q Consensus 202 ~~~vIi~GDfN~~~~~~ 218 (304) +..|-|.||+=.-.+.. T Consensus 204 ~~gI~L~gDlpigv~~d 220 (494) T pfam02446 204 DKGIGIKGDLPIYVAYD 220 (494) T ss_pred HCCCEEEEECCEEECCC T ss_conf 37987987432057898 No 65 >pfam03104 DNA_pol_B_exo DNA polymerase family B, exonuclease domain. This domain has 3' to 5' exonuclease activity and adopts a ribonuclease H type fold. Probab=20.24 E-value=52 Score=12.62 Aligned_cols=23 Identities=17% Similarity=0.369 Sum_probs=9.3 Q ss_pred HHHHHHHHHCCCCCCCEEEEECCCC Q ss_conf 9999998630368980999802688 Q gi|254781003|r 189 QWLKDWITQKKESLVPFVIAGDFNR 213 (304) Q Consensus 189 ~~l~~~~~~~~~~~~~vIi~GDfN~ 213 (304) +.|..|.......+.. |+.| +|. T Consensus 153 ~ll~~f~~~i~~~dpd-ii~g-yNi 175 (254) T pfam03104 153 ELLRRFFEFIRQYDPD-IITG-YNG 175 (254) T ss_pred HHHHHHHHHHHHCCCC-EEEE-CCC T ss_conf 9999999998751998-9996-678 No 66 >cd02854 Glycogen_branching_enzyme_like_N_term Glycogen branching enzyme-like N-terminus domain. Glycogen branching enzyme (AKA 1,4 alpha glucan branching enzyme) catalyzes the formation of alpha-1,6 branch points in either glycogen or starch by cleavage of the alpha-1,4 glucosidic linkage yielding a non-reducing end oligosaccharide chain and subsequent attachment to the alpha-1,6 position. By increasing the number of non-reducing ends glycogen is more reactive to synthesis and digestion as well as being more soluble. The N-terminus of the glycogen branching enzyme-like proteins may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobi Probab=20.08 E-value=53 Score=12.60 Aligned_cols=14 Identities=29% Similarity=0.482 Sum_probs=10.6 Q ss_pred CCCCCEEEEECCCC Q ss_conf 68980999802688 Q gi|254781003|r 200 ESLVPFVIAGDFNR 213 (304) Q Consensus 200 ~~~~~vIi~GDfN~ 213 (304) ++...|-|.||||. T Consensus 14 PnA~~v~liGdFN~ 27 (99) T cd02854 14 PNAEEVYLIGDFNN 27 (99) T ss_pred CCCCEEEEEEECCC T ss_conf 98849999812778 Done!