Query gi|254780464|ref|YP_003064877.1| M16 family peptidase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 424 No_of_seqs 186 out of 5740 Neff 9.6 Searched_HMMs 39220 Date Sun May 29 17:06:33 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780464.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 KOG0960 consensus 100.0 0 0 531.9 26.8 418 3-422 34-466 (467) 2 COG0612 PqqL Predicted Zn-depe 100.0 0 0 525.8 30.2 406 2-407 16-434 (438) 3 KOG2067 consensus 100.0 0 0 465.2 23.4 417 2-420 24-459 (472) 4 KOG2583 consensus 100.0 1.5E-43 0 311.1 24.4 398 3-414 23-429 (429) 5 COG1025 Ptr Secreted/periplasm 100.0 2.5E-41 0 296.1 23.1 400 2-407 23-444 (937) 6 KOG0959 consensus 100.0 4.6E-41 1.4E-45 294.3 22.8 402 2-406 27-448 (974) 7 COG1026 Predicted Zn-dependent 100.0 4.7E-37 1.2E-41 267.5 18.9 408 9-421 28-476 (978) 8 pfam00675 Peptidase_M16 Insuli 100.0 1.2E-32 3.1E-37 237.9 10.2 147 13-159 1-148 (149) 9 KOG2019 consensus 100.0 6.6E-30 1.7E-34 219.5 18.5 397 8-409 59-499 (998) 10 KOG0961 consensus 100.0 2.5E-28 6.3E-33 209.0 22.5 391 11-406 29-457 (1022) 11 pfam05193 Peptidase_M16_C Pept 99.9 4.9E-28 1.3E-32 207.0 9.0 174 165-338 1-182 (182) 12 TIGR02110 PQQ_syn_pqqF coenzym 99.9 3.6E-28 9.1E-33 207.9 -0.3 389 4-398 1-420 (737) 13 COG1026 Predicted Zn-dependent 99.9 6.3E-20 1.6E-24 152.6 22.6 388 5-403 529-957 (978) 14 COG1025 Ptr Secreted/periplasm 99.7 8.6E-15 2.2E-19 118.2 21.6 376 7-395 507-896 (937) 15 KOG0959 consensus 99.6 1.2E-13 3.1E-18 110.4 20.1 371 12-394 519-910 (974) 16 KOG2019 consensus 99.6 1.7E-13 4.3E-18 109.5 17.2 381 8-402 566-981 (998) 17 KOG0961 consensus 99.5 7.3E-12 1.9E-16 98.5 15.1 341 53-402 596-983 (1022) 18 pfam03410 Peptidase_M44 Protei 98.6 1.9E-07 4.8E-12 68.9 6.4 186 6-214 2-195 (590) 19 COG0612 PqqL Predicted Zn-depe 98.5 1.2E-05 3E-10 56.9 14.0 161 233-402 36-205 (438) 20 pfam08367 M16C_assoc Peptidase 98.2 1E-05 2.7E-10 57.3 9.2 26 296-321 161-186 (248) 21 pfam00675 Peptidase_M16 Insuli 97.2 0.0088 2.2E-07 37.6 10.3 131 228-366 5-140 (149) 22 KOG0960 consensus 96.6 0.063 1.6E-06 31.9 13.3 10 378-387 369-378 (467) 23 KOG2067 consensus 96.5 0.012 3.1E-07 36.7 6.6 12 255-266 219-230 (472) 24 TIGR02110 PQQ_syn_pqqF coenzym 95.1 0.063 1.6E-06 31.9 5.6 223 71-318 509-735 (737) 25 pfam05193 Peptidase_M16_C Pept 91.5 0.55 1.4E-05 25.6 5.4 101 16-121 70-180 (182) 26 TIGR01334 modD modD protein; I 72.0 1.7 4.4E-05 22.3 1.1 27 376-402 214-242 (277) 27 TIGR02545 ATP_syn_fliI flagell 66.2 2.1 5.4E-05 21.7 0.6 37 169-206 172-208 (439) 28 TIGR01026 fliI_yscN ATPase Fli 50.2 12 0.0003 16.6 2.2 81 252-338 364-450 (455) 29 TIGR03395 sphingomy sphingomye 48.6 14 0.00035 16.3 2.2 39 160-198 154-192 (283) 30 TIGR01574 miaB-methiolase tRNA 47.1 17 0.00044 15.6 2.6 18 113-130 179-196 (456) 31 KOG1110 consensus 46.2 14 0.00035 16.2 2.0 75 258-335 63-137 (183) 32 TIGR01481 ccpA catabolite cont 44.1 13 0.00033 16.4 1.6 63 60-122 33-106 (332) 33 PRK06220 consensus 41.5 22 0.00056 14.8 2.5 10 411-420 364-373 (384) 34 PRK07682 hypothetical protein; 40.9 21 0.00055 14.9 2.3 11 411-421 360-370 (378) 35 PRK08361 aspartate aminotransf 40.8 22 0.00055 14.9 2.3 11 410-420 372-382 (390) 36 PRK06056 consensus 40.6 21 0.00055 14.9 2.2 15 380-394 345-359 (402) 37 pfam08593 MUG2_C Meiotically u 39.2 17 0.00043 15.6 1.5 31 152-182 33-66 (94) 38 COG5023 Tubulin [Cytoskeleton] 39.2 23 0.00057 14.8 2.2 10 202-211 164-173 (443) 39 cd01568 QPRTase_NadC Quinolina 38.0 16 0.0004 15.9 1.2 11 379-389 236-246 (269) 40 PRK07309 aromatic amino acid a 37.6 25 0.00065 14.4 2.4 11 180-190 89-99 (390) 41 cd01569 PBEF_like pre-B-cell c 37.0 5.3 0.00014 19.0 -1.3 60 59-118 78-146 (407) 42 TIGR02414 pepN_proteo aminopep 35.2 27 0.0007 14.2 2.1 77 318-394 644-734 (898) 43 PRK08362 consensus 35.2 28 0.0007 14.2 2.4 10 169-178 70-79 (389) 44 PRK11174 cysteine/glutathione 33.9 21 0.00053 15.0 1.3 85 325-410 449-549 (588) 45 PRK06207 aspartate aminotransf 33.0 30 0.00076 14.0 2.4 23 168-190 88-110 (406) 46 PRK07683 aminotransferase A; V 33.0 30 0.00076 14.0 2.6 10 411-420 367-376 (387) 47 PRK06195 DNA polymerase III su 32.9 30 0.00075 14.0 2.0 30 43-74 52-81 (309) 48 PRK07337 aminotransferase; Val 32.8 29 0.00074 14.0 1.9 11 411-421 373-383 (388) 49 COG0157 NadC Nicotinate-nucleo 32.0 22 0.00056 14.9 1.2 12 378-389 242-253 (280) 50 PRK06507 consensus 32.0 31 0.00079 13.9 2.1 17 168-184 74-94 (400) 51 PRK10046 dpiA two-component re 31.6 31 0.0008 13.8 2.7 24 365-389 176-199 (225) 52 TIGR01478 STEVOR variant surfa 31.5 32 0.0008 13.8 1.9 51 140-204 48-99 (315) 53 PRK08960 hypothetical protein; 30.5 31 0.0008 13.8 1.7 11 410-420 370-380 (387) 54 PRK07778 consensus 30.1 32 0.00082 13.8 1.8 12 410-421 369-380 (386) 55 TIGR01754 flav_RNR ribonucleot 29.6 17 0.00043 15.7 0.2 57 152-208 52-118 (145) 56 TIGR02873 spore_ylxY probable 28.0 36 0.00092 13.4 1.8 26 146-177 140-165 (269) 57 PRK08385 nicotinate-nucleotide 28.0 21 0.00053 15.0 0.5 16 323-338 209-224 (279) 58 pfam01837 DUF39 Domain of unkn 27.7 22 0.00055 14.9 0.5 29 24-52 72-100 (300) 59 PRK09225 threonine synthase; V 27.6 37 0.00094 13.4 3.5 31 179-211 180-211 (465) 60 PRK09148 aminotransferase; Val 26.6 35 0.00088 13.5 1.4 12 410-421 374-385 (406) 61 PRK08636 aspartate aminotransf 26.5 38 0.00098 13.2 2.5 23 168-190 77-103 (403) 62 PRK13539 cytochrome c biogenes 26.2 14 0.00036 16.2 -0.6 62 355-416 136-197 (206) 63 PRK08072 nicotinate-nucleotide 26.0 31 0.00079 13.9 1.1 34 302-335 192-226 (277) 64 PRK09147 aminotransferase; Pro 24.6 41 0.0011 13.0 1.7 12 410-421 380-391 (397) 65 PRK06106 nicotinate-nucleotide 24.6 33 0.00083 13.7 1.0 35 303-337 197-232 (281) 66 pfam00427 PBS_linker_poly Phyc 24.6 41 0.0011 13.0 4.1 86 321-410 39-128 (131) 67 PRK09939 putative oxidoreducta 23.9 33 0.00085 13.7 0.9 12 181-192 144-155 (759) 68 pfam05838 DUF847 Predicted lys 23.9 33 0.00085 13.6 0.9 25 159-183 42-67 (87) 69 PRK05742 nicotinate-nucleotide 23.6 36 0.00093 13.4 1.0 31 305-335 196-227 (277) 70 TIGR02416 CO_dehy_Mo_lg carbon 23.4 19 0.00048 15.3 -0.4 68 308-386 364-433 (775) 71 TIGR02915 PEP_resp_reg putativ 23.2 44 0.0011 12.8 1.8 36 92-128 86-124 (451) 72 cd01572 QPRTase Quinolinate ph 23.2 36 0.00091 13.4 1.0 35 303-337 187-222 (268) 73 pfam01729 QRPTase_C Quinolinat 23.2 32 0.00081 13.8 0.7 40 163-202 106-146 (169) 74 cd01573 modD_like ModD; Quinol 22.7 38 0.00096 13.3 1.0 12 378-389 238-249 (272) 75 PRK07896 nicotinate-nucleotide 21.3 44 0.0011 12.9 1.1 33 304-336 204-237 (288) 76 TIGR00365 TIGR00365 glutaredox 21.3 48 0.0012 12.6 1.4 68 51-128 15-92 (103) 77 KOG0431 consensus 21.0 49 0.0012 12.5 1.4 18 373-390 394-411 (453) 78 COG1448 TyrB Aspartate/tyrosin 20.8 44 0.0011 12.9 1.0 161 239-413 211-390 (396) 79 PRK09265 aminotransferase AlaT 20.7 49 0.0013 12.5 2.3 12 409-420 383-394 (404) 80 PRK06543 nicotinate-nucleotide 20.6 44 0.0011 12.9 1.0 12 378-389 245-256 (281) 81 PRK09016 quinolinate phosphori 20.6 45 0.0012 12.8 1.0 12 378-389 260-271 (296) 82 cd04913 ACT_AKii-LysC-BS-like_ 20.0 49 0.0013 12.5 1.1 27 82-108 41-67 (75) No 1 >KOG0960 consensus Probab=100.00 E-value=0 Score=531.86 Aligned_cols=418 Identities=29% Similarity=0.476 Sum_probs=391.8 Q ss_pred EEEEECCCCCEEEEECCCCCEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCEEEEEECCC Q ss_conf 18998798719999858998799999983311448888983489999897335688898899999997087679975322 Q gi|254780464|r 3 LRISKTSSGITVITEVMPIDSAFVKVNIRAGSRNERQEEHGMAHFLEHMLFKGTTKRTAKEIVEEIEKVGGDINAYTSLE 82 (424) Q Consensus 3 ~~~~~L~NGl~v~~~~~~~~~~~i~~~~~~Gs~~e~~~~~G~ah~lehl~~~gt~~~~~~~~~~~l~~~G~~~na~t~~d 82 (424) -++|+|+||++|+++++..++++|++|+++||++|++.+.|+|||+|||+||||++++...+..++|.+|+.+||+|++| T Consensus 34 t~vttL~NGlrVaTE~~~a~TATVGVwidaGSR~EnekNNG~ahFLEhlaFKGT~~Rs~~alElEieniGahLNAytSRe 113 (467) T KOG0960 34 TEVTTLPNGLRVATEHNSASTATVGVWIDAGSRFENEKNNGTAHFLEHLAFKGTKNRSQAALELEIENIGAHLNAYTSRE 113 (467) T ss_pred CEEEECCCCCEEEECCCCCCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 35887478728985147876337999961574234655342899999887427775326789989987777740333443 Q ss_pred EEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCHHHCCCCCCCCCCCCCCCCCHHHH Q ss_conf 37999612665652100122232210356620011245567666651133211100000012222222222222210344 Q gi|254780464|r 83 HTSYHAWVLKEHVPLALEIIGDMLSNSSFNPSDIERERNVVLEEIGMSEDDSWDFLDARFSEMVWKDQIIGRPILGKPET 162 (424) Q Consensus 83 ~~~~~~~~~~~~~~~~l~ll~~~~~~p~~~~~~~~~ek~~v~~e~~~~~~~p~~~~~~~~~~~~f~~~~~~~~~~g~~~~ 162 (424) .|+|+++++++++++++++|+|++++.++.++++++||.+|+.|++..+++-....++.++..+|+++|+|++++|+.+. T Consensus 114 qT~yyakal~~dv~kavdiLaDIlqns~L~~s~IerER~vILrEmqevd~~~~eVVfdhLHatafQgtPL~~tilGp~en 193 (467) T KOG0960 114 QTVYYAKALSKDVPKAVDILADILQNSKLEESAIERERDVILREMQEVDKNHQEVVFDHLHATAFQGTPLGRTILGPSEN 193 (467) T ss_pred CEEEEHHHCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHH T ss_conf 13431232232216899999999873764266777888899999999875306799999888875589653322371555 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCHHHHCCCCCHHHHHHHHHHCCCCCCC---CCCCCCCCCCCCCCEEEEE-CCCCCCCEE Q ss_conf 200016677642011245663000002545512211221101235877---7788664223322102330-368432102 Q gi|254780464|r 163 ISSFTPEKIISFVSRNYTADRMYVVCVGAVDHEFCVSQVESYFNVCSV---AKIKESMKPAVYVGGEYIQ-KRDLAEEHM 238 (424) Q Consensus 163 i~~lt~~~l~~f~~~~y~p~n~~l~i~Gd~~~~~~~~~i~~~f~~~~~---~~~~~~~~p~~~~~~~~~~-~~~~~~~~i 238 (424) |++|+.+||++|.+++|.++||+|+.+|.++|+++.++++++||+++. +..++...|+.++|.|++. +.+.|.+++ T Consensus 194 I~si~r~DL~~yi~thY~~~RmVlaaaGgV~He~lv~la~k~fg~~~~~~~~~~~~~~~~~~FtgsEvR~rdd~lP~a~~ 273 (467) T KOG0960 194 IKSISRADLKDYINTHYKASRMVLAAAGGVKHEELVKLAEKYFGDLSKLQTGDKVPLVPPARFTGSEVRVRDDDLPLAHI 273 (467) T ss_pred HHHHHHHHHHHHHHHCCCCCCEEEEECCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCHHHE T ss_conf 66634999999998546675279980688677999999999718886324676777788764337445653788741011 Q ss_pred EECCCCCCCCCCHHHHHHHHHHHHHCC---------CCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEEC-CCCHH Q ss_conf 301000013521135689987776248---------76532222000268621013332221012201344425-83024 Q gi|254780464|r 239 MLGFNGCAYQSRDFYLTNILASILGDG---------MSSRLFQEVREKRGLCYSISAHHENFSDNGVLYIASAT-AKENI 308 (424) Q Consensus 239 ~~~~~~~~~~~~d~~al~ll~~iLg~g---------~sSrL~~~lRe~~glaY~v~s~~~~~~~~g~~~i~~~~-~~~~~ 308 (424) .+++.+++|.+||++++++++.++|.+ .+|+|-+.+-+. .++-++.+|...|.|+|+|++|+.| .+..+ T Consensus 274 AiAVEG~~w~~pD~~~l~van~iiG~wdr~~g~g~~~~s~La~~~~~~-~l~~sfqsFnt~YkDTGLwG~y~V~~~~~~i 352 (467) T KOG0960 274 AIAVEGVSWAHPDYFALMVANTIIGNWDRTEGGGRNLSSRLAQKIQQD-QLCHSFQSFNTSYKDTGLWGIYFVTDNLTMI 352 (467) T ss_pred EEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHH-HHHHHHHHHHCCCCCCCCEEEEEEECCHHHH T ss_conf 466743776882279999999985200103577667753889998887-7888776521234345532699874681467 Q ss_pred HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCHHHHHHHH Q ss_conf 77787789999853148998999999999999998513898999999999999689847989999988727989999999 Q gi|254780464|r 309 MALTSSIVEVVQSLLENIEQREIDKECAKIHAKLIKSQERSYLRALEISKQVMFCGSILCSEKIIDTISAITCEDIVGVA 388 (424) Q Consensus 309 ~~~~~~i~~~i~~l~~~i~~~el~~ak~~~~~~~~~~~~~~~~~a~~l~~~~~~~~~~~~~~~~~~~i~~vt~~di~~~~ 388 (424) +..+..+.++|..++..+++.|++|||++++.++++.+|.+...+..++++++.+|+..++.|..++|++||+.+|++++ T Consensus 353 ddl~~~vl~eW~rL~~~vteaEV~RAKn~Lkt~Lll~ldgttpi~ediGrqlL~~Grri~l~El~~rId~vt~~~Vr~va 432 (467) T KOG0960 353 DDLIHSVLKEWMRLATSVTEAEVERAKNQLKTNLLLSLDGTTPIAEDIGRQLLTYGRRIPLAELEARIDAVTAKDVREVA 432 (467) T ss_pred HHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHCCHHHHHHHH T ss_conf 89999999999999866169999999999999999871389803778777785547868869999998541599999999 Q ss_pred HHHC-CCCCEEEEEECCCCCCCCHHHHHHHHHHHH Q ss_conf 9862-778779998166010899899999884342 Q gi|254780464|r 389 KKIF-SSTPTLAILGPPMDHVPTTSELIHALEGFR 422 (424) Q Consensus 389 ~~~l-~~~~~i~~vgp~~~~~P~~~~l~~~l~~~~ 422 (424) .+|+ +.+.+++.+| |.+.+|++..++..|.-+| T Consensus 433 ~k~iyd~~iAia~vG-~ie~lpdy~~irs~m~w~R 466 (467) T KOG0960 433 SKYIYDKDIAIAAVG-PIEGLPDYNRIRSGMSWMR 466 (467) T ss_pred HHHHHCCCCCEEEEC-CCCCCCHHHHHHCCCHHHC T ss_conf 977514783036634-4103741889961003524 No 2 >COG0612 PqqL Predicted Zn-dependent peptidases [General function prediction only] Probab=100.00 E-value=0 Score=525.79 Aligned_cols=406 Identities=35% Similarity=0.561 Sum_probs=368.7 Q ss_pred CEEEEECCCCCEEEEECCC-CCEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCEEEEEEC Q ss_conf 6189987987199998589-987999999833114488889834899998973356888988999999970876799753 Q gi|254780464|r 2 NLRISKTSSGITVITEVMP-IDSAFVKVNIRAGSRNERQEEHGMAHFLEHMLFKGTTKRTAKEIVEEIEKVGGDINAYTS 80 (424) Q Consensus 2 ~~~~~~L~NGl~v~~~~~~-~~~~~i~~~~~~Gs~~e~~~~~G~ah~lehl~~~gt~~~~~~~~~~~l~~~G~~~na~t~ 80 (424) .++..+|+||+++++.+++ .+.+++++||++||++|++++.|+||+||||+|+||.+++..++.+.++.+||..||+|+ T Consensus 16 ~~~~~~L~nGl~~~~~~~~~~~~vs~~~~v~~Gs~~e~~~~~G~AH~lehm~fkgt~~~~~~~i~~~~~~~G~~~na~ts 95 (438) T COG0612 16 GLQVFTLPNGLRVITYPNPTAPTVSLDVWVKAGSRAEPAGKAGIAHFLEHMAFKGTTGLPSAELAEAFEKLGGQLNAFTS 95 (438) T ss_pred CCEEEECCCCCEEEEEECCCCCEEEEEEEEECCCCCCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCCCEEEEEC T ss_conf 55799847875899884588874899999943644678862339999999971799888767899999975991456752 Q ss_pred CCEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCHHHCCCCCCCCCCCCCCCCCHH Q ss_conf 22379996126656521001222322103566200112455676666511332111000000122222222222222103 Q gi|254780464|r 81 LEHTSYHAWVLKEHVPLALEIIGDMLSNSSFNPSDIERERNVVLEEIGMSEDDSWDFLDARFSEMVWKDQIIGRPILGKP 160 (424) Q Consensus 81 ~d~~~~~~~~~~~~~~~~l~ll~~~~~~p~~~~~~~~~ek~~v~~e~~~~~~~p~~~~~~~~~~~~f~~~~~~~~~~g~~ 160 (424) +|+|+|++++++++++.+|++++|++.+|.|++++|++||.++++|+++..++|...+++.+.+.+|++||||++++|+. T Consensus 96 ~d~t~y~~~~l~~~~~~~l~llad~l~~p~f~~~~~e~Ek~vil~ei~~~~d~p~~~~~~~l~~~~~~~~p~~~~~~G~~ 175 (438) T COG0612 96 FDYTVYYLSVLPDNLDKALDLLADILLNPTFDEEEVEREKGVILEEIRMRQDDPDDLAFERLLEALYGNHPLGRPILGTE 175 (438) T ss_pred CCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCCCCCH T ss_conf 55134552146653899999999998578999899999999999999984038788999999999748997888998897 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCHHHHCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCC-CCCCCEEEEE-----CCCCC Q ss_conf 44200016677642011245663000002545512211221101235877778866422-3322102330-----36843 Q gi|254780464|r 161 ETISSFTPEKIISFVSRNYTADRMYVVCVGAVDHEFCVSQVESYFNVCSVAKIKESMKP-AVYVGGEYIQ-----KRDLA 234 (424) Q Consensus 161 ~~i~~lt~~~l~~f~~~~y~p~n~~l~i~Gd~~~~~~~~~i~~~f~~~~~~~~~~~~~p-~~~~~~~~~~-----~~~~~ 234 (424) ++|.++|+++|++||++||+|+||+|+||||++++++..+++++|++|+....+....+ +...+...+. .++.. T Consensus 176 e~I~~it~~dl~~f~~k~Y~p~n~~l~vvGdi~~~~v~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 255 (438) T COG0612 176 ESIEAITREDLKDFYQKWYQPDNMVLVVVGDVDAEEVVELIEKYFGDLPGAAPPPKIPPEPPLGPERVVRVNDPEQPDLE 255 (438) T ss_pred HHHHHCCHHHHHHHHHHHCCCCCEEEEEECCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCHHH T ss_conf 88734799999999998568147389997799989999999998433777788777777666788735861355563166 Q ss_pred CCEEEECCCCCCCCCC-HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCCCHHHHHHH Q ss_conf 2102301000013521-135689987776248765322220002686210133322210122013444258302477787 Q gi|254780464|r 235 EEHMMLGFNGCAYQSR-DFYLTNILASILGDGMSSRLFQEVREKRGLCYSISAHHENFSDNGVLYIASATAKENIMALTS 313 (424) Q Consensus 235 ~~~i~~~~~~~~~~~~-d~~al~ll~~iLg~g~sSrL~~~lRe~~glaY~v~s~~~~~~~~g~~~i~~~~~~~~~~~~~~ 313 (424) ++++.+++++++..++ +++++.+++.+||++++||||+++|+++||+|+++++...+.+.|.+.++..+.+++.+.+.+ T Consensus 256 ~~~~~~g~~~~~~~~~~~~~~~~l~~~llgg~~~SrLf~~~re~~glay~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 335 (438) T COG0612 256 QAWLALGYPGPDYDSPDDYAALLLLNGLLGGGFSSRLFQELREKRGLAYSVSSFSDFLSDSGLFSIYAGTAPENPEKTAE 335 (438) T ss_pred HHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCEEEEEECCCCCCCCCCCEEEEEEECCCCHHHHHH T ss_conf 55541121367744255699999999984686522889999996581588612543211268437999845667667999 Q ss_pred HHHHHHHHHCCC----CCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCHHHHHHHHH Q ss_conf 789999853148----9989999999999999985138989999999999996898479899999887279899999999 Q gi|254780464|r 314 SIVEVVQSLLEN----IEQREIDKECAKIHAKLIKSQERSYLRALEISKQVMFCGSILCSEKIIDTISAITCEDIVGVAK 389 (424) Q Consensus 314 ~i~~~i~~l~~~----i~~~el~~ak~~~~~~~~~~~~~~~~~a~~l~~~~~~~~~~~~~~~~~~~i~~vt~~di~~~~~ 389 (424) .+.+++..+.+. +++++++++|..+...+....+++...+..+......+++..+.+++.+.|++||++||+++++ T Consensus 336 ~i~~~~~~~~~~~~~~~t~~~~~~~k~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~vt~~dv~~~a~ 415 (438) T COG0612 336 LVEEILKALKKGLKGPFTEEELDAAKQLLIGLLLLSLDSPSSIAELLGQYLLLGGSLITLEELLERIEAVTLEDVNAVAK 415 (438) T ss_pred HHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCHHHHHHHHH T ss_conf 99999999986143547799999999999888887038899999999999971577679999999997259999999999 Q ss_pred HHCCCC-CEEEEEECCCCC Q ss_conf 862778-779998166010 Q gi|254780464|r 390 KIFSST-PTLAILGPPMDH 407 (424) Q Consensus 390 ~~l~~~-~~i~~vgp~~~~ 407 (424) +++.++ .+++++||.... T Consensus 416 ~~~~~~~~~~~~~~p~~~~ 434 (438) T COG0612 416 KLLAPENLTIVVLGPEKAL 434 (438) T ss_pred HHCCCCCCEEEEECCCCCC T ss_conf 8507567489996672224 No 3 >KOG2067 consensus Probab=100.00 E-value=0 Score=465.24 Aligned_cols=417 Identities=27% Similarity=0.421 Sum_probs=392.7 Q ss_pred CEEEEECCCCCEEEEECCCCCEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCEEEEEECC Q ss_conf 61899879871999985899879999998331144888898348999989733568889889999999708767997532 Q gi|254780464|r 2 NLRISKTSSGITVITEVMPIDSAFVKVNIRAGSRNERQEEHGMAHFLEHMLFKGTTKRTAKEIVEEIEKVGGDINAYTSL 81 (424) Q Consensus 2 ~~~~~~L~NGl~v~~~~~~~~~~~i~~~~~~Gs~~e~~~~~G~ah~lehl~~~gt~~~~~~~~~~~l~~~G~~~na~t~~ 81 (424) +.++|+|+||+||.+++.+++-+++++++++||++|.+.-.|++|+||.|+|++|.+++..++.+++|.+||...|.+++ T Consensus 24 ~~kvttL~NGlkvase~~pg~f~~vGlyIdsGsrYE~~~~~GisH~lerLAF~ST~~~~~~ei~~~LE~~GGn~~cqsSR 103 (472) T KOG2067 24 NTKVTTLPNGLKVASENTPGQFCTVGLYIDSGSRYEAKYFSGISHFLERLAFKSTERFSSKEILAELEKLGGNCDCQSSR 103 (472) T ss_pred CCEEEECCCCCEEECCCCCCCCEEEEEEEECCCCCCCCCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCCCCCCH T ss_conf 33035658864784058888724788897047623376753179999998600435776789999999838922244337 Q ss_pred CEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCHHHCCCCCCCCCCCCCCCCCHHH Q ss_conf 23799961266565210012223221035662001124556766665113321110000001222222222222221034 Q gi|254780464|r 82 EHTSYHAWVLKEHVPLALEIIGDMLSNSSFNPSDIERERNVVLEEIGMSEDDSWDFLDARFSEMVWKDQIIGRPILGKPE 161 (424) Q Consensus 82 d~~~~~~~~~~~~~~~~l~ll~~~~~~p~~~~~~~~~ek~~v~~e~~~~~~~p~~~~~~~~~~~~f~~~~~~~~~~g~~~ 161 (424) |.++|.+++.+++++.++.+|+|.+.+|+|.+++++.+|..|.-|+...-.+|+-.+.+.++.++|.++.+|.|.+++.+ T Consensus 104 etm~Yaas~~~~~v~sm~~lLadtV~~P~~~d~ev~~~~~~v~~E~~el~~~Pe~lL~e~iH~Aay~~ntlg~pl~cp~~ 183 (472) T KOG2067 104 ETMMYAASADSDGVDSMVELLADTVLNPKFTDQEVEEARRAVKYEIEELWMRPEPLLTEMIHSAAYSGNTLGLPLLCPEE 183 (472) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHECCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHH T ss_conf 66677787542143899999999873665568999999976620112002482366999999987356765555538745 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCHHHHCCCCCHHHHHHHHHHCCCCCCCCCCCCCC-CCCCCCCEEEEECCCCC------ Q ss_conf 42000166776420112456630000025455122112211012358777788664-22332210233036843------ Q gi|254780464|r 162 TISSFTPEKIISFVSRNYTADRMYVVCVGAVDHEFCVSQVESYFNVCSVAKIKESM-KPAVYVGGEYIQKRDLA------ 234 (424) Q Consensus 162 ~i~~lt~~~l~~f~~~~y~p~n~~l~i~Gd~~~~~~~~~i~~~f~~~~~~~~~~~~-~p~~~~~~~~~~~~~~~------ 234 (424) .|++|+.+.|..|-+++|+|++|++..|| ++++++++.++++|+.||+.+.++.. .++.|+|++...+.|.+ T Consensus 184 ~i~~I~~~~l~~yl~~~ytp~rmVlA~vG-V~heelv~~~~~~~~~~~s~~~p~i~~~~aQYtGG~~~~~~d~~~~~~g~ 262 (472) T KOG2067 184 NIDKINREVLEEYLKYFYTPERMVLAGVG-VEHEELVEIAEKLLGDLPSTKVPPIDESKAQYTGGELKIDTDAPQVTGGP 262 (472) T ss_pred HHHHHHHHHHHHHHHHCCCHHHEEEEECC-CCHHHHHHHHHHHHCCCCCCCCCCCCCCHHHCCCCCCCCCCCCCCCCCCC T ss_conf 53130399999999861783253764048-87799999999973548766789855325440365113577776556762 Q ss_pred -CCEEEECCCCCCCCCCHHHHHHHHHHHHHCC-----------CCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEE Q ss_conf -2102301000013521135689987776248-----------7653222200026862101333222101220134442 Q gi|254780464|r 235 -EEHMMLGFNGCAYQSRDFYLTNILASILGDG-----------MSSRLFQEVREKRGLCYSISAHHENFSDNGVLYIASA 302 (424) Q Consensus 235 -~~~i~~~~~~~~~~~~d~~al~ll~~iLg~g-----------~sSrL~~~lRe~~glaY~v~s~~~~~~~~g~~~i~~~ 302 (424) -+|++++|.+++++++|++++.+|+.++||| |.||||.++-.+..|.|++-+++..|.|+|+|+|+++ T Consensus 263 EltHv~lg~Eg~~~~deD~v~~avLq~lmGGGGSFSAGGPGKGMySrLY~~vLNry~wv~sctAfnhsy~DtGlfgi~~s 342 (472) T KOG2067 263 ELTHVVLGFEGCSWNDEDFVALAVLQMLMGGGGSFSAGGPGKGMYSRLYLNVLNRYHWVYSCTAFNHSYSDTGLFGIYAS 342 (472) T ss_pred CEEEEEEEECCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECC T ss_conf 10245675115887870489999999973488565789998633999999998562788876665213467754688613 Q ss_pred CCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCHH Q ss_conf 58302477787789999853148998999999999999998513898999999999999689847989999988727989 Q gi|254780464|r 303 TAKENIMALTSSIVEVVQSLLENIEQREIDKECAKIHAKLIKSQERSYLRALEISKQVMFCGSILCSEKIIDTISAITCE 382 (424) Q Consensus 303 ~~~~~~~~~~~~i~~~i~~l~~~i~~~el~~ak~~~~~~~~~~~~~~~~~a~~l~~~~~~~~~~~~~~~~~~~i~~vt~~ 382 (424) ++|+.+.+++..+.+++..+..+++++||+|||+++++.+++++|+.--.+.+++++.+..|.....+++++.|+++|++ T Consensus 343 ~~P~~a~~aveli~~e~~~~~~~v~~~el~RAK~qlkS~LlMNLESR~V~~EDvGRQVL~~g~rk~p~e~~~~Ie~lt~~ 422 (472) T KOG2067 343 APPQAANDAVELIAKEMINMAGGVTQEELERAKTQLKSMLLMNLESRPVAFEDVGRQVLTTGERKPPDEFIKKIEQLTPS 422 (472) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCHH T ss_conf 78788889999999999987078788999999999999987413103115887768997526757989999999856988 Q ss_pred HHHHHHHHHCCCCCEEEEEECCCCCCCCHHHHHHHHHH Q ss_conf 99999998627787799981660108998999998843 Q gi|254780464|r 383 DIVGVAKKIFSSTPTLAILGPPMDHVPTTSELIHALEG 420 (424) Q Consensus 383 di~~~~~~~l~~~~~i~~vgp~~~~~P~~~~l~~~l~~ 420 (424) ||.+++.++|.++++++..| +..++|+...+.++++. T Consensus 423 DI~rva~kvlt~~p~va~~G-d~~~lpt~~~i~~~~~~ 459 (472) T KOG2067 423 DISRVASKVLTGKPSVAAFG-DGTGLPTYDHIGNAVSS 459 (472) T ss_pred HHHHHHHHHHCCCCEECCCC-CCCCCCCHHHHHHHCCC T ss_conf 89999999834996002577-65688503313111124 No 4 >KOG2583 consensus Probab=100.00 E-value=1.5e-43 Score=311.05 Aligned_cols=398 Identities=22% Similarity=0.339 Sum_probs=326.9 Q ss_pred EEEEECCCCCEEEEECCCCCEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCEEEEEECCC Q ss_conf 18998798719999858998799999983311448888983489999897335688898899999997087679975322 Q gi|254780464|r 3 LRISKTSSGITVITEVMPIDSAFVKVNIRAGSRNERQEEHGMAHFLEHMLFKGTTKRTAKEIVEEIEKVGGDINAYTSLE 82 (424) Q Consensus 3 ~~~~~L~NGl~v~~~~~~~~~~~i~~~~~~Gs~~e~~~~~G~ah~lehl~~~gt~~~~~~~~~~~l~~~G~~~na~t~~d 82 (424) ...++|.|||+|.+.+.++|+.++.+.|++|||||+..+.|++|+++...++.|+++++..|.+.++.+||.++..++|| T Consensus 23 ~~~~kl~ngL~Vas~e~~~~is~l~l~~~AGSRYe~~~~~G~sHllr~f~g~~Tq~~sal~ivr~se~~GG~Lss~~tRe 102 (429) T KOG2583 23 SKTTKLVNGLTVASREAPTAISSLSLAFRAGSRYEPADQQGLSHLLRNFVGRDTQERSALKIVRESEQLGGTLSSTATRE 102 (429) T ss_pred HHHHCCCCCEEEEECCCCCCCEEEEEEEECCCCCCCCCCCCHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCEEEEEEECC T ss_conf 23112256418985227886237999982476577654224899999851667666621556455685286555553100 Q ss_pred EEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCHHHHHH-HHHHHHHHHCCCCCCCCCHHHCCCCCCCCCCCCCCCCCHHH Q ss_conf 37999612665652100122232210356620011245-56766665113321110000001222222222222221034 Q gi|254780464|r 83 HTSYHAWVLKEHVPLALEIIGDMLSNSSFNPSDIERER-NVVLEEIGMSEDDSWDFLDARFSEMVWKDQIIGRPILGKPE 161 (424) Q Consensus 83 ~~~~~~~~~~~~~~~~l~ll~~~~~~p~~~~~~~~~ek-~~v~~e~~~~~~~p~~~~~~~~~~~~f~~~~~~~~~~g~~~ 161 (424) .+.|+++|++++++..|.+|.+++..|.|.+|+++.+. ..+..++. ..+|...+.+.+++.+|.+ .+|+|++.+.- T Consensus 103 ~~~~tvt~lrd~~~~~l~~L~~V~~~paFkPwEl~D~~~~ti~~~l~--~~t~~~~a~e~lH~aAfRn-gLgnslY~p~~ 179 (429) T KOG2583 103 LIGLTVTFLRDDLEYYLSLLGDVLDAPAFKPWELEDVVLATIDADLA--YQTPYTIAIEQLHAAAFRN-GLGNSLYSPGY 179 (429) T ss_pred EEEEEEEEECCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHH--HCCHHHHHHHHHHHHHHHC-CCCCCCCCCCC T ss_conf 18999998553279999999986256675851012221453677764--1372889999999999862-35872237731 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCHHHHCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCEEEEC Q ss_conf 42000166776420112456630000025455122112211012358777788664223322102330368432102301 Q gi|254780464|r 162 TISSFTPEKIISFVSRNYTADRMYVVCVGAVDHEFCVSQVESYFNVCSVAKIKESMKPAVYVGGEYIQKRDLAEEHMMLG 241 (424) Q Consensus 162 ~i~~lt~~~l~~f~~~~y~p~n~~l~i~Gd~~~~~~~~~i~~~f~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~i~~~ 241 (424) .+.+++.++|++|-+++|+..|++++.+ ++|++.++...++++...+ ..+....|+.+.+++.+........++.+. T Consensus 180 ~vg~vss~eL~~Fa~k~fv~gn~~lvg~-nvd~~~L~~~~~~~~~~~~--~~~~k~a~a~~~gGe~Rk~~~g~~~~v~va 256 (429) T KOG2583 180 QVGSVSSSELKDFAAKHFVKGNAVLVGV-NVDHDDLKQFADEYAPIRD--GLPLKPAPAKYSGGEARKDARGNRVHVAVA 256 (429) T ss_pred CCCCCCHHHHHHHHHHHHHCCCEEEEEC-CCCHHHHHHHHHHHCCCCC--CCCCCCCCCCCCCCCCCCCCCCCEEEEEEE T ss_conf 1267658999999998731144389952-7774899999987265668--877788874444874221368963699984 Q ss_pred CCCC-CCCCCHHHHHHHHHHHHHCCC-----CCCCCCCCCCCCC-CCCHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHH Q ss_conf 0000-135211356899877762487-----6532222000268-62101333222101220134442583024777877 Q gi|254780464|r 242 FNGC-AYQSRDFYLTNILASILGDGM-----SSRLFQEVREKRG-LCYSISAHHENFSDNGVLYIASATAKENIMALTSS 314 (424) Q Consensus 242 ~~~~-~~~~~d~~al~ll~~iLg~g~-----sSrL~~~lRe~~g-laY~v~s~~~~~~~~g~~~i~~~~~~~~~~~~~~~ 314 (424) -.+. .-+.++..+..++..+||.+. .+.|- ++--..| ..-+++++...|.|.|+|+++......+..++.+. T Consensus 257 gegAAa~~~k~~~a~av~~~~Lg~~~~~k~~t~~~~-~aa~~a~~~~~s~sA~~a~ysDsGL~gv~~~~~~~~a~~~v~s 335 (429) T KOG2583 257 GEGAAAGNLKVLAAQAVLLAALGNSAPVKRGTGLLS-EAAGAAGEQGASASAFNAPYSDSGLFGVYVSAQGSQAGKVVSS 335 (429) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCHHH-HHHHHCCCCCCEEEEECCCCCCCCEEEEEEEECCCCHHHHHHH T ss_conf 375433346889999999999713465455641678-8876304567504652155567854899997137507789999 Q ss_pred HHHHHHHH-CCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCHHHHHHHHHHHCC Q ss_conf 89999853-14899899999999999999851389899999999999968984798999998872798999999998627 Q gi|254780464|r 315 IVEVVQSL-LENIEQREIDKECAKIHAKLIKSQERSYLRALEISKQVMFCGSILCSEKIIDTISAITCEDIVGVAKKIFS 393 (424) Q Consensus 315 i~~~i~~l-~~~i~~~el~~ak~~~~~~~~~~~~~~~~~a~~l~~~~~~~~~~~~~~~~~~~i~~vt~~di~~~~~~~l~ 393 (424) ....++.. ..+++......+++.++..+....+. . .+.......... ..++++..|++||..||++++++.+. T Consensus 336 ~v~~lks~~~~~id~~~~~a~~~~l~~~~~ss~~a---~--~~~~~~~a~~~~-~~d~~i~~id~Vt~sdV~~a~kk~~s 409 (429) T KOG2583 336 EVKKLKSALVSDIDNAKVKAAIKALKASYLSSVEA---L--ELATGSQANLVS-EPDAFIQQIDKVTASDVQKAAKKFLS 409 (429) T ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHH---H--HHHHHHHHCCCC-CHHHHHHHHCCCCHHHHHHHHHHHCC T ss_conf 99999988753875127899999998876310377---8--876577751777-84889887303468899999998603 Q ss_pred CCCEEEEEECCCCCCCCHHHH Q ss_conf 787799981660108998999 Q gi|254780464|r 394 STPTLAILGPPMDHVPTTSEL 414 (424) Q Consensus 394 ~~~~i~~vgp~~~~~P~~~~l 414 (424) .+..++.+| +...+|+.+|| T Consensus 410 ~kls~aA~G-nl~~vPY~DEL 429 (429) T KOG2583 410 GKLSLAAYG-NLSNVPYLDEL 429 (429) T ss_pred CCCEEEEEC-CCCCCCCCCCC T ss_conf 752156643-54577655569 No 5 >COG1025 Ptr Secreted/periplasmic Zn-dependent peptidases, insulinase-like [Posttranslational modification, protein turnover, chaperones] Probab=100.00 E-value=2.5e-41 Score=296.11 Aligned_cols=400 Identities=21% Similarity=0.206 Sum_probs=306.2 Q ss_pred CEEEEECCCCCEEEEECCC-CCEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCCCH-HHHHHHHHHHCCEEEEEE Q ss_conf 6189987987199998589-98799999983311448888983489999897335688898-899999997087679975 Q gi|254780464|r 2 NLRISKTSSGITVITEVMP-IDSAFVKVNIRAGSRNERQEEHGMAHFLEHMLFKGTTKRTA-KEIVEEIEKVGGDINAYT 79 (424) Q Consensus 2 ~~~~~~L~NGl~v~~~~~~-~~~~~i~~~~~~Gs~~e~~~~~G~ah~lehl~~~gt~~~~~-~~~~~~l~~~G~~~na~t 79 (424) .+|..+|+||+++++...| .+..+..+.|++||.+||.+.+|+||++|||+|.||++||. ..+...|+..||+.||+| T Consensus 23 ~y~~I~LpNGl~~LlisDP~a~ks~aAL~V~vGs~~DP~e~~GLAHflEHmlfmGseKYP~~~~f~~fLskhgGs~NA~T 102 (937) T COG1025 23 KYRAIKLPNGLRALLVSDPQADKSSAALVVPVGSFDDPEEYPGLAHFLEHMLFMGSEKYPDEGGFSEFLSKHGGSHNAST 102 (937) T ss_pred CEEEEECCCCCEEEEECCCCCCCCCEEEEEECCCCCCHHHCCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHCCCCCCCCC T ss_conf 23689889996688841899876341588604887893442147999999997267668996406899997499445523 Q ss_pred CCCEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCHHHCCCCCCCCCCCCCCCCCH Q ss_conf 32237999612665652100122232210356620011245567666651133211100000012222222222222210 Q gi|254780464|r 80 SLEHTSYHAWVLKEHVPLALEIIGDMLSNSSFNPSDIERERNVVLEEIGMSEDDSWDFLDARFSEMVWKDQIIGRPILGK 159 (424) Q Consensus 80 ~~d~~~~~~~~~~~~~~~~l~ll~~~~~~p~~~~~~~~~ek~~v~~e~~~~~~~p~~~~~~~~~~~~f~~~~~~~~~~g~ 159 (424) ..+.|+|++++..+.++.+|+.+++.+.+|.|+++.+++|+..|-+|..+...+...++++.....+-++||+.+...|. T Consensus 103 ~~~~T~fyFeV~~~al~~ALDrFa~ff~~PLf~~e~~dRE~~AV~sE~~~~~~~D~~R~~~~~~~~~np~HP~srFs~GN 182 (937) T COG1025 103 AGERTAFYFEVENDALEGALDRFADFFIEPLFNKEALDRERNAVNSEFTMNLTSDGWRMYQVQALTANPGHPLSKFSTGN 182 (937) T ss_pred CCCCEEEEEEECHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCC T ss_conf 77750699983678889999999998756103867788899998799862767317899999986369899742057888 Q ss_pred HHHHHH----HHHHHHHHHHCCCCCCCCCHHHHCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCC----EEEE-EC Q ss_conf 344200----01667764201124566300000254551221122110123587777886642233221----0233-03 Q gi|254780464|r 160 PETISS----FTPEKIISFVSRNYTADRMYVVCVGAVDHEFCVSQVESYFNVCSVAKIKESMKPAVYVG----GEYI-QK 230 (424) Q Consensus 160 ~~~i~~----lt~~~l~~f~~~~y~p~n~~l~i~Gd~~~~~~~~~i~~~f~~~~~~~~~~~~~p~~~~~----~~~~-~~ 230 (424) .++|.. -..++++.||+++|.++||+++|.|+-+.+++.+++.++||.+|......+..|.+... ...+ .. T Consensus 183 ~~TL~~~p~~~v~~el~ef~~~~YSa~~M~lviyg~q~ldeL~~~a~~~F~~Ipn~~~~~p~~p~p~~~d~~t~~ii~i~ 262 (937) T COG1025 183 LETLSDKPGLVVQQELKEFHEKHYSANNMKLVIYGNQPLDELAKLAADLFGDIPNRARKIPPIPVPVVTDEQTGKIIHIV 262 (937) T ss_pred HHHHCCCCCCHHHHHHHHHHHHHCCHHHEEEEEECCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHCCEEEEC T ss_conf 65404677855899999999971686456999966899789999999984768877788899999998957847647841 Q ss_pred CCCCCCEEEECCCCCCCCCCH-HHHHHHHHHHHHC-CCCCCCCCCCCCCCCCCCHHHHHHHHHH-CCCCEEEEEECC--- Q ss_conf 684321023010000135211-3568998777624-8765322220002686210133322210-122013444258--- Q gi|254780464|r 231 RDLAEEHMMLGFNGCAYQSRD-FYLTNILASILGD-GMSSRLFQEVREKRGLCYSISAHHENFS-DNGVLYIASATA--- 304 (424) Q Consensus 231 ~~~~~~~i~~~~~~~~~~~~d-~~al~ll~~iLg~-g~sSrL~~~lRe~~glaY~v~s~~~~~~-~~g~~~i~~~~~--- 304 (424) +-.+...+.+.|+.++..+.. .-+...+.++||. +.+|.+ . ...+.||+-+++++..... +.|.|.|..... T Consensus 263 p~~~~~~L~i~f~i~~~~~~~~~~~~~~~s~Lig~es~gsL~-~-~Lk~~Glit~l~a~~~~~~~n~~~f~is~~LT~~G 340 (937) T COG1025 263 PAKPRPRLRIYFPIDDNSAKFRSKPDEYLSHLIGNESPGSLL-A-WLKKQGLITELSAGLDPISGNYGVFAISYELTDKG 340 (937) T ss_pred CCCCCCEEEEEEECCCCCCCCCCCCHHHHHHHHCCCCCCHHH-H-HHHHCCCHHHHCCCCCCCCCCCCEEEEEEEHHHCC T ss_conf 378874589998727864322338799999996468986099-9-99866435421045401357762499986342110 Q ss_pred CCHHHHHHHHHHHHHHHHCC-CCCHHHHHHHHHHHHHHHHH-HCCCHHHHHHHHHHHHHHCCCCCCHHHHH---HHHHCC Q ss_conf 30247778778999985314-89989999999999999985-13898999999999999689847989999---988727 Q gi|254780464|r 305 KENIMALTSSIVEVVQSLLE-NIEQREIDKECAKIHAKLIK-SQERSYLRALEISKQVMFCGSILCSEKII---DTISAI 379 (424) Q Consensus 305 ~~~~~~~~~~i~~~i~~l~~-~i~~~el~~ak~~~~~~~~~-~~~~~~~~a~~l~~~~~~~~~~~~~~~~~---~~i~~v 379 (424) -.+.++++..+.+-++-+++ ++....++...+.....+.. +...+......++-.+.. .+.+.+. -.+..- T Consensus 341 l~~~~~VI~~~F~yl~~l~~~~~~~~~f~Elq~v~~l~f~y~~~t~~~~~~~~l~~~m~~----~p~~~~~~~~~~~~~y 416 (937) T COG1025 341 LAHYDRVIALTFQYLNLLREKGIPKYTFDELQNVLDLDFRYPSKTRPMDYVSWLADNMER----EPVEHTLYASLVLPRY 416 (937) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCC----CCHHHHHCHHHCCCCC T ss_conf 032899999999999998861613667899988887652130248817789999875546----7756540221024543 Q ss_pred CHHHHHHHHHHHCCCCCEEEEEECCCCC Q ss_conf 9899999999862778779998166010 Q gi|254780464|r 380 TCEDIVGVAKKIFSSTPTLAILGPPMDH 407 (424) Q Consensus 380 t~~di~~~~~~~l~~~~~i~~vgp~~~~ 407 (424) ..++++.++.-+...+..+..+++..+. T Consensus 417 d~~~~~~~l~~~~pen~R~~lis~~~~~ 444 (937) T COG1025 417 DPKAIQERLALMTPENARLWLISKLEEH 444 (937) T ss_pred CHHHHHHHHHHHCCCCEEEEEECCCCCC T ss_conf 8799999998638002699996378775 No 6 >KOG0959 consensus Probab=100.00 E-value=4.6e-41 Score=294.31 Aligned_cols=402 Identities=19% Similarity=0.216 Sum_probs=303.7 Q ss_pred CEEEEECCCCCEEEEECCC-CCEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCCCH-HHHHHHHHHHCCEEEEEE Q ss_conf 6189987987199998589-98799999983311448888983489999897335688898-899999997087679975 Q gi|254780464|r 2 NLRISKTSSGITVITEVMP-IDSAFVKVNIRAGSRNERQEEHGMAHFLEHMLFKGTTKRTA-KEIVEEIEKVGGDINAYT 79 (424) Q Consensus 2 ~~~~~~L~NGl~v~~~~~~-~~~~~i~~~~~~Gs~~e~~~~~G~ah~lehl~~~gt~~~~~-~~~~~~l~~~G~~~na~t 79 (424) ..+..+|+|||++++...| .+..+..+-|++||..||.+.+|+||++|||+|.||++||. .++...+...||+.||+| T Consensus 27 ~yr~~~L~Ngl~alLisDp~tD~ssaal~V~vGS~~DP~dl~GLAHF~EHMlFmGS~KYP~En~y~~~lsk~gGssNA~T 106 (974) T KOG0959 27 EYRGIELTNGLRALLISDPKTDKSSAALDVKVGSFSDPEDLQGLAHFCEHMLFMGSEKYPDENEYSKFLSKNGGSSNAYT 106 (974) T ss_pred CEEEEEECCCCEEEEECCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHCCCCCCCCC T ss_conf 23579805775699802899886430245400465782112118999999986246447874136788875288654444 Q ss_pred CCCEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCHHHCCCCCCCCCCCCCCCCCH Q ss_conf 32237999612665652100122232210356620011245567666651133211100000012222222222222210 Q gi|254780464|r 80 SLEHTSYHAWVLKEHVPLALEIIGDMLSNSSFNPSDIERERNVVLEEIGMSEDDSWDFLDARFSEMVWKDQIIGRPILGK 159 (424) Q Consensus 80 ~~d~~~~~~~~~~~~~~~~l~ll~~~~~~p~~~~~~~~~ek~~v~~e~~~~~~~p~~~~~~~~~~~~f~~~~~~~~~~g~ 159 (424) +.|+|+|++.+..++++.+|+.+++.+.+|.|.++..+||+..|.+|.+....+...+.++.+....=++||+++...|. T Consensus 107 ~~e~T~y~F~V~~~~l~~ALDrFaqFf~~Plf~~~a~eREv~AVdSE~~~nl~~D~wr~~ql~~~l~~~~hp~~kF~tGN 186 (974) T KOG0959 107 DSEHTNYYFDVQHDHLEGALDRFAQFFSDPLFNKSATEREVGAVDSEHEKNLNSDGWRFDQLLRSLSNPGHPYSKFSTGN 186 (974) T ss_pred CCCCCEEEEECCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCCCCHHCCCCC T ss_conf 55442578733647778899999998627123768888899888789873457513678999997548897510114563 Q ss_pred HHHHHHHH-----HHHHHHHHCCCCCCCCCHHHHCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCC-CC----CEEEEE Q ss_conf 34420001-----667764201124566300000254551221122110123587777886642233-22----102330 Q gi|254780464|r 160 PETISSFT-----PEKIISFVSRNYTADRMYVVCVGAVDHEFCVSQVESYFNVCSVAKIKESMKPAV-YV----GGEYIQ 229 (424) Q Consensus 160 ~~~i~~lt-----~~~l~~f~~~~y~p~n~~l~i~Gd~~~~~~~~~i~~~f~~~~~~~~~~~~~p~~-~~----~~~~~~ 229 (424) .++|.... .+.|++||++||.+++|+++|+|+.+.+.+..++.+.|+.++....+.+.-|.. +. +..+.. T Consensus 187 ~~tL~~~p~~~~~r~~L~kF~k~~Yssn~M~l~i~G~eslD~Le~lv~~~F~~i~N~~~~~p~f~~~p~~~e~~~~~~~v 266 (974) T KOG0959 187 KKTLLEGPREIDLRDELLKFYKNWYSSNIMTLVIVGKESLDVLESLVTRLFDEISNKKKPRPVFPEPPFLPEELKKLVRV 266 (974) T ss_pred HHHHHHCCCCCHHHHHHHHHHHHHCCCCCCEEEEECCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHCCEEEE T ss_conf 66663054411189999999985255554169997478724888999987266555677787656788784886758999 Q ss_pred CCCCCCCEEEECCCCCCCCC-CHHHHHHHHHHHHH-CCCCCCCCCCCCCCCCCCCHHHHHHH-HHHCCCCEEEEEECCC- Q ss_conf 36843210230100001352-11356899877762-48765322220002686210133322-2101220134442583- Q gi|254780464|r 230 KRDLAEEHMMLGFNGCAYQS-RDFYLTNILASILG-DGMSSRLFQEVREKRGLCYSISAHHE-NFSDNGVLYIASATAK- 305 (424) Q Consensus 230 ~~~~~~~~i~~~~~~~~~~~-~d~~al~ll~~iLg-~g~sSrL~~~lRe~~glaY~v~s~~~-~~~~~g~~~i~~~~~~- 305 (424) .+-.....+.+.|+.|+..+ -+.-+.+.+.+++| .|.+|.+ ..|+ .+||+-+..+... ...+.+.|.+.+.... T Consensus 267 ~pik~~~~l~is~~~p~~~~~y~~kP~~y~~hLigheg~GSL~-~~Lk-~~gw~~sl~a~~~~~as~~~~f~v~idLtd~ 344 (974) T KOG0959 267 VPIKDGRSLMISWPVPPLNHHYKSKPLRYLSHLIGHEGPGSLL-SYLK-RLGWATSLEAGIPEFASGYSFFNVSIDLTDE 344 (974) T ss_pred EECCCCCEEEEEEECCCCCCCCCCCCHHHHHHHHCCCCCCHHH-HHHH-HHHCHHEEECCCCCCCCCCCEEEEEEEECCC T ss_conf 7235562689998438855411148189999984467862199-9999-7323211222777531354048999970013 Q ss_pred --CHHHHHHHHHHHHHHHHCC-CCCHHHHHHHHHHHHH-HHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCH Q ss_conf --0247778778999985314-8998999999999999-99851389899999999999968984798999998872798 Q gi|254780464|r 306 --ENIMALTSSIVEVVQSLLE-NIEQREIDKECAKIHA-KLIKSQERSYLRALEISKQVMFCGSILCSEKIIDTISAITC 381 (424) Q Consensus 306 --~~~~~~~~~i~~~i~~l~~-~i~~~el~~ak~~~~~-~~~~~~~~~~~~a~~l~~~~~~~~~~~~~~~~~~~i~~vt~ 381 (424) +++++++..+..-++.+.. +...--+++....-.. ......+.+...+..++.+..+ .+....-.....+....+ T Consensus 345 G~e~~~~ii~~~f~yi~~l~~~~~~~~i~~E~~~~~~~~Frf~~k~~p~~~~~~~~~nlq~-~P~~~il~~~~ll~~~~p 423 (974) T KOG0959 345 GLEHVDEIIGLVFNYIKLLQSAGPEKWIFKELQLISEVKFRFQDKEPPMEYASEIASNLQY-YPVEDVLTGSYLLTEFDP 423 (974) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHEEECCCCCCHHHHHHHHHHCCC-CCHHHHHCCHHHHHHCCH T ss_conf 2011999999999999999862713479999987653334301048967899999864233-786876320555532183 Q ss_pred HHHHHHHHHHCCCCCEEEEEECCCC Q ss_conf 9999999986277877999816601 Q gi|254780464|r 382 EDIVGVAKKIFSSTPTLAILGPPMD 406 (424) Q Consensus 382 ~di~~~~~~~l~~~~~i~~vgp~~~ 406 (424) +.|+.+...+-..+..++++..... T Consensus 424 ~~i~~~~~~L~p~n~~v~~~s~~~~ 448 (974) T KOG0959 424 DLIQEVLSSLVPSNMRVILVSRSFE 448 (974) T ss_pred HHHHHHHHHCCCCCCEEEEEEECCC T ss_conf 8888898750755441466540124 No 7 >COG1026 Predicted Zn-dependent peptidases, insulinase-like [General function prediction only] Probab=100.00 E-value=4.7e-37 Score=267.48 Aligned_cols=408 Identities=17% Similarity=0.209 Sum_probs=303.2 Q ss_pred CCCCEEEEECCCCCEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCCCHHH-HHHHH-HHHCCEEEEEECCCEEEE Q ss_conf 9871999985899879999998331144888898348999989733568889889-99999-970876799753223799 Q gi|254780464|r 9 SSGITVITEVMPIDSAFVKVNIRAGSRNERQEEHGMAHFLEHMLFKGTTKRTAKE-IVEEI-EKVGGDINAYTSLEHTSY 86 (424) Q Consensus 9 ~NGl~v~~~~~~~~~~~i~~~~~~Gs~~e~~~~~G~ah~lehl~~~gt~~~~~~~-~~~~l-~~~G~~~na~t~~d~~~~ 86 (424) ++|+++++..++-+...+++.|+ .+|.+..|+||.|||++|+|+++||-.+ |...+ .+++..+||+|..|.|+| T Consensus 28 ~TGa~l~hi~~~d~~~vFsi~F~----T~p~dstGVaHiLEHtvlcGS~kYPvkdPF~~ml~rSLntF~NA~T~~D~T~Y 103 (978) T COG1026 28 KTGAELAHIKNEDPNNVFSIAFK----TEPHDSTGVAHILEHTVLCGSKKYPVKDPFFKMLKRSLNTFLNAFTFPDKTVY 103 (978) T ss_pred CCCCEEEEECCCCCCCEEEEEEE----CCCCCCCCCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCEE T ss_conf 67866998447985736889961----68987778406887776517777888876999987768777763117886453 Q ss_pred EEECCC-CCCCCCHHHHHHHHHCCCCCCCHHHHH--------------HHHHHHHHHHCCCCCCCCCHHHCCCCCCCCCC Q ss_conf 961266-565210012223221035662001124--------------55676666511332111000000122222222 Q gi|254780464|r 87 HAWVLK-EHVPLALEIIGDMLSNSSFNPSDIERE--------------RNVVLEEIGMSEDDSWDFLDARFSEMVWKDQI 151 (424) Q Consensus 87 ~~~~~~-~~~~~~l~ll~~~~~~p~~~~~~~~~e--------------k~~v~~e~~~~~~~p~~~~~~~~~~~~f~~~~ 151 (424) .++..- +++-.+|..+.|.+++|.+.++.|..| +++|.+|++....++..++++.+.+.+|++.. T Consensus 104 P~sS~~~~Df~NLl~VYlDavf~PlL~~e~F~QEgwr~e~~~~~~l~~~GVVyNEMKGa~ss~~~~~~~~~~~slfp~~t 183 (978) T COG1026 104 PASSANEKDFYNLLSVYLDAVFHPLLTKESFLQEGWRIEFKDESNLKYKGVVYNEMKGAYSSGESVLSRAMQQSLFPGTT 183 (978) T ss_pred ECCCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHCCCCCCCCCCEEEEEEEEHHCCCCCCCHHHHHHHHHHHHCCCCC T ss_conf 54106840277888998876327311367775543201667876512301775000131058056899998876388765 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHCCCCCHHHHHHHHHHC-CCCCCCCCCCCCCC--CCCCCCEEE- Q ss_conf 22222210344200016677642011245663000002545512211221101-23587777886642--233221023- Q gi|254780464|r 152 IGRPILGKPETISSFTPEKIISFVSRNYTADRMYVVCVGAVDHEFCVSQVESY-FNVCSVAKIKESMK--PAVYVGGEY- 227 (424) Q Consensus 152 ~~~~~~g~~~~i~~lt~~~l~~f~~~~y~p~n~~l~i~Gd~~~~~~~~~i~~~-f~~~~~~~~~~~~~--p~~~~~~~~- 227 (424) ||....|.++.|..+|.+++++||+++|+|+|+.+.++||++.++++..+++. +...+......+.+ +......+. T Consensus 184 y~~~SGG~P~~I~~LtyE~~r~FHkk~Y~pSN~~i~~yGni~~~~~L~~iee~~l~~~~k~~~~~~i~~~~~~~~~~~~~ 263 (978) T COG1026 184 YGVNSGGDPKNIPDLTYEEFRAFHKKHYHPSNCKIFVYGNIPTERLLDFIEEKVLRPFGKRELDVPIPDQKAFKKPRRKV 263 (978) T ss_pred CCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCEEEEEECCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 41058999430324677999999997378454399997897879999887786512246322589988630138635540 Q ss_pred ----E--ECCCCCCCEEEECCCCCCCCC-CHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCC-CHHHHHHHHHHCCCCEE- Q ss_conf ----3--036843210230100001352-11356899877762487653222200026862-10133322210122013- Q gi|254780464|r 228 ----I--QKRDLAEEHMMLGFNGCAYQS-RDFYLTNILASILGDGMSSRLFQEVREKRGLC-YSISAHHENFSDNGVLY- 298 (424) Q Consensus 228 ----~--~~~~~~~~~i~~~~~~~~~~~-~d~~al~ll~~iLg~g~sSrL~~~lRe~~gla-Y~v~s~~~~~~~~g~~~- 298 (424) + ...|..|..+.+.|.+++..+ .+.+++.||..+|-++.+|+|.++|.|- |+. +.++.++........|. T Consensus 264 ~~ypi~~~~~de~q~~~~lsWl~~~~~d~~~~lal~vL~~iLl~~~asPl~~~lies-glg~~~~~g~~~~~~~~~~f~v 342 (978) T COG1026 264 LEYPISFDEEDEDQGLLSLSWLGGSASDAEDSLALEVLEEILLDSAASPLTQALIES-GLGFADVSGSYDSDLKETIFSV 342 (978) T ss_pred EEECCCCCCCCCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHC-CCCCCCCCCEECCCCCEEEEEE T ss_conf 250467777887626999987257733598888999999997167622778998872-7885323553425500257999 Q ss_pred EEEECCCCHHHHHHHHHHHHHHHHC-CCCCHHHHHHHHHHHHHHHHHHCCCHHHH--HHHHHHHHHHCCCCCCHH---HH Q ss_conf 4442583024777877899998531-48998999999999999998513898999--999999999689847989---99 Q gi|254780464|r 299 IASATAKENIMALTSSIVEVVQSLL-ENIEQREIDKECAKIHAKLIKSQERSYLR--ALEISKQVMFCGSILCSE---KI 372 (424) Q Consensus 299 i~~~~~~~~~~~~~~~i~~~i~~l~-~~i~~~el~~ak~~~~~~~~~~~~~~~~~--a~~l~~~~~~~~~~~~~~---~~ 372 (424) +.-+++.++..+..+.+.+.++++. +||..+.++.++.++..++......+... +......++.|+++...- .+ T Consensus 343 ~~~gv~~ek~~~~k~lV~~~L~~l~~~gi~~~~ie~~~~q~E~s~ke~~s~pfgl~l~~~~~~gw~~G~dp~~~Lr~~~~ 422 (978) T COG1026 343 GLKGVSEEKIAKLKNLVLSTLKELVKNGIDKKLIEAILHQLEFSLKEVKSYPFGLGLMFRSLYGWLNGGDPEDSLRFLDY 422 (978) T ss_pred EECCCCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCCCHHHHHHHHHH T ss_conf 96589878899999999999999998168799999999999975554037884378898741331248985665305999 Q ss_pred HHHHH-CCCHHH-HHHHHHHHCCCCC--EEEEEECCCCCC-CCHHHHHHHHHHH Q ss_conf 99887-279899-9999998627787--799981660108-9989999988434 Q gi|254780464|r 373 IDTIS-AITCED-IVGVAKKIFSSTP--TLAILGPPMDHV-PTTSELIHALEGF 421 (424) Q Consensus 373 ~~~i~-~vt~~d-i~~~~~~~l~~~~--~i~~vgp~~~~~-P~~~~l~~~l~~~ 421 (424) .+.++ .+.... ...++++||=.|+ +++++.|..+-. ...++.++.|+.. T Consensus 423 ~~~Lr~~le~~~~fe~LI~ky~l~N~h~~~v~~~Ps~~~~~~~ekee~e~L~~~ 476 (978) T COG1026 423 LQNLREKLEKGPYFEKLIRKYFLDNPHYVTVIVLPSPELEEKLEKEERELLQKR 476 (978) T ss_pred HHHHHHHHHCCHHHHHHHHHHHHCCCCCEEEEEECCHHHHHHHHHHHHHHHHHH T ss_conf 999998654281899999999602896179999067578888779999999999 No 8 >pfam00675 Peptidase_M16 Insulinase (Peptidase family M16). Probab=100.00 E-value=1.2e-32 Score=237.88 Aligned_cols=147 Identities=41% Similarity=0.650 Sum_probs=142.1 Q ss_pred EEEEECC-CCCEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCEEEEEECCCEEEEEEECC Q ss_conf 9999858-998799999983311448888983489999897335688898899999997087679975322379996126 Q gi|254780464|r 13 TVITEVM-PIDSAFVKVNIRAGSRNERQEEHGMAHFLEHMLFKGTTKRTAKEIVEEIEKVGGDINAYTSLEHTSYHAWVL 91 (424) Q Consensus 13 ~v~~~~~-~~~~~~i~~~~~~Gs~~e~~~~~G~ah~lehl~~~gt~~~~~~~~~~~l~~~G~~~na~t~~d~~~~~~~~~ 91 (424) ||++.++ +.|++++++||++||++|++++.|+||++|||+|+||++++..++.+.++.+|+.+||+|++|.|.|+++++ T Consensus 1 rVv~~~~~~~p~~si~l~v~~Gs~~E~~~~~Glahlle~~~f~Gt~~~~~~~i~~~~e~~G~~~na~t~~d~t~~~~~~l 80 (149) T pfam00675 1 RVALEHDPPADTSAVGLWVDAGSRYEPDDNNGLAHFLEHMAFKGTKKYPSNELEEELEKLGGSLNAYTSREHTVYYVEVL 80 (149) T ss_pred CEEEEECCCCCEEEEEEEECCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCEEEEEECC T ss_conf 98999769999799999977036888999722999999997289988982689999998496687641578569996066 Q ss_pred CCCCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCHHHCCCCCCCCCCCCCCCCCH Q ss_conf 65652100122232210356620011245567666651133211100000012222222222222210 Q gi|254780464|r 92 KEHVPLALEIIGDMLSNSSFNPSDIERERNVVLEEIGMSEDDSWDFLDARFSEMVWKDQIIGRPILGK 159 (424) Q Consensus 92 ~~~~~~~l~ll~~~~~~p~~~~~~~~~ek~~v~~e~~~~~~~p~~~~~~~~~~~~f~~~~~~~~~~g~ 159 (424) +++++.+++++++++.+|.|++++|+++|+++++|+++..++|...+.+.+++.+|++||||+|++|+ T Consensus 81 ~~~~~~~l~ll~d~l~~p~f~~~e~e~ek~~v~~e~~~~~~~p~~~~~~~l~~~~f~~hpyg~p~~G~ 148 (149) T pfam00675 81 NDDLPKAVDRLADFFLNPLFSPSEVERERLVVEYEVEAVDAEPQEVLLDNLHAAAYRGTPLGRSLLGP 148 (149) T ss_pred HHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCCCCCCCC T ss_conf 88999999999998518899999999999999999998876989999999999852999899998789 No 9 >KOG2019 consensus Probab=99.97 E-value=6.6e-30 Score=219.53 Aligned_cols=397 Identities=16% Similarity=0.207 Sum_probs=294.0 Q ss_pred CCCCCEEEEECCCCCEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCCCHHH-HHHHHH-HHCCEEEEEECCCEEE Q ss_conf 79871999985899879999998331144888898348999989733568889889-999999-7087679975322379 Q gi|254780464|r 8 TSSGITVITEVMPIDSAFVKVNIRAGSRNERQEEHGMAHFLEHMLFKGTTKRTAKE-IVEEIE-KVGGDINAYTSLEHTS 85 (424) Q Consensus 8 L~NGl~v~~~~~~~~~~~i~~~~~~Gs~~e~~~~~G~ah~lehl~~~gt~~~~~~~-~~~~l~-~~G~~~na~t~~d~~~ 85 (424) =+-|.++++.++.-+.-.+++.|+ .-|++-.|+.|.|||-..+|+.+||-.+ |.+.+. ++...+||+|..|+|. T Consensus 59 ~~Tgae~lhl~reD~N~vFsI~Fr----Tpp~dstGiPHILEHtvLCGS~KYPvrdPFfkmLnrSLatFmNAfT~pD~T~ 134 (998) T KOG2019 59 KKTGAEVLHLDREDENNVFSIVFR----TPPKDSTGIPHILEHTVLCGSRKYPVRDPFFKMLNRSLATFMNAFTAPDYTF 134 (998) T ss_pred CCCCCEEEEECCCCCCCEEEEEEE----CCCCCCCCCCHHHHHHEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCE T ss_conf 577860676135788754689961----6998667985153210000467676537599998778999986313788414 Q ss_pred EEEEC-CCCCCCCCHHHHHHHHHCCCCCCCHHHHH------------------HHHHHHHHHHCCCCCCCCCHHHCCCCC Q ss_conf 99612-66565210012223221035662001124------------------556766665113321110000001222 Q gi|254780464|r 86 YHAWV-LKEHVPLALEIIGDMLSNSSFNPSDIERE------------------RNVVLEEIGMSEDDSWDFLDARFSEMV 146 (424) Q Consensus 86 ~~~~~-~~~~~~~~l~ll~~~~~~p~~~~~~~~~e------------------k~~v~~e~~~~~~~p~~~~~~~~~~~~ 146 (424) |-+.. -++|+....+.+.|..+.|++.+.+|..| |++|..|++...++|.+.++..+...+ T Consensus 135 yPfattN~kDf~NL~dVYLDAtffPklr~~dF~QEGWr~Eh~dpsd~~SpivfkGVVfNEMKG~~S~~~~if~~~~Qq~L 214 (998) T KOG2019 135 YPFATTNTKDFYNLRDVYLDATFFPKLRKLDFQQEGWRLEHNDPSDPISPIVFKGVVFNEMKGQYSDPDYIFGMLFQQAL 214 (998) T ss_pred EECCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCEEECCCCCCCCCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHH T ss_conf 21334782777777888641110447776324431232144788887664313215540025530580278999999864 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCC-CCCC-C Q ss_conf 2222222222210344200016677642011245663000002545512211221101235877778866422-3322-1 Q gi|254780464|r 147 WKDQIIGRPILGKPETISSFTPEKIISFVSRNYTADRMYVVCVGAVDHEFCVSQVESYFNVCSVAKIKESMKP-AVYV-G 224 (424) Q Consensus 147 f~~~~~~~~~~g~~~~i~~lt~~~l~~f~~~~y~p~n~~l~i~Gd~~~~~~~~~i~~~f~~~~~~~~~~~~~p-~~~~-~ 224 (424) |+++.||....|.+..|..++.+++++||.++|+|+|+.+.-.||++.++..++++..|.............. ..+. + T Consensus 215 ~p~~tYgv~SGGDPl~IpdLt~eelk~FHr~~YHPSNAri~tYGn~Pl~~~l~~l~e~~~~~sk~~~s~kv~~qk~f~kp 294 (998) T KOG2019 215 FPENTYGVNSGGDPLDIPDLTYEELKEFHRQHYHPSNARIFTYGNFPLEDLLKQLEEDFSPFSKRELSSKVTFQKLFDKP 294 (998) T ss_pred CCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCEEEEECCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCC T ss_conf 86655354589996657535699999998852697764267646860899999988864203666666766554312567 Q ss_pred EEEEE-------CCCCCCCEEEECCCCCC-CCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC--CCHHHHHHHHHHCC Q ss_conf 02330-------36843210230100001-3521135689987776248765322220002686--21013332221012 Q gi|254780464|r 225 GEYIQ-------KRDLAEEHMMLGFNGCA-YQSRDFYLTNILASILGDGMSSRLFQEVREKRGL--CYSISAHHENFSDN 294 (424) Q Consensus 225 ~~~~~-------~~~~~~~~i~~~~~~~~-~~~~d~~al~ll~~iLg~g~sSrL~~~lRe~~gl--aY~v~s~~~~~~~~ 294 (424) .+++. .....|..+.+.|..+. .+--+.+++.+|..+|-+|++|.+++.|.|- || -.++.+.+...... T Consensus 295 ~rvve~~p~d~~~~p~Kq~~~s~s~L~~~p~d~~etfaL~~L~~Ll~~gpsSp~yk~LiES-GLGtEfsvnsG~~~~t~~ 373 (998) T KOG2019 295 RRVVEKGPADPGDLPKKQTKCSNSFLSNDPLDTYETFALKVLSHLLLDGPSSPFYKALIES-GLGTEFSVNSGYEDTTLQ 373 (998) T ss_pred CEEEEECCCCCCCCCCCEEEEEEEEECCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHC-CCCCCCCCCCCCCCCCCC T ss_conf 1332335899888852303777776358843589999999999886279960889999871-778521557788850013 Q ss_pred CCEEEEEE-CCCCHHHHHHHHHHHHHHHHCC-CCCHHHHHHHHHHHHHHHHHHC-CCHHHHHHHHHHHHHHCCCCCCHHH Q ss_conf 20134442-5830247778778999985314-8998999999999999998513-8989999999999996898479899 Q gi|254780464|r 295 GVLYIASA-TAKENIMALTSSIVEVVQSLLE-NIEQREIDKECAKIHAKLIKSQ-ERSYLRALEISKQVMFCGSILCSEK 371 (424) Q Consensus 295 g~~~i~~~-~~~~~~~~~~~~i~~~i~~l~~-~i~~~el~~ak~~~~~~~~~~~-~~~~~~a~~l~~~~~~~~~~~~~~~ 371 (424) +.|.+... ++.+++..+.+.+...++++.+ +++.+.++...+++.-++..+- ..--.++..+...+.+..++..+-. T Consensus 374 ~~fsVGLqGvseediekve~lV~~t~~~lae~gfd~drieAil~qiEislk~qst~fGL~L~~~i~~~W~~d~DPfE~Lk 453 (998) T KOG2019 374 PQFSVGLQGVSEEDIEKVEELVMNTFNKLAETGFDNDRIEAILHQIEISLKHQSTGFGLSLMQSIISKWINDMDPFEPLK 453 (998) T ss_pred CEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCCCCCHHH T ss_conf 22432003466788999999999999999982243688999998764002204566018999876665236899541222 Q ss_pred HHHHHHCC-------CHHHHHHHHHHHCCCCC-EEEEEECCCCCCC Q ss_conf 99988727-------98999999998627787-7999816601089 Q gi|254780464|r 372 IIDTISAI-------TCEDIVGVAKKIFSSTP-TLAILGPPMDHVP 409 (424) Q Consensus 372 ~~~~i~~v-------t~~di~~~~~~~l~~~~-~i~~vgp~~~~~P 409 (424) +.+.+++. +..-.+..+++|+-.|+ .++.-+.|.+.-+ T Consensus 454 ~~~~L~~lk~~l~ek~~~lfq~lIkkYilnn~h~~t~smqpd~e~~ 499 (998) T KOG2019 454 FEEQLKKLKQRLAEKSKKLFQPLIKKYILNNPHCFTFSMQPDPEFA 499 (998) T ss_pred HHHHHHHHHHHHHHHCHHHHHHHHHHHHHCCCCEEEEEECCCCHHH T ss_conf 5549999998876533167899999997269844899956881155 No 10 >KOG0961 consensus Probab=99.97 E-value=2.5e-28 Score=208.96 Aligned_cols=391 Identities=17% Similarity=0.204 Sum_probs=279.5 Q ss_pred CCEEEEECCCCCEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHH-CCEEEEEECCCEEEEEEE Q ss_conf 7199998589987999999833114488889834899998973356888988999999970-876799753223799961 Q gi|254780464|r 11 GITVITEVMPIDSAFVKVNIRAGSRNERQEEHGMAHFLEHMLFKGTTKRTAKEIVEEIEKV-GGDINAYTSLEHTSYHAW 89 (424) Q Consensus 11 Gl~v~~~~~~~~~~~i~~~~~~Gs~~e~~~~~G~ah~lehl~~~gt~~~~~~~~~~~l~~~-G~~~na~t~~d~~~~~~~ 89 (424) |++|++-..|++.++-.+.|. .|.....|+.|-||||+|.|+++||...+.+.+... =+..||-|+-|+|.|.++ T Consensus 29 kl~va~~~~pts~vhG~f~v~----TEa~~d~G~PHTLEHL~FMGSKkYP~kGvLd~~anr~l~dtNAwTDtD~T~YtLS 104 (1022) T KOG0961 29 KLRVAIGEVPTSMVHGAFSVV----TEADSDDGLPHTLEHLVFMGSKKYPFKGVLDVIANRCLADTNAWTDTDHTAYTLS 104 (1022) T ss_pred CEEEEEEECCCCCEEEEEEEE----EEECCCCCCCHHHHHHHHHCCCCCCCCCHHHHHHCCHHCCCCCCCCCCCCEEEEE T ss_conf 247998505875123468866----5304788993067778640566677532888861102104555403674247851 Q ss_pred CCC-CCCCCCHHHHHHHHHCCCCCCCHHHHH----------HHHHHHHHHHCCCCCCCCCHHHCCCCCCC-CCCCCCCCC Q ss_conf 266-565210012223221035662001124----------55676666511332111000000122222-222222222 Q gi|254780464|r 90 VLK-EHVPLALEIIGDMLSNSSFNPSDIERE----------RNVVLEEIGMSEDDSWDFLDARFSEMVWK-DQIIGRPIL 157 (424) Q Consensus 90 ~~~-~~~~~~l~ll~~~~~~p~~~~~~~~~e----------k~~v~~e~~~~~~~p~~~~~~~~~~~~f~-~~~~~~~~~ 157 (424) ..- +-+-++|+.+.|-+++|.++.++|-.| +++|.+|++..+..-+..+.+...+..|+ .++|...-. T Consensus 105 tag~dGFlklLPvy~dHiL~P~Ltdeaf~TEVyHI~geg~d~GVVySEMq~~es~~~~im~~~~~~~~yP~~sgY~~eTG 184 (1022) T KOG0961 105 TAGSDGFLKLLPVYIDHILTPMLTDEAFATEVYHITGEGNDAGVVYSEMQDHESEMESIMDRKTKEVIYPPFSGYAVETG 184 (1022) T ss_pred CCCCCCHHHHHHHHHHHHCCCCCCHHHHHHHEEEECCCCCCCCEEEHHHHHHHCCCCHHHHHHHHEEECCCCCCCEECCC T ss_conf 14543368774888876357200246655411352687873533530543200122005434220064589887233058 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHCCCCCHHHHHHHHHHCCCCCCCCCC--CCC-CCCC-----CCCCEE--- Q ss_conf 10344200016677642011245663000002545512211221101235877778--866-4223-----322102--- Q gi|254780464|r 158 GKPETISSFTPEKIISFVSRNYTADRMYVVCVGAVDHEFCVSQVESYFNVCSVAKI--KES-MKPA-----VYVGGE--- 226 (424) Q Consensus 158 g~~~~i~~lt~~~l~~f~~~~y~p~n~~l~i~Gd~~~~~~~~~i~~~f~~~~~~~~--~~~-~~p~-----~~~~~~--- 226 (424) |.++.++.+|.+.+++||++.|+++||+++|+|.+++++++...+.+-..++.... |+. ++|- .+.-.+ T Consensus 185 G~~knLR~lt~ekIR~yHK~~Y~~sN~cviVcG~v~~d~lL~~m~~~~neile~~s~vP~~~~rPf~~tn~~~~i~e~t~ 264 (1022) T KOG0961 185 GRLKNLRELTLEKIRDYHKKFYHLSNMCVIVCGMVDHDQLLEIMNNVENEILEHMSTVPDHFPRPFSFTNALSDIKESTV 264 (1022) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCE T ss_conf 87056788559999999987556466599995676789999999877766654266689888988623467545774441 Q ss_pred -EE--ECCCCCCCEEEECCCCCCCCC-CHHHHHHHHHHHHHCCCCCCCCCCCCC-CCCCCCHHHHHHHHHHCCCCEEEEE Q ss_conf -33--036843210230100001352-113568998777624876532222000-2686210133322210122013444 Q gi|254780464|r 227 -YI--QKRDLAEEHMMLGFNGCAYQS-RDFYLTNILASILGDGMSSRLFQEVRE-KRGLCYSISAHHENFSDNGVLYIAS 301 (424) Q Consensus 227 -~~--~~~~~~~~~i~~~~~~~~~~~-~d~~al~ll~~iLg~g~sSrL~~~lRe-~~glaY~v~s~~~~~~~~g~~~i~~ 301 (424) .+ -..|..+..+.++|.+++..+ ....|+.+|-..|.+..-+.+.+.+.| +..++.+++.+..-.-....=.... T Consensus 265 ~tVefp~~Des~G~v~~aW~g~s~sD~~t~~a~~vL~dyls~savapf~~~fVeieDP~assv~f~~~~~vrc~i~L~f~ 344 (1022) T KOG0961 265 HTVEFPTDDESRGAVEVAWFGHSPSDLETHSALHVLFDYLSNSAVAPFQKDFVEIEDPLASSVSFHIAEGVRCDIRLNFA 344 (1022) T ss_pred EEEECCCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCEEEECCCCCCCEEEEEECCCCEEEEEEEC T ss_conf 35566875555652899974888787656789999999860022340003337752765441355651366346898516 Q ss_pred ECCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCC--HHHHHHHHHHHHHHCCCCC-CH------HHH Q ss_conf 25830247778778999985314899899999999999999851389--8999999999999689847-98------999 Q gi|254780464|r 302 ATAKENIMALTSSIVEVVQSLLENIEQREIDKECAKIHAKLIKSQER--SYLRALEISKQVMFCGSIL-CS------EKI 372 (424) Q Consensus 302 ~~~~~~~~~~~~~i~~~i~~l~~~i~~~el~~ak~~~~~~~~~~~~~--~~~~a~~l~~~~~~~~~~~-~~------~~~ 372 (424) +++-+++.++..-+++.+.+- ..|+=+.......+-+-+++.++|. ++..+..+....++|..-. .+ -++ T Consensus 345 gVP~EKi~~~~~k~l~~l~et-~~iDm~Rm~~~i~~t~~~yL~nlE~n~~s~fms~ii~d~~ygnedg~~l~~~lk~l~~ 423 (1022) T KOG0961 345 GVPVEKIDECAPKFLDKLVET-ANIDMERMGYLIDQTILNYLVNLETNAPSDFMSHIIGDQLYGNEDGELLKKRLKELDF 423 (1022) T ss_pred CCCHHHHHHHHHHHHHHHHHH-CCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHH T ss_conf 773787644408999998886-0067999999998879999985402776888998766451267534678889876778 Q ss_pred HHHHHCCCHHHHHHHHHHHCCCCCEEEEEECCCC Q ss_conf 9988727989999999986277877999816601 Q gi|254780464|r 373 IDTISAITCEDIVGVAKKIFSSTPTLAILGPPMD 406 (424) Q Consensus 373 ~~~i~~vt~~di~~~~~~~l~~~~~i~~vgp~~~ 406 (424) .+.+.+....|.+.+.+|||-.++.+++++.|.. T Consensus 424 ~~~L~~w~~kdW~~Llnk~Fven~s~tVia~Ps~ 457 (1022) T KOG0961 424 LKKLKSWPAKDWVQLLNKYFVENPSATVIAVPSE 457 (1022) T ss_pred HHHHHHCCHHHHHHHHHHHHCCCCCEEEEECCCH T ss_conf 9998632277899999998305897379935829 No 11 >pfam05193 Peptidase_M16_C Peptidase M16 inactive domain. Peptidase M16 consists of two structurally related domains. One is the active peptidase, whereas the other is inactive. The two domains hold the substrate like a clamp. Probab=99.95 E-value=4.9e-28 Score=206.98 Aligned_cols=174 Identities=33% Similarity=0.540 Sum_probs=149.1 Q ss_pred HHHHHHHHHHHCCCCCCCCCHHHHCCCCCHHHHHHHHHHCCCCCCCCCCCCCC-----CCCCCCCEE-EEECCCCCCCEE Q ss_conf 00166776420112456630000025455122112211012358777788664-----223322102-330368432102 Q gi|254780464|r 165 SFTPEKIISFVSRNYTADRMYVVCVGAVDHEFCVSQVESYFNVCSVAKIKESM-----KPAVYVGGE-YIQKRDLAEEHM 238 (424) Q Consensus 165 ~lt~~~l~~f~~~~y~p~n~~l~i~Gd~~~~~~~~~i~~~f~~~~~~~~~~~~-----~p~~~~~~~-~~~~~~~~~~~i 238 (424) ++|.++|++||++||+|+||+++|+||+|+++++++++++|++|+....+... .+....... .....+..++++ T Consensus 1 ~lt~e~l~~fy~~~Y~p~n~~l~i~Gdi~~~~~~~~i~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (182) T pfam05193 1 SLTREDLKDFYKKHYSPDNMVLVVVGDVDLDELLALAEKYFGDLPASPPKPKPREPPLEPEELTGKEVVVPDKDVPQAKL 80 (182) T ss_pred CCCHHHHHHHHHHHCCCCCEEEEEECCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCEEE T ss_conf 95989999999984782067999983999999999999997159988888887666766434677479992799862699 Q ss_pred EECCCCCCCC-CCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHH Q ss_conf 3010000135-211356899877762487653222200026862101333222101220134442583024777877899 Q gi|254780464|r 239 MLGFNGCAYQ-SRDFYLTNILASILGDGMSSRLFQEVREKRGLCYSISAHHENFSDNGVLYIASATAKENIMALTSSIVE 317 (424) Q Consensus 239 ~~~~~~~~~~-~~d~~al~ll~~iLg~g~sSrL~~~lRe~~glaY~v~s~~~~~~~~g~~~i~~~~~~~~~~~~~~~i~~ 317 (424) .++|++++.. +++++++.+++.+|+++++|||++.||++.|++|++++++..+.+.|+|.+++.+.+++..++++.+.+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~l~~lL~~~~~s~L~~~lr~~~gl~y~~~~~~~~~~~~g~~~i~~~~~~~~~~~~~~~i~~ 160 (182) T pfam05193 81 ALAFPGPPLGNDPDSAALDVLAELLGGGASSRLFQELREKEGLAYSVSAFLDSYSDTGLFGIYADLDPENLDEVIELILE 160 (182) T ss_pred EEEEECCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCEEEEEEEEECCCCCEEEEEEEEECCCCHHHHHHHHHH T ss_conf 99998488665588999999888725874437899998754880489986555889717999999787889999999999 Q ss_pred HHHHHCC-CCCHHHHHHHHHHH Q ss_conf 9985314-89989999999999 Q gi|254780464|r 318 VVQSLLE-NIEQREIDKECAKI 338 (424) Q Consensus 318 ~i~~l~~-~i~~~el~~ak~~~ 338 (424) +++++.+ |++++||+++|+++ T Consensus 161 ~l~~l~~~g~s~~el~~ak~~l 182 (182) T pfam05193 161 ELKKLAEEGITEEELERAKAQL 182 (182) T ss_pred HHHHHHHCCCCHHHHHHHHHCC T ss_conf 9999987197999999999649 No 12 >TIGR02110 PQQ_syn_pqqF coenzyme PQQ biosynthesis protein PqqF; InterPro: IPR011844 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This family of metallopeptidases belong to MEROPS peptidase family M16, subfamily M16A (clan ME). In the subset of species that make coenzyme PQQ (pyrrolo-quinoline-quinone), this peptidase is found in the PQQ biosynthesis region and is thought to act as a protease on PqqA (IPR011725 from INTERPRO), a probable peptide precursor of the coenzyme. PQQ is required for some glucose dehydrogenases and alcohol dehydrogenases .; GO: 0008234 cysteine-type peptidase activity, 0008270 zinc ion binding, 0018189 pyrroloquinoline quinone biosynthetic process. Probab=99.93 E-value=3.6e-28 Score=207.90 Aligned_cols=389 Identities=20% Similarity=0.215 Sum_probs=267.7 Q ss_pred EEEECCC------CCEEEE-ECCCCCEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCCCHHH-HHHHHHHHCCEE Q ss_conf 8998798------719999-85899879999998331144888898348999989733568889889-999999708767 Q gi|254780464|r 4 RISKTSS------GITVIT-EVMPIDSAFVKVNIRAGSRNERQEEHGMAHFLEHMLFKGTTKRTAKE-IVEEIEKVGGDI 75 (424) Q Consensus 4 ~~~~L~N------Gl~v~~-~~~~~~~~~i~~~~~~Gs~~e~~~~~G~ah~lehl~~~gt~~~~~~~-~~~~l~~~G~~~ 75 (424) |..||+| ||+|-+ ++.......-.+-|-+||-+||.-=+|+|||||||+|.||+.|+.+| +--+++.-||++ T Consensus 1 R~~TL~nhWrPCqGLr~~L~H~P~a~RaAA~lRVAAGSHDeP~awPGLAHfLEHLlFlG~ErF~~d~rLM~wvQ~qGGQV 80 (737) T TIGR02110 1 RRITLANHWRPCQGLRVHLVHDPAAKRAAALLRVAAGSHDEPSAWPGLAHFLEHLLFLGGERFQGDDRLMPWVQRQGGQV 80 (737) T ss_pred CEEECCCCCCCCCCCCEECCCCCHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCHHHHHCCCCC T ss_conf 90205888864523210014670357999999975166664014665488999997303111457666208986238851 Q ss_pred EEEECCCEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCHHHCCCCCCCCCCCCCC Q ss_conf 99753223799961266565210012223221035662001124556766665113321110000001222222222222 Q gi|254780464|r 76 NAYTSLEHTSYHAWVLKEHVPLALEIIGDMLSNSSFNPSDIERERNVVLEEIGMSEDDSWDFLDARFSEMVWKDQIIGRP 155 (424) Q Consensus 76 na~t~~d~~~~~~~~~~~~~~~~l~ll~~~~~~p~~~~~~~~~ek~~v~~e~~~~~~~p~~~~~~~~~~~~f~~~~~~~~ 155 (424) ||+|-.-.|-|++.++.+.|..++.-|.||+..|.++-++..+|++|+-.|......|...+-.-.+...+-+.||+.+. T Consensus 81 NA~T~~r~t~fFFElpA~Ala~Gl~RL~~MLA~Pll~~~dq~REreVl~AEy~a~~~da~~~~~aAl~~~~~a~hplRrF 160 (737) T TIGR02110 81 NASTLERTTDFFFELPAEALAAGLARLCDMLARPLLTIEDQQREREVLEAEYIAWQADAQTRREAALLDALSAAHPLRRF 160 (737) T ss_pred CCCHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCCCCCCCCHH T ss_conf 00020134203455799999867999999864203129999887999999999985056579999998357877611011 Q ss_pred CCCHHHHHH----HHHHHHHHHHHCCCCCCCCCHHHHCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCEE---EE Q ss_conf 221034420----00166776420112456630000025455122112211012358777788664223322102---33 Q gi|254780464|r 156 ILGKPETIS----SFTPEKIISFVSRNYTADRMYVVCVGAVDHEFCVSQVESYFNVCSVAKIKESMKPAVYVGGE---YI 228 (424) Q Consensus 156 ~~g~~~~i~----~lt~~~l~~f~~~~y~p~n~~l~i~Gd~~~~~~~~~i~~~f~~~~~~~~~~~~~p~~~~~~~---~~ 228 (424) -.|+-+++. .+ ..-|++||.+||+..||+|...|.-+.|++..+.+++=+.++.+.......|+.--... .. T Consensus 161 ~aG~R~sL~~~~~Af-Q~AL~~FH~~~Y~A~~M~LwLqGPQsL~ELe~LA~R~ga~L~~G~~~~~~~P~~L~~~~~~~l~ 239 (737) T TIGR02110 161 HAGSRDSLALPNDAF-QQALRDFHRRHYQAGNMQLWLQGPQSLDELEELAARFGASLAAGEELAQAAPAPLLKFAALTLA 239 (737) T ss_pred HCCCHHHCCCCHHHH-HHHHHHHHHHHHHHHCCEEECCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHH T ss_conf 023515316984899-9999999998404430403048798778999999998201447788788878555612689987 Q ss_pred ECCCCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHC-CCCCCCCCCCCCCCCCCCHHHHHH-HHHHCCCCEEEEE---EC Q ss_conf 036843210230100001352113568998777624-876532222000268621013332-2210122013444---25 Q gi|254780464|r 229 QKRDLAEEHMMLGFNGCAYQSRDFYLTNILASILGD-GMSSRLFQEVREKRGLCYSISAHH-ENFSDNGVLYIAS---AT 303 (424) Q Consensus 229 ~~~~~~~~~i~~~~~~~~~~~~d~~al~ll~~iLg~-g~sSrL~~~lRe~~glaY~v~s~~-~~~~~~g~~~i~~---~~ 303 (424) ...+.+ ..+.++|-.-.......-++.+|...|+. .++| |.-.||+ ++|+-++.+.. ......+++-+.+ .. T Consensus 240 ~~~~~P-~~l~L~fa~~~l~~~~~~~V~~L~~~L~d~~PGg-LlA~L~~-r~L~es~~a~~Lyqd~~QaLLaL~F~~~~~ 316 (737) T TIGR02110 240 ERSEGP-ERLWLLFALEALPAAAEDAVTLLCEFLQDEAPGG-LLAELRE-RGLAESLAATWLYQDAGQALLALEFEARSV 316 (737) T ss_pred HHCCCC-HHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCCHH-HHHHHHH-HHHHCCHHCCHHHHHHHHHHHHHHHHHCCC T ss_conf 414782-1467888865133117999999999863148711-6899987-652100202000125556889988752467 Q ss_pred CCCHHHHHHHHHHHHHHHHCC----------CCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHH Q ss_conf 830247778778999985314----------8998999999999999998513898999999999999689847989999 Q gi|254780464|r 304 AKENIMALTSSIVEVVQSLLE----------NIEQREIDKECAKIHAKLIKSQERSYLRALEISKQVMFCGSILCSEKII 373 (424) Q Consensus 304 ~~~~~~~~~~~i~~~i~~l~~----------~i~~~el~~ak~~~~~~~~~~~~~~~~~a~~l~~~~~~~~~~~~~~~~~ 373 (424) +++...++...+.+-+.-+.+ ++-.+.++.-..--...+... +.......-.+..-...+....+++. T Consensus 317 SA~~a~~~~~~~~~WL~al~~~a~sllaarrPlwa~~~~~Y~~L~qR~l~~~--~~~~L~~l~~R~~~~~l~~~~~~~l~ 394 (737) T TIGR02110 317 SAAAAEQVEELLTEWLGALKEQAASLLAARRPLWAELLEEYERLEQRRLQLA--ALLPLAQLDARALGFALGEALPAALK 394 (737) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHH--HCCCHHHHHHHHHHCCCCCCCHHHHH T ss_conf 8453689999999999999998999987304667436658988877776543--04876677777420266757637899 Q ss_pred HHHHCCCHHHHHHHHHHH-CCCCCEE Q ss_conf 988727989999999986-2778779 Q gi|254780464|r 374 DTISAITCEDIVGVAKKI-FSSTPTL 398 (424) Q Consensus 374 ~~i~~vt~~di~~~~~~~-l~~~~~i 398 (424) +..++++..-+..++.+. +..++.+ T Consensus 395 ~~~~~L~~~p~~~l~~q~~~pg~p~~ 420 (737) T TIGR02110 395 DFLAALQDAPSVRLATQTRLPGAPFA 420 (737) T ss_pred HHHHHCCHHHHHHHHHCCCCCCHHHH T ss_conf 99984222245565530458704778 No 13 >COG1026 Predicted Zn-dependent peptidases, insulinase-like [General function prediction only] Probab=99.88 E-value=6.3e-20 Score=152.61 Aligned_cols=388 Identities=16% Similarity=0.206 Sum_probs=247.2 Q ss_pred EEECCCCCEEEEECCC-CCEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCEEEE----EE Q ss_conf 9987987199998589-987999999833114488889834899998973356888988999999970876799----75 Q gi|254780464|r 5 ISKTSSGITVITEVMP-IDSAFVKVNIRAGSRNERQEEHGMAHFLEHMLFKGTTKRTAKEIVEEIEKVGGDINA----YT 79 (424) Q Consensus 5 ~~~L~NGl~v~~~~~~-~~~~~i~~~~~~Gs~~e~~~~~G~ah~lehl~~~gt~~~~~~~~~~~l~~~G~~~na----~t 79 (424) .+.-.|..+|+..+.. ...+.+.++|+. ++-.....+=+.-+-+.+.--||.+++..++.++++..-|.+++ .+ T Consensus 529 ~~~~~~~~~v~~~~~~tn~i~yl~~~~~~-~~l~~~llpyL~L~~~~l~~lgt~~~~y~e~~~~i~~~TGgis~~~~~~~ 607 (978) T COG1026 529 ETEVSNEAKVLHHDLFTNGITYLRLYFDL-DMLPSELLPYLPLFAFALTNLGTETYSYKELLNQIERHTGGISVSLSVDT 607 (978) T ss_pred EEECCCCCCEEEEECCCCCEEEEEEEEEC-CCCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCEEEEEECC T ss_conf 63036776048853278874899999616-87876653448999999985388885799999999997178104676236 Q ss_pred C-------CCEEEEEEECCCCCCCCCHHHHHHHHHCCCCC-CCHHHHHHHHHHHHHHHCCCC-CCCCCHHHCCCCCCCCC Q ss_conf 3-------22379996126656521001222322103566-200112455676666511332-11100000012222222 Q gi|254780464|r 80 S-------LEHTSYHAWVLKEHVPLALEIIGDMLSNSSFN-PSDIERERNVVLEEIGMSEDD-SWDFLDARFSEMVWKDQ 150 (424) Q Consensus 80 ~-------~d~~~~~~~~~~~~~~~~l~ll~~~~~~p~~~-~~~~~~ek~~v~~e~~~~~~~-p~~~~~~~~~~~~f~~~ 150 (424) + +..+++.+.+++++.+++++++.+++.++.|+ .+.+....+..++.+.....+ +...+.......++.+. T Consensus 608 ~~~~~~~~~~~~~i~~K~l~~k~~~~~~~i~~~l~~~~F~D~~Rlkell~q~~~~l~~~vr~sG~~~A~~~~~s~~~~~~ 687 (978) T COG1026 608 DPGDDGEYRPSFSISGKALRSKVEKLFELIREILANTDFHDRERLKELLEQYLSDLTSSVRNSGHSIASSLANSRLSSAG 687 (978) T ss_pred CCCCCCCCCCEEEEEEEEHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCH T ss_conf 87755555613799987653105579999999985077685899999999998452776642533789987532556205 Q ss_pred CCCCCCCC--HHHHHHHHH-----------HHHHHHHHCCCCCCCCCHHHHCCCCCHHHHHHHHHHCCCCCCC-----CC Q ss_conf 22222221--034420001-----------6677642011245663000002545512211221101235877-----77 Q gi|254780464|r 151 IIGRPILG--KPETISSFT-----------PEKIISFVSRNYTADRMYVVCVGAVDHEFCVSQVESYFNVCSV-----AK 212 (424) Q Consensus 151 ~~~~~~~g--~~~~i~~lt-----------~~~l~~f~~~~y~p~n~~l~i~Gd~~~~~~~~~i~~~f~~~~~-----~~ 212 (424) .+....-| ..+-|..+. .+.|++.+++.+..+|+.+++.|+.+ ...+.+++-|-++.. .. T Consensus 688 ~l~e~~~Gl~q~k~i~~l~~~~~~~~~~ei~~kL~~l~~~i~~~~n~~i~i~~~~~--~~~~~~e~~l~~~~~~~~~~~~ 765 (978) T COG1026 688 ALKELLNGLSQVKFLRELSSNFEENFEKEIADKLQALRKKIFQTNNLRIAIIGDID--KILDLLENPLLKFLEHLLPGFE 765 (978) T ss_pred HHHHHHCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCHH--HHHHHHHHHHHHHHCCCCCCCC T ss_conf 48877458069999999987540255588999999999999624765899835716--6677777776542203671002 Q ss_pred CCCCCCCC-CC-CC-EEEEECCCCCCCEEEECC--CCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHH Q ss_conf 88664223-32-21-023303684321023010--000135211356899877762487653222200026862101333 Q gi|254780464|r 213 IKESMKPA-VY-VG-GEYIQKRDLAEEHMMLGF--NGCAYQSRDFYLTNILASILGDGMSSRLFQEVREKRGLCYSISAH 287 (424) Q Consensus 213 ~~~~~~p~-~~-~~-~~~~~~~~~~~~~i~~~~--~~~~~~~~d~~al~ll~~iLg~g~sSrL~~~lRe~~glaY~v~s~ 287 (424) .+....+. .. .+ ......-+.+.++...+| -.+++.|||+.++.|+..+|+. ..|...||++ |.||+++++ T Consensus 766 ~~~~~~~~~~~~~~~~~~~~ii~~p~a~~~l~fs~~~~~y~hpd~~~l~vls~~L~~---~~lw~~IR~~-GGAYGa~as 841 (978) T COG1026 766 LPTPPKNPHLDLISSLSEATIIPSPVAYNALAFSIGGLPYTHPDYAALQVLSEYLGS---GYLWNKIREK-GGAYGASAS 841 (978) T ss_pred CCCCCCCCCHHHHCCCCCEEEECCHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHCC---CHHHHHHHHH-CCCCCCCCC T ss_conf 688988854113235654178556799888760046788888641699999999634---4037777762-355453333 Q ss_pred HHHHHCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCC-CCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCC Q ss_conf 2221012201344425830247778778999985314-899899999999999999851389899999999999968984 Q gi|254780464|r 288 HENFSDNGVLYIASATAKENIMALTSSIVEVVQSLLE-NIEQREIDKECAKIHAKLIKSQERSYLRALEISKQVMFCGSI 366 (424) Q Consensus 288 ~~~~~~~g~~~i~~~~~~~~~~~~~~~i~~~i~~l~~-~i~~~el~~ak~~~~~~~~~~~~~~~~~a~~l~~~~~~~~~~ 366 (424) ... ..|.|.++.-.+|+ ..++.+...+.++.+.. .+++.|+++++-.....+. ..+++...+..-....+.+... T Consensus 842 ~~~--~~G~f~f~sYRDPn-~~kt~~v~~~~v~~l~s~~~~~~d~~~~ilg~i~~~d-~p~sp~~~~~~s~~~~~sg~~~ 917 (978) T COG1026 842 IDA--NRGVFSFASYRDPN-ILKTYKVFRKSVKDLASGNFDERDLEEAILGIISTLD-TPESPASEGSKSFYRDLSGLTD 917 (978) T ss_pred CCC--CCCEEEEEECCCCC-HHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCC-CCCCCCCEEHHHHHHHHHCCCH T ss_conf 246--88769998248982-8899999999999987388778899888877400355-5668763203667788746988 Q ss_pred CCHHHHHHHHHCCCHHHHHHHHHHHCC---CCCEEEEEEC Q ss_conf 798999998872798999999998627---7877999816 Q gi|254780464|r 367 LCSEKIIDTISAITCEDIVGVAKKIFS---STPTLAILGP 403 (424) Q Consensus 367 ~~~~~~~~~i~~vt~~di~~~~~~~l~---~~~~i~~vgp 403 (424) ...+.+.++|-.||++||++++++|+. +..++++++. T Consensus 918 ~~~qa~re~~l~vt~~di~~~~~~yl~~~~~e~~i~~~~~ 957 (978) T COG1026 918 EERQAFRERLLDVTKEDIKEVMDKYLLNFSSENSIAVFAG 957 (978) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCEEEEEEC T ss_conf 8999999998627599999999999762435561799832 No 14 >COG1025 Ptr Secreted/periplasmic Zn-dependent peptidases, insulinase-like [Posttranslational modification, protein turnover, chaperones] Probab=99.72 E-value=8.6e-15 Score=118.18 Aligned_cols=376 Identities=11% Similarity=0.124 Sum_probs=240.2 Q ss_pred ECCCCCEEEEECCC--C-CEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCEEEEEECCCE Q ss_conf 87987199998589--9-87999999833114488889834899998973356888988999999970876799753223 Q gi|254780464|r 7 KTSSGITVITEVMP--I-DSAFVKVNIRAGSRNERQEEHGMAHFLEHMLFKGTTKRTAKEIVEEIEKVGGDINAYTSLEH 83 (424) Q Consensus 7 ~L~NGl~v~~~~~~--~-~~~~i~~~~~~Gs~~e~~~~~G~ah~lehl~~~gt~~~~~~~~~~~l~~~G~~~na~t~~d~ 83 (424) -.+|+......+.+ + |...+.+.++.=-...++...=+..++..+++-...++.. ....-|.+++...+.+. T Consensus 507 ~~~~~~~wy~~~d~F~~~PK~~v~~~irsp~~~~s~r~~Vl~~l~~~la~dal~~~~y-----~A~~aG~sfs~~~~~~G 581 (937) T COG1025 507 EDPNLRLWYLKEDYFAVEPKASVSLAIRSPHASRSPRNQVLTELYAYLANDALDKLSY-----QASLAGLSFSLAANSNG 581 (937) T ss_pred CCCCCEEEEECCCCCCCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHH-----HHHHCCEEEEEECCCCC T ss_conf 4888448984377566677623699975753455789999999999997878876302-----77755657885047886 Q ss_pred EEEEEECCCCCCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCC-CCCCCCCHHHCCCCCCCCCCCCCCCCCHHHH Q ss_conf 799961266565210012223221035662001124556766665113-3211100000012222222222222210344 Q gi|254780464|r 84 TSYHAWVLKEHVPLALEIIGDMLSNSSFNPSDIERERNVVLEEIGMSE-DDSWDFLDARFSEMVWKDQIIGRPILGKPET 162 (424) Q Consensus 84 ~~~~~~~~~~~~~~~l~ll~~~~~~p~~~~~~~~~ek~~v~~e~~~~~-~~p~~~~~~~~~~~~f~~~~~~~~~~g~~~~ 162 (424) ..++++|.++.++.++..+-+.+..-.++++.|...|..+.++++... +.|.+++.+.+.-.+-..+. +.--..+. T Consensus 582 l~ltisGft~~lp~L~~~~l~~l~~~~~~~~~f~~~K~~~~~~~~~a~~~~p~~~~~~~l~~l~~~~~~---s~~e~~~~ 658 (937) T COG1025 582 LDLTISGFTQRLPQLLRAFLDGLFSLPVDEDRFEQAKSQLSEELKNALTGKPYRQALDGLTGLLQVPYW---SREERRNA 658 (937) T ss_pred EEEEEECCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCC---CHHHHHHH T ss_conf 699960565124999999999873489987899999999999997665148799987875665178876---78999987 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCHHHHCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCC--CCEEEEE--CCCCCCCEE Q ss_conf 200016677642011245663000002545512211221101235877778866422332--2102330--368432102 Q gi|254780464|r 163 ISSFTPEKIISFVSRNYTADRMYVVCVGAVDHEFCVSQVESYFNVCSVAKIKESMKPAVY--VGGEYIQ--KRDLAEEHM 238 (424) Q Consensus 163 i~~lt~~~l~~f~~~~y~p~n~~l~i~Gd~~~~~~~~~i~~~f~~~~~~~~~~~~~p~~~--~~~~~~~--~~~~~~~~i 238 (424) ++.++.+++.+|....+.+.-..+.|.||+..+++.++.+.....++.........|... .++..+. ..+.+.... T Consensus 659 l~~v~~~e~~~f~~~l~~~~~lE~lv~Gn~~~~da~~l~~~~~~~l~~~~s~~~~~~~~~~~~~~~~~~e~~~~~~~~an 738 (937) T COG1025 659 LESVSVEEFAAFRDTLLNGVHLEMLVLGNLTEADATNLAETLQKKLPAIGSTWYRNPSVYLLKGGTRIFETVGGESDSAN 738 (937) T ss_pred HHHCCHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCEECCCCCEEEEEECCCCCCCCC T ss_conf 64244999987888764103401466356038889889999986502467755678752306887167641268865554 Q ss_pred EECCCCCCCC-CCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECC---CCHHHHHHHH Q ss_conf 3010000135-21135689987776248765322220002686210133322210122013444258---3024777877 Q gi|254780464|r 239 MLGFNGCAYQ-SRDFYLTNILASILGDGMSSRLFQEVREKRGLCYSISAHHENFSDNGVLYIASATA---KENIMALTSS 314 (424) Q Consensus 239 ~~~~~~~~~~-~~d~~al~ll~~iLg~g~sSrL~~~lRe~~glaY~v~s~~~~~~~~g~~~i~~~~~---~~~~~~~~~~ 314 (424) .+.+.....+ .++.....++.+++ .-.+|..||+|.+|.|-|.+++....+.-.+.++..+. |+-..+-+.. T Consensus 739 ~~i~~~~~~~~~~~~a~s~Ll~~l~----~~~ff~~LRTkeQLGY~Vfs~~~~v~~~~gi~f~vqS~~~~p~~L~~r~~~ 814 (937) T COG1025 739 AAILYPQQYDEIKSSALSSLLGQLI----HPWFFDQLRTKEQLGYAVFSGPREVGRTPGIGFLVQSNSKSPSYLLERINA 814 (937) T ss_pred CEEEECCCCCHHHHHHHHHHHHHHH----HHHHHHHHHHHHHCCEEEEECCEEECCCCCEEEEEECCCCCHHHHHHHHHH T ss_conf 1067514666088799999999997----677687764566404389740234257254599995799882999999999 Q ss_pred HHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCC--CCCCHHHHHHHHHCCCHHHHHHHHHHHC Q ss_conf 89999853148998999999999999998513898999999999999689--8479899999887279899999999862 Q gi|254780464|r 315 IVEVVQSLLENIEQREIDKECAKIHAKLIKSQERSYLRALEISKQVMFCG--SILCSEKIIDTISAITCEDIVGVAKKIF 392 (424) Q Consensus 315 i~~~i~~l~~~i~~~el~~ak~~~~~~~~~~~~~~~~~a~~l~~~~~~~~--~~~~~~~~~~~i~~vt~~di~~~~~~~l 392 (424) ..+.......++++++++..|..+.+++.....+..-++..++ ..+..| +.+..+..++.++.+|.+++.++....+ T Consensus 815 F~~~~~~~l~~ms~e~Fe~~k~alin~il~~~~nl~e~a~r~~-~~~~~g~~~Fd~~ek~i~~vk~LT~~~l~~f~~~~l 893 (937) T COG1025 815 FLETAEPELREMSEEDFEQIKKALINQILQPPQNLAEEASRLW-KAFGRGNLDFDHREKKIEAVKTLTKQKLLDFFENAL 893 (937) T ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHH-HHHCCCCCCCCCHHHHHHHHHHCCHHHHHHHHHHHH T ss_conf 9998999987189999999999999998716888899999999-996368778673789999998649999999999751 Q ss_pred CCC Q ss_conf 778 Q gi|254780464|r 393 SST 395 (424) Q Consensus 393 ~~~ 395 (424) ... T Consensus 894 ~~~ 896 (937) T COG1025 894 SYE 896 (937) T ss_pred CCC T ss_conf 103 No 15 >KOG0959 consensus Probab=99.65 E-value=1.2e-13 Score=110.42 Aligned_cols=371 Identities=13% Similarity=0.162 Sum_probs=237.9 Q ss_pred CEEEEECCC---CCEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCEEEEEECCCEEEEEE Q ss_conf 199998589---98799999983311448888983489999897335688898899999997087679975322379996 Q gi|254780464|r 12 ITVITEVMP---IDSAFVKVNIRAGSRNERQEEHGMAHFLEHMLFKGTTKRTAKEIVEEIEKVGGDINAYTSLEHTSYHA 88 (424) Q Consensus 12 l~v~~~~~~---~~~~~i~~~~~~Gs~~e~~~~~G~ah~lehl~~~gt~~~~~~~~~~~l~~~G~~~na~t~~d~~~~~~ 88 (424) .++..++++ .|.+.+.+.+..=-....+...++++++..++.-. ++ +..-.....|-.+..+.+.......+ T Consensus 519 ~~lw~k~dd~f~~Pka~~~~~~~~p~~~~~~~~~~l~~l~~~~l~d~---l~--E~~Y~A~~aGl~~~~~~s~~G~~~~v 593 (974) T KOG0959 519 SELWYKQDDKFNVPKAYTKFDFICPGATQSPLNSVLSTLYVRLLKDQ---LN--EYLYPALLAGLTYSLSSSSKGVELRV 593 (974) T ss_pred CEEEEECCCCCCCCHHHEEEEECCCCCCCCHHHHHHHHHHHHHHHHH---HH--HHHHHHHHCCCEEEEEECCCCEEEEE T ss_conf 24677426634564022034422764326788889999999999987---75--77778886064577642588548987 Q ss_pred ECCCCCCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHH-CCCCCCCCCHHHCCCCCCCCCCCCCCCCCHHHHHHHHH Q ss_conf 12665652100122232210356620011245567666651-13321110000001222222222222221034420001 Q gi|254780464|r 89 WVLKEHVPLALEIIGDMLSNSSFNPSDIERERNVVLEEIGM-SEDDSWDFLDARFSEMVWKDQIIGRPILGKPETISSFT 167 (424) Q Consensus 89 ~~~~~~~~~~l~ll~~~~~~p~~~~~~~~~ek~~v~~e~~~-~~~~p~~~~~~~~~~~~f~~~~~~~~~~g~~~~i~~lt 167 (424) .+..+++..+++.+.+++.+-..++..|+..++.+..+++. ..++|..++.+.+ ..+..+..+...- ..+.++.++ T Consensus 594 ~Gfnekl~~ll~~~~~~~~~f~~~~~rf~iike~~~~~~~n~~~~~p~~~a~~~~-~lll~~~~W~~~e--~~~al~~~~ 670 (974) T KOG0959 594 SGFNEKLPLLLEKVVQMMANFELDEDRFEIIKELLKRELRNHAFDNPYQLANDYL-LLLLEESIWSKEE--LLEALDDVT 670 (974) T ss_pred ECCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHH-HHHHHCCCCCHHH--HHHHHHCCC T ss_conf 4157533999999999987526668898999999999986553036789999999-9983210135699--998861152 Q ss_pred HHHHHHHHCCCCCCCCCHHHHCCCCCHHHHHHHHHHCCCCCCCCCCCCCC----------CCCCCCCEEEEECCC----C Q ss_conf 66776420112456630000025455122112211012358777788664----------223322102330368----4 Q gi|254780464|r 168 PEKIISFVSRNYTADRMYVVCVGAVDHEFCVSQVESYFNVCSVAKIKESM----------KPAVYVGGEYIQKRD----L 233 (424) Q Consensus 168 ~~~l~~f~~~~y~p~n~~l~i~Gd~~~~~~~~~i~~~f~~~~~~~~~~~~----------~p~~~~~~~~~~~~~----~ 233 (424) .+++..|-..++.+.-+...|.||++.++..++++...+.++...+...+ .-....|...+.... - T Consensus 671 le~~~~F~~~~~~~~~~e~~i~GN~te~~A~~l~~~v~d~l~~~~~~~~p~~~~~~~~~~~~~lp~G~~~~~~~~~n~~~ 750 (974) T KOG0959 671 LEDLESFISEFLQPFHLELLIHGNLTEKEALQLLKSVLDILKSAAPNSRPLFRSEHLPRREIQLPNGDYYFYRHLLNKTD 750 (974) T ss_pred HHHHHHHHHHHHHHHHEEEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEECCCCCEEEEECCCCCCC T ss_conf 87788789987543325788736756588999999987665224788766541003765426606886589973666678 Q ss_pred CCCEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEE--CCCCHHHHH Q ss_conf 321023010000135211356899877762487653222200026862101333222101220134442--583024777 Q gi|254780464|r 234 AEEHMMLGFNGCAYQSRDFYLTNILASILGDGMSSRLFQEVREKRGLCYSISAHHENFSDNGVLYIASA--TAKENIMAL 311 (424) Q Consensus 234 ~~~~i~~~~~~~~~~~~d~~al~ll~~iLg~g~sSrL~~~lRe~~glaY~v~s~~~~~~~~g~~~i~~~--~~~~~~~~~ 311 (424) +.+.+.+.+.+-..+..+...+.++..++.+ .+|..||++..|.|-|++......+.-.+.+... -.++-++.- T Consensus 751 ~ns~i~~~~Q~~~~~~~~~~~~~L~~~li~e----p~Fd~LRTkeqLGYiv~~~~r~~~G~~~~~i~Vqs~~~~~~le~r 826 (974) T KOG0959 751 DNSCIEVYYQIGVQDTRDNAVLGLLEQLIKE----PAFDQLRTKEQLGYIVSTGVRLNYGTVGLQITVQSEKSVDYLEER 826 (974) T ss_pred CCCEEEEEEECCCCHHHHHHHHHHHHHHHCC----CHHHHHHHHHHHCEEEEEEEEEECCCCEEEEEECCCCCCHHHHHH T ss_conf 8742799997066505788999999999604----137766667762717642235406853068997068881379999 Q ss_pred HHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCC-CCCCHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 87789999853148998999999999999998513898999999999999689-84798999998872798999999998 Q gi|254780464|r 312 TSSIVEVVQSLLENIEQREIDKECAKIHAKLIKSQERSYLRALEISKQVMFCG-SILCSEKIIDTISAITCEDIVGVAKK 390 (424) Q Consensus 312 ~~~i~~~i~~l~~~i~~~el~~ak~~~~~~~~~~~~~~~~~a~~l~~~~~~~~-~~~~~~~~~~~i~~vt~~di~~~~~~ 390 (424) +....+.+.+.....++++++..+..+.........+......+.+.....+. ..+..+..++++..+|.+++-.+... T Consensus 827 Ie~fl~~~~~~i~~m~~e~Fe~~~~~lI~~~~ek~~~l~~e~~~~w~ei~~~~y~f~r~~~~v~~l~~i~k~~~i~~f~~ 906 (974) T KOG0959 827 IESFLETFLEEIVEMSDEEFEKHKSGLIASKLEKPKNLSEESSRYWDEIIIGQYNFDRDEKEVEALKKITKEDVINFFDE 906 (974) T ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999985666566565789998876547506688888899887411000056789999986128889999986 Q ss_pred HCCC Q ss_conf 6277 Q gi|254780464|r 391 IFSS 394 (424) Q Consensus 391 ~l~~ 394 (424) ++.. T Consensus 907 ~~~~ 910 (974) T KOG0959 907 YIRK 910 (974) T ss_pred HCCC T ss_conf 3501 No 16 >KOG2019 consensus Probab=99.60 E-value=1.7e-13 Score=109.50 Aligned_cols=381 Identities=15% Similarity=0.134 Sum_probs=225.7 Q ss_pred CCCCCEEEEECCCCC-EEEEEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCEEEEE--ECCC-- Q ss_conf 798719999858998-79999998331144888898348999989733568889889999999708767997--5322-- Q gi|254780464|r 8 TSSGITVITEVMPID-SAFVKVNIRAGSRNERQEEHGMAHFLEHMLFKGTTKRTAKEIVEEIEKVGGDINAY--TSLE-- 82 (424) Q Consensus 8 L~NGl~v~~~~~~~~-~~~i~~~~~~Gs~~e~~~~~G~ah~lehl~~~gt~~~~~~~~~~~l~~~G~~~na~--t~~d-- 82 (424) .-||++|...+.... .+.+++.+.-|+.-+.- -+=+.-+.+.|+..||...+-.++...|...-|.++++ ++.+ T Consensus 566 dingvkv~~~dl~tngi~Y~r~~~~l~~~p~eL-~PylPlfc~sll~lGt~~lsf~el~qqI~rkTGGiS~~p~~~s~~~ 644 (998) T KOG2019 566 DINGVKVQRCDLFTNGITYTRVVFDLNSLPEEL-LPYLPLFCQSLLNLGTGDLSFVELEQQIGRKTGGISVSPLVSSDDG 644 (998) T ss_pred ECCCCEEEEEECCCCCEEEEEEEECCCCCCHHH-HCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCEEECEEECCCCC T ss_conf 026853577321568747999861124381776-5013899999986287865389999986020586551103216899 Q ss_pred ------EEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHH---CCCCCCCCCHHHCCCCCCCCCCCC Q ss_conf ------37999612665652100122232210356620011245567666651---133211100000012222222222 Q gi|254780464|r 83 ------HTSYHAWVLKEHVPLALEIIGDMLSNSSFNPSDIERERNVVLEEIGM---SEDDSWDFLDARFSEMVWKDQIIG 153 (424) Q Consensus 83 ------~~~~~~~~~~~~~~~~l~ll~~~~~~p~~~~~~~~~ek~~v~~e~~~---~~~~p~~~~~~~~~~~~f~~~~~~ 153 (424) .+.+...++.++.+..++|+..++.++.|..+ ++.|..|.+...+ ...+........-..+.+.....= T Consensus 645 ~d~p~~~i~~~~~~l~rn~~dlfel~n~il~e~~f~n~--dkfkvlvk~s~s~~~n~i~dsGH~~A~~rs~a~l~~ag~i 722 (998) T KOG2019 645 MDEPELGIVFSGSMLDRNADDLFELWNKILQETCFTNQ--DKFKVLVKQSASRMTNGIADSGHGFAAARSAAMLTPAGWI 722 (998) T ss_pred CCCCCEEEEECCHHHCCCHHHHHHHHHHHHHCCCCCCH--HHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCHH T ss_conf 87610378852011037706899999999715676528--8999999999887632566555046776522246744006 Q ss_pred CCCCCHHHH---HHHH---H-------HHHHHHHHCCCCCCCCCHHHHCCCC-CHHHHHHHHHHCCCCCCCCCCCCC--- Q ss_conf 222210344---2000---1-------6677642011245663000002545-512211221101235877778866--- Q gi|254780464|r 154 RPILGKPET---ISSF---T-------PEKIISFVSRNYTADRMYVVCVGAV-DHEFCVSQVESYFNVCSVAKIKES--- 216 (424) Q Consensus 154 ~~~~g~~~~---i~~l---t-------~~~l~~f~~~~y~p~n~~l~i~Gd~-~~~~~~~~i~~~f~~~~~~~~~~~--- 216 (424) ..-+|-.+. +..+ . .+.|.+..+-....+||.+.|..|- .+..+.+.+++++..+|...+... T Consensus 723 ~EqlgGl~ql~fl~~L~~~~d~d~~~i~~kL~eIrk~ll~~ng~~~~itAd~~q~~~vEkav~kFl~~lp~e~p~g~~st 802 (998) T KOG2019 723 SEQLGGLSQLEFLHRLEEKVDNDWEPIVSKLTEIRKSLLNTNGMIVNITADPKQLTNVEKAVEKFLDSLPRENPSGSKST 802 (998) T ss_pred HHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHHHHCCCCCCCCCCCC T ss_conf 77736629999999999975300788988899999997357980799835951000599999999985654589988567 Q ss_pred CCCCCCCCEEEE-ECCCCCC-CEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCC Q ss_conf 422332210233-0368432-10230100001352113568998777624876532222000268621013332221012 Q gi|254780464|r 217 MKPAVYVGGEYI-QKRDLAE-EHMMLGFNGCAYQSRDFYLTNILASILGDGMSSRLFQEVREKRGLCYSISAHHENFSDN 294 (424) Q Consensus 217 ~~p~~~~~~~~~-~~~~~~~-~~i~~~~~~~~~~~~d~~al~ll~~iLg~g~sSrL~~~lRe~~glaY~v~s~~~~~~~~ 294 (424) ..+-.....+.+ +.-+.-| .++.-+..++++.++|..+++|++.+|.. --|..++||+.| ||+-++.+... . T Consensus 803 ~d~r~p~~~~~i~~~~P~fqvnyvgka~~~vpyt~~d~asl~vlS~~lt~---k~Lh~evRekGG-AYGgg~s~~sh--~ 876 (998) T KOG2019 803 WDARLPLRSEAIRVVIPTFQVNYVGKAGLGVPYTHPDGASLQVLSKLLTN---KWLHDEVREKGG-AYGGGCSYSSH--S 876 (998) T ss_pred CCCCCCCCCEEEEEECCCCEEHHHHHHCCCCCCCCCCCCHHHHHHHHHHH---HHHHHHHHHHCC-CCCCCCCCCCC--C T ss_conf 65557777604788526620001233215656778898579999998877---888889887367-65776553245--6 Q ss_pred CCEEEEEECCCCHHHHHHHHHHHHHHHHCC-CCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHH Q ss_conf 201344425830247778778999985314-8998999999999999998513898999999999999689847989999 Q gi|254780464|r 295 GVLYIASATAKENIMALTSSIVEVVQSLLE-NIEQREIDKECAKIHAKLIKSQERSYLRALEISKQVMFCGSILCSEKII 373 (424) Q Consensus 295 g~~~i~~~~~~~~~~~~~~~i~~~i~~l~~-~i~~~el~~ak~~~~~~~~~~~~~~~~~a~~l~~~~~~~~~~~~~~~~~ 373 (424) |.|.++.--+|+-++. ++.....-.-++. .+++++|++||-...+..- +...|...- + ...+++-.-+..+... T Consensus 877 GvfSf~SYRDpn~lkt-L~~f~~tgd~~~~~~~~~~dldeAkl~~f~~VD-ap~~P~~kG--~-~~fl~gvtDemkQarR 951 (998) T KOG2019 877 GVFSFYSYRDPNPLKT-LDIFDGTGDFLRGLDVDQQDLDEAKLGTFGDVD-APQLPDAKG--L-LRFLLGVTDEMKQARR 951 (998) T ss_pred CEEEEEECCCCCHHHH-HHHHCCCCHHHHCCCCCCCCHHHHHHCCCCCCC-CCCCCCCCC--H-HHHHHCCCHHHHHHHH T ss_conf 4278874269844457-786456103552587660003556630001456-776986200--6-8998538878899989 Q ss_pred HHHHCCCHHHHHHHHHHHCCC-CCEEEEEE Q ss_conf 988727989999999986277-87799981 Q gi|254780464|r 374 DTISAITCEDIVGVAKKIFSS-TPTLAILG 402 (424) Q Consensus 374 ~~i~~vt~~di~~~~~~~l~~-~~~i~~vg 402 (424) +.|-+|+..|+.+++.+|+.. +..+.+.+ T Consensus 952 Eqll~vSl~d~~~vae~yl~~~~~~~~vav 981 (998) T KOG2019 952 EQLLAVSLKDFKAVAEAYLGVGDKGVAVAV 981 (998) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCEEEEE T ss_conf 998863488999999997123666138883 No 17 >KOG0961 consensus Probab=99.45 E-value=7.3e-12 Score=98.55 Aligned_cols=341 Identities=13% Similarity=0.150 Sum_probs=210.4 Q ss_pred HCCCCCCCH----HHHHHHHHH--HCCEEEEE---ECCCEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCHHHHHHHHH Q ss_conf 335688898----899999997--08767997---532237999612665652100122232210356620011245567 Q gi|254780464|r 53 FKGTTKRTA----KEIVEEIEK--VGGDINAY---TSLEHTSYHAWVLKEHVPLALEIIGDMLSNSSFNPSDIERERNVV 123 (424) Q Consensus 53 ~~gt~~~~~----~~~~~~l~~--~G~~~na~---t~~d~~~~~~~~~~~~~~~~l~ll~~~~~~p~~~~~~~~~ek~~v 123 (424) ..||-+-+. .++.+.+-+ +..+++.. +-++...+.+.|.+++.+...++...++....|+++.+....+.. T Consensus 596 ldgtiptp~~~s~~~v~~~~~s~~id~si~~g~~G~~~~lvn~~Ikv~a~~Y~~~v~Wi~~~l~~~VfD~~Ri~~~~~~~ 675 (1022) T KOG0961 596 LDGTIPTPVLTSADDVAKHFTSDLIDHSIQVGVSGLYDRLVNLRIKVGADKYPLLVKWIQIFLQGVVFDPSRIHQCAQKL 675 (1022) T ss_pred HCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHEEEEEEECCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 56889950413499999987765533332035455643150589997468863568999998655000889999999998 Q ss_pred HHHHHHCCCCCCCCCHHHCCCCCCCCCCCCCCC--CCHHHHHHHHH-------------HHHHHHHHCCCCCCCCCHHHH Q ss_conf 666651133211100000012222222222222--21034420001-------------667764201124566300000 Q gi|254780464|r 124 LEEIGMSEDDSWDFLDARFSEMVWKDQIIGRPI--LGKPETISSFT-------------PEKIISFVSRNYTADRMYVVC 188 (424) Q Consensus 124 ~~e~~~~~~~p~~~~~~~~~~~~f~~~~~~~~~--~g~~~~i~~lt-------------~~~l~~f~~~~y~p~n~~l~i 188 (424) +.++..+..|...++.-.+...+|+++.+..+. +--++-++.+. .++++. |.- ..+...+.+ T Consensus 676 l~~i~~~KRdg~~vlss~~~~~lY~~~slk~s~d~L~~Ek~l~ei~~~v~n~~~~Il~~~e~mR~-y~l--~~n~~~ihv 752 (1022) T KOG0961 676 LGEIRDRKRDGCTVLSSAVASMLYGKNSLKISFDELVLEKLLEEISKDVMNNPEAILEKLEQMRS-YAL--FSNGVNIHV 752 (1022) T ss_pred HHHHHHHHCCCCEEHHHHHHHHHHCCCCHHHCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH-HHH--HHCCEEEEE T ss_conf 75425540476076688888887334303320318889999999999985097989999999999-987--516647999 Q ss_pred CCCCCH-HHHHHHHHHCCCCCCCCCC----CC-----CCCCCCCCC--EEE-EECCCCCCCEEEECCCC-CCCCCCHHHH Q ss_conf 254551-2211221101235877778----86-----642233221--023-30368432102301000-0135211356 Q gi|254780464|r 189 VGAVDH-EFCVSQVESYFNVCSVAKI----KE-----SMKPAVYVG--GEY-IQKRDLAEEHMMLGFNG-CAYQSRDFYL 254 (424) Q Consensus 189 ~Gd~~~-~~~~~~i~~~f~~~~~~~~----~~-----~~~p~~~~~--~~~-~~~~~~~~~~i~~~~~~-~~~~~~d~~a 254 (424) +||++. +....-.........-+.+ .. ...+..... ... +..+....+.++...|. .+|.+|++.+ T Consensus 753 vgDI~kid~~~~~Wn~l~~~~~~~nP~~~f~~tf~~~~~~s~e~gsssk~~~I~~p~sESs~l~~sip~~~~w~dpel~~ 832 (1022) T KOG0961 753 VGDIDKIDPKMLSWNWLQADPRFGNPGHQFSATFEAGENVSLELGSSSKELLIGVPGSESSFLYQSIPLDANWNDPELIP 832 (1022) T ss_pred EEEHHCCCCCCCCCHHHHCCCCCCCCHHHCCCCCCCCCCCCEECCCCCCEEEECCCCCCCCCEEEECCCCCCCCCCCHHH T ss_conf 82120078644573555508222894142013001576443220687632685178864533145245446667832128 Q ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCC---CCHHHH Q ss_conf 89987776248765322220002686210133322210122013444258302477787789999853148---998999 Q gi|254780464|r 255 TNILASILGDGMSSRLFQEVREKRGLCYSISAHHENFSDNGVLYIASATAKENIMALTSSIVEVVQSLLEN---IEQREI 331 (424) Q Consensus 255 l~ll~~iLg~g~sSrL~~~lRe~~glaY~v~s~~~~~~~~g~~~i~~~~~~~~~~~~~~~i~~~i~~l~~~---i~~~el 331 (424) ..++.++|+- +-+++.+.|| ..||||+++.+...-...-.+.||.+.++-+. ...-.++++++..| +++.++ T Consensus 833 ~~l~~~YL~~-~eGPfW~~IR-G~GLAYGanm~~~~d~~~~~~~iyr~ad~~ka---ye~~rdiV~~~vsG~~e~s~~~~ 907 (1022) T KOG0961 833 AMLFGQYLSQ-CEGPFWRAIR-GDGLAYGANMFVKPDRKQITLSIYRCADPAKA---YERTRDIVRKIVSGSGEISKAEF 907 (1022) T ss_pred HHHHHHHHHH-CCCCHHHHHC-CCCHHCCCEEEEECCCCEEEEEEECCCCHHHH---HHHHHHHHHHHHCCCEEECHHHH T ss_conf 9999999985-4460465403-54010154058841687789996237718999---99888999998637502318875 Q ss_pred HHHHHHHHHHHHHHCCC-H-HHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCHHHHHHHHHHH----CCCCCEEEEEE Q ss_conf 99999999999851389-8-999999999999689847989999988727989999999986----27787799981 Q gi|254780464|r 332 DKECAKIHAKLIKSQER-S-YLRALEISKQVMFCGSILCSEKIIDTISAITCEDIVGVAKKI----FSSTPTLAILG 402 (424) Q Consensus 332 ~~ak~~~~~~~~~~~~~-~-~~~a~~l~~~~~~~~~~~~~~~~~~~i~~vt~~di~~~~~~~----l~~~~~i~~vg 402 (424) +-||......+.. .|+ . +.-+....-....+.+......+.++|.+||.+++.+..+.| |+++..+..|. T Consensus 908 egAk~s~~~~~~~-~Eng~~~~a~~~~~l~~~~q~~~~fn~~~leri~nvT~~~~~~~~~~y~~~~Fds~~~va~i~ 983 (1022) T KOG0961 908 EGAKRSTVFEMMK-RENGTVSGAAKISILNNFRQTPHPFNIDLLERIWNVTSEEMVKIGGPYLARLFDSKCFVASIA 983 (1022) T ss_pred CCCHHHHHHHHHH-HHCCCEECHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEHHHCCCCCEEEEE T ss_conf 2325779999998-743613005779999998716996218999999875099998733650324315667468996 No 18 >pfam03410 Peptidase_M44 Protein G1. Protein G1, named after the vaccinia virus protein, is a glycoprotein expressed by many Poxviridae. Probab=98.55 E-value=1.9e-07 Score=68.94 Aligned_cols=186 Identities=15% Similarity=0.144 Sum_probs=103.7 Q ss_pred EECCCCCEEEEECCCCCEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCEEEEEECCCEEE Q ss_conf 98798719999858998799999983311448888983489999897335688898899999997087679975322379 Q gi|254780464|r 6 SKTSSGITVITEVMPIDSAFVKVNIRAGSRNERQEEHGMAHFLEHMLFKGTTKRTAKEIVEEIEKVGGDINAYTSLEHTS 85 (424) Q Consensus 6 ~~L~NGl~v~~~~~~~~~~~i~~~~~~Gs~~e~~~~~G~ah~lehl~~~gt~~~~~~~~~~~l~~~G~~~na~t~~d~~~ 85 (424) ..|+||+||...+.-..-+.+++. +.|--.|=.+-.|+||+|||.+- .++...+ --||+|++.+.+ T Consensus 2 IvL~NGVRiFin~~M~KDIYlGIs-~FGFE~DI~~iLGiAHLLEHILI----sFD~~~F---------~ANASTaRsYMS 67 (590) T pfam03410 2 IVLENGVRVFINPGMGKDIYIGLS-NFGFENDIPEILGIAHLLEHILI----SFDYTRF---------VANASTARSYMS 67 (590) T ss_pred EEECCCEEEEECCCCCCCEEEEEC-CCCCCCCHHHHHHHHHHHHHHEE----ECCHHHH---------CCCCCHHHHHHH T ss_conf 682485599856776664599653-45664433668769998867527----5355550---------135410344565 Q ss_pred EEEECCCCC-CCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCC-----CCCCCCCHHHCCCCCCCCCCCCCCCCCH Q ss_conf 996126656-5210012223221035662001124556766665113-----3211100000012222222222222210 Q gi|254780464|r 86 YHAWVLKEH-VPLALEIIGDMLSNSSFNPSDIERERNVVLEEIGMSE-----DDSWDFLDARFSEMVWKDQIIGRPILGK 159 (424) Q Consensus 86 ~~~~~~~~~-~~~~l~ll~~~~~~p~~~~~~~~~ek~~v~~e~~~~~-----~~p~~~~~~~~~~~~f~~~~~~~~~~g~ 159 (424) |-+...+.. ...++.-+..|+..-.--...|.+ ..+...+++.+ .+....+++.+. .+.++.-|. .|. T Consensus 68 FWC~sirg~~~~DAvrtliSWFF~~g~Lr~~F~~--~~i~~~ikELENEYYFRnEvfHCmDvLT-fL~gGDLYN---GGR 141 (590) T pfam03410 68 FWCRAIRAETYLDALRTLVSWFFARGALKDDFSR--SRIRNHIKELENEYYFRNEVFHCMDILT-FLGGGDLYN---GGR 141 (590) T ss_pred HHHHHHCCCCHHHHHHHHHHHHHCCCCCCCCCCH--HHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHCCCCCCC---CCH T ss_conf 6657502996678999999982048963024646--5699999987432422324677888988-853886557---724 Q ss_pred HHHHHHHH--HHHHHHHHCCCCCCCCCHHHHCCCCCHHHHHHHHHHCCCCCCCCCCC Q ss_conf 34420001--66776420112456630000025455122112211012358777788 Q gi|254780464|r 160 PETISSFT--PEKIISFVSRNYTADRMYVVCVGAVDHEFCVSQVESYFNVCSVAKIK 214 (424) Q Consensus 160 ~~~i~~lt--~~~l~~f~~~~y~p~n~~l~i~Gd~~~~~~~~~i~~~f~~~~~~~~~ 214 (424) ...|++++ ++.+.+-.++.-.| |.++. |-.++ +.+..++++.||.+|.-+.. T Consensus 142 i~ML~~l~~i~~~L~~RM~~I~Gp-niVIF-Vk~l~-~~~l~ll~~TFG~LP~~P~~ 195 (590) T pfam03410 142 LSMLDQLDAVRELLGKRMRRIAGP-NVVIF-VRELS-PAALALLERTFGTLPACPST 195 (590) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCC-CEEEE-EECCC-HHHHHHHHHHCCCCCCCCCC T ss_conf 888762578999999988861089-68999-84078-89999999762788789620 No 19 >COG0612 PqqL Predicted Zn-dependent peptidases [General function prediction only] Probab=98.48 E-value=1.2e-05 Score=56.95 Aligned_cols=161 Identities=14% Similarity=0.138 Sum_probs=104.9 Q ss_pred CCCCEEEECCCCCCCCCCH--HHHHHHHHHHHHCCCCCC----CCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCCC Q ss_conf 4321023010000135211--356899877762487653----2222000268621013332221012201344425830 Q gi|254780464|r 233 LAEEHMMLGFNGCAYQSRD--FYLTNILASILGDGMSSR----LFQEVREKRGLCYSISAHHENFSDNGVLYIASATAKE 306 (424) Q Consensus 233 ~~~~~i~~~~~~~~~~~~d--~~al~ll~~iLg~g~sSr----L~~~lRe~~glaY~v~s~~~~~~~~g~~~i~~~~~~~ 306 (424) .+...+.+.+...+...++ .=...++.+++..|...+ +.+. -+..|....+++++. ....+..+.++ T Consensus 36 ~~~vs~~~~v~~Gs~~e~~~~~G~AH~lehm~fkgt~~~~~~~i~~~-~~~~G~~~na~ts~d------~t~y~~~~l~~ 108 (438) T COG0612 36 APTVSLDVWVKAGSRAEPAGKAGIAHFLEHMAFKGTTGLPSAELAEA-FEKLGGQLNAFTSFD------YTVYYLSVLPD 108 (438) T ss_pred CCEEEEEEEEECCCCCCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHH-HHHCCCCEEEEECCC------CEEEEEECCHH T ss_conf 87489999994364467886233999999997179988876789999-997599145675255------13455214665 Q ss_pred HHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCH--HHHHHHHHCCCHHHH Q ss_conf 247778778999985314899899999999999999851389899999999999968984798--999998872798999 Q gi|254780464|r 307 NIMALTSSIVEVVQSLLENIEQREIDKECAKIHAKLIKSQERSYLRALEISKQVMFCGSILCS--EKIIDTISAITCEDI 384 (424) Q Consensus 307 ~~~~~~~~i~~~i~~l~~~i~~~el~~ak~~~~~~~~~~~~~~~~~a~~l~~~~~~~~~~~~~--~~~~~~i~~vt~~di 384 (424) +.+++++.+.+.+..- .+++++|++.|..+...+....+++..++.......++++.+... .-..+.|+++|++|+ T Consensus 109 ~~~~~l~llad~l~~p--~f~~~~~e~Ek~vil~ei~~~~d~p~~~~~~~l~~~~~~~~p~~~~~~G~~e~I~~it~~dl 186 (438) T COG0612 109 NLDKALDLLADILLNP--TFDEEEVEREKGVILEEIRMRQDDPDDLAFERLLEALYGNHPLGRPILGTEESIEAITREDL 186 (438) T ss_pred HHHHHHHHHHHHHHCC--CCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHCCHHHH T ss_conf 3899999999998578--99989999999999999998403878899999999974899788899889788734799999 Q ss_pred HHHHHHHCCCCCE-EEEEE Q ss_conf 9999986277877-99981 Q gi|254780464|r 385 VGVAKKIFSSTPT-LAILG 402 (424) Q Consensus 385 ~~~~~~~l~~~~~-i~~vg 402 (424) +++.++|+.++.. |+++| T Consensus 187 ~~f~~k~Y~p~n~~l~vvG 205 (438) T COG0612 187 KDFYQKWYQPDNMVLVVVG 205 (438) T ss_pred HHHHHHHCCCCCEEEEEEC T ss_conf 9999985681473899977 No 20 >pfam08367 M16C_assoc Peptidase M16C associated. This domain appears in eukaryotes as well as bacteria and tends to be found near the C-terminus of the metalloprotease M16C (pfam05193). Probab=98.25 E-value=1e-05 Score=57.27 Aligned_cols=26 Identities=15% Similarity=0.243 Sum_probs=12.4 Q ss_pred CEEEEEECCCCHHHHHHHHHHHHHHH Q ss_conf 01344425830247778778999985 Q gi|254780464|r 296 VLYIASATAKENIMALTSSIVEVVQS 321 (424) Q Consensus 296 ~~~i~~~~~~~~~~~~~~~i~~~i~~ 321 (424) .|.+.+.|-..++.++.+.+.+.+.+ T Consensus 161 ~l~vs~k~L~~n~~~~~~L~~eil~~ 186 (248) T pfam08367 161 SFVISGKALNSKVPKLFELISEILNE 186 (248) T ss_pred EEEEEEEEHHHHHHHHHHHHHHHHHC T ss_conf 79999761753189999999999873 No 21 >pfam00675 Peptidase_M16 Insulinase (Peptidase family M16). Probab=97.15 E-value=0.0088 Score=37.64 Aligned_cols=131 Identities=10% Similarity=0.035 Sum_probs=81.8 Q ss_pred EECCCCCCCEEEECCCCCCCCCCHHH--HHHHHHHHHHCCCCCCCC---CCCCCCCCCCCHHHHHHHHHHCCCCEEEEEE Q ss_conf 30368432102301000013521135--689987776248765322---2200026862101333222101220134442 Q gi|254780464|r 228 IQKRDLAEEHMMLGFNGCAYQSRDFY--LTNILASILGDGMSSRLF---QEVREKRGLCYSISAHHENFSDNGVLYIASA 302 (424) Q Consensus 228 ~~~~~~~~~~i~~~~~~~~~~~~d~~--al~ll~~iLg~g~sSrL~---~~lRe~~glaY~v~s~~~~~~~~g~~~i~~~ 302 (424) +++++.+.+.+.+.+.+-+..++... ...++.+++..|...+=- .+.-+..|...++++. . -.+.+++. T Consensus 5 ~~~~~~p~~si~l~v~~Gs~~E~~~~~Glahlle~~~f~Gt~~~~~~~i~~~~e~~G~~~na~t~----~--d~t~~~~~ 78 (149) T pfam00675 5 EHDPPADTSAVGLWVDAGSRYEPDDNNGLAHFLEHMAFKGTKKYPSNELEEELEKLGGSLNAYTS----R--EHTVYYVE 78 (149) T ss_pred EECCCCCEEEEEEEECCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCEEEEEEC----C--CCEEEEEE T ss_conf 97699997999999770368889997229999999972899889826899999984966876415----7--85699960 Q ss_pred CCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCC Q ss_conf 5830247778778999985314899899999999999999851389899999999999968984 Q gi|254780464|r 303 TAKENIMALTSSIVEVVQSLLENIEQREIDKECAKIHAKLIKSQERSYLRALEISKQVMFCGSI 366 (424) Q Consensus 303 ~~~~~~~~~~~~i~~~i~~l~~~i~~~el~~ak~~~~~~~~~~~~~~~~~a~~l~~~~~~~~~~ 366 (424) +.+++.+.+++.+.+.+..- .++++++++.|+.+...+....+++..++.+.....++++.+ T Consensus 79 ~l~~~~~~~l~ll~d~l~~p--~f~~~e~e~ek~~v~~e~~~~~~~p~~~~~~~l~~~~f~~hp 140 (149) T pfam00675 79 VLNDDLPKAVDRLADFFLNP--LFSPSEVERERLVVEYEVEAVDAEPQEVLLDNLHAAAYRGTP 140 (149) T ss_pred CCHHHHHHHHHHHHHHHCCC--CCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCC T ss_conf 66889999999999985188--999999999999999999988769899999999998529998 No 22 >KOG0960 consensus Probab=96.59 E-value=0.063 Score=31.92 Aligned_cols=10 Identities=10% Similarity=0.225 Sum_probs=6.1 Q ss_pred CCCHHHHHHH Q ss_conf 2798999999 Q gi|254780464|r 378 AITCEDIVGV 387 (424) Q Consensus 378 ~vt~~di~~~ 387 (424) +||-++|.++ T Consensus 369 ~vteaEV~RA 378 (467) T KOG0960 369 SVTEAEVERA 378 (467) T ss_pred HCCHHHHHHH T ss_conf 6169999999 No 23 >KOG2067 consensus Probab=96.46 E-value=0.012 Score=36.69 Aligned_cols=12 Identities=33% Similarity=0.420 Sum_probs=4.2 Q ss_pred HHHHHHHHHCCC Q ss_conf 899877762487 Q gi|254780464|r 255 TNILASILGDGM 266 (424) Q Consensus 255 l~ll~~iLg~g~ 266 (424) +.+...+|+... T Consensus 219 v~~~~~~~~~~~ 230 (472) T KOG2067 219 VEIAEKLLGDLP 230 (472) T ss_pred HHHHHHHHCCCC T ss_conf 999999735487 No 24 >TIGR02110 PQQ_syn_pqqF coenzyme PQQ biosynthesis protein PqqF; InterPro: IPR011844 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This family of metallopeptidases belong to MEROPS peptidase family M16, subfamily M16A (clan ME). In the subset of species that make coenzyme PQQ (pyrrolo-quinoline-quinone), this peptidase is found in the PQQ biosynthesis region and is thought to act as a protease on PqqA (IPR011725 from INTERPRO), a probable peptide precursor of the coenzyme. PQQ is required for some glucose dehydrogenases and alcohol dehydrogenases .; GO: 0008234 cysteine-type peptidase activity, 0008270 zinc ion binding, 0018189 pyrroloquinoline quinone biosynthetic process. Probab=95.06 E-value=0.063 Score=31.91 Aligned_cols=223 Identities=13% Similarity=0.119 Sum_probs=114.7 Q ss_pred HCCEEEEEECCCEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCHHHHHHHHH-HHHHHHCCCCCCCCCHHHCCCCCCCC Q ss_conf 08767997532237999612665652100122232210356620011245567-66665113321110000001222222 Q gi|254780464|r 71 VGGDINAYTSLEHTSYHAWVLKEHVPLALEIIGDMLSNSSFNPSDIERERNVV-LEEIGMSEDDSWDFLDARFSEMVWKD 149 (424) Q Consensus 71 ~G~~~na~t~~d~~~~~~~~~~~~~~~~l~ll~~~~~~p~~~~~~~~~ek~~v-~~e~~~~~~~p~~~~~~~~~~~~f~~ 149 (424) -|+++.-....++=.....+|..-...++..-......+ ...+....+.| -.|. ..++..+.-+++.+.+ ++ T Consensus 509 AGv~~~~~~~g~~W~L~L~gp~~~~~A~~~~~l~~l~~a---~P~~~~~~~~~ssaer---~as~p~i~IRQLL~aL-p~ 581 (737) T TIGR02110 509 AGVELEFEALGEDWQLKLNGPRSPMRAVVSLALALLALA---APSLAQEAARLSSAER---RASAPSIPIRQLLAAL-PE 581 (737) T ss_pred CCCCCEEEEECCCHHHHHCCCCCCCCHHHHHHHHHHHCC---CCCHHCCCHHHHHHHH---CCCCCCCHHHHHHHHH-HH T ss_conf 483311121164212220487977727899999997268---8720215201004442---1577761689999960-10 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCC-CCCCCEEEE Q ss_conf 2222222210344200016677642011245663000002545512211221101235877778866422-332210233 Q gi|254780464|r 150 QIIGRPILGKPETISSFTPEKIISFVSRNYTADRMYVVCVGAVDHEFCVSQVESYFNVCSVAKIKESMKP-AVYVGGEYI 228 (424) Q Consensus 150 ~~~~~~~~g~~~~i~~lt~~~l~~f~~~~y~p~n~~l~i~Gd~~~~~~~~~i~~~f~~~~~~~~~~~~~p-~~~~~~~~~ 228 (424) ++.. +.....+.++|+..|..-.. +.+....++++........ . ..... .......| +...+.... T Consensus 582 ~l~~-------~~~A~~d~e~L~~~W~~arw-~~l~~~l~a~~~~a~~~a~-~--~~p~T--a~~~~~~P~Par~g~~~~ 648 (737) T TIGR02110 582 RLLK-------SKPAQVDDEDLQALWLAARW-DSLALALPAAVQLALSVAL-Q--LSPGT--ADVERPTPLPARLGRGWE 648 (737) T ss_pred CCCC-------CCCCCCCCHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHH-H--CCCCC--CCCCCCCCCCCCCCCCEE T ss_conf 2577-------77788981789999999756-5667666899999987776-4--68887--876777888688888604 Q ss_pred E-CCCCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHH-CCCCEEEEEECCCC Q ss_conf 0-368432102301000013521135689987776248765322220002686210133322210-12201344425830 Q gi|254780464|r 229 Q-KRDLAEEHMMLGFNGCAYQSRDFYLTNILASILGDGMSSRLFQEVREKRGLCYSISAHHENFS-DNGVLYIASATAKE 306 (424) Q Consensus 229 ~-~~~~~~~~i~~~~~~~~~~~~d~~al~ll~~iLg~g~sSrL~~~lRe~~glaY~v~s~~~~~~-~~g~~~i~~~~~~~ 306 (424) . -.|.....+.+-+|.|..+.++-.|+.||..++- .+.|+.||.+.++.|.|++.+..-. ..|++ +...++.- T Consensus 649 ~la~~gge~ALLlF~PlP~a~~a~eAAwRLLA~l~e----~~F~qRLRVElQlGY~Vf~~~r~va~R~GLL-fa~QSP~~ 723 (737) T TIGR02110 649 ELAVDGGESALLLFCPLPTADVASEAAWRLLAQLLE----PPFFQRLRVELQLGYAVFSRYRRVADRDGLL-FAVQSPDA 723 (737) T ss_pred ECCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHC----CHHHHHHHHHHHHHHHHHHHHHHCCCCCCHH-HCCCCCCC T ss_conf 445687765276431788998453799999998716----3100122477788677764233203654011-10458300 Q ss_pred HHHHHHHHHHHH Q ss_conf 247778778999 Q gi|254780464|r 307 NIMALTSSIVEV 318 (424) Q Consensus 307 ~~~~~~~~i~~~ 318 (424) ...+++.-+... T Consensus 724 ~~~~LL~H~~~F 735 (737) T TIGR02110 724 SAAELLQHIKTF 735 (737) T ss_pred HHHHHHHHHHHH T ss_conf 089999999986 No 25 >pfam05193 Peptidase_M16_C Peptidase M16 inactive domain. Peptidase M16 consists of two structurally related domains. One is the active peptidase, whereas the other is inactive. The two domains hold the substrate like a clamp. Probab=91.47 E-value=0.55 Score=25.63 Aligned_cols=101 Identities=15% Similarity=0.151 Sum_probs=40.1 Q ss_pred EECCCCCEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCCCHHHHHHHH-HHHCCEEEEEEC----C--CEEEEEE Q ss_conf 98589987999999833114488889834899998973356888988999999-970876799753----2--2379996 Q gi|254780464|r 16 TEVMPIDSAFVKVNIRAGSRNERQEEHGMAHFLEHMLFKGTTKRTAKEIVEEI-EKVGGDINAYTS----L--EHTSYHA 88 (424) Q Consensus 16 ~~~~~~~~~~i~~~~~~Gs~~e~~~~~G~ah~lehl~~~gt~~~~~~~~~~~l-~~~G~~~na~t~----~--d~~~~~~ 88 (424) ....+.+...+.+.++..+..+.+. .-...++.+++..+.. ..+.+.+ +..|...+++.+ . ..+.+++ T Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~l~~lL~~~~~----s~L~~~lr~~~gl~y~~~~~~~~~~~~g~~~i~~ 144 (182) T pfam05193 70 VPDKDVPQAKLALAFPGPPLGNDPD-SAALDVLAELLGGGAS----SRLFQELREKEGLAYSVSAFLDSYSDTGLFGIYA 144 (182) T ss_pred EECCCCCCEEEEEEEECCCCCCCHH-HHHHHHHHHHHCCCCC----CHHHHHHHHHCCCEEEEEEEEECCCCCEEEEEEE T ss_conf 9279986269999998488665588-9999998887258744----3789999875488048998655588971799999 Q ss_pred ECCCCCCCCCHHHHHHHHHCC---CCCCCHHHHHHH Q ss_conf 126656521001222322103---566200112455 Q gi|254780464|r 89 WVLKEHVPLALEIIGDMLSNS---SFNPSDIERERN 121 (424) Q Consensus 89 ~~~~~~~~~~l~ll~~~~~~p---~~~~~~~~~ek~ 121 (424) .+.+++++++++.+.+.+..- -|++++|++.|. T Consensus 145 ~~~~~~~~~~~~~i~~~l~~l~~~g~s~~el~~ak~ 180 (182) T pfam05193 145 DLDPENLDEVIELILEELKKLAEEGITEEELERAKA 180 (182) T ss_pred EECCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHH T ss_conf 978788999999999999999871979999999996 No 26 >TIGR01334 modD modD protein; InterPro: IPR006242 The gene modD for a member of this family is found with molybdenum transport genes modABC in Rhodobacter capsulatus. However, disruption of modD causes only a 4-fold (rather than 500-fold for modA, modB, modC) change in the external molybdenum concentration required to suppress an alternative nitrogenase. ModD proteins are highly similar to nicotinate-nucleotide pyrophosphorylase (also called quinolinate phosphoribosyltransferase). Their function is unknown. ; GO: 0016763 transferase activity transferring pentosyl groups. Probab=72.03 E-value=1.7 Score=22.26 Aligned_cols=27 Identities=19% Similarity=0.320 Sum_probs=17.6 Q ss_pred HHCCCHHHHHHHHHH--HCCCCCEEEEEE Q ss_conf 872798999999998--627787799981 Q gi|254780464|r 376 ISAITCEDIVGVAKK--IFSSTPTLAILG 402 (424) Q Consensus 376 i~~vt~~di~~~~~~--~l~~~~~i~~vg 402 (424) +++-|+++++++.+. .++++.++++-| T Consensus 214 LdKFtpqQ~~~~~~~l~~~~~~~tLa~aG 242 (277) T TIGR01334 214 LDKFTPQQVKELLELLKLLDKIITLAVAG 242 (277) T ss_pred CCCCCHHHHHHHHHHHHHHCCCCCEEEEC T ss_conf 23688889999999999737676534007 No 27 >TIGR02545 ATP_syn_fliI flagellar protein export ATPase FliI; InterPro: IPR013379 FliI proteins are involved in bacterial flagellum systems, acting to drive protein export for flagellar biosynthesis. The most closely related proteins are the YscN proteins of bacterial type III secretion systems.; GO: 0016887 ATPase activity, 0001539 ciliary or flagellar motility, 0009296 flagellum biogenesis, 0009288 flagellin-based flagellum. Probab=66.21 E-value=2.1 Score=21.67 Aligned_cols=37 Identities=14% Similarity=0.204 Sum_probs=16.6 Q ss_pred HHHHHHHCCCCCCCCCHHHHCCCCCHHHHHHHHHHCCC Q ss_conf 67764201124566300000254551221122110123 Q gi|254780464|r 169 EKIISFVSRNYTADRMYVVCVGAVDHEFCVSQVESYFN 206 (424) Q Consensus 169 ~~l~~f~~~~y~p~n~~l~i~Gd~~~~~~~~~i~~~f~ 206 (424) ..|.-...|+-..+-.||..+|-= --+|.+.||+.+| T Consensus 172 STLLGMiAr~t~ADV~VIALIGER-GREV~EFiE~~LG 208 (439) T TIGR02545 172 STLLGMIARYTEADVNVIALIGER-GREVKEFIEDDLG 208 (439) T ss_pred HHHHHHHHCCCCCCEEEEEEECCC-CCCCHHHHHHHCC T ss_conf 889888750665887899844465-6643135543035 No 28 >TIGR01026 fliI_yscN ATPase FliI/YscN family; InterPro: IPR005714 Proteins in this entry show extensive homology to the ATP synthase F1 beta subunit, and are involved in type III protein secretion. They fall into the two separate functional groups outlined below. The first group, exemplified by the Salmonella typhimurium FliI protein (P26465 from SWISSPROT), is needed for flagellar assembly. Most structural components of the bacterial flagellum are translocated through the central channel of the growing flagellar structure by the type III flagellar protein-export apparatus in an ATPase-driven manner, to be assembled at the growing end. FliI is the ATPase that couples ATP hydrolysis to the translocation reaction , . The second group couples ATP hydrolysis to protein translocation in non-flagellar type III secretion systems. Often these systems are involved in virulence and pathogenicity. YscN (P40290 from SWISSPROT) from pathogenic Yersinia species, for example, energises the injection of antihost factors directly into eukaryotic cells, thus overcoming host defences .; GO: 0016887 ATPase activity, 0009058 biosynthetic process, 0015031 protein transport, 0005737 cytoplasm. Probab=50.16 E-value=12 Score=16.64 Aligned_cols=81 Identities=12% Similarity=0.146 Sum_probs=36.2 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCCCC--CC---HHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCCC Q ss_conf 35689987776248765322220002686--21---01333222101220134442583024777877899998531489 Q gi|254780464|r 252 FYLTNILASILGDGMSSRLFQEVREKRGL--CY---SISAHHENFSDNGVLYIASATAKENIMALTSSIVEVVQSLLENI 326 (424) Q Consensus 252 ~~al~ll~~iLg~g~sSrL~~~lRe~~gl--aY---~v~s~~~~~~~~g~~~i~~~~~~~~~~~~~~~i~~~i~~l~~~i 326 (424) |+|.++++.+ ||++.+|.-+... +- .+-|.|..+.+-=-++.|...+...++++++..-+...-+.+++ T Consensus 364 YPAIdvl~S~------SR~~~~i~s~~~~~~~~~fR~lLs~y~~~EdLiriG~Y~~GSd~e~D~AI~~~~~le~FL~Qg~ 437 (455) T TIGR01026 364 YPAIDVLASI------SRLMTAIVSEEHKKAARKFRELLSKYKENEDLIRIGAYKKGSDKELDKAIAKYPKLERFLKQGI 437 (455) T ss_pred CCCCCHHHHH------HHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCCC T ss_conf 7863755568------8988740748899999999999998876132312202558788668999857999888754794 Q ss_pred CH-HHHHHHHHHH Q ss_conf 98-9999999999 Q gi|254780464|r 327 EQ-REIDKECAKI 338 (424) Q Consensus 327 ~~-~el~~ak~~~ 338 (424) .| ..++...+.+ T Consensus 438 ~E~~~fE~s~~~L 450 (455) T TIGR01026 438 NEKVNFEESLQQL 450 (455) T ss_pred CCCCHHHHHHHHH T ss_conf 3463089999999 No 29 >TIGR03395 sphingomy sphingomyelin phosphodiesterase. Members of this family are bacterial proteins that act as sphingomyelin phosphodiesterase (EC 3.1.4.12), also called sphingomyelinase. Some members of this family have been shown to act as hemolysins. Probab=48.60 E-value=14 Score=16.26 Aligned_cols=39 Identities=15% Similarity=0.197 Sum_probs=32.0 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCHHHHCCCCCHHHHH Q ss_conf 344200016677642011245663000002545512211 Q gi|254780464|r 160 PETISSFTPEKIISFVSRNYTADRMYVVCVGAVDHEFCV 198 (424) Q Consensus 160 ~~~i~~lt~~~l~~f~~~~y~p~n~~l~i~Gd~~~~~~~ 198 (424) ...++.....++++|.+....|.+-.++++||++.+.-. T Consensus 154 ~~~~R~~Q~~ei~~fi~~~~ip~~e~vi~aGDfNV~k~s 192 (283) T TIGR03395 154 PASIRANQLNEIQDFIDSKNIPKDETVLIGGDLNVNKGS 192 (283) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCC T ss_conf 589999999999999985599989718994014426898 No 30 >TIGR01574 miaB-methiolase tRNA-i(6)A37 thiotransferase enzyme MiaB; InterPro: IPR006463 These sequences are homologs of the MiaB enzyme responsible for the modification of the isopentenylated adenine-37 base of most bacterial and eukaryotic tRNAs that read codons beginning with uracil (all except tRNA(I,V) Ser). Adenine-37 is next to the anticodon on the 3 side in these tRNAs, and lack of modification at this site leads to an increased spontaneous mutation frequency. Isopentenylated A-37 is modified by methylthiolation at position 2, either by MiaB alone or in concert with a separate methylase yet to be discovered (MiaC?) , , .. Probab=47.06 E-value=17 Score=15.60 Aligned_cols=18 Identities=22% Similarity=0.331 Sum_probs=9.5 Q ss_pred CCHHHHHHHHHHHHHHHC Q ss_conf 200112455676666511 Q gi|254780464|r 113 PSDIERERNVVLEEIGMS 130 (424) Q Consensus 113 ~~~~~~ek~~v~~e~~~~ 130 (424) -+++.+--+.|++|++.. T Consensus 179 G~E~Sr~~~~Il~Ev~~l 196 (456) T TIGR01574 179 GEEISRPLDDILQEVQKL 196 (456) T ss_pred CCCCCCCHHHHHHHHHHH T ss_conf 820125744699999999 No 31 >KOG1110 consensus Probab=46.22 E-value=14 Score=16.22 Aligned_cols=75 Identities=13% Similarity=0.168 Sum_probs=43.4 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHH Q ss_conf 877762487653222200026862101333222101220134442583024777877899998531489989999999 Q gi|254780464|r 258 LASILGDGMSSRLFQEVREKRGLCYSISAHHENFSDNGVLYIASATAKENIMALTSSIVEVVQSLLENIEQREIDKEC 335 (424) Q Consensus 258 l~~iLg~g~sSrL~~~lRe~~glaY~v~s~~~~~~~~g~~~i~~~~~~~~~~~~~~~i~~~i~~l~~~i~~~el~~ak 335 (424) |..+=|...+.+++-.|+ |.+|+|...-..|...|.+...++++..+.-.....-.+....-..+++++|++... T Consensus 63 L~~ydGs~~d~~Il~AI~---G~VYDVT~Gr~FYGp~GpY~~fAG~DASR~La~~s~d~~d~~ddlsdL~a~e~eal~ 137 (183) T KOG1110 63 LRQYDGSDPDKPILLAIN---GKVYDVTRGREFYGPGGPYSLFAGKDASRGLAKMSFDLSDETDDLSDLTAEELEALN 137 (183) T ss_pred HHHCCCCCCCCCEEEEEC---CEEEEECCCCCCCCCCCCCHHHCCCCHHHHHHHCCCCHHHCCCCCCCCCHHHHHHHH T ss_conf 986479898872489863---668885477620379998400025506778875466511125653324899999999 No 32 >TIGR01481 ccpA catabolite control protein A; InterPro: IPR006377 Catabolite control protein A is a LacI family global transcriptional regulator found in Gram-positive bacteria. CcpA is involved in repressing carbohydrate utilization genes [ex: alpha-amylase (amyE), acetyl-coenzyme A synthase (acsA)] and in activating genes involved in transporting excess carbon from the cell [e.g. acetate kinase (ackA), alpha-acetolactate synthase (alsS)]. Additionally, disruption of CcpA in Bacillus megaterium, Staphylococcus xylosus, Lactobacillus casei and Lactocacillus pentosus also decreases growth rate, which suggests CcpA is involved in the regulation of other metabolic pathways.; GO: 0030528 transcription regulator activity, 0045449 regulation of transcription. Probab=44.09 E-value=13 Score=16.40 Aligned_cols=63 Identities=25% Similarity=0.289 Sum_probs=35.8 Q ss_pred CHHHHHHHHHHHCCEEEE----EECCCEEEEEEECCC------CCCCCCHHHHHHHH-HCCCCCCCHHHHHHHH Q ss_conf 988999999970876799----753223799961266------56521001222322-1035662001124556 Q gi|254780464|r 60 TAKEIVEEIEKVGGDINA----YTSLEHTSYHAWVLK------EHVPLALEIIGDML-SNSSFNPSDIERERNV 122 (424) Q Consensus 60 ~~~~~~~~l~~~G~~~na----~t~~d~~~~~~~~~~------~~~~~~l~ll~~~~-~~p~~~~~~~~~ek~~ 122 (424) ++....+-|+.+|=.=|| =+|.-.|+.-+=.|. ..|.++++=+|.|. -|--+..++-+.||+. T Consensus 33 TrKKVL~VI~RL~YRPNAVARGLASKrTTTVGViIPDIsN~yya~LarGIeDIA~MYKYNIILsNSD~~~ekE~ 106 (332) T TIGR01481 33 TRKKVLEVIKRLDYRPNAVARGLASKRTTTVGVIIPDISNIYYAELARGIEDIATMYKYNIILSNSDEDEEKEV 106 (332) T ss_pred HHHHHHHHHHHCCCCCCHHHHCCCCCCEEEEEEEECCCCHHHHHHCCCCCHHHHHHHCCCEEEECCCCCCCCHH T ss_conf 63359999965279854453226677402346881586157685016760257511012335652689875216 No 33 >PRK06220 consensus Probab=41.47 E-value=22 Score=14.84 Aligned_cols=10 Identities=10% Similarity=0.282 Sum_probs=4.1 Q ss_pred HHHHHHHHHH Q ss_conf 8999998843 Q gi|254780464|r 411 TSELIHALEG 420 (424) Q Consensus 411 ~~~l~~~l~~ 420 (424) .+.|.++++- T Consensus 364 ~e~l~ea~~R 373 (384) T PRK06220 364 LENIAEGLDR 373 (384) T ss_pred HHHHHHHHHH T ss_conf 9999999999 No 34 >PRK07682 hypothetical protein; Validated Probab=40.92 E-value=21 Score=14.94 Aligned_cols=11 Identities=18% Similarity=0.308 Sum_probs=4.9 Q ss_pred HHHHHHHHHHH Q ss_conf 89999988434 Q gi|254780464|r 411 TSELIHALEGF 421 (424) Q Consensus 411 ~~~l~~~l~~~ 421 (424) .+.|.++++-. T Consensus 360 ~e~l~eal~Rl 370 (378) T PRK07682 360 LEQLQEAMKRM 370 (378) T ss_pred HHHHHHHHHHH T ss_conf 99999999999 No 35 >PRK08361 aspartate aminotransferase; Provisional Probab=40.84 E-value=22 Score=14.92 Aligned_cols=11 Identities=36% Similarity=0.582 Sum_probs=4.9 Q ss_pred CHHHHHHHHHH Q ss_conf 98999998843 Q gi|254780464|r 410 TTSELIHALEG 420 (424) Q Consensus 410 ~~~~l~~~l~~ 420 (424) ..++|.++++- T Consensus 372 ~~e~l~eal~R 382 (390) T PRK08361 372 SKEKLIEAMER 382 (390) T ss_pred CHHHHHHHHHH T ss_conf 99999999999 No 36 >PRK06056 consensus Probab=40.61 E-value=21 Score=14.93 Aligned_cols=15 Identities=13% Similarity=0.091 Sum_probs=6.6 Q ss_pred CHHHHHHHHHHHCCC Q ss_conf 989999999986277 Q gi|254780464|r 380 TCEDIVGVAKKIFSS 394 (424) Q Consensus 380 t~~di~~~~~~~l~~ 394 (424) +..+-.+++++.+.. T Consensus 345 ~~~~~~~~~~~ll~e 359 (402) T PRK06056 345 RPQTSAELAALILEE 359 (402) T ss_pred CCCCHHHHHHHHHHH T ss_conf 989999999999970 No 37 >pfam08593 MUG2_C Meiotically up-regulated C-terminal. This is the C-terminal part of some meiotically up-regulated gene products from fungi. The actual function is not yet known. The proteins are likely to be cell-surface glycoproteins and may bes situated close to telomeres, hence their being expressed during meiosis. Probab=39.25 E-value=17 Score=15.63 Aligned_cols=31 Identities=13% Similarity=0.262 Sum_probs=25.1 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHCCC---CCCC Q ss_conf 222222103442000166776420112---4566 Q gi|254780464|r 152 IGRPILGKPETISSFTPEKIISFVSRN---YTAD 182 (424) Q Consensus 152 ~~~~~~g~~~~i~~lt~~~l~~f~~~~---y~p~ 182 (424) ++.|++|+.++|+.++.+++..|-+-| |.|+ T Consensus 33 ~~LPpi~si~Did~Ls~~qc~~yL~Gy~v~~~~n 66 (94) T pfam08593 33 PLLPPIGSIQDIDILSKEQCSRYLPGYGVEFHPN 66 (94) T ss_pred CCCCCCCHHHHHHHCCHHHHHHHHCCCCCCCCCH T ss_conf 7899620145244338999998841368644833 No 38 >COG5023 Tubulin [Cytoskeleton] Probab=39.21 E-value=23 Score=14.79 Aligned_cols=10 Identities=20% Similarity=0.285 Sum_probs=3.9 Q ss_pred HHCCCCCCCC Q ss_conf 1012358777 Q gi|254780464|r 202 ESYFNVCSVA 211 (424) Q Consensus 202 ~~~f~~~~~~ 211 (424) ...|+..|++ T Consensus 164 ~~tfSV~P~p 173 (443) T COG5023 164 KLTFSVFPAP 173 (443) T ss_pred EEEEEECCCC T ss_conf 2578742687 No 39 >cd01568 QPRTase_NadC Quinolinate phosphoribosyl transferase (QAPRTase or QPRTase), also called nicotinate-nucleotide pyrophosphorylase, is involved in the de novo synthesis of NAD in both prokaryotes and eukaryotes. It catalyses the reaction of quinolinic acid (QA) with 5-phosphoribosyl-1-pyrophosphate (PRPP) in the presence of Mg2+ to produce nicotinic acid mononucleotide (NAMN), pyrophosphate and carbon dioxide. QPRTase functions as a homodimer with two active sites, each formed by the C-terminal region of one subunit and the N-terminal region of the other. Probab=38.05 E-value=16 Score=15.88 Aligned_cols=11 Identities=45% Similarity=0.480 Sum_probs=4.9 Q ss_pred CCHHHHHHHHH Q ss_conf 79899999999 Q gi|254780464|r 379 ITCEDIVGVAK 389 (424) Q Consensus 379 vt~~di~~~~~ 389 (424) |+.+.|.++++ T Consensus 236 I~~~ni~~ya~ 246 (269) T cd01568 236 ITLENIRAYAE 246 (269) T ss_pred CCHHHHHHHHH T ss_conf 99999999997 No 40 >PRK07309 aromatic amino acid aminotransferase; Validated Probab=37.56 E-value=25 Score=14.45 Aligned_cols=11 Identities=18% Similarity=0.474 Sum_probs=4.3 Q ss_pred CCCCCHHHHCC Q ss_conf 56630000025 Q gi|254780464|r 180 TADRMYVVCVG 190 (424) Q Consensus 180 ~p~n~~l~i~G 190 (424) .|++-+++..| T Consensus 89 ~p~~~I~it~G 99 (390) T PRK07309 89 APENEILVTIG 99 (390) T ss_pred CCCCEEEECCC T ss_conf 98887998889 No 41 >cd01569 PBEF_like pre-B-cell colony-enhancing factor (PBEF)-like. The mammalian members of this group of nicotinate phosphoribosyltransferases (NAPRTases) were originally identified as genes whose expression is upregulated upon activation in lymphoid cells. In general, nicotinate phosphoribosyltransferase catalyses the formation of NAMN and PPi from 5-phosphoribosy -1-pyrophosphate (PRPP) and nicotinic acid, this is the first, and also rate limiting, reaction in the NAD salvage synthesis. Probab=37.02 E-value=5.3 Score=18.99 Aligned_cols=60 Identities=12% Similarity=0.031 Sum_probs=29.3 Q ss_pred CCHHHHHHHHHHHCCEE--EEEECCCEE-------EEEEECCCCCCCCCHHHHHHHHHCCCCCCCHHHH Q ss_conf 89889999999708767--997532237-------9996126656521001222322103566200112 Q gi|254780464|r 59 RTAKEIVEEIEKVGGDI--NAYTSLEHT-------SYHAWVLKEHVPLALEIIGDMLSNSSFNPSDIER 118 (424) Q Consensus 59 ~~~~~~~~~l~~~G~~~--na~t~~d~~-------~~~~~~~~~~~~~~l~ll~~~~~~p~~~~~~~~~ 118 (424) +........++.++|.+ .-.+-.|.+ .+++..+..++..+...|..++.+..|.+.-+.. T Consensus 78 fn~~gw~~I~~~h~G~LPi~I~AvpEGt~Vp~~~pl~tvenT~p~~~WL~nylET~Ll~~iWyp~TvAT 146 (407) T cd01569 78 FNEDGWRHILELHDGYLPIEIRAVPEGTVVPTKNVLMTVENTDPDFFWLTNYLETILLRNVWYPSTVAT 146 (407) T ss_pred CCHHHHHHHHHHCCCCCCEEEEECCCCCEECCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 338889999997499763599985898663489867999818987425888999999987566588999 No 42 >TIGR02414 pepN_proteo aminopeptidase N; InterPro: IPR012779 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. The M1 family of zinc metallopeptidases contains a number of distinct, well-separated clades of proteins with aminopeptidase activity. Several are designated aminopeptidase N, 3.4.11.2 from EC, after the Escherichia coli enzyme, suggesting a similar activity profile (see P04825 from SWISSPROT for a description of catalytic activity). This family of zinc metallopeptidases belong to MEROPS peptidase family M1 (aminopeptidase N, clan MA); the majority are identified as alanyl aminopeptidases (proteobacteria) that are closely related to Escherichia coli PepN and presumed to have a similar (not identical) function. Nearly all are found in proteobacteria, but members are found also in cyanobacteria, plants, and apicomplexan parasites , . This family differs greatly in sequence from the family of aminopeptidases typified by Streptomyces lividans PepN (IPR012778 from INTERPRO) and from the membrane bound aminopeptidase N family in animals.; GO: 0004179 membrane alanyl aminopeptidase activity, 0008270 zinc ion binding. Probab=35.19 E-value=27 Score=14.22 Aligned_cols=77 Identities=14% Similarity=0.169 Sum_probs=38.9 Q ss_pred HHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCC-----CCCCHHHH---------HHHHHCCCHHH Q ss_conf 99853148998999999999999998513898999999999999689-----84798999---------99887279899 Q gi|254780464|r 318 VVQSLLENIEQREIDKECAKIHAKLIKSQERSYLRALEISKQVMFCG-----SILCSEKI---------IDTISAITCED 383 (424) Q Consensus 318 ~i~~l~~~i~~~el~~ak~~~~~~~~~~~~~~~~~a~~l~~~~~~~~-----~~~~~~~~---------~~~i~~vt~~d 383 (424) .|.++-+.++.+-|-.||..++.++...+.+....+..-.+...... .+...... ...+......+ T Consensus 644 ~La~~~~~~Dp~al~~AR~~~~~~iA~~L~d~l~~~y~~~~~~~~~~~Y~nDeH~~p~~~G~R~LrN~~L~yL~~~~~~~ 723 (898) T TIGR02414 644 YLAELMENIDPDALHAAREFLRAAIARQLADDLLALYDALQEEGPYSTYVNDEHVDPAAAGRRALRNAALSYLAAADDAE 723 (898) T ss_pred HHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCHH T ss_conf 99999860496799999999999999876589999999850457551311357768889988999999999987369888 Q ss_pred HHHHHHHHCCC Q ss_conf 99999986277 Q gi|254780464|r 384 IVGVAKKIFSS 394 (424) Q Consensus 384 i~~~~~~~l~~ 394 (424) +.+.+.+.|.. T Consensus 724 ~~~~~~~q~~~ 734 (898) T TIGR02414 724 IRNLALEQFKS 734 (898) T ss_pred HHHHHHHHHCC T ss_conf 99999997426 No 43 >PRK08362 consensus Probab=35.19 E-value=28 Score=14.20 Aligned_cols=10 Identities=20% Similarity=0.288 Sum_probs=3.4 Q ss_pred HHHHHHHCCC Q ss_conf 6776420112 Q gi|254780464|r 169 EKIISFVSRN 178 (424) Q Consensus 169 ~~l~~f~~~~ 178 (424) +.+.+|+++. T Consensus 70 ~aia~~~~~~ 79 (389) T PRK08362 70 EAIAEKLKKQ 79 (389) T ss_pred HHHHHHHHHH T ss_conf 9999999884 No 44 >PRK11174 cysteine/glutathione ABC transporter membrane/ATP-binding component; Reviewed Probab=33.94 E-value=21 Score=15.02 Aligned_cols=85 Identities=6% Similarity=0.105 Sum_probs=49.1 Q ss_pred CCCHHHHHHHHHHHH-----HHHHHHCCCH---------HHH--HHHHHHHHHHCCCCCCHHHHHHHHHCCCHHHHHHHH Q ss_conf 899899999999999-----9998513898---------999--999999999689847989999988727989999999 Q gi|254780464|r 325 NIEQREIDKECAKIH-----AKLIKSQERS---------YLR--ALEISKQVMFCGSILCSEKIIDTISAITCEDIVGVA 388 (424) Q Consensus 325 ~i~~~el~~ak~~~~-----~~~~~~~~~~---------~~~--a~~l~~~~~~~~~~~~~~~~~~~i~~vt~~di~~~~ 388 (424) +.+++|+.+|.+... .++..+.++. .+. --.++|..+..-+..-++|--..+|.-|...|++.. T Consensus 449 ~atdeei~~A~~~a~~~~~I~~Lp~GldT~vge~G~~LSGGQrQRiaiARAll~~~~ILILDEaTSaLD~~tE~~i~~~L 528 (588) T PRK11174 449 DASDEQLQQALENAWVSEFVPLLPQGLDTPIGDQAAGLSVGQAQRLALARALLQPCQLLLLDEPTASLDAHSEQLVMQAL 528 (588) T ss_pred CCCHHHHHHHHHHHHHHHHHHHCCCCCCCHHCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHH T ss_conf 43345799999986247899845132236322888877999999999999983798989998987798999999999999 Q ss_pred HHHCCCCCEEEEEECCCCCCCC Q ss_conf 9862778779998166010899 Q gi|254780464|r 389 KKIFSSTPTLAILGPPMDHVPT 410 (424) Q Consensus 389 ~~~l~~~~~i~~vgp~~~~~P~ 410 (424) +++.+ +.|++++--....+.. T Consensus 529 ~~~~~-~rTviiIaHRlsti~~ 549 (588) T PRK11174 529 NAASQ-GQTTLMVTHQLEDLAQ 549 (588) T ss_pred HHHCC-CCEEEEEECCHHHHHH T ss_conf 98679-9989998067999984 No 45 >PRK06207 aspartate aminotransferase; Provisional Probab=33.05 E-value=30 Score=13.98 Aligned_cols=23 Identities=13% Similarity=0.274 Sum_probs=9.1 Q ss_pred HHHHHHHHCCCCCCCCCHHHHCC Q ss_conf 66776420112456630000025 Q gi|254780464|r 168 PEKIISFVSRNYTADRMYVVCVG 190 (424) Q Consensus 168 ~~~l~~f~~~~y~p~n~~l~i~G 190 (424) .+.++++|..-+.|.+-+++..| T Consensus 88 a~~~~~~~g~~~d~~d~Iivt~G 110 (406) T PRK06207 88 AARLAEFTGAPVDARDGLIITPG 110 (406) T ss_pred HHHHHHHHCCCCCCCCEEEECCC T ss_conf 99999986899998786998587 No 46 >PRK07683 aminotransferase A; Validated Probab=33.02 E-value=30 Score=13.97 Aligned_cols=10 Identities=20% Similarity=0.268 Sum_probs=4.2 Q ss_pred HHHHHHHHHH Q ss_conf 8999998843 Q gi|254780464|r 411 TSELIHALEG 420 (424) Q Consensus 411 ~~~l~~~l~~ 420 (424) .++|.++++- T Consensus 367 ~e~l~eal~R 376 (387) T PRK07683 367 IETLKEGLDR 376 (387) T ss_pred HHHHHHHHHH T ss_conf 9999999999 No 47 >PRK06195 DNA polymerase III subunit epsilon; Validated Probab=32.91 E-value=30 Score=14.00 Aligned_cols=30 Identities=10% Similarity=0.097 Sum_probs=12.1 Q ss_pred CHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCE Q ss_conf 34899998973356888988999999970876 Q gi|254780464|r 43 GMAHFLEHMLFKGTTKRTAKEIVEEIEKVGGD 74 (424) Q Consensus 43 G~ah~lehl~~~gt~~~~~~~~~~~l~~~G~~ 74 (424) .+..+...|+ .+.-.+.. -+.+..+.+|+. T Consensus 52 ~ihGIt~~~v-~~aP~f~e-v~~~~~~fi~~~ 81 (309) T PRK06195 52 GIHGIRPHMV-QDELEFDK-IWEKIKDYFNDN 81 (309) T ss_pred EECCCCHHHH-HCCCCHHH-HHHHHHHHHCCC T ss_conf 4037799999-65999999-999999985799 No 48 >PRK07337 aminotransferase; Validated Probab=32.84 E-value=29 Score=14.05 Aligned_cols=11 Identities=27% Similarity=0.232 Sum_probs=5.3 Q ss_pred HHHHHHHHHHH Q ss_conf 89999988434 Q gi|254780464|r 411 TSELIHALEGF 421 (424) Q Consensus 411 ~~~l~~~l~~~ 421 (424) .+.|.++++-. T Consensus 373 ~~~l~eal~RL 383 (388) T PRK07337 373 YSRLEEAVARL 383 (388) T ss_pred HHHHHHHHHHH T ss_conf 99999999999 No 49 >COG0157 NadC Nicotinate-nucleotide pyrophosphorylase [Coenzyme metabolism] Probab=31.99 E-value=22 Score=14.88 Aligned_cols=12 Identities=42% Similarity=0.418 Sum_probs=6.1 Q ss_pred CCCHHHHHHHHH Q ss_conf 279899999999 Q gi|254780464|r 378 AITCEDIVGVAK 389 (424) Q Consensus 378 ~vt~~di~~~~~ 389 (424) .||.+.|.+++. T Consensus 242 gIt~~ni~~yA~ 253 (280) T COG0157 242 GITLENIREYAE 253 (280) T ss_pred CCCHHHHHHHHH T ss_conf 978778999862 No 50 >PRK06507 consensus Probab=31.98 E-value=31 Score=13.86 Aligned_cols=17 Identities=18% Similarity=0.294 Sum_probs=7.4 Q ss_pred HHHHHHHHCCC----CCCCCC Q ss_conf 66776420112----456630 Q gi|254780464|r 168 PEKIISFVSRN----YTADRM 184 (424) Q Consensus 168 ~~~l~~f~~~~----y~p~n~ 184 (424) ++.+.+||++. +.|+++ T Consensus 74 Reaia~~~~~~~g~~~~p~~I 94 (400) T PRK06507 74 KAAIQRKFKRENNLDYDPSEI 94 (400) T ss_pred HHHHHHHHHHHHCCCCCCCEE T ss_conf 999999999984899982438 No 51 >PRK10046 dpiA two-component response regulator DpiA; Provisional Probab=31.56 E-value=31 Score=13.82 Aligned_cols=24 Identities=8% Similarity=0.084 Sum_probs=10.3 Q ss_pred CCCCHHHHHHHHHCCCHHHHHHHHH Q ss_conf 8479899999887279899999999 Q gi|254780464|r 365 SILCSEKIIDTISAITCEDIVGVAK 389 (424) Q Consensus 365 ~~~~~~~~~~~i~~vt~~di~~~~~ 389 (424) ...+.++..+.+. ++.-.++++.. T Consensus 176 ~~~ta~eva~~~g-iSrvTaRRYLe 199 (225) T PRK10046 176 VQHTAETVAQALT-ISRTTARRYLE 199 (225) T ss_pred CCCCHHHHHHHHC-CCHHHHHHHHH T ss_conf 9868999999858-51999999999 No 52 >TIGR01478 STEVOR variant surface antigen, stevor family; InterPro: IPR006374 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see . This entry represents the stevor (short for "subtelomeric variable open reading frame") family of predicted variant surface antigens. This is the third largest variant surface antigen family in P. falciparum, with 33 genes identified in the genome of this parasite . The function of these proteins is not known, but they are expressed during several stages of the parasite life-cycle and may play a variety of roles in parasite survival and infection .. Probab=31.49 E-value=32 Score=13.81 Aligned_cols=51 Identities=16% Similarity=0.062 Sum_probs=31.2 Q ss_pred HHCCCCCCCCCC-CCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHCCCCCHHHHHHHHHHC Q ss_conf 000122222222-22222210344200016677642011245663000002545512211221101 Q gi|254780464|r 140 ARFSEMVWKDQI-IGRPILGKPETISSFTPEKIISFVSRNYTADRMYVVCVGAVDHEFCVSQVESY 204 (424) Q Consensus 140 ~~~~~~~f~~~~-~~~~~~g~~~~i~~lt~~~l~~f~~~~y~p~n~~l~i~Gd~~~~~~~~~i~~~ 204 (424) |.|.+.=-|.+| |-+.+- =+|-|++++-|-|++|.++| | +.|++.++|++- T Consensus 48 R~LAeiq~P~NPHYhNDPE-LKeIIDKLNeEAIKKYq~TH------------d-PYEqL~elVeKn 99 (315) T TIGR01478 48 RLLAEIQRPKNPHYHNDPE-LKEIIDKLNEEAIKKYQKTH------------D-PYEQLKELVEKN 99 (315) T ss_pred HHHHHHCCCCCCCCCCCHH-HHHHHHHHHHHHHHHHHCCC------------C-CHHHHHHHHHHC T ss_conf 6567535763229898734-66686240267786653058------------8-157774455420 No 53 >PRK08960 hypothetical protein; Provisional Probab=30.49 E-value=31 Score=13.84 Aligned_cols=11 Identities=27% Similarity=0.314 Sum_probs=5.1 Q ss_pred CHHHHHHHHHH Q ss_conf 98999998843 Q gi|254780464|r 410 TTSELIHALEG 420 (424) Q Consensus 410 ~~~~l~~~l~~ 420 (424) ..+.|.++++- T Consensus 370 ~~~~l~eal~R 380 (387) T PRK08960 370 SLPRLQEAVER 380 (387) T ss_pred CHHHHHHHHHH T ss_conf 99999999999 No 54 >PRK07778 consensus Probab=30.12 E-value=32 Score=13.76 Aligned_cols=12 Identities=25% Similarity=0.382 Sum_probs=5.9 Q ss_pred CHHHHHHHHHHH Q ss_conf 989999988434 Q gi|254780464|r 410 TTSELIHALEGF 421 (424) Q Consensus 410 ~~~~l~~~l~~~ 421 (424) ..++|.++++-. T Consensus 369 ~~~~l~eal~RL 380 (386) T PRK07778 369 PTADIEEALRRI 380 (386) T ss_pred CHHHHHHHHHHH T ss_conf 899999999999 No 55 >TIGR01754 flav_RNR ribonucleotide reductase-associated flavodoxin, putative; InterPro: IPR010088 This entry represents a family of proteins found immediately downstream of ribonucleotide reductase genes in Xyella fastidiosa and some Gram-positive bacteria. It appears to be a highly divergent flavodoxin of the short chain type, more like the flavodoxins of the sulphate-reducing genus Desulfovibrio than like the NifF flavodoxins associated with nitrogen fixation.. Probab=29.58 E-value=17 Score=15.65 Aligned_cols=57 Identities=14% Similarity=0.262 Sum_probs=38.3 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHCCC-C---CCCCCHHHHCCCCCH---H---HHHHHHHHCCCCC Q ss_conf 222222103442000166776420112-4---566300000254551---2---2112211012358 Q gi|254780464|r 152 IGRPILGKPETISSFTPEKIISFVSRN-Y---TADRMYVVCVGAVDH---E---FCVSQVESYFNVC 208 (424) Q Consensus 152 ~~~~~~g~~~~i~~lt~~~l~~f~~~~-y---~p~n~~l~i~Gd~~~---~---~~~~~i~~~f~~~ 208 (424) |-.-++|+=-.=..-|+.++|+|.... + .|+|++|..+|+--. | ..+..+.++||+- T Consensus 52 ydL~~LGtwT~~~GrTP~e~KdFiaEl~~liGKP~nvaiFGTGeTQwGG~d~yCgAVdr~a~ffgss 118 (145) T TIGR01754 52 YDLYLLGTWTVERGRTPDEMKDFIAELAVLIGKPKNVAIFGTGETQWGGDDYYCGAVDRLAKFFGSS 118 (145) T ss_pred CCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHHCCC T ss_conf 3157743122357899666899999999983699824886588755288654024788878650468 No 56 >TIGR02873 spore_ylxY probable sporulation protein, polysaccharide deacetylase family; InterPro: IPR014228 Members of this entry include YlxY, and are related to IPR014132 from INTERPRO, which represents a subset of the polysaccharide deacetylase family found in a species if, and only if, the species forms endospores e.g. Bacillus subtilis or Clostridium tetani. Proteins in this entry are likewise restricted to spore-formers, but are not universally found among endospore-forming species.. Probab=28.04 E-value=36 Score=13.42 Aligned_cols=26 Identities=12% Similarity=0.266 Sum_probs=15.6 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCC Q ss_conf 22222222222210344200016677642011 Q gi|254780464|r 146 VWKDQIIGRPILGKPETISSFTPEKIISFVSR 177 (424) Q Consensus 146 ~f~~~~~~~~~~g~~~~i~~lt~~~l~~f~~~ 177 (424) .-|||.|.||=+ ..||.+.+.+--++ T Consensus 140 eiGNHaY~HPdM------~~Ls~~~~~dql~~ 165 (269) T TIGR02873 140 EIGNHAYNHPDM------KTLSKERISDQLDK 165 (269) T ss_pred EECCCCCCCCCH------HHCCHHHHHHHHHH T ss_conf 463534588562------23034778999988 No 57 >PRK08385 nicotinate-nucleotide pyrophosphorylase; Provisional Probab=27.99 E-value=21 Score=15.04 Aligned_cols=16 Identities=25% Similarity=0.439 Sum_probs=8.1 Q ss_pred CCCCCHHHHHHHHHHH Q ss_conf 1489989999999999 Q gi|254780464|r 323 LENIEQREIDKECAKI 338 (424) Q Consensus 323 ~~~i~~~el~~ak~~~ 338 (424) .++++.+++.++...+ T Consensus 209 LDn~s~e~~~~~v~~l 224 (279) T PRK08385 209 LDNMTPEEIREVIEAL 224 (279) T ss_pred ECCCCHHHHHHHHHHH T ss_conf 8499999999999998 No 58 >pfam01837 DUF39 Domain of unknown function DUF39. This presumed domain is about is about 320 residues long. It is found in proteins that have two C-terminal pfam00571 domains. The function of this domain is unknown. One member has been misannotated as an inosine monophosphate dehydrogenase based on the similarity to the CBS domains. Probab=27.70 E-value=22 Score=14.91 Aligned_cols=29 Identities=24% Similarity=0.397 Sum_probs=14.1 Q ss_pred EEEEEEECCCCCCCCCCCCCHHHHHHHHH Q ss_conf 99999983311448888983489999897 Q gi|254780464|r 24 AFVKVNIRAGSRNERQEEHGMAHFLEHML 52 (424) Q Consensus 24 ~~i~~~~~~Gs~~e~~~~~G~ah~lehl~ 52 (424) .++-+++.+....|.+...|=+|+++.|+ T Consensus 72 gaVD~~lg~T~~s~~~~~YGGghvi~dLv 100 (300) T pfam01837 72 GAVDLYLGATQHSEDDERYGGGHVIEDLV 100 (300) T ss_pred EEEEEEEECCCCCCCCCCCCCHHHHHHHH T ss_conf 06889975466678875456424589986 No 59 >PRK09225 threonine synthase; Validated Probab=27.56 E-value=37 Score=13.37 Aligned_cols=31 Identities=26% Similarity=0.373 Sum_probs=20.3 Q ss_pred CCCCCC-HHHHCCCCCHHHHHHHHHHCCCCCCCC Q ss_conf 456630-000025455122112211012358777 Q gi|254780464|r 179 YTADRM-YVVCVGAVDHEFCVSQVESYFNVCSVA 211 (424) Q Consensus 179 y~p~n~-~l~i~Gd~~~~~~~~~i~~~f~~~~~~ 211 (424) ...+|. ++.|-|+|| +...++++.|.+..-. T Consensus 180 ~~~~NV~~i~v~G~FD--DcQ~lVK~~f~D~~~~ 211 (465) T PRK09225 180 LQGDNIHVVAVEGDFD--DCQALVKAAFNDEELK 211 (465) T ss_pred CCCCCEEEEEECCCHH--HHHHHHHHHHHCHHHH T ss_conf 1799779998468767--8999999997197663 No 60 >PRK09148 aminotransferase; Validated Probab=26.60 E-value=35 Score=13.54 Aligned_cols=12 Identities=8% Similarity=0.158 Sum_probs=5.3 Q ss_pred CHHHHHHHHHHH Q ss_conf 989999988434 Q gi|254780464|r 410 TTSELIHALEGF 421 (424) Q Consensus 410 ~~~~l~~~l~~~ 421 (424) ..+.|+++++-. T Consensus 374 ~~~~L~eal~Rl 385 (406) T PRK09148 374 NEQRIRQAARNI 385 (406) T ss_pred CHHHHHHHHHHH T ss_conf 899999999999 No 61 >PRK08636 aspartate aminotransferase; Provisional Probab=26.46 E-value=38 Score=13.24 Aligned_cols=23 Identities=26% Similarity=0.414 Sum_probs=9.6 Q ss_pred HHHHHHHHCCCC----CCCCCHHHHCC Q ss_conf 667764201124----56630000025 Q gi|254780464|r 168 PEKIISFVSRNY----TADRMYVVCVG 190 (424) Q Consensus 168 ~~~l~~f~~~~y----~p~n~~l~i~G 190 (424) ++.+.+||++.| .|++-+++..| T Consensus 77 R~aia~~~~~~~g~~~~p~~~I~vt~G 103 (403) T PRK08636 77 RLAICNWYKRKYNVDLDPDTEVVATMG 103 (403) T ss_pred HHHHHHHHHHHCCCCCCCCCEEEEECC T ss_conf 999999999970999999874999179 No 62 >PRK13539 cytochrome c biogenesis protein CcmA; Provisional Probab=26.22 E-value=14 Score=16.16 Aligned_cols=62 Identities=6% Similarity=0.008 Sum_probs=42.7 Q ss_pred HHHHHHHHCCCCCCHHHHHHHHHCCCHHHHHHHHHHHCCCCCEEEEEECCCCCCCCHHHHHH Q ss_conf 99999996898479899999887279899999999862778779998166010899899999 Q gi|254780464|r 355 EISKQVMFCGSILCSEKIIDTISAITCEDIVGVAKKIFSSTPTLAILGPPMDHVPTTSELIH 416 (424) Q Consensus 355 ~l~~~~~~~~~~~~~~~~~~~i~~vt~~di~~~~~~~l~~~~~i~~vgp~~~~~P~~~~l~~ 416 (424) .+++.+...-+..-++|-...++..+.+++.+.++++.....+|+++-=....+|++.+++- T Consensus 136 ~la~al~~~p~vllLDEPtsgLD~~~~~~v~~~i~~~~~~g~tiIi~tH~p~~~~~~~~~~~ 197 (206) T PRK13539 136 ALARLLVSNRPIWLLDEPTAALDSASQALFAELIRAHLAQGGIVIAATHIPLGLPGARELDL 197 (206) T ss_pred HHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCEECC T ss_conf 99999986989899979977789999999999999999589999999389887888989805 No 63 >PRK08072 nicotinate-nucleotide pyrophosphorylase; Provisional Probab=26.03 E-value=31 Score=13.86 Aligned_cols=34 Identities=12% Similarity=0.141 Sum_probs=17.5 Q ss_pred ECCCCHHHHHHHHHHHHHHHH-CCCCCHHHHHHHH Q ss_conf 258302477787789999853-1489989999999 Q gi|254780464|r 302 ATAKENIMALTSSIVEVVQSL-LENIEQREIDKEC 335 (424) Q Consensus 302 ~~~~~~~~~~~~~i~~~i~~l-~~~i~~~el~~ak 335 (424) .+.-++.+++..++..-...+ .++++.+++.++. T Consensus 192 eVEv~s~~q~~~a~~~g~diImLDNm~p~~ik~~v 226 (277) T PRK08072 192 EVETETEEQVREAVAAGADIIMFDNRTPDEIREFV 226 (277) T ss_pred EEEECCHHHHHHHHHCCCCEEEECCCCHHHHHHHH T ss_conf 99719899999998679989998798999999999 No 64 >PRK09147 aminotransferase; Provisional Probab=24.63 E-value=41 Score=13.02 Aligned_cols=12 Identities=25% Similarity=0.288 Sum_probs=5.6 Q ss_pred CHHHHHHHHHHH Q ss_conf 989999988434 Q gi|254780464|r 410 TTSELIHALEGF 421 (424) Q Consensus 410 ~~~~l~~~l~~~ 421 (424) ..+.|.++++-+ T Consensus 380 ~~e~l~eal~Rl 391 (397) T PRK09147 380 PLAECVEAAERI 391 (397) T ss_pred CHHHHHHHHHHH T ss_conf 899999999999 No 65 >PRK06106 nicotinate-nucleotide pyrophosphorylase; Provisional Probab=24.62 E-value=33 Score=13.70 Aligned_cols=35 Identities=14% Similarity=0.248 Sum_probs=17.2 Q ss_pred CCCCHHHHHHHHHHHHHHHH-CCCCCHHHHHHHHHH Q ss_conf 58302477787789999853-148998999999999 Q gi|254780464|r 303 TAKENIMALTSSIVEVVQSL-LENIEQREIDKECAK 337 (424) Q Consensus 303 ~~~~~~~~~~~~i~~~i~~l-~~~i~~~el~~ak~~ 337 (424) +.-++.+++..++..-...+ .++++.+++.++... T Consensus 197 VEv~sl~q~~~al~~g~d~ImLDNmsp~~~~~av~~ 232 (281) T PRK06106 197 VEVDTLEQLEEALELGVDAVLLDNMSPDTLREAVAI 232 (281) T ss_pred EEECCHHHHHHHHHCCCCEEEECCCCHHHHHHHHHH T ss_conf 996999999999727998999879999999999998 No 66 >pfam00427 PBS_linker_poly Phycobilisome Linker polypeptide. Probab=24.57 E-value=41 Score=13.01 Aligned_cols=86 Identities=19% Similarity=0.246 Sum_probs=44.7 Q ss_pred HHCCC-CCHHHHHHHHHHHHHHHHHH---CCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCHHHHHHHHHHHCCCCC Q ss_conf 53148-99899999999999999851---389899999999999968984798999998872798999999998627787 Q gi|254780464|r 321 SLLEN-IEQREIDKECAKIHAKLIKS---QERSYLRALEISKQVMFCGSILCSEKIIDTISAITCEDIVGVAKKIFSSTP 396 (424) Q Consensus 321 ~l~~~-i~~~el~~ak~~~~~~~~~~---~~~~~~~a~~l~~~~~~~~~~~~~~~~~~~i~~vt~~di~~~~~~~l~~~~ 396 (424) .+++| |+-.|+-++.. ++.+... .-.+..++-++..-.++|.++....+...-+.-+-..=..+++..++++.- T Consensus 39 ~lrng~IsVreFVr~la--~S~~yr~~f~~~~~~~r~iEl~~khlLGRap~~~~E~~~~~~i~a~~G~~a~Id~lldS~E 116 (131) T pfam00427 39 QLRNGEISVREFVRGLA--KSELYRKRFFEPNSNYRVIELNFKHLLGRAPYNQAEISAHSIILAEKGLDAFIDSLLDSDE 116 (131) T ss_pred HHHCCCCCHHHHHHHHH--CCHHHHHHHCCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHCCHH T ss_conf 98869982999999998--2699999860266602999999999718999999999999999994292999999968699 Q ss_pred EEEEEECCCCCCCC Q ss_conf 79998166010899 Q gi|254780464|r 397 TLAILGPPMDHVPT 410 (424) Q Consensus 397 ~i~~vgp~~~~~P~ 410 (424) -.-.-| .+.||+ T Consensus 117 Y~~~FG--~d~VPy 128 (131) T pfam00427 117 YLENFG--EDTVPY 128 (131) T ss_pred HHHHCC--CCCCCC T ss_conf 998738--888897 No 67 >PRK09939 putative oxidoreductase; Provisional Probab=23.94 E-value=33 Score=13.66 Aligned_cols=12 Identities=8% Similarity=0.180 Sum_probs=5.1 Q ss_pred CCCCHHHHCCCC Q ss_conf 663000002545 Q gi|254780464|r 181 ADRMYVVCVGAV 192 (424) Q Consensus 181 p~n~~l~i~Gd~ 192 (424) |+.+.++-+|-. T Consensus 144 P~~a~FYtSgR~ 155 (759) T PRK09939 144 PNQVEFYTSGRT 155 (759) T ss_pred CCEEEEEECCCC T ss_conf 774889833776 No 68 >pfam05838 DUF847 Predicted lysozyme (DUF847). This family consists of several hypothetical bacterial sequences as well as one viral sequence, the function of this family is unknown. However, these proteins are predicted to be related to lysozyme enzymes. Probab=23.87 E-value=33 Score=13.64 Aligned_cols=25 Identities=16% Similarity=0.387 Sum_probs=18.8 Q ss_pred HHHHHHHHHHHHHHHHHCCCC-CCCC Q ss_conf 034420001667764201124-5663 Q gi|254780464|r 159 KPETISSFTPEKIISFVSRNY-TADR 183 (424) Q Consensus 159 ~~~~i~~lt~~~l~~f~~~~y-~p~n 183 (424) +..+++++|.++.+.+|++.| .+.. T Consensus 42 ~~~dvr~Lt~~~a~~IY~~~YW~~~~ 67 (87) T pfam05838 42 DAGDIKALTRDQAVAIYRADYWDRPR 67 (87) T ss_pred CHHHHHHCCHHHHHHHHHHHCCCCCC T ss_conf 86889878999999999997418035 No 69 >PRK05742 nicotinate-nucleotide pyrophosphorylase; Provisional Probab=23.56 E-value=36 Score=13.40 Aligned_cols=31 Identities=10% Similarity=0.250 Sum_probs=13.6 Q ss_pred CCHHHHHHHHHHHHHHHH-CCCCCHHHHHHHH Q ss_conf 302477787789999853-1489989999999 Q gi|254780464|r 305 KENIMALTSSIVEVVQSL-LENIEQREIDKEC 335 (424) Q Consensus 305 ~~~~~~~~~~i~~~i~~l-~~~i~~~el~~ak 335 (424) .++.+++..++..-..-+ .++++.+++.++. T Consensus 196 v~tl~q~~~a~~~gaDiI~LDnms~~~lk~av 227 (277) T PRK05742 196 VESLDELRQALAAGADIVMLDELSLDDMREAV 227 (277) T ss_pred ECCHHHHHHHHHCCCCEEEECCCCHHHHHHHH T ss_conf 67799999987469989998699999999999 No 70 >TIGR02416 CO_dehy_Mo_lg carbon-monoxide dehydrogenase, large subunit; InterPro: IPR012780 Carbon monoxide dehydrogenases catalyse the reversible oxidation of carbon monoxide to carbon dioxide as shown below , :CO + H(2)O + A == CO(2) + AH(2) A variety of electron acceptors can be used by these enzymes including ferredoxin, methyl viologen and benzyl viologen. Under anaerobic conditions, carbon monoxide is utilised by a number of methanogenic archaea and acetogenic, sulphate-reducing or phototrophic bacteria. Under aerobic conditions carbon monoxide can be used as sole carbon and energy source by the carboxidotrophic bacteria, a taxonomically diverse group of obligate or facultative chemolithoautotrophic species. The anaerobic dehydrogenases are mostly nickel-containing iron-sulphur enzymes, while the The aerobic enzymes categorised so far are molybdenum-containing iron-sulphur proteins. This entry represents the large subunit of the molybdenum-containing carbon monoxide dehydrogenase found in aerobic bacteria. This enzyme is a dimer of heterotrimers, where each heterotrimer consists of a small iron-sulphur subunit, a medium FAD-containing flavoprotein subunit and a large molybdopterin-containign subunit which contains the active site , . The large subunit is divided into two domains. The N-terminal domain is composed of mainly of a five-and four-stranded mixed beta-sheet and interacts with the small and medium subunits. The C-terminal domain contains a region which interacts with its counterpart in the overall dimer, providing the main dimer contact, and a beta-sheet surrounded by alpha helices. The active site located in this subunit contains a novel dinuclear heterometal [CuSMo(=O)OH] cluster.; GO: 0005507 copper ion binding, 0018492 carbon-monoxide dehydrogenase (acceptor) activity, 0030151 molybdenum ion binding. Probab=23.42 E-value=19 Score=15.32 Aligned_cols=68 Identities=15% Similarity=0.189 Sum_probs=30.5 Q ss_pred HHHHHHHHHHHHHHHCC--CCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCHHHHH Q ss_conf 47778778999985314--8998999999999999998513898999999999999689847989999988727989999 Q gi|254780464|r 308 IMALTSSIVEVVQSLLE--NIEQREIDKECAKIHAKLIKSQERSYLRALEISKQVMFCGSILCSEKIIDTISAITCEDIV 385 (424) Q Consensus 308 ~~~~~~~i~~~i~~l~~--~i~~~el~~ak~~~~~~~~~~~~~~~~~a~~l~~~~~~~~~~~~~~~~~~~i~~vt~~di~ 385 (424) +.+++=.|.+.+.-|+. ..+..+| |.||.++.. +-|+. ..++-. +..-++...+...++.|.-+.++ T Consensus 364 vTEAVY~iER~vd~LA~~L~MD~A~l-R~~N~~~~E-----QFPY~--~~~Gw~---YDSGdY~~a~k~~~d~VdY~~lR 432 (775) T TIGR02416 364 VTEAVYLIERLVDVLAQKLKMDKAEL-RLKNFIKAE-----QFPYT--SKLGWE---YDSGDYEKAMKKAMDAVDYEALR 432 (775) T ss_pred HHHHHHHHHHHHHHHHHHHCCCHHHH-HHHHHHHCC-----CCCCC--CCCCCE---ECCCCHHHHHHHHHHHCCHHHHH T ss_conf 67899999999999998726883688-888632004-----78753--567633---40788799999997521804431 Q ss_pred H Q ss_conf 9 Q gi|254780464|r 386 G 386 (424) Q Consensus 386 ~ 386 (424) + T Consensus 433 A 433 (775) T TIGR02416 433 A 433 (775) T ss_pred H T ss_conf 8 No 71 >TIGR02915 PEP_resp_reg putative PEP-CTERM system response regulator; InterPro: IPR014264 Members of this protein family share full-length homology with (but do not include) the acetoacetate metabolism regulatory protein AtoC (see Q06065 from SWISSPROT). These proteins have a Fis family DNA binding sequence, a response regulator receiver domain, and sigma-54 interaction domain. They are found strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria, where members of IPR014265 from INTERPRO and IPR014266 from INTERPRO also occur.. Probab=23.23 E-value=44 Score=12.84 Aligned_cols=36 Identities=14% Similarity=0.193 Sum_probs=16.6 Q ss_pred CCCCCCCHHHHH-HHHHCCCCCCCHHHHHH--HHHHHHHH Q ss_conf 656521001222-32210356620011245--56766665 Q gi|254780464|r 92 KEHVPLALEIIG-DMLSNSSFNPSDIERER--NVVLEEIG 128 (424) Q Consensus 92 ~~~~~~~l~ll~-~~~~~p~~~~~~~~~ek--~~v~~e~~ 128 (424) .+|.-+++.+=+ |....| |+++.+.... .--+.+++ T Consensus 86 r~NAlkAi~lGAYDFyqKP-~d~d~L~liv~RAf~L~~Le 124 (451) T TIGR02915 86 RENALKAIGLGAYDFYQKP-IDPDVLKLIVDRAFRLYTLE 124 (451) T ss_pred CHHHHHHHHCCCCCCCCCC-CCHHHHHHHHHHHHHHHHHH T ss_conf 3889999643751013578-75789999999988888888 No 72 >cd01572 QPRTase Quinolinate phosphoribosyl transferase (QAPRTase or QPRTase), also called nicotinate-nucleotide pyrophosphorylase, is involved in the de novo synthesis of NAD in both prokaryotes and eukaryotes. It catalyses the reaction of quinolinic acid (QA) with 5-phosphoribosyl-1-pyrophosphate (PRPP) in the presence of Mg2+ to produce nicotinic acid mononucleotide (NAMN), pyrophosphate and carbon dioxide. QPRTase functions as a homodimer with two active sites, each formed by the C-terminal region of one subunit and the N-terminal region of the other. Probab=23.22 E-value=36 Score=13.44 Aligned_cols=35 Identities=17% Similarity=0.278 Sum_probs=16.5 Q ss_pred CCCCHHHHHHHHHHHHHHHH-CCCCCHHHHHHHHHH Q ss_conf 58302477787789999853-148998999999999 Q gi|254780464|r 303 TAKENIMALTSSIVEVVQSL-LENIEQREIDKECAK 337 (424) Q Consensus 303 ~~~~~~~~~~~~i~~~i~~l-~~~i~~~el~~ak~~ 337 (424) +.-++.+++..++..-...+ .++++.+++.++... T Consensus 187 VEv~s~~e~~~a~~~gadiI~LDn~spe~~~~~v~~ 222 (268) T cd01572 187 VEVETLEQLKEALEAGADIIMLDNMSPEELREAVAL 222 (268) T ss_pred EEECCHHHHHHHHHCCCCEEEECCCCHHHHHHHHHH T ss_conf 994889999999975999999779999999999998 No 73 >pfam01729 QRPTase_C Quinolinate phosphoribosyl transferase, C-terminal domain. Quinolinate phosphoribosyl transferase (QPRTase) or nicotinate-nucleotide pyrophosphorylase EC:2.4.2.19 is involved in the de novo synthesis of NAD in both prokaryotes and eukaryotes. It catalyses the reaction of quinolinic acid with 5-phosphoribosyl-1-pyrophosphate (PRPP) in the presence of Mg2+ to give rise to nicotinic acid mononucleotide (NaMN), pyrophosphate and carbon dioxide. The QA substrate is bound between the C-terminal domain of one subunit, and the N-terminal domain of the other. The C-terminal domain has a 7 beta-stranded TIM barrel-like fold. Probab=23.18 E-value=32 Score=13.78 Aligned_cols=40 Identities=18% Similarity=0.326 Sum_probs=22.2 Q ss_pred HHHHHHHHHHHHHCCCC-CCCCCHHHHCCCCCHHHHHHHHH Q ss_conf 20001667764201124-56630000025455122112211 Q gi|254780464|r 163 ISSFTPEKIISFVSRNY-TADRMYVVCVGAVDHEFCVSQVE 202 (424) Q Consensus 163 i~~lt~~~l~~f~~~~y-~p~n~~l~i~Gd~~~~~~~~~i~ 202 (424) +++++++++++..+..= ...++.+-++|+++.+.+.++.+ T Consensus 106 LDn~spe~l~~~v~~l~~~~~~v~iEaSGgI~~~ni~~yA~ 146 (169) T pfam01729 106 LDNFSPEEVREAVEELDERAGRVLLEVSGGITLDNVLEYAK 146 (169) T ss_pred ECCCCHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHH T ss_conf 77999999999999999758967999618999999999997 No 74 >cd01573 modD_like ModD; Quinolinate phosphoribosyl transferase (QAPRTase or QPRTase) present in some modABC operons in bacteria, which are involved in molybdate transport. In general, QPRTases are part of the de novo synthesis pathway of NAD in both prokaryotes and eukaryotes. They catalyse the reaction of quinolinic acid (QA) with 5-phosphoribosyl-1-pyrophosphate (PRPP) in the presence of Mg2+ to produce nicotinic acid mononucleotide (NAMN), pyrophosphate and carbon dioxide. Probab=22.75 E-value=38 Score=13.30 Aligned_cols=12 Identities=25% Similarity=0.291 Sum_probs=6.3 Q ss_pred CCCHHHHHHHHH Q ss_conf 279899999999 Q gi|254780464|r 378 AITCEDIVGVAK 389 (424) Q Consensus 378 ~vt~~di~~~~~ 389 (424) .||.+.|.++++ T Consensus 238 gI~~~ni~~yA~ 249 (272) T cd01573 238 GINIENAAAYAA 249 (272) T ss_pred CCCHHHHHHHHH T ss_conf 999999999997 No 75 >PRK07896 nicotinate-nucleotide pyrophosphorylase; Provisional Probab=21.33 E-value=44 Score=12.86 Aligned_cols=33 Identities=12% Similarity=0.227 Sum_probs=14.4 Q ss_pred CCCHHHHHHHHHHHHHHHH-CCCCCHHHHHHHHH Q ss_conf 8302477787789999853-14899899999999 Q gi|254780464|r 304 AKENIMALTSSIVEVVQSL-LENIEQREIDKECA 336 (424) Q Consensus 304 ~~~~~~~~~~~i~~~i~~l-~~~i~~~el~~ak~ 336 (424) .-++.+++..++..-..-+ .++++.+++.++.. T Consensus 204 Ev~sl~q~~ea~~~gaDiImLDNms~e~~~~av~ 237 (288) T PRK07896 204 EVDSLEQLDEVLAEGAELILLDNFPVWQTQEAVQ 237 (288) T ss_pred EECCHHHHHHHHHCCCCEEEECCCCHHHHHHHHH T ss_conf 9797999999874699999977999999999999 No 76 >TIGR00365 TIGR00365 glutaredoxin homolog; InterPro: IPR004480 Glutaredoxins , , , also known as thioltransferases (disulphide reductases, are small proteins of approximately one hundred amino-acid residues which utilise glutathione and NADPH as cofactors. Oxidized glutathione is regenerated by glutathione reductase. Together these components compose the glutathione system . Glutaredoxin functions as an electron carrier in the glutathione-dependent synthesis of deoxyribonucleotides by the enzyme ribonucleotide reductase. Like thioredoxin, which functions in a similar way, glutaredoxin possesses an active centre disulphide bond . It exists in either a reduced or an oxidized form where the two cysteine residues are linked in an intramolecular disulphide bond. Glutaredoxin has been sequenced in a variety of species. On the basis of extensive sequence similarity, it has been proposed that Vaccinia virus protein O2L is most probably a glutaredoxin. Finally, it must be noted that Bacteriophage T4 thioredoxin seems also to be evolutionary related. In position 5 of the pattern T4 thioredoxin has Val instead of Pro. This family groups a number of hypothetical proteins from different organisms which are related to glutaredoxin proteins.. Probab=21.31 E-value=48 Score=12.59 Aligned_cols=68 Identities=16% Similarity=0.270 Sum_probs=33.9 Q ss_pred HHH-CCCCCCCHHHHHHH----HHHHCCEEEEEECCCEEEEEEECCCC-CCCCCHHHHHHHHHCCCCCCCHHHHH----H Q ss_conf 973-35688898899999----99708767997532237999612665-65210012223221035662001124----5 Q gi|254780464|r 51 MLF-KGTTKRTAKEIVEE----IEKVGGDINAYTSLEHTSYHAWVLKE-HVPLALEIIGDMLSNSSFNPSDIERE----R 120 (424) Q Consensus 51 l~~-~gt~~~~~~~~~~~----l~~~G~~~na~t~~d~~~~~~~~~~~-~~~~~l~ll~~~~~~p~~~~~~~~~e----k 120 (424) |+| |||.++|...|.+. |..+|..- ..+. +.+++.+ ++-..|+-+++| |+|++=-++.| - T Consensus 15 ~lfMKGsP~~P~CGFS~~~v~~L~~~~~~P------akF~-~~DiL~d~~iR~~lk~ysnW---PT~PQLyVnGEf~GG~ 84 (103) T TIGR00365 15 LLFMKGSPKLPKCGFSRQAVGILRECGIRP------AKFA-YVDILQDPEIRQGLKEYSNW---PTFPQLYVNGEFVGGC 84 (103) T ss_pred EEEECCCCCCCCCCCHHHHHHHHHHCCCCC------CCCC-CCCCCCCCHHHHHCHHCCCC---CCCCCEEECCEEECHH T ss_conf 684368899889988178999998548982------1132-00024781332201000357---9998356655355615 Q ss_pred HHHHHHHH Q ss_conf 56766665 Q gi|254780464|r 121 NVVLEEIG 128 (424) Q Consensus 121 ~~v~~e~~ 128 (424) ++|++..+ T Consensus 85 Dii~e~~~ 92 (103) T TIGR00365 85 DIIIEMYQ 92 (103) T ss_pred HHHHHHHH T ss_conf 89999976 No 77 >KOG0431 consensus Probab=20.98 E-value=49 Score=12.55 Aligned_cols=18 Identities=11% Similarity=0.159 Sum_probs=13.6 Q ss_pred HHHHHCCCHHHHHHHHHH Q ss_conf 998872798999999998 Q gi|254780464|r 373 IDTISAITCEDIVGVAKK 390 (424) Q Consensus 373 ~~~i~~vt~~di~~~~~~ 390 (424) +..-+-||++.|+.+.+| T Consensus 394 VsltDLVtp~~VKKaYrK 411 (453) T KOG0431 394 VSLTDLVTPAQVKKAYRK 411 (453) T ss_pred CCHHHCCCHHHHHHHHHH T ss_conf 756641278999999886 No 78 >COG1448 TyrB Aspartate/tyrosine/aromatic aminotransferase [Amino acid transport and metabolism] Probab=20.84 E-value=44 Score=12.86 Aligned_cols=161 Identities=15% Similarity=0.208 Sum_probs=68.2 Q ss_pred EECCCCCC-CCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHH-HCCCCEEEEEECCCCHHHHHHHHHH Q ss_conf 30100001-352113568998777624876532222000268621013332221-0122013444258302477787789 Q gi|254780464|r 239 MLGFNGCA-YQSRDFYLTNILASILGDGMSSRLFQEVREKRGLCYSISAHHENF-SDNGVLYIASATAKENIMALTSSIV 316 (424) Q Consensus 239 ~~~~~~~~-~~~~d~~al~ll~~iLg~g~sSrL~~~lRe~~glaY~v~s~~~~~-~~~g~~~i~~~~~~~~~~~~~~~i~ 316 (424) -++|.+.. --+.|.+++.++.... . ++- .+-|++=.+.-| .+.|.+.+. ..+.+....+.+.+. T Consensus 211 D~AYQGF~~GleeDa~~lR~~a~~~---~------~~l----va~S~SKnfgLYgERVGa~~vv-a~~~~~a~~v~sqlk 276 (396) T COG1448 211 DIAYQGFADGLEEDAYALRLFAEVG---P------ELL----VASSFSKNFGLYGERVGALSVV-AEDAEEADRVLSQLK 276 (396) T ss_pred EHHHHHHCCCHHHHHHHHHHHHHHC---C------CEE----EEEHHHHHHHHHHHCCCEEEEE-ECCHHHHHHHHHHHH T ss_conf 3244442124677899999999729---8------589----9710001244343002315888-278899999999999 Q ss_pred HHHHHHCCCCCHHHHHHHHH-----HHHHHHHHHCCCHHHHHHHHHHHHH----HCCCCCCHHHHHHH-----HHCCCHH Q ss_conf 99985314899899999999-----9999998513898999999999999----68984798999998-----8727989 Q gi|254780464|r 317 EVVQSLLENIEQREIDKECA-----KIHAKLIKSQERSYLRALEISKQVM----FCGSILCSEKIIDT-----ISAITCE 382 (424) Q Consensus 317 ~~i~~l~~~i~~~el~~ak~-----~~~~~~~~~~~~~~~~a~~l~~~~~----~~~~~~~~~~~~~~-----i~~vt~~ 382 (424) ..++..-.++...--.-+.. .++.......+.-+.+...+...+. ..+......-+.+. +-..|++ T Consensus 277 ~~iR~~ySnPP~~Ga~vva~IL~~p~Lra~W~~El~~Mr~Ri~~mR~~lv~~L~~~~~~~~f~~i~~Q~GMFsy~Gls~~ 356 (396) T COG1448 277 AIIRTNYSNPPAHGAAVVATILNNPELRAEWEQELEEMRQRILEMRQALVDALKALGAPRNFDFIISQRGMFSYTGLSPE 356 (396) T ss_pred HHHHHCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHCCCEEECCCCCHH T ss_conf 99986358984123999999948999999999999999999999999999999862788662247546752445899999 Q ss_pred HHHHHHHHH---CCCCCEEEEEECCCCCCCCHHH Q ss_conf 999999986---2778779998166010899899 Q gi|254780464|r 383 DIVGVAKKI---FSSTPTLAILGPPMDHVPTTSE 413 (424) Q Consensus 383 di~~~~~~~---l~~~~~i~~vgp~~~~~P~~~~ 413 (424) +|.+.-.++ +-..-.+.+-|-+..++.+.++ T Consensus 357 QV~rLree~~IY~v~sGRi~vaGl~~~ni~~va~ 390 (396) T COG1448 357 QVDRLREEFGIYLVASGRINVAGLNTSNIDYVAK 390 (396) T ss_pred HHHHHHHHCCEEEECCCEEEECCCCHHHHHHHHH T ss_conf 9999998566799537814653687440899999 No 79 >PRK09265 aminotransferase AlaT; Validated Probab=20.72 E-value=49 Score=12.51 Aligned_cols=12 Identities=25% Similarity=0.437 Sum_probs=5.7 Q ss_pred CCHHHHHHHHHH Q ss_conf 998999998843 Q gi|254780464|r 409 PTTSELIHALEG 420 (424) Q Consensus 409 P~~~~l~~~l~~ 420 (424) ...+.|.++++- T Consensus 383 ~~~e~l~eal~R 394 (404) T PRK09265 383 PRVDDLEDAIGR 394 (404) T ss_pred CCHHHHHHHHHH T ss_conf 999999999999 No 80 >PRK06543 nicotinate-nucleotide pyrophosphorylase; Provisional Probab=20.62 E-value=44 Score=12.85 Aligned_cols=12 Identities=8% Similarity=0.310 Sum_probs=6.7 Q ss_pred CCCHHHHHHHHH Q ss_conf 279899999999 Q gi|254780464|r 378 AITCEDIVGVAK 389 (424) Q Consensus 378 ~vt~~di~~~~~ 389 (424) .||.+.|.++++ T Consensus 245 gI~~~ni~~yA~ 256 (281) T PRK06543 245 NVSLNTVPAIAS 256 (281) T ss_pred CCCHHHHHHHHH T ss_conf 999999999997 No 81 >PRK09016 quinolinate phosphoribosyltransferase; Validated Probab=20.60 E-value=45 Score=12.76 Aligned_cols=12 Identities=25% Similarity=0.310 Sum_probs=5.9 Q ss_pred CCCHHHHHHHHH Q ss_conf 279899999999 Q gi|254780464|r 378 AITCEDIVGVAK 389 (424) Q Consensus 378 ~vt~~di~~~~~ 389 (424) .||.+.|.++++ T Consensus 260 gI~l~ni~~yA~ 271 (296) T PRK09016 260 NVTLETLREFAE 271 (296) T ss_pred CCCHHHHHHHHH T ss_conf 998999999997 No 82 >cd04913 ACT_AKii-LysC-BS-like_1 ACT domains of the lysine-sensitive aspartokinase isoenzyme AKII of Bacillus subtilis (BS) strain 168 and related proteins. This CD includes the N-terminal of the two ACT domains of the lysine-sensitive aspartokinase isoenzyme AKII of Bacillus subtilis (BS) strain 168, and the lysine plus threonine-sensitive aspartokinase of Corynebacterium glutamicum, and related sequences. In B. subtilis 168, the regulation of the diaminopimelate (Dap)-lysine biosynthetic pathway involves dual control by Dap and lysine, effected through separate Dap- and lysine-sensitive aspartokinase isoenzymes. The B. subtilis 168 AKII is induced by methionine and repressed and inhibited by lysine. Although Corynebacterium glutamicum is known to contain a single aspartokinase, both the succinylase and dehydrogenase variant pathways of DAP-lysine synthesis operate simultaneously in this organism. In corynebacteria and other various Gram-positive bacteria, the DAP-lysine pathway is fee Probab=20.02 E-value=49 Score=12.52 Aligned_cols=27 Identities=19% Similarity=0.161 Sum_probs=18.1 Q ss_pred CEEEEEEECCCCCCCCCHHHHHHHHHC Q ss_conf 237999612665652100122232210 Q gi|254780464|r 82 EHTSYHAWVLKEHVPLALEIIGDMLSN 108 (424) Q Consensus 82 d~~~~~~~~~~~~~~~~l~ll~~~~~~ 108 (424) ..+.++++++.++++++++.+.+.... T Consensus 41 ~~~disFTv~~~dl~~a~~vl~~~~~~ 67 (75) T cd04913 41 GTTDISFTVPKSDLKKALAVLEKLKKE 67 (75) T ss_pred CCEEEEEEECHHHHHHHHHHHHHHHHH T ss_conf 945899990499999999999999986 Done!