Query 044668
Match_columns 353
No_of_seqs 325 out of 1669
Neff 8.3
Searched_HMMs 46136
Date Fri Mar 29 05:31:46 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/044668.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/044668hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2067 Mitochondrial processi 100.0 2.5E-56 5.4E-61 411.6 21.7 308 36-349 31-459 (472)
2 KOG0960 Mitochondrial processi 100.0 1.3E-51 2.9E-56 379.9 24.6 304 36-349 40-464 (467)
3 COG0612 PqqL Predicted Zn-depe 100.0 2.8E-43 6E-48 348.2 27.4 288 36-339 23-436 (438)
4 TIGR02110 PQQ_syn_pqqF coenzym 100.0 1.6E-37 3.4E-42 317.1 24.7 212 37-265 7-284 (696)
5 PRK15101 protease3; Provisiona 100.0 5.9E-34 1.3E-38 306.6 24.6 259 36-311 50-419 (961)
6 PTZ00432 falcilysin; Provision 100.0 6.7E-29 1.5E-33 267.7 27.1 256 37-313 99-524 (1119)
7 KOG2583 Ubiquinol cytochrome c 100.0 1.4E-27 3E-32 221.6 22.0 323 1-343 1-429 (429)
8 PRK15101 protease3; Provisiona 99.9 1.3E-21 2.8E-26 211.1 17.5 270 21-319 526-888 (961)
9 COG1025 Ptr Secreted/periplasm 99.9 2E-21 4.4E-26 198.6 16.0 152 57-208 46-239 (937)
10 KOG0959 N-arginine dibasic con 99.9 2.1E-21 4.5E-26 201.4 15.7 168 37-209 35-245 (974)
11 COG1026 Predicted Zn-dependent 99.8 9.6E-18 2.1E-22 172.5 20.2 223 62-296 44-390 (978)
12 KOG2019 Metalloendoprotease HM 99.8 4.6E-18 9.9E-23 167.1 15.3 248 26-290 25-396 (998)
13 PF00675 Peptidase_M16: Insuli 99.8 1.5E-18 3.2E-23 146.6 7.8 111 41-156 2-148 (149)
14 PF05193 Peptidase_M16_C: Pept 99.7 4.1E-16 8.9E-21 134.2 11.6 117 162-289 1-184 (184)
15 KOG0961 Predicted Zn2+-depende 99.6 4.5E-14 9.7E-19 139.2 18.0 162 29-202 22-233 (1022)
16 COG1026 Predicted Zn-dependent 98.1 0.00014 3.1E-09 76.5 17.3 252 29-304 526-904 (978)
17 PTZ00432 falcilysin; Provision 97.8 0.0005 1.1E-08 75.7 16.4 276 37-337 667-1106(1119)
18 COG1025 Ptr Secreted/periplasm 97.4 0.013 2.8E-07 61.9 19.0 241 57-310 527-859 (937)
19 KOG2019 Metalloendoprotease HM 97.2 0.0067 1.4E-07 61.7 13.8 236 37-293 568-924 (998)
20 KOG0959 N-arginine dibasic con 96.5 0.16 3.4E-06 54.7 17.9 165 23-204 515-711 (974)
21 PF03410 Peptidase_M44: Protei 96.0 0.055 1.2E-06 53.0 10.2 148 55-208 18-193 (590)
22 PHA03081 putative metalloprote 94.6 0.21 4.6E-06 49.1 9.2 148 55-208 18-193 (595)
23 COG0612 PqqL Predicted Zn-depe 91.6 0.5 1.1E-05 46.8 6.9 76 272-347 121-220 (438)
24 KOG0961 Predicted Zn2+-depende 82.2 4.9 0.00011 41.7 7.4 67 217-298 825-922 (1022)
25 TIGR02110 PQQ_syn_pqqF coenzym 82.1 15 0.00032 38.9 11.2 103 225-335 42-195 (696)
26 PF08367 M16C_assoc: Peptidase 78.5 8.5 0.00019 35.0 7.3 81 38-124 78-182 (248)
27 KOG2067 Mitochondrial processi 77.3 14 0.00031 35.9 8.5 125 59-187 267-442 (472)
28 PF00675 Peptidase_M16: Insuli 66.1 50 0.0011 26.9 8.6 86 225-318 33-141 (149)
29 PF09851 SHOCT: Short C-termin 45.4 21 0.00046 21.3 2.1 18 273-290 13-30 (31)
30 KOG0960 Mitochondrial processi 36.5 4.3E+02 0.0094 26.1 11.8 33 157-189 417-450 (467)
31 PRK05986 cob(I)alamin adenolsy 31.5 62 0.0013 28.4 3.7 38 160-199 130-168 (191)
32 PF01729 QRPTase_C: Quinolinat 30.6 51 0.0011 28.2 3.0 41 159-199 105-147 (169)
33 PRK09016 quinolinate phosphori 30.5 89 0.0019 29.4 4.8 38 160-199 234-272 (296)
34 COG0285 FolC Folylpolyglutamat 30.2 62 0.0013 32.2 3.9 53 242-294 51-112 (427)
35 PRK06559 nicotinate-nucleotide 28.3 1E+02 0.0022 29.0 4.7 39 159-199 222-261 (290)
36 PRK06978 nicotinate-nucleotide 27.2 1.2E+02 0.0025 28.6 4.9 38 160-199 231-269 (294)
37 TIGR00708 cobA cob(I)alamin ad 26.6 88 0.0019 26.9 3.7 37 161-199 113-150 (173)
38 COG0157 NadC Nicotinate-nucleo 26.0 1.3E+02 0.0027 28.1 4.8 40 160-199 214-254 (280)
39 PRK08385 nicotinate-nucleotide 25.9 1.2E+02 0.0026 28.3 4.8 41 159-199 207-251 (278)
40 PRK06543 nicotinate-nucleotide 25.1 1.2E+02 0.0027 28.2 4.7 39 159-199 218-257 (281)
41 PRK07414 cob(I)yrinic acid a,c 25.0 92 0.002 27.0 3.6 37 160-198 130-167 (178)
42 KOG3432 Vacuolar H+-ATPase V1 24.8 1.8E+02 0.0038 23.1 4.7 45 162-206 44-89 (121)
43 PRK06106 nicotinate-nucleotide 23.4 1.4E+02 0.003 27.9 4.7 38 160-199 220-258 (281)
44 PRK07896 nicotinate-nucleotide 22.2 1.8E+02 0.0038 27.3 5.1 41 159-199 224-266 (289)
45 PF00719 Pyrophosphatase: Inor 20.9 1.7E+02 0.0036 24.7 4.3 44 158-201 106-151 (156)
46 PLN02373 soluble inorganic pyr 20.8 1.6E+02 0.0035 25.7 4.3 48 155-202 128-176 (188)
47 PRK01250 inorganic pyrophospha 20.3 1.2E+02 0.0025 26.3 3.3 50 155-204 122-173 (176)
No 1
>KOG2067 consensus Mitochondrial processing peptidase, alpha subunit [Posttranslational modification, protein turnover, chaperones]
Probab=100.00 E-value=2.5e-56 Score=411.58 Aligned_cols=308 Identities=43% Similarity=0.585 Sum_probs=280.6
Q ss_pred CCCCCcccccCCCCcccCCCCCceeEEEccccCCCCCCCCcHHHHHHHHhcccCCCCC-------------cCccccCcc
Q 044668 36 STSSPSLDFPLPGICLSHPSSLPDYFYVDCGSIYESPISFVKTDLLERMAFRSTRDRS-------------NIQASPSRE 102 (353)
Q Consensus 36 ~~~~~~l~~~~p~~~~~~~~~v~~~~~v~~Gs~~E~~~~~G~ahlle~llf~gt~~~~-------------~~~a~t~~d 102 (353)
.||++|.+++-||.=. ++++++++|||+|.+...|++||+|.|.|++|.+++ +..|+++||
T Consensus 31 ~NGlkvase~~pg~f~------~vGlyIdsGsrYE~~~~~GisH~lerLAF~ST~~~~~~ei~~~LE~~GGn~~cqsSRe 104 (472)
T KOG2067|consen 31 PNGLKVASENTPGQFC------TVGLYIDSGSRYEAKYFSGISHFLERLAFKSTERFSSKEILAELEKLGGNCDCQSSRE 104 (472)
T ss_pred CCccEEeccCCCCCce------EEEEEEecCccccCcCcccHHHHHHHHhhccccCCcHHHHHHHHHHhCCcccccccHh
Confidence 4688888888888766 899999999999999999999999999999999998 899999999
Q ss_pred eeEEEeecccCCHHHHHHHHHH----------------------HHHHhcChHHHHHHHHHHHhc-CCCCCCCCCCChHh
Q 044668 103 QMGCSFAALETYVPEMVELLTD----------------------ISEATRNPQSLLSEAIFSACY-SVVLANPLLAPECA 159 (353)
Q Consensus 103 ~t~~~~~~~~~~l~~~l~ll~d----------------------i~~~~~~p~~~~~~~l~~~~f-~~p~g~~~~G~~e~ 159 (353)
.+.|.+++.+++++.++++|+| +++...+|+.++.|.+|.++| ++++|.|.+++.+.
T Consensus 105 tm~Yaas~~~~~v~sm~~lLadtV~~P~~~d~ev~~~~~~v~~E~~el~~~Pe~lL~e~iH~Aay~~ntlg~pl~cp~~~ 184 (472)
T KOG2067|consen 105 TMMYAASADSDGVDSMVELLADTVLNPKFTDQEVEEARRAVKYEIEELWMRPEPLLTEMIHSAAYSGNTLGLPLLCPEEN 184 (472)
T ss_pred hhHHHHHhhhcccHHHHHHHHHHHhcccccHHHHHHHHHhhhheccccccCchhhHHHHHHHHHhccCcccccccCChhh
Confidence 9999999999999999999999 334557899999999999999 56999999999999
Q ss_pred hcccCHHHHHHHHHhhcCCCCeEEEEEccCHHHHHHHHHHhhCCCCCCCCCC--CCc-----------------------
Q 044668 160 ISRLNSTLIKEFFAENYTAPWMVLAASGVEHDQLVSVEEPLLSDLPISILTK--SPD----------------------- 214 (353)
Q Consensus 160 l~~it~edL~~f~~~~y~p~n~~lvivGid~~el~~~v~~~f~~l~~~~~~~--~p~----------------------- 214 (353)
|++|+.+.|.+|.+.+|+|++|+++.|||+||++.+.++++|++||+...++ .++
T Consensus 185 i~~I~~~~l~~yl~~~ytp~rmVlA~vGV~heelv~~~~~~~~~~~s~~~p~i~~~~aQYtGG~~~~~~d~~~~~~g~El 264 (472)
T KOG2067|consen 185 IDKINREVLEEYLKYFYTPERMVLAGVGVEHEELVEIAEKLLGDLPSTKVPPIDESKAQYTGGELKIDTDAPQVTGGPEL 264 (472)
T ss_pred hhhhhHHHHHHHHHhcCChhheEeeecCCCHHHHHHHHHHHhccCCccCCCCcccchhhccccccccCCCCccccCccce
Confidence 9999999999999999999999999999999999999999999999843332 111
Q ss_pred -cc-------cCCCCchHHHHHHHHhhcCCCCCCCCCCCCCcccHHHHHHHhhCCceee-----ecee----ee------
Q 044668 215 -LC-------TLEDKDAMTLTVTQMLLEGGGSFSAGGPGKGVYSRLHRRVLNEIPRVQQ-----GVYF----CG------ 271 (353)
Q Consensus 215 -~i-------~~~~~d~~~~~vl~~iLGgg~s~~~g~pg~g~~SrL~~~lre~~gl~Ys-----~~y~----fg------ 271 (353)
|| ..+|+|++++.|||.+||||+|||+|||||||.||||.+|.+++.|+|+ ++|+ ||
T Consensus 265 tHv~lg~Eg~~~~deD~v~~avLq~lmGGGGSFSAGGPGKGMySrLY~~vLNry~wv~sctAfnhsy~DtGlfgi~~s~~ 344 (472)
T KOG2067|consen 265 THVVLGFEGCSWNDEDFVALAVLQMLMGGGGSFSAGGPGKGMYSRLYLNVLNRYHWVYSCTAFNHSYSDTGLFGIYASAP 344 (472)
T ss_pred eeeeEeeccCCCCChhHHHHHHHHHHhcCCcccCCCCCCcchHHHHHHHHHhhhHHHHHhhhhhccccCCceeEEeccCC
Confidence 22 3356899999999999999999999999999999999999999999999 5565 55
Q ss_pred ----------------ecCCCCCHHHHHHHHHHHHHHHHHhccCHHHHHHHHHHHHHHcCCCcc----------------
Q 044668 272 ----------------ITPGEVNQVQLDRAVQSTNSAILMNLESRIVVSEDIDRQVQTHGEMKP---------------- 319 (353)
Q Consensus 272 ----------------l~~~~is~~EL~~aK~~l~~~~~~~~es~~~~a~~l~~~~~~~g~~~~---------------- 319 (353)
....+++++||+|||+++++.++|++|+.--.+++++||++.+|.++.
T Consensus 345 P~~a~~aveli~~e~~~~~~~v~~~el~RAK~qlkS~LlMNLESR~V~~EDvGRQVL~~g~rk~p~e~~~~Ie~lt~~DI 424 (472)
T KOG2067|consen 345 PQAANDAVELIAKEMINMAGGVTQEELERAKTQLKSMLLMNLESRPVAFEDVGRQVLTTGERKPPDEFIKKIEQLTPSDI 424 (472)
T ss_pred HHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHhcccccchhHHHHhHHHHhccCcCCHHHHHHHHHhcCHHHH
Confidence 245779999999999999999999999999999999999999998876
Q ss_pred -----cccCCCceEEEEcCCCCCCCHHHHHHHHhc
Q 044668 320 -----KLILSPLTMASYGDVINVPSYDAESAASSS 349 (353)
Q Consensus 320 -----~~l~~~~~~~~vG~~~~~p~~~~~~~~~~~ 349 (353)
|+|.++|+|+..||..++|.|+.|.+++++
T Consensus 425 ~rva~kvlt~~p~va~~Gd~~~lpt~~~i~~~~~~ 459 (472)
T KOG2067|consen 425 SRVASKVLTGKPSVAAFGDGTGLPTYDHIGNAVSS 459 (472)
T ss_pred HHHHHHHhcCCceeccCCcccCCcchhhhhhhccc
Confidence 788999999999999999999999998864
No 2
>KOG0960 consensus Mitochondrial processing peptidase, beta subunit, and related enzymes (insulinase superfamily) [Posttranslational modification, protein turnover, chaperones]
Probab=100.00 E-value=1.3e-51 Score=379.88 Aligned_cols=304 Identities=25% Similarity=0.390 Sum_probs=265.0
Q ss_pred CCCCCcccccCCCCcccCCCCCceeEEEccccCCCCCCCCcHHHHHHHHhcccCCCCC-------------cCccccCcc
Q 044668 36 STSSPSLDFPLPGICLSHPSSLPDYFYVDCGSIYESPISFVKTDLLERMAFRSTRDRS-------------NIQASPSRE 102 (353)
Q Consensus 36 ~~~~~~l~~~~p~~~~~~~~~v~~~~~v~~Gs~~E~~~~~G~ahlle~llf~gt~~~~-------------~~~a~t~~d 102 (353)
.||++|.++.. +... .++++|+++|||+|.+.++|+|||||||.|+||++++ .+|++|+||
T Consensus 40 ~NGlrVaTE~~-~a~T-----ATVGVwidaGSR~EnekNNG~ahFLEhlaFKGT~~Rs~~alElEieniGahLNAytSRe 113 (467)
T KOG0960|consen 40 PNGLRVATEHN-SAST-----ATVGVWIDAGSRFENEKNNGTAHFLEHLAFKGTKNRSQAALELEIENIGAHLNAYTSRE 113 (467)
T ss_pred CCCcEEEeccC-CCcc-----eEEEEEeccCccccccccccHHHHHHHHHhcCCCcchhHHHHHHHHHHHHHhccccccc
Confidence 35677777777 6666 7999999999999999999999999999999999998 899999999
Q ss_pred eeEEEeecccCCHHHHHHHHHH----------------------HHHHhcChHHHHHHHHHHHhc-CCCCCCCCCCChHh
Q 044668 103 QMGCSFAALETYVPEMVELLTD----------------------ISEATRNPQSLLSEAIFSACY-SVVLANPLLAPECA 159 (353)
Q Consensus 103 ~t~~~~~~~~~~l~~~l~ll~d----------------------i~~~~~~p~~~~~~~l~~~~f-~~p~g~~~~G~~e~ 159 (353)
.|.|+++++++++++++++|+| +++.+++-..++++.+|..+| ++|+|+++.|+.|+
T Consensus 114 qT~yyakal~~dv~kavdiLaDIlqns~L~~s~IerER~vILrEmqevd~~~~eVVfdhLHatafQgtPL~~tilGp~en 193 (467)
T KOG0960|consen 114 QTVYYAKALSKDVPKAVDILADILQNSKLEESAIERERDVILREMQEVDKNHQEVVFDHLHATAFQGTPLGRTILGPSEN 193 (467)
T ss_pred ceeeehhhccccchHHHHHHHHHHHhCccchhHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHhcCCcccccccChhhh
Confidence 9999999999999999999999 234556778899999999999 67999999999999
Q ss_pred hcccCHHHHHHHHHhhcCCCCeEEEEEc-cCHHHHHHHHHHhhCCCCCCCC---CC-C-Cc-----------------cc
Q 044668 160 ISRLNSTLIKEFFAENYTAPWMVLAASG-VEHDQLVSVEEPLLSDLPISIL---TK-S-PD-----------------LC 216 (353)
Q Consensus 160 l~~it~edL~~f~~~~y~p~n~~lvivG-id~~el~~~v~~~f~~l~~~~~---~~-~-p~-----------------~i 216 (353)
|++|+++||++|.++||.++||+|+.+| |+||++.++++++||+++.... ++ . |. ++
T Consensus 194 I~si~r~DL~~yi~thY~~~RmVlaaaGgV~He~lv~la~k~fg~~~~~~~~~~~~~~~~~~FtgsEvR~rdd~lP~a~~ 273 (467)
T KOG0960|consen 194 IKSISRADLKDYINTHYKASRMVLAAAGGVKHEELVKLAEKYFGDLSKLQTGDKVPLVPPARFTGSEVRVRDDDLPLAHI 273 (467)
T ss_pred hhhhhHHHHHHHHHhcccCccEEEEecCCcCHHHHHHHHHHHcCCCcccccCcCCCCCCCccccCceeeecCCCCchhhe
Confidence 9999999999999999999999999999 9999999999999999764221 11 1 11 11
Q ss_pred --------cCCCCchHHHHHHHHhhcCCCCCCCCCCCCCcccHHHHHHHhhCCceee-----ecee----ee--------
Q 044668 217 --------TLEDKDAMTLTVTQMLLEGGGSFSAGGPGKGVYSRLHRRVLNEIPRVQQ-----GVYF----CG-------- 271 (353)
Q Consensus 217 --------~~~~~d~~~~~vl~~iLGgg~s~~~g~pg~g~~SrL~~~lre~~gl~Ys-----~~y~----fg-------- 271 (353)
|. +||++++.|+|.|+|....+--||. ...|||-+.+-+. .+|.+ .+|. ||
T Consensus 274 AiAVEG~~w~-~pD~~~l~van~iiG~wdr~~g~g~--~~~s~La~~~~~~-~l~~sfqsFnt~YkDTGLwG~y~V~~~~ 349 (467)
T KOG0960|consen 274 AIAVEGVSWA-HPDYFALMVANTIIGNWDRTEGGGR--NLSSRLAQKIQQD-QLCHSFQSFNTSYKDTGLWGIYFVTDNL 349 (467)
T ss_pred eeeEecCCcC-CccHHHHHHHHHHhhhhhcccCCcc--CCccHHHHHHHHH-HHHHHHhhhhcccccccceeEEEEecCh
Confidence 54 5999999999999999887665554 4799999999886 56655 3454 55
Q ss_pred ---------------ecCCCCCHHHHHHHHHHHHHHHHHhccCHHHHHHHHHHHHHHcCCCcc-----------------
Q 044668 272 ---------------ITPGEVNQVQLDRAVQSTNSAILMNLESRIVVSEDIDRQVQTHGEMKP----------------- 319 (353)
Q Consensus 272 ---------------l~~~~is~~EL~~aK~~l~~~~~~~~es~~~~a~~l~~~~~~~g~~~~----------------- 319 (353)
-....+|+.|++|||++|+.++++.+|++..++++||+|++.+|+.++
T Consensus 350 ~~iddl~~~vl~eW~rL~~~vteaEV~RAKn~Lkt~Lll~ldgttpi~ediGrqlL~~Grri~l~El~~rId~vt~~~Vr 429 (467)
T KOG0960|consen 350 TMIDDLIHSVLKEWMRLATSVTEAEVERAKNQLKTNLLLSLDGTTPIAEDIGRQLLTYGRRIPLAELEARIDAVTAKDVR 429 (467)
T ss_pred hhHHHHHHHHHHHHHHHHhhccHHHHHHHHHHHHHHHHHHhcCCCchHHHHHHHHhhcCCcCChHHHHHHHhhccHHHHH
Confidence 123479999999999999999999999999999999999999999987
Q ss_pred ----c-ccCCCceEEEEcCCCCCCCHHHHHHHHhc
Q 044668 320 ----K-LILSPLTMASYGDVINVPSYDAESAASSS 349 (353)
Q Consensus 320 ----~-~l~~~~~~~~vG~~~~~p~~~~~~~~~~~ 349 (353)
| +++.+++++.||+++.+|.|..|+..|+-
T Consensus 430 ~va~k~iyd~~iAia~vG~ie~lpdy~~irs~m~w 464 (467)
T KOG0960|consen 430 EVASKYIYDKDIAIAAVGPIEGLPDYNRIRSGMSW 464 (467)
T ss_pred HHHHHHhhcCCcceeeecccccCchHHHHhccchh
Confidence 4 55889999999999999999999998863
No 3
>COG0612 PqqL Predicted Zn-dependent peptidases [General function prediction only]
Probab=100.00 E-value=2.8e-43 Score=348.23 Aligned_cols=288 Identities=24% Similarity=0.325 Sum_probs=242.6
Q ss_pred CCCCCcccccCCCCcccCCCCCceeEEEccccCCCCCCCCcHHHHHHHHhcccCCCCC-------------cCccccCcc
Q 044668 36 STSSPSLDFPLPGICLSHPSSLPDYFYVDCGSIYESPISFVKTDLLERMAFRSTRDRS-------------NIQASPSRE 102 (353)
Q Consensus 36 ~~~~~~l~~~~p~~~~~~~~~v~~~~~v~~Gs~~E~~~~~G~ahlle~llf~gt~~~~-------------~~~a~t~~d 102 (353)
.||+.++..|+|.++. +++.++|++|+++|++++.|+|||||||+|+||++++ ..||+|++|
T Consensus 23 ~nGl~~~~~~~~~~~~-----vs~~~~v~~Gs~~e~~~~~G~AH~lehm~fkgt~~~~~~~i~~~~~~~G~~~na~ts~d 97 (438)
T COG0612 23 PNGLRVITYPNPTAPT-----VSLDVWVKAGSRAEPAGKAGIAHFLEHMAFKGTTGLPSAELAEAFEKLGGQLNAFTSFD 97 (438)
T ss_pred CCCCEEEEEeCCCCCE-----EEEEEEEeecccCCCCCcccHHHHHHHHHccCCCCCChHHHHHHHHHhcCeeeccccch
Confidence 4577777778877777 9999999999999999999999999999999987755 889999999
Q ss_pred eeEEEeecccCCHHHHHHHHHH----------------------HHHHhcChHHHHHHHHHHHhc-CCCCCCCCCCChHh
Q 044668 103 QMGCSFAALETYVPEMVELLTD----------------------ISEATRNPQSLLSEAIFSACY-SVVLANPLLAPECA 159 (353)
Q Consensus 103 ~t~~~~~~~~~~l~~~l~ll~d----------------------i~~~~~~p~~~~~~~l~~~~f-~~p~g~~~~G~~e~ 159 (353)
+|.|++++++++++.+|++++| |++..++|..++++.+++.+| +|||+++++|++++
T Consensus 98 ~t~y~~~~l~~~~~~~l~llad~l~~p~f~~~~~e~Ek~vil~ei~~~~d~p~~~~~~~l~~~~~~~~p~~~~~~G~~e~ 177 (438)
T COG0612 98 YTVYYLSVLPDNLDKALDLLADILLNPTFDEEEVEREKGVILEEIRMRQDDPDDLAFERLLEALYGNHPLGRPILGTEES 177 (438)
T ss_pred hhhhhhhhchhhhHHHHHHHHHHHhCCCCCHHHHHHHHHHHHHHHHhhccCchHHHHHHHHHHhhccCCCCCCCCCCHHH
Confidence 9999999999999999999999 455678999999999999999 68999999999999
Q ss_pred hcccCHHHHHHHHHhhcCCCCeEEEEEc-cCHHHHHHHHHHhhCCCCC-CCCCCC---Cc-------c------------
Q 044668 160 ISRLNSTLIKEFFAENYTAPWMVLAASG-VEHDQLVSVEEPLLSDLPI-SILTKS---PD-------L------------ 215 (353)
Q Consensus 160 l~~it~edL~~f~~~~y~p~n~~lvivG-id~~el~~~v~~~f~~l~~-~~~~~~---p~-------~------------ 215 (353)
|+++|++||++||++||+|+||+|+||| |+++++.++++++|++|+. .++... |. +
T Consensus 178 I~~it~~dl~~f~~k~Y~p~n~~l~vvGdi~~~~v~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 257 (438)
T COG0612 178 IEAITREDLKDFYQKWYQPDNMVLVVVGDVDAEEVVELIEKYFGDLPGAAPPPKIPPEPPLGPERVVRVNDPEQPDLEQA 257 (438)
T ss_pred HHhCCHHHHHHHHHHhcCcCceEEEEecCCCHHHHHHHHHHHHccCCccCCCCCCCCccccCCCceEEecCCCCchhhhh
Confidence 9999999999999999999999999999 9999999999999999986 221111 10 0
Q ss_pred -c-------cCCC-CchHHHHHHHHhhcCCCCCCCCCCCCCcccHHHHHHHhhCCceee-ecee--------ee------
Q 044668 216 -C-------TLED-KDAMTLTVTQMLLEGGGSFSAGGPGKGVYSRLHRRVLNEIPRVQQ-GVYF--------CG------ 271 (353)
Q Consensus 216 -i-------~~~~-~d~~~~~vl~~iLGgg~s~~~g~pg~g~~SrL~~~lre~~gl~Ys-~~y~--------fg------ 271 (353)
+ ...+ .|++++.|++.+|||+ ++||||+++||++||+|+ .++. ++
T Consensus 258 ~~~~g~~~~~~~~~~~~~~~~l~~~llgg~-----------~~SrLf~~~re~~glay~~~~~~~~~~~~~~~~~~~~~~ 326 (438)
T COG0612 258 WLALGYPGPDYDSPDDYAALLLLNGLLGGG-----------FSSRLFQELREKRGLAYSVSSFSDFLSDSGLFSIYAGTA 326 (438)
T ss_pred hhhccccCcCcCcchhhHHHHHHHHHhCCC-----------cchHHHHHHHHhcCceeeeccccccccccCCceEEEEec
Confidence 0 1111 2678999999999987 999999999999999998 2211 11
Q ss_pred -----------------ecC---CCCCHHHHHHHHHHHHHHHHHhccCHHHHHHHHHHHHHHcCCCcc------------
Q 044668 272 -----------------ITP---GEVNQVQLDRAVQSTNSAILMNLESRIVVSEDIDRQVQTHGEMKP------------ 319 (353)
Q Consensus 272 -----------------l~~---~~is~~EL~~aK~~l~~~~~~~~es~~~~a~~l~~~~~~~g~~~~------------ 319 (353)
+.. +.+|++|++++|..+.+.++...+++..+++.++......+....
T Consensus 327 ~~~~~~~~~~i~~~~~~~~~~~~~~~t~~~~~~~k~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~vt 406 (438)
T COG0612 327 PENPEKTAELVEEILKALKKGLKGPFTEEELDAAKQLLIGLLLLSLDSPSSIAELLGQYLLLGGSLITLEELLERIEAVT 406 (438)
T ss_pred CCChhhHHHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHHHhhhccCCHHHHHHHHHHHHHhcCCccCHHHHHHHHHhcC
Confidence 222 359999999999999999999999999999999966553334333
Q ss_pred ---------cccC-CCceEEEEcCCCCCCC
Q 044668 320 ---------KLIL-SPLTMASYGDVINVPS 339 (353)
Q Consensus 320 ---------~~l~-~~~~~~~vG~~~~~p~ 339 (353)
+++. .+++++++|+....+.
T Consensus 407 ~~dv~~~a~~~~~~~~~~~~~~~p~~~~~~ 436 (438)
T COG0612 407 LEDVNAVAKKLLAPENLTIVVLGPEKALKD 436 (438)
T ss_pred HHHHHHHHHHhcCCCCcEEEEEcccccccc
Confidence 4553 4599999999887654
No 4
>TIGR02110 PQQ_syn_pqqF coenzyme PQQ biosynthesis probable peptidase PqqF. In a subset of species that make coenzyme PQQ (pyrrolo-quinoline-quinone), this probable peptidase is found in the PQQ biosynthesis region and is thought to act as a protease on PqqA (TIGR02107), a probable peptide precursor of the coenzyme. PQQ is required for some glucose dehydrogenases and alcohol dehydrogenases.
Probab=100.00 E-value=1.6e-37 Score=317.13 Aligned_cols=212 Identities=15% Similarity=0.105 Sum_probs=182.0
Q ss_pred CCCCcccccCCCCcccCCCCCceeEEEccccCCCCCCCCcHHHHHHHHhcccCCCCC--------------cCccccCcc
Q 044668 37 TSSPSLDFPLPGICLSHPSSLPDYFYVDCGSIYESPISFVKTDLLERMAFRSTRDRS--------------NIQASPSRE 102 (353)
Q Consensus 37 ~~~~~l~~~~p~~~~~~~~~v~~~~~v~~Gs~~E~~~~~G~ahlle~llf~gt~~~~--------------~~~a~t~~d 102 (353)
||++|+..+.|+.++ +++.++|++||++|++++.|+|||+|||+|+||++++ ++|++|++|
T Consensus 7 NGLrVllv~~p~~p~-----vav~l~v~aGS~~Ep~~~~GLAHfLEHMLFkGT~~~~~~~~i~~~le~lGG~lNA~Ts~d 81 (696)
T TIGR02110 7 NGLRVHLYHQPDAKR-----AAALLRVAAGSHDEPSAWPGLAHFLEHLLFLGGERFQGDDRLMPWVQRQGGQVNATTLER 81 (696)
T ss_pred CCCEEEEEECCCCCE-----EEEEEEEeeccCCCCCCCCcHHHHHHHHHhcCCCCCCcHHHHHHHHHHhCCeEEEEEcCC
Confidence 355555555555555 8999999999999999999999999999999999987 899999999
Q ss_pred eeEEEeecccCCHHHHHHHHHH----------------------HHHHhcChHHHHHHHHHHHhc-CCCCCCCCCCChHh
Q 044668 103 QMGCSFAALETYVPEMVELLTD----------------------ISEATRNPQSLLSEAIFSACY-SVVLANPLLAPECA 159 (353)
Q Consensus 103 ~t~~~~~~~~~~l~~~l~ll~d----------------------i~~~~~~p~~~~~~~l~~~~f-~~p~g~~~~G~~e~ 159 (353)
+|.|++++++++++.+|++++| ++...++|..++.+.+.+.+| +|||+++.+|+.++
T Consensus 82 ~T~y~~~v~~~~l~~aL~lLaD~l~~P~f~eeeierEr~vvl~Ei~~~~ddp~~~~~~~l~~~l~~~HPy~~~~iGt~es 161 (696)
T TIGR02110 82 TTAFFFELPAAALAAGLARLCDMLARPLLTAEDQQREREVLEAEYIAWQNDADTLREAALLDALQAGHPLRRFHAGSRDS 161 (696)
T ss_pred eEEEEEEecHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHhcCHHHHHHHHHHHHcCCCCCCCCCCCCCHHH
Confidence 9999999999999999999999 333557999999999999999 78999999999999
Q ss_pred hcccC---HHHHHHHHHhhcCCCCeEEEEEc-cCHHHHHHHHHHhhCCCCCCCCCC----CC----c------------c
Q 044668 160 ISRLN---STLIKEFFAENYTAPWMVLAASG-VEHDQLVSVEEPLLSDLPISILTK----SP----D------------L 215 (353)
Q Consensus 160 l~~it---~edL~~f~~~~y~p~n~~lvivG-id~~el~~~v~~~f~~l~~~~~~~----~p----~------------~ 215 (353)
|++++ .+||++||++||.|+||+|+|+| +++++++++++++|++|+.+..+. .| . +
T Consensus 162 L~~it~~t~edL~~F~~~~Y~p~NmvLvIvGdvs~eel~~l~e~~f~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~q~~ 241 (696)
T TIGR02110 162 LALPNTAFQQALRDFHRRHYQAGNMQLWLQGPQSLDELEQLAARFGASLAAGGECAQAPPAPLLRFDRLTLAGGSEPRLW 241 (696)
T ss_pred HhCcccchHHHHHHHHHHhcchhcEEEEEEeCCCHHHHHHHHHHHhCCCCCCCCCCCCCCCCCCCCceeEEEecCcceEE
Confidence 99876 99999999999999999999999 999999999999999997553221 11 0 1
Q ss_pred c-----cCCCCchHHHHHHHHhhcCCCCCCCCCCCCCcccHHHHHHHhhCCceee
Q 044668 216 C-----TLEDKDAMTLTVTQMLLEGGGSFSAGGPGKGVYSRLHRRVLNEIPRVQQ 265 (353)
Q Consensus 216 i-----~~~~~d~~~~~vl~~iLGgg~s~~~g~pg~g~~SrL~~~lre~~gl~Ys 265 (353)
+ .....|..++.+++++|||+ ++|+||++||| +||+|+
T Consensus 242 l~~~~p~~~~~d~~al~lL~~iLg~g-----------~sSrL~~~LRe-~GLays 284 (696)
T TIGR02110 242 LLFALAGLPATARDNVTLLCEFLQDE-----------APGGLLAQLRE-RGLAES 284 (696)
T ss_pred EEEeecCCCCCChHHHHHHHHHhCCC-----------cchHHHHHHHH-CCCEEE
Confidence 1 12223444689999999998 89999999997 799999
No 5
>PRK15101 protease3; Provisional
Probab=100.00 E-value=5.9e-34 Score=306.61 Aligned_cols=259 Identities=10% Similarity=0.063 Sum_probs=209.5
Q ss_pred CCCCCcccccCCCCcccCCCCCceeEEEccccCCCCCCCCcHHHHHHHHhcccCCCCC--------------cCccccCc
Q 044668 36 STSSPSLDFPLPGICLSHPSSLPDYFYVDCGSIYESPISFVKTDLLERMAFRSTRDRS--------------NIQASPSR 101 (353)
Q Consensus 36 ~~~~~~l~~~~p~~~~~~~~~v~~~~~v~~Gs~~E~~~~~G~ahlle~llf~gt~~~~--------------~~~a~t~~ 101 (353)
.||++|+..+.|..++ +++.++|++||++||+++.|+|||+|||+|+||++++ .+||+|+.
T Consensus 50 ~NGL~v~l~~~~~~~~-----~~~~l~v~~Gs~~ep~~~~GlAHflEHmlf~GT~~~p~~~~~~~~l~~~Gg~~NA~T~~ 124 (961)
T PRK15101 50 DNGMTVLLVSDPQAVK-----SLAALALPVGSLEDPDAQQGLAHYLEHMVLMGSKKYPQPDSLAEFLKKHGGSHNASTAS 124 (961)
T ss_pred CCCCEEEEEeCCCCcc-----eeEEEEeCcCCCCCCCCCCchHHHHHHHHhcCCccCCCcchHHHHHHHhCCCccceECC
Confidence 4577888878877777 8999999999999999999999999999999999985 88999999
Q ss_pred ceeEEEeecccCCHHHHHHHHHH----------------------HHHHhcChHHHHHHHHHHHhc-CCCCCCCCCCChH
Q 044668 102 EQMGCSFAALETYVPEMVELLTD----------------------ISEATRNPQSLLSEAIFSACY-SVVLANPLLAPEC 158 (353)
Q Consensus 102 d~t~~~~~~~~~~l~~~l~ll~d----------------------i~~~~~~p~~~~~~~l~~~~f-~~p~g~~~~G~~e 158 (353)
|+|.|++++++++++.+|+++++ ++...++|...+.+.+.+.+| +|||+++..|+.+
T Consensus 125 d~T~y~~~~~~~~l~~aL~~~ad~~~~P~f~~~~~erE~~~v~~E~~~~~~~~~~~~~~~~~~~~~~~hp~~~~~~G~~e 204 (961)
T PRK15101 125 YRTAFYLEVENDALPPAVDRLADAIAEPLLDPKNADRERNAVNAELTMARSRDGMRMAQVSAETINPAHPGSRFSGGNLE 204 (961)
T ss_pred CceEEEEEcCHHHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHhhCCCCCCcccCCCCCHH
Confidence 99999999999999999999999 233457788999999988999 7999999999999
Q ss_pred hhccc----CHHHHHHHHHhhcCCCCeEEEEEc-cCHHHHHHHHHHhhCCCCCCCCC--C--CC------c---------
Q 044668 159 AISRL----NSTLIKEFFAENYTAPWMVLAASG-VEHDQLVSVEEPLLSDLPISILT--K--SP------D--------- 214 (353)
Q Consensus 159 ~l~~i----t~edL~~f~~~~y~p~n~~lvivG-id~~el~~~v~~~f~~l~~~~~~--~--~p------~--------- 214 (353)
+|.++ ++++|++||++||+|+||+|+|+| ++++++.++++++|++||.+..+ . .| .
T Consensus 205 tl~~~~~~~~~~~L~~f~~~~Y~p~nm~lvv~G~~~~~~l~~~~~~~F~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 284 (961)
T PRK15101 205 TLSDKPGSKLQDALVDFYQRYYSANLMKAVIYSNQPLPELAKLAADTFGRVPNKNASVPEITVPVVTDAQKGIIIHYVPA 284 (961)
T ss_pred HhhcCCchHHHHHHHHHHHHhCcccceEEEEEcCCCHHHHHHHHHHHhccCCCCCCCCCCCCCCCCCHHHcCeEEEEEEC
Confidence 99997 699999999999999999999999 99999999999999999865321 1 01 0
Q ss_pred ------cc-cCC---CC--chHHHHHHHHhhcCCCCCCCCCCCCCcccHHHHHHHhhCCceee-ec----ee------ee
Q 044668 215 ------LC-TLE---DK--DAMTLTVTQMLLEGGGSFSAGGPGKGVYSRLHRRVLNEIPRVQQ-GV----YF------CG 271 (353)
Q Consensus 215 ------~i-~~~---~~--d~~~~~vl~~iLGgg~s~~~g~pg~g~~SrL~~~lre~~gl~Ys-~~----y~------fg 271 (353)
++ |.- ++ +..+..+++.+||++ ..++|+..| +++||+|+ .+ +. |.
T Consensus 285 ~~~~~l~l~~~~p~~~~~~~~~~~~~l~~ll~~~-----------~~g~l~~~L-~~~gla~~v~s~~~~~~~~~~g~f~ 352 (961)
T PRK15101 285 QPRKVLRVEFRIDNNSAKFRSKTDEYISYLIGNR-----------SPGTLSDWL-QKQGLAEGISAGADPMVDRNSGVFA 352 (961)
T ss_pred CCCcEEEEEEecCCcHHHHhhCHHHHHHHHhcCC-----------CCCcHHHHH-HHcCccceeeeccccccCCCceEEE
Confidence 01 211 11 223578899999987 445677666 47899998 21 11 11
Q ss_pred --------------------------ecCCCCCHHHHHHHHHHHHHHHHHh-ccCHHHHHHHHHHHH
Q 044668 272 --------------------------ITPGEVNQVQLDRAVQSTNSAILMN-LESRIVVSEDIDRQV 311 (353)
Q Consensus 272 --------------------------l~~~~is~~EL~~aK~~l~~~~~~~-~es~~~~a~~l~~~~ 311 (353)
+++.+++++||+++|+.+..++... ..++...+..++..+
T Consensus 353 i~~~~~~~~~~~~~~v~~~i~~~i~~l~~~g~~~~el~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~ 419 (961)
T PRK15101 353 ISVSLTDKGLAQRDQVVAAIFSYLNLLREKGIDKSYFDELAHVLDLDFRYPSITRDMDYIEWLADTM 419 (961)
T ss_pred EEEEcChHHHHhHHHHHHHHHHHHHHHHhcCCcHHHHHHHHHHHhccccCCCCCChHHHHHHHHHHh
Confidence 5678999999999999999887543 344444556666543
No 6
>PTZ00432 falcilysin; Provisional
Probab=99.97 E-value=6.7e-29 Score=267.69 Aligned_cols=256 Identities=16% Similarity=0.119 Sum_probs=200.9
Q ss_pred CCCCcccccCCC--CcccCCCCCceeEEEccccCCCCCCCCcHHHHHHHHhcccCCCCC---------------cCcccc
Q 044668 37 TSSPSLDFPLPG--ICLSHPSSLPDYFYVDCGSIYESPISFVKTDLLERMAFRSTRDRS---------------NIQASP 99 (353)
Q Consensus 37 ~~~~~l~~~~p~--~~~~~~~~v~~~~~v~~Gs~~E~~~~~G~ahlle~llf~gt~~~~---------------~~~a~t 99 (353)
+|++|+..+.+. .+. +++.|+|++|+ .+..|++|++|||+|+||++++ .+||+|
T Consensus 99 nGl~vl~~~~~d~~~~~-----~~f~i~f~T~~----~d~~G~aH~LEH~~f~GS~k~p~~~~~~~l~~~gl~~~lNA~T 169 (1119)
T PTZ00432 99 TGLQVISLKTNDSSGKE-----MCFDFYVPTPP----HNDKGIPHILEHSVLSGSKKYNYKDSFSLLVQGGFNSFLNAYT 169 (1119)
T ss_pred CCCEEEEEecCCCccce-----eEEEEEecCCC----CCCcchhHHHHHHHhCCCCCCCcccHHHHHHhcCcCCCccccC
Confidence 355655554433 122 67889999997 4568999999999999999988 579999
Q ss_pred CcceeEEEeecccC-CHHHHHHHHHH-----------H---------------------------------------HH-
Q 044668 100 SREQMGCSFAALET-YVPEMVELLTD-----------I---------------------------------------SE- 127 (353)
Q Consensus 100 ~~d~t~~~~~~~~~-~l~~~l~ll~d-----------i---------------------------------------~~- 127 (353)
+.|+|+|++.++.+ ++..+|+++.| + ++
T Consensus 170 ~~D~T~Y~~~~~~e~d~~~~ldv~~d~v~~P~~~~~~~~f~qEgwh~E~~~~~~~~~~~~e~~~~~~~~l~~kgVV~~Em 249 (1119)
T PTZ00432 170 FKDRTSYLFASTNEKDFYNTADVYMDSVFQPNILEDKDIFKQEGWHYKVTKLKDDEKNADELGNVHDRHVSYSGIVYSEM 249 (1119)
T ss_pred CCCceEEEeccCCHHHHHHHHHHHHHHHhCcCcccccchhhhhhhhccccccccccccccccccccccccchhhHHHHHH
Confidence 99999999999875 79999999998 1 01
Q ss_pred --HhcChHHHHHHHHHHHhcCCCCCCCCCCChHhhcccCHHHHHHHHHhhcCCCCeEEEEEc-cCHHHHHHHHHHhhCCC
Q 044668 128 --ATRNPQSLLSEAIFSACYSVVLANPLLAPECAISRLNSTLIKEFFAENYTAPWMVLAASG-VEHDQLVSVEEPLLSDL 204 (353)
Q Consensus 128 --~~~~p~~~~~~~l~~~~f~~p~g~~~~G~~e~l~~it~edL~~f~~~~y~p~n~~lvivG-id~~el~~~v~~~f~~l 204 (353)
..++|..++.+.+.+.+|+|||+++..|++++|.++|.++|++||++||+|+||+|+++| +|++++.++++++|+++
T Consensus 250 k~~~~~p~~~~~~~~~~~lf~~pY~~~~~G~~~~I~~lt~e~l~~Fh~~~Y~P~N~~l~v~Gdid~~~~l~~l~~~f~~~ 329 (1119)
T PTZ00432 250 KKRFSDPLSFGYSVIYQNLFSNVYKYDSGGDPKDIVELTYEELVEFYKTYYGPKTATVYFYGPNDVTERLEFVDNYLTKH 329 (1119)
T ss_pred HHhhCCHHHHHHHHHHHHHhCCCCCCCCCCChHhhccCCHHHHHHHHHHhcCccceEEEEEcCCCHHHHHHHHHHHHhhc
Confidence 225899999999999989779999999999999999999999999999999999999999 99999999999999777
Q ss_pred CCCC----------C---CC------CCc--------------c---c-cC---------------CC-CchHHHHHHHH
Q 044668 205 PISI----------L---TK------SPD--------------L---C-TL---------------ED-KDAMTLTVTQM 231 (353)
Q Consensus 205 ~~~~----------~---~~------~p~--------------~---i-~~---------------~~-~d~~~~~vl~~ 231 (353)
|... . .+ .|+ + + |. .+ +|++++.||++
T Consensus 330 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~e~~~l~~~~w~~~p~~~~~~~~~~~~~d~~~~~AL~VLs~ 409 (1119)
T PTZ00432 330 PKTGQLSHTAYREDADENLLYEEYKDKPKHVKKKFSSHSEEEENLMSVSWLLNPKHNGSKDYDKSLIDPVDYLALLVLNY 409 (1119)
T ss_pred ccccccccccccccccccccccccccCCeEEEeccCCCccccccEEEEEEEcCCccccccccccccCCHHHHHHHHHHHH
Confidence 5431 0 00 010 1 1 31 22 57899999999
Q ss_pred hhcCCCCCCCCCCCCCcccHHHHHHHhhCCceee--e-cee-------e--e----------------------------
Q 044668 232 LLEGGGSFSAGGPGKGVYSRLHRRVLNEIPRVQQ--G-VYF-------C--G---------------------------- 271 (353)
Q Consensus 232 iLGgg~s~~~g~pg~g~~SrL~~~lre~~gl~Ys--~-~y~-------f--g---------------------------- 271 (353)
+|||+ .+|+|+++||| .||+|+ . .+. | +
T Consensus 410 lLggg-----------~sS~L~q~LrE-~GLa~svv~~~~~~~~~~~~f~I~l~g~~~~~~~~~~~~~~ev~~~I~~~L~ 477 (1119)
T PTZ00432 410 LLLGT-----------PESVLYKALID-SGLGKKVVGSGLDDYFKQSIFSIGLKGIKETNEKRKDKVHYTFEKVVLNALT 477 (1119)
T ss_pred HHcCC-----------CccHHHHHHHh-cCCCcCCCcCcccCCCCceEEEEEEEcCChHhccchhhhHHHHHHHHHHHHH
Confidence 99998 89999999997 699877 1 111 1 1
Q ss_pred -ecCCCCCHHHHHHHHHHHHHHHHHhcc----CHHHHHHHHHHHHHH
Q 044668 272 -ITPGEVNQVQLDRAVQSTNSAILMNLE----SRIVVSEDIDRQVQT 313 (353)
Q Consensus 272 -l~~~~is~~EL~~aK~~l~~~~~~~~e----s~~~~a~~l~~~~~~ 313 (353)
++.+|+++++|++||+++.-.+...-- ..-.++..+...+++
T Consensus 478 ~l~~eGi~~eele~a~~qlef~~rE~~~~~~p~gl~~~~~~~~~~~~ 524 (1119)
T PTZ00432 478 KVVTEGFNKSAVEASLNNIEFVMKELNLGTYPKGLMLIFLMQSRLQY 524 (1119)
T ss_pred HHHHhCCCHHHHHHHHHHHHHHhhhccCCCCCcHHHHHHHHHHHHhc
Confidence 457899999999999999887764321 134566666655553
No 7
>KOG2583 consensus Ubiquinol cytochrome c reductase, subunit QCR2 [Energy production and conversion]
Probab=99.96 E-value=1.4e-27 Score=221.62 Aligned_cols=323 Identities=21% Similarity=0.275 Sum_probs=233.9
Q ss_pred CCcccccccccccccccccccCCCCCceecccCCCCCCCCcccccCCCCcccCCCCCceeEEEccccCCCCCCCCcHHHH
Q 044668 1 MRCRMPATARFASSSAVASTSSSSGGFYSWLSGEQSTSSPSLDFPLPGICLSHPSSLPDYFYVDCGSIYESPISFVKTDL 80 (353)
Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~p~~~~~~~~~v~~~~~v~~Gs~~E~~~~~G~ahl 80 (353)
|+||--.+.||.+.++.+ +| ..-++..--|+|.|.+.+.|+... .+.+.|++|||||+.+++|++|+
T Consensus 1 M~sr~~~~~r~~~~a~~~------~~-~~~~~~kl~ngL~Vas~e~~~~is------~l~l~~~AGSRYe~~~~~G~sHl 67 (429)
T KOG2583|consen 1 MLSRASSFVRFATAAAPA------AG-QISKTTKLVNGLTVASREAPTAIS------SLSLAFRAGSRYEPADQQGLSHL 67 (429)
T ss_pred CCccccchhhcccccccc------cc-chhhhhccccceEEEeccCCCcce------EEEEEEecCccCCccccccHHHH
Confidence 899999999996665522 33 222222224588888888876655 78899999999999999999999
Q ss_pred HHHHhcccCCCCC-------------cCccccCcceeEEEeecccCCHHHHHHHHHH-----------HH----------
Q 044668 81 LERMAFRSTRDRS-------------NIQASPSREQMGCSFAALETYVPEMVELLTD-----------IS---------- 126 (353)
Q Consensus 81 le~llf~gt~~~~-------------~~~a~t~~d~t~~~~~~~~~~l~~~l~ll~d-----------i~---------- 126 (353)
++...++.|++++ .++.+++||.+.|++++++++++..|.+|.+ ++
T Consensus 68 lr~f~g~~Tq~~sal~ivr~se~~GG~Lss~~tRe~~~~tvt~lrd~~~~~l~~L~~V~~~paFkPwEl~D~~~~ti~~~ 147 (429)
T KOG2583|consen 68 LRNFVGRDTQERSALKIVRESEQLGGTLSSTATRELIGLTVTFLRDDLEYYLSLLGDVLDAPAFKPWELEDVVLATIDAD 147 (429)
T ss_pred HHHhcccCccccchhhhhhhhHhhCceeeeeeecceEEEEEEEecccHHHHHHHHHHhhcccCcCchhhhhhhhhhhHHH
Confidence 9999999999998 8999999999999999999999999999999 11
Q ss_pred HHhcChHHHHHHHHHHHhcCCCCCCCCCCChHhhcccCHHHHHHHHHhhcCCCCeEEEEEccCHHHHHHHHHHhhC---C
Q 044668 127 EATRNPQSLLSEAIFSACYSVVLANPLLAPECAISRLNSTLIKEFFAENYTAPWMVLAASGVEHDQLVSVEEPLLS---D 203 (353)
Q Consensus 127 ~~~~~p~~~~~~~l~~~~f~~p~g~~~~G~~e~l~~it~edL~~f~~~~y~p~n~~lvivGid~~el~~~v~~~f~---~ 203 (353)
-...+|..+++|.+|+++|++.+|++++.+.-.+.+++.++|.+|-+++|...|++++.+|+|++.++...++++- .
T Consensus 148 l~~~t~~~~a~e~lH~aAfRngLgnslY~p~~~vg~vss~eL~~Fa~k~fv~gn~~lvg~nvd~~~L~~~~~~~~~~~~~ 227 (429)
T KOG2583|consen 148 LAYQTPYTIAIEQLHAAAFRNGLGNSLYSPGYQVGSVSSSELKDFAAKHFVKGNAVLVGVNVDHDDLKQFADEYAPIRDG 227 (429)
T ss_pred hhhcChHHHHHHHHHHHHHhcccCCcccCCcccccCccHHHHHHHHHHHhhccceEEEecCCChHHHHHHHHHhccccCC
Confidence 1347899999999999999779999999988889999999999999999999999999999999999999999832 1
Q ss_pred CCCCCCCCC------Cc-------cc--------cCCCCchHHHHHHHHhhcCCCCCCCCCCCCCcccHHHHHHHhhCCc
Q 044668 204 LPISILTKS------PD-------LC--------TLEDKDAMTLTVTQMLLEGGGSFSAGGPGKGVYSRLHRRVLNEIPR 262 (353)
Q Consensus 204 l~~~~~~~~------p~-------~i--------~~~~~d~~~~~vl~~iLGgg~s~~~g~pg~g~~SrL~~~lre~~gl 262 (353)
++..+.+.. ++ || ..+.++..+..|+.++||-+...+.| + |..+.+-...-+ .|.
T Consensus 228 ~~~k~a~a~~~gGe~Rk~~~g~~~~v~vagegAAa~~~k~~~a~av~~~~Lg~~~~~k~~-t--~~~~~aa~~a~~-~~~ 303 (429)
T KOG2583|consen 228 LPLKPAPAKYSGGEARKDARGNRVHVAVAGEGAAAGNLKVLAAQAVLLAALGNSAPVKRG-T--GLLSEAAGAAGE-QGA 303 (429)
T ss_pred CCCCCCCccccCCccccccCCceeEEEEecCcccccchHHHHHHHHHHHHHhcccccccc-c--chHHHHHhhccc-cCc
Confidence 222221111 11 22 23457778889999999988666665 2 234433322222 233
Q ss_pred eee---ecee----ee-----------------------ecCCCCCHHHHHHHHHHHHHHHHHhccCHHHHHHHHHHHHH
Q 044668 263 VQQ---GVYF----CG-----------------------ITPGEVNQVQLDRAVQSTNSAILMNLESRIVVSEDIDRQVQ 312 (353)
Q Consensus 263 ~Ys---~~y~----fg-----------------------l~~~~is~~EL~~aK~~l~~~~~~~~es~~~~a~~l~~~~~ 312 (353)
.-+ .+|+ || ....+++....+.+++.++.....+.+. +-.....+..
T Consensus 304 s~sA~~a~ysDsGL~gv~~~~~~~~a~~~v~s~v~~lks~~~~~id~~~~~a~~~~l~~~~~ss~~a---~~~~~~~~a~ 380 (429)
T KOG2583|consen 304 SASAFNAPYSDSGLFGVYVSAQGSQAGKVVSSEVKKLKSALVSDIDNAKVKAAIKALKASYLSSVEA---LELATGSQAN 380 (429)
T ss_pred eeeeecccccCCceEEEEEEecCccHHHHHHHHHHHHHHHHhcCCcchHHHHHHHHHHHHhhcchHH---HHHhhHHHhc
Confidence 322 3343 44 2345555555666666666665554433 2222222211
Q ss_pred HcCCC------cc------------cccCCCceEEEEcCCCCCCCHHHH
Q 044668 313 THGEM------KP------------KLILSPLTMASYGDVINVPSYDAE 343 (353)
Q Consensus 313 ~~g~~------~~------------~~l~~~~~~~~vG~~~~~p~~~~~ 343 (353)
.-+.+ ++ +++..++++++|||..++||+|++
T Consensus 381 ~~~~~d~~i~~id~Vt~sdV~~a~kk~~s~kls~aA~Gnl~~vPY~DEL 429 (429)
T KOG2583|consen 381 LVSEPDAFIQQIDKVTASDVQKAAKKFLSGKLSLAAYGNLSNVPYLDEL 429 (429)
T ss_pred CCCChHHHHHHhccccHHHHHHHHHHhccCcceeeeeccccCCcccccC
Confidence 01111 01 555689999999999999999985
No 8
>PRK15101 protease3; Provisional
Probab=99.87 E-value=1.3e-21 Score=211.06 Aligned_cols=270 Identities=9% Similarity=0.013 Sum_probs=202.4
Q ss_pred cCCCCCceecccCCCCC-CCCcccccCCCCcccCCCCCceeEEEccccCCCCCCCCcHHHHHHHHhcccCCCCC------
Q 044668 21 SSSSGGFYSWLSGEQST-SSPSLDFPLPGICLSHPSSLPDYFYVDCGSIYESPISFVKTDLLERMAFRSTRDRS------ 93 (353)
Q Consensus 21 ~~~~~~~~~~~~~~~~~-~~~~l~~~~p~~~~~~~~~v~~~~~v~~Gs~~E~~~~~G~ahlle~llf~gt~~~~------ 93 (353)
...+.|++.|+.+++.= ++|. +.+.+.+++|...+++.+.|++.|+..|+......+.
T Consensus 526 i~~~~g~~vw~~~d~~f~~~Pk---------------~~i~~~~~~~~~~~~~~~~~l~~L~~~ll~~~l~e~~y~a~~a 590 (961)
T PRK15101 526 IVDEPGLRVVYMPSQYFADEPK---------------ADISLVLRNPKAMDSARNQVLFALNDYLAGLALDQLSNQASVG 590 (961)
T ss_pred EEcCCCeEEEEeCCCccccCCC---------------EEEEEEEeCCCccCCHHHHHHHHHHHHHHHHHHHHHhchHHhc
Confidence 45578999999988611 2343 7889999999999999999999999998743322222
Q ss_pred --cCccccCcceeEEEeecccCCHHHHHHHHHH-------------------HHHHhcCh-HHHHHHHHHH-Hhc-CCCC
Q 044668 94 --NIQASPSREQMGCSFAALETYVPEMVELLTD-------------------ISEATRNP-QSLLSEAIFS-ACY-SVVL 149 (353)
Q Consensus 94 --~~~a~t~~d~t~~~~~~~~~~l~~~l~ll~d-------------------i~~~~~~p-~~~~~~~l~~-~~f-~~p~ 149 (353)
+++.. +.+.+.+++++++++++.+|+++.+ +++..+.- ...+...+.. ..| .|||
T Consensus 591 G~~~~~~-~~~g~~i~v~g~s~~l~~ll~~l~d~l~~~~~~~~~fe~~k~~~~~~l~~~~~~~~~~~~~~~~~~~~~~py 669 (961)
T PRK15101 591 GISFSTN-ANNGLMVNANGYTQRLPQLLQALLEGYFSFTPTEEQLAQAKSWYREQLDSAEKGKAYEQAIMPAQMLSQVPY 669 (961)
T ss_pred CcEEEEc-cCCCEEEEEEecChhHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHhhhcccCcHHHHHHHHHHHhcCCC
Confidence 55556 6899999999999999999999988 11111100 1111122221 235 7899
Q ss_pred CCCCCCChHhhcccCHHHHHHHHHhhcCCCCeEEEEEc-cCHHHHHHHHHHhhCCCCCCCC---C------CCCc-----
Q 044668 150 ANPLLAPECAISRLNSTLIKEFFAENYTAPWMVLAASG-VEHDQLVSVEEPLLSDLPISIL---T------KSPD----- 214 (353)
Q Consensus 150 g~~~~G~~e~l~~it~edL~~f~~~~y~p~n~~lvivG-id~~el~~~v~~~f~~l~~~~~---~------~~p~----- 214 (353)
+.+ .++.+.|+++|.+||++||+++|.+.|++++|+| ++.+++.++++++++.++..+. . ..+.
T Consensus 670 ~~~-~~~~~~l~~it~edl~~f~~~~~~~~~~~~~v~GNi~~~ea~~l~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 748 (961)
T PRK15101 670 FER-DERRKLLPSITLKDVLAYRDALLSGATPEFLVVGNLTEEQVTTLARDVQKQLGADGTEWWRGKDVVVDKKQSVNFE 748 (961)
T ss_pred CCH-HHHHHHHhcCCHHHHHHHHHHHHHhceEEEEEEcCCCHHHHHHHHHHHHHHhccCCcccccccceEeCCCCeEEEe
Confidence 875 6789999999999999999999999999999999 9999999999988877653211 0 0000
Q ss_pred --------cc---c--CCCCchHHHHHHHHhhcCCCCCCCCCCCCCcccHHHHHHHhhCCceee-ecee------ee---
Q 044668 215 --------LC---T--LEDKDAMTLTVTQMLLEGGGSFSAGGPGKGVYSRLHRRVLNEIPRVQQ-GVYF------CG--- 271 (353)
Q Consensus 215 --------~i---~--~~~~d~~~~~vl~~iLGgg~s~~~g~pg~g~~SrL~~~lre~~gl~Ys-~~y~------fg--- 271 (353)
.+ + .. .+.++..+++.+|||+ ++||||++||+++||+|+ .++. ++
T Consensus 749 ~~~~~~~~~~~~~~~~~g-~~~~~~~v~~~lLg~~-----------~ssrlf~~LRtk~qLgY~V~s~~~~~~~~~~~~~ 816 (961)
T PRK15101 749 KAGSSTDSALAAVYVPTG-YDEYQSSAYSSLLGQI-----------IQPWFYNQLRTEEQLGYAVFAFPMSVGRQWGMGF 816 (961)
T ss_pred cCCCCCCCeEEEEEEeCC-CCCHHHHHHHHHHHHH-----------HhHHHHHHHHHHhhhceEEEEEeeccCCeeeEEE
Confidence 00 1 12 3458899999999998 999999999999999999 2211 11
Q ss_pred ----------------------e--cCCCCCHHHHHHHHHHHHHHHHHhccCHHHHHHHHHHHHHHcCCCcc
Q 044668 272 ----------------------I--TPGEVNQVQLDRAVQSTNSAILMNLESRIVVSEDIDRQVQTHGEMKP 319 (353)
Q Consensus 272 ----------------------l--~~~~is~~EL~~aK~~l~~~~~~~~es~~~~a~~l~~~~~~~g~~~~ 319 (353)
+ .-.++|++||+++|+.+++++....++....+..++..+...+.+.+
T Consensus 817 ~vqs~~~~~~~l~~~i~~f~~~~~~~l~~lt~eE~~~~k~~l~~~~~~~~~sl~~~a~~~~~~i~~~~~~fd 888 (961)
T PRK15101 817 LLQSNDKQPAYLWQRYQAFFPQAEAKLRAMKPEEFAQYQQALINQLLQAPQTLGEEASRLSKDFDRGNMRFD 888 (961)
T ss_pred EEECCCCCHHHHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHhcCCCCcC
Confidence 1 12479999999999999999999999999999999877764444444
No 9
>COG1025 Ptr Secreted/periplasmic Zn-dependent peptidases, insulinase-like [Posttranslational modification, protein turnover, chaperones]
Probab=99.87 E-value=2e-21 Score=198.58 Aligned_cols=152 Identities=18% Similarity=0.181 Sum_probs=132.3
Q ss_pred CceeEEEccccCCCCCCCCcHHHHHHHHhcccCCCCC--------------cCccccCcceeEEEeecccCCHHHHHHHH
Q 044668 57 LPDYFYVDCGSIYESPISFVKTDLLERMAFRSTRDRS--------------NIQASPSREQMGCSFAALETYVPEMVELL 122 (353)
Q Consensus 57 v~~~~~v~~Gs~~E~~~~~G~ahlle~llf~gt~~~~--------------~~~a~t~~d~t~~~~~~~~~~l~~~l~ll 122 (353)
-+..+.|++||..||.+.+|+|||+|||+|.|+++|+ +.||+|..+.|+|++++..+.|+.+|+.+
T Consensus 46 s~aAL~V~vGs~~DP~e~~GLAHflEHmlfmGseKYP~~~~f~~fLskhgGs~NA~T~~~~T~fyFeV~~~al~~ALDrF 125 (937)
T COG1025 46 SSAALVVPVGSFDDPEEYPGLAHFLEHMLFMGSEKYPDEGGFSEFLSKHGGSHNASTAGERTAFYFEVENDALEGALDRF 125 (937)
T ss_pred cceeEEeecCCCCChhhcccHHHHHHHHHHhcCccCCCccchHHHHHHcCCccccccCCCceeEEEEecHHHHHHHHHHH
Confidence 5678999999999999999999999999999999998 89999999999999999999999999999
Q ss_pred HH-----------HH--------HH---hcChHHHHHHHHHHHhc-CCCCCCCCCCChHhhcc----cCHHHHHHHHHhh
Q 044668 123 TD-----------IS--------EA---TRNPQSLLSEAIFSACY-SVVLANPLLAPECAISR----LNSTLIKEFFAEN 175 (353)
Q Consensus 123 ~d-----------i~--------~~---~~~p~~~~~~~l~~~~f-~~p~g~~~~G~~e~l~~----it~edL~~f~~~~ 175 (353)
++ ++ +. ..+-..+..+.....+- +||+.+...|..++|.. ...++|++||++|
T Consensus 126 a~ff~~PLf~~e~~dRE~~AV~sE~~~~~~~D~~R~~~~~~~~~np~HP~srFs~GN~~TL~~~p~~~v~~el~ef~~~~ 205 (937)
T COG1025 126 ADFFIEPLFNKEALDRERNAVNSEFTMNLTSDGWRMYQVQALTANPGHPLSKFSTGNLETLSDKPGLVVQQELKEFHEKH 205 (937)
T ss_pred HHHHhccccChHHHHHHHHHHHHHHhcCcCchHHHHHHHHHhhcCCCCCccccCCCChhhhccCCCchHHHHHHHHHHHh
Confidence 99 11 11 12333444444444444 68999999999999988 6689999999999
Q ss_pred cCCCCeEEEEEc-cCHHHHHHHHHHhhCCCCCCC
Q 044668 176 YTAPWMVLAASG-VEHDQLVSVEEPLLSDLPISI 208 (353)
Q Consensus 176 y~p~n~~lvivG-id~~el~~~v~~~f~~l~~~~ 208 (353)
|.++||+++|.| -+.+++.+++.++||.+|...
T Consensus 206 YSa~~M~lviyg~q~ldeL~~~a~~~F~~Ipn~~ 239 (937)
T COG1025 206 YSANNMKLVIYGNQPLDELAKLAADLFGDIPNRA 239 (937)
T ss_pred cChhheEEEEecCCCHHHHHHHHHHHhCcCCCCC
Confidence 999999999999 999999999999999998654
No 10
>KOG0959 consensus N-arginine dibasic convertase NRD1 and related Zn2+-dependent endopeptidases, insulinase superfamily [Posttranslational modification, protein turnover, chaperones]
Probab=99.86 E-value=2.1e-21 Score=201.42 Aligned_cols=168 Identities=13% Similarity=0.096 Sum_probs=139.1
Q ss_pred CCCCcccccCCCCcccCCCCCceeEEEccccCCCCCCCCcHHHHHHHHhcccCCCCC--------------cCccccCcc
Q 044668 37 TSSPSLDFPLPGICLSHPSSLPDYFYVDCGSIYESPISFVKTDLLERMAFRSTRDRS--------------NIQASPSRE 102 (353)
Q Consensus 37 ~~~~~l~~~~p~~~~~~~~~v~~~~~v~~Gs~~E~~~~~G~ahlle~llf~gt~~~~--------------~~~a~t~~d 102 (353)
||+.++....|+... -+..+-|++||..||.+.+|+|||+|||+|.||++|+ +.||+|+.|
T Consensus 35 Ngl~alLisDp~tD~-----ssaal~V~vGS~~DP~dl~GLAHF~EHMlFmGS~KYP~En~y~~~lsk~gGssNA~T~~e 109 (974)
T KOG0959|consen 35 NGLRALLISDPKTDK-----SSAALDVKVGSFSDPEDLQGLAHFCEHMLFMGSEKYPDENEYSKFLSKNGGSSNAYTDSE 109 (974)
T ss_pred CCceEEEecCCCCCc-----cceeeeeeccccCCccccccHHHHHHHHHhhccccCCCcchhHHHHHhcCCccccccccc
Confidence 344444444444333 5677889999999999999999999999999999998 889999999
Q ss_pred eeEEEeecccCCHHHHHHHHHH-----------HH--------HHh--cChHHHHHHHHHHHhc--CCCCCCCCCCChHh
Q 044668 103 QMGCSFAALETYVPEMVELLTD-----------IS--------EAT--RNPQSLLSEAIFSACY--SVVLANPLLAPECA 159 (353)
Q Consensus 103 ~t~~~~~~~~~~l~~~l~ll~d-----------i~--------~~~--~~p~~~~~~~l~~~~f--~~p~g~~~~G~~e~ 159 (353)
+|+|++.+..++|+.+|+.+++ ++ +.. -+-+......+.+.+. +|||++...|..++
T Consensus 110 ~T~y~F~V~~~~l~~ALDrFaqFf~~Plf~~~a~eREv~AVdSE~~~nl~~D~wr~~ql~~~l~~~~hp~~kF~tGN~~t 189 (974)
T KOG0959|consen 110 HTNYYFDVQHDHLEGALDRFAQFFSDPLFNKSATEREVGAVDSEHEKNLNSDGWRFDQLLRSLSNPGHPYSKFSTGNKKT 189 (974)
T ss_pred cceEEEecchHHHHHHHHHHHHHhhCcccChHHHHHHHHHHHHHHHhccCcchhHHHHHHHHhcCCCCcchhccccchhh
Confidence 9999999999999999999999 11 111 1122333444555555 58999999999999
Q ss_pred hcccC-----HHHHHHHHHhhcCCCCeEEEEEc-cCHHHHHHHHHHhhCCCCCCCC
Q 044668 160 ISRLN-----STLIKEFFAENYTAPWMVLAASG-VEHDQLVSVEEPLLSDLPISIL 209 (353)
Q Consensus 160 l~~it-----~edL~~f~~~~y~p~n~~lvivG-id~~el~~~v~~~f~~l~~~~~ 209 (353)
|.... ++.|++||++||.+++|++||+| .+.|++..++.+.|++++....
T Consensus 190 L~~~p~~~~~r~~L~kF~k~~Yssn~M~l~i~G~eslD~Le~lv~~~F~~i~N~~~ 245 (974)
T KOG0959|consen 190 LLEGPREIDLRDELLKFYKNWYSSNIMTLVIVGKESLDVLESLVTRLFDEISNKKK 245 (974)
T ss_pred hhhccccchHHHHHHHHHHhhcccccceEEEEcCCChhHHHHHHHHHcccccccCC
Confidence 99998 99999999999999999999999 9999999999999999876543
No 11
>COG1026 Predicted Zn-dependent peptidases, insulinase-like [General function prediction only]
Probab=99.78 E-value=9.6e-18 Score=172.47 Aligned_cols=223 Identities=14% Similarity=0.148 Sum_probs=175.9
Q ss_pred EEccccCCCCCCCCcHHHHHHHHhcccCCCCC---------------cCccccCcceeEEEeecc-cCCHHHHHHHHHH-
Q 044668 62 YVDCGSIYESPISFVKTDLLERMAFRSTRDRS---------------NIQASPSREQMGCSFAAL-ETYVPEMVELLTD- 124 (353)
Q Consensus 62 ~v~~Gs~~E~~~~~G~ahlle~llf~gt~~~~---------------~~~a~t~~d~t~~~~~~~-~~~l~~~l~ll~d- 124 (353)
.|...-+.+|.+..|++|.|||+.|.|+++|+ .+||.|+.|.|+|-++.. .+++-..|.+..|
T Consensus 44 vFsi~F~T~p~dstGVaHiLEHtvlcGS~kYPvkdPF~~ml~rSLntF~NA~T~~D~T~YP~sS~~~~Df~NLl~VYlDa 123 (978)
T COG1026 44 VFSIAFKTEPHDSTGVAHILEHTVLCGSKKYPVKDPFFKMLKRSLNTFLNAFTFPDKTVYPASSANEKDFYNLLSVYLDA 123 (978)
T ss_pred eEEEEeecCCCCCCCcchHHHHHhhhCCCCCCCCChHHHHHHHhHHHHHhhccCCCcceeeccccCcchHHHHHHHHHHh
Confidence 34445556888999999999999999999998 589999999999987655 4688999999988
Q ss_pred -----------------------------------HHHHhcChHHHHHHHHHHHhc-CCCCCCCCCCChHhhcccCHHHH
Q 044668 125 -----------------------------------ISEATRNPQSLLSEAIFSACY-SVVLANPLLAPECAISRLNSTLI 168 (353)
Q Consensus 125 -----------------------------------i~~~~~~p~~~~~~~l~~~~f-~~p~g~~~~G~~e~l~~it~edL 168 (353)
++....+|..++++.+++.+| +..||....|.++.|..+|.|++
T Consensus 124 vf~PlL~~e~F~QEgwr~e~~~~~~l~~~GVVyNEMKGa~ss~~~~~~~~~~~slfp~~ty~~~SGG~P~~I~~LtyE~~ 203 (978)
T COG1026 124 VFHPLLTKESFLQEGWRIEFKDESNLKYKGVVYNEMKGAYSSGESVLSRAMQQSLFPGTTYGVNSGGDPKNIPDLTYEEF 203 (978)
T ss_pred hhCcccchHHHhhhhhccccCCCccceeeeEEeehhcccccCchhHHHHHHHHhhCCCccccccCCCCcccccccCHHHH
Confidence 333457899999999999999 56899999999999999999999
Q ss_pred HHHHHhhcCCCCeEEEEEc-cCHHHHHHHHHHh-hCCCCCCCCC-C--------CCc----------------c----c-
Q 044668 169 KEFFAENYTAPWMVLAASG-VEHDQLVSVEEPL-LSDLPISILT-K--------SPD----------------L----C- 216 (353)
Q Consensus 169 ~~f~~~~y~p~n~~lvivG-id~~el~~~v~~~-f~~l~~~~~~-~--------~p~----------------~----i- 216 (353)
++||+++|+|+|+.|.++| ++.++..+.+++. +...++.... + .|. + +
T Consensus 204 r~FHkk~Y~pSN~~i~~yGni~~~~~L~~iee~~l~~~~k~~~~~~i~~~~~~~~~~~~~~~ypi~~~~~de~q~~~~ls 283 (978)
T COG1026 204 RAFHKKHYHPSNCKIFVYGNIPTERLLDFIEEKVLRPFGKRELDVPIPDQKAFKKPRRKVLEYPISFDEEDEDQGLLSLS 283 (978)
T ss_pred HHHHHHhCCccceEEEEECCCCHHHHHHHHHHhhhccccccccCCCCCcccccCcccccceeeccCCCCCCCceeEEEEE
Confidence 9999999999999999999 9999999999876 5544332210 0 010 0 1
Q ss_pred c-----CCCCchHHHHHHHHhhcCCCCCCCCCCCCCcccHHHHHHHhhCCce-ee--ecee---------ee--------
Q 044668 217 T-----LEDKDAMTLTVTQMLLEGGGSFSAGGPGKGVYSRLHRRVLNEIPRV-QQ--GVYF---------CG-------- 271 (353)
Q Consensus 217 ~-----~~~~d~~~~~vl~~iLGgg~s~~~g~pg~g~~SrL~~~lre~~gl~-Ys--~~y~---------fg-------- 271 (353)
| .+-.+.+++.||..+|-|+ ..|.|.++|.|. |+. +. .+|. +|
T Consensus 284 Wl~~~~~d~~~~lal~vL~~iLl~~-----------~asPl~~~lies-glg~~~~~g~~~~~~~~~~f~v~~~gv~~ek 351 (978)
T COG1026 284 WLGGSASDAEDSLALEVLEEILLDS-----------AASPLTQALIES-GLGFADVSGSYDSDLKETIFSVGLKGVSEEK 351 (978)
T ss_pred EecCCcccHHHHHHHHHHHHHHccC-----------cccHHHHHHHHc-CCCcccccceeccccceeEEEEEecCCCHHH
Confidence 3 2335679999999999887 799999999996 554 22 2122 22
Q ss_pred --------------ecCCCCCHHHHHHHHHHHHHHHHHh
Q 044668 272 --------------ITPGEVNQVQLDRAVQSTNSAILMN 296 (353)
Q Consensus 272 --------------l~~~~is~~EL~~aK~~l~~~~~~~ 296 (353)
+..+++..+.++.++.++.-++.-.
T Consensus 352 ~~~~k~lV~~~L~~l~~~gi~~~~ie~~~~q~E~s~ke~ 390 (978)
T COG1026 352 IAKLKNLVLSTLKELVKNGIDKKLIEAILHQLEFSLKEV 390 (978)
T ss_pred HHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHhhhhh
Confidence 4567788888888888877666554
No 12
>KOG2019 consensus Metalloendoprotease HMP1 (insulinase superfamily) [General function prediction only; Posttranslational modification, protein turnover, chaperones]
Probab=99.77 E-value=4.6e-18 Score=167.11 Aligned_cols=248 Identities=14% Similarity=0.111 Sum_probs=188.5
Q ss_pred CceecccCCCCCCCCcccc-cCCCCcccCCCCCceeE-------------------EEccccCCCCCCCCcHHHHHHHHh
Q 044668 26 GFYSWLSGEQSTSSPSLDF-PLPGICLSHPSSLPDYF-------------------YVDCGSIYESPISFVKTDLLERMA 85 (353)
Q Consensus 26 ~~~~~~~~~~~~~~~~l~~-~~p~~~~~~~~~v~~~~-------------------~v~~Gs~~E~~~~~G~ahlle~ll 85 (353)
-.+.|+.++.-.+..++.. ++|.+.+ -++.+ +|..+-+.-|.+..||.|+|||..
T Consensus 25 a~q~yp~G~~~hgfevv~~~~VpEl~l-----~av~lkH~~Tgae~lhl~reD~N~vFsI~FrTpp~dstGiPHILEHtv 99 (998)
T KOG2019|consen 25 APQRYPHGDKKHGFEVVSEEFVPELKL-----TAVLLKHKKTGAEVLHLDREDENNVFSIVFRTPPKDSTGIPHILEHTV 99 (998)
T ss_pred HHHhCCCCccccceEEeeeeechhhhh-----hheeeeecCCCceeEeeccCCCCceeEEEeecCCCccCCCchhhhhhe
Confidence 3456776444345555443 5666655 33333 778888888999999999999999
Q ss_pred cccCCCCC---------------cCccccCcceeEEEee-cccCCHHHHHHHHHH-------------------------
Q 044668 86 FRSTRDRS---------------NIQASPSREQMGCSFA-ALETYVPEMVELLTD------------------------- 124 (353)
Q Consensus 86 f~gt~~~~---------------~~~a~t~~d~t~~~~~-~~~~~l~~~l~ll~d------------------------- 124 (353)
..|+.+|+ .+||+|..|+|.|-+. +.+.|+....++.-|
T Consensus 100 LCGS~KYPvrdPFfkmLnrSLatFmNAfT~pD~T~yPfattN~kDf~NL~dVYLDAtffPklr~~dF~QEGWr~Eh~dps 179 (998)
T KOG2019|consen 100 LCGSRKYPVRDPFFKMLNRSLATFMNAFTAPDYTFYPFATTNTKDFYNLRDVYLDATFFPKLRKLDFQQEGWRLEHNDPS 179 (998)
T ss_pred eeccCcCcccChHHHHHHHHHHHHHhhccCCCcceeecccCChHHHHHHHHHhhhcccchHHHhhhhhhhcceeecCCCC
Confidence 99999998 7899999999999764 455788888887777
Q ss_pred ---------------HHHHhcChHHHHHHHHHHHhc-CCCCCCCCCCChHhhcccCHHHHHHHHHhhcCCCCeEEEEEc-
Q 044668 125 ---------------ISEATRNPQSLLSEAIFSACY-SVVLANPLLAPECAISRLNSTLIKEFFAENYTAPWMVLAASG- 187 (353)
Q Consensus 125 ---------------i~~~~~~p~~~~~~~l~~~~f-~~p~g~~~~G~~e~l~~it~edL~~f~~~~y~p~n~~lvivG- 187 (353)
++....||+.++...+++++| .+.||....|.+-.|..++.+++++||++||+|+|+-+...|
T Consensus 180 d~~SpivfkGVVfNEMKG~~S~~~~if~~~~Qq~L~p~~tYgv~SGGDPl~IpdLt~eelk~FHr~~YHPSNAri~tYGn 259 (998)
T KOG2019|consen 180 DPISPIVFKGVVFNEMKGQYSDPDYIFGMLFQQALFPENTYGVNSGGDPLDIPDLTYEELKEFHRQHYHPSNARIFTYGN 259 (998)
T ss_pred CCcccceeeeeeeecccccccChhHHHHHHHHHhhCccccccccCCCCcccCccccHHHHHHHHHhccCCCcceeEeecC
Confidence 233457899999999999999 689999999999999999999999999999999999999999
Q ss_pred cCHHHHHHHHHHhhCCCCCCCCC---C------CCc----------------c----c-c-----CCCCchHHHHHHHHh
Q 044668 188 VEHDQLVSVEEPLLSDLPISILT---K------SPD----------------L----C-T-----LEDKDAMTLTVTQML 232 (353)
Q Consensus 188 id~~el~~~v~~~f~~l~~~~~~---~------~p~----------------~----i-~-----~~~~d~~~~~vl~~i 232 (353)
++.++...+++.-|....+.... . .|+ + + | .+--+.+++.||.++
T Consensus 260 ~Pl~~~l~~l~e~~~~~sk~~~s~kv~~qk~f~kp~rvve~~p~d~~~~p~Kq~~~s~s~L~~~p~d~~etfaL~~L~~L 339 (998)
T KOG2019|consen 260 FPLEDLLKQLEEDFSPFSKRELSSKVTFQKLFDKPRRVVEKGPADPGDLPKKQTKCSNSFLSNDPLDTYETFALKVLSHL 339 (998)
T ss_pred chHHHHHHHHHHhhcccccccccCccccccccccCceeeeecCCCCCCCccceeEEEEEeecCCchhHHHHHHHHHHHHH
Confidence 99999999988777655332211 0 111 1 1 1 122456899999999
Q ss_pred hcCCCCCCCCCCCCCcccHHHHHHHhhCCce--ee--ecee-------eeecCCCCCHHHHHHHHHHHH
Q 044668 233 LEGGGSFSAGGPGKGVYSRLHRRVLNEIPRV--QQ--GVYF-------CGITPGEVNQVQLDRAVQSTN 290 (353)
Q Consensus 233 LGgg~s~~~g~pg~g~~SrL~~~lre~~gl~--Ys--~~y~-------fgl~~~~is~~EL~~aK~~l~ 290 (353)
|-+| .+|.+|+.|.|. |+- .+ +.|. |++--+|++|+.++..++-+-
T Consensus 340 l~~g-----------psSp~yk~LiES-GLGtEfsvnsG~~~~t~~~~fsVGLqGvseediekve~lV~ 396 (998)
T KOG2019|consen 340 LLDG-----------PSSPFYKALIES-GLGTEFSVNSGYEDTTLQPQFSVGLQGVSEEDIEKVEELVM 396 (998)
T ss_pred hcCC-----------CccHHHHHHHHc-CCCcccccCCCCCcccccceeeeeeccccHHHHHHHHHHHH
Confidence 9887 899999999996 442 33 3333 555556788888887765543
No 13
>PF00675 Peptidase_M16: Insulinase (Peptidase family M16) This is family M16 in the peptidase classification. ; InterPro: IPR011765 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. The majority of the sequences in this entry are metallopeptidases and non-peptidase homologs belong to MEROPS peptidase family M16 (clan ME), subfamilies M16A, M16B and M16C; they include: Insulinase, insulin-degrading enzyme (3.4.24.56 from EC) Mitochondrial processing peptidase alpha subunit, (Alpha-MPP, 3.4.24.64 from EC) Pitrlysin, Protease III precursor (3.4.24.55 from EC) Nardilysin, (3.4.24.61 from EC) Ubiquinol-cytochrome C reductase complex core protein I,mitochondrial precursor (1.10.2.2 from EC) Coenzyme PQQ synthesis protein F (3.4.99 from EC) These proteins do not share many regions of sequence similarity; the most noticeable is in the N-terminal section. This region includes a conserved histidine followed, two residues later by a glutamate and another histidine. In pitrilysin, it has been shown [] that this H-x-x-E-H motif is involved in enzymatic activity; the two histidines bind zinc and the glutamate is necessary for catalytic activity. The proteins classified as non-peptidase homologues either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity. ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3P7L_A 3P7O_A 3TUV_A 3GO9_A 1BE3_B 1PP9_B 2A06_B 1SQB_B 1SQP_B 1L0N_B ....
Probab=99.76 E-value=1.5e-18 Score=146.60 Aligned_cols=111 Identities=33% Similarity=0.448 Sum_probs=98.7
Q ss_pred cccccCCCCcccCCCCCceeEEEccccCCCCCCCCcHHHHHHHHhcccCCCCC-------------cCccccCcceeEEE
Q 044668 41 SLDFPLPGICLSHPSSLPDYFYVDCGSIYESPISFVKTDLLERMAFRSTRDRS-------------NIQASPSREQMGCS 107 (353)
Q Consensus 41 ~l~~~~p~~~~~~~~~v~~~~~v~~Gs~~E~~~~~G~ahlle~llf~gt~~~~-------------~~~a~t~~d~t~~~ 107 (353)
|++...|+.++ +++.++|++|+++|++++.|++|+++||+++||.+++ .+++.+++|++.|+
T Consensus 2 V~~~~~~~~~~-----~~~~l~~~~Gs~~e~~~~~G~a~ll~~l~~~gs~~~~~~~l~~~l~~~G~~~~~~t~~d~t~~~ 76 (149)
T PF00675_consen 2 VVLVEDPGSPV-----VSVSLVFKAGSRYEPPGKPGLAHLLEHLLFRGSKKYSSDELQEELESLGASFNASTSRDSTSYS 76 (149)
T ss_dssp EEEEESTTSSE-----EEEEEEES-SGGGSCTTTTTHHHHHHHHTTSBBSSSBHHHHHHHHHHTTCEEEEEEESSEEEEE
T ss_pred EEEEEcCCCCE-----EEEEEEEeeccCCCCCCCCchhhhhhhhcccccchhhhhhhHHHhhhhccccceEecccceEEE
Confidence 34455566666 8999999999999999999999999999999999998 88999999999999
Q ss_pred eecccCCHHHHHHHHHH----------------------HHHHhcChHHHHHHHHHHHhc-CCCCCCCCCCC
Q 044668 108 FAALETYVPEMVELLTD----------------------ISEATRNPQSLLSEAIFSACY-SVVLANPLLAP 156 (353)
Q Consensus 108 ~~~~~~~l~~~l~ll~d----------------------i~~~~~~p~~~~~~~l~~~~f-~~p~g~~~~G~ 156 (353)
+++++++++.+|++|++ +++..++|..++.+.+++.+| +||||+++.||
T Consensus 77 ~~~~~~~~~~~l~~l~~~~~~P~f~~~~~~~~r~~~~~ei~~~~~~~~~~~~~~l~~~~f~~~p~~~~~~~~ 148 (149)
T PF00675_consen 77 ASVLSEDLEKALELLADMLFNPSFDEEEFEREREQILQEIEEIKENPQELAFEKLHSAAFRGHPYGNPLLGP 148 (149)
T ss_dssp EEEEGGGHHHHHHHHHHHHHSBGGCHHHHHHHHHHHHHHHHHHTTHHHHHHHHHHHHHHHTTSGGGSHSS-T
T ss_pred EEEecccchhHHHHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHhccCCCCCCCCCC
Confidence 99999999999999999 445668899999999999999 68999999987
No 14
>PF05193 Peptidase_M16_C: Peptidase M16 inactive domain; InterPro: IPR007863 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. These metallopeptidases belong to MEROPS peptidase family M16 (clan ME). They include proteins, which are classified as non-peptidase homologues either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity. The peptidases in this group of sequences include: Insulinase, insulin-degrading enzyme (3.4.24.56 from EC) Mitochondrial processing peptidase alpha subunit, (Alpha-MPP, 3.4.24.64 from EC) Pitrlysin, Protease III precursor (3.4.24.55 from EC) Nardilysin, (3.4.24.61 from EC) Ubiquinol-cytochrome C reductase complex core protein I,mitochondrial precursor (1.10.2.2 from EC) Coenzyme PQQ synthesis protein F (3.4.99 from EC) These proteins do not share many regions of sequence similarity; the most noticeable is in the N-terminal section. This region includes a conserved histidine followed, two residues later by a glutamate and another histidine. In pitrilysin, it has been shown [] that this H-x-x-E-H motif is involved in enzymatic activity; the two histidines bind zinc and the glutamate is necessary for catalytic activity. The mitochondrial processing peptidase consists of two structurally related domains. One is the active peptidase whereas the other, the C-terminal region, is inactive. The two domains hold the substrate like a clamp [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis; PDB: 1BE3_B 1PP9_B 2A06_B 1SQB_B 1SQP_B 1L0N_B 1SQX_B 1NU1_B 1L0L_B 2FYU_B ....
Probab=99.67 E-value=4.1e-16 Score=134.23 Aligned_cols=117 Identities=22% Similarity=0.384 Sum_probs=93.3
Q ss_pred ccCHHHHHHHHHhhcCCCCeEEEEEc-cCHHHHHHHHHHhhCCCCCCC---C-CC----CC----c--------------
Q 044668 162 RLNSTLIKEFFAENYTAPWMVLAASG-VEHDQLVSVEEPLLSDLPISI---L-TK----SP----D-------------- 214 (353)
Q Consensus 162 ~it~edL~~f~~~~y~p~n~~lvivG-id~~el~~~v~~~f~~l~~~~---~-~~----~p----~-------------- 214 (353)
++|.++|++||++||+|+||+++++| ++++++.++++++|+.|+... . .. .+ .
T Consensus 1 ~it~e~l~~f~~~~y~p~n~~l~i~Gd~~~~~~~~~i~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (184)
T PF05193_consen 1 NITLEDLRAFYKKFYRPSNMTLVIVGDIDPDELEKLIEKYFGSLPKSSIPPKPKPRSPPLPPSEPQGKEIVIPSKDESQS 80 (184)
T ss_dssp C--HHHHHHHHHHHSSGGGEEEEEEESSGHHHHHHHHHHHHTTSSHSCHGGSSSCSSSSSSCGGSSEEEEEEEESSSSSE
T ss_pred CCCHHHHHHHHHHhcCccceEEEEEcCccHHHHHHHHHhhhhhhcccccccccccccccccccccccccccccccccccc
Confidence 58999999999999999999999999 999999999999999998542 1 11 01 0
Q ss_pred cc-------cC-CCCchHHHHHHHHhhcCCCCCCCCCCCCCcccHHHHHHHhhCCceee-ec----ee----ee------
Q 044668 215 LC-------TL-EDKDAMTLTVTQMLLEGGGSFSAGGPGKGVYSRLHRRVLNEIPRVQQ-GV----YF----CG------ 271 (353)
Q Consensus 215 ~i-------~~-~~~d~~~~~vl~~iLGgg~s~~~g~pg~g~~SrL~~~lre~~gl~Ys-~~----y~----fg------ 271 (353)
.+ .. ++++..++.+++.+||++ +.|+|+.++|++.+++|+ .. +. |+
T Consensus 81 ~v~~~~~~~~~~~~~~~~~~~~l~~~l~~~-----------~~s~l~~~lr~~~~l~y~v~~~~~~~~~~~~~~i~~~~~ 149 (184)
T PF05193_consen 81 IVSIAFPGPPIKDSKDYFALNLLSSLLGNG-----------MSSRLFQELREKQGLAYSVSASNSSYRDSGLFSISFQVT 149 (184)
T ss_dssp EEEEEEEEEETGTSTTHHHHHHHHHHHHCS-----------TTSHHHHHHHTTTTSESEEEEEEEEESSEEEEEEEEEEE
T ss_pred ccccccccccccccchhhHHHHHHHHHhcC-----------ccchhHHHHHhccccceEEEeeeeccccceEEEEEEEcC
Confidence 01 23 668999999999999998 999999999999999998 11 11 22
Q ss_pred -----------------ecCCCCCHHHHHHHHHHH
Q 044668 272 -----------------ITPGEVNQVQLDRAVQST 289 (353)
Q Consensus 272 -----------------l~~~~is~~EL~~aK~~l 289 (353)
+.++|+|++||+++|++|
T Consensus 150 ~~~~~~~~~~~~~~l~~l~~~~~s~~el~~~k~~L 184 (184)
T PF05193_consen 150 PENLDEAIEAILQELKRLREGGISEEELERAKNQL 184 (184)
T ss_dssp GGGHHHHHHHHHHHHHHHHHHCS-HHHHHHHHHHH
T ss_pred cccHHHHHHHHHHHHHHHHHcCCCHHHHHHHHhcC
Confidence 566789999999999876
No 15
>KOG0961 consensus Predicted Zn2+-dependent endopeptidase, insulinase superfamily [Posttranslational modification, protein turnover, chaperones]
Probab=99.60 E-value=4.5e-14 Score=139.17 Aligned_cols=162 Identities=14% Similarity=0.178 Sum_probs=131.0
Q ss_pred ecccCCCCCCCCcccccCCCCcccCCCCCceeEEEccccCCCCCCCCcHHHHHHHHhcccCCCCC--------------c
Q 044668 29 SWLSGEQSTSSPSLDFPLPGICLSHPSSLPDYFYVDCGSIYESPISFVKTDLLERMAFRSTRDRS--------------N 94 (353)
Q Consensus 29 ~~~~~~~~~~~~~l~~~~p~~~~~~~~~v~~~~~v~~Gs~~E~~~~~G~ahlle~llf~gt~~~~--------------~ 94 (353)
-|..+.. ++.|.....|+..+ ...+.|.+ |..+..|+.|-||||.|.|+++|+ .
T Consensus 22 kyis~~T--kl~va~~~~pts~v------hG~f~v~T----Ea~~d~G~PHTLEHL~FMGSKkYP~kGvLd~~anr~l~d 89 (1022)
T KOG0961|consen 22 KYISKNT--KLRVAIGEVPTSMV------HGAFSVVT----EADSDDGLPHTLEHLVFMGSKKYPFKGVLDVIANRCLAD 89 (1022)
T ss_pred EEecccc--ceEEEEeecCCcce------eeeEEeee----eecCCCCCchhHHHHhhhccccCCcccHHHHhhcchhcc
Confidence 6777777 77776665555533 44444433 455678999999999999999998 8
Q ss_pred CccccCcceeEEEeeccc-CCHHHHHHHHHH--------------------------------HHHHhcChHHHHHHHHH
Q 044668 95 IQASPSREQMGCSFAALE-TYVPEMVELLTD--------------------------------ISEATRNPQSLLSEAIF 141 (353)
Q Consensus 95 ~~a~t~~d~t~~~~~~~~-~~l~~~l~ll~d--------------------------------i~~~~~~p~~~~~~~l~ 141 (353)
.||+|+.|+|.|.++... +.+-+.|..+.| ++..+..-..+..+..+
T Consensus 90 tNAwTDtD~T~YtLStag~dGFlklLPvy~dHiL~P~Ltdeaf~TEVyHI~geg~d~GVVySEMq~~es~~~~im~~~~~ 169 (1022)
T KOG0961|consen 90 TNAWTDTDHTAYTLSTAGSDGFLKLLPVYIDHILTPMLTDEAFATEVYHITGEGNDAGVVYSEMQDHESEMESIMDRKTK 169 (1022)
T ss_pred cccccccCcceEEeecccccchHHHhHHHHHhhcCcccchhhhhhheeeecCCCCccceeehhhhhhhcccchhhhhhhh
Confidence 999999999999998765 467788888777 33344556677778888
Q ss_pred HHhc-C-CCCCCCCCCChHhhcccCHHHHHHHHHhhcCCCCeEEEEEc-cCHHHHHHHHHHhhC
Q 044668 142 SACY-S-VVLANPLLAPECAISRLNSTLIKEFFAENYTAPWMVLAASG-VEHDQLVSVEEPLLS 202 (353)
Q Consensus 142 ~~~f-~-~p~g~~~~G~~e~l~~it~edL~~f~~~~y~p~n~~lvivG-id~~el~~~v~~~f~ 202 (353)
+.+| . .+|.....|-..+|..+|.|++++||+++|.++||++.|+| |+++++....+..-.
T Consensus 170 ~~~yP~~sgY~~eTGG~~knLR~lt~ekIR~yHK~~Y~~sN~cviVcG~v~~d~lL~~m~~~~n 233 (1022)
T KOG0961|consen 170 EVIYPPFSGYAVETGGRLKNLRELTLEKIRDYHKKFYHLSNMCVIVCGMVDHDQLLEIMNNVEN 233 (1022)
T ss_pred eeecCCCCCceeccCCChhhHHHhhHHHHHHHHHHhccccceEEEEecCcCHHHHHHHHHHHHh
Confidence 8899 3 48988889999999999999999999999999999999999 999999887765544
No 16
>COG1026 Predicted Zn-dependent peptidases, insulinase-like [General function prediction only]
Probab=98.10 E-value=0.00014 Score=76.46 Aligned_cols=252 Identities=16% Similarity=0.135 Sum_probs=156.7
Q ss_pred ecccCCCCCCCCcccccCCCCcccCCCCCceeEEEccccCCCCCCCCcHHHHHHHHhcccCCCCC---------------
Q 044668 29 SWLSGEQSTSSPSLDFPLPGICLSHPSSLPDYFYVDCGSIYESPISFVKTDLLERMAFRSTRDRS--------------- 93 (353)
Q Consensus 29 ~~~~~~~~~~~~~l~~~~p~~~~~~~~~v~~~~~v~~Gs~~E~~~~~G~ahlle~llf~gt~~~~--------------- 93 (353)
.-+..+..+..+++..+...-.+ +.++++|+. +.-+....+=+.-|...+.--||++++
T Consensus 526 ~~l~~~~~~~~~v~~~~~~tn~i-----~yl~~~~~~-~~l~~~llpyL~L~~~~l~~lgt~~~~y~e~~~~i~~~TGgi 599 (978)
T COG1026 526 TSLETEVSNEAKVLHHDLFTNGI-----TYLRLYFDL-DMLPSELLPYLPLFAFALTNLGTETYSYKELLNQIERHTGGI 599 (978)
T ss_pred cceeeeccCCcceEEeecCCCCe-----EEEEEEeec-CCCChhhhhhHHHHHHHHHhcCCCCcCHHHHHHHHHHHhCCc
Confidence 33555556677888888877777 889999999 444556677777777777778998887
Q ss_pred --cCccccC-------cceeEEEeecccCCHHHHHHHHHH----------------HHHHh--------cChHHHHHHHH
Q 044668 94 --NIQASPS-------REQMGCSFAALETYVPEMVELLTD----------------ISEAT--------RNPQSLLSEAI 140 (353)
Q Consensus 94 --~~~a~t~-------~d~t~~~~~~~~~~l~~~l~ll~d----------------i~~~~--------~~p~~~~~~~l 140 (353)
++++.++ +..+.+++.++.+..+++++++.+ |++.. +.+..+|.-..
T Consensus 600 s~~~~~~~~~~~~~~~~~~~~i~~K~l~~k~~~~~~~i~~~l~~~~F~D~~Rlkell~q~~~~l~~~vr~sG~~~A~~~~ 679 (978)
T COG1026 600 SVSLSVDTDPGDDGEYRPSFSISGKALRSKVEKLFELIREILANTDFHDRERLKELLEQYLSDLTSSVRNSGHSIASSLA 679 (978)
T ss_pred eeeEeeccCCCccccccceEEEEEEehhhhhhHHHHHHHHHHhcCCcCcHHHHHHHHHHHHhhhHHhhhccchHHHHHHh
Confidence 3333333 345788999999999999999988 12211 22333333332
Q ss_pred HHHhc-CCCCCCCCCCC--hHhhcccC-----------HHHHHHHHHhhcCCCCeEEEEEc-cCHHHHHHHHHHhhCCCC
Q 044668 141 FSACY-SVVLANPLLAP--ECAISRLN-----------STLIKEFFAENYTAPWMVLAASG-VEHDQLVSVEEPLLSDLP 205 (353)
Q Consensus 141 ~~~~f-~~p~g~~~~G~--~e~l~~it-----------~edL~~f~~~~y~p~n~~lvivG-id~~el~~~v~~~f~~l~ 205 (353)
...++ ...+.....|- .+-|..+. .+.|++.+++.+..+|+-+++.| ++ ++.+.+++-|-++.
T Consensus 680 ~s~~~~~~~l~e~~~Gl~q~k~i~~l~~~~~~~~~~ei~~kL~~l~~~i~~~~n~~i~i~~~~~--~~~~~~e~~l~~~~ 757 (978)
T COG1026 680 NSRLSSAGALKELLNGLSQVKFLRELSSNFEENFEKEIADKLQALRKKIFQTNNLRIAIIGDID--KILDLLENPLLKFL 757 (978)
T ss_pred hcccccchhHHHHhcChhHHHHHHHHHHhhcccccHHHHHHHHHHHHHHhhcCceEEEEecChh--hhHHHHHHHhhhhh
Confidence 22222 11111111111 11122221 35778889999999999888888 53 33344444443332
Q ss_pred CC-----CCCC---CC---------c------------------cccCCCCchHHHHHHHHhhcCCCCCCCCCCCCCccc
Q 044668 206 IS-----ILTK---SP---------D------------------LCTLEDKDAMTLTVTQMLLEGGGSFSAGGPGKGVYS 250 (353)
Q Consensus 206 ~~-----~~~~---~p---------~------------------~i~~~~~d~~~~~vl~~iLGgg~s~~~g~pg~g~~S 250 (353)
.. ..+. .+ + ++...|||+.++.|+.++|+-+
T Consensus 758 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ii~~p~a~~~l~fs~~~~~y~hpd~~~l~vls~~L~~~-------------- 823 (978)
T COG1026 758 EHLLPGFELPTPPKNPHLDLISSLSEATIIPSPVAYNALAFSIGGLPYTHPDYAALQVLSEYLGSG-------------- 823 (978)
T ss_pred cccCcccccCCCCCCcchhhhccccceEEeccHHHHHHHhhhccCCCCCCccchHHHHHHHHhccc--------------
Confidence 10 0000 00 0 1234679999999999999744
Q ss_pred HHHHHHHhhCCceee--ecee-----ee----------------------ecCCCCCHHHHHHHHHHHHHHHHHhccCHH
Q 044668 251 RLHRRVLNEIPRVQQ--GVYF-----CG----------------------ITPGEVNQVQLDRAVQSTNSAILMNLESRI 301 (353)
Q Consensus 251 rL~~~lre~~gl~Ys--~~y~-----fg----------------------l~~~~is~~EL~~aK~~l~~~~~~~~es~~ 301 (353)
.|..+||++ |+||+ .++. |+ +..+.++++|++.|+--+++.+.. .+++.
T Consensus 824 ~lw~~IR~~-GGAYGa~as~~~~~G~f~f~sYRDPn~~kt~~v~~~~v~~l~s~~~~~~d~~~~ilg~i~~~d~-p~sp~ 901 (978)
T COG1026 824 YLWNKIREK-GGAYGASASIDANRGVFSFASYRDPNILKTYKVFRKSVKDLASGNFDERDLEEAILGIISTLDT-PESPA 901 (978)
T ss_pred hhHHHHHhh-ccccccccccccCCCeEEEEecCCCcHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHhhccccc-ccCCc
Confidence 599999996 99999 2222 11 567799999999999888876644 45554
Q ss_pred HHH
Q 044668 302 VVS 304 (353)
Q Consensus 302 ~~a 304 (353)
..+
T Consensus 902 ~~~ 904 (978)
T COG1026 902 SEG 904 (978)
T ss_pred cee
Confidence 444
No 17
>PTZ00432 falcilysin; Provisional
Probab=97.83 E-value=0.0005 Score=75.70 Aligned_cols=276 Identities=13% Similarity=0.031 Sum_probs=160.2
Q ss_pred CCCCcccccCCCCcccCCCCCceeEEEccccCCCCCCCCcHHHHHHHHh-cccCCCCC-------------cCc----cc
Q 044668 37 TSSPSLDFPLPGICLSHPSSLPDYFYVDCGSIYESPISFVKTDLLERMA-FRSTRDRS-------------NIQ----AS 98 (353)
Q Consensus 37 ~~~~~l~~~~p~~~~~~~~~v~~~~~v~~Gs~~E~~~~~G~ahlle~ll-f~gt~~~~-------------~~~----a~ 98 (353)
+++|++.+++|.-.+ +.+.++|+.....+ +..-...|+..++ --||++++ .++ ..
T Consensus 667 ~~~~~~~~~~~TnGi-----~y~~~~fdl~~l~~--e~~~yl~L~~~~l~~~gT~~~s~~el~~~i~~~tGg~~~~~~~~ 739 (1119)
T PTZ00432 667 GSVTVLVHPIESRGI-----LYLDFAFSLDSLTV--DELKYLNLFKALLKENGTDKLSSEEFTYKREKNLGGLSASTAFY 739 (1119)
T ss_pred CCcceEEEecCCCCe-----EEEEEEecCCCCCH--HHHhhHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCCeEEEEEEe
Confidence 477888888887777 89999998886543 3334445555444 44888887 222 11
Q ss_pred cC------------cceeEEEeecccCCHHHHHHHHHH----------------HHHHh--------cChHHHHHHHHHH
Q 044668 99 PS------------REQMGCSFAALETYVPEMVELLTD----------------ISEAT--------RNPQSLLSEAIFS 142 (353)
Q Consensus 99 t~------------~d~t~~~~~~~~~~l~~~l~ll~d----------------i~~~~--------~~p~~~~~~~l~~ 142 (353)
++ ...+.+++.++.++++++++++.+ +++.. ++...++...+..
T Consensus 740 ~~~~~~~~~~~~~~~~~~~v~~k~l~~~~~~~~~l~~eil~~~~f~d~~rl~~il~~~~~~~~~~~~~~Gh~~A~~~~~s 819 (1119)
T PTZ00432 740 SETNNLTYDDPYNGVGYLNVRAKVLKHKVNEMVDIVLEALKDADFSNSKKGVEILKRKINGMKTVFSSKGHKFALKRMKS 819 (1119)
T ss_pred ccccccccCcccccceEEEEEEEEhhhhHHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHh
Confidence 22 346888999999999999999998 11111 1222222211111
Q ss_pred HhcC-CCCCCC-CCC--ChHhhcccC-----------HHHHHHHHHhhcCCCCeEEEEEc-c-CHHHHHHHHHHhhCCCC
Q 044668 143 ACYS-VVLANP-LLA--PECAISRLN-----------STLIKEFFAENYTAPWMVLAASG-V-EHDQLVSVEEPLLSDLP 205 (353)
Q Consensus 143 ~~f~-~p~g~~-~~G--~~e~l~~it-----------~edL~~f~~~~y~p~n~~lvivG-i-d~~el~~~v~~~f~~l~ 205 (353)
-+. ..+-.. ..| -..-|+.+. .+.|.+.+++.++++|+.+.++| . ..+.+.+.+..++..++
T Consensus 820 -~~S~~~~~~e~~~G~~~~~fl~~l~~~~~e~~~~~v~~~L~~i~~~i~~~~~l~~~vt~~~~~~~~~~~~~~~~~~~l~ 898 (1119)
T PTZ00432 820 -KFSVSDYADELVNGYSQLLFLKETLVPLAEKDWSKVESKLNEIRNKLLSMKNLTVNVTGDSELLDSLLDDSTTFLKKLS 898 (1119)
T ss_pred -cCCHHHHHHHHhcCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHhCcCCcEEEEEeCHHHHHHHHHHHHHHHHhcc
Confidence 000 000000 111 111111111 34577888899999999999999 5 44566665555555553
Q ss_pred CC----C-----C--C--------CCC--c--------c---c-------cCCCCchHHHHHHHHhhcCCCCCCCCCCCC
Q 044668 206 IS----I-----L--T--------KSP--D--------L---C-------TLEDKDAMTLTVTQMLLEGGGSFSAGGPGK 246 (353)
Q Consensus 206 ~~----~-----~--~--------~~p--~--------~---i-------~~~~~d~~~~~vl~~iLGgg~s~~~g~pg~ 246 (353)
.. . . . ..+ + . + ...+++..++.|+..+|..
T Consensus 899 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~p~~V~yv~~~~~~~~~~~~~~~~l~Vl~~~L~~----------- 967 (1119)
T PTZ00432 899 STFKENDNKSSDKVWVKEVLDKKLMESVDKNEFIVLPTRVNFVGMGGKLFDKSDKVDGSFQVIVHYLKN----------- 967 (1119)
T ss_pred cccccccccccccccccccccccccCCcccceEEEccCceeEEEEecccccCCCccCHHHHHHHHHHcc-----------
Confidence 10 1 0 0 001 1 1 1 1235678999999999953
Q ss_pred CcccHHHHHHHhhCCceee--ecee------ee--------------------ecC--CCCCHHHHHHHHHHHHHHHHHh
Q 044668 247 GVYSRLHRRVLNEIPRVQQ--GVYF------CG--------------------ITP--GEVNQVQLDRAVQSTNSAILMN 296 (353)
Q Consensus 247 g~~SrL~~~lre~~gl~Ys--~~y~------fg--------------------l~~--~~is~~EL~~aK~~l~~~~~~~ 296 (353)
+.|+++||++ |.||+ ..+. |. +.. ..+|++||+++|-..++.+-.
T Consensus 968 ---~yLw~~IR~~-GGAYG~~~~~~~~G~~~f~SYRDPn~~~Tl~~f~~~~~~l~~~~~~~~~~~l~~~iig~~~~~D~- 1042 (1119)
T PTZ00432 968 ---SYLWKTVRMS-LGAYGVFADLLYTGHVIFMSYADPNFEKTLEVYKEVASALREAAETLTDKDLLRYKIGKISNIDK- 1042 (1119)
T ss_pred ---ccchHHHccc-CCccccCCccCCCCeEEEEEecCCCHHHHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHhccCC-
Confidence 4599999996 99999 1111 11 445 569999999998888776644
Q ss_pred ccCHHHHHHHHHHHHHHcCCCcc----------------------ccc--CCCceEEEEcCCCCC
Q 044668 297 LESRIVVSEDIDRQVQTHGEMKP----------------------KLI--LSPLTMASYGDVINV 337 (353)
Q Consensus 297 ~es~~~~a~~l~~~~~~~g~~~~----------------------~~l--~~~~~~~~vG~~~~~ 337 (353)
-.++...+.......+ .|.... +++ ...-.++++|+-..+
T Consensus 1043 p~~p~~~g~~~~~~~l-~g~t~e~rq~~R~~il~~t~edi~~~a~~~~~~~~~~~~~v~g~~~~~ 1106 (1119)
T PTZ00432 1043 PLHVDELSKLALLRII-RNESDEDRQKFRKDILETTKEDFYRLADLMEKSKEWEKVIAVVNSKTS 1106 (1119)
T ss_pred CCChHHHHHHHHHHHH-cCCCHHHHHHHHHHHHcCCHHHHHHHHHHHHhhhccCeEEEEECHHHh
Confidence 2344444433332222 454433 333 245678888875543
No 18
>COG1025 Ptr Secreted/periplasmic Zn-dependent peptidases, insulinase-like [Posttranslational modification, protein turnover, chaperones]
Probab=97.43 E-value=0.013 Score=61.91 Aligned_cols=241 Identities=11% Similarity=0.062 Sum_probs=153.8
Q ss_pred CceeEEEccccCCCCCCCCcHHHHHHHHhcccCCCCC--------cCccccCcceeEEEeecccCCHHHHHHHHHH-HHH
Q 044668 57 LPDYFYVDCGSIYESPISFVKTDLLERMAFRSTRDRS--------NIQASPSREQMGCSFAALETYVPEMVELLTD-ISE 127 (353)
Q Consensus 57 v~~~~~v~~Gs~~E~~~~~G~ahlle~llf~gt~~~~--------~~~a~t~~d~t~~~~~~~~~~l~~~l~ll~d-i~~ 127 (353)
..+.+.++.--....+...=+..|...+++....++. +++...+.+...+++++.++.++.++..+.+ +..
T Consensus 527 ~~v~~~irsp~~~~s~r~~Vl~~l~~~la~dal~~~~y~A~~aG~sfs~~~~~~Gl~ltisGft~~lp~L~~~~l~~l~~ 606 (937)
T COG1025 527 ASVSLAIRSPHASRSPRNQVLTELYAYLANDALDKLSYQASLAGLSFSLAANSNGLDLTISGFTQRLPQLLRAFLDGLFS 606 (937)
T ss_pred ceeEEEEeCcccccCHHHHHHHHHHHHHHHHHHHhhhhHHHhcceEEEeecCCCceEEEeeccccchHHHHHHHHHHHhc
Confidence 5667777764444444444455556666666555544 5666666688899999999999988877776 111
Q ss_pred ----------------------HhcChHHHHHHHHHHHhcCCCCCCCCCCChHhhcccCHHHHHHHHHhhcCCCCeEEEE
Q 044668 128 ----------------------ATRNPQSLLSEAIFSACYSVVLANPLLAPECAISRLNSTLIKEFFAENYTAPWMVLAA 185 (353)
Q Consensus 128 ----------------------~~~~p~~~~~~~l~~~~f~~p~g~~~~G~~e~l~~it~edL~~f~~~~y~p~n~~lvi 185 (353)
....|...+.+.+.-..- +.-.+.---.+.++.++.+++..|....+.+....+.+
T Consensus 607 ~~~~~~~f~~~K~~~~~~~~~a~~~~p~~~~~~~l~~l~~--~~~~s~~e~~~~l~~v~~~e~~~f~~~l~~~~~lE~lv 684 (937)
T COG1025 607 LPVDEDRFEQAKSQLSEELKNALTGKPYRQALDGLTGLLQ--VPYWSREERRNALESVSVEEFAAFRDTLLNGVHLEMLV 684 (937)
T ss_pred CCCCHHHHHHHHHHHHHHHHhhhhcCCHHHHHHHhhhhhC--CCCcCHHHHHHHhhhccHHHHHHHHHHhhhccceeeee
Confidence 113454444444432221 11111111246678999999999999999999999999
Q ss_pred Ec-cCHHHHHHHHHHhhCCCCCCCC---CCC----Cc---c---------------c-cCCCCchHHHHHHHHhhcCCCC
Q 044668 186 SG-VEHDQLVSVEEPLLSDLPISIL---TKS----PD---L---------------C-TLEDKDAMTLTVTQMLLEGGGS 238 (353)
Q Consensus 186 vG-id~~el~~~v~~~f~~l~~~~~---~~~----p~---~---------------i-~~~~~d~~~~~vl~~iLGgg~s 238 (353)
+| +..+++.++++..-..++.... ..+ ++ + + .....+.+...++..+++--
T Consensus 685 ~Gn~~~~da~~l~~~~~~~l~~~~s~~~~~~~~~~~~~~~~~~e~~~~~~~~an~~i~~~~~~~~~~~~a~s~Ll~~l-- 762 (937)
T COG1025 685 LGNLTEADATNLAETLQKKLPAIGSTWYRNPSVYLLKGGTRIFETVGGESDSANAAILYPQQYDEIKSSALSSLLGQL-- 762 (937)
T ss_pred eccchHHHHHHHHHHHHhhhcccCCcccCCCceeccCCCeeEeeeccCCcccccceeEeccccchHHHHHHHHHHHHH--
Confidence 99 9999988888755443432221 000 11 0 1 11112223344444454444
Q ss_pred CCCCCCCCCcccHHHHHHHhhCCceee--ecee-----ee---------------------------ecCCCCCHHHHHH
Q 044668 239 FSAGGPGKGVYSRLHRRVLNEIPRVQQ--GVYF-----CG---------------------------ITPGEVNQVQLDR 284 (353)
Q Consensus 239 ~~~g~pg~g~~SrL~~~lre~~gl~Ys--~~y~-----fg---------------------------l~~~~is~~EL~~ 284 (353)
+.-..|..||.|..|=|- +.+. .| -.-.++++++++.
T Consensus 763 ---------~~~~ff~~LRTkeQLGY~Vfs~~~~v~~~~gi~f~vqS~~~~p~~L~~r~~~F~~~~~~~l~~ms~e~Fe~ 833 (937)
T COG1025 763 ---------IHPWFFDQLRTKEQLGYAVFSGPREVGRTPGIGFLVQSNSKSPSYLLERINAFLETAEPELREMSEEDFEQ 833 (937)
T ss_pred ---------HhHHhHHHhhhhhhcceEEEecceeecCccceEEEEeCCCCChHHHHHHHHHHHHHHHHHHHhCCHHHHHH
Confidence 677899999999887775 2221 11 1234689999999
Q ss_pred HHHHHHHHHHHhccCHHHHHHHHHHH
Q 044668 285 AVQSTNSAILMNLESRIVVSEDIDRQ 310 (353)
Q Consensus 285 aK~~l~~~~~~~~es~~~~a~~l~~~ 310 (353)
.|..++++++....+....+..+...
T Consensus 834 ~k~alin~il~~~~nl~e~a~r~~~~ 859 (937)
T COG1025 834 IKKALINQILQPPQNLAEEASRLWKA 859 (937)
T ss_pred HHHHHHHHHHccCCCHHHHHHHHHHH
Confidence 99999999999888888888888733
No 19
>KOG2019 consensus Metalloendoprotease HMP1 (insulinase superfamily) [General function prediction only; Posttranslational modification, protein turnover, chaperones]
Probab=97.25 E-value=0.0067 Score=61.66 Aligned_cols=236 Identities=17% Similarity=0.111 Sum_probs=142.0
Q ss_pred CCCCcccccCCCCcccCCCCCceeEEEccccCCCCCCCCcHHHHHHHHhcccCCCCC---------------cCc--ccc
Q 044668 37 TSSPSLDFPLPGICLSHPSSLPDYFYVDCGSIYESPISFVKTDLLERMAFRSTRDRS---------------NIQ--ASP 99 (353)
Q Consensus 37 ~~~~~l~~~~p~~~~~~~~~v~~~~~v~~Gs~~E~~~~~G~ahlle~llf~gt~~~~---------------~~~--a~t 99 (353)
||+++..+.++.-.+ +.+++++.-|+.-| .=.+=+.-|++.++-.||...+ ++. +.+
T Consensus 568 ngvkv~~~dl~tngi-----~Y~r~~~~l~~~p~-eL~PylPlfc~sll~lGt~~lsf~el~qqI~rkTGGiS~~p~~~s 641 (998)
T KOG2019|consen 568 NGVKVQRCDLFTNGI-----TYTRVVFDLNSLPE-ELLPYLPLFCQSLLNLGTGDLSFVELEQQIGRKTGGISVSPLVSS 641 (998)
T ss_pred cCceeEEeeccCCce-----EEEEEeeccccCcH-HhhcchHHHHHHHHhcCCCcccHHHHHHHhhhhcCceeecceecc
Confidence 567777777766656 78888888888655 2345577888999999998877 111 111
Q ss_pred Ccc------eeEEEeecccCCHHHHHHHHHHH-HHH---hcChHHHHHHHHHHHh----c--CCCCCC-------CCCCC
Q 044668 100 SRE------QMGCSFAALETYVPEMVELLTDI-SEA---TRNPQSLLSEAIFSAC----Y--SVVLAN-------PLLAP 156 (353)
Q Consensus 100 ~~d------~t~~~~~~~~~~l~~~l~ll~di-~~~---~~~p~~~~~~~l~~~~----f--~~p~g~-------~~~G~ 156 (353)
+.+ .+.++..++..+.+.++++...+ .+. +++.-.++.......+ - +|.|+. ..-|.
T Consensus 642 ~~~~d~p~~~i~~~~~~l~rn~~dlfel~n~il~e~~f~n~dkfkvlvk~s~s~~~n~i~dsGH~~A~~rs~a~l~~ag~ 721 (998)
T KOG2019|consen 642 DDGMDEPELGIVFSGSMLDRNADDLFELWNKILQETCFTNQDKFKVLVKQSASRMTNGIADSGHGFAAARSAAMLTPAGW 721 (998)
T ss_pred CCCCCccceeEEechhhhcCChhHHHHHHHHHhcccCcccHHHHHHHHHHHHHHhhccCCcccchhHhhhhhcccCcccc
Confidence 111 24567788888999999988871 110 0111111111100000 0 111111 00111
Q ss_pred -hHhhcccC-------------------HHHHHHHHHhhcCCCCeEEEEEc--cCHHHHHHHHHHhhCCCCCCCCCCC--
Q 044668 157 -ECAISRLN-------------------STLIKEFFAENYTAPWMVLAASG--VEHDQLVSVEEPLLSDLPISILTKS-- 212 (353)
Q Consensus 157 -~e~l~~it-------------------~edL~~f~~~~y~p~n~~lvivG--id~~el~~~v~~~f~~l~~~~~~~~-- 212 (353)
.|++..++ .+.|.+..+-....+||.+.|.. ..+..+++.|++++..+|...+...
T Consensus 722 i~EqlgGl~ql~fl~~L~~~~d~d~~~i~~kL~eIrk~ll~~ng~~~~itAd~~q~~~vEkav~kFl~~lp~e~p~g~~s 801 (998)
T KOG2019|consen 722 ISEQLGGLSQLEFLHRLEEKVDNDWEPIVSKLTEIRKSLLNTNGMIVNITADPKQLTNVEKAVEKFLDSLPRENPSGSKS 801 (998)
T ss_pred hHhHhcchHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHhcCCCeEEEEecCcccchhHHHHHHHHHHhccccCCCCCcc
Confidence 12222211 12344444555688999999887 7888999999999887773221110
Q ss_pred ------Cc-----c-----------------ccCCCCchHHHHHHHHhhcCCCCCCCCCCCCCcccHHHHHHHhhCCcee
Q 044668 213 ------PD-----L-----------------CTLEDKDAMTLTVTQMLLEGGGSFSAGGPGKGVYSRLHRRVLNEIPRVQ 264 (353)
Q Consensus 213 ------p~-----~-----------------i~~~~~d~~~~~vl~~iLGgg~s~~~g~pg~g~~SrL~~~lre~~gl~Y 264 (353)
|- + +...+||-.+++|++.+| ....|..+|||+ |.+|
T Consensus 802 t~d~r~p~~~~~i~~~~P~fqvnyvgka~~~vpyt~~d~asl~vlS~~l--------------t~k~Lh~evRek-GGAY 866 (998)
T KOG2019|consen 802 TWDARLPLRSEAIRVVIPTFQVNYVGKAGLGVPYTHPDGASLQVLSKLL--------------TNKWLHDEVREK-GGAY 866 (998)
T ss_pred CccccCCCCceeEEEeccccchhhhhhhcccccCCCCCCcHHHHHHHHH--------------HHHHHHHHHHHh-cCcc
Confidence 10 0 145679999999999997 456799999997 7888
Q ss_pred e--ecee----------------------ee-----ecCCCCCHHHHHHHHHHHHHHH
Q 044668 265 Q--GVYF----------------------CG-----ITPGEVNQVQLDRAVQSTNSAI 293 (353)
Q Consensus 265 s--~~y~----------------------fg-----l~~~~is~~EL~~aK~~l~~~~ 293 (353)
+ -+|+ |+ +....+++++|+.||-...+..
T Consensus 867 Ggg~s~~sh~GvfSf~SYRDpn~lktL~~f~~tgd~~~~~~~~~~dldeAkl~~f~~V 924 (998)
T KOG2019|consen 867 GGGCSYSSHSGVFSFYSYRDPNPLKTLDIFDGTGDFLRGLDVDQQDLDEAKLGTFGDV 924 (998)
T ss_pred CCccccccccceEEEEeccCCchhhHHHhhcchhhhhhcCCccccchhhhhhhhcccc
Confidence 7 1121 22 5566689999999987776554
No 20
>KOG0959 consensus N-arginine dibasic convertase NRD1 and related Zn2+-dependent endopeptidases, insulinase superfamily [Posttranslational modification, protein turnover, chaperones]
Probab=96.55 E-value=0.16 Score=54.69 Aligned_cols=165 Identities=15% Similarity=0.186 Sum_probs=111.0
Q ss_pred CCCCceecccCCCCCCCCcccccCCCCcccCCCCCceeEEEccccCCCCCCCCcHHHHHHHHhcccCCCCC--------c
Q 044668 23 SSGGFYSWLSGEQSTSSPSLDFPLPGICLSHPSSLPDYFYVDCGSIYESPISFVKTDLLERMAFRSTRDRS--------N 94 (353)
Q Consensus 23 ~~~~~~~~~~~~~~~~~~~l~~~~p~~~~~~~~~v~~~~~v~~Gs~~E~~~~~G~ahlle~llf~gt~~~~--------~ 94 (353)
..+.-+.|+..+....+| + ..+.+.+.+=-....+...+++++...++.--...+. .
T Consensus 515 ~~~~~~lw~k~dd~f~~P--------------k-a~~~~~~~~p~~~~~~~~~~l~~l~~~~l~d~l~E~~Y~A~~aGl~ 579 (974)
T KOG0959|consen 515 DTPFSELWYKQDDKFNVP--------------K-AYTKFDFICPGATQSPLNSVLSTLYVRLLKDQLNEYLYPALLAGLT 579 (974)
T ss_pred cCCcceeEEecccccccc--------------h-hheeeeecCcccccCHHHHHHHHHHHHHHHHHHhHHHHHHHhccce
Confidence 345667888877733322 1 3556666664444556778899998888766554433 5
Q ss_pred CccccCcceeEEEeecccCCHHHHHHHHHH--------------H--------HH-HhcChHHHHHHHHHHHhcCCCCCC
Q 044668 95 IQASPSREQMGCSFAALETYVPEMVELLTD--------------I--------SE-ATRNPQSLLSEAIFSACYSVVLAN 151 (353)
Q Consensus 95 ~~a~t~~d~t~~~~~~~~~~l~~~l~ll~d--------------i--------~~-~~~~p~~~~~~~l~~~~f~~p~g~ 151 (353)
+....+.......+.+..+.+..+++.+.+ + +. ..++|...+.+.++-.+=+.-+..
T Consensus 580 ~~~~~s~~G~~~~v~Gfnekl~~ll~~~~~~~~~f~~~~~rf~iike~~~~~~~n~~~~~p~~~a~~~~~lll~~~~W~~ 659 (974)
T KOG0959|consen 580 YSLSSSSKGVELRVSGFNEKLPLLLEKVVQMMANFELDEDRFEIIKELLKRELRNHAFDNPYQLANDYLLLLLEESIWSK 659 (974)
T ss_pred EEeeecCCceEEEEeccCcccHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHhhhhhccHHHHHHHHHHHHhhccccch
Confidence 666666677788899999999888888877 1 11 224555555544433221222211
Q ss_pred CCCCChHhhcccCHHHHHHHHHhhcCCCCeEEEEEc-cCHHHHHHHHHHhhCCC
Q 044668 152 PLLAPECAISRLNSTLIKEFFAENYTAPWMVLAASG-VEHDQLVSVEEPLLSDL 204 (353)
Q Consensus 152 ~~~G~~e~l~~it~edL~~f~~~~y~p~n~~lvivG-id~~el~~~v~~~f~~l 204 (353)
.--.+.++.++.+|+..|-..++.+--+...|+| ++.+++.++++..+..+
T Consensus 660 --~e~~~al~~~~le~~~~F~~~~~~~~~~e~~i~GN~te~~A~~l~~~v~d~l 711 (974)
T KOG0959|consen 660 --EELLEALDDVTLEDLESFISEFLQPFHLELLIHGNLTEKEALQLLKSVLDIL 711 (974)
T ss_pred --HHHHHHhhcccHHHHHHHHHHHhhhhheEEEEecCcchHHHHHHHHHHHhhh
Confidence 1123566789999999999999999999999999 99999999876665544
No 21
>PF03410 Peptidase_M44: Protein G1; InterPro: IPR005072 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M44 (clan ME). The active site residues for members of this family and family M16 occur in the motif HXXEHProtein. The type example is the vaccinia virus-type metalloendopeptidase G1 from vaccinia virus, it is a metalloendopeptidase expressed by many Poxviridae which appears to play a role in the maturation of viral proteins.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0019067 viral assembly, maturation, egress, and release
Probab=96.04 E-value=0.055 Score=52.99 Aligned_cols=148 Identities=14% Similarity=0.090 Sum_probs=82.9
Q ss_pred CCCceeEEEccccCCCCCCCCcHHHHHHHHhcccCCCCCcCccccCcceeEEEeecccCC-HHHHHHHHHH---------
Q 044668 55 SSLPDYFYVDCGSIYESPISFVKTDLLERMAFRSTRDRSNIQASPSREQMGCSFAALETY-VPEMVELLTD--------- 124 (353)
Q Consensus 55 ~~v~~~~~v~~Gs~~E~~~~~G~ahlle~llf~gt~~~~~~~a~t~~d~t~~~~~~~~~~-l~~~l~ll~d--------- 124 (353)
+++.+++ -+.|--.|-++-.|+||||||.+-.=...+=..||+|+|.++.|=+...+.. -..++.-|..
T Consensus 18 KDIYlGI-s~FGFe~DI~~iLGiAHLLEHILIsFD~~~F~ANASTaRsYMSFWC~si~g~~~~DAvrtliSWFF~~g~Lk 96 (590)
T PF03410_consen 18 KDIYLGI-SNFGFENDIGEILGIAHLLEHILISFDSSKFLANASTARSYMSFWCKSIRGRTYIDAVRTLISWFFDNGKLK 96 (590)
T ss_pred cceEEee-cccccccchHHHHhHHHHHHHHeeecchHHhhcccchhhhhhhhhhhhccCCChhHHHHHHHHHhhcCCccc
Confidence 3456664 3456666777899999999999843211111679999999999877666543 2333333332
Q ss_pred -----------HHHHhcChHHHHHHH---HHHHhc--CCCCCCCCCCChHhhcccC--HHHHHHHHHhhcCCCCeEEEEE
Q 044668 125 -----------ISEATRNPQSLLSEA---IFSACY--SVVLANPLLAPECAISRLN--STLIKEFFAENYTAPWMVLAAS 186 (353)
Q Consensus 125 -----------i~~~~~~p~~~~~~~---l~~~~f--~~p~g~~~~G~~e~l~~it--~edL~~f~~~~y~p~n~~lviv 186 (353)
+++.++.- ..-.|. +.-..| ++.+- ..|....+++++ ++.|.+-.++. ...|++|.+=
T Consensus 97 ~~F~~~~i~~hikELENEY-YFRnEvfHCmDvLtfL~gGDLY--NGGRi~ML~~l~~i~~mL~~RM~~I-~GpniVIFVk 172 (590)
T PF03410_consen 97 DNFSRSKIKNHIKELENEY-YFRNEVFHCMDVLTFLGGGDLY--NGGRIDMLNNLNDIRNMLSNRMHRI-IGPNIVIFVK 172 (590)
T ss_pred ccccHhHHHHHHHHHhhhh-hhhhhHHHHHHHHHHhcCCccc--CCchHHHHhhhHHHHHHHHHHHHhh-cCCcEEEEEe
Confidence 44544321 111122 222334 33221 124444555544 23444555444 3345555444
Q ss_pred ccCHHHHHHHHHHhhCCCCCCC
Q 044668 187 GVEHDQLVSVEEPLLSDLPISI 208 (353)
Q Consensus 187 Gid~~el~~~v~~~f~~l~~~~ 208 (353)
-+++. +..++++.||.+|+-+
T Consensus 173 ~l~~~-~l~lL~~TFGtLP~cP 193 (590)
T PF03410_consen 173 ELNPN-ILSLLSNTFGTLPSCP 193 (590)
T ss_pred ccCHH-HHHHHHHhcCCCCCCc
Confidence 48865 5589999999998754
No 22
>PHA03081 putative metalloprotease; Provisional
Probab=94.64 E-value=0.21 Score=49.05 Aligned_cols=148 Identities=14% Similarity=0.086 Sum_probs=81.9
Q ss_pred CCCceeEEEccccCCCCCCCCcHHHHHHHHhcccCCCCCcCccccCcceeEEEeecccC-CHHHHHHHHHH---------
Q 044668 55 SSLPDYFYVDCGSIYESPISFVKTDLLERMAFRSTRDRSNIQASPSREQMGCSFAALET-YVPEMVELLTD--------- 124 (353)
Q Consensus 55 ~~v~~~~~v~~Gs~~E~~~~~G~ahlle~llf~gt~~~~~~~a~t~~d~t~~~~~~~~~-~l~~~l~ll~d--------- 124 (353)
+++.+++ -+.|--.|-.+-.|+|||+||.+-.=...+=..||+|.+.++.|-+..... ....++.-+..
T Consensus 18 kdiy~gi-~~fgfe~di~~~lg~ahllehili~fd~~~f~anast~r~ymsfwc~sirg~~y~DAvrtliSWFF~~~~Lr 96 (595)
T PHA03081 18 KDIYLGI-SNFGFENDIGEILGIAHLLEHILISFDSSKFVANASTARSYMSFWCKSIRGRSYIDAIRTLISWFFDNGKLK 96 (595)
T ss_pred cceEEee-cccccccchHHHHhHHHHHHHHeeecchHHhcccchhhhhhHhHhhHhhcCCchHHHHHHHHHHhccCCccc
Confidence 3345553 345666677788999999999984322111167899999999887665553 22334433333
Q ss_pred -----------HHHHhcChHHHHHHH---HHHHhc--CCCCCCCCCCChHhhcccC--HHHHHHHHHhhcCCCCeEEEEE
Q 044668 125 -----------ISEATRNPQSLLSEA---IFSACY--SVVLANPLLAPECAISRLN--STLIKEFFAENYTAPWMVLAAS 186 (353)
Q Consensus 125 -----------i~~~~~~p~~~~~~~---l~~~~f--~~p~g~~~~G~~e~l~~it--~edL~~f~~~~y~p~n~~lviv 186 (353)
+++.++.- ..-.|. +.-..| ++.+- ..|....|++++ ++.|.+-.++. ...|++|.+=
T Consensus 97 ~~F~~~~ik~~ikELENEY-YFRnEvfHCmDvLTfL~gGDLY--NGGRi~ML~~l~~i~~~L~~RM~~I-~GpniVIFVk 172 (595)
T PHA03081 97 DNFSLSKIRNHIKELENEY-YFRNEVFHCMDVLTFLGGGDLY--NGGRIDMLDNLNDVRDMLSNRMHRI-SGPNIVIFVK 172 (595)
T ss_pred cccchhhHHHHHHHHhhhh-hhhhhhHHHHHHHHHhcCCccc--CCchHHHHhhhHHHHHHHHHHHHhh-cCCcEEEEEe
Confidence 44444321 111122 222234 33221 134455555544 23444444443 3445555444
Q ss_pred ccCHHHHHHHHHHhhCCCCCCC
Q 044668 187 GVEHDQLVSVEEPLLSDLPISI 208 (353)
Q Consensus 187 Gid~~el~~~v~~~f~~l~~~~ 208 (353)
-+++. +..++++.||.+|.-+
T Consensus 173 ~ln~~-~l~lL~~TFGtLP~~P 193 (595)
T PHA03081 173 ELNPN-TLSLLNNTFGTLPSCP 193 (595)
T ss_pred ccCHH-HHHHHHHhcCCCCCCc
Confidence 48865 4589999999998765
No 23
>COG0612 PqqL Predicted Zn-dependent peptidases [General function prediction only]
Probab=91.57 E-value=0.5 Score=46.81 Aligned_cols=76 Identities=9% Similarity=-0.031 Sum_probs=58.3
Q ss_pred ecCCCCCHHHHHHHHHHHHHHHHHhccCHHHHHHHHHHHHHHcCCCcc-----------------------ccc-CCCce
Q 044668 272 ITPGEVNQVQLDRAVQSTNSAILMNLESRIVVSEDIDRQVQTHGEMKP-----------------------KLI-LSPLT 327 (353)
Q Consensus 272 l~~~~is~~EL~~aK~~l~~~~~~~~es~~~~a~~l~~~~~~~g~~~~-----------------------~~l-~~~~~ 327 (353)
+..--++++++++.|..++..+.+..+++..++.......++.+.+.. +++ +++++
T Consensus 121 l~~p~f~~~~~e~Ek~vil~ei~~~~d~p~~~~~~~l~~~~~~~~p~~~~~~G~~e~I~~it~~dl~~f~~k~Y~p~n~~ 200 (438)
T COG0612 121 LLNPTFDEEEVEREKGVILEEIRMRQDDPDDLAFERLLEALYGNHPLGRPILGTEESIEAITREDLKDFYQKWYQPDNMV 200 (438)
T ss_pred HhCCCCCHHHHHHHHHHHHHHHHhhccCchHHHHHHHHHHhhccCCCCCCCCCCHHHHHhCCHHHHHHHHHHhcCcCceE
Confidence 455669999999999999999999999999999988877765444433 566 57899
Q ss_pred EEEEcCCCCCCCHHHHHHHH
Q 044668 328 MASYGDVINVPSYDAESAAS 347 (353)
Q Consensus 328 ~~~vG~~~~~p~~~~~~~~~ 347 (353)
+++|||+..---.+.+.+.+
T Consensus 201 l~vvGdi~~~~v~~~~~~~f 220 (438)
T COG0612 201 LVVVGDVDAEEVVELIEKYF 220 (438)
T ss_pred EEEecCCCHHHHHHHHHHHH
Confidence 99999987644444444433
No 24
>KOG0961 consensus Predicted Zn2+-dependent endopeptidase, insulinase superfamily [Posttranslational modification, protein turnover, chaperones]
Probab=82.16 E-value=4.9 Score=41.67 Aligned_cols=67 Identities=16% Similarity=0.176 Sum_probs=50.3
Q ss_pred cCCCCchHHHHHHHHhhcCCCCCCCCCCCCCcccHHHHHHHhhCCceeeecee-------ee------------------
Q 044668 217 TLEDKDAMTLTVTQMLLEGGGSFSAGGPGKGVYSRLHRRVLNEIPRVQQGVYF-------CG------------------ 271 (353)
Q Consensus 217 ~~~~~d~~~~~vl~~iLGgg~s~~~g~pg~g~~SrL~~~lre~~gl~Ys~~y~-------fg------------------ 271 (353)
|. ||++++..++..+||. +...+.+.+|-. ||+|+.+.. ++
T Consensus 825 w~-dpel~~~~l~~~YL~~------------~eGPfW~~IRG~-GLAYGanm~~~~d~~~~~~~iyr~ad~~kaye~~rd 890 (1022)
T KOG0961|consen 825 WN-DPELIPAMLFGQYLSQ------------CEGPFWRAIRGD-GLAYGANMFVKPDRKQITLSIYRCADPAKAYERTRD 890 (1022)
T ss_pred cC-CcchhHHHHHHHHHHh------------cccchhhhhccc-chhccceeEEeccCCEEEEEeecCCcHHHHHHHHHH
Confidence 54 4999999999999986 466789999985 999991111 11
Q ss_pred ----e--cCCCCCHHHHHHHHHHHHHHHHHhcc
Q 044668 272 ----I--TPGEVNQVQLDRAVQSTNSAILMNLE 298 (353)
Q Consensus 272 ----l--~~~~is~~EL~~aK~~l~~~~~~~~e 298 (353)
+ -+..+++.|++.||......+.. +|
T Consensus 891 iV~~~vsG~~e~s~~~~egAk~s~~~~~~~-~E 922 (1022)
T KOG0961|consen 891 IVRKIVSGSGEISKAEFEGAKRSTVFEMMK-RE 922 (1022)
T ss_pred HHHHHhcCceeecHHHhccchHHHHHHHHH-Hh
Confidence 2 25679999999999998876643 45
No 25
>TIGR02110 PQQ_syn_pqqF coenzyme PQQ biosynthesis probable peptidase PqqF. In a subset of species that make coenzyme PQQ (pyrrolo-quinoline-quinone), this probable peptidase is found in the PQQ biosynthesis region and is thought to act as a protease on PqqA (TIGR02107), a probable peptide precursor of the coenzyme. PQQ is required for some glucose dehydrogenases and alcohol dehydrogenases.
Probab=82.06 E-value=15 Score=38.92 Aligned_cols=103 Identities=9% Similarity=-0.006 Sum_probs=67.8
Q ss_pred HHHHHHHhhcCCCCCCCCCCCCCcccH-HHHHHHhhCCceee--e-----ceeee----------------ecCCCCCHH
Q 044668 225 TLTVTQMLLEGGGSFSAGGPGKGVYSR-LHRRVLNEIPRVQQ--G-----VYFCG----------------ITPGEVNQV 280 (353)
Q Consensus 225 ~~~vl~~iLGgg~s~~~g~pg~g~~Sr-L~~~lre~~gl~Ys--~-----~y~fg----------------l~~~~is~~ 280 (353)
-..++.++|.-|. +.+... =+.+..|+.|..++ + .|.+- +..--++++
T Consensus 42 LAHfLEHMLFkGT--------~~~~~~~~i~~~le~lGG~lNA~Ts~d~T~y~~~v~~~~l~~aL~lLaD~l~~P~f~ee 113 (696)
T TIGR02110 42 LAHFLEHLLFLGG--------ERFQGDDRLMPWVQRQGGQVNATTLERTTAFFFELPAAALAAGLARLCDMLARPLLTAE 113 (696)
T ss_pred HHHHHHHHHhcCC--------CCCCcHHHHHHHHHHhCCeEEEEEcCCeEEEEEEecHHHHHHHHHHHHHHHhCCCCCHH
Confidence 4577788875441 124442 24455666777665 1 12211 445569999
Q ss_pred HHHHHHHHHHHHHHHhccCHHHHHHHHHHHHHHcCCCcc--------------------------ccc-CCCceEEEEcC
Q 044668 281 QLDRAVQSTNSAILMNLESRIVVSEDIDRQVQTHGEMKP--------------------------KLI-LSPLTMASYGD 333 (353)
Q Consensus 281 EL~~aK~~l~~~~~~~~es~~~~a~~l~~~~~~~g~~~~--------------------------~~l-~~~~~~~~vG~ 333 (353)
|+++.|+.++..+....+++..++.......++.+.+.. +++ .++++++++||
T Consensus 114 eierEr~vvl~Ei~~~~ddp~~~~~~~l~~~l~~~HPy~~~~iGt~esL~~it~~t~edL~~F~~~~Y~p~NmvLvIvGd 193 (696)
T TIGR02110 114 DQQREREVLEAEYIAWQNDADTLREAALLDALQAGHPLRRFHAGSRDSLALPNTAFQQALRDFHRRHYQAGNMQLWLQGP 193 (696)
T ss_pred HHHHHHHHHHHHHHHHhcCHHHHHHHHHHHHcCCCCCCCCCCCCCHHHHhCcccchHHHHHHHHHHhcchhcEEEEEEeC
Confidence 999999999999999999988888876655553222211 344 56899999998
Q ss_pred CC
Q 044668 334 VI 335 (353)
Q Consensus 334 ~~ 335 (353)
++
T Consensus 194 vs 195 (696)
T TIGR02110 194 QS 195 (696)
T ss_pred CC
Confidence 64
No 26
>PF08367 M16C_assoc: Peptidase M16C associated; InterPro: IPR013578 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This domain appears in eukaryotes as well as bacteria and tends to be found near the C terminus of metalloproteases and related sequences belonging to MEROPS peptidase family M16 (subfamily M16C, clan ME). These include: eupitrilysin, falcilysin, PreP peptidase, CYM1 peptidase and subfamily M16C non-peptidase homologues.; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis; PDB: 2FGE_B 3S5I_A 3S5H_A 3S5M_A 3S5K_A.
Probab=78.48 E-value=8.5 Score=35.05 Aligned_cols=81 Identities=21% Similarity=0.163 Sum_probs=52.2
Q ss_pred CCCcccccCCCCcccCCCCCceeEEEccccCCCCCCCCcHHHHHHHHhcccCCCCC-----------------cCc----
Q 044668 38 SSPSLDFPLPGICLSHPSSLPDYFYVDCGSIYESPISFVKTDLLERMAFRSTRDRS-----------------NIQ---- 96 (353)
Q Consensus 38 ~~~~l~~~~p~~~~~~~~~v~~~~~v~~Gs~~E~~~~~G~ahlle~llf~gt~~~~-----------------~~~---- 96 (353)
+.|++..+.|.-.+ +.+.++|+.....+ .+.+=+.-|..-+---||++++ ++.
T Consensus 78 ~~~v~~~~~~TnGI-----~Y~~l~fdl~~l~~-e~l~yl~Ll~~ll~~lgT~~~sy~el~~~i~~~tGGis~~~~~~~~ 151 (248)
T PF08367_consen 78 GIPVLFHEQPTNGI-----VYVRLYFDLSDLPE-EDLPYLPLLTDLLGELGTKNYSYEELSNEIDLYTGGISFSIEVYTD 151 (248)
T ss_dssp TCEEEEEE---TTE-----EEEEEEEE-TTS-C-CCHCCHHHHHHHCCCS-BSSS-HHHHHHHHHHHSSEEEEEEEEEEE
T ss_pred CccEEEEEcCCCCe-----EEEEEEecCCCCCH-HHHHhHHHHHHHHHhCCCCCCCHHHHHHHHHHhCCCeEEEeeeccC
Confidence 55666666665555 88899999985543 4556666666666667999988 111
Q ss_pred ---cccCcceeEEEeecccCCHHHHHHHHHH
Q 044668 97 ---ASPSREQMGCSFAALETYVPEMVELLTD 124 (353)
Q Consensus 97 ---a~t~~d~t~~~~~~~~~~l~~~l~ll~d 124 (353)
.....-.+.+++.|+.++++++++++.+
T Consensus 152 ~~~~~~~~~~l~is~k~L~~~~~~~~~ll~e 182 (248)
T PF08367_consen 152 YDDDDKYRPYLVISAKCLDEKLDEAFELLSE 182 (248)
T ss_dssp ECTECCCEEEEEEEEEEEGGGHHHHHHHHHH
T ss_pred CCCccceeEEEEEEEEeHhhhHHHHHHHHHH
Confidence 1122335678999999999999999998
No 27
>KOG2067 consensus Mitochondrial processing peptidase, alpha subunit [Posttranslational modification, protein turnover, chaperones]
Probab=77.26 E-value=14 Score=35.91 Aligned_cols=125 Identities=15% Similarity=0.032 Sum_probs=75.6
Q ss_pred eeEEEccccCCCCCCCCcHHHHHHHHhcccCCCCC----------------------------cCccccCcceeEEEeec
Q 044668 59 DYFYVDCGSIYESPISFVKTDLLERMAFRSTRDRS----------------------------NIQASPSREQMGCSFAA 110 (353)
Q Consensus 59 ~~~~v~~Gs~~E~~~~~G~ahlle~llf~gt~~~~----------------------------~~~a~t~~d~t~~~~~~ 110 (353)
+.+-+.+=+..|+ .=++-.+-.||..|.+.++ .-..+.+..-+++++++
T Consensus 267 v~lg~Eg~~~~de---D~v~~avLq~lmGGGGSFSAGGPGKGMySrLY~~vLNry~wv~sctAfnhsy~DtGlfgi~~s~ 343 (472)
T KOG2067|consen 267 VVLGFEGCSWNDE---DFVALAVLQMLMGGGGSFSAGGPGKGMYSRLYLNVLNRYHWVYSCTAFNHSYSDTGLFGIYASA 343 (472)
T ss_pred eeEeeccCCCCCh---hHHHHHHHHHHhcCCcccCCCCCCcchHHHHHHHHHhhhHHHHHhhhhhccccCCceeEEeccC
Confidence 3444555444443 2366667778888888777 33445566678899999
Q ss_pred ccCCHHHHHHHHHH-------------HHHHhc--------C--hHHHHHHHHHHHhcCCCCCCCCCCChHhhcccCHHH
Q 044668 111 LETYVPEMVELLTD-------------ISEATR--------N--PQSLLSEAIFSACYSVVLANPLLAPECAISRLNSTL 167 (353)
Q Consensus 111 ~~~~l~~~l~ll~d-------------i~~~~~--------~--p~~~~~~~l~~~~f~~p~g~~~~G~~e~l~~it~ed 167 (353)
+++...+++++++. +++.+. | -.-+++|-+-+.+..+.-..+.---.+.|++++.+|
T Consensus 344 ~P~~a~~aveli~~e~~~~~~~v~~~el~RAK~qlkS~LlMNLESR~V~~EDvGRQVL~~g~rk~p~e~~~~Ie~lt~~D 423 (472)
T KOG2067|consen 344 PPQAANDAVELIAKEMINMAGGVTQEELERAKTQLKSMLLMNLESRPVAFEDVGRQVLTTGERKPPDEFIKKIEQLTPSD 423 (472)
T ss_pred CHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHhcccccchhHHHHhHHHHhccCcCCHHHHHHHHHhcCHHH
Confidence 99999999999988 111110 0 011233444444442222222222246788899999
Q ss_pred HHHHHHhhcCCCCeEEEEEc
Q 044668 168 IKEFFAENYTAPWMVLAASG 187 (353)
Q Consensus 168 L~~f~~~~y~p~n~~lvivG 187 (353)
+.++-++.++. +.+++..|
T Consensus 424 I~rva~kvlt~-~p~va~~G 442 (472)
T KOG2067|consen 424 ISRVASKVLTG-KPSVAAFG 442 (472)
T ss_pred HHHHHHHHhcC-CceeccCC
Confidence 99998888754 34555556
No 28
>PF00675 Peptidase_M16: Insulinase (Peptidase family M16) This is family M16 in the peptidase classification. ; InterPro: IPR011765 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. The majority of the sequences in this entry are metallopeptidases and non-peptidase homologs belong to MEROPS peptidase family M16 (clan ME), subfamilies M16A, M16B and M16C; they include: Insulinase, insulin-degrading enzyme (3.4.24.56 from EC) Mitochondrial processing peptidase alpha subunit, (Alpha-MPP, 3.4.24.64 from EC) Pitrlysin, Protease III precursor (3.4.24.55 from EC) Nardilysin, (3.4.24.61 from EC) Ubiquinol-cytochrome C reductase complex core protein I,mitochondrial precursor (1.10.2.2 from EC) Coenzyme PQQ synthesis protein F (3.4.99 from EC) These proteins do not share many regions of sequence similarity; the most noticeable is in the N-terminal section. This region includes a conserved histidine followed, two residues later by a glutamate and another histidine. In pitrilysin, it has been shown [] that this H-x-x-E-H motif is involved in enzymatic activity; the two histidines bind zinc and the glutamate is necessary for catalytic activity. The proteins classified as non-peptidase homologues either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity. ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3P7L_A 3P7O_A 3TUV_A 3GO9_A 1BE3_B 1PP9_B 2A06_B 1SQB_B 1SQP_B 1L0N_B ....
Probab=66.08 E-value=50 Score=26.92 Aligned_cols=86 Identities=13% Similarity=0.043 Sum_probs=56.9
Q ss_pred HHHHHHHhhcCCCCCCCCCCCCCcccHHHHHHHhhCCceee-------eceeee----------------ecCCCCCHHH
Q 044668 225 TLTVTQMLLEGGGSFSAGGPGKGVYSRLHRRVLNEIPRVQQ-------GVYFCG----------------ITPGEVNQVQ 281 (353)
Q Consensus 225 ~~~vl~~iLGgg~s~~~g~pg~g~~SrL~~~lre~~gl~Ys-------~~y~fg----------------l~~~~is~~E 281 (353)
...++.+++--|+ +++++.-+.+..++.|..++ ..|.+. +..-.+++++
T Consensus 33 ~a~ll~~l~~~gs--------~~~~~~~l~~~l~~~G~~~~~~t~~d~t~~~~~~~~~~~~~~l~~l~~~~~~P~f~~~~ 104 (149)
T PF00675_consen 33 LAHLLEHLLFRGS--------KKYSSDELQEELESLGASFNASTSRDSTSYSASVLSEDLEKALELLADMLFNPSFDEEE 104 (149)
T ss_dssp HHHHHHHHTTSBB--------SSSBHHHHHHHHHHTTCEEEEEEESSEEEEEEEEEGGGHHHHHHHHHHHHHSBGGCHHH
T ss_pred hhhhhhhhccccc--------chhhhhhhHHHhhhhccccceEecccceEEEEEEecccchhHHHHHHHHHhCCCCCHHH
Confidence 3566777664431 12555445556666787766 112211 4455699999
Q ss_pred HHHHHHHHHHHHHHhccCHHHHHHHHHHHHHHcCCCc
Q 044668 282 LDRAVQSTNSAILMNLESRIVVSEDIDRQVQTHGEMK 318 (353)
Q Consensus 282 L~~aK~~l~~~~~~~~es~~~~a~~l~~~~~~~g~~~ 318 (353)
+++.|..++..+....+++..++........+.+.+.
T Consensus 105 ~~~~r~~~~~ei~~~~~~~~~~~~~~l~~~~f~~~p~ 141 (149)
T PF00675_consen 105 FEREREQILQEIEEIKENPQELAFEKLHSAAFRGHPY 141 (149)
T ss_dssp HHHHHHHHHHHHHHHTTHHHHHHHHHHHHHHHTTSGG
T ss_pred HHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHhccCCC
Confidence 9999999999998887777777777776776666554
No 29
>PF09851 SHOCT: Short C-terminal domain; InterPro: IPR018649 This family of hypothetical prokaryotic proteins has no known function.
Probab=45.35 E-value=21 Score=21.28 Aligned_cols=18 Identities=11% Similarity=0.233 Sum_probs=15.3
Q ss_pred cCCCCCHHHHHHHHHHHH
Q 044668 273 TPGEVNQVQLDRAVQSTN 290 (353)
Q Consensus 273 ~~~~is~~EL~~aK~~l~ 290 (353)
..+-||++|+++.|+.+.
T Consensus 13 ~~G~IseeEy~~~k~~ll 30 (31)
T PF09851_consen 13 DKGEISEEEYEQKKARLL 30 (31)
T ss_pred HcCCCCHHHHHHHHHHHh
Confidence 357799999999999875
No 30
>KOG0960 consensus Mitochondrial processing peptidase, beta subunit, and related enzymes (insulinase superfamily) [Posttranslational modification, protein turnover, chaperones]
Probab=36.52 E-value=4.3e+02 Score=26.08 Aligned_cols=33 Identities=21% Similarity=0.293 Sum_probs=29.9
Q ss_pred hHhhcccCHHHHHHHHHhhcCCCCeEEEEEc-cC
Q 044668 157 ECAISRLNSTLIKEFFAENYTAPWMVLAASG-VE 189 (353)
Q Consensus 157 ~e~l~~it~edL~~f~~~~y~p~n~~lvivG-id 189 (353)
...|++|+.++++++-.+++--....++.|| +.
T Consensus 417 ~~rId~vt~~~Vr~va~k~iyd~~iAia~vG~ie 450 (467)
T KOG0960|consen 417 EARIDAVTAKDVREVASKYIYDKDIAIAAVGPIE 450 (467)
T ss_pred HHHHhhccHHHHHHHHHHHhhcCCcceeeecccc
Confidence 4678999999999999999999999999999 74
No 31
>PRK05986 cob(I)alamin adenolsyltransferase/cobinamide ATP-dependent adenolsyltransferase; Validated
Probab=31.50 E-value=62 Score=28.38 Aligned_cols=38 Identities=11% Similarity=0.114 Sum_probs=32.2
Q ss_pred hcccCHHHHHHHHHhhcCCCCeEEEEEc-cCHHHHHHHHHH
Q 044668 160 ISRLNSTLIKEFFAENYTAPWMVLAASG-VEHDQLVSVEEP 199 (353)
Q Consensus 160 l~~it~edL~~f~~~~y~p~n~~lvivG-id~~el~~~v~~ 199 (353)
..=|+.++|.++.+. .|.++.|++.| -.++++.++++-
T Consensus 130 ~gli~~eevi~~L~~--rp~~~evVlTGR~~p~~Lie~ADl 168 (191)
T PRK05986 130 YGYLDVEEVLEALNA--RPGMQHVVITGRGAPRELIEAADL 168 (191)
T ss_pred CCCccHHHHHHHHHc--CCCCCEEEEECCCCCHHHHHhCch
Confidence 345889999999976 89999999999 888888887653
No 32
>PF01729 QRPTase_C: Quinolinate phosphoribosyl transferase, C-terminal domain; InterPro: IPR002638 Quinolinate phosphoribosyl transferase (QPRTase) or nicotinate-nucleotide pyrophosphorylase 2.4.2.19 from EC is involved in the de novo synthesis of NAD in both prokaryotes and eukaryotes. It catalyses the reaction of quinolinic acid with 5-phosphoribosyl-1-pyrophosphate (PRPP) in the presence of Mg2+ to give rise to nicotinic acid mononucleotide (NaMN), pyrophosphate and carbon dioxide [, ]. Unlike IPR004393 from INTERPRO, this domain also includes the molybdenum transport system protein ModD.; GO: 0004514 nicotinate-nucleotide diphosphorylase (carboxylating) activity, 0009435 NAD biosynthetic process; PDB: 3C2O_A 3C2F_A 3C2E_A 3C2R_A 3C2V_A 1QPN_E 1QPO_A 1QPQ_F 1QPR_E 2I14_D ....
Probab=30.62 E-value=51 Score=28.24 Aligned_cols=41 Identities=17% Similarity=0.288 Sum_probs=31.6
Q ss_pred hhcccCHHHHHHHHHhhc-CCCCeEEEEEc-cCHHHHHHHHHH
Q 044668 159 AISRLNSTLIKEFFAENY-TAPWMVLAASG-VEHDQLVSVEEP 199 (353)
Q Consensus 159 ~l~~it~edL~~f~~~~y-~p~n~~lvivG-id~~el~~~v~~ 199 (353)
.+++.+.+++++..+..- .+.++.|.++| |+++.+.++.+.
T Consensus 105 ~lD~~~~~~~~~~v~~l~~~~~~v~ie~SGGI~~~ni~~ya~~ 147 (169)
T PF01729_consen 105 MLDNMSPEDLKEAVEELRELNPRVKIEASGGITLENIAEYAKT 147 (169)
T ss_dssp EEES-CHHHHHHHHHHHHHHTTTSEEEEESSSSTTTHHHHHHT
T ss_pred EecCcCHHHHHHHHHHHhhcCCcEEEEEECCCCHHHHHHHHhc
Confidence 567889999999888432 34459999999 999999988764
No 33
>PRK09016 quinolinate phosphoribosyltransferase; Validated
Probab=30.52 E-value=89 Score=29.42 Aligned_cols=38 Identities=21% Similarity=0.314 Sum_probs=32.1
Q ss_pred hcccCHHHHHHHHHhhcCCCCeEEEEEc-cCHHHHHHHHHH
Q 044668 160 ISRLNSTLIKEFFAENYTAPWMVLAASG-VEHDQLVSVEEP 199 (353)
Q Consensus 160 l~~it~edL~~f~~~~y~p~n~~lvivG-id~~el~~~v~~ 199 (353)
+++++.+++++..+. .+.++.|-++| |+.+.+.++++.
T Consensus 234 LDn~s~e~~~~av~~--~~~~~~ieaSGGI~~~ni~~yA~t 272 (296)
T PRK09016 234 LDNFTTEQMREAVKR--TNGRALLEVSGNVTLETLREFAET 272 (296)
T ss_pred eCCCChHHHHHHHHh--hcCCeEEEEECCCCHHHHHHHHhc
Confidence 468999999999984 35689999999 999999888764
No 34
>COG0285 FolC Folylpolyglutamate synthase [Coenzyme metabolism]
Probab=30.18 E-value=62 Score=32.16 Aligned_cols=53 Identities=21% Similarity=0.220 Sum_probs=39.5
Q ss_pred CCCCCCcccHHHHHHHhhCCc---eeeecee--e----eecCCCCCHHHHHHHHHHHHHHHH
Q 044668 242 GGPGKGVYSRLHRRVLNEIPR---VQQGVYF--C----GITPGEVNQVQLDRAVQSTNSAIL 294 (353)
Q Consensus 242 g~pg~g~~SrL~~~lre~~gl---~Ys~~y~--f----gl~~~~is~~EL~~aK~~l~~~~~ 294 (353)
|-.|||.+++..+.+.++.|+ +|++.+. | .+..+.|+++++.++-.++.....
T Consensus 51 GTNGKGSt~afl~siL~~aG~~VG~yTSPHL~~~~ERI~ing~~Isd~~~~~~~~~ve~~~~ 112 (427)
T COG0285 51 GTNGKGSTCAFLESILREAGYKVGVYTSPHLLSFNERIRINGEPISDEELAAAFERVEEAAG 112 (427)
T ss_pred CCCCchhHHHHHHHHHHHcCCCceEECCCccCccceEEEECCEECCHHHHHHHHHHHHHHhc
Confidence 456899999999999888886 4543332 3 366779999999999987765543
No 35
>PRK06559 nicotinate-nucleotide pyrophosphorylase; Provisional
Probab=28.27 E-value=1e+02 Score=28.99 Aligned_cols=39 Identities=10% Similarity=0.147 Sum_probs=32.4
Q ss_pred hhcccCHHHHHHHHHhhcCCCCeEEEEEc-cCHHHHHHHHHH
Q 044668 159 AISRLNSTLIKEFFAENYTAPWMVLAASG-VEHDQLVSVEEP 199 (353)
Q Consensus 159 ~l~~it~edL~~f~~~~y~p~n~~lvivG-id~~el~~~v~~ 199 (353)
.+++++.+++++..+. .+++..+-++| |+.+.+.++++.
T Consensus 222 mLDnmspe~l~~av~~--~~~~~~leaSGGI~~~ni~~yA~t 261 (290)
T PRK06559 222 MLDNMSLEQIEQAITL--IAGRSRIECSGNIDMTTISRFRGL 261 (290)
T ss_pred EECCCCHHHHHHHHHH--hcCceEEEEECCCCHHHHHHHHhc
Confidence 3579999999999973 35689999999 999999888764
No 36
>PRK06978 nicotinate-nucleotide pyrophosphorylase; Provisional
Probab=27.23 E-value=1.2e+02 Score=28.63 Aligned_cols=38 Identities=26% Similarity=0.368 Sum_probs=31.6
Q ss_pred hcccCHHHHHHHHHhhcCCCCeEEEEEc-cCHHHHHHHHHH
Q 044668 160 ISRLNSTLIKEFFAENYTAPWMVLAASG-VEHDQLVSVEEP 199 (353)
Q Consensus 160 l~~it~edL~~f~~~~y~p~n~~lvivG-id~~el~~~v~~ 199 (353)
+++++.+++++..+. .+.++.+-++| |+.+.+.++++.
T Consensus 231 LDnmspe~l~~av~~--~~~~~~lEaSGGIt~~ni~~yA~t 269 (294)
T PRK06978 231 LDNFTLDMMREAVRV--TAGRAVLEVSGGVNFDTVRAFAET 269 (294)
T ss_pred ECCCCHHHHHHHHHh--hcCCeEEEEECCCCHHHHHHHHhc
Confidence 468999999998874 35689999999 999999888764
No 37
>TIGR00708 cobA cob(I)alamin adenosyltransferase. Alternate name: corrinoid adenosyltransferase.
Probab=26.57 E-value=88 Score=26.95 Aligned_cols=37 Identities=14% Similarity=0.134 Sum_probs=31.6
Q ss_pred cccCHHHHHHHHHhhcCCCCeEEEEEc-cCHHHHHHHHHH
Q 044668 161 SRLNSTLIKEFFAENYTAPWMVLAASG-VEHDQLVSVEEP 199 (353)
Q Consensus 161 ~~it~edL~~f~~~~y~p~n~~lvivG-id~~el~~~v~~ 199 (353)
.=|+.++|.++.+. .|.++.|++.| --++++.++++-
T Consensus 113 gli~~~~v~~lL~~--rp~~~evVlTGR~~p~~l~e~AD~ 150 (173)
T TIGR00708 113 GYLDVEEVVEALQE--RPGHQHVIITGRGCPQDLLELADL 150 (173)
T ss_pred CCcCHHHHHHHHHh--CCCCCEEEEECCCCCHHHHHhCce
Confidence 45888999999976 89999999999 888888887653
No 38
>COG0157 NadC Nicotinate-nucleotide pyrophosphorylase [Coenzyme metabolism]
Probab=26.03 E-value=1.3e+02 Score=28.08 Aligned_cols=40 Identities=20% Similarity=0.204 Sum_probs=34.4
Q ss_pred hcccCHHHHHHHHHhhcCCCCeEEEEEc-cCHHHHHHHHHH
Q 044668 160 ISRLNSTLIKEFFAENYTAPWMVLAASG-VEHDQLVSVEEP 199 (353)
Q Consensus 160 l~~it~edL~~f~~~~y~p~n~~lvivG-id~~el~~~v~~ 199 (353)
+++++.+++++..+..=.+.++.+-++| |+.+.+.++.+.
T Consensus 214 LDNm~~e~~~~av~~l~~~~~~~lEaSGgIt~~ni~~yA~t 254 (280)
T COG0157 214 LDNMSPEELKEAVKLLGLAGRALLEASGGITLENIREYAET 254 (280)
T ss_pred ecCCCHHHHHHHHHHhccCCceEEEEeCCCCHHHHHHHhhc
Confidence 4689999999999877677899999999 999999887754
No 39
>PRK08385 nicotinate-nucleotide pyrophosphorylase; Provisional
Probab=25.88 E-value=1.2e+02 Score=28.28 Aligned_cols=41 Identities=10% Similarity=0.164 Sum_probs=31.5
Q ss_pred hhcccCHHHHHHHHHhh---cCCCCeEEEEEc-cCHHHHHHHHHH
Q 044668 159 AISRLNSTLIKEFFAEN---YTAPWMVLAASG-VEHDQLVSVEEP 199 (353)
Q Consensus 159 ~l~~it~edL~~f~~~~---y~p~n~~lvivG-id~~el~~~v~~ 199 (353)
.+++++.+++++..+.. ..+.++.+.++| |+.+.+.++++.
T Consensus 207 ~LDn~~~e~l~~~v~~l~~~~~~~~~~leaSGGI~~~ni~~yA~t 251 (278)
T PRK08385 207 MLDNMTPEEIREVIEALKREGLRERVKIEVSGGITPENIEEYAKL 251 (278)
T ss_pred EECCCCHHHHHHHHHHHHhcCcCCCEEEEEECCCCHHHHHHHHHc
Confidence 35688899998877643 123589999999 999999988764
No 40
>PRK06543 nicotinate-nucleotide pyrophosphorylase; Provisional
Probab=25.09 E-value=1.2e+02 Score=28.23 Aligned_cols=39 Identities=13% Similarity=0.176 Sum_probs=32.3
Q ss_pred hhcccCHHHHHHHHHhhcCCCCeEEEEEc-cCHHHHHHHHHH
Q 044668 159 AISRLNSTLIKEFFAENYTAPWMVLAASG-VEHDQLVSVEEP 199 (353)
Q Consensus 159 ~l~~it~edL~~f~~~~y~p~n~~lvivG-id~~el~~~v~~ 199 (353)
.+++++.+++++.... .+++..+-++| |+.+.+.++++.
T Consensus 218 mLDn~s~e~l~~av~~--~~~~~~leaSGgI~~~ni~~yA~t 257 (281)
T PRK06543 218 MLDNFSLDDLREGVEL--VDGRAIVEASGNVNLNTVGAIAST 257 (281)
T ss_pred EECCCCHHHHHHHHHH--hCCCeEEEEECCCCHHHHHHHHhc
Confidence 3579999999999973 45677999999 999999888764
No 41
>PRK07414 cob(I)yrinic acid a,c-diamide adenosyltransferase; Validated
Probab=25.05 E-value=92 Score=26.97 Aligned_cols=37 Identities=14% Similarity=0.296 Sum_probs=31.7
Q ss_pred hcccCHHHHHHHHHhhcCCCCeEEEEEc-cCHHHHHHHHH
Q 044668 160 ISRLNSTLIKEFFAENYTAPWMVLAASG-VEHDQLVSVEE 198 (353)
Q Consensus 160 l~~it~edL~~f~~~~y~p~n~~lvivG-id~~el~~~v~ 198 (353)
..=|+.+++.++.+. .|.++.|++.| -.++++.++++
T Consensus 130 ~gli~~eeVl~~L~~--rp~~~evILTGR~~p~~Lie~AD 167 (178)
T PRK07414 130 FGLIPETEVLEFLEK--RPSHVDVILTGPEMPESLLAIAD 167 (178)
T ss_pred CCCccHHHHHHHHHh--CCCCCEEEEECCCCCHHHHHhCC
Confidence 345889999999986 79999999999 88888888764
No 42
>KOG3432 consensus Vacuolar H+-ATPase V1 sector, subunit F [Energy production and conversion]
Probab=24.83 E-value=1.8e+02 Score=23.07 Aligned_cols=45 Identities=7% Similarity=0.043 Sum_probs=38.6
Q ss_pred ccCHHHHHHHHHhhcCCCCeEEEEEc-cCHHHHHHHHHHhhCCCCC
Q 044668 162 RLNSTLIKEFFAENYTAPWMVLAASG-VEHDQLVSVEEPLLSDLPI 206 (353)
Q Consensus 162 ~it~edL~~f~~~~y~p~n~~lvivG-id~~el~~~v~~~f~~l~~ 206 (353)
+-|.+++.+-++++...+.+.|+.+. +-.+.+...++.+-..+|+
T Consensus 44 ~Tt~~eiedaF~~f~~RdDIaIiLInq~~Ae~iR~~vD~h~~~vPA 89 (121)
T KOG3432|consen 44 KTTVEEIEDAFKSFTARDDIAIILINQFIAEMIRDRVDAHTQAVPA 89 (121)
T ss_pred cCCHHHHHHHHHhhccccCeEEEEEhHHHHHHHHHHHHhccccCCe
Confidence 56789999999999999999999999 8888888888887665554
No 43
>PRK06106 nicotinate-nucleotide pyrophosphorylase; Provisional
Probab=23.35 E-value=1.4e+02 Score=27.93 Aligned_cols=38 Identities=16% Similarity=0.268 Sum_probs=31.0
Q ss_pred hcccCHHHHHHHHHhhcCCCCeEEEEEc-cCHHHHHHHHHH
Q 044668 160 ISRLNSTLIKEFFAENYTAPWMVLAASG-VEHDQLVSVEEP 199 (353)
Q Consensus 160 l~~it~edL~~f~~~~y~p~n~~lvivG-id~~el~~~v~~ 199 (353)
+++++.+++++..+. .+.+..+-++| |+.+.+.++++.
T Consensus 220 LDn~s~e~l~~av~~--~~~~~~leaSGGI~~~ni~~yA~t 258 (281)
T PRK06106 220 LDNMTPDTLREAVAI--VAGRAITEASGRITPETAPAIAAS 258 (281)
T ss_pred eCCCCHHHHHHHHHH--hCCCceEEEECCCCHHHHHHHHhc
Confidence 468999999999883 34566799999 999999888764
No 44
>PRK07896 nicotinate-nucleotide pyrophosphorylase; Provisional
Probab=22.17 E-value=1.8e+02 Score=27.35 Aligned_cols=41 Identities=20% Similarity=0.235 Sum_probs=32.0
Q ss_pred hhcccCHHHHHHHHHhh-cCCCCeEEEEEc-cCHHHHHHHHHH
Q 044668 159 AISRLNSTLIKEFFAEN-YTAPWMVLAASG-VEHDQLVSVEEP 199 (353)
Q Consensus 159 ~l~~it~edL~~f~~~~-y~p~n~~lvivG-id~~el~~~v~~ 199 (353)
.+++++.+++++..... -...+..+.++| |+++.+.++++.
T Consensus 224 ~LDnm~~e~vk~av~~~~~~~~~v~ieaSGGI~~~ni~~yA~t 266 (289)
T PRK07896 224 LLDNFPVWQTQEAVQRRDARAPTVLLESSGGLTLDTAAAYAET 266 (289)
T ss_pred EeCCCCHHHHHHHHHHHhccCCCEEEEEECCCCHHHHHHHHhc
Confidence 34689999999988642 024678999999 999999888764
No 45
>PF00719 Pyrophosphatase: Inorganic pyrophosphatase; InterPro: IPR008162 Inorganic pyrophosphatase (3.6.1.1 from EC) (PPase) [, ] is the enzyme responsible for the hydrolysis of pyrophosphate (PPi) which is formed principally as the product of the many biosynthetic reactions that utilise ATP. All known PPases require the presence of divalent metal cations, with magnesium conferring the highest activity. Among other residues, a lysine has been postulated to be part of or close to the active site. PPases have been sequenced from bacteria such as Escherichia coli (homohexamer), Bacillus PS3 (Thermophilic bacterium PS-3) and Thermus thermophilus, from the archaebacteria Thermoplasma acidophilum, from fungi (homodimer), from a plant, and from bovine retina. In yeast, a mitochondrial isoform of PPase has been characterised which seems to be involved in energy production and whose activity is stimulated by uncouplers of ATP synthesis. The sequences of PPases share some regions of similarities, among which is a region that contains three conserved aspartates that are involved in the binding of cations.; GO: 0000287 magnesium ion binding, 0004427 inorganic diphosphatase activity, 0006796 phosphate-containing compound metabolic process, 0005737 cytoplasm; PDB: 2UXS_A 1WCF_A 1SXV_A 3I4Q_A 2PRD_A 2IHP_B 1PYP_A 2IK7_A 2IK4_A 1E9G_A ....
Probab=20.88 E-value=1.7e+02 Score=24.73 Aligned_cols=44 Identities=11% Similarity=0.068 Sum_probs=34.9
Q ss_pred HhhcccCHHHHHHHHHhhcCC-CCeEEEEEc-cCHHHHHHHHHHhh
Q 044668 158 CAISRLNSTLIKEFFAENYTA-PWMVLAASG-VEHDQLVSVEEPLL 201 (353)
Q Consensus 158 e~l~~it~edL~~f~~~~y~p-~n~~lvivG-id~~el~~~v~~~f 201 (353)
+.+..-..++++.|++.|=.. .|-.+.+-| .+.+++.+.|++..
T Consensus 106 ~dl~~~~~~~i~~fF~~YK~l~~~k~~~~~~~~~~~~A~~~i~~~~ 151 (156)
T PF00719_consen 106 EDLPPHLLDEIEHFFRNYKDLEENKWVEVGGWEDAEEALKVIKEAH 151 (156)
T ss_dssp GGSSHHHHHHHHHHHHHTTTTSTTEEEEEEEEEEHHHHHHHHHHHH
T ss_pred HHhChhHHHHHHHHHHHhcCcCCCCeEEeCCCcCHHHHHHHHHHHH
Confidence 333444578899999987777 888899999 99999999998754
No 46
>PLN02373 soluble inorganic pyrophosphatase
Probab=20.82 E-value=1.6e+02 Score=25.75 Aligned_cols=48 Identities=8% Similarity=0.079 Sum_probs=38.7
Q ss_pred CChHhhcccCHHHHHHHHHhhcCCCCeEEEEEc-cCHHHHHHHHHHhhC
Q 044668 155 APECAISRLNSTLIKEFFAENYTAPWMVLAASG-VEHDQLVSVEEPLLS 202 (353)
Q Consensus 155 G~~e~l~~it~edL~~f~~~~y~p~n~~lvivG-id~~el~~~v~~~f~ 202 (353)
-+.+.|.....++++.|++.|=...+-.+.+.| .+.+++.+.|++...
T Consensus 128 ~dl~Dl~~~~l~~I~~fF~~YK~legK~v~v~g~~~~~~A~~~I~~~~~ 176 (188)
T PLN02373 128 TDIKELPPHRLAEIRRFFEDYKKNENKEVAVNDFLPAEAAIEAIQYSMD 176 (188)
T ss_pred CChHHCCHHHHHHHHHHHHHhcccCCCeEEeCCccCHHHHHHHHHHHHH
Confidence 345666777789999999988788887788888 999999999877543
No 47
>PRK01250 inorganic pyrophosphatase; Provisional
Probab=20.28 E-value=1.2e+02 Score=26.29 Aligned_cols=50 Identities=12% Similarity=0.047 Sum_probs=38.2
Q ss_pred CChHhhcccCHHHHHHHHHhhcCCC-CeEEEEEc-cCHHHHHHHHHHhhCCC
Q 044668 155 APECAISRLNSTLIKEFFAENYTAP-WMVLAASG-VEHDQLVSVEEPLLSDL 204 (353)
Q Consensus 155 G~~e~l~~it~edL~~f~~~~y~p~-n~~lvivG-id~~el~~~v~~~f~~l 204 (353)
-+.+.|..-..++++.|++.|=..+ +-.+-+.| .+.+++.+.|++....+
T Consensus 122 ~dl~dl~~~~l~eI~~fF~~YK~le~gk~~~v~g~~~~~~A~~~I~~~~~~y 173 (176)
T PRK01250 122 KDVNDLPELLKAQIKHFFEHYKDLEKGKWVKVEGWGGAEEAKAEIVEAIERA 173 (176)
T ss_pred CChHHCCHHHHHHHHHHHHHhcCCCCCCCEEecCccCHHHHHHHHHHHHHHH
Confidence 3556677777899999999876654 56677888 99999999998765433
Done!