Query psy2731
Match_columns 537
No_of_seqs 321 out of 1974
Neff 7.9
Searched_HMMs 46136
Date Fri Aug 16 23:46:40 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy2731.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/2731hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1047|consensus 100.0 4E-113 1E-117 874.7 32.1 458 7-524 136-613 (613)
2 TIGR02411 leuko_A4_hydro leuko 100.0 7E-105 1E-109 872.9 41.1 455 6-521 130-601 (601)
3 KOG1046|consensus 100.0 2.7E-63 5.8E-68 564.6 21.5 316 6-376 159-490 (882)
4 TIGR02412 pepN_strep_liv amino 100.0 1.8E-61 3.9E-66 549.8 25.7 314 6-375 124-452 (831)
5 PRK14015 pepN aminopeptidase N 100.0 6.6E-59 1.4E-63 524.6 24.1 316 6-374 125-457 (875)
6 TIGR02414 pepN_proteo aminopep 100.0 2.1E-58 4.5E-63 519.1 24.1 316 6-374 112-444 (863)
7 COG0308 PepN Aminopeptidase N 100.0 1.5E-55 3.2E-60 501.2 23.1 319 6-377 140-475 (859)
8 PF01433 Peptidase_M1: Peptida 100.0 2.7E-51 5.9E-56 432.3 15.9 250 5-296 129-387 (390)
9 PF09127 Leuk-A4-hydro_C: Leuk 100.0 2.1E-42 4.5E-47 311.2 14.3 143 377-521 1-143 (143)
10 KOG1932|consensus 100.0 2.1E-27 4.6E-32 260.3 21.3 214 8-258 182-399 (1180)
11 PF13485 Peptidase_MA_2: Pepti 99.1 1.2E-10 2.5E-15 102.4 5.6 97 198-313 23-121 (128)
12 COG3975 Predicted protease wit 98.7 3.1E-07 6.8E-12 96.2 15.5 240 84-370 169-432 (558)
13 KOG1047|consensus 98.1 1E-05 2.2E-10 85.4 9.4 221 267-532 286-538 (613)
14 PF10460 Peptidase_M30: Peptid 96.7 0.0079 1.7E-07 62.1 9.7 47 198-244 137-187 (366)
15 PF05299 Peptidase_M61: M61 gl 96.5 0.0019 4.2E-08 56.3 3.1 45 199-243 3-58 (122)
16 PF07607 DUF1570: Protein of u 95.6 0.0071 1.5E-07 53.3 2.0 39 201-240 2-43 (128)
17 TIGR02411 leuko_A4_hydro leuko 93.9 0.74 1.6E-05 51.6 13.1 131 399-532 367-530 (601)
18 PF10026 DUF2268: Predicted Zn 93.9 0.19 4.1E-06 47.9 7.2 47 198-244 63-113 (195)
19 PF10023 DUF2265: Predicted am 90.8 0.25 5.5E-06 50.5 3.9 41 196-242 161-201 (337)
20 PRK04860 hypothetical protein; 86.7 0.99 2.1E-05 41.5 4.5 25 103-127 6-31 (160)
21 PF04450 BSP: Peptidase of pla 86.2 0.58 1.3E-05 44.9 2.8 38 198-238 94-131 (205)
22 COG4324 Predicted aminopeptida 84.9 0.71 1.5E-05 45.1 2.7 45 192-242 188-233 (376)
23 smart00731 SprT SprT homologue 76.2 2.2 4.7E-05 38.5 2.7 18 197-214 56-73 (146)
24 PF12725 DUF3810: Protein of u 75.0 3.1 6.8E-05 42.7 3.8 32 199-242 195-226 (318)
25 COG2719 SpoVR Uncharacterized 68.2 9 0.00019 40.4 5.2 57 184-247 234-290 (495)
26 PF12315 DUF3633: Protein of u 63.8 10 0.00022 36.2 4.3 40 201-242 94-133 (212)
27 PF04293 SpoVR: SpoVR like pro 59.6 19 0.0004 38.4 5.8 53 184-243 228-280 (426)
28 COG3227 LasB Zinc metalloprote 59.2 46 0.001 35.7 8.5 44 201-246 338-384 (507)
29 PF01447 Peptidase_M4: Thermol 57.3 35 0.00075 31.0 6.5 24 98-121 68-91 (150)
30 PRK11767 SpoVR family protein; 57.0 23 0.0005 38.3 6.0 56 184-246 237-292 (498)
31 PF13574 Reprolysin_2: Metallo 49.6 8.5 0.00018 35.8 1.3 14 199-212 110-123 (173)
32 PHA02456 zinc metallopeptidase 44.5 10 0.00023 32.1 0.9 16 197-212 76-91 (141)
33 PF01863 DUF45: Protein of unk 43.9 65 0.0014 30.3 6.5 24 198-226 162-185 (205)
34 PF10263 SprT-like: SprT-like 43.8 12 0.00026 33.8 1.3 18 197-214 57-74 (157)
35 PF01421 Reprolysin: Reprolysi 43.7 13 0.00029 35.1 1.6 13 199-211 130-142 (199)
36 PRK03001 M48 family peptidase; 42.8 34 0.00073 34.5 4.5 16 197-212 121-136 (283)
37 PF12125 Beta-TrCP_D: D domain 42.5 38 0.00082 23.3 3.1 32 405-438 4-35 (40)
38 PF08325 WLM: WLM domain; Int 42.1 13 0.00027 35.2 1.2 18 198-215 80-97 (186)
39 KOG2661|consensus 41.7 13 0.00028 37.9 1.2 20 195-214 270-289 (424)
40 PRK03982 heat shock protein Ht 41.0 36 0.00079 34.3 4.4 19 196-214 121-139 (288)
41 COG4783 Putative Zn-dependent 39.6 21 0.00046 38.2 2.5 18 198-215 128-145 (484)
42 PF06114 DUF955: Domain of unk 36.5 16 0.00035 30.7 0.9 19 199-217 41-59 (122)
43 PF13688 Reprolysin_5: Metallo 36.1 18 0.0004 33.9 1.3 14 199-212 141-154 (196)
44 cd04272 ZnMc_salivary_gland_MP 35.6 56 0.0012 31.4 4.6 14 199-212 144-157 (220)
45 COG0501 HtpX Zn-dependent prot 34.3 19 0.00041 36.3 1.1 20 196-215 153-172 (302)
46 PRK02391 heat shock protein Ht 33.2 48 0.001 33.7 3.8 19 197-215 130-148 (296)
47 PRK04351 hypothetical protein; 33.1 13 0.00029 33.7 -0.2 15 197-211 58-72 (149)
48 PF05960 DUF885: Bacterial pro 29.6 3.9E+02 0.0084 29.4 10.6 45 198-242 370-426 (549)
49 PF01435 Peptidase_M48: Peptid 29.4 21 0.00046 34.0 0.5 22 198-219 87-108 (226)
50 PRK05457 heat shock protein Ht 29.3 46 0.00099 33.6 2.9 19 197-215 131-149 (284)
51 PF11658 DUF3260: Protein of u 28.5 68 0.0015 34.8 4.1 107 278-393 218-339 (518)
52 PRK03072 heat shock protein Ht 27.1 25 0.00055 35.6 0.6 15 197-211 124-138 (288)
53 COG2856 Predicted Zn peptidase 27.1 79 0.0017 30.5 3.9 42 199-240 71-117 (213)
54 COG5504 Predicted Zn-dependent 25.7 2.8E+02 0.0061 27.4 7.3 39 196-237 136-181 (280)
55 PF13699 DUF4157: Domain of un 25.4 35 0.00077 27.3 1.0 13 200-212 61-73 (79)
56 PRK01345 heat shock protein Ht 25.4 28 0.00061 35.7 0.6 18 197-214 121-138 (317)
57 PF09768 Peptidase_M76: Peptid 25.3 1.4E+02 0.0031 27.8 5.1 18 195-212 66-83 (173)
58 PF09471 Peptidase_M64: IgA Pe 24.8 76 0.0016 31.7 3.5 16 199-214 215-230 (264)
59 PRK04897 heat shock protein Ht 24.5 30 0.00065 35.2 0.6 15 197-211 134-148 (298)
60 cd04269 ZnMc_adamalysin_II_lik 24.3 38 0.00082 31.8 1.2 14 199-212 130-143 (194)
61 TIGR03368 cellulose_yhjU cellu 24.3 93 0.002 33.7 4.1 106 278-393 216-337 (518)
62 PRK02870 heat shock protein Ht 23.7 32 0.00069 35.7 0.6 16 196-211 169-184 (336)
63 PF12174 RST: RCD1-SRO-TAF4 (R 23.4 1.4E+02 0.003 23.4 3.9 46 298-354 12-59 (70)
64 PF12388 Peptidase_M57: Dual-a 22.9 61 0.0013 31.2 2.3 13 199-211 132-144 (211)
65 COG1451 Predicted metal-depend 22.9 35 0.00076 33.2 0.7 19 197-215 172-190 (223)
66 PF14891 Peptidase_M91: Effect 22.8 43 0.00093 31.1 1.3 15 199-213 102-116 (174)
67 PF13582 Reprolysin_3: Metallo 22.7 43 0.00093 28.7 1.2 13 200-212 107-119 (124)
68 cd04270 ZnMc_TACE_like Zinc-de 22.4 43 0.00093 33.0 1.2 13 200-212 167-179 (244)
69 PF15641 Tox-MPTase5: Metallop 21.2 74 0.0016 25.9 2.0 18 199-216 63-81 (109)
70 PRK01265 heat shock protein Ht 20.6 40 0.00087 34.7 0.6 15 197-211 137-151 (324)
71 PF11140 DUF2913: Protein of u 20.5 76 0.0016 30.4 2.4 37 441-477 29-66 (207)
No 1
>KOG1047|consensus
Probab=100.00 E-value=4.5e-113 Score=874.67 Aligned_cols=458 Identities=37% Similarity=0.625 Sum_probs=427.1
Q ss_pred cccCcceecccCCCCCCceEEEEEEEeCCCcccccccccccccCCCCCcceEEEEcccccccceeEEEEeccceeccCCc
Q psy2731 7 KKYLAEFALQSIDTDNFKAHLTSHFAHKPEINQIEWDLWLNTTGMPPHIPKYSFYQPIKVPSYLVAIVVGNLASYKISER 86 (537)
Q Consensus 7 ~~~~Ar~~fPC~D~p~~ka~f~l~i~~p~~~~~~Sn~~~~~~~~~~~~~~~~~F~~t~pip~yl~a~avg~~~~~~~g~~ 86 (537)
+..+||.+|||+|.|+.|.||+..|.+|.+++++.+......+..++++..++|++.+|||+||+||++|++++.++|+|
T Consensus 136 QAIhaRsi~PC~DTPavK~ty~a~v~vp~~l~a~mSai~~~~~~~~~~~~~f~f~q~~pIP~YLiai~~G~L~s~eIgpR 215 (613)
T KOG1047|consen 136 QAIHARSIFPCQDTPAVKSTYTAEVEVPMGLTALMSAIPAGEKPGSNGRAIFRFKQEVPIPSYLIAIAVGDLESREIGPR 215 (613)
T ss_pred HHhHHheeccccCCCcceeEEEEEEEcCCcceeeeeccccccCCCCCCcceEEEEeccCchhhhHHHhhccccccccCCc
Confidence 45789999999999999999999999999999997776655555667789999999999999999999999999999999
Q ss_pred eEEecCcchHHHHHHHHH-HHHHHHHHHHHHhcCCCCCcccEEEeCCCCCCcccccccceeeccccchhhhhccccCCCC
Q psy2731 87 CSVWSEPELVKEAADEFN-ETEKFLSTAEEICGTYVWGVYDLVMLPPSFPFGGMENPYDTSLQDVCNDLANRWISWNHTK 165 (537)
Q Consensus 87 ~~v~~~p~~~~~~~~~~~-~~~~~l~~~e~~~GpYp~~k~d~v~lpp~f~~ggmEn~gltf~~~~~~~~~e~~~~~~~~~ 165 (537)
++||++|+.+++++++|. .++++|...|+++|||+|++||+|++||+||+||||||||||
T Consensus 216 s~VwaEp~~~~a~~~ef~~~~e~~L~~Ae~l~GpY~WgryDllvlPpSFP~gGMENPcltF------------------- 276 (613)
T KOG1047|consen 216 SRVWAEPCLLDACQEEFAGETEDFLKAAEKLFGPYVWGRYDLLVLPPSFPFGGMENPCLTF------------------- 276 (613)
T ss_pred cceecchhhhHHHHHHHHhhhHHHHHHHHHHcCCcccccceEEEecCCCCcccccCcceee-------------------
Confidence 999999999999999998 999999999999999999999999999999999999999999
Q ss_pred CCCCccccccccCCchhhhhhhccccccccCCcchhhhHhhhhhhhhccCcccCCCCchhhHHHHHHHHHHHHHHhhhcC
Q psy2731 166 SPPFSKQDLAAFTPGQKIEFLAKSSSYILAGDGSLAAVVAHEISHSWTGNLVTNRNFEHFWLNEGFTMFVERKITGRLRG 245 (537)
Q Consensus 166 ~~~~~~~~~~l~~p~~~~~~~~~~~~~~~~~~~~~~~viaHEiAHqWfGnlVt~~~w~~~WLnEGfa~y~e~~~~~~~~g 245 (537)
+|| ++|+||++++.||||||||+||||+||+.+|+|+|||||||+|+|++|+..++|
T Consensus 277 -----------~Tp------------TllaGDrsl~~vIaHEIAHSWtGNlVTN~sWehfWLNEGfTvylErrI~g~~~g 333 (613)
T KOG1047|consen 277 -----------VTP------------TLLAGDRSLVDVIAHEIAHSWTGNLVTNASWEHFWLNEGFTVYLERRIVGRLYG 333 (613)
T ss_pred -----------ecc------------hhhcCCcchhhHHHHHhhhhhcccccccCccchhhhcccchhhhhhhhhhhhcc
Confidence 999 999999999999999999999999999999999999999999999999999999
Q ss_pred chhHHHHHHHhHHHHHHHHccCcchhhhhhhccCCCc------ccccccCcCCccch--hhHHHHHHHHHhcccccchhh
Q psy2731 246 EAERHFDALSGLKDLKQAAGDGSLAAVVAHEISHSWT------GNLVTNRNFEHFWL--NEGFTMFVERKITGRLRGEAE 317 (537)
Q Consensus 246 ~~~~~~~~~~~~~~l~~~~~~d~~~~~~~~e~~~pl~------g~~~~~~~f~~i~Y--G~~~l~~L~~~lG~~~f~~~~ 317 (537)
++.+++++.++|..++..+ |.++. +++++ ++.+|+++|++++| |+++|++|++++|+...
T Consensus 334 ~~~~~f~a~~gw~~L~~~~--d~~g~------~~~~tkLv~kl~~~dPDdafs~VpYeKG~~ll~~Le~~lG~~~~---- 401 (613)
T KOG1047|consen 334 EAYRQFEALIGWRELRPSM--DLFGE------TSEFTKLVVKLENVDPDDAFSQVPYEKGFALLFYLEQLLGDPTR---- 401 (613)
T ss_pred hhHHHHHHhcChhhhhhHH--HhcCC------CcccchhhhhccCCChHHhhhcCchhhhhHHHHHHHHHhCChhh----
Confidence 9999999999999999999 88887 66655 23467799999999 99999999999998887
Q ss_pred hhhhhh--hHHHHHh-hhccCCCchhhHHHHHHHHhcCcchHH-----HHhh-hhcCCCCccccccccchhHHHHHHHHH
Q psy2731 318 RHFDAL--SGLKDLK-QAVSSTGPLWDSKRNALDFQKGRHYQC-----KIER-LKCGSAILFIYGYDTSLQDVCNDLANR 388 (537)
Q Consensus 318 ~~~~~l--~y~~~~~-~~~~t~~~~~df~~~l~~~~~~~~~~~-----~~~~-l~~~G~P~~~~~~~~~~~~~~~~l~~~ 388 (537)
|+.+ .|+++|+ +++.+ +||+++|.+++++.+..+ .++. ++++|+|+.+++|++++++.|.+|+++
T Consensus 402 --Fd~FLr~Yv~kfa~ksI~t----~dfld~Lye~fpe~kk~dil~~vd~~~Wl~~~G~Pp~~p~~~~sL~~~~~~La~~ 475 (613)
T KOG1047|consen 402 --FDPFLRAYVHKFAFKSILT----QDFLDFLYEYFPELKKKDILDEVDWDLWLNSPGMPPPKPNFDSSLARPVEALAQK 475 (613)
T ss_pred --HHHHHHHHHHHhccceecH----HHHHHHHHHhCcchhhhhhhccccHHHHhcCCCCCCCCCCccchHHHHHHHHHHH
Confidence 8865 9999999 99999 999999999998743222 2333 999999999999999999999999999
Q ss_pred HhhcCC--CCCCCCChhhcccCChhcHHHHHHHhhccCCCCHHHHHHHHhhcCcCCCCcHHHHHHHHHHHHhCcccchhH
Q psy2731 389 WISWNH--TKETPFSKQDLAAFTPGQKIEFLAILLDKEMYDLPKVKSLQDVYRFNGVRNCEIRFRWLKLCLKARWKEQVP 466 (537)
Q Consensus 389 w~~~~~--~~~~~~~~~~~~~~~~~q~~~FL~~l~~~~~l~~~~l~~ld~~y~l~~~~N~Ei~~rW~~l~i~~~~~~~~~ 466 (537)
|+..+. .....++..|+++|+++|++.||++|+++.++++..+++|+++|+|.+++|+||+|||++|+|+++|+++++
T Consensus 476 W~~~~~~~~~~~~~~a~d~~~~~~~Ql~lfL~~l~q~~~lp~~~~~aL~~~Y~l~~skNAEvr~rw~~L~vksk~~~~~~ 555 (613)
T KOG1047|consen 476 WTAVEELDDAASEFSAEDIEKWSPDQLVLFLDKLLQSKPLPPGTVKALQDTYSLLQSKNAEVRFRWLQLIVKSKWQEAYK 555 (613)
T ss_pred HhhcccccccccccchhhHHhcChHhHHHHHHHHHhcCCCChHHHHHHHHHhhhhhccchHHHHHHHHHHHHhhhHHHHH
Confidence 998443 233457889999999999999999999998899999999999999999999999999999999999999999
Q ss_pred HHHHhHhccCccccccHHHHHHhcCccCHHHHHHHHHHhhhcccHHHHHHHHhhcCCC
Q psy2731 467 HVIDMVTSQGRMKYVRPLYRELYAWEDTRQTAIDTFKQHRKQMMYVTAYTLAKDLKLG 524 (537)
Q Consensus 467 ~~~~fl~~~GRmk~~~PlYr~L~~~~~~r~~A~~~f~~~~~~yhpi~~~~v~k~l~~~ 524 (537)
.|.+||.+||||||||||||+|+.|+..+++|+++|+++|.+|||||+++|++||+++
T Consensus 556 ~v~~fl~~~grmkf~rPlyr~L~~~~~~~~~A~~~F~~~k~qmhpi~~~~V~~~L~~~ 613 (613)
T KOG1047|consen 556 KVLEFLGEQGRMKFTRPLYRDLAGGEAAKQLAIETFAKTKSQMHPICANVVQKDLGLY 613 (613)
T ss_pred HHHHHHHhccceeeehHHHHHhhcchHHHHHHHHHHHHhHhhhCHHHHHHHHHHhcCC
Confidence 9999999999999999999999999889999999999999999999999999999874
No 2
>TIGR02411 leuko_A4_hydro leukotriene A-4 hydrolase/aminopeptidase. Members of this family represent a distinctive subset within the zinc metallopeptidase family M1 (pfam01433). The majority of the members of pfam01433 are aminopeptidases, but the sequences in this family for which the function is known are leukotriene A-4 hydrolase. A dual epoxide hydrolase and aminopeptidase activity at the same active site is indicated. The physiological substrate for aminopeptidase activity is not known.
Probab=100.00 E-value=6.6e-105 Score=872.91 Aligned_cols=455 Identities=34% Similarity=0.582 Sum_probs=403.2
Q ss_pred ccccCcceecccCCCCCCceEEEEEEEeCCCcccccccccccccCCCCCcceEEEEcccccccceeEEEEeccceeccCC
Q psy2731 6 LKKYLAEFALQSIDTDNFKAHLTSHFAHKPEINQIEWDLWLNTTGMPPHIPKYSFYQPIKVPSYLVAIVVGNLASYKISE 85 (537)
Q Consensus 6 l~~~~Ar~~fPC~D~p~~ka~f~l~i~~p~~~~~~Sn~~~~~~~~~~~~~~~~~F~~t~pip~yl~a~avg~~~~~~~g~ 85 (537)
.||.+||+||||||+|++||||+++|+|| +.|++|+.....+ .++..+++|++++|||+||+||+||+|+..+.|+
T Consensus 130 ~qp~~AR~~fPC~D~P~~Katf~~~I~~P--~~av~sg~~~~~~--~~~~~~~~F~~t~pmptYLia~avG~~~~~~~g~ 205 (601)
T TIGR02411 130 CQAIHARSVIPCQDTPSVKSTYTAEVESP--LPVLMSGIPDGET--SNDPGKYLFKQKVPIPAYLIALASGDLASAPIGP 205 (601)
T ss_pred CcccchheeeeecCCcccceEEEEEEeeC--cceeccCCccccc--cCCCceEEEEeCCCcchhhheeeeccceecccCC
Confidence 48999999999999999999999999999 8888665544322 2456789999999999999999999999999999
Q ss_pred ceEEecCcchHHHHHHHHH-HHHHHHHHHHHHhcCCCCCcccEEEeCCCCCCcccccccceeeccccchhhhhccccCCC
Q psy2731 86 RCSVWSEPELVKEAADEFN-ETEKFLSTAEEICGTYVWGVYDLVMLPPSFPFGGMENPYDTSLQDVCNDLANRWISWNHT 164 (537)
Q Consensus 86 ~~~v~~~p~~~~~~~~~~~-~~~~~l~~~e~~~GpYp~~k~d~v~lpp~f~~ggmEn~gltf~~~~~~~~~e~~~~~~~~ 164 (537)
+++||++|+.++++++++. .++++|+++|+++|||||+|||+|++||+|++|||||||+||
T Consensus 206 ~~~v~~~p~~~~~~~~~~~~~~~~~l~~~e~~~~pYp~~k~d~vvlpp~f~~GgMEN~~ltf------------------ 267 (601)
T TIGR02411 206 RSSVYSEPEQLEKCQYEFEHDTENFIKTAEDLIFPYEWGQYDLLVLPPSFPYGGMENPNLTF------------------ 267 (601)
T ss_pred ceEEEccchhHHHHHHHHHHhHHHHHHHHHHhCCCCcCccceEEEecCccccccccccccee------------------
Confidence 9999999999999999998 999999999999889999999999999999999999999999
Q ss_pred CCCCCccccccccCCchhhhhhhccccccccCCcchhhhHhhhhhhhhccCcccCCCCchhhHHHHHHHHHHHHHHhhhc
Q psy2731 165 KSPPFSKQDLAAFTPGQKIEFLAKSSSYILAGDGSLAAVVAHEISHSWTGNLVTNRNFEHFWLNEGFTMFVERKITGRLR 244 (537)
Q Consensus 165 ~~~~~~~~~~~l~~p~~~~~~~~~~~~~~~~~~~~~~~viaHEiAHqWfGnlVt~~~w~~~WLnEGfa~y~e~~~~~~~~ 244 (537)
.+| +++.++++...+||||||||||||+||++||+|+|||||||+|+|++++++++
T Consensus 268 ------------~~~------------~ll~~d~s~~~viaHElAHqWfGNlVT~~~W~d~WLnEGfaty~e~~~~~~~~ 323 (601)
T TIGR02411 268 ------------ATP------------TLIAGDRSNVDVIAHELAHSWSGNLVTNCSWEHFWLNEGWTVYLERRIVGRLY 323 (601)
T ss_pred ------------ecc------------ccccCChhhhhhHHHHHHhhccCceeecCCchHHHHHhhHHHHHHHHHHHHhc
Confidence 677 77888888889999999999999999999999999999999999999999999
Q ss_pred CchhHHHHHHHhHHHHHHHHccCcchhhhhhhccCCCcc--cccccCcCCccch--hhHHHHHHHHHhc-ccccchhhhh
Q psy2731 245 GEAERHFDALSGLKDLKQAAGDGSLAAVVAHEISHSWTG--NLVTNRNFEHFWL--NEGFTMFVERKIT-GRLRGEAERH 319 (537)
Q Consensus 245 g~~~~~~~~~~~~~~l~~~~~~d~~~~~~~~e~~~pl~g--~~~~~~~f~~i~Y--G~~~l~~L~~~lG-~~~f~~~~~~ 319 (537)
|++.+.+....++..++.++. .+.. .++++.+... ...+++.|+.++| |+++|+|||.+|| ++.|
T Consensus 324 ~e~~~~~~~~~~~~~l~~~~~--~~~~--~~~~~~~~~~~~~~dp~~~f~~i~Y~KGa~~L~mL~~~lG~~~~F------ 393 (601)
T TIGR02411 324 GEKTRHFSALIGWGELQESVK--TLGE--DPEYTKLVVDLKDNDPDDAFSSVPYEKGFNFLFYLEQLLGGPAVF------ 393 (601)
T ss_pred CcHHHHHHHHHhHHHHHHHHH--hhcC--CCCCCcccccCCCCChhhhccccchhhHHHHHHHHHHHhCCHHHH------
Confidence 998777877778888877773 2322 1222222210 1146689999999 9999999999999 8888
Q ss_pred hhhh-hHHHHHh-hhccCCCchhhHHHHHHHHhcCc---chHH--HHhh-hhcCCCCccccccccchhHHHHHHHHHHhh
Q psy2731 320 FDAL-SGLKDLK-QAVSSTGPLWDSKRNALDFQKGR---HYQC--KIER-LKCGSAILFIYGYDTSLQDVCNDLANRWIS 391 (537)
Q Consensus 320 ~~~l-~y~~~~~-~~~~t~~~~~df~~~l~~~~~~~---~~~~--~~~~-l~~~G~P~~~~~~~~~~~~~~~~l~~~w~~ 391 (537)
++++ .|+++|+ +++++ +||+++|.+++++. .+.. .|+. ++++|+|+++++|++++.+.|.+|+++|+.
T Consensus 394 ~~~lr~Yl~~~~~~s~~t----~df~~~l~~~~~~~~~~~~l~~~~~~~Wl~~~G~P~~~~~~~~~~~~~~~~l~~~w~~ 469 (601)
T TIGR02411 394 DPFLKHYFKKFAYKSLDT----YQFKDALYEYFKDTGKVDKLNAVDWDTWLYSPGLPPVKPNFDTTLADECYALASRWVD 469 (601)
T ss_pred HHHHHHHHHHhCCCCCCH----HHHHHHHHHHhhhccccchhhhhhHHHHhcCCCCCCcCCCCCchHHHHHHHHHHHHHh
Confidence 8899 9999999 99999 99999999987432 1222 3677 999999999999999999999999999987
Q ss_pred cCCCCCCCCChhhcccCChhcHHHHHHHhhccC---CCCHHHHHHHHhhcCcCCCCcHHHHHHHHHHHHhCcccchhHHH
Q psy2731 392 WNHTKETPFSKQDLAAFTPGQKIEFLAILLDKE---MYDLPKVKSLQDVYRFNGVRNCEIRFRWLKLCLKARWKEQVPHV 468 (537)
Q Consensus 392 ~~~~~~~~~~~~~~~~~~~~q~~~FL~~l~~~~---~l~~~~l~~ld~~y~l~~~~N~Ei~~rW~~l~i~~~~~~~~~~~ 468 (537)
.++.+..+++..++++|+++|+++||+.|+++. +|++++|++||++|+|++++|+||+||||+++|+++|+++++++
T Consensus 470 ~~~~~~~~~~~~~~~~~~~~q~~~fl~~l~~~~~~~~l~~~~~~~l~~~y~~~~~~n~ei~~~w~~~~~~~~~~~~~~~~ 549 (601)
T TIGR02411 470 AAKDDLSSFNAKDIKDFSSHQLVLFLETLTERKGIVSLPEGHIKRMGDIYNFAASKNAEVRFRWNRLAIQAKLEDEYPLI 549 (601)
T ss_pred cCCCCCCCcchhhhhhCCHHHHHHHHHHHHhhccCCCCCHHHHHHHHhhcCccCCCchhhhHHHHHHHHhcCCchhHHHH
Confidence 442222233558999999999999999999963 69999999999999999999999999999999999999999999
Q ss_pred HHhHhccCccccccHHHHHHhcCccCHHHHHHHHHHhhhcccHHHHHHHHhhc
Q psy2731 469 IDMVTSQGRMKYVRPLYRELYAWEDTRQTAIDTFKQHRKQMMYVTAYTLAKDL 521 (537)
Q Consensus 469 ~~fl~~~GRmk~~~PlYr~L~~~~~~r~~A~~~f~~~~~~yhpi~~~~v~k~l 521 (537)
.+||+++|||||||||||+|+++ ++|++|+++|+++|++|||||+++|+|||
T Consensus 550 ~~~l~~~gr~k~~~p~y~~l~~~-~~~~~a~~~f~~~~~~yh~~~~~~v~~~l 601 (601)
T TIGR02411 550 AEWLGTVGRMKFVRPGYRLLNAF-VDKDFAIRTFEKFKDSYHPICAMLVEKDL 601 (601)
T ss_pred HHHHHhcCCcEEehHHHHHHHhc-cCHHHHHHHHHHHhhccCHHHHHHHHhhC
Confidence 99999999999999999999998 68999999999999999999999999997
No 3
>KOG1046|consensus
Probab=100.00 E-value=2.7e-63 Score=564.60 Aligned_cols=316 Identities=22% Similarity=0.259 Sum_probs=276.6
Q ss_pred ccccCcceecccCCCCCCceEEEEEEEeCCCcccccccccccccCCCCCcceEEEEcccccccceeEEEEeccceec---
Q psy2731 6 LKKYLAEFALQSIDTDNFKAHLTSHFAHKPEINQIEWDLWLNTTGMPPHIPKYSFYQPIKVPSYLVAIVVGNLASYK--- 82 (537)
Q Consensus 6 l~~~~Ar~~fPC~D~p~~ka~f~l~i~~p~~~~~~Sn~~~~~~~~~~~~~~~~~F~~t~pip~yl~a~avg~~~~~~--- 82 (537)
++|+.||++|||||+|++||+|.|+|.||++++++|||++......++++.+.+|++|++||+|++||+||+|...+
T Consensus 159 fept~AR~~FPCfDeP~~KAtF~Itl~hp~~~~aLSNm~v~~~~~~~~~~~~~~F~~Tp~MstYLvAf~V~~f~~~e~~~ 238 (882)
T KOG1046|consen 159 FEPTDARRAFPCFDEPAFKATFTITLVHPKGYTALSNMPVIKEEPVDDGWKTTTFEKTPKMSTYLVAFAVGDFVYVETIT 238 (882)
T ss_pred cCccchhhcCCCCCcccccCceEEEEEecCCceEeecCcccccccccCCeeEEEEEecCCCchhhheeeeeccccceeec
Confidence 58999999999999999999999999999999999999999888888889999999999999999999999999876
Q ss_pred -cCCceEEecCcchHHHHHHHHHHHHHHHHHHHHHhc-CCCCCcccEEEeCCCCCCcccccccc-eeeccccchhhhhcc
Q psy2731 83 -ISERCSVWSEPELVKEAADEFNETEKFLSTAEEICG-TYVWGVYDLVMLPPSFPFGGMENPYD-TSLQDVCNDLANRWI 159 (537)
Q Consensus 83 -~g~~~~v~~~p~~~~~~~~~~~~~~~~l~~~e~~~G-pYp~~k~d~v~lpp~f~~ggmEn~gl-tf~~~~~~~~~e~~~ 159 (537)
.|++++||++|+..+++.++++.+.++|++++++|| |||.+|+|+|++ |+|..|+|||||| || +|..+
T Consensus 239 ~~~v~vrv~a~p~~~~~~~~al~~~~~~L~~~e~~f~i~yPLpK~D~iav-Pdf~~GAMENwGLvty--------re~~l 309 (882)
T KOG1046|consen 239 KSGVPVRVYARPEKINQGQFALEVATKVLEFYEDYFGIPYPLPKLDLVAV-PDFSAGAMENWGLVTY--------RETAL 309 (882)
T ss_pred CCCceEEEEeChHHhhHHHHHHHHHHHHHHHHHHHhCCCCCCccccEEec-CCccccchhcCcceee--------eehhh
Confidence 478999999999999999999999999999999999 999999999999 6999999999997 55 77777
Q ss_pred ccCCCCCCCCccccccccCCchhhhhhhccccccccCCcchhhhHhhhhhhhhccCcccCCCCchhhHHHHHHHHHHHHH
Q psy2731 160 SWNHTKSPPFSKQDLAAFTPGQKIEFLAKSSSYILAGDGSLAAVVAHEISHSWTGNLVTNRNFEHFWLNEGFTMFVERKI 239 (537)
Q Consensus 160 ~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~~~~~~~~~~~~viaHEiAHqWfGnlVt~~~w~~~WLnEGfa~y~e~~~ 239 (537)
+|+++.+ ....+..++.+||||+|||||||+||++||+|+|||||||+|++...
T Consensus 310 L~~~~~s--------------------------s~~~k~~va~vIaHElAHQWFGNLVTm~wW~dLWLnEGfAt~~~~~~ 363 (882)
T KOG1046|consen 310 LYDPQTS--------------------------SSSNKQRVAEVIAHELAHQWFGNLVTMKWWNDLWLNEGFATYVEYLA 363 (882)
T ss_pred ccCCCcC--------------------------cHHHHHHHHHHHHHHHHHHHhcCcccHhhhhhhhhcccHHHHHHHHh
Confidence 7777542 12455678999999999999999999999999999999999999999
Q ss_pred HhhhcCchhHHHHHHHhHHHHHHHHccCcchhhhhhhccCCCccccc----ccCcCCccch--hhHHHHHHHHHhccccc
Q psy2731 240 TGRLRGEAERHFDALSGLKDLKQAAGDGSLAAVVAHEISHSWTGNLV----TNRNFEHFWL--NEGFTMFVERKITGRLR 313 (537)
Q Consensus 240 ~~~~~g~~~~~~~~~~~~~~l~~~~~~d~~~~~~~~e~~~pl~g~~~----~~~~f~~i~Y--G~~~l~~L~~~lG~~~f 313 (537)
++..+++.. +...+....+..++..|.... +||+..... ..+.|+.+.| |++++|||+..||++.|
T Consensus 364 v~~~~p~~~--~~~~~~~~~l~~~l~~D~l~~------shpi~~~v~~~~ei~e~fd~i~Y~KGasvlRML~~~lGe~~F 435 (882)
T KOG1046|consen 364 VDHLFPEWD--IWEQFLLENLERVLSLDALAS------SHPISVPVESPSEIDEIFDEISYQKGASVLRMLESLLGEEVF 435 (882)
T ss_pred hccCCcchh--hHHHHHHHHHHHHhhhhcccc------cCCeeeecCCcchhhhhhhhhhhhHHHHHHHHHHHHHCHHHH
Confidence 998888753 334455677777787787776 566542111 1257999999 99999999999999999
Q ss_pred chhhhhhhhh-hHHHHHh-hhccCCCchhhHHHHHHHHhcCcchH-HHHhh-hhcCCCCcccccccc
Q psy2731 314 GEAERHFDAL-SGLKDLK-QAVSSTGPLWDSKRNALDFQKGRHYQ-CKIER-LKCGSAILFIYGYDT 376 (537)
Q Consensus 314 ~~~~~~~~~l-~y~~~~~-~~~~t~~~~~df~~~l~~~~~~~~~~-~~~~~-l~~~G~P~~~~~~~~ 376 (537)
.+++ .|+.+|+ +++++ +|||+.|. .+...++ ++|+. ..|+|+|++.|.-..
T Consensus 436 ------~~gi~~yL~~~~y~na~~----~DLw~~l~--~~~~~~v~~~M~~Wt~Q~G~Pvv~V~~~~ 490 (882)
T KOG1046|consen 436 ------RKGLRSYLKKHQYSNAKT----EDLWDALE--EGSGLDVSELMDTWTKQMGYPVVTVERNG 490 (882)
T ss_pred ------HHHHHHHHHHhccCCCCc----hhHHHHHh--ccCCCCHHHHHhhhhcCCCCceEEEEecC
Confidence 9999 9999999 99999 99999999 3333444 47777 999999999986544
No 4
>TIGR02412 pepN_strep_liv aminopeptidase N, Streptomyces lividans type. This family is a subset of the members of the zinc metallopeptidase family M1 (pfam01433), with a single member characterized in Streptomyces lividans 66 and designated aminopeptidase N. The spectrum of activity may differ somewhat from the aminopeptidase N clade of E. coli and most other Proteobacteria, well separated phylogenetically within the M1 family. The M1 family also includes leukotriene A-4 hydrolase/aminopeptidase (with a bifunctional active site).
Probab=100.00 E-value=1.8e-61 Score=549.83 Aligned_cols=314 Identities=18% Similarity=0.222 Sum_probs=256.8
Q ss_pred ccccCcceecccCCCCCCceEEEEEEEeCCCcccccccccccccCCCCCcceEEEEcccccccceeEEEEeccceec---
Q psy2731 6 LKKYLAEFALQSIDTDNFKAHLTSHFAHKPEINQIEWDLWLNTTGMPPHIPKYSFYQPIKVPSYLVAIVVGNLASYK--- 82 (537)
Q Consensus 6 l~~~~Ar~~fPC~D~p~~ka~f~l~i~~p~~~~~~Sn~~~~~~~~~~~~~~~~~F~~t~pip~yl~a~avg~~~~~~--- 82 (537)
.||+.||+||||||+|++||+|+|+|++|++|+|+|||++.+... .++.++++|++++|||+|++||+||+|...+
T Consensus 124 ~ep~~Ar~~fPcfDeP~~KAtf~ltit~p~~~~v~sNg~~~~~~~-~~~~~~~~F~~t~pmstYL~a~~vG~f~~~~~~~ 202 (831)
T TIGR02412 124 FEPADARRVFAVFDQPDLKANFKFSVKAPEDWTVISNSRETDVTP-EPADRRWEFPETPKLSTYLTAVAAGPYHSVQDES 202 (831)
T ss_pred CCCcCceeeEecCCCCCCceeEEEEEEECCCceEECCCccccccc-cCCCeEEEecCCCCcccceEEEEEeceEEEeecC
Confidence 589999999999999999999999999999999999999865543 3567789999999999999999999999876
Q ss_pred cCCceEEecCcchHHH--HHHHHHHHHHHHHHHHHHhc-CCCCCcccEEEeCCCCCCcccccccceeeccccchhhhhcc
Q psy2731 83 ISERCSVWSEPELVKE--AADEFNETEKFLSTAEEICG-TYVWGVYDLVMLPPSFPFGGMENPYDTSLQDVCNDLANRWI 159 (537)
Q Consensus 83 ~g~~~~v~~~p~~~~~--~~~~~~~~~~~l~~~e~~~G-pYp~~k~d~v~lpp~f~~ggmEn~gltf~~~~~~~~~e~~~ 159 (537)
.|+++++|++|+..+. +.++++.+.++|+++|++|| ||||+|||+|++ |+|+.|||||||++.+ +|.++
T Consensus 203 ~gvpi~v~~~~~~~~~~~~~~al~~~~~~l~~~e~~fg~pYP~~k~d~V~v-P~f~~GaMEn~Glit~-------~e~~l 274 (831)
T TIGR02412 203 RSYPLGIYARRSLAQYLDADAIFTITRQGLAFFHRKFGYPYPFKKYDQIFV-PEFNAGAMENAGCVTF-------AENFL 274 (831)
T ss_pred CCEEEEEEECcchhhhhhHHHHHHHHHHHHHHHHHHhCCCCCcccCCEEEc-CCCCCCcccccceeee-------chhhc
Confidence 4688999999987664 56788999999999999999 999999999999 5999999999997542 23322
Q ss_pred ccCCCCCCCCccccccccCCchhhhhhhccccccccCCcchhhhHhhhhhhhhccCcccCCCCchhhHHHHHHHHHHHHH
Q psy2731 160 SWNHTKSPPFSKQDLAAFTPGQKIEFLAKSSSYILAGDGSLAAVVAHEISHSWTGNLVTNRNFEHFWLNEGFTMFVERKI 239 (537)
Q Consensus 160 ~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~~~~~~~~~~~~viaHEiAHqWfGnlVt~~~w~~~WLnEGfa~y~e~~~ 239 (537)
++.. .++ ..+..+..+|+||||||||||+||++||+|+|||||||+|+++++
T Consensus 275 -~~~~------------~~~---------------~~~~~~~~viaHElAHqWFGnlVT~~wW~dlWLnEGFAty~e~~~ 326 (831)
T TIGR02412 275 -HRAE------------ATR---------------AEKENRAGVILHEMAHMWFGDLVTMRWWNDLWLNESFAEYMGTLA 326 (831)
T ss_pred -cCCc------------CCH---------------HHHHHHHHHHHHHHHHHHhCCEeccccccchhHHHHHHHHHHHHH
Confidence 1110 111 223456789999999999999999999999999999999999999
Q ss_pred HhhhcCchhHHHHHHHhHHHHHHHHccCcchhhhhhhccCCCcccc----cccCcCCccch--hhHHHHHHHHHhccccc
Q psy2731 240 TGRLRGEAERHFDALSGLKDLKQAAGDGSLAAVVAHEISHSWTGNL----VTNRNFEHFWL--NEGFTMFVERKITGRLR 313 (537)
Q Consensus 240 ~~~~~g~~~~~~~~~~~~~~l~~~~~~d~~~~~~~~e~~~pl~g~~----~~~~~f~~i~Y--G~~~l~~L~~~lG~~~f 313 (537)
+++.++... ....+.......++..|.... +||+.... .....|+.++| |+++|+||+..||++.|
T Consensus 327 ~~~~~~~~~--~~~~f~~~~~~~a~~~D~~~~------t~Pi~~~~~~~~~~~~~fd~isY~KGa~vL~mL~~~lGee~F 398 (831)
T TIGR02412 327 SAEATEYTD--AWTTFAAQGKQWAYEADQLPT------THPIVADVADLADALSNFDGITYAKGASVLKQLVAWVGEEAF 398 (831)
T ss_pred HHhcCCcch--HHHHHHHHHHHHHHHHhcccC------CCCCccCCCCHHHHHHhccCccchhHHHHHHHHHHHHCHHHH
Confidence 999877532 222222223344454466555 56664211 11257899999 99999999999999999
Q ss_pred chhhhhhhhh-hHHHHHh-hhccCCCchhhHHHHHHHHhcCcchHHHHhh-hhcCCCCccccccc
Q psy2731 314 GEAERHFDAL-SGLKDLK-QAVSSTGPLWDSKRNALDFQKGRHYQCKIER-LKCGSAILFIYGYD 375 (537)
Q Consensus 314 ~~~~~~~~~l-~y~~~~~-~~~~t~~~~~df~~~l~~~~~~~~~~~~~~~-l~~~G~P~~~~~~~ 375 (537)
+++| .|+++|+ +++++ +||++++.+.++.. ...+|++ ++++|+|.+.++++
T Consensus 399 ------~~glr~Yl~~~~~~nat~----~Dl~~~l~~~sg~d-l~~~~~~W~~~~G~P~l~v~~~ 452 (831)
T TIGR02412 399 ------FAGVNAYFKRHAFGNATL----DDLIDSLAKASGRD-LSAWSDAWLETAGVNTLTPEIT 452 (831)
T ss_pred ------HHHHHHHHHHcCCCCCCH----HHHHHHHHHHhCCC-HHHHHHHHHcCCCCceEEEEEE
Confidence 9999 9999999 99999 99999999998643 2357888 99999999987653
No 5
>PRK14015 pepN aminopeptidase N; Provisional
Probab=100.00 E-value=6.6e-59 Score=524.63 Aligned_cols=316 Identities=17% Similarity=0.192 Sum_probs=253.0
Q ss_pred ccccCcceecccCCCCCCceEEEEEEEeCCC-c-ccccccccccccCCCCCcceEEEEcccccccceeEEEEeccceec-
Q psy2731 6 LKKYLAEFALQSIDTDNFKAHLTSHFAHKPE-I-NQIEWDLWLNTTGMPPHIPKYSFYQPIKVPSYLVAIVVGNLASYK- 82 (537)
Q Consensus 6 l~~~~Ar~~fPC~D~p~~ka~f~l~i~~p~~-~-~~~Sn~~~~~~~~~~~~~~~~~F~~t~pip~yl~a~avg~~~~~~- 82 (537)
.||..||+||||+|+|++||+|+++|++|++ | .++|||.+.......+++++++|+.++|||+||+||+||+|+..+
T Consensus 125 ~Ep~gAR~~fPc~D~P~~KAtf~itI~~p~~~~~~~lSNG~l~~~~~~~~g~~~~~w~~~~PmpsYL~Al~aGdf~~~~d 204 (875)
T PRK14015 125 CEAEGFRRITYFLDRPDVLARYTVRIEADKAKYPVLLSNGNLVESGELPDGRHWATWEDPFPKPSYLFALVAGDLDVLED 204 (875)
T ss_pred ccccCcCCcccCCCCCCCCeeEEEEEEEccccCeEEecCCccccceeccCCeEEEEEEeCCCcccceEEEEEeCCEEEEE
Confidence 5899999999999999999999999999995 8 568999988766567889999999999999999999999999865
Q ss_pred -------cCCceEEecCcchHHHHHHHHHHHHHHHHHHHHHhc-CCCCCcccEEEeCCCCCCcccccccceeeccccchh
Q psy2731 83 -------ISERCSVWSEPELVKEAADEFNETEKFLSTAEEICG-TYVWGVYDLVMLPPSFPFGGMENPYDTSLQDVCNDL 154 (537)
Q Consensus 83 -------~g~~~~v~~~p~~~~~~~~~~~~~~~~l~~~e~~~G-pYp~~k~d~v~lpp~f~~ggmEn~gltf~~~~~~~~ 154 (537)
.++++++|++|+..+++.++++.++++|+++|++|| ||||++||+|++ |+|++|||||||++.+
T Consensus 205 ~~~~~~g~~vpl~iy~~p~~~~~~~~al~~~~~~L~~~E~~FG~pYP~~k~diVav-p~f~~GaMEN~Gl~~f------- 276 (875)
T PRK14015 205 TFTTRSGREVALEIYVEPGNLDKCDHAMDSLKKSMKWDEERFGLEYDLDIFMIVAV-DDFNMGAMENKGLNIF------- 276 (875)
T ss_pred EeeccCCCeEEEEEEEeCCcHHHHHHHHHHHHHHHHHHHHHhCCCCChhhhCEEeC-CCCCCccccccccccc-------
Confidence 136889999999999999999999999999999999 899999999998 6999999999997542
Q ss_pred hhhccccCCCCCCCCccccccccCCchhhhhhhccccccccCCcchhhhHhhhhhhhhccCcccCCCCchhhHHHHHHHH
Q psy2731 155 ANRWISWNHTKSPPFSKQDLAAFTPGQKIEFLAKSSSYILAGDGSLAAVVAHEISHSWTGNLVTNRNFEHFWLNEGFTMF 234 (537)
Q Consensus 155 ~e~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~~~~~~~~~~~~viaHEiAHqWfGnlVt~~~w~~~WLnEGfa~y 234 (537)
++.+++.++.. ....+...+..+||||+|||||||+||++||+++|||||||+|
T Consensus 277 ~~~~lL~~~~~--------------------------~t~~~~~~i~~vIaHElaHqWFGNlVT~~~W~dLWLnEGFAty 330 (875)
T PRK14015 277 NSKYVLADPET--------------------------ATDADYERIESVIAHEYFHNWTGNRVTCRDWFQLSLKEGLTVF 330 (875)
T ss_pred ccceEecCccc--------------------------CCHHHHHHHHHHHHHHHHHHHHhCcceecchhhhhhhhHHHHH
Confidence 23333322211 0012233577899999999999999999999999999999999
Q ss_pred HHHHHHhhhcCchhHHHHHHHhHHHHHHHHccCcchhhhhhhccCCCccc--ccccCcCCccch--hhHHHHHHHHHhcc
Q psy2731 235 VERKITGRLRGEAERHFDALSGLKDLKQAAGDGSLAAVVAHEISHSWTGN--LVTNRNFEHFWL--NEGFTMFVERKITG 310 (537)
Q Consensus 235 ~e~~~~~~~~g~~~~~~~~~~~~~~l~~~~~~d~~~~~~~~e~~~pl~g~--~~~~~~f~~i~Y--G~~~l~~L~~~lG~ 310 (537)
++..+.+...+....++......+ ..++..|.... ++|+... ...++.|+.+.| |+++||||+..||+
T Consensus 331 ~e~~~~~~~~~~~~~~~~~~~~l~--~~~~~~D~~~~------a~pi~p~~~~~i~~~f~~~~Y~KGA~vLrMLr~~lGd 402 (875)
T PRK14015 331 RDQEFSADLGSRAVKRIEDVRVLR--AAQFAEDAGPM------AHPVRPDSYIEINNFYTATVYEKGAEVIRMLHTLLGE 402 (875)
T ss_pred HHHHHHHHhhhHHHHHHHHHHHHh--hhccccccccc------CCCCCCcchhhHHhcccchhhhHHHHHHHHHHHHhCH
Confidence 998887776654433333222111 11232233322 4554311 122357888999 99999999999999
Q ss_pred cccchhhhhhhhh-hHHHHHh-hhccCCCchhhHHHHHHHHhcCcchHHHHhhhhcCCCCcccccc
Q psy2731 311 RLRGEAERHFDAL-SGLKDLK-QAVSSTGPLWDSKRNALDFQKGRHYQCKIERLKCGSAILFIYGY 374 (537)
Q Consensus 311 ~~f~~~~~~~~~l-~y~~~~~-~~~~t~~~~~df~~~l~~~~~~~~~~~~~~~l~~~G~P~~~~~~ 374 (537)
+.| ++++ .|+++|+ +++++ +||+++++++++... ..|+.+++++|+|.+.++.
T Consensus 403 e~F------~~gLr~Yl~~~~~~~at~----~Df~~ale~asg~DL-~~f~~W~~q~G~P~l~v~~ 457 (875)
T PRK14015 403 EGF------RKGMDLYFERHDGQAVTC----EDFVAAMEDASGRDL-SQFRRWYSQAGTPRVTVSD 457 (875)
T ss_pred HHH------HHHHHHHHHHhCCCCCCH----HHHHHHHHHHhCCCH-HHHHHHHcCCCCCeEEEEE
Confidence 999 9999 9999999 99999 999999999986532 2344349999999998854
No 6
>TIGR02414 pepN_proteo aminopeptidase N, Escherichia coli type. The M1 family of zinc metallopeptidases contains a number of distinct, well-separated clades of proteins with aminopeptidase activity. Several are designated aminopeptidase N, EC 3.4.11.2, after the Escherichia coli enzyme, suggesting a similar activity profile. This family consists of all aminopeptidases closely related to E. coli PepN and presumed to have similar (not identical) function. Nearly all are found in Proteobacteria, but members are found also in Cyanobacteria, plants, and apicomplexan parasites. This family differs greatly in sequence from the family of aminopeptidases typified by Streptomyces lividans PepN (TIGR02412), from the membrane bound aminopeptidase N family in animals, etc.
Probab=100.00 E-value=2.1e-58 Score=519.15 Aligned_cols=316 Identities=16% Similarity=0.170 Sum_probs=254.9
Q ss_pred ccccCcceecccCCCCCCceEEEEEEEeCCC-cc-cccccccccccCCCCCcceEEEEcccccccceeEEEEeccceecc
Q psy2731 6 LKKYLAEFALQSIDTDNFKAHLTSHFAHKPE-IN-QIEWDLWLNTTGMPPHIPKYSFYQPIKVPSYLVAIVVGNLASYKI 83 (537)
Q Consensus 6 l~~~~Ar~~fPC~D~p~~ka~f~l~i~~p~~-~~-~~Sn~~~~~~~~~~~~~~~~~F~~t~pip~yl~a~avg~~~~~~~ 83 (537)
.||..||++|||||+|++||+|+++|++|++ |. ++|||.++......+++++++|+.++|||+||+||+||+|+..+.
T Consensus 112 ~Ep~gaR~ifpc~DeP~~kAtf~vtI~~p~~~y~v~lSNg~~~~~~~~~~g~~~~~f~~t~pmptYLfA~vaGdf~~~~~ 191 (863)
T TIGR02414 112 CEAEGFRRITYFPDRPDVMSRYTVTITADKKKYPVLLSNGNKIASGELPDGRHWAEWEDPFPKPSYLFALVAGDLDVLED 191 (863)
T ss_pred ecCCCCCcCCCCCCCCCCceEEEEEEEECCCcceEEEeCCccccceecCCCeEEEEEeCCCCcChhHheEEEeCCEEEEE
Confidence 5899999999999999999999999999996 75 579998876656677889999999999999999999999998652
Q ss_pred --------CCceEEecCcchHHHHHHHHHHHHHHHHHHHHHhc-CCCCCcccEEEeCCCCCCcccccccceeeccccchh
Q psy2731 84 --------SERCSVWSEPELVKEAADEFNETEKFLSTAEEICG-TYVWGVYDLVMLPPSFPFGGMENPYDTSLQDVCNDL 154 (537)
Q Consensus 84 --------g~~~~v~~~p~~~~~~~~~~~~~~~~l~~~e~~~G-pYp~~k~d~v~lpp~f~~ggmEn~gltf~~~~~~~~ 154 (537)
++++++|++|+..+++.++++.++++|+++|++|| |||+++||+|++ |+|++||||||||+.+
T Consensus 192 ~~~t~sg~~v~l~iy~~p~~~~~~~~al~~~~~~L~~~E~~fG~pYPl~k~diVav-pdf~~GaMEN~GLi~f------- 263 (863)
T TIGR02414 192 TFTTKSGREVALRVYVEEGNKDKCDHAMESLKKAMKWDEEVFGLEYDLDIFMIVAV-DDFNMGAMENKGLNIF------- 263 (863)
T ss_pred EeeccCCCceEEEEEEccCcHHHHHHHHHHHHHHHHHHHHHhCCCCChhhccEEec-CCCCCccccccceecc-------
Confidence 36789999999999999999999999999999999 899999999999 6999999999997542
Q ss_pred hhhccccCCCCCCCCccccccccCCchhhhhhhccccccccCCcchhhhHhhhhhhhhccCcccCCCCchhhHHHHHHHH
Q psy2731 155 ANRWISWNHTKSPPFSKQDLAAFTPGQKIEFLAKSSSYILAGDGSLAAVVAHEISHSWTGNLVTNRNFEHFWLNEGFTMF 234 (537)
Q Consensus 155 ~e~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~~~~~~~~~~~~viaHEiAHqWfGnlVt~~~w~~~WLnEGfa~y 234 (537)
++.++++++.. ....+...+..+||||+|||||||+||++||.++|||||||+|
T Consensus 264 ~e~~lL~~~~~--------------------------~td~~~~~i~~VIaHElaHqWfGNlVT~~~W~~LWLnEGfAty 317 (863)
T TIGR02414 264 NSKYVLADPET--------------------------ATDADYERIESVIAHEYFHNWTGNRVTCRDWFQLSLKEGLTVF 317 (863)
T ss_pred ccceEEeCCCC--------------------------CCHHHHHHHHHHHHHHHHHHHhcceeeecchhhhhhhhhHHHH
Confidence 33334333321 0012334577899999999999999999999999999999999
Q ss_pred HHHHHHhhhcCchhHHHHHHHhHHHHHHHHccCcchhhhhhhccCCCccc--ccccCcCCccch--hhHHHHHHHHHhcc
Q psy2731 235 VERKITGRLRGEAERHFDALSGLKDLKQAAGDGSLAAVVAHEISHSWTGN--LVTNRNFEHFWL--NEGFTMFVERKITG 310 (537)
Q Consensus 235 ~e~~~~~~~~g~~~~~~~~~~~~~~l~~~~~~d~~~~~~~~e~~~pl~g~--~~~~~~f~~i~Y--G~~~l~~L~~~lG~ 310 (537)
++..+.....+....++......+. ..+..|.... .+|+... ...++.|+.+.| |+++++||+..||+
T Consensus 318 ~e~~~~~~~~~~~~~~~~~~~~lr~--~~f~~D~~p~------~~Pi~~~~~~~i~~~y~~i~Y~KGA~vLrML~~~LGe 389 (863)
T TIGR02414 318 RDQEFSADMTSRAVKRIEDVRLLRA--HQFPEDAGPM------AHPVRPESYVEINNFYTATVYEKGAEVIRMLHTLLGE 389 (863)
T ss_pred HHHHHHHHhhhHHHHHHHHHHHHHh--hhhccccccc------CCCCCCcchhhHHhccchHHhHHHHHHHHHHHHHhCH
Confidence 9988877776654444433222221 1233344433 5565311 122367889999 99999999999999
Q ss_pred cccchhhhhhhhh-hHHHHHh-hhccCCCchhhHHHHHHHHhcCcchHHHHhhhhcCCCCcccccc
Q psy2731 311 RLRGEAERHFDAL-SGLKDLK-QAVSSTGPLWDSKRNALDFQKGRHYQCKIERLKCGSAILFIYGY 374 (537)
Q Consensus 311 ~~f~~~~~~~~~l-~y~~~~~-~~~~t~~~~~df~~~l~~~~~~~~~~~~~~~l~~~G~P~~~~~~ 374 (537)
+.| +++| .|+++|+ +++++ +||+++|++.++.... .|..+++++|+|.+.++.
T Consensus 390 e~F------~~gLr~Yl~r~~~~~at~----~Df~~ale~asg~dL~-~f~~W~~q~G~P~v~v~~ 444 (863)
T TIGR02414 390 EGF------RKGMDLYFSRHDGQAVTC----EDFVAAMEDASGRDLN-QFRRWYSQAGTPVLEVKE 444 (863)
T ss_pred HHH------HHHHHHHHHHhCCCCCCH----HHHHHHHHHHhCCCHH-HHHHHHcCCCCceeEEEE
Confidence 999 9999 9999999 99999 9999999999865322 344339999999998854
No 7
>COG0308 PepN Aminopeptidase N [Amino acid transport and metabolism]
Probab=100.00 E-value=1.5e-55 Score=501.25 Aligned_cols=319 Identities=22% Similarity=0.286 Sum_probs=265.8
Q ss_pred ccccCcceecccCCCCCCceEEEEEEEeCCCcccccccccccccCCCCCcceEEEEcccccccceeEEEEeccceecc--
Q psy2731 6 LKKYLAEFALQSIDTDNFKAHLTSHFAHKPEINQIEWDLWLNTTGMPPHIPKYSFYQPIKVPSYLVAIVVGNLASYKI-- 83 (537)
Q Consensus 6 l~~~~Ar~~fPC~D~p~~ka~f~l~i~~p~~~~~~Sn~~~~~~~~~~~~~~~~~F~~t~pip~yl~a~avg~~~~~~~-- 83 (537)
.|+..||+||||+|+|+.||+|+++|..++++.++|||+........+++.+++|.+++|||+||+|+++|+|...++
T Consensus 140 ~Ea~~aR~~fpc~D~P~~katf~~~i~~~k~~~~iSN~~~~~~~~~~~g~~~~~f~~~~~mptYL~al~~G~~~~~~~~~ 219 (859)
T COG0308 140 CEAEGARRIFPCIDEPDVKATFTLTIRADKGPKLISNGNLIDGGTLVDGRKIVKFEDTPPMPTYLFALVAGDLEVFRDKF 219 (859)
T ss_pred cccCCCceeeecCCCCCCcceeEEEEEecCcceeeecCCccccccccCCcEEEEEcCCCCcchHhhheeeecceeeeeee
Confidence 478999999999999999999999999999999999999988877778899999999999999999999999987652
Q ss_pred -----CCceEEecCcchHHHHHHHHHHHHHHHHHHHHHhc-CCCCCcccEEEeCCCCCCcccccccceeeccccchhhhh
Q psy2731 84 -----SERCSVWSEPELVKEAADEFNETEKFLSTAEEICG-TYVWGVYDLVMLPPSFPFGGMENPYDTSLQDVCNDLANR 157 (537)
Q Consensus 84 -----g~~~~v~~~p~~~~~~~~~~~~~~~~l~~~e~~~G-pYp~~k~d~v~lpp~f~~ggmEn~gltf~~~~~~~~~e~ 157 (537)
++++.||++++...++.+.++.+.++++++|++|| |||+++ ++|.+ |+|+.|||||||++.+ ++.
T Consensus 220 ~~~~~~v~l~iy~~~g~~~~a~~~~~~~~~~~~~~e~~fg~~y~l~~-~~V~v-~~f~~GaMEN~Gl~tf-------~~~ 290 (859)
T COG0308 220 DTRSRDVPLEIYVPPGVLDRAKYALDETKRSIEFYEEYFGLPYALPI-DIVAV-PDFSAGAMENWGLVTF-------REK 290 (859)
T ss_pred ccCCCCeeEEEEecCcchhhhhhhHHHHHHHhhhHHHhcCCCCCCcc-cEEec-cCCCCccccccceeEE-------eee
Confidence 56789999998899999999999999999999999 999999 88888 6999999999997442 222
Q ss_pred ccccCCCCCCCCccccccccCCchhhhhhhccccccccCCcchhhhHhhhhhhhhccCcccCCCCchhhHHHHHHHHHHH
Q psy2731 158 WISWNHTKSPPFSKQDLAAFTPGQKIEFLAKSSSYILAGDGSLAAVVAHEISHSWTGNLVTNRNFEHFWLNEGFTMFVER 237 (537)
Q Consensus 158 ~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~~~~~~~~~~~~viaHEiAHqWfGnlVt~~~w~~~WLnEGfa~y~e~ 237 (537)
+++ +.|. . ........+..+|+||+|||||||+||++||+++|||||||+|++.
T Consensus 291 ~ll----------------~~~~---------~-at~~~~~~~~~viaHElaHqWfGnlVT~~~W~~lWLnEgfat~~e~ 344 (859)
T COG0308 291 YLL----------------ADPE---------T-ATDSDYENVEEVIAHELAHQWFGNLVTMKWWDDLWLNEGFATFREV 344 (859)
T ss_pred EEe----------------eCcc---------c-chhHHHHHHHHHHHHHHhhhcccceeeccCHHHHHHhhhhHHHHHH
Confidence 232 3320 0 1112225667899999999999999999999999999999999999
Q ss_pred HHHhhhcCchhHHHHHHHhHHHHHHHHccCcchhhhhhhccCCCccc----ccccCcCCccch--hhHHHHHHHHHhccc
Q psy2731 238 KITGRLRGEAERHFDALSGLKDLKQAAGDGSLAAVVAHEISHSWTGN----LVTNRNFEHFWL--NEGFTMFVERKITGR 311 (537)
Q Consensus 238 ~~~~~~~g~~~~~~~~~~~~~~l~~~~~~d~~~~~~~~e~~~pl~g~----~~~~~~f~~i~Y--G~~~l~~L~~~lG~~ 311 (537)
++.+.++|.....+...... ....++..|+... +||+.-. ..+++.|+.+.| |++++|||+..+|++
T Consensus 345 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~D~~~~------~hPi~~~~~~~~ei~~~fD~i~Y~KGs~vlrml~~~lG~e 417 (859)
T COG0308 345 LWSEDLGGRAWKRWEDFRTL-RTSIALAEDSLPS------SHPIRVDVYDPKEINDFFDAIVYEKGASVLRMLETLLGEE 417 (859)
T ss_pred HHHHHhcchHHHHHHHHHHH-hhhHHHhhccccc------cCCcccCCCCccchhhhcchhhcchhHHHHHHHHHHHCHH
Confidence 99999998443333332221 1222666677776 6666511 233588999999 999999999999999
Q ss_pred ccchhhhhhhhh-hHHHHHh-hhccCCCchhhHHHHHHHHhcCcchHHHHhh-hhcCCCCccccccccc
Q psy2731 312 LRGEAERHFDAL-SGLKDLK-QAVSSTGPLWDSKRNALDFQKGRHYQCKIER-LKCGSAILFIYGYDTS 377 (537)
Q Consensus 312 ~f~~~~~~~~~l-~y~~~~~-~~~~t~~~~~df~~~l~~~~~~~~~~~~~~~-l~~~G~P~~~~~~~~~ 377 (537)
.| .+++ .|+++|+ +++++ +||++++.+.+|.... .+|+. +.++|+|++.++.+..
T Consensus 418 ~F------~kgl~~yf~~h~~~~~~~----~Dl~~a~~~~sg~dl~-~~~~~w~~q~G~P~l~v~~~~~ 475 (859)
T COG0308 418 AF------RKGLSLYFKRHAGGNATT----MDLWKALEDASGKDLS-AFFESWLSQAGYPVLTVSVRYD 475 (859)
T ss_pred HH------HHHHHHHHHhcCCCCCCH----HHHHHHHHHHhCCcHH-HHHHHHHhCCCCCceeeeeecc
Confidence 99 9999 9999999 99999 9999999999975432 36777 9999999999875433
No 8
>PF01433 Peptidase_M1: Peptidase family M1 This is family M1 in the peptidase classification.; InterPro: IPR014782 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M1 (clan MA(E)), the type example being aminopeptidase N from Homo sapiens (Human). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Membrane alanine aminopeptidase (3.4.11.2 from EC) is part of the HEXXH+E group; it consists entirely of aminopeptidases, spread across a wide variety of species []. Functional studies show that CD13/APN catalyzes the removal of single amino acids from the amino terminus of small peptides and probably plays a role in their final digestion; one family member (leukotriene-A4 hydrolase) is known to hydrolyse the epoxide leukotriene-A4 to form an inflammatory mediator []. This hydrolase has been shown to have aminopeptidase activity [], and the zinc ligands of the M1 family were identified by site-directed mutagenesis on this enzyme [] CD13 participates in trimming peptides bound to MHC class II molecules [] and cleaves MIP-1 chemokine, which alters target cell specificity from basophils to eosinophils []. CD13 acts as a receptor for specific strains of RNA viruses (coronaviruses) which cause a relatively large percentage of upper respiratory trace infections. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 2XQ0_A 2XPY_A 2XPZ_A 3SE6_B 3EBH_A 3EBG_A 3T8V_A 3Q44_A 3Q43_A 3EBI_A ....
Probab=100.00 E-value=2.7e-51 Score=432.30 Aligned_cols=250 Identities=25% Similarity=0.398 Sum_probs=202.0
Q ss_pred cccccCcceecccCCCCCCceEEEEEEEeCCCcccccccccccccCCCCCcceEEEEcccccccceeEEEEeccceec--
Q psy2731 5 WLKKYLAEFALQSIDTDNFKAHLTSHFAHKPEINQIEWDLWLNTTGMPPHIPKYSFYQPIKVPSYLVAIVVGNLASYK-- 82 (537)
Q Consensus 5 ~l~~~~Ar~~fPC~D~p~~ka~f~l~i~~p~~~~~~Sn~~~~~~~~~~~~~~~~~F~~t~pip~yl~a~avg~~~~~~-- 82 (537)
-++|++||+||||||+|++||+|+++|+||++++|+|||++......++++.+++|+.++|||+|++||+||+|...+
T Consensus 129 ~~~p~~ar~~fPc~D~p~~ka~f~~~i~~p~~~~~~sng~~~~~~~~~~~~~~~~f~~t~p~~~yl~a~~vg~~~~~~~~ 208 (390)
T PF01433_consen 129 QFEPNGARRWFPCFDEPSFKATFDLTITHPKDYTALSNGPLEEEESNDDGWKTTTFETTPPMPTYLFAFAVGDFESVEVT 208 (390)
T ss_dssp E-TTTTGGGTSSB--STTSEEEEEEEEEEETTTEEEESSEEEEEEEETTTEEEEEEEEEEEEEGGG--EEEESEEEEEEE
T ss_pred cccccccceeeeeeccCCccceEEEeeeccccceeeccccccccccccccceeEeeecccccCchhhhhhcCcccccccc
Confidence 368999999999999999999999999999999999999999888778899999999999999999999999999876
Q ss_pred -c-CCceEEecCcchHHHHHHHHHHHHHHHHHHHHHhc-CCCCCcccEEEeCCCCCCcccccccceeeccccchhhhhcc
Q psy2731 83 -I-SERCSVWSEPELVKEAADEFNETEKFLSTAEEICG-TYVWGVYDLVMLPPSFPFGGMENPYDTSLQDVCNDLANRWI 159 (537)
Q Consensus 83 -~-g~~~~v~~~p~~~~~~~~~~~~~~~~l~~~e~~~G-pYp~~k~d~v~lpp~f~~ggmEn~gltf~~~~~~~~~e~~~ 159 (537)
. |+++++|++|+..++++..++.+.++|++++++|| |||++++|+|++ |+|+.|||||||++++ +|..+
T Consensus 209 ~~~~~~v~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~g~~yp~~k~~~v~~-p~~~~~~me~~g~i~~-------~~~~l 280 (390)
T PF01433_consen 209 TKSGVPVRVYARPGDEEQLQFALDIAPKALEYYEEYFGIPYPFKKLDIVAV-PDFPFGGMENWGLITY-------RESYL 280 (390)
T ss_dssp TTTEEEEEEEEECTCGGGHHHHHHHHHHHHHHHHHHHTS--SSSEEEEEEE-ST-SSSEE--TTEEEE-------EGGGT
T ss_pred cccccchheeehhhhHHHHHHHHHhhHHHHHHHHhhccccceecceeEEEE-eccccccccccccccc-------ccccc
Confidence 2 36899999999999999999999999999999999 999999999999 5999999999998542 45556
Q ss_pred ccCCCCCCCCccccccccCCchhhhhhhccccccccCCcchhhhHhhhhhhhhccCcccCCCCchhhHHHHHHHHHHHHH
Q psy2731 160 SWNHTKSPPFSKQDLAAFTPGQKIEFLAKSSSYILAGDGSLAAVVAHEISHSWTGNLVTNRNFEHFWLNEGFTMFVERKI 239 (537)
Q Consensus 160 ~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~~~~~~~~~~~~viaHEiAHqWfGnlVt~~~w~~~WLnEGfa~y~e~~~ 239 (537)
+++++.+ ....+..+..+||||+|||||||+||+++|+|.||+||||+|++.++
T Consensus 281 ~~~~~~~--------------------------~~~~~~~~~~~iahElahqWfGn~vt~~~w~d~WL~Eg~a~y~~~~~ 334 (390)
T PF01433_consen 281 LYDPDIS--------------------------TIGDKQEIASLIAHELAHQWFGNLVTPKWWSDLWLNEGFATYLEYLI 334 (390)
T ss_dssp S-STTTS---------------------------HHHHHHHHHHHHHHHHTTTBTTTEEESSGGGHHHHHHHHHHHHHHH
T ss_pred ccCcccc--------------------------cchhhhhhHHHHHHHHHHHHhccCCccccchhhhHHHHHHHHHHHHh
Confidence 5554321 11233467889999999999999999999999999999999999999
Q ss_pred HhhhcCchhHHHHHHHhHHHHHHHHccCcchhhhhhhccCCCccc----ccccCcCCccch
Q psy2731 240 TGRLRGEAERHFDALSGLKDLKQAAGDGSLAAVVAHEISHSWTGN----LVTNRNFEHFWL 296 (537)
Q Consensus 240 ~~~~~g~~~~~~~~~~~~~~l~~~~~~d~~~~~~~~e~~~pl~g~----~~~~~~f~~i~Y 296 (537)
+++++|.. ++...+..+.++.++..|.... .+|+... ....+.|+.++|
T Consensus 335 ~~~~~~~~--~~~~~~~~~~~~~~~~~d~~~~------~~pl~~~~~~~~~~~~~f~~~~Y 387 (390)
T PF01433_consen 335 LEKLFGEW--QMMELFLVQEMQRALREDALPN------SHPLSSEVEDPSDIDDMFDDISY 387 (390)
T ss_dssp HHHHHGHH--HHHHHHHHHHHHHHHHHHTSTT------CCCSSSSSSSESCGGGGSSHHHH
T ss_pred HhhccCcc--cchhhhhhhhHHHHHHHhhcCC------CcceEeCCCCCCChHHhcCcccc
Confidence 99999953 3455556666777787777665 5666411 123367888888
No 9
>PF09127 Leuk-A4-hydro_C: Leukotriene A4 hydrolase, C-terminal; InterPro: IPR015211 This C-terminal domain is found in peptidases belonging to MEROPS peptidase family M1, particularly: aminopeptidase-1 of Caenorhabditis elegans, aminopeptidase O, aminopeptidase B and the bifunctional leukotriene A4 hydrolase/aminopeptidase. The domain adopts a structure consisting of two layers of parallel alpha-helices, five in the inner layer and four in the outer, arranged in an antiparallel manner, with perpendicular loops containing short helical segments on top. It is required for the formation of a deep cleft harbouring the catalytic Zn2+ site in leukotriene A4 hydrolase []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding, 0019370 leukotriene biosynthetic process; PDB: 3FUJ_A 3FU3_A 3FTX_A 3FTS_A 3B7R_L 2VJ8_A 3FTW_A 3FUF_A 3FU0_A 3CHO_A ....
Probab=100.00 E-value=2.1e-42 Score=311.16 Aligned_cols=143 Identities=39% Similarity=0.720 Sum_probs=127.1
Q ss_pred chhHHHHHHHHHHhhcCCCCCCCCChhhcccCChhcHHHHHHHhhccCCCCHHHHHHHHhhcCcCCCCcHHHHHHHHHHH
Q psy2731 377 SLQDVCNDLANRWISWNHTKETPFSKQDLAAFTPGQKIEFLAILLDKEMYDLPKVKSLQDVYRFNGVRNCEIRFRWLKLC 456 (537)
Q Consensus 377 ~~~~~~~~l~~~w~~~~~~~~~~~~~~~~~~~~~~q~~~FL~~l~~~~~l~~~~l~~ld~~y~l~~~~N~Ei~~rW~~l~ 456 (537)
++.+.|.+|+++|++.... ..+++..|+++|+++|+++||++|.+.+++++++|++||++|+|++++|+||++|||++|
T Consensus 1 sL~~~~~~La~~W~~~~~~-~~~~~~~d~~~~~~~Q~~~FL~~L~~~~~l~~~~l~~Ld~~y~l~~s~NaEI~~rW~~l~ 79 (143)
T PF09127_consen 1 SLADQCYALAEKWISADED-KSPFSAEDIKDWSSNQWVVFLDQLLEPKPLSPEKLQALDKVYKLSNSKNAEIRFRWLRLA 79 (143)
T ss_dssp CCCHHHHHHHHHHHHHHCT-CCC-SGGGGTT--HHHHHHHHHHHHTCCG-CHHHHHHHHHHHCHCT-SSHHHHHHHHHHH
T ss_pred ChHHHHHHHHHHHHhcccc-cCCCCHHHHHhCCHHHHHHHHHHhccCCCCCHHHHHHHHHHhCCCCCCCHHHHHHHHHHH
Confidence 4789999999999987543 446889999999999999999999887789999999999999999999999999999999
Q ss_pred HhCcccchhHHHHHhHhccCccccccHHHHHHhcCccCHHHHHHHHHHhhhcccHHHHHHHHhhc
Q psy2731 457 LKARWKEQVPHVIDMVTSQGRMKYVRPLYRELYAWEDTRQTAIDTFKQHRKQMMYVTAYTLAKDL 521 (537)
Q Consensus 457 i~~~~~~~~~~~~~fl~~~GRmk~~~PlYr~L~~~~~~r~~A~~~f~~~~~~yhpi~~~~v~k~l 521 (537)
|+++|+++++.|.+||+++|||||||||||+|++ +.+|++|+++|+++|++|||||+++|+|||
T Consensus 80 i~~~~~~~~~~v~~fL~~~GRmKfvrPlYr~L~~-~~~~~~A~~~F~~~k~~YHpi~~~~V~kdL 143 (143)
T PF09127_consen 80 IKAKYEPALPQVEEFLGSQGRMKFVRPLYRALAK-PEGRDLAKETFEKAKPFYHPITRQMVEKDL 143 (143)
T ss_dssp HHTT-GGGHHHHHHHHHHS--HHHHHHHHHHHHT-TTCHHHHHHHHHHHGGGS-HHHHHHHHHHT
T ss_pred HhcCcHHHHHHHHHHHHHcCCChhHHHHHHHHHc-cchHHHHHHHHHHHHHhcCHHHHHHHHHhC
Confidence 9999999999999999999999999999999999 789999999999999999999999999997
No 10
>KOG1932|consensus
Probab=99.95 E-value=2.1e-27 Score=260.35 Aligned_cols=214 Identities=14% Similarity=0.209 Sum_probs=180.1
Q ss_pred ccCcceecccCCCCCCceEEEEEEEeCCCcccccccccccccCCCC-CcceEEEEcccccccceeEEEEeccceec--cC
Q psy2731 8 KYLAEFALQSIDTDNFKAHLTSHFAHKPEINQIEWDLWLNTTGMPP-HIPKYSFYQPIKVPSYLVAIVVGNLASYK--IS 84 (537)
Q Consensus 8 ~~~Ar~~fPC~D~p~~ka~f~l~i~~p~~~~~~Sn~~~~~~~~~~~-~~~~~~F~~t~pip~yl~a~avg~~~~~~--~g 84 (537)
+..||.||||.|.+..+|||++.+++|+.+.++|+|.+..+...+| ...+++|..+.|+++.-+|||||+|+... .+
T Consensus 182 ~s~ar~WfPCvD~~~e~~tWeLeftvp~~~~av~~geLl~~v~~~D~~Kkt~~ys~tvPvA~~~I~~AiG~F~~~~~P~~ 261 (1180)
T KOG1932|consen 182 SSSARSWFPCVDSSYERCTWELEFTVPKNLVAVSCGELLEQVETPDLRKKTYHYSLTVPVAPSNIGFAIGPFKSYVEPSM 261 (1180)
T ss_pred ccccceEEeecCCccccceEEEEEEecccceeccchhhhheeecccccccEEEEEEeccCCccccceeeccccccCCCcc
Confidence 3569999999999999999999999999999999999987766666 46899999999999999999999999874 36
Q ss_pred CceEEecCcchHHHHHHHHHHHHHHHHHHHHHhc-CCCCCcccEEEeCCCCCCcccccccceeeccccchhhhhccccCC
Q psy2731 85 ERCSVWSEPELVKEAADEFNETEKFLSTAEEICG-TYVWGVYDLVMLPPSFPFGGMENPYDTSLQDVCNDLANRWISWNH 163 (537)
Q Consensus 85 ~~~~v~~~p~~~~~~~~~~~~~~~~l~~~e~~~G-pYp~~k~d~v~lpp~f~~ggmEn~gltf~~~~~~~~~e~~~~~~~ 163 (537)
+.+..|+.|+...-.....-.+.++++|+|+.+| .|||+.|.+|++|+. +.--|..+.|.+.+..
T Consensus 262 ~~i~~f~LP~~~~~v~nt~~~l~k~iefye~~ls~rYPF~~~k~VFvd~~-~~~i~~~asl~I~st~------------- 327 (1180)
T KOG1932|consen 262 IDITHFCLPGLEPLVKNTTVYLHKAIEFYEEELSSRYPFSCYKTVFVDEA-AVEISSYASLSIFSTS------------- 327 (1180)
T ss_pred CcceeEecCcchHHhhhHHHHHHHHHHHHHHHhccCCCcceeeEEEecCC-cceeeecceeeeeecc-------------
Confidence 7899999999998888888999999999999999 799999999999633 3344555555442211
Q ss_pred CCCCCCccccccccCCchhhhhhhccccccccCCcchhhhHhhhhhhhhccCcccCCCCchhhHHHHHHHHHHHHHHhhh
Q psy2731 164 TKSPPFSKQDLAAFTPGQKIEFLAKSSSYILAGDGSLAAVVAHEISHSWTGNLVTNRNFEHFWLNEGFTMFVERKITGRL 243 (537)
Q Consensus 164 ~~~~~~~~~~~~l~~p~~~~~~~~~~~~~~~~~~~~~~~viaHEiAHqWfGnlVt~~~w~~~WLnEGfa~y~e~~~~~~~ 243 (537)
+|.+- .+|+.-......+|--||-||||.++|+..|.|+||.+|+|.|+..+++++.
T Consensus 328 -----------lLy~~------------~iIDq~~~tr~~La~aLA~Q~fg~yIsp~~wsD~Wl~~GiagYl~~l~~kk~ 384 (1180)
T KOG1932|consen 328 -----------LLYSK------------NIIDQTFLTRRKLAWALASQWFGVYISPVDWSDFWLLKGIAGYLTGLFVKKF 384 (1180)
T ss_pred -----------ccchH------------hhhhHHHHHHHHHHHHHHHhhhEEEeeccchhhhHHHHhHHHHHHHHHHHHH
Confidence 11222 4455555678899999999999999999999999999999999999999999
Q ss_pred cCchhHHHHHHHhHH
Q psy2731 244 RGEAERHFDALSGLK 258 (537)
Q Consensus 244 ~g~~~~~~~~~~~~~ 258 (537)
.|.++++++-....+
T Consensus 385 lGNNEyry~lKk~~d 399 (1180)
T KOG1932|consen 385 LGNNEYRYQLKKALD 399 (1180)
T ss_pred hCchHHHHHHHHHHH
Confidence 999988776444333
No 11
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=99.09 E-value=1.2e-10 Score=102.38 Aligned_cols=97 Identities=30% Similarity=0.423 Sum_probs=63.5
Q ss_pred cchhhhHhhhhhhhhccCcccCCCCchhhHHHHHHHHHHHHHHhhhcCchhHHHHHHHhHHHHHHHHccCcchhhhhhhc
Q psy2731 198 GSLAAVVAHEISHSWTGNLVTNRNFEHFWLNEGFTMFVERKITGRLRGEAERHFDALSGLKDLKQAAGDGSLAAVVAHEI 277 (537)
Q Consensus 198 ~~~~~viaHEiAHqWfGnlVt~~~w~~~WLnEGfa~y~e~~~~~~~~g~~~~~~~~~~~~~~l~~~~~~d~~~~~~~~e~ 277 (537)
..+..+++||++|+|+++.++.......|++||+|+|++... ... . ...+...+..+...+
T Consensus 23 ~~~~~~l~HE~~H~~~~~~~~~~~~~~~W~~EG~A~y~~~~~----~~~---~------~~~~~~~~~~~~~~~------ 83 (128)
T PF13485_consen 23 DWLDRVLAHELAHQWFGNYFGGDDNAPRWFNEGLAEYVEGRI----EDE---F------DEDLKQAIESGSLPP------ 83 (128)
T ss_pred HHHHHHHHHHHHHHHHHHHcCCCccCchHHHHHHHHHHhcCc----cch---h------HHHHHHHHHcCCCCC------
Confidence 345799999999999999998777788999999999999331 110 0 112223332232222
Q ss_pred cCCCcccccccCcCCccch--hhHHHHHHHHHhccccc
Q psy2731 278 SHSWTGNLVTNRNFEHFWL--NEGFTMFVERKITGRLR 313 (537)
Q Consensus 278 ~~pl~g~~~~~~~f~~i~Y--G~~~l~~L~~~lG~~~f 313 (537)
..++.........+....| |..++++|+...|++.|
T Consensus 84 ~~~l~~~~~~~~~~~~~~Y~~~~~~~~~L~~~~G~~~~ 121 (128)
T PF13485_consen 84 LEPLNSSFDFSWEDDSLAYYQGYLFVRFLEEKYGREKF 121 (128)
T ss_pred hHHHhccccccccccccHHHHHHHHHHHHHHHHHHHHH
Confidence 1122211111344555678 99999999999998776
No 12
>COG3975 Predicted protease with the C-terminal PDZ domain [General function prediction only]
Probab=98.71 E-value=3.1e-07 Score=96.21 Aligned_cols=240 Identities=14% Similarity=0.167 Sum_probs=135.2
Q ss_pred CCceEEecCcchHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccEEEeCCCCCCcccccccceeeccccchhhhhccccCC
Q psy2731 84 SERCSVWSEPELVKEAADEFNETEKFLSTAEEICGTYVWGVYDLVMLPPSFPFGGMENPYDTSLQDVCNDLANRWISWNH 163 (537)
Q Consensus 84 g~~~~v~~~p~~~~~~~~~~~~~~~~l~~~e~~~GpYp~~k~d~v~lpp~f~~ggmEn~gltf~~~~~~~~~e~~~~~~~ 163 (537)
+.++.++.+-. ...+....+.++++++.--+.||+-||.+|.+++--.+-++||||+-.-|.+... .
T Consensus 169 ph~~~~~g~~p-~~d~~~~~~~~k~ii~~~~~vFg~~~~~~Y~Fl~~~s~q~~GGlEH~~St~l~~~------------r 235 (558)
T COG3975 169 PHTIALRGELP-NFDKERLASDTKKIIEAEIKVFGSAPFDKYVFLLHLSDQIYGGLEHRRSTALIYD------------R 235 (558)
T ss_pred ceeEEEeeccc-cccHHHHHHHHHHHHHHHHHHhcCCCccceEEEEEecCCCCCCceeccccccccc------------c
Confidence 34555665432 2244555678999999999999999999999988756667899999753331100 0
Q ss_pred CCCCCCccccccccCCchhhhhhhccccccccCCcchhhhHhhhhhhhhccCcccCCCC-----------chhhHHHHHH
Q psy2731 164 TKSPPFSKQDLAAFTPGQKIEFLAKSSSYILAGDGSLAAVVAHEISHSWTGNLVTNRNF-----------EHFWLNEGFT 232 (537)
Q Consensus 164 ~~~~~~~~~~~~l~~p~~~~~~~~~~~~~~~~~~~~~~~viaHEiAHqWfGnlVt~~~w-----------~~~WLnEGfa 232 (537)
. . .+. . ...+....+++||..|.|-+--+.++.- .-+|+.||||
T Consensus 236 ~---~--------~~~------------~--~ky~~~l~llsHEyfH~WNvKrIrpa~l~p~~~d~en~t~~lW~~EG~T 290 (558)
T COG3975 236 F---G--------FTD------------Q--DKYQDLLGLLSHEYFHAWNVKRIRPAALEPFNLDKENYTPLLWFSEGFT 290 (558)
T ss_pred c---c--------ccc------------h--hHHHHHHHHHHHHHHHhccceeccccccCCccccccCCCcceeeecCch
Confidence 0 0 111 0 1124557899999999998766665432 3589999999
Q ss_pred HHHHHHHHhhhcCchhHHHHHHHhHHHHHHHHccCcchh--hhhhhccCCCccccccc--CcCCccch---hhHHHHHHH
Q psy2731 233 MFVERKITGRLRGEAERHFDALSGLKDLKQAAGDGSLAA--VVAHEISHSWTGNLVTN--RNFEHFWL---NEGFTMFVE 305 (537)
Q Consensus 233 ~y~e~~~~~~~~g~~~~~~~~~~~~~~l~~~~~~d~~~~--~~~~e~~~pl~g~~~~~--~~f~~i~Y---G~~~l~~L~ 305 (537)
.|+..++.-+..--....+.. .+..-...+....... ..+.....++.....++ ..-..++| |+++--+|.
T Consensus 291 ~Yy~~ll~lRsgl~~~~~~l~--~la~tl~~~~~~~gRl~~~laEsS~~awik~yr~d~ns~n~~~sYY~kG~lv~L~lD 368 (558)
T COG3975 291 SYYDRLLALRSGLISLETYLN--YLAKTLARYLNTPGRLRQSLAESSFDAWIKYYRPDENSPNRLVSYYQKGALVALLLD 368 (558)
T ss_pred HHHHHHHHHHhccCcHHHHHH--HHHHHHHHHhcCCceecccccccccchhHHhhcccccccccchhhhhchhHHHHHHH
Confidence 999998876542222222221 1111111111011100 00000001111011121 12233555 999999998
Q ss_pred HHhcccccch--hhhhhhhhhHHHHH--h-hhccCCCchhhHHHHHHHHhcCcchHHHHhh-hhcCCCCcc
Q psy2731 306 RKITGRLRGE--AERHFDALSGLKDL--K-QAVSSTGPLWDSKRNALDFQKGRHYQCKIER-LKCGSAILF 370 (537)
Q Consensus 306 ~~lG~~~f~~--~~~~~~~l~y~~~~--~-~~~~t~~~~~df~~~l~~~~~~~~~~~~~~~-l~~~G~P~~ 370 (537)
-.|-+..-|+ +++.+++| .+.+ . +.+++ +++...+....+. ....||++ +++.--|++
T Consensus 369 l~iR~r~~~~~SLDdvmram--~~~~~~~~~~~t~----e~v~av~~~~tg~-dl~~f~~~~i~~~~~~~l 432 (558)
T COG3975 369 LLIRERGGGQKSLDDVMRAL--WKEFGRAERGYTP----EDVQAVLENVTGL-DLATFFDEYIEGTEPPPL 432 (558)
T ss_pred HHHHhcCCCcccHHHHHHHH--HHHhCcCccCCCH----HHHHHHHHhhccc-cHHHHHHHHhhcCCCCCh
Confidence 8884332221 34444444 2222 4 66788 9999999999872 22347777 666555554
No 13
>KOG1047|consensus
Probab=98.12 E-value=1e-05 Score=85.43 Aligned_cols=221 Identities=29% Similarity=0.409 Sum_probs=138.3
Q ss_pred CcchhhhhhhccCCCcccccccCcCCccchhhHHHHHHHHHhcccccchhhhhhhhhhHHHHHhhhccCCCchhhHHHHH
Q psy2731 267 GSLAAVVAHEISHSWTGNLVTNRNFEHFWLNEGFTMFVERKITGRLRGEAERHFDALSGLKDLKQAVSSTGPLWDSKRNA 346 (537)
Q Consensus 267 d~~~~~~~~e~~~pl~g~~~~~~~f~~i~YG~~~l~~L~~~lG~~~f~~~~~~~~~l~y~~~~~~~~~t~~~~~df~~~l 346 (537)
.++..+++|||+|.|+|+++++..|...+....+-.+||+.|-...+|+....++++ +.. .++...+
T Consensus 286 rsl~~vIaHEIAHSWtGNlVTN~sWehfWLNEGfTvylErrI~g~~~g~~~~~f~a~---------~gw----~~L~~~~ 352 (613)
T KOG1047|consen 286 RSLVDVIAHEIAHSWTGNLVTNASWEHFWLNEGFTVYLERRIVGRLYGEAYRQFEAL---------IGW----RELRPSM 352 (613)
T ss_pred cchhhHHHHHhhhhhcccccccCccchhhhcccchhhhhhhhhhhhcchhHHHHHHh---------cCh----hhhhhHH
Confidence 456779999999999999999999999888333344577765434443333333332 111 1111111
Q ss_pred HHHhcCcchHHHHhhhhcCCCCccccccccchhHHHHHHHHHHhhcCCCCCCCCChhhcccCChhcHHHHHHHhhccCCC
Q psy2731 347 LDFQKGRHYQCKIERLKCGSAILFIYGYDTSLQDVCNDLANRWISWNHTKETPFSKQDLAAFTPGQKIEFLAILLDKEMY 426 (537)
Q Consensus 347 ~~~~~~~~~~~~~~~l~~~G~P~~~~~~~~~~~~~~~~l~~~w~~~~~~~~~~~~~~~~~~~~~~q~~~FL~~l~~~~~l 426 (537)
.. .|-+ ....+-++++. . -+++..|+..+|+ ++.+.+.||+++++.+..
T Consensus 353 d~----------------~g~~-------~~~tkLv~kl~----~--~dPDdafs~VpYe--KG~~ll~~Le~~lG~~~~ 401 (613)
T KOG1047|consen 353 DL----------------FGET-------SEFTKLVVKLE----N--VDPDDAFSQVPYE--KGFALLFYLEQLLGDPTR 401 (613)
T ss_pred Hh----------------cCCC-------cccchhhhhcc----C--CChHHhhhcCchh--hhhHHHHHHHHHhCChhh
Confidence 11 1110 11112222221 1 1344467888888 999999999999998778
Q ss_pred CHHHHHHHHhhcCcCCCCcHHHHHHHHHHHHhCcccchhHHHH--HhHhccC--------ccccccHHHHHHhcCccCHH
Q psy2731 427 DLPKVKSLQDVYRFNGVRNCEIRFRWLKLCLKARWKEQVPHVI--DMVTSQG--------RMKYVRPLYRELYAWEDTRQ 496 (537)
Q Consensus 427 ~~~~l~~ld~~y~l~~~~N~Ei~~rW~~l~i~~~~~~~~~~~~--~fl~~~G--------Rmk~~~PlYr~L~~~~~~r~ 496 (537)
+...|+++-+.|+++++.-.+.+---|.---.-+-.++++.+. .||.+.| +....+|.+....+|....+
T Consensus 402 Fd~FLr~Yv~kfa~ksI~t~dfld~Lye~fpe~kk~dil~~vd~~~Wl~~~G~Pp~~p~~~~sL~~~~~~La~~W~~~~~ 481 (613)
T KOG1047|consen 402 FDPFLRAYVHKFAFKSILTQDFLDFLYEYFPELKKKDILDEVDWDLWLNSPGMPPPKPNFDSSLARPVEALAQKWTAVEE 481 (613)
T ss_pred HHHHHHHHHHHhccceecHHHHHHHHHHhCcchhhhhhhccccHHHHhcCCCCCCCCCCccchHHHHHHHHHHHHhhccc
Confidence 9999999999999998877776544443322222244555554 9999999 55566777776667764111
Q ss_pred --------HH--------------HHHHHHhhhcccHHHHHHHHhhcCCCCcchHHHh
Q psy2731 497 --------TA--------------IDTFKQHRKQMMYVTAYTLAKDLKLGDLDSAEKK 532 (537)
Q Consensus 497 --------~A--------------~~~f~~~~~~yhpi~~~~v~k~l~~~~~~~~~~~ 532 (537)
-| .+.--++++ .-|-..+.+.+.+++...++||+.
T Consensus 482 ~~~~~~~~~a~d~~~~~~~Ql~lfL~~l~q~~~-lp~~~~~aL~~~Y~l~~skNAEvr 538 (613)
T KOG1047|consen 482 LDDAASEFSAEDIEKWSPDQLVLFLDKLLQSKP-LPPGTVKALQDTYSLLQSKNAEVR 538 (613)
T ss_pred ccccccccchhhHHhcChHhHHHHHHHHHhcCC-CChHHHHHHHHHhhhhhccchHHH
Confidence 11 111123333 336678889999999999999875
No 14
>PF10460 Peptidase_M30: Peptidase M30; InterPro: IPR019501 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This family contains metallopeptidases belonging to MEROPS peptidase family M30 (hyicolysin family, clan MA). Hyicolysin has a zinc ion which is liganded by two histidine and one glutamate residue.
Probab=96.74 E-value=0.0079 Score=62.15 Aligned_cols=47 Identities=23% Similarity=0.304 Sum_probs=35.7
Q ss_pred cchhhhHhhhhhhhh--ccCcccCC--CCchhhHHHHHHHHHHHHHHhhhc
Q psy2731 198 GSLAAVVAHEISHSW--TGNLVTNR--NFEHFWLNEGFTMFVERKITGRLR 244 (537)
Q Consensus 198 ~~~~~viaHEiAHqW--fGnlVt~~--~w~~~WLnEGfa~y~e~~~~~~~~ 244 (537)
.....++|||+-|+= --+.|... .-.|.|||||++.-.|.++..++.
T Consensus 137 ~~~~sTlAHEfQHmInfy~~~v~~g~~~~~dtWLnE~lS~~aEdl~s~~~~ 187 (366)
T PF10460_consen 137 DTVYSTLAHEFQHMINFYQRGVLHGKQYAMDTWLNEMLSMSAEDLYSSKID 187 (366)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCcccccHHHHHHHHHHHHHHhcCCC
Confidence 356789999999985 22444443 346999999999999998877663
No 15
>PF05299 Peptidase_M61: M61 glycyl aminopeptidase; InterPro: IPR007963 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M61 (glycyl aminopeptidase family, clan MA(E)).The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example is glycyl aminopeptidase from Sphingomonas capsulata.
Probab=96.53 E-value=0.0019 Score=56.30 Aligned_cols=45 Identities=27% Similarity=0.516 Sum_probs=36.1
Q ss_pred chhhhHhhhhhhhhccCccc-----------CCCCchhhHHHHHHHHHHHHHHhhh
Q psy2731 199 SLAAVVAHEISHSWTGNLVT-----------NRNFEHFWLNEGFTMFVERKITGRL 243 (537)
Q Consensus 199 ~~~~viaHEiAHqWfGnlVt-----------~~~w~~~WLnEGfa~y~e~~~~~~~ 243 (537)
....++|||..|.|-+-.+. +.--+.+|+-||+|+|++..++.+.
T Consensus 3 ~~l~l~sHEffH~WnvkrirP~~l~p~dy~~~~~t~~LWv~EG~T~Y~~~l~l~Ra 58 (122)
T PF05299_consen 3 RFLGLLSHEFFHSWNVKRIRPAELGPFDYEKPNYTELLWVYEGFTSYYGDLLLVRA 58 (122)
T ss_pred chhhhhhhhccccccceEeccccccCCCCCCCCCCCCEeeeeCcHHHHHHHHHHHc
Confidence 34679999999999864444 4445578999999999999988765
No 16
>PF07607 DUF1570: Protein of unknown function (DUF1570); InterPro: IPR011464 This entry represents hypothetical proteins confined to bacteria.
Probab=95.56 E-value=0.0071 Score=53.30 Aligned_cols=39 Identities=26% Similarity=0.465 Sum_probs=29.2
Q ss_pred hhhHhhhhhhhhccCc-ccC--CCCchhhHHHHHHHHHHHHHH
Q psy2731 201 AAVVAHEISHSWTGNL-VTN--RNFEHFWLNEGFTMFVERKIT 240 (537)
Q Consensus 201 ~~viaHEiAHqWfGnl-Vt~--~~w~~~WLnEGfa~y~e~~~~ 240 (537)
+.+|+||-+||=.-|. |-+ +.| =.|+.||||+|+|..-+
T Consensus 2 ~~T~~HEa~HQl~~N~Gl~~r~~~~-P~Wv~EGlA~yFE~~~~ 43 (128)
T PF07607_consen 2 IATIAHEATHQLAFNTGLHPRLADW-PRWVSEGLATYFETPGM 43 (128)
T ss_pred chHHHHHHHHHHHHHccccccCCCC-chHHHHhHHHHcCCCcc
Confidence 4589999999987654 111 222 27999999999997765
No 17
>TIGR02411 leuko_A4_hydro leukotriene A-4 hydrolase/aminopeptidase. Members of this family represent a distinctive subset within the zinc metallopeptidase family M1 (pfam01433). The majority of the members of pfam01433 are aminopeptidases, but the sequences in this family for which the function is known are leukotriene A-4 hydrolase. A dual epoxide hydrolase and aminopeptidase activity at the same active site is indicated. The physiological substrate for aminopeptidase activity is not known.
Probab=93.92 E-value=0.74 Score=51.60 Aligned_cols=131 Identities=14% Similarity=0.142 Sum_probs=85.3
Q ss_pred CCChhhcccCChhcHHHHHHHhhccCCCCHHHHHHHHhhcCcCCCCcHHHHHHHHHHHHhCcc-cchhHH--HHHhHhcc
Q psy2731 399 PFSKQDLAAFTPGQKIEFLAILLDKEMYDLPKVKSLQDVYRFNGVRNCEIRFRWLKLCLKARW-KEQVPH--VIDMVTSQ 475 (537)
Q Consensus 399 ~~~~~~~~~~~~~q~~~FL~~l~~~~~l~~~~l~~ld~~y~l~~~~N~Ei~~rW~~l~i~~~~-~~~~~~--~~~fl~~~ 475 (537)
.|+...|. ++...+++|+++++.+..+.+-|+.+-+.|++.+..-.++.-.... ....+. ...++. ...|+...
T Consensus 367 ~f~~i~Y~--KGa~~L~mL~~~lG~~~~F~~~lr~Yl~~~~~~s~~t~df~~~l~~-~~~~~~~~~~l~~~~~~~Wl~~~ 443 (601)
T TIGR02411 367 AFSSVPYE--KGFNFLFYLEQLLGGPAVFDPFLKHYFKKFAYKSLDTYQFKDALYE-YFKDTGKVDKLNAVDWDTWLYSP 443 (601)
T ss_pred hccccchh--hHHHHHHHHHHHhCCHHHHHHHHHHHHHHhCCCCCCHHHHHHHHHH-HhhhccccchhhhhhHHHHhcCC
Confidence 35555566 8999999999999734578889999999999987776665433322 221111 122332 58999999
Q ss_pred C----ccccccHHHHHHh----cCccCH-----H---HH--------HHHHHHhh------hcccHHHHHHHHhhcCCCC
Q psy2731 476 G----RMKYVRPLYRELY----AWEDTR-----Q---TA--------IDTFKQHR------KQMMYVTAYTLAKDLKLGD 525 (537)
Q Consensus 476 G----Rmk~~~PlYr~L~----~~~~~r-----~---~A--------~~~f~~~~------~~yhpi~~~~v~k~l~~~~ 525 (537)
| ..+|..+++.+.. +|.++. + .- +-.|...= .-.++-..+.+++.++++.
T Consensus 444 G~P~~~~~~~~~~~~~~~~l~~~w~~~~~~~~~~~~~~~~~~~~~~q~~~fl~~l~~~~~~~~l~~~~~~~l~~~y~~~~ 523 (601)
T TIGR02411 444 GLPPVKPNFDTTLADECYALASRWVDAAKDDLSSFNAKDIKDFSSHQLVLFLETLTERKGIVSLPEGHIKRMGDIYNFAA 523 (601)
T ss_pred CCCCcCCCCCchHHHHHHHHHHHHHhcCCCCCCCcchhhhhhCCHHHHHHHHHHHHhhccCCCCCHHHHHHHHhhcCccC
Confidence 9 5677777766422 343211 1 11 22333211 1377889999999999999
Q ss_pred cchHHHh
Q psy2731 526 LDSAEKK 532 (537)
Q Consensus 526 ~~~~~~~ 532 (537)
..++|+.
T Consensus 524 ~~n~ei~ 530 (601)
T TIGR02411 524 SKNAEVR 530 (601)
T ss_pred CCchhhh
Confidence 9999875
No 18
>PF10026 DUF2268: Predicted Zn-dependent protease (DUF2268); InterPro: IPR018728 This domain, found in various hypothetical bacterial proteins, as well as predicted zinc dependent proteases, has no known function.
Probab=93.86 E-value=0.19 Score=47.88 Aligned_cols=47 Identities=19% Similarity=0.322 Sum_probs=34.9
Q ss_pred cchhhhHhhhhhhhhccCccc----CCCCchhhHHHHHHHHHHHHHHhhhc
Q psy2731 198 GSLAAVVAHEISHSWTGNLVT----NRNFEHFWLNEGFTMFVERKITGRLR 244 (537)
Q Consensus 198 ~~~~~viaHEiAHqWfGnlVt----~~~w~~~WLnEGfa~y~e~~~~~~~~ 244 (537)
..+..+||||+-|.+--..+. ..+--|.-+.||+|.+++..+.....
T Consensus 63 ~~l~~~iaHE~hH~~r~~~~~~~~~~~TLld~~I~EGlAe~f~~~~~g~~~ 113 (195)
T PF10026_consen 63 EELPALIAHEYHHNCRYEQIGWDPEDTTLLDSLIMEGLAEYFAEELYGEEY 113 (195)
T ss_pred HHHHHHHHHHHHHHHHHhccCCCCCCCCHHHHHHHhhHHHHHHHHHcCCCC
Confidence 478899999999996433332 23344778999999999988776655
No 19
>PF10023 DUF2265: Predicted aminopeptidase (DUF2265); InterPro: IPR014553 This group represents a predicted aminopeptidase.
Probab=90.75 E-value=0.25 Score=50.47 Aligned_cols=41 Identities=37% Similarity=0.506 Sum_probs=30.6
Q ss_pred CCcchhhhHhhhhhhhhccCcccCCCCchhhHHHHHHHHHHHHHHhh
Q psy2731 196 GDGSLAAVVAHEISHSWTGNLVTNRNFEHFWLNEGFTMFVERKITGR 242 (537)
Q Consensus 196 ~~~~~~~viaHEiAHqWfGnlVt~~~w~~~WLnEGfa~y~e~~~~~~ 242 (537)
.+..++.+|-||+|||=+ -.+ +|.=+||+||+++|...+.+
T Consensus 161 ~~~~LA~LIfHELaHq~~----Yv~--~dt~FNEsfAtfVe~~G~~~ 201 (337)
T PF10023_consen 161 PDGELARLIFHELAHQTL----YVK--GDTAFNESFATFVEREGARR 201 (337)
T ss_pred CchHHHHHHHHHHhhcee----ecC--CCchhhHHHHHHHHHHHHHH
Confidence 345789999999999933 222 25568999999999876554
No 20
>PRK04860 hypothetical protein; Provisional
Probab=86.75 E-value=0.99 Score=41.47 Aligned_cols=25 Identities=12% Similarity=0.034 Sum_probs=19.4
Q ss_pred HHHHHHHHHHHHHHhc-CCCCCcccE
Q psy2731 103 FNETEKFLSTAEEICG-TYVWGVYDL 127 (537)
Q Consensus 103 ~~~~~~~l~~~e~~~G-pYp~~k~d~ 127 (537)
...++..+...+++|| |++-++..+
T Consensus 6 ~~~~~~~~~~a~~~f~~~f~~p~~~f 31 (160)
T PRK04860 6 MRRLRECLAQANLYFKRTFPEPKVSY 31 (160)
T ss_pred HHHHHHHHHHHHHHhCCCCCCCEEEE
Confidence 4567888999999999 888765543
No 21
>PF04450 BSP: Peptidase of plants and bacteria; InterPro: IPR007541 These basic secretory proteins (BSPs) are believed to be part of the plants defence mechanism against pathogens [].
Probab=86.17 E-value=0.58 Score=44.86 Aligned_cols=38 Identities=26% Similarity=0.329 Sum_probs=28.9
Q ss_pred cchhhhHhhhhhhhhccCcccCCCCchhhHHHHHHHHHHHH
Q psy2731 198 GSLAAVVAHEISHSWTGNLVTNRNFEHFWLNEGFTMFVERK 238 (537)
Q Consensus 198 ~~~~~viaHEiAHqWfGnlVt~~~w~~~WLnEGfa~y~e~~ 238 (537)
..+..+|.||++|-|=.+--+.. --||-||+|.|+..+
T Consensus 94 ~Ei~Gvl~HE~~H~~Q~~~~~~~---P~~liEGIADyVRl~ 131 (205)
T PF04450_consen 94 DEIIGVLYHEMVHCWQWDGRGTA---PGGLIEGIADYVRLK 131 (205)
T ss_pred HHHHHHHHHHHHHHhhcCCCCCC---ChhheecHHHHHHHH
Confidence 35789999999998876653322 358999999997654
No 22
>COG4324 Predicted aminopeptidase [General function prediction only]
Probab=84.89 E-value=0.71 Score=45.12 Aligned_cols=45 Identities=29% Similarity=0.330 Sum_probs=32.0
Q ss_pred ccccCC-cchhhhHhhhhhhhhccCcccCCCCchhhHHHHHHHHHHHHHHhh
Q psy2731 192 YILAGD-GSLAAVVAHEISHSWTGNLVTNRNFEHFWLNEGFTMFVERKITGR 242 (537)
Q Consensus 192 ~~~~~~-~~~~~viaHEiAHqWfGnlVt~~~w~~~WLnEGfa~y~e~~~~~~ 242 (537)
+.+..+ ..+..+|-||+|||=+. |. +|.=+||+||++.|..-+++
T Consensus 188 tmlr~dd~~lA~LIFHELAHQk~Y--~~----~DtAFNEsFAtaVEt~Gvr~ 233 (376)
T COG4324 188 TMLRQDDTYLASLIFHELAHQKIY--VN----NDTAFNESFATAVETSGVRK 233 (376)
T ss_pred HHhcCChHHHHHHHHHHHhhheEe--ec----CcchHhHHHHHHHHHHhHHH
Confidence 444433 46789999999999664 11 24457999999999776554
No 23
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=76.20 E-value=2.2 Score=38.54 Aligned_cols=18 Identities=22% Similarity=0.266 Sum_probs=14.7
Q ss_pred CcchhhhHhhhhhhhhcc
Q psy2731 197 DGSLAAVVAHEISHSWTG 214 (537)
Q Consensus 197 ~~~~~~viaHEiAHqWfG 214 (537)
...+..+|.|||+|.|..
T Consensus 56 ~~~l~~~l~HEm~H~~~~ 73 (146)
T smart00731 56 RDRLRETLLHELCHAALY 73 (146)
T ss_pred HHHHHhhHHHHHHHHHHH
Confidence 356788999999999854
No 24
>PF12725 DUF3810: Protein of unknown function (DUF3810); InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=75.00 E-value=3.1 Score=42.73 Aligned_cols=32 Identities=22% Similarity=0.250 Sum_probs=24.0
Q ss_pred chhhhHhhhhhhhhccCcccCCCCchhhHHHHHHHHHHHHHHhh
Q psy2731 199 SLAAVVAHEISHSWTGNLVTNRNFEHFWLNEGFTMFVERKITGR 242 (537)
Q Consensus 199 ~~~~viaHEiAHqWfGnlVt~~~w~~~WLnEGfa~y~e~~~~~~ 242 (537)
..-.++|||+|||= =-..|.=|+|++.++..+
T Consensus 195 ~~P~T~~HElAHq~------------G~a~E~EANFiayLac~~ 226 (318)
T PF12725_consen 195 SLPFTICHELAHQL------------GFASEDEANFIAYLACIN 226 (318)
T ss_pred cccHHHHHHHHHHh------------CCCCHHHHHHHHHHHHhc
Confidence 45679999999993 123678889998887654
No 25
>COG2719 SpoVR Uncharacterized conserved protein [Function unknown]
Probab=68.24 E-value=9 Score=40.36 Aligned_cols=57 Identities=18% Similarity=0.241 Sum_probs=33.1
Q ss_pred hhhhccccccccCCcchhhhHhhhhhhhhccCcccCCCCchhhHHHHHHHHHHHHHHhhhcCch
Q psy2731 184 EFLAKSSSYILAGDGSLAAVVAHEISHSWTGNLVTNRNFEHFWLNEGFTMFVERKITGRLRGEA 247 (537)
Q Consensus 184 ~~~~~~~~~~~~~~~~~~~viaHEiAHqWfGnlVt~~~w~~~WLnEGfa~y~e~~~~~~~~g~~ 247 (537)
-|+.+....+-.-++.+. .|..++| |+|.--.+.+ -||||+|+|..+.++..+|.+.
T Consensus 234 yFiEk~~~~LEpWQReil-~ivR~ea-~YF~PQ~qTk-----VMNEGWAtfWHytiln~lydE~ 290 (495)
T COG2719 234 YFIEKNSPLLEPWQREIL-RIVRKEA-QYFYPQRQTK-----VMNEGWATFWHYTILNHLYDEG 290 (495)
T ss_pred HHHHhcchhcCHHHHHHH-HHHHHHH-HHhcchHHHH-----HhhhhHHHHHHHHHHHhhhhhc
Confidence 455555432222233333 4455555 3443322222 5899999999999999887654
No 26
>PF12315 DUF3633: Protein of unknown function (DUF3633); InterPro: IPR022087 This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM.
Probab=63.80 E-value=10 Score=36.18 Aligned_cols=40 Identities=18% Similarity=0.248 Sum_probs=29.9
Q ss_pred hhhHhhhhhhhhccCcccCCCCchhhHHHHHHHHHHHHHHhh
Q psy2731 201 AAVVAHEISHSWTGNLVTNRNFEHFWLNEGFTMFVERKITGR 242 (537)
Q Consensus 201 ~~viaHEiAHqWfGnlVt~~~w~~~WLnEGfa~y~e~~~~~~ 242 (537)
..++|||+.|-|.- +.--.--..++-||++.-+++++++.
T Consensus 94 gsiLAHE~mHa~Lr--l~g~~~L~~~vEEGiCqvla~~wL~~ 133 (212)
T PF12315_consen 94 GSILAHELMHAWLR--LNGFPNLSPEVEEGICQVLAYLWLES 133 (212)
T ss_pred hhHHHHHHHHHHhc--ccCCCCCChHHHHHHHHHHHHHHHhh
Confidence 57899999999972 11111125689999999999998875
No 27
>PF04293 SpoVR: SpoVR like protein; InterPro: IPR007390 One of the family members P37875 from SWISSPROT is Bacillus subtilis stage V sporulation protein R, which is involved in spore cortex formation []. Little is known about cortex biosynthesis, except that it depends on several sigma E controlled genes, including spoVR [].
Probab=59.56 E-value=19 Score=38.41 Aligned_cols=53 Identities=15% Similarity=0.263 Sum_probs=31.0
Q ss_pred hhhhccccccccCCcchhhhHhhhhhhhhccCcccCCCCchhhHHHHHHHHHHHHHHhhh
Q psy2731 184 EFLAKSSSYILAGDGSLAAVVAHEISHSWTGNLVTNRNFEHFWLNEGFTMFVERKITGRL 243 (537)
Q Consensus 184 ~~~~~~~~~~~~~~~~~~~viaHEiAHqWfGnlVt~~~w~~~WLnEGfa~y~e~~~~~~~ 243 (537)
.|+.+..+-+-.-++.+. .|+.++|.=.+.-.-|- =||||+|+|...+++..+
T Consensus 228 ~Fi~~~ap~Le~WqrdIl-~iVR~ea~YF~PQ~qTK------IMNEGWAsywH~~im~~l 280 (426)
T PF04293_consen 228 YFIEEHAPELEDWQRDIL-RIVREEAQYFYPQIQTK------IMNEGWASYWHYRIMREL 280 (426)
T ss_pred HHHHHcCccccHHHHHHH-HHHHHHHHHhcchhhhh------hhccchHHHHHHHHHhhc
Confidence 566665543323333333 34455554333333222 289999999999999887
No 28
>COG3227 LasB Zinc metalloprotease (elastase) [Amino acid transport and metabolism]
Probab=59.25 E-value=46 Score=35.74 Aligned_cols=44 Identities=30% Similarity=0.360 Sum_probs=30.7
Q ss_pred hhhHhhhhhhhhcc---CcccCCCCchhhHHHHHHHHHHHHHHhhhcCc
Q psy2731 201 AAVVAHEISHSWTG---NLVTNRNFEHFWLNEGFTMFVERKITGRLRGE 246 (537)
Q Consensus 201 ~~viaHEiAHqWfG---nlVt~~~w~~~WLnEGfa~y~e~~~~~~~~g~ 246 (537)
..|+||||.|.-.+ ||+--..-. =|||+|+.-+...+-....++
T Consensus 338 LDVvAHElTHGvtq~tA~L~Y~~qsG--ALNEsfSDvfG~~i~~~~~~~ 384 (507)
T COG3227 338 LDVVAHELTHGVTQQTAGLIYRGQSG--ALNESFSDVFGTLIEQYVKNQ 384 (507)
T ss_pred cceehhhhcchhhhhccCceecCCCC--chhhHHHHHHHHHHHHHhccC
Confidence 57999999997655 666555333 389999988886665444333
No 29
>PF01447 Peptidase_M4: Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ; InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=57.30 E-value=35 Score=31.01 Aligned_cols=24 Identities=17% Similarity=0.203 Sum_probs=17.3
Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCC
Q psy2731 98 EAADEFNETEKFLSTAEEICGTYV 121 (537)
Q Consensus 98 ~~~~~~~~~~~~l~~~e~~~GpYp 121 (537)
.+..+...+.+..++|.+.||.=+
T Consensus 68 ~~vdA~~~~~~v~d~y~~~~gr~s 91 (150)
T PF01447_consen 68 AAVDAHYNAGKVYDYYKNVFGRNS 91 (150)
T ss_dssp HHHHHHHHHHHHHHHHHHHHSS-S
T ss_pred HHHHHHHhHHHHHHHHHHHHCCCC
Confidence 444555678888999999999433
No 30
>PRK11767 SpoVR family protein; Provisional
Probab=56.97 E-value=23 Score=38.34 Aligned_cols=56 Identities=18% Similarity=0.275 Sum_probs=33.3
Q ss_pred hhhhccccccccCCcchhhhHhhhhhhhhccCcccCCCCchhhHHHHHHHHHHHHHHhhhcCc
Q psy2731 184 EFLAKSSSYILAGDGSLAAVVAHEISHSWTGNLVTNRNFEHFWLNEGFTMFVERKITGRLRGE 246 (537)
Q Consensus 184 ~~~~~~~~~~~~~~~~~~~viaHEiAHqWfGnlVt~~~w~~~WLnEGfa~y~e~~~~~~~~g~ 246 (537)
.|+.+..+-+-.-++.+ -.|+.++|. +|---.-.+ -||||+|+|--+.++..++.+
T Consensus 237 ~Fi~~~ap~Le~WQReI-l~IVR~ea~-YF~PQ~qTK-----IMNEGWAsyWH~~im~~l~d~ 292 (498)
T PRK11767 237 YFIEKNAPLLEPWQREI-LRIVRKIAQ-YFYPQRQTQ-----VMNEGWATFWHYTILNHLYDE 292 (498)
T ss_pred HHHHHcCcccchHHHHH-HHHHHHHHH-hccchhhhh-----HHhhHhHHHHHHHHHHHhhhc
Confidence 55666654332333343 345556564 343222222 489999999999999988654
No 31
>PF13574 Reprolysin_2: Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=49.59 E-value=8.5 Score=35.75 Aligned_cols=14 Identities=29% Similarity=0.299 Sum_probs=12.0
Q ss_pred chhhhHhhhhhhhh
Q psy2731 199 SLAAVVAHEISHSW 212 (537)
Q Consensus 199 ~~~~viaHEiAHqW 212 (537)
.-..++||||+|++
T Consensus 110 ~~~~~~aHElGH~l 123 (173)
T PF13574_consen 110 FGIDTFAHELGHQL 123 (173)
T ss_dssp HHHHHHHHHHHHHH
T ss_pred eeeeeehhhhHhhc
Confidence 35678999999997
No 32
>PHA02456 zinc metallopeptidase motif-containing protein
Probab=44.55 E-value=10 Score=32.12 Aligned_cols=16 Identities=31% Similarity=0.790 Sum_probs=13.6
Q ss_pred CcchhhhHhhhhhhhh
Q psy2731 197 DGSLAAVVAHEISHSW 212 (537)
Q Consensus 197 ~~~~~~viaHEiAHqW 212 (537)
.+.-..+++||++|-|
T Consensus 76 ~KGC~~TL~HEL~H~W 91 (141)
T PHA02456 76 NKGCRDTLAHELNHAW 91 (141)
T ss_pred ccchHHHHHHHHHHHH
Confidence 3456789999999999
No 33
>PF01863 DUF45: Protein of unknown function DUF45; InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=43.86 E-value=65 Score=30.34 Aligned_cols=24 Identities=29% Similarity=0.556 Sum_probs=17.1
Q ss_pred cchhhhHhhhhhhhhccCcccCCCCchhh
Q psy2731 198 GSLAAVVAHEISHSWTGNLVTNRNFEHFW 226 (537)
Q Consensus 198 ~~~~~viaHEiAHqWfGnlVt~~~w~~~W 226 (537)
.-+..||+||+||--.- +--..||
T Consensus 162 ~~idYVvvHEL~Hl~~~-----nHs~~Fw 185 (205)
T PF01863_consen 162 EVIDYVVVHELCHLRHP-----NHSKRFW 185 (205)
T ss_pred cHHHHHHHHHHHHhccC-----CCCHHHH
Confidence 35678999999997654 3444566
No 34
>PF10263 SprT-like: SprT-like family; InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases.
Probab=43.81 E-value=12 Score=33.82 Aligned_cols=18 Identities=22% Similarity=0.351 Sum_probs=15.0
Q ss_pred CcchhhhHhhhhhhhhcc
Q psy2731 197 DGSLAAVVAHEISHSWTG 214 (537)
Q Consensus 197 ~~~~~~viaHEiAHqWfG 214 (537)
...+..+|.|||+|.|..
T Consensus 57 ~~~~~~tL~HEm~H~~~~ 74 (157)
T PF10263_consen 57 EEELIDTLLHEMAHAAAY 74 (157)
T ss_pred HHHHHHHHHHHHHHHHhh
Confidence 346789999999999973
No 35
>PF01421 Reprolysin: Reprolysin (M12B) family zinc metalloprotease This Prosite motif covers only the active site.; InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=43.72 E-value=13 Score=35.09 Aligned_cols=13 Identities=46% Similarity=0.897 Sum_probs=10.9
Q ss_pred chhhhHhhhhhhh
Q psy2731 199 SLAAVVAHEISHS 211 (537)
Q Consensus 199 ~~~~viaHEiAHq 211 (537)
..+.++||||+|.
T Consensus 130 ~~a~~~AHelGH~ 142 (199)
T PF01421_consen 130 SFAVIIAHELGHN 142 (199)
T ss_dssp HHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHh
Confidence 4568999999996
No 36
>PRK03001 M48 family peptidase; Provisional
Probab=42.83 E-value=34 Score=34.51 Aligned_cols=16 Identities=31% Similarity=0.563 Sum_probs=13.4
Q ss_pred CcchhhhHhhhhhhhh
Q psy2731 197 DGSLAAVVAHEISHSW 212 (537)
Q Consensus 197 ~~~~~~viaHEiAHqW 212 (537)
+..+..|+|||++|-=
T Consensus 121 ~~El~aVlAHElgHi~ 136 (283)
T PRK03001 121 EREIRGVMAHELAHVK 136 (283)
T ss_pred HHHHHHHHHHHHHHHh
Confidence 4568899999999964
No 37
>PF12125 Beta-TrCP_D: D domain of beta-TrCP; InterPro: IPR021977 This domain is found in eukaryotes, and is approximately 40 amino acids in length. It is found associated with PF00646 from PFAM, PF00400 from PFAM. The protein that contains this domain functions as a ubiquitin ligase. Ubiquitination is required to direct proteins towards the proteasome for degradation. This protein is part of the WD40 class of F box proteins. The D domain of these F box proteins is involved in mediating the dimerisation of the protein. Dimerisation is necessary to polyubiquitinate substrates so this D domain is vital in directing substrates towards the proteasome for degradation. ; PDB: 2P64_A 1P22_A.
Probab=42.50 E-value=38 Score=23.32 Aligned_cols=32 Identities=9% Similarity=0.236 Sum_probs=25.2
Q ss_pred cccCChhcHHHHHHHhhccCCCCHHHHHHHHhhc
Q psy2731 405 LAAFTPGQKIEFLAILLDKEMYDLPKVKSLQDVY 438 (537)
Q Consensus 405 ~~~~~~~q~~~FL~~l~~~~~l~~~~l~~ld~~y 438 (537)
+..|+..+.+.|.++|+. .+++-+.-+++...
T Consensus 4 F~~Wse~eQvdFVe~Ll~--rM~H~Qhg~In~fL 35 (40)
T PF12125_consen 4 FDKWSESEQVDFVEQLLS--RMCHYQHGQINAFL 35 (40)
T ss_dssp HTTS-HHHHHHHHHHHHH--TS-HHHHHHHHHHH
T ss_pred hhccCchhHHHHHHHHHH--HHHHHHHHhHHHHH
Confidence 568999999999999998 58888888877643
No 38
>PF08325 WLM: WLM domain; InterPro: IPR013536 The WLM (WSS1-like metalloprotease) domain is a globular domain related to the zincin-like superfamily of Zn-dependent peptidase. Since the WLM domain contains all known active site residues of zincins, it is predicted to be a catalytically active peptidase domain. The WLM domain is a eukaryotic domain represented in plants, fungi, Plasmodium, and kinetoplastids. By contrast, it is absent in animals, Cryptosporidium, and Microsporidia, suggesting that it has been lost on multiple occasions during the evolution of eukaryotes. The WLM domain is found either in stand-alone form or in association with other domains such as the RanBP2 zinc finger , the ubiquitin domain, or the PUB/PUG domain. This domain could function as a specific de-SUMOylating domain of distinct protein complexes in the nucleus and the cytoplasm []. It has been suggested to form a segregated alpha/beta structure with eight helices and five strands. Proteins containign this domain include yeast WSS1 (a weak suppressor of the Ub-related protein SMT3), and various putative metalloproteases from plant and fungal species.
Probab=42.08 E-value=13 Score=35.15 Aligned_cols=18 Identities=22% Similarity=0.490 Sum_probs=15.8
Q ss_pred cchhhhHhhhhhhhhccC
Q psy2731 198 GSLAAVVAHEISHSWTGN 215 (537)
Q Consensus 198 ~~~~~viaHEiAHqWfGn 215 (537)
..+..++.|||||.++|+
T Consensus 80 ~~i~~t~lHELaH~~~~~ 97 (186)
T PF08325_consen 80 ETILGTMLHELAHNVHGP 97 (186)
T ss_pred HHHHHHHHHHHHhcccCC
Confidence 467899999999999887
No 39
>KOG2661|consensus
Probab=41.69 E-value=13 Score=37.92 Aligned_cols=20 Identities=40% Similarity=0.607 Sum_probs=16.6
Q ss_pred cCCcchhhhHhhhhhhhhcc
Q psy2731 195 AGDGSLAAVVAHEISHSWTG 214 (537)
Q Consensus 195 ~~~~~~~~viaHEiAHqWfG 214 (537)
.++..++.|++||+|||=-+
T Consensus 270 k~ddglAtvLgHE~aHaVar 289 (424)
T KOG2661|consen 270 KDDDGLATVLGHEIAHAVAR 289 (424)
T ss_pred cChHHHHHHHHHHHHHHHHH
Confidence 56678899999999999544
No 40
>PRK03982 heat shock protein HtpX; Provisional
Probab=41.05 E-value=36 Score=34.34 Aligned_cols=19 Identities=32% Similarity=0.543 Sum_probs=14.5
Q ss_pred CCcchhhhHhhhhhhhhcc
Q psy2731 196 GDGSLAAVVAHEISHSWTG 214 (537)
Q Consensus 196 ~~~~~~~viaHEiAHqWfG 214 (537)
.+..+..|+|||++|-==|
T Consensus 121 ~~~El~AVlAHElgHi~~~ 139 (288)
T PRK03982 121 NEDELEGVIAHELTHIKNR 139 (288)
T ss_pred CHHHHHHHHHHHHHHHHcC
Confidence 3456889999999997433
No 41
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=39.56 E-value=21 Score=38.23 Aligned_cols=18 Identities=44% Similarity=0.593 Sum_probs=13.7
Q ss_pred cchhhhHhhhhhhhhccC
Q psy2731 198 GSLAAVVAHEISHSWTGN 215 (537)
Q Consensus 198 ~~~~~viaHEiAHqWfGn 215 (537)
..++.||||||+|-==+.
T Consensus 128 sElagViAHEigHv~qrH 145 (484)
T COG4783 128 SELAGVIAHEIGHVAQRH 145 (484)
T ss_pred HHHHHHHHHHHHHHhhhh
Confidence 367899999999964333
No 42
>PF06114 DUF955: Domain of unknown function (DUF955); InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=36.47 E-value=16 Score=30.68 Aligned_cols=19 Identities=21% Similarity=0.125 Sum_probs=14.9
Q ss_pred chhhhHhhhhhhhhccCcc
Q psy2731 199 SLAAVVAHEISHSWTGNLV 217 (537)
Q Consensus 199 ~~~~viaHEiAHqWfGnlV 217 (537)
....+++|||+|.+++..-
T Consensus 41 ~~~f~laHELgH~~~~~~~ 59 (122)
T PF06114_consen 41 RQRFTLAHELGHILLHHGD 59 (122)
T ss_dssp HHHHHHHHHHHHHHHHH-H
T ss_pred HHHHHHHHHHHHHHhhhcc
Confidence 4568999999999987543
No 43
>PF13688 Reprolysin_5: Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=36.13 E-value=18 Score=33.92 Aligned_cols=14 Identities=43% Similarity=0.496 Sum_probs=11.9
Q ss_pred chhhhHhhhhhhhh
Q psy2731 199 SLAAVVAHEISHSW 212 (537)
Q Consensus 199 ~~~~viaHEiAHqW 212 (537)
....++||||+|.+
T Consensus 141 ~~~~~~AHEiGH~l 154 (196)
T PF13688_consen 141 NGAITFAHEIGHNL 154 (196)
T ss_dssp HHHHHHHHHHHHHT
T ss_pred ceehhhHHhHHHhc
Confidence 44689999999997
No 44
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=35.58 E-value=56 Score=31.42 Aligned_cols=14 Identities=21% Similarity=0.252 Sum_probs=11.9
Q ss_pred chhhhHhhhhhhhh
Q psy2731 199 SLAAVVAHEISHSW 212 (537)
Q Consensus 199 ~~~~viaHEiAHqW 212 (537)
..+.++||||+|..
T Consensus 144 ~~~~~~AHElGH~l 157 (220)
T cd04272 144 YGVYTMTHELAHLL 157 (220)
T ss_pred ccHHHHHHHHHHHh
Confidence 45789999999985
No 45
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=34.32 E-value=19 Score=36.28 Aligned_cols=20 Identities=40% Similarity=0.561 Sum_probs=15.2
Q ss_pred CCcchhhhHhhhhhhhhccC
Q psy2731 196 GDGSLAAVVAHEISHSWTGN 215 (537)
Q Consensus 196 ~~~~~~~viaHEiAHqWfGn 215 (537)
.+..+..|++||++|.=-++
T Consensus 153 ~~dEl~aVlaHElgHi~~rd 172 (302)
T COG0501 153 NDDELEAVLAHELGHIKNRH 172 (302)
T ss_pred CHHHHHHHHHHHHHHHhccc
Confidence 34568899999999985443
No 46
>PRK02391 heat shock protein HtpX; Provisional
Probab=33.18 E-value=48 Score=33.71 Aligned_cols=19 Identities=42% Similarity=0.552 Sum_probs=14.6
Q ss_pred CcchhhhHhhhhhhhhccC
Q psy2731 197 DGSLAAVVAHEISHSWTGN 215 (537)
Q Consensus 197 ~~~~~~viaHEiAHqWfGn 215 (537)
+..+..|+|||++|--=++
T Consensus 130 ~~El~aVlaHElgHi~~~d 148 (296)
T PRK02391 130 PDELEAVLAHELSHVKNRD 148 (296)
T ss_pred HHHHHHHHHHHHHHHHcCC
Confidence 3568899999999965443
No 47
>PRK04351 hypothetical protein; Provisional
Probab=33.12 E-value=13 Score=33.68 Aligned_cols=15 Identities=27% Similarity=0.479 Sum_probs=12.6
Q ss_pred CcchhhhHhhhhhhh
Q psy2731 197 DGSLAAVVAHEISHS 211 (537)
Q Consensus 197 ~~~~~~viaHEiAHq 211 (537)
...+..+|+||++|-
T Consensus 58 ~~~l~~vv~HElcH~ 72 (149)
T PRK04351 58 LEELIGIIKHELCHY 72 (149)
T ss_pred HHHHHhhHHHHHHHH
Confidence 456789999999995
No 48
>PF05960 DUF885: Bacterial protein of unknown function (DUF885); InterPro: IPR010281 This family consists of hypothetical bacterial proteins.; PDB: 3O0Y_B 3U24_A 3IUK_A.
Probab=29.57 E-value=3.9e+02 Score=29.41 Aligned_cols=45 Identities=22% Similarity=0.239 Sum_probs=28.4
Q ss_pred cchhhhHhhhhh--hhhccCc---cc--CCCCch-----hhHHHHHHHHHHHHHHhh
Q psy2731 198 GSLAAVVAHEIS--HSWTGNL---VT--NRNFEH-----FWLNEGFTMFVERKITGR 242 (537)
Q Consensus 198 ~~~~~viaHEiA--HqWfGnl---Vt--~~~w~~-----~WLnEGfa~y~e~~~~~~ 242 (537)
..+..+++||-. |++=... .. ...++. .=..||+|.|.|.+.++.
T Consensus 370 ~~~~~~~~HE~~PGHh~Q~~~~~~~~~~~~~~r~~~~~~~~~~EGWAlY~E~l~~~e 426 (549)
T PF05960_consen 370 YSLYTTAAHEAYPGHHLQIAYANEHPDKLPPFRRLLGSYTAFVEGWALYAEQLMYDE 426 (549)
T ss_dssp TCHHHHHHHHSTTTHHHHHHHHHHTCCGS-HHHHHC---HHHHHHHHHHHHHHHHHH
T ss_pred hhHHHHHHhhcCCcHHHHHHHHHHccccCcHHHHhccccCchhhHHHHHHHHHHHHh
Confidence 467789999974 7763311 11 122222 246899999999988454
No 49
>PF01435 Peptidase_M48: Peptidase family M48 This is family M48 in the peptidase classification. ; InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated. The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases []. HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=29.36 E-value=21 Score=34.03 Aligned_cols=22 Identities=36% Similarity=0.542 Sum_probs=17.7
Q ss_pred cchhhhHhhhhhhhhccCcccC
Q psy2731 198 GSLAAVVAHEISHSWTGNLVTN 219 (537)
Q Consensus 198 ~~~~~viaHEiAHqWfGnlVt~ 219 (537)
..+..++|||++|-.-+...-.
T Consensus 87 ~el~aVlaHElgH~~~~h~~~~ 108 (226)
T PF01435_consen 87 DELAAVLAHELGHIKHRHILKS 108 (226)
T ss_dssp HHHHHHHHHHHHHHHTTHCCCC
T ss_pred HHHHHHHHHHHHHHHcCCcchH
Confidence 4678899999999988765544
No 50
>PRK05457 heat shock protein HtpX; Provisional
Probab=29.31 E-value=46 Score=33.61 Aligned_cols=19 Identities=47% Similarity=0.658 Sum_probs=14.8
Q ss_pred CcchhhhHhhhhhhhhccC
Q psy2731 197 DGSLAAVVAHEISHSWTGN 215 (537)
Q Consensus 197 ~~~~~~viaHEiAHqWfGn 215 (537)
+..+..|+|||++|.=-|+
T Consensus 131 ~~El~aVlAHElgHi~~~d 149 (284)
T PRK05457 131 RDEVEAVLAHEISHIANGD 149 (284)
T ss_pred HHHHHHHHHHHHHHHHcCC
Confidence 3578899999999975443
No 51
>PF11658 DUF3260: Protein of unknown function (DUF3260); InterPro: IPR017744 This protein was identified by the partial phylogenetic profiling algorithm [] as part of the system for cellulose biosynthesis in bacteria, and in fact is found in cellulose biosynthesis gene regions. The protein was designated YhjU in Salmonella enteritidis, where disruption of its gene disrupts cellulose biosynthesis and biofilm formation [].
Probab=28.47 E-value=68 Score=34.84 Aligned_cols=107 Identities=11% Similarity=0.077 Sum_probs=66.9
Q ss_pred cCCCccccccc-CcCCc-cch-hhHHHHHHHHHhcccccchhhhhhhhh--hH-HHH-----Hh-hh-ccCCCchhhHHH
Q psy2731 278 SHSWTGNLVTN-RNFEH-FWL-NEGFTMFVERKITGRLRGEAERHFDAL--SG-LKD-----LK-QA-VSSTGPLWDSKR 344 (537)
Q Consensus 278 ~~pl~g~~~~~-~~f~~-i~Y-G~~~l~~L~~~lG~~~f~~~~~~~~~l--~y-~~~-----~~-~~-~~t~~~~~df~~ 344 (537)
.||+-...+.- +.|++ .+| |++.+|.||.--|+..-..+. ..+= -| +++ |. +. .+..+..++|..
T Consensus 218 ~hPl~~~FDi~F~nFNSAtSYSGPAaIRLLRASCGQ~sH~~Ly--~pa~~qC~LF~nLa~lGf~~~l~mnHdG~Fd~Fl~ 295 (518)
T PF11658_consen 218 NHPLWKRFDIVFDNFNSATSYSGPAAIRLLRASCGQPSHSDLY--QPAPQQCYLFDNLAKLGFTQQLMMNHDGHFDNFLQ 295 (518)
T ss_pred cCchHHhhcchhcccccccccchHHHHHHHHhccCCcchHhhc--CCCcccccHHHHHHhcCCchhhccCCCCccccHHH
Confidence 56754233322 56654 789 999999999999976531100 0000 11 222 22 11 222355688888
Q ss_pred HHHHHhcCcchHHHHhh--hhcCCCCccccccccchhHHHHHHHHHHhhcC
Q psy2731 345 NALDFQKGRHYQCKIER--LKCGSAILFIYGYDTSLQDVCNDLANRWISWN 393 (537)
Q Consensus 345 ~l~~~~~~~~~~~~~~~--l~~~G~P~~~~~~~~~~~~~~~~l~~~w~~~~ 393 (537)
.+.+. |+ ++. +.+.|+|+....||.+-.-...++-++|.+..
T Consensus 296 ~ir~~-G~------l~~pl~s~~g~~~~~~~FDgSpI~~D~~vL~rW~~~r 339 (518)
T PF11658_consen 296 EIRED-GG------LQAPLMSQAGLPVALHSFDGSPIYDDLAVLNRWLQQR 339 (518)
T ss_pred HHHHc-CC------CCCCCcCCCCCchHhhccCCCcccchHHHHHHHHHHH
Confidence 88877 32 223 78899999999999887777777778998654
No 52
>PRK03072 heat shock protein HtpX; Provisional
Probab=27.12 E-value=25 Score=35.56 Aligned_cols=15 Identities=40% Similarity=0.691 Sum_probs=12.6
Q ss_pred CcchhhhHhhhhhhh
Q psy2731 197 DGSLAAVVAHEISHS 211 (537)
Q Consensus 197 ~~~~~~viaHEiAHq 211 (537)
+..+..|+|||++|-
T Consensus 124 ~~El~aVlAHElgHi 138 (288)
T PRK03072 124 ERELRGVLGHELSHV 138 (288)
T ss_pred HHHHHHHHHHHHHHH
Confidence 356889999999995
No 53
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=27.08 E-value=79 Score=30.51 Aligned_cols=42 Identities=12% Similarity=0.060 Sum_probs=26.8
Q ss_pred chhhhHhhhhhhhhccCcc-----cCCCCchhhHHHHHHHHHHHHHH
Q psy2731 199 SLAAVVAHEISHSWTGNLV-----TNRNFEHFWLNEGFTMFVERKIT 240 (537)
Q Consensus 199 ~~~~viaHEiAHqWfGnlV-----t~~~w~~~WLnEGfa~y~e~~~~ 240 (537)
.-..++|||++|-|++--. ....|..-=.-|--|++++..++
T Consensus 71 r~rFtlAHELGH~llH~~~~~~~~~~~~~~~~~~~E~~AN~FAa~lL 117 (213)
T COG2856 71 RKRFTLAHELGHALLHTDLNTRFDAEPTLQQDRKIEAEANAFAAELL 117 (213)
T ss_pred HHHHHHHHHHhHHHhccccchhhhcccccchhHHHHHHHHHHHHHHh
Confidence 3467999999999997332 22333444456677777665543
No 54
>COG5504 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=25.68 E-value=2.8e+02 Score=27.43 Aligned_cols=39 Identities=26% Similarity=0.237 Sum_probs=26.9
Q ss_pred CCcchhhhHhhhhhhh-------hccCcccCCCCchhhHHHHHHHHHHH
Q psy2731 196 GDGSLAAVVAHEISHS-------WTGNLVTNRNFEHFWLNEGFTMFVER 237 (537)
Q Consensus 196 ~~~~~~~viaHEiAHq-------WfGnlVt~~~w~~~WLnEGfa~y~e~ 237 (537)
...++-.+||||.=|- |=+..||-- |.=+-||+|.+..+
T Consensus 136 ~~~~v~aliaHE~HH~~R~~~i~~~eg~vtLl---e~lV~EGLAE~av~ 181 (280)
T COG5504 136 TITSVPALIAHEYHHNCRLRYIDYGEGSVTLL---EALVMEGLAEHAVF 181 (280)
T ss_pred CccchHHHHHHHHHhhheecccccCCCceeHH---HHHHHHHHHHHHHH
Confidence 4457788999999986 333334433 66788999988543
No 55
>PF13699 DUF4157: Domain of unknown function (DUF4157)
Probab=25.39 E-value=35 Score=27.30 Aligned_cols=13 Identities=38% Similarity=0.496 Sum_probs=10.8
Q ss_pred hhhhHhhhhhhhh
Q psy2731 200 LAAVVAHEISHSW 212 (537)
Q Consensus 200 ~~~viaHEiAHqW 212 (537)
-..+++||++|-+
T Consensus 61 ~~~llaHEl~Hv~ 73 (79)
T PF13699_consen 61 GRALLAHELAHVV 73 (79)
T ss_pred cchhHhHHHHHHH
Confidence 3679999999965
No 56
>PRK01345 heat shock protein HtpX; Provisional
Probab=25.36 E-value=28 Score=35.74 Aligned_cols=18 Identities=33% Similarity=0.470 Sum_probs=14.1
Q ss_pred CcchhhhHhhhhhhhhcc
Q psy2731 197 DGSLAAVVAHEISHSWTG 214 (537)
Q Consensus 197 ~~~~~~viaHEiAHqWfG 214 (537)
+..+..|+||||+|-==+
T Consensus 121 ~dEL~aVlAHElgHi~~~ 138 (317)
T PRK01345 121 PEEVAGVMAHELAHVKNR 138 (317)
T ss_pred HHHHHHHHHHHHHHHHcC
Confidence 357889999999997433
No 57
>PF09768 Peptidase_M76: Peptidase M76 family; InterPro: IPR019165 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. Mitochondrial inner membrane protease ATP23 has two roles in the assembly of mitochondrial ATPase. Firstly, it acts as a protease that removes the N-terminal 10 residues of mitochondrial ATPase CF(0) subunit 6 (ATP6) at the intermembrane space side. Secondly, it is involved in the correct assembly of the membrane-embedded ATPase CF(0) particle, probably mediating association of ATP6 with the subunit 9 ring [, ].; GO: 0004222 metalloendopeptidase activity
Probab=25.27 E-value=1.4e+02 Score=27.78 Aligned_cols=18 Identities=28% Similarity=0.649 Sum_probs=14.8
Q ss_pred cCCcchhhhHhhhhhhhh
Q psy2731 195 AGDGSLAAVVAHEISHSW 212 (537)
Q Consensus 195 ~~~~~~~~viaHEiAHqW 212 (537)
.....+..+|+|||.|.|
T Consensus 66 ~~~~~l~~~l~HELIHay 83 (173)
T PF09768_consen 66 RSQGHLEDTLTHELIHAY 83 (173)
T ss_pred CCHHHHHHHHHHHHHHHH
Confidence 444567899999999987
No 58
>PF09471 Peptidase_M64: IgA Peptidase M64; InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=24.80 E-value=76 Score=31.71 Aligned_cols=16 Identities=50% Similarity=0.808 Sum_probs=12.7
Q ss_pred chhhhHhhhhhhhhcc
Q psy2731 199 SLAAVVAHEISHSWTG 214 (537)
Q Consensus 199 ~~~~viaHEiAHqWfG 214 (537)
....+++||+.|+..|
T Consensus 215 ~~~~v~vHE~GHsf~~ 230 (264)
T PF09471_consen 215 SFKQVVVHEFGHSFGG 230 (264)
T ss_dssp THHHHHHHHHHHHTT-
T ss_pred cccceeeeeccccccc
Confidence 5678999999999643
No 59
>PRK04897 heat shock protein HtpX; Provisional
Probab=24.47 E-value=30 Score=35.18 Aligned_cols=15 Identities=47% Similarity=0.680 Sum_probs=12.8
Q ss_pred CcchhhhHhhhhhhh
Q psy2731 197 DGSLAAVVAHEISHS 211 (537)
Q Consensus 197 ~~~~~~viaHEiAHq 211 (537)
+..+..|+|||++|-
T Consensus 134 ~~El~aVlAHElgHi 148 (298)
T PRK04897 134 REELEGVIGHEISHI 148 (298)
T ss_pred HHHHHHHHHHHHHHH
Confidence 457889999999995
No 60
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=24.30 E-value=38 Score=31.79 Aligned_cols=14 Identities=36% Similarity=0.605 Sum_probs=11.9
Q ss_pred chhhhHhhhhhhhh
Q psy2731 199 SLAAVVAHEISHSW 212 (537)
Q Consensus 199 ~~~~viaHEiAHqW 212 (537)
..+.++||||+|+.
T Consensus 130 ~~a~~~AHElGH~l 143 (194)
T cd04269 130 LFAVTMAHELGHNL 143 (194)
T ss_pred HHHHHHHHHHHhhc
Confidence 44789999999996
No 61
>TIGR03368 cellulose_yhjU cellulose synthase operon protein YhjU. This protein was identified by the partial phylogenetic profiling algorithm (PubMed:16930487) as part of the system for cellulose biosynthesis in bacteria, and in fact is found in cellulose biosynthesis gene regions. The protein was designated YhjU in Salmonella enteritidis, where disruption of its gene disrupts cellulose biosynthesis and biofilm formation (PubMed:11929533).
Probab=24.29 E-value=93 Score=33.73 Aligned_cols=106 Identities=10% Similarity=0.116 Sum_probs=65.2
Q ss_pred cCCCccccccc-CcCCc-cch-hhHHHHHHHHHhcccccchhhhhhhhh---hH-HHH-----Hh-hh-ccCCCchhhHH
Q psy2731 278 SHSWTGNLVTN-RNFEH-FWL-NEGFTMFVERKITGRLRGEAERHFDAL---SG-LKD-----LK-QA-VSSTGPLWDSK 343 (537)
Q Consensus 278 ~~pl~g~~~~~-~~f~~-i~Y-G~~~l~~L~~~lG~~~f~~~~~~~~~l---~y-~~~-----~~-~~-~~t~~~~~df~ 343 (537)
.||+-...+.- +.|++ .+| |+..+|.||.--|++.-..+ ++.- -| +++ |. +. .+..+..++|.
T Consensus 216 ~hPl~~~FDilF~nFnSAtSYSGPAaIRlLRASCGQ~~H~~L---Y~pa~~qCyLF~nLa~lGf~~~l~lnHdG~Fd~fl 292 (518)
T TIGR03368 216 NHPLLSRFDILFDNFNSATSYSGPAAIRLLRASCGQESHADL---YDPAPEQCHLFSNLAKLGFTENLLLNHDGHFDNFL 292 (518)
T ss_pred cCchHHhhcchhhccccccccchHHHHHHHHhccCCcchHhh---cCCccccchHHHHHHHcCCchhhcccCCCccchHH
Confidence 57764233322 56655 789 99999999999996543110 1110 12 122 22 11 22234467777
Q ss_pred HHHHHHhcCcchHHHHhh--hhcCCCCccccccccchhHHHHHHHHHHhhcC
Q psy2731 344 RNALDFQKGRHYQCKIER--LKCGSAILFIYGYDTSLQDVCNDLANRWISWN 393 (537)
Q Consensus 344 ~~l~~~~~~~~~~~~~~~--l~~~G~P~~~~~~~~~~~~~~~~l~~~w~~~~ 393 (537)
+.+.+. |+ ++. ..+.|+|+-...||.+......++-++|.+..
T Consensus 293 ~~ir~~-G~------~~~pl~~~~g~~~~~~aFDGSpIy~D~~vL~rW~~~r 337 (518)
T TIGR03368 293 QLVREN-GG------MQSPLMSQTGLPVAQRSFDGSPIYDDYAVLNRWLQER 337 (518)
T ss_pred HHHHHc-CC------CCCCCcCccCCcHHHhccCCCcccchHHHHHHHHHHh
Confidence 777766 32 222 78899999888999887777677778997654
No 62
>PRK02870 heat shock protein HtpX; Provisional
Probab=23.69 E-value=32 Score=35.67 Aligned_cols=16 Identities=50% Similarity=0.677 Sum_probs=13.3
Q ss_pred CCcchhhhHhhhhhhh
Q psy2731 196 GDGSLAAVVAHEISHS 211 (537)
Q Consensus 196 ~~~~~~~viaHEiAHq 211 (537)
.+..+..|+|||++|-
T Consensus 169 ~~dEL~aVlAHELgHi 184 (336)
T PRK02870 169 DRDELQAVMAHELSHI 184 (336)
T ss_pred CHHHHHHHHHHHHHHH
Confidence 3457889999999995
No 63
>PF12174 RST: RCD1-SRO-TAF4 (RST) plant domain; InterPro: IPR022003 This domain is found in many plant proteins including SROs and RCD1s; it is required for interaction with multiple plant transcription factors.
Probab=23.44 E-value=1.4e+02 Score=23.43 Aligned_cols=46 Identities=11% Similarity=0.107 Sum_probs=37.4
Q ss_pred hHHHHHHHHHhcccccchhhhhhhhh-hHHHHHh-hhccCCCchhhHHHHHHHHhcCcc
Q psy2731 298 EGFTMFVERKITGRLRGEAERHFDAL-SGLKDLK-QAVSSTGPLWDSKRNALDFQKGRH 354 (537)
Q Consensus 298 ~~~l~~L~~~lG~~~f~~~~~~~~~l-~y~~~~~-~~~~t~~~~~df~~~l~~~~~~~~ 354 (537)
+.++.+|...++.+. ..-+ .++++|+ +-++- ++|...+....|++.
T Consensus 12 ~~L~~~l~~~l~~~~-------~~~l~~~Y~~~k~~kIsR----~~fvr~lR~IVGD~l 59 (70)
T PF12174_consen 12 PMLFSALSKHLPPSK-------MDLLQKHYEEFKKKKISR----EEFVRKLRQIVGDQL 59 (70)
T ss_pred HHHHHHHHHHCCHHH-------HHHHHHHHHHHHHCCCCH----HHHHHHHHHHHHHHH
Confidence 467888999888644 3445 8999999 99999 999999999988543
No 64
>PF12388 Peptidase_M57: Dual-action HEIGH metallo-peptidase; InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=22.89 E-value=61 Score=31.16 Aligned_cols=13 Identities=38% Similarity=0.723 Sum_probs=10.8
Q ss_pred chhhhHhhhhhhh
Q psy2731 199 SLAAVVAHEISHS 211 (537)
Q Consensus 199 ~~~~viaHEiAHq 211 (537)
....+|+|||.|-
T Consensus 132 ~~~hvi~HEiGH~ 144 (211)
T PF12388_consen 132 VIEHVITHEIGHC 144 (211)
T ss_pred HHHHHHHHHhhhh
Confidence 3567999999997
No 65
>COG1451 Predicted metal-dependent hydrolase [General function prediction only]
Probab=22.86 E-value=35 Score=33.17 Aligned_cols=19 Identities=32% Similarity=0.433 Sum_probs=15.8
Q ss_pred CcchhhhHhhhhhhhhccC
Q psy2731 197 DGSLAAVVAHEISHSWTGN 215 (537)
Q Consensus 197 ~~~~~~viaHEiAHqWfGn 215 (537)
+.-+..+++||+||-=..|
T Consensus 172 ~~~i~YVvvHELaHLke~n 190 (223)
T COG1451 172 EEVIDYVVVHELAHLKEKN 190 (223)
T ss_pred HHHHHHHHHHHHHHHhhhh
Confidence 3457889999999987777
No 66
>PF14891 Peptidase_M91: Effector protein
Probab=22.84 E-value=43 Score=31.06 Aligned_cols=15 Identities=27% Similarity=0.461 Sum_probs=12.4
Q ss_pred chhhhHhhhhhhhhc
Q psy2731 199 SLAAVVAHEISHSWT 213 (537)
Q Consensus 199 ~~~~viaHEiAHqWf 213 (537)
.-..+++|||+|-|=
T Consensus 102 ~p~v~L~HEL~HA~~ 116 (174)
T PF14891_consen 102 PPFVVLYHELIHAYD 116 (174)
T ss_pred HHHHHHHHHHHHHHH
Confidence 346799999999984
No 67
>PF13582 Reprolysin_3: Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=22.70 E-value=43 Score=28.67 Aligned_cols=13 Identities=38% Similarity=0.465 Sum_probs=10.9
Q ss_pred hhhhHhhhhhhhh
Q psy2731 200 LAAVVAHEISHSW 212 (537)
Q Consensus 200 ~~~viaHEiAHqW 212 (537)
...+++|||.|..
T Consensus 107 ~~~~~~HEiGH~l 119 (124)
T PF13582_consen 107 GVDTFAHEIGHNL 119 (124)
T ss_dssp STTHHHHHHHHHT
T ss_pred cceEeeehhhHhc
Confidence 3589999999974
No 68
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=22.43 E-value=43 Score=32.95 Aligned_cols=13 Identities=38% Similarity=0.654 Sum_probs=11.2
Q ss_pred hhhhHhhhhhhhh
Q psy2731 200 LAAVVAHEISHSW 212 (537)
Q Consensus 200 ~~~viaHEiAHqW 212 (537)
...++||||+|.+
T Consensus 167 ~a~t~AHElGHnl 179 (244)
T cd04270 167 SDLVTAHELGHNF 179 (244)
T ss_pred HHHHHHHHHHHhc
Confidence 3579999999997
No 69
>PF15641 Tox-MPTase5: Metallopeptidase toxin 5
Probab=21.17 E-value=74 Score=25.88 Aligned_cols=18 Identities=28% Similarity=0.540 Sum_probs=11.6
Q ss_pred chhhhHhhh-hhhhhccCc
Q psy2731 199 SLAAVVAHE-ISHSWTGNL 216 (537)
Q Consensus 199 ~~~~viaHE-iAHqWfGnl 216 (537)
.+..+|+|| +-|-||---
T Consensus 63 ~lr~~iiheelhhrw~~rg 81 (109)
T PF15641_consen 63 ELRNTIIHEELHHRWWKRG 81 (109)
T ss_pred HHHHHHHHHHHHHHHHHhh
Confidence 456777775 666787543
No 70
>PRK01265 heat shock protein HtpX; Provisional
Probab=20.58 E-value=40 Score=34.73 Aligned_cols=15 Identities=33% Similarity=0.567 Sum_probs=12.7
Q ss_pred CcchhhhHhhhhhhh
Q psy2731 197 DGSLAAVVAHEISHS 211 (537)
Q Consensus 197 ~~~~~~viaHEiAHq 211 (537)
+..+..|+|||++|-
T Consensus 137 ~~El~aVlAHElgHi 151 (324)
T PRK01265 137 RDEIKAVAGHELGHL 151 (324)
T ss_pred HHHHHHHHHHHHHHH
Confidence 356889999999994
No 71
>PF11140 DUF2913: Protein of unknown function (DUF2913); InterPro: IPR021316 This family of proteins with unknown function appear to be restricted to Gammaproteobacteria.
Probab=20.50 E-value=76 Score=30.43 Aligned_cols=37 Identities=27% Similarity=0.483 Sum_probs=28.7
Q ss_pred CCCCcHHHHHHHHHHHHhCc-ccchhHHHHHhHhccCc
Q psy2731 441 NGVRNCEIRFRWLKLCLKAR-WKEQVPHVIDMVTSQGR 477 (537)
Q Consensus 441 ~~~~N~Ei~~rW~~l~i~~~-~~~~~~~~~~fl~~~GR 477 (537)
+..++.++++||+.-++|.+ |...+..=.+++..+||
T Consensus 29 s~~~~n~fL~~WL~~alK~krF~k~v~~dlk~ll~~gR 66 (207)
T PF11140_consen 29 SPAQRNHFLVRWLKPALKQKRFSKLVKKDLKWLLQQGR 66 (207)
T ss_pred CHHHHHHHHHHHHHHHHcCcccchhHHHHHHHHHHhCC
Confidence 45667779999999999965 64555556678888898
Done!