Query psy1977
Match_columns 764
No_of_seqs 252 out of 1358
Neff 7.9
Searched_HMMs 46136
Date Fri Aug 16 21:03:02 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy1977.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1977hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3624|consensus 100.0 1E-131 3E-136 1171.4 63.3 645 83-764 38-687 (687)
2 COG3590 PepO Predicted metallo 100.0 6E-126 1E-130 1012.8 50.0 636 96-764 9-654 (654)
3 PF05649 Peptidase_M13_N: Pept 100.0 1.7E-64 3.8E-69 562.3 35.8 383 108-496 1-390 (390)
4 PF01431 Peptidase_M13: Peptid 100.0 1.9E-59 4E-64 476.1 11.4 204 557-763 1-206 (206)
5 PF09768 Peptidase_M76: Peptid 84.1 1.1 2.5E-05 43.7 3.8 63 534-608 24-87 (173)
6 PF14247 DUF4344: Domain of un 83.1 0.64 1.4E-05 47.3 1.7 52 554-605 47-105 (220)
7 PF01447 Peptidase_M4: Thermol 66.8 2.4 5.2E-05 40.6 0.8 41 554-605 108-148 (150)
8 PF04228 Zn_peptidase: Putativ 61.6 3.8 8.2E-05 43.7 1.2 86 553-644 134-226 (292)
9 KOG2661|consensus 61.2 5.6 0.00012 42.3 2.3 52 543-605 235-288 (424)
10 KOG0506|consensus 60.7 7.3 0.00016 43.3 3.1 102 489-613 167-272 (622)
11 KOG3314|consensus 56.4 4.8 0.0001 38.4 0.7 21 590-610 89-109 (194)
12 PRK02870 heat shock protein Ht 51.1 29 0.00063 37.9 5.8 40 554-603 142-184 (336)
13 PF01435 Peptidase_M48: Peptid 45.1 26 0.00056 35.4 4.2 42 555-606 60-103 (226)
14 cd04270 ZnMc_TACE_like Zinc-de 44.2 14 0.00031 38.4 2.1 21 591-614 166-186 (244)
15 cd04269 ZnMc_adamalysin_II_lik 43.8 21 0.00046 35.5 3.2 25 588-615 127-151 (194)
16 PF13485 Peptidase_MA_2: Pepti 40.6 30 0.00065 31.0 3.5 55 558-627 4-58 (128)
17 PRK01265 heat shock protein Ht 38.4 60 0.0013 35.3 5.8 40 554-603 109-151 (324)
18 cd04272 ZnMc_salivary_gland_MP 38.1 38 0.00083 34.5 4.1 19 593-614 146-164 (220)
19 PRK01345 heat shock protein Ht 37.6 52 0.0011 35.7 5.2 41 554-604 93-136 (317)
20 cd04273 ZnMc_ADAMTS_like Zinc- 37.5 15 0.00033 37.0 1.0 20 592-614 140-159 (207)
21 KOG1047|consensus 35.6 21 0.00047 40.8 1.8 36 592-636 288-329 (613)
22 COG4227 Antirestriction protei 33.5 25 0.00054 36.3 1.8 42 557-609 179-220 (316)
23 PF03002 Somatostatin: Somatos 33.3 15 0.00033 21.6 0.1 9 104-112 3-11 (18)
24 PF12315 DUF3633: Protein of u 32.9 22 0.00047 35.7 1.3 15 591-605 92-106 (212)
25 COG0501 HtpX Zn-dependent prot 31.2 64 0.0014 34.3 4.7 41 554-604 126-169 (302)
26 COG3227 LasB Zinc metalloprote 30.9 31 0.00067 38.9 2.1 42 595-644 340-384 (507)
27 PF01421 Reprolysin: Reprolysi 30.7 35 0.00075 34.1 2.3 18 587-604 126-143 (199)
28 TIGR03076 near_not_gcvH Chlamy 30.5 1E+02 0.0023 35.1 6.0 67 616-701 211-277 (686)
29 PRK03001 M48 family peptidase; 27.2 93 0.002 33.1 5.0 41 554-604 93-136 (283)
30 cd04267 ZnMc_ADAM_like Zinc-de 26.3 38 0.00082 33.5 1.7 22 590-614 131-152 (192)
31 TIGR01089 fucI L-fucose isomer 26.1 4.4E+02 0.0094 30.9 10.1 56 569-624 311-417 (587)
32 PF14891 Peptidase_M91: Effect 24.7 30 0.00064 34.0 0.6 14 593-606 104-117 (174)
33 PRK03072 heat shock protein Ht 24.6 1.2E+02 0.0026 32.4 5.2 55 537-604 82-139 (288)
34 PF15601 Imm42: Immunity prote 24.2 2.6E+02 0.0057 26.2 6.6 83 431-530 37-124 (134)
35 PRK03982 heat shock protein Ht 23.0 1.3E+02 0.0028 32.2 5.1 41 554-604 94-137 (288)
36 PF13688 Reprolysin_5: Metallo 23.0 56 0.0012 32.4 2.2 26 587-615 137-162 (196)
37 PF13253 DUF4044: Protein of u 22.4 1.8E+02 0.0039 20.6 3.9 28 12-40 3-30 (35)
38 PRK04897 heat shock protein Ht 21.3 1.4E+02 0.003 32.0 5.0 40 554-603 106-148 (298)
39 PF10835 DUF2573: Protein of u 21.2 3.8E+02 0.0082 22.6 6.1 64 396-463 4-72 (82)
40 KOG0718|consensus 21.0 1.4E+02 0.0031 33.7 4.8 66 438-512 35-106 (546)
No 1
>KOG3624|consensus
Probab=100.00 E-value=1.5e-131 Score=1171.45 Aligned_cols=645 Identities=41% Similarity=0.795 Sum_probs=604.9
Q ss_pred CCCCchhHHHHHHHHHhcCCCCCCCCCchhhhhcchhhhccCCCCCCcccchHHHHHHHHHHHHHHHhcCCCCCCCCcHH
Q psy1977 83 KICFTEGCVHTASNLLKNMDLNVDPCDDFYRFSCGKFLKETQIPDDKTAQTAFSVISDALEEQLRTIVEEPIRENEPKPF 162 (764)
Q Consensus 83 ~~C~s~~Ci~~a~~l~~~md~svdPCdDFY~yaCG~W~~~~~ip~~~~~~~~f~~l~~~~~~~lk~lLe~~~~~~~~~a~ 162 (764)
.+|.+++|+..|+.|+++||.|||||+|||+||||+|.+++++|+.. ++++|..+++.+...++++|+++....+++++
T Consensus 38 ~~~~~~~~~~~a~~l~~~~d~svdPC~dFy~~ACG~~~~~~~~~~~~-~~~~~~~~~~~~~~~l~~~L~~~~~~~~s~~~ 116 (687)
T KOG3624|consen 38 DVCESPECVTLAHQLLNYMDTSVDPCEDFYQYACGNWAENHPIPKDK-RSSTLDELQDQVLRQLKELLEEPNSSSTSKAE 116 (687)
T ss_pred CccCCHHHHHHHHHHHHcCCCCCCcchhHHHHHhCcHhhcCCCcccc-cccHHHHHHHHHHHHHHHHHhCCCCCCCCHHH
Confidence 68999999999999999999999999999999999999999999988 99999999999999999999987766688999
Q ss_pred HHHHHHHHHHHhhhhhhhcc-cchHHHHHHhcCCCCCCCCCCCcccchhHHHHHHHHHHccCCCcceEEEEEeecCCCCc
Q psy1977 163 KLVKNLYKACMNVKNIEILG-LEPMKRMLQDLGGWPVLEGQNWNEETFSWKDSVYKFHENGYSVDYFIDFSIVINVKNNS 241 (764)
Q Consensus 163 ~kak~~Y~sCm~~~~ie~~g-~~pL~~~L~~lGgWP~~~~~~w~~~~~~~~~~l~~l~~~g~g~~~l~~~~V~~D~~nss 241 (764)
++++.||+|||+....+..+ .++++++|+.+||||+.++ +|++++|+|.++++.+... ||..+||.+.|..|..+++
T Consensus 117 ~~~k~~Y~sC~~~~~~~~~~~~~~l~~~i~~~G~wP~l~~-~w~~~~f~~~~~l~~~~~~-yg~~~l~~~~v~~~~~~~~ 194 (687)
T KOG3624|consen 117 RKAKRFYESCLDAKALESSGALQLLFRIIQSIGGWPLLEG-NWDESKFNLNEMLANLLRR-YGLTTLFLLEVALDYKNSS 194 (687)
T ss_pred HHHHHHHHHHhchhhhhhhcchHHHHHHHHHhCCCcCCCC-CCCcccCCHHHHHHHHHHH-cCccceeEEEEecccccCc
Confidence 99999999999977766664 7899999999999999998 4999999999999999998 9999999999999999887
Q ss_pred ceEEEEeCCcCCCchh--hccccccHHHHHHHHHHHHHHHHHhCCChHHHHHHHHHHHHHHHHHhcccCChhhhcccccc
Q psy1977 242 ERIIDLDQATLGLSRE--YLVKGINEKLVNAYYRYMIDIAVLLGADKDVATEELKESLEFEIKLANISLPLEQRRNATKL 319 (764)
Q Consensus 242 ~~ii~idqp~l~lp~r--yl~~~~~~~~~~aY~~~~~~~~~llg~~~~~~~~~~~~v~~fE~~La~i~~~~~~r~~~~~~ 319 (764)
|+.+++|.+ |+......+.+.+|..++..++.++|.+...++..++++++||.+|++++.+.++|++...+
T Consensus 195 -------~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~e~~l~~~~~~~~~~~~~~~~ 267 (687)
T KOG3624|consen 195 -------QPGLILPSRSKYLATDSDSEKLRAYLLLANELLQLLGLDSDEAEEYARLVIELERQLANITKPDENRRELQAL 267 (687)
T ss_pred -------ccccCcchHhhhhccccHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHhccchhhhhhhHHHH
Confidence 788888877 66665555778899999999999999999999999999999999999999998889999999
Q ss_pred cCccCHHHHhhhCCCCCHHHHHHHhcCCCCCCCCCcEEEEcChhHHHHHHHHHhcCChHHHHHHHHHHHHHHHhhhhcHH
Q psy1977 320 YNPMSLDELQTKFPSIPWVEYLNTLLYPKDRLRETDIVVVDVPNYISKLEELLQTTPKKVQANYALWRATAASVSYLTEA 399 (764)
Q Consensus 320 y~~~tl~eLq~~~p~i~W~~~l~~i~~~~~~i~~~~~V~v~~~~Yl~~L~~ll~~t~~~~l~NYl~wr~v~~~~~~L~~~ 399 (764)
|+.+++.+|++.+|.++|..++..++.. +...+.|+|++++|+++|..+|.+|++++|+|||+||++..+.+.++..
T Consensus 268 ~~~~~l~~l~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~y~~~l~~ii~~t~~~~laNYl~~~~~~~~~~~~~~~ 344 (687)
T KOG3624|consen 268 YNSVNLAELQKKFPSIDWKQLLRSVLGE---ILPHNEVVVFNPDYLVNLSAIIQSTPKRTLANYLIWRLLDDLLDLLPKP 344 (687)
T ss_pred cccccHHHHHHhcccccHHHHhhhhccc---cccCCceEEecHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHhcccchh
Confidence 9999999999999999999999999876 2333489999999999999999999999999999999999988755443
Q ss_pred HHHHHHHHHHhhcCCCCCCchHhhHHHHHHHHHhhHHHHhHHhhcccHHHHHHHHHHHHHHHHHHHHhhhcCCCCCHHHH
Q psy1977 400 IRNRQLQYSTALSGRTQMEPRWKECVDTVSSSISLAMGALYVRKFFKEDAKANAIEMVELIREEMNKILNTVEWMDNKTR 479 (764)
Q Consensus 400 ~~~~~~~~~~~l~G~~~~~~rw~~Cv~~~~~~~~~a~~~lyv~~~f~~~~k~~~~~m~~~Ik~~f~~~l~~~~Wmd~~tk 479 (764)
.+ ||..|++.+...||.+++++|++++|+++++..+.+|+.+||.+|.++|++++|||++||
T Consensus 345 ~~------------------~~~~Cv~~~~~~~p~a~~~l~~r~~~~~~~~~~~~~m~~~lk~~f~~~l~~~~Wl~~~tr 406 (687)
T KOG3624|consen 345 RK------------------RWADCVELVRTLLPLALGRLYVRNFFDKENKKEVSEMIEDLKKAFEEMLQELDWLDEETR 406 (687)
T ss_pred hH------------------HHHHHHHHHHHhhhhhhHHHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHhCcccCHHHH
Confidence 21 799999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHhcccccccCCccCChhhHHHHhcccccCCCChHHHHHHHHHHHHHHHHH-hhCCCCCCCCcccCCCCceeee
Q psy1977 480 RKALEKAKAMTAHIAYPDELLDDNKISEFYDKLEIDPNNYYKSILNLTKFGTGYSFS-LLRKPVNKTEWISHGHPALVNA 558 (764)
Q Consensus 480 ~~a~~Kl~~m~~~igyP~~~~~~~~L~~~Y~~l~~~~~~yf~n~l~~~~~~~~~~~~-~L~~p~~~~~w~~~~~p~~vna 558 (764)
+.|++|+++|+.+||||++..++..|+++|.++.++.++|++|+..+.++....... .+..++++..|... |..|||
T Consensus 407 ~~a~~K~~~m~~~ig~p~~~~~~~~ld~~y~~l~l~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~na 484 (687)
T KOG3624|consen 407 KSAIKKLNAMKKNIGYPDEFLPPGTLDKEYENLKLSLDDYDSNLEILLKLQRRRSEQLELRAPVDPLDWVGS--PAQVNA 484 (687)
T ss_pred HHHHHHHHHHHHHcCCHHHhcCchhHHHHHhcccCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccc--cceeec
Confidence 999999999999999999999888899999999999999999999999998887777 57888889999988 999999
Q ss_pred eeecCCCeeeeccccccCCCCCCCccccchhhhHHHHHHHHHhhccCccccccCCCCCCcCCCCHhhHHHHHHHHHHHHH
Q psy1977 559 FYSSIENSITFPAGILQGSFFSNDRPKYMNFGAIGFVIGHEITHGFDDQGRQFDKNGNLVDWWQEDTKRSYIEKARCIIE 638 (764)
Q Consensus 559 ~Y~~~~N~Iv~PagiLq~Pff~~~~p~a~nyg~iG~vighEitHaFD~~G~~yd~~Gn~~~wW~~~s~~~f~~~~~C~~~ 638 (764)
||.|.+|.|+|||||||+|||+..+|.++|||+||++|||||+||||+.|++||+.||+++||+.+|.++|.++++|+++
T Consensus 485 ~Y~~~~N~i~~pa~ilq~P~f~~~~P~~~nyg~iG~vigHEl~H~FD~~G~~~D~~Gn~~~ww~~~s~~~f~~~~qC~~~ 564 (687)
T KOG3624|consen 485 FYSPEKNEIVFPAGLLQPPFFDLSYPDYLNYGGIGFVIGHELTHGFDDQGRQYDESGNLRDWWDTESESEFQERAQCLVK 564 (687)
T ss_pred cccCCCceEEEehhcccCCCCCcccchhhhhHHHHHHHHHHHhhcccccccccCCCcccccCCCHhHHHHHHHHHHHHHH
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHcCCccCCcccccCCccchhhhhhhhccHHHHHHHHHHHHhcCCCCCC-CCCCCCCChhhHHHHHHHHHhcccCChhHH
Q psy1977 639 QYGNYSVPEVNLTLNGVNNQGENIADNGGMKEAYNAYVNWAATHPAEPR-LPGLQNFTPQQMFWISAASTWCSKHRPETL 717 (764)
Q Consensus 639 qy~~~~~~~~~~~~ng~~tl~EnIAD~gGl~~Ay~AY~~~~~~~~~e~~-Lpgl~~~t~~QLFFl~~Aq~~C~~~~~e~~ 717 (764)
||++|..+ .++||..|++|||||+||+++||+||++|......+.+ |||++ +|++||||++|||.||+..+|+..
T Consensus 565 qy~~y~~~---~~~ng~~t~~EnIAD~~Gl~~A~~AY~~~~~~~~~~~~~lp~l~-~t~~QLFFl~~Aq~~C~~~~~~~~ 640 (687)
T KOG3624|consen 565 QYSNYLDP---RRLNGSKTLGENIADNGGLKLAYRAYKKWKLDLSGEPRELPGLD-LTPEQLFFLSYAQFFCSSEDPKKL 640 (687)
T ss_pred HHhccCCC---cccCCccccchhhcchhhHHHHHHHHHHHHHhccCCCCCCCCCC-CChhHHHHHHHHHHHhccCChhhh
Confidence 99999988 67899999999999999999999999988765444434 99999 999999999999999999999999
Q ss_pred HhhhhcCCCCCCceeEEEeCCCchhHHhhcCCCCCCCCCCCCccccC
Q psy1977 718 KLRITTGFHSPGEFRVLGPVSNLESFSRDFQCPSGSRMNPVHKCKVW 764 (764)
Q Consensus 718 ~~~~~~d~Hsp~~~RVng~L~N~~eFa~aF~C~~Gs~MnP~~kC~vW 764 (764)
...+.+++|+|+++||||+|+|+|+|++||+||.||||||.+||.||
T Consensus 641 ~~~~~~~~H~~~~~RVng~lsN~~eFa~aF~C~~gS~MnP~~kC~v~ 687 (687)
T KOG3624|consen 641 PESLLTDPHSPESFRVNGVLSNSPEFAEAFNCPIGSPMNPEKKCKVW 687 (687)
T ss_pred hHHhhcCCCCCcceeEeeeccCcHHHHHhcCCCCCCCCCccccCcCC
Confidence 99999999999999999999999999999999999999999999999
No 2
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=100.00 E-value=6.5e-126 Score=1012.78 Aligned_cols=636 Identities=32% Similarity=0.590 Sum_probs=587.8
Q ss_pred HHHhcCCCCCCCCCchhhhhcchhhhccCCCCCCcccchHHHHHHHHHHHHHHHhcCCCC---CCCCcHHHHHHHHHHHH
Q psy1977 96 NLLKNMDLNVDPCDDFYRFSCGKFLKETQIPDDKTAQTAFSVISDALEEQLRTIVEEPIR---ENEPKPFKLVKNLYKAC 172 (764)
Q Consensus 96 ~l~~~md~svdPCdDFY~yaCG~W~~~~~ip~~~~~~~~f~~l~~~~~~~lk~lLe~~~~---~~~~~a~~kak~~Y~sC 172 (764)
.=++.||.++.||||||+||.|.|.+.++||.++++++.|.+|.++.+..++.+++.-.. ..+....+++..||++-
T Consensus 9 fd~s~~d~~~rpqdDly~~vNG~Wl~~~~IPaDrsr~G~F~~L~d~~e~~~~~~i~~a~a~~~ap~~~~~~~~g~~y~~~ 88 (654)
T COG3590 9 FDLSHMDAMTRPQDDLYGYVNGEWLKTAEIPADRSRDGAFDKLDDRAEALVRDIIEAAAANEQAPEDAILQRIGKLYRSF 88 (654)
T ss_pred cChhcCCCcCCcchHHHHHhhhhhhhcCcCCCCccccccHHHHHHHHHHHHHHHHHHHhcCCCCCcchHHHHHHHHHHHh
Confidence 335799999999999999999999999999999999999999999999999998876431 12235788999999999
Q ss_pred HhhhhhhhcccchHHHHHHhcCCCCCCCCCCCcccchhHHHHHHHHHHccCCCcceEEEEEeecCCCCcceEEEEeCCcC
Q psy1977 173 MNVKNIEILGLEPMKRMLQDLGGWPVLEGQNWNEETFSWKDSVYKFHENGYSVDYFIDFSIVINVKNNSERIIDLDQATL 252 (764)
Q Consensus 173 m~~~~ie~~g~~pL~~~L~~lGgWP~~~~~~w~~~~~~~~~~l~~l~~~g~g~~~l~~~~V~~D~~nss~~ii~idqp~l 252 (764)
||+..+|+.|.+||+..|+.+.. +.+ . + ++...++.+.+. |.+.+|.++|++|.||++++++++.||+|
T Consensus 89 mD~~~~E~~g~~Pl~~~La~i~~---~~s--~--s--df~a~l~~~~~~--g~~~~f~~~vspD~kds~~~v~~~sq~Gl 157 (654)
T COG3590 89 MDEAKREKAGVDPLKPELAEIDS---LAS--F--S--DFAAALGQLERA--GQGNPFGFSVSPDFKDSTRYVLYFSQSGL 157 (654)
T ss_pred ccHHHHHhcCCCchhHHHHHHHh---hcc--H--H--HHHHHHHHHHhc--cCCCCceeeeccccccchhheeeeccCCC
Confidence 99999999999999999988753 111 1 2 567888888875 88999999999999999999999999999
Q ss_pred CCchh-hccccccHHHHHHHHHHHHHHHHHhCCChH--HHHHHHHHHHHHHHHHhcccCChhhhcccccccCccCHHHHh
Q psy1977 253 GLSRE-YLVKGINEKLVNAYYRYMIDIAVLLGADKD--VATEELKESLEFEIKLANISLPLEQRRNATKLYNPMSLDELQ 329 (764)
Q Consensus 253 ~lp~r-yl~~~~~~~~~~aY~~~~~~~~~llg~~~~--~~~~~~~~v~~fE~~La~i~~~~~~r~~~~~~y~~~tl~eLq 329 (764)
+||++ |..++..++++.+|++.+.+++.++|.+++ .+.+.+.+|+.||++||+.+....++|+....||++|+.+||
T Consensus 158 gLPD~~YY~de~~~e~l~ay~~hv~rml~~~~~~~~~~d~a~~A~~v~a~et~lA~~~~~~~~~rd~~~~Yn~~tf~~l~ 237 (654)
T COG3590 158 GLPDTTYYRDEQHAELLAAYKEHVARMLGLFGFSEEEEDAAKHALRVVALETKLANASWEVVKYRDLYHTYNPATFAELQ 237 (654)
T ss_pred CCCchhhhhhhhHHHHHHHHHHHHHHHHHHhCCchhhhhHHHHHHHHHHHHHHHhhhhhHHHHHHhhhcCcChhhHHHhc
Confidence 99999 766667889999999999999999998765 678889999999999999999888999999999999999999
Q ss_pred hhCCCCCHHHHHHHhcCCCCCCCCCcEEEEcChhHHHHHHHHHhcCChHHHHHHHHHHHHHHHhhhhcHHHHHHHHH-HH
Q psy1977 330 TKFPSIPWVEYLNTLLYPKDRLRETDIVVVDVPNYISKLEELLQTTPKKVQANYALWRATAASVSYLTEAIRNRQLQ-YS 408 (764)
Q Consensus 330 ~~~p~i~W~~~l~~i~~~~~~i~~~~~V~v~~~~Yl~~L~~ll~~t~~~~l~NYl~wr~v~~~~~~L~~~~~~~~~~-~~ 408 (764)
...|..+|..++..+ ...|+..|++.+|.|++.+..++.+++...+..|+.|++++..+|+|++++++..++ |.
T Consensus 238 ~~~p~~~w~~~~~~~-----G~~PD~~iv~~~p~y~~~~~~~~~e~~~~~wKawl~~~~~~~~ap~Lt~dl~~~~f~fy~ 312 (654)
T COG3590 238 PELPGDDWSLLFSAL-----GQLPDKVIVVENPFYLKEFASLLAEENWADWKAWLRWKLIRAAAPYLTEDLVDEHFDFYG 312 (654)
T ss_pred ccCCCccHHHHHHHh-----cCCCCeeEeecCchHHHHHHHHHhcCcHHHHHHHHHHHHHhhcchhhhhHHHHhhhHHHh
Confidence 999999999988775 223558899999999999999999999999999999999999999999999999995 57
Q ss_pred HhhcCCCCCCchHhhHHHHHHHHHhhHHHHhHHhhcccHHHHHHHHHHHHHHHHHHHHhhhcCCCCCHHHHHHHHHHHHh
Q psy1977 409 TALSGRTQMEPRWKECVDTVSSSISLAMGALYVRKFFKEDAKANAIEMVELIREEMNKILNTVEWMDNKTRRKALEKAKA 488 (764)
Q Consensus 409 ~~l~G~~~~~~rw~~Cv~~~~~~~~~a~~~lyv~~~f~~~~k~~~~~m~~~Ik~~f~~~l~~~~Wmd~~tk~~a~~Kl~~ 488 (764)
+.|+|.++..+||+.-+..+++.|+.|+|.+||++||++++|..+++|+..++++++.+|++++||.++||++|++||+.
T Consensus 313 r~LsG~~E~r~rwKr~~~l~~~~~geaiG~~Yv~~~Fpp~aKa~m~~lVa~l~~ayk~Ri~~ldWmtpeTrekAl~KL~~ 392 (654)
T COG3590 313 RTLSGQPEARDRWKRAVNLAERLFGEAIGLLYVKRYFPPEAKADMEELVANLIKAYKARISKLDWMTPETREKALEKLNK 392 (654)
T ss_pred hhccCChhhhHHHHHHHHHHHHHHHHHHHHHHHHHhCChHHHHHHHHHHHHHHHHHHHHhhhccccCHHHHHHHHHHHhc
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred cccccccCCccCChhhHHHHhcccccCCCChHHHHHHHHHHHHHHHHHhhCCCCCCCCcccCCCCceeeeeeecCCCeee
Q psy1977 489 MTAHIAYPDELLDDNKISEFYDKLEIDPNNYYKSILNLTKFGTGYSFSLLRKPVNKTEWISHGHPALVNAFYSSIENSIT 568 (764)
Q Consensus 489 m~~~igyP~~~~~~~~L~~~Y~~l~~~~~~yf~n~l~~~~~~~~~~~~~L~~p~~~~~w~~~~~p~~vna~Y~~~~N~Iv 568 (764)
++.+||||+... +|+.|.+..+++++|.++..++...+.++|+++||||++|.|. |++|||||+|.+|.||
T Consensus 393 ft~kIGYP~~wr-------~y~kL~~~~~sl~~N~~r~~~~~~~~~l~K~~kPVDr~eW~M~--pq~VNAYYnp~~N~IV 463 (654)
T COG3590 393 FTAKIGYPDPWR-------YYSKLEIKRDSLYGNVLRASAFNHAHELSKIGKPVDRDEWEMP--PQTVNAYYNPQKNEIV 463 (654)
T ss_pred cccccCCCchhh-------hhhhhccCchhHHHHHHHHHHHHHhhhHHHhCCCCchhhcCCC--HHHhhhhcCCCCceEe
Confidence 999999996543 4889999999999999999999999999999999999999999 9999999999999999
Q ss_pred eccccccCCCCCCCccccchhhhHHHHHHHHHhhccCccccccCCCCCCcCCCCHhhHHHHHHHHHHHHHHHcCCccCCc
Q psy1977 569 FPAGILQGSFFSNDRPKYMNFGAIGFVIGHEITHGFDDQGRQFDKNGNLVDWWQEDTKRSYIEKARCIIEQYGNYSVPEV 648 (764)
Q Consensus 569 ~PagiLq~Pff~~~~p~a~nyg~iG~vighEitHaFD~~G~~yd~~Gn~~~wW~~~s~~~f~~~~~C~~~qy~~~~~~~~ 648 (764)
|||+|||+|||+.+.+.+.||||||.||||||+|||||||++||.+||+++||+.++..+|+++++-++.||+.|...+
T Consensus 464 FPAAILQ~PFfd~ea~~a~NYGgIGaVIgHEI~HgFDdqGakfD~~GnL~dWWT~eD~~aF~~~T~~lvaqf~~~e~~~- 542 (654)
T COG3590 464 FPAAILQAPFFDPEADSAANYGGIGAVIGHEIGHGFDDQGAKFDGDGNLNDWWTDEDAAAFKERTKKLVAQFDGYEPEG- 542 (654)
T ss_pred eeHHhcCCCCCCCCcchhhcccCccceehhhhcccccCCccccCCCCcHHhhcCHHHHHHHHHHHHHHHHHhcCccccc-
Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999998653
Q ss_pred ccccCCc-cchhhhhhhhccHHHHHHHHHHHHhcCCCCCCCCCCCCCChhhHHHHHHHHHhcccCChhHHHhhhhcCCCC
Q psy1977 649 NLTLNGV-NNQGENIADNGGMKEAYNAYVNWAATHPAEPRLPGLQNFTPQQMFWISAASTWCSKHRPETLKLRITTGFHS 727 (764)
Q Consensus 649 ~~~~ng~-~tl~EnIAD~gGl~~Ay~AY~~~~~~~~~e~~Lpgl~~~t~~QLFFl~~Aq~~C~~~~~e~~~~~~~~d~Hs 727 (764)
| ..+|+ .|+||||||.|||..||.||+.... .. |=++.+|..|-||++|||+|-.+.++|....++..|+||
T Consensus 543 ~-~~~g~~lTvgENIaDlGGl~~Al~Ayk~~~~---~~---pv~dg~tg~qrfF~~wAqiWR~K~r~e~~~~~l~~DpHs 615 (654)
T COG3590 543 G-KDNGNALTVGENIADLGGLAIALDAYKLSLD---PA---PVIDGFTGLQRFFLNWAQIWRMKARPEEQQMRLSVDPHS 615 (654)
T ss_pred c-ccccceeeeccccccchhHHHHHHHHHHhcC---Cc---ccccccchHHHHHHHHHHHHhhccCHHHHHHHHccCCCC
Confidence 3 33566 9999999999999999999998664 23 445558999999999999999999999999999999999
Q ss_pred CCceeEEEeCCCchhHHhhcCCCCCCCC--CCCCccccC
Q psy1977 728 PGEFRVLGPVSNLESFSRDFQCPSGSRM--NPVHKCKVW 764 (764)
Q Consensus 728 p~~~RVng~L~N~~eFa~aF~C~~Gs~M--nP~~kC~vW 764 (764)
|++|||||++.|+++|.+||.|++|++| -|.++-.+|
T Consensus 616 P~~fR~Ng~~~N~~~FyeaFdVke~D~myr~pe~Rv~IW 654 (654)
T COG3590 616 PAEFRVNGPVRNMDEFYEAFDVKEGDAMYRAPEERVVIW 654 (654)
T ss_pred CceeeccCCCCChHHHHHHcCCCCCcccccChhhccccC
Confidence 9999999999999999999999999999 588999998
No 3
>PF05649 Peptidase_M13_N: Peptidase family M13 This is family M13 in the peptidase classification. ; InterPro: IPR008753 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell []. Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0008237 metallopeptidase activity, 0006508 proteolysis; PDB: 3DWB_A 3ZUK_A 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A.
Probab=100.00 E-value=1.7e-64 Score=562.31 Aligned_cols=383 Identities=36% Similarity=0.717 Sum_probs=329.2
Q ss_pred CCchhhhhcchhhhccCCCCCCcccchHHHHHHHHHHHHHHHhcCCCCC-----CCCcHHHHHHHHHHHHHhhhhhhhcc
Q psy1977 108 CDDFYRFSCGKFLKETQIPDDKTAQTAFSVISDALEEQLRTIVEEPIRE-----NEPKPFKLVKNLYKACMNVKNIEILG 182 (764)
Q Consensus 108 CdDFY~yaCG~W~~~~~ip~~~~~~~~f~~l~~~~~~~lk~lLe~~~~~-----~~~~a~~kak~~Y~sCm~~~~ie~~g 182 (764)
|||||+||||+|.++|++|.++..+++|+.+++.+.+.++++|+++... .++++++|++.||++||+.+.++..|
T Consensus 1 CdDFY~yvCg~w~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~~~~~~~~~~~k~~~~Y~sC~~~~~~~~~~ 80 (390)
T PF05649_consen 1 CDDFYQYVCGNWIKSHPIPSGQSSYSTFSDLQDNIDRQLKEILEKPSSSDTSPAEDSSAEQKAKALYDSCMDTDAREKDG 80 (390)
T ss_dssp TT-HHHHHHHHHHHH----TT-SEEEHHHHHHHHHHHHHHHHHSCCS--HCC-TTTTHHHHHHHHHHHHHH-HHHHHHHT
T ss_pred CccHHHHHhCcHHhhCCCCcCcccccHHHHHHHHhhhhhhhhhcccccccccccCCChHHHHHHHHHHHHHHhhhcchhh
Confidence 9999999999999999999999999999999999999999999997432 25789999999999999999999999
Q ss_pred cchHHHHHHhcCCCCCCCCCCCcccchhHHHHHHHHHHccCCCcceEEEEEeecCCCCcceEEEEeCCcCCCchh-hccc
Q psy1977 183 LEPMKRMLQDLGGWPVLEGQNWNEETFSWKDSVYKFHENGYSVDYFIDFSIVINVKNNSERIIDLDQATLGLSRE-YLVK 261 (764)
Q Consensus 183 ~~pL~~~L~~lGgWP~~~~~~w~~~~~~~~~~l~~l~~~g~g~~~l~~~~V~~D~~nss~~ii~idqp~l~lp~r-yl~~ 261 (764)
.++|+++|+++||||+... |+.+.++|.++++.+... +|.++||++.|.+|++|++.++|+|++|.+++|++ |...
T Consensus 81 ~~~l~~~l~~~~~~p~~~~--~~~~~~~~~~~l~~l~~~-~~~~~l~~~~v~~d~~~~~~~~l~i~~~~~~l~~~~~~~~ 157 (390)
T PF05649_consen 81 LEPLKEFLRSIGGWPFLSD--WNESKFDLLDTLARLSRR-YGIDPLFSLYVDPDPQNPSKYILYIDPPELGLPSKEYYRD 157 (390)
T ss_dssp THHHHHHHHHCTCBCCCSS--HHTTCCHHHHHHHHHHHT-C---SSSEEEEEEETTEEEEEEEEEEE---SSSSGGGGCT
T ss_pred hhhHHHHHHHhhhcccCCc--ccCCHhHHHHHHHHHHhh-ccccceeeeEeeccccchheeEeecccCCCCCcchHHhhc
Confidence 9999999999999999854 767788999999999999 99999999999999999999999999999999887 4443
Q ss_pred cc-cHHHHHHHHHHHHHHHHHhCCChHHHHHHHHHHHHHHHHHhcccCChhhhcccccccCccCHHHHhhhCCCCCHHHH
Q psy1977 262 GI-NEKLVNAYYRYMIDIAVLLGADKDVATEELKESLEFEIKLANISLPLEQRRNATKLYNPMSLDELQTKFPSIPWVEY 340 (764)
Q Consensus 262 ~~-~~~~~~aY~~~~~~~~~llg~~~~~~~~~~~~v~~fE~~La~i~~~~~~r~~~~~~y~~~tl~eLq~~~p~i~W~~~ 340 (764)
+. .....++|.++|++++..+|.+...+.+.+++|++||.+|+++....++++++...|+++|++||+..+|.++|..+
T Consensus 158 ~~~~~~~~~~y~~~v~~~l~~~g~~~~~~~~~~~~i~~~e~~L~~~~~~~~~~~~~~~~~~~~tl~eL~~~~p~i~w~~~ 237 (390)
T PF05649_consen 158 PHSSKEYLQAYREYVREVLKLLGVDDDEASKLADDIVKFEKQLAKLSSEDEERRDPQKLYNPMTLSELQSQYPQIDWDRY 237 (390)
T ss_dssp CGGCHHHHHHHHHHHHHHHHHCSS-HHHHHHHHHHHHHHHHHHHHHS--GGGCT-HHHH-EEEEHHHHHHHSTTS-HHHH
T ss_pred chhhHHHHHHHHHHHHHHHhhccCChHHHHHHHHHHHHHHHHHHHHhhhhHhhccHHHhcCcccHHHHHHHHhhhhHHHH
Confidence 32 37888999999999999999988899999999999999999999888888888888999999999999999999999
Q ss_pred HHHhcCCCCCCCCCcEEEEcChhHHHHHHHHHhcCChHHHHHHHHHHHHHHHhhhhcHHHHHHHHHHHHhhcCCCCCCch
Q psy1977 341 LNTLLYPKDRLRETDIVVVDVPNYISKLEELLQTTPKKVQANYALWRATAASVSYLTEAIRNRQLQYSTALSGRTQMEPR 420 (764)
Q Consensus 341 l~~i~~~~~~i~~~~~V~v~~~~Yl~~L~~ll~~t~~~~l~NYl~wr~v~~~~~~L~~~~~~~~~~~~~~l~G~~~~~~r 420 (764)
|+.++++ +++++.|+|.+|+||++|.++|.++++++|+|||+||++.+++++++.++++....|..++.|.....+|
T Consensus 238 l~~~~~~---~~~~~~V~v~~~~y~~~l~~ll~~~~~~~l~nYl~~~~i~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 314 (390)
T PF05649_consen 238 LKALFGE---VTESDEVIVRSPDYFKKLSELLKQTSKETLANYLGWRLIRELSPYLSSELRDLYFEFYRTLSGVEESRPR 314 (390)
T ss_dssp HHHHCCC---CCTTSEEEES-HHHHHHHHHHHHHS-HHHHHHHHHHHHHHHHGGGS-HHHHHHHHHHHTHHH--SS---H
T ss_pred HHHHccc---cCCCceEEEcCcHHHHHHHHHHHhCCcHHHHHHHHHHHHHHHHhhccHHHHHHHHHHHHHhcCCccchhh
Confidence 9999976 5677899999999999999999999999999999999999999999999999999999999999988999
Q ss_pred HhhHHHHHHHHHhhHHHHhHHhhcccHHHHHHHHHHHHHHHHHHHHhhhcCCCCCHHHHHHHHHHHHhcccccccC
Q psy1977 421 WKECVDTVSSSISLAMGALYVRKFFKEDAKANAIEMVELIREEMNKILNTVEWMDNKTRRKALEKAKAMTAHIAYP 496 (764)
Q Consensus 421 w~~Cv~~~~~~~~~a~~~lyv~~~f~~~~k~~~~~m~~~Ik~~f~~~l~~~~Wmd~~tk~~a~~Kl~~m~~~igyP 496 (764)
|+.|++.+...||++++++|++.+|+++.++.|.+|+++||++|.++|++++|||++||+.|++|+++|+.+||||
T Consensus 315 ~~~C~~~~~~~l~~~~~~~y~~~~~~~~~~~~v~~~~~~ik~~f~~~i~~~~Wld~~tk~~a~~Kl~~m~~~iGyP 390 (390)
T PF05649_consen 315 WQRCVRHVEKLLPFALGRLYVRRYFDKENKKEVEDMVENIKEAFRERIEESDWLDEETKKEAIEKLNNMKLNIGYP 390 (390)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHH--CCHHHHHHHHHHHHHHHHHHHHCT-TTS-HHHHHHHHHHHHH-EEEEES-
T ss_pred HHHHHHHHHHHHHHHHHhhHhhhcCchhHHHHHHHHHHHHHHHHHHHHhhCCCCCHHHHHHHHHHHHhhhhhccCC
Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999999
No 4
>PF01431 Peptidase_M13: Peptidase family M13 This is family M13 in the peptidase classification. ; InterPro: IPR018497 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell []. Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A 3DWB_A 3ZUK_A.
Probab=100.00 E-value=1.9e-59 Score=476.10 Aligned_cols=204 Identities=49% Similarity=0.947 Sum_probs=168.0
Q ss_pred eeeeecCCCeeeeccccccCCCCCCCccccchhhhHHHHHHHHHhhccCccccccCCCCCCcCCCCHhhHHHHHHHHHHH
Q psy1977 557 NAFYSSIENSITFPAGILQGSFFSNDRPKYMNFGAIGFVIGHEITHGFDDQGRQFDKNGNLVDWWQEDTKRSYIEKARCI 636 (764)
Q Consensus 557 na~Y~~~~N~Iv~PagiLq~Pff~~~~p~a~nyg~iG~vighEitHaFD~~G~~yd~~Gn~~~wW~~~s~~~f~~~~~C~ 636 (764)
||||.|..|.|+||+|+||+|||+.+.|.++|||+||++|||||+||||..|+.||++|++++||++.+.+.|.++.+|+
T Consensus 1 na~Y~~~~N~i~ip~~~l~~P~f~~~~p~~~~yg~lG~ilahel~hafd~~g~~~D~~g~~~~wWs~~~~~~~~~~~~Cl 80 (206)
T PF01431_consen 1 NAYYSPRFNSIVIPAGILQPPFFDPNYPPALNYGGLGFILAHELMHAFDPEGINYDEDGNLRNWWSEESRSNFKERAQCL 80 (206)
T ss_dssp --EEETTTTEEEEEGGGSSTTT--TTS-HHHHHHTHHHHHHHHHHHCTSTTGGGB-TTS-BS--S-HHHHHHHHHHHHHH
T ss_pred CCCCCcccCEEEecHHHhCCccCCCCCCHHHHHHHHHHHHHHHHHHHHHHhHhhcCcccccccccchhhHHHHHHHHHHH
Confidence 89999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHcCCccCCccc--ccCCccchhhhhhhhccHHHHHHHHHHHHhcCCCCCCCCCCCCCChhhHHHHHHHHHhcccCCh
Q psy1977 637 IEQYGNYSVPEVNL--TLNGVNNQGENIADNGGMKEAYNAYVNWAATHPAEPRLPGLQNFTPQQMFWISAASTWCSKHRP 714 (764)
Q Consensus 637 ~~qy~~~~~~~~~~--~~ng~~tl~EnIAD~gGl~~Ay~AY~~~~~~~~~e~~Lpgl~~~t~~QLFFl~~Aq~~C~~~~~ 714 (764)
++||+++..+..+. +++|..|++|||||++||++||+||++ ....+.+|||++.+|++||||++|||.||+..++
T Consensus 81 ~~qy~~~~~~~~~~~~~~~~~~~~~E~iaD~~gl~~a~~ay~~---~~~~~~~l~~~~~~t~~QlFF~~~a~~~C~~~~~ 157 (206)
T PF01431_consen 81 RDQYSNYSVPEGNKSVSLNGSDTLNENIADNAGLRLAYRAYRK---KSSNEERLPGLEGLTPDQLFFISFAQSFCEKMSP 157 (206)
T ss_dssp HHHHHT-BCGGGTTCSB--TTTTHHHHHHHHHHHHHHHHHHHH---HHTTT-B-TTCTTB-HHHHHHHHHHHHT-EEE-H
T ss_pred HHHHhcccccccccccccchHHHHHHHHHHHHhHHHHHHHHhh---cccccccccccccchhHHHHHHHHHHHHhcCcch
Confidence 99999998775432 689999999999999999999999998 2235678999999999999999999999999998
Q ss_pred hHHHhhhhcCCCCCCceeEEEeCCCchhHHhhcCCCCCCCCCCCCcccc
Q psy1977 715 ETLKLRITTGFHSPGEFRVLGPVSNLESFSRDFQCPSGSRMNPVHKCKV 763 (764)
Q Consensus 715 e~~~~~~~~d~Hsp~~~RVng~L~N~~eFa~aF~C~~Gs~MnP~~kC~v 763 (764)
+....++..++|+|+++|||++|+|+|+|++||+||.||+|||.+||++
T Consensus 158 ~~~~~~~~~~~h~p~~~RvN~~L~n~~~Fa~aF~C~~gs~Mnp~~kC~~ 206 (206)
T PF01431_consen 158 ESVYRQLQTDPHSPARFRVNGPLSNMPEFAEAFNCPSGSPMNPKKKCRC 206 (206)
T ss_dssp HHHHHHHHH-SS--HHHHHHHHHHT-HHHHHHHT--TTSTTS-SSGG--
T ss_pred hhhhhhhhcCCCCchHHHHHHHHhcCHHHHHHcCCCCCCCCCCCCCCcC
Confidence 8888888899999999999999999999999999999999999999986
No 5
>PF09768 Peptidase_M76: Peptidase M76 family; InterPro: IPR019165 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. Mitochondrial inner membrane protease ATP23 has two roles in the assembly of mitochondrial ATPase. Firstly, it acts as a protease that removes the N-terminal 10 residues of mitochondrial ATPase CF(0) subunit 6 (ATP6) at the intermembrane space side. Secondly, it is involved in the correct assembly of the membrane-embedded ATPase CF(0) particle, probably mediating association of ATP6 with the subunit 9 ring [, ].; GO: 0004222 metalloendopeptidase activity
Probab=84.08 E-value=1.1 Score=43.74 Aligned_cols=63 Identities=16% Similarity=0.210 Sum_probs=42.3
Q ss_pred HHHhhCCC-CCCCCcccCCCCceeeeeeecCCCeeeeccccccCCCCCCCccccchhhhHHHHHHHHHhhccCccc
Q psy1977 534 SFSLLRKP-VNKTEWISHGHPALVNAFYSSIENSITFPAGILQGSFFSNDRPKYMNFGAIGFVIGHEITHGFDDQG 608 (764)
Q Consensus 534 ~~~~L~~p-~~~~~w~~~~~p~~vna~Y~~~~N~Iv~PagiLq~Pff~~~~p~a~nyg~iG~vighEitHaFD~~G 608 (764)
.+++++-+ +.+....-..=+..+.+.|+|..+.|++=.--+ .+.+-+--+|.|||+|+||+--
T Consensus 24 ~~~~~gc~~~~~~~i~c~~C~~~~~Ggf~p~~~~I~lC~N~~------------~~~~~l~~~l~HELIHayD~cr 87 (173)
T PF09768_consen 24 ALKKLGCPPVPPRHIKCEPCDSSVSGGFDPSKKGIVLCQNRI------------RSQGHLEDTLTHELIHAYDHCR 87 (173)
T ss_pred HHHHcCCCCCCCCCeEEEECcCCCcCCccCCCCCEEEeeCCC------------CCHHHHHHHHHHHHHHHHHHHh
Confidence 35667755 433322211003346678999899998866544 4678889999999999999754
No 6
>PF14247 DUF4344: Domain of unknown function (DUF4344)
Probab=83.07 E-value=0.64 Score=47.34 Aligned_cols=52 Identities=27% Similarity=0.384 Sum_probs=35.0
Q ss_pred ceeeeeeecCCCeeeeccccccCC--CCCC-Ccccc----chhhhHHHHHHHHHhhccC
Q psy1977 554 ALVNAFYSSIENSITFPAGILQGS--FFSN-DRPKY----MNFGAIGFVIGHEITHGFD 605 (764)
Q Consensus 554 ~~vna~Y~~~~N~Iv~PagiLq~P--ff~~-~~p~a----~nyg~iG~vighEitHaFD 605 (764)
-..||||+|..++|+||..++..- .|.. +.|.- .-.|.+=.++.||+.|++=
T Consensus 47 Ge~nA~ydPe~~~I~iCYEf~~~~~~~f~~~~~~~~~~~~~~~~~~~~~l~HE~GHAlI 105 (220)
T PF14247_consen 47 GEDNAFYDPENRSITICYEFVDEILDRFAKANDPDEEYGQAAIGNVLFTLYHELGHALI 105 (220)
T ss_pred CCCCCccCCCCCEEEECHHHHHHHHHHHHhCCcCcchHHHHHHHHHHHHHHHHHHHHHH
Confidence 456999999999999998776431 1221 22211 1355666778899999984
No 7
>PF01447 Peptidase_M4: Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ; InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=66.84 E-value=2.4 Score=40.60 Aligned_cols=41 Identities=29% Similarity=0.486 Sum_probs=20.7
Q ss_pred ceeeeeeecCCCeeeeccccccCCCCCCCccccchhhhHHHHHHHHHhhccC
Q psy1977 554 ALVNAFYSSIENSITFPAGILQGSFFSNDRPKYMNFGAIGFVIGHEITHGFD 605 (764)
Q Consensus 554 ~~vna~Y~~~~N~Iv~PagiLq~Pff~~~~p~a~nyg~iG~vighEitHaFD 605 (764)
.-.|||++- .+|+|--|-- ..| -.+.+===|+||||+||+-
T Consensus 108 ~~~NAfW~g--~~m~yGdG~~--~~f-------~~~~~~lDVvaHEltHGVt 148 (150)
T PF01447_consen 108 NYNNAFWNG--SQMVYGDGDG--QIF-------KPFASSLDVVAHELTHGVT 148 (150)
T ss_dssp STT-EEE-S--SSEEEE---S--SSB-------S-GGG-HHHHHHHHHHHHH
T ss_pred CccCccccC--CEEEEECCCC--ccc-------ccCccccceeeeccccccc
Confidence 346899864 5787765532 011 1112212389999999973
No 8
>PF04228 Zn_peptidase: Putative neutral zinc metallopeptidase; InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=61.65 E-value=3.8 Score=43.72 Aligned_cols=86 Identities=14% Similarity=0.212 Sum_probs=48.6
Q ss_pred CceeeeeeecCCCeeeecccccc---CCCCCCCccccchhhhHHHHHHHHHhhccCccccccCCCCCCcCCCC----Hhh
Q psy1977 553 PALVNAFYSSIENSITFPAGILQ---GSFFSNDRPKYMNFGAIGFVIGHEITHGFDDQGRQFDKNGNLVDWWQ----EDT 625 (764)
Q Consensus 553 p~~vna~Y~~~~N~Iv~PagiLq---~Pff~~~~p~a~nyg~iG~vighEitHaFD~~G~~yd~~Gn~~~wW~----~~s 625 (764)
+..++|||-|..+.|+++..++. ..| ...-| -+.-.|||||..|..-..---.++......=.+ ++-
T Consensus 134 ~s~~gpFYCp~D~tIYlD~~f~~~L~~~~-ga~G~-----~a~ayVlAHEyGHHVQ~l~Gil~~~~~~~~~~~~~~~~~~ 207 (292)
T PF04228_consen 134 SSATGPFYCPADQTIYLDLSFFDELQQRF-GASGD-----FAQAYVLAHEYGHHVQNLLGILDAVRQAQQGRSPAEANEL 207 (292)
T ss_pred cCCCCCEeCCCCCEEEechHHHHHHHHHh-CCccH-----HHHHHHHHHHHHHHHHHHhhhHHHHHHHhhccCccccchH
Confidence 34568999999999999987542 222 11112 234568999999987443111111000000001 122
Q ss_pred HHHHHHHHHHHHHHHcCCc
Q psy1977 626 KRSYIEKARCIIEQYGNYS 644 (764)
Q Consensus 626 ~~~f~~~~~C~~~qy~~~~ 644 (764)
..+..-++.|+.--|..+.
T Consensus 208 svr~ELQADC~AGvw~~~~ 226 (292)
T PF04228_consen 208 SVRLELQADCFAGVWAGHA 226 (292)
T ss_pred HHHHHHHHHHHHHHHhhhc
Confidence 3466778999987765543
No 9
>KOG2661|consensus
Probab=61.19 E-value=5.6 Score=42.28 Aligned_cols=52 Identities=29% Similarity=0.483 Sum_probs=30.9
Q ss_pred CCCCcccC--CCCceeeeeeecCCCeeeeccccccCCCCCCCccccchhhhHHHHHHHHHhhccC
Q psy1977 543 NKTEWISH--GHPALVNAFYSSIENSITFPAGILQGSFFSNDRPKYMNFGAIGFVIGHEITHGFD 605 (764)
Q Consensus 543 ~~~~w~~~--~~p~~vna~Y~~~~N~Iv~PagiLq~Pff~~~~p~a~nyg~iG~vighEitHaFD 605 (764)
+.-.|..+ . -.++|||--|..-.+|| .||| |.--+-.++.+|+|||+.|+.-
T Consensus 235 sgIkWeihVVn-dPipNAFvLPgGKvfVF-tgiL---------n~ck~ddglAtvLgHE~aHaVa 288 (424)
T KOG2661|consen 235 SGIKWEIHVVN-DPIPNAFVLPGGKVFVF-TGIL---------NSCKDDDGLATVLGHEIAHAVA 288 (424)
T ss_pred cCceeEEEEec-CCCCceeeccCCeEEEE-echh---------hcccChHHHHHHHHHHHHHHHH
Confidence 34557654 1 12356665555433333 4555 3334456899999999999874
No 10
>KOG0506|consensus
Probab=60.70 E-value=7.3 Score=43.34 Aligned_cols=102 Identities=17% Similarity=0.271 Sum_probs=65.9
Q ss_pred cccccccCCccCChhhHHHHhcccc-cCCCChHHHHHHHHHHHHHHHHHhhCCCCCCCCcccCCCCceeeeeeecCCCee
Q psy1977 489 MTAHIAYPDELLDDNKISEFYDKLE-IDPNNYYKSILNLTKFGTGYSFSLLRKPVNKTEWISHGHPALVNAFYSSIENSI 567 (764)
Q Consensus 489 m~~~igyP~~~~~~~~L~~~Y~~l~-~~~~~yf~n~l~~~~~~~~~~~~~L~~p~~~~~w~~~~~p~~vna~Y~~~~N~I 567 (764)
++.++--|+|..=...++..|+... .+++..-.-+ -+|. ...++.|.++ .-+|...-
T Consensus 167 LrkqmVIPdw~~Fts~I~tIFEscke~seG~vA~YI------------PQLa-r~sPdlW~vS--vCTvDGQR------- 224 (622)
T KOG0506|consen 167 LRKQMVIPDWEEFTSHIDTIFESCKESSEGKVATYI------------PQLA-RQSPDLWGVS--VCTVDGQR------- 224 (622)
T ss_pred HhcCccCCcHHHHHHHHHHHHHHHHhcCCccHHHhh------------HHHh-ccCCccceeE--EeeecCcc-------
Confidence 3455566888655555666666653 3444321111 1222 2346789988 66665432
Q ss_pred eeccccccCCCCCCCccccchhhhHHHHHHHHHhhcc---CccccccCC
Q psy1977 568 TFPAGILQGSFFSNDRPKYMNFGAIGFVIGHEITHGF---DDQGRQFDK 613 (764)
Q Consensus 568 v~PagiLq~Pff~~~~p~a~nyg~iG~vighEitHaF---D~~G~~yd~ 613 (764)
...|----||--....+++||+..-.-||.|..|.+ ...||.|++
T Consensus 225 -~s~Gd~k~pFClQSvsK~f~YAiv~sdlG~~~vH~yVG~EPSGrlfN~ 272 (622)
T KOG0506|consen 225 -HSLGDTKKPFCLQSVSKPFNYAIVASDLGTEVVHQYVGQEPSGRLFNE 272 (622)
T ss_pred -ccccccCCchhHhhhcccceeeeeeccccHHHHHHHhcCCCccchhhh
Confidence 234555568877778899999999999999999987 456888753
No 11
>KOG3314|consensus
Probab=56.40 E-value=4.8 Score=38.38 Aligned_cols=21 Identities=33% Similarity=0.547 Sum_probs=16.9
Q ss_pred hhHHHHHHHHHhhccCccccc
Q psy1977 590 GAIGFVIGHEITHGFDDQGRQ 610 (764)
Q Consensus 590 g~iG~vighEitHaFD~~G~~ 610 (764)
.-+--++.|||.|+|||--.+
T Consensus 89 ~h~n~vv~HElIH~fDd~r~~ 109 (194)
T KOG3314|consen 89 DHVNQVVIHELIHAFDDCRAK 109 (194)
T ss_pred HHHHHHHHHHHHHHHHhhhhh
Confidence 346789999999999986544
No 12
>PRK02870 heat shock protein HtpX; Provisional
Probab=51.14 E-value=29 Score=37.92 Aligned_cols=40 Identities=25% Similarity=0.453 Sum_probs=30.0
Q ss_pred ceeeeeee---cCCCeeeeccccccCCCCCCCccccchhhhHHHHHHHHHhhc
Q psy1977 554 ALVNAFYS---SIENSITFPAGILQGSFFSNDRPKYMNFGAIGFVIGHEITHG 603 (764)
Q Consensus 554 ~~vna~Y~---~~~N~Iv~PagiLq~Pff~~~~p~a~nyg~iG~vighEitHa 603 (764)
..+|||-. +....|++.-|+|+. ++-.-+-.||||||.|-
T Consensus 142 ~~~NAFA~G~~~~~~~Ivvt~GLL~~----------L~~dEL~aVlAHELgHi 184 (336)
T PRK02870 142 PYMNAFASGYSEKSAMVAITTGLLEK----------LDRDELQAVMAHELSHI 184 (336)
T ss_pred CCCceEEecCCCCCcEEEEehHHhhh----------CCHHHHHHHHHHHHHHH
Confidence 35899865 244579999998852 34456899999999996
No 13
>PF01435 Peptidase_M48: Peptidase family M48 This is family M48 in the peptidase classification. ; InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated. The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases []. HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=45.09 E-value=26 Score=35.43 Aligned_cols=42 Identities=24% Similarity=0.324 Sum_probs=30.9
Q ss_pred eeeeeeecCCC--eeeeccccccCCCCCCCccccchhhhHHHHHHHHHhhccCc
Q psy1977 555 LVNAFYSSIEN--SITFPAGILQGSFFSNDRPKYMNFGAIGFVIGHEITHGFDD 606 (764)
Q Consensus 555 ~vna~Y~~~~N--~Iv~PagiLq~Pff~~~~p~a~nyg~iG~vighEitHaFD~ 606 (764)
.+|||-.+..+ .|++..|+|.. ++=.-+-.||||||.|.-..
T Consensus 60 ~~NA~~~g~~~~~~I~v~~~ll~~----------~~~~el~aVlaHElgH~~~~ 103 (226)
T PF01435_consen 60 SPNAFATGGGPRKRIVVTSGLLES----------LSEDELAAVLAHELGHIKHR 103 (226)
T ss_dssp SEEEEEETTTC--EEEEEHHHHHH----------SSHHHHHHHHHHHHHHHHTT
T ss_pred CCcEEEEccCCCcEEEEeChhhhc----------ccHHHHHHHHHHHHHHHHcC
Confidence 38998776643 58888887742 34456889999999997643
No 14
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=44.15 E-value=14 Score=38.40 Aligned_cols=21 Identities=33% Similarity=0.646 Sum_probs=16.3
Q ss_pred hHHHHHHHHHhhccCccccccCCC
Q psy1977 591 AIGFVIGHEITHGFDDQGRQFDKN 614 (764)
Q Consensus 591 ~iG~vighEitHaFD~~G~~yd~~ 614 (764)
.+..+|||||.|.| |-..|..
T Consensus 166 ~~a~t~AHElGHnl---Gm~HD~~ 186 (244)
T cd04270 166 ESDLVTAHELGHNF---GSPHDPD 186 (244)
T ss_pred HHHHHHHHHHHHhc---CCCCCCC
Confidence 37789999999999 5555554
No 15
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=43.80 E-value=21 Score=35.48 Aligned_cols=25 Identities=28% Similarity=0.577 Sum_probs=18.6
Q ss_pred hhhhHHHHHHHHHhhccCccccccCCCC
Q psy1977 588 NFGAIGFVIGHEITHGFDDQGRQFDKNG 615 (764)
Q Consensus 588 nyg~iG~vighEitHaFD~~G~~yd~~G 615 (764)
++...+.+++|||.|.| |-.+|..+
T Consensus 127 ~~~~~a~~~AHElGH~l---G~~HD~~~ 151 (194)
T cd04269 127 NLLLFAVTMAHELGHNL---GMEHDDGG 151 (194)
T ss_pred chHHHHHHHHHHHHhhc---CCCcCCCC
Confidence 35667899999999998 55555543
No 16
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=40.64 E-value=30 Score=31.01 Aligned_cols=55 Identities=20% Similarity=0.277 Sum_probs=31.7
Q ss_pred eeeecCCCeeeeccccccCCCCCCCccccchhhhHHHHHHHHHhhccCccccccCCCCCCcCCCCHhhHH
Q psy1977 558 AFYSSIENSITFPAGILQGSFFSNDRPKYMNFGAIGFVIGHEITHGFDDQGRQFDKNGNLVDWWQEDTKR 627 (764)
Q Consensus 558 a~Y~~~~N~Iv~PagiLq~Pff~~~~p~a~nyg~iG~vighEitHaFD~~G~~yd~~Gn~~~wW~~~s~~ 627 (764)
.+|.|..|.|+++.. . ...-.+-.+++||++|.+-. ...+.......||++-..+
T Consensus 4 G~~~~~~~~i~~~~~-------~------~~~~~~~~~l~HE~~H~~~~--~~~~~~~~~~~W~~EG~A~ 58 (128)
T PF13485_consen 4 GVYYPGFNRIVVYFQ-------G------SDEDWLDRVLAHELAHQWFG--NYFGGDDNAPRWFNEGLAE 58 (128)
T ss_pred EEEecCCCEEEEecC-------C------CCHHHHHHHHHHHHHHHHHH--HHcCCCccCchHHHHHHHH
Confidence 467788888875331 1 11223448999999999843 2333333455566655433
No 17
>PRK01265 heat shock protein HtpX; Provisional
Probab=38.38 E-value=60 Score=35.29 Aligned_cols=40 Identities=30% Similarity=0.332 Sum_probs=30.1
Q ss_pred ceeeeeee---cCCCeeeeccccccCCCCCCCccccchhhhHHHHHHHHHhhc
Q psy1977 554 ALVNAFYS---SIENSITFPAGILQGSFFSNDRPKYMNFGAIGFVIGHEITHG 603 (764)
Q Consensus 554 ~~vna~Y~---~~~N~Iv~PagiLq~Pff~~~~p~a~nyg~iG~vighEitHa 603 (764)
...|||-. +....|++.-|+|+. ++=.-+-.|+||||.|-
T Consensus 109 ~~~NAfa~G~~~~~~~Ivvt~gLl~~----------l~~~El~aVlAHElgHi 151 (324)
T PRK01265 109 PFPNAFAYGSPIAGKRIAITLPLLKI----------LNRDEIKAVAGHELGHL 151 (324)
T ss_pred CCCCeEEeccCCCCCEEEEehHHHhh----------CCHHHHHHHHHHHHHHH
Confidence 34788753 445789999999874 34456899999999994
No 18
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=38.06 E-value=38 Score=34.49 Aligned_cols=19 Identities=26% Similarity=0.527 Sum_probs=14.2
Q ss_pred HHHHHHHHhhccCccccccCCC
Q psy1977 593 GFVIGHEITHGFDDQGRQFDKN 614 (764)
Q Consensus 593 G~vighEitHaFD~~G~~yd~~ 614 (764)
..+|||||.|.| |...|.+
T Consensus 146 ~~~~AHElGH~l---G~~HD~~ 164 (220)
T cd04272 146 VYTMTHELAHLL---GAPHDGS 164 (220)
T ss_pred HHHHHHHHHHHh---CCCCCCC
Confidence 699999999988 4444433
No 19
>PRK01345 heat shock protein HtpX; Provisional
Probab=37.64 E-value=52 Score=35.71 Aligned_cols=41 Identities=24% Similarity=0.345 Sum_probs=29.9
Q ss_pred ceeeeeee---cCCCeeeeccccccCCCCCCCccccchhhhHHHHHHHHHhhcc
Q psy1977 554 ALVNAFYS---SIENSITFPAGILQGSFFSNDRPKYMNFGAIGFVIGHEITHGF 604 (764)
Q Consensus 554 ~~vna~Y~---~~~N~Iv~PagiLq~Pff~~~~p~a~nyg~iG~vighEitHaF 604 (764)
...|||-. +.+..|++--|+|+. ++=--+-.||||||.|--
T Consensus 93 ~~~NAFa~G~~~~~~~V~vt~gLL~~----------L~~dEL~aVlAHElgHi~ 136 (317)
T PRK01345 93 PQPNAFATGRNPENAAVAATTGLLQR----------LSPEEVAGVMAHELAHVK 136 (317)
T ss_pred CCcceEEecCCCCCeEEEechHHHhh----------CCHHHHHHHHHHHHHHHH
Confidence 45799876 334478888888863 233468999999999964
No 20
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=37.46 E-value=15 Score=37.05 Aligned_cols=20 Identities=35% Similarity=0.611 Sum_probs=15.2
Q ss_pred HHHHHHHHHhhccCccccccCCC
Q psy1977 592 IGFVIGHEITHGFDDQGRQFDKN 614 (764)
Q Consensus 592 iG~vighEitHaFD~~G~~yd~~ 614 (764)
.+.+|+|||.|.| |-..|.+
T Consensus 140 ~a~~~aHElGH~L---G~~HD~~ 159 (207)
T cd04273 140 SAFTIAHELGHVL---GMPHDGD 159 (207)
T ss_pred eEEeeeeechhhc---CCCCCCC
Confidence 4688999999988 5555554
No 21
>KOG1047|consensus
Probab=35.55 E-value=21 Score=40.79 Aligned_cols=36 Identities=39% Similarity=0.742 Sum_probs=26.4
Q ss_pred HHHHHHHHHhhccCccccccCCCCCCc---CC---CCHhhHHHHHHHHHHH
Q psy1977 592 IGFVIGHEITHGFDDQGRQFDKNGNLV---DW---WQEDTKRSYIEKARCI 636 (764)
Q Consensus 592 iG~vighEitHaFD~~G~~yd~~Gn~~---~w---W~~~s~~~f~~~~~C~ 636 (764)
++.||+|||.|+- .||+. +| |=|+=..-|.+|..|=
T Consensus 288 l~~vIaHEIAHSW---------tGNlVTN~sWehfWLNEGfTvylErrI~g 329 (613)
T KOG1047|consen 288 LVDVIAHEIAHSW---------TGNLVTNASWEHFWLNEGFTVYLERRIVG 329 (613)
T ss_pred hhhHHHHHhhhhh---------cccccccCccchhhhcccchhhhhhhhhh
Confidence 5899999999986 46663 34 7777777777776553
No 22
>COG4227 Antirestriction protein [DNA replication, recombination, and repair]
Probab=33.47 E-value=25 Score=36.34 Aligned_cols=42 Identities=29% Similarity=0.393 Sum_probs=27.9
Q ss_pred eeeeecCCCeeeeccccccCCCCCCCccccchhhhHHHHHHHHHhhccCcccc
Q psy1977 557 NAFYSSIENSITFPAGILQGSFFSNDRPKYMNFGAIGFVIGHEITHGFDDQGR 609 (764)
Q Consensus 557 na~Y~~~~N~Iv~PagiLq~Pff~~~~p~a~nyg~iG~vighEitHaFD~~G~ 609 (764)
.|||+|....|.+|.- ..++.+.||- ..++||+.|---..+|
T Consensus 179 rAfy~Pa~D~Iq~p~~--------~aF~da~~yy---aTl~HElghwtgh~~r 220 (316)
T COG4227 179 RAFYSPASDSIQLPPF--------EAFRDAINYY---ATLLHELGHWTGHEAR 220 (316)
T ss_pred cccccCccCeeccCCh--------hhccchHhHH---HHHHHHhccccCchhh
Confidence 4899999999998853 1234455553 4678888886544333
No 23
>PF03002 Somatostatin: Somatostatin/Cortistatin family; InterPro: IPR018142 Somatostatin inhibits the release of the pituitary growth hormone, somatotropin and inhibits the release of glucagon and insulin from the pancreas of fasted animals. Cortistatin is a cortical neuropeptide with neuronal depressant and sleep-modulating properties [].; GO: 0005179 hormone activity, 0005576 extracellular region
Probab=33.25 E-value=15 Score=21.59 Aligned_cols=9 Identities=22% Similarity=0.851 Sum_probs=7.0
Q ss_pred CCCCCCchh
Q psy1977 104 NVDPCDDFY 112 (764)
Q Consensus 104 svdPCdDFY 112 (764)
...||.|||
T Consensus 3 ~k~~Cknff 11 (18)
T PF03002_consen 3 RKAGCKNFF 11 (18)
T ss_pred cccccccee
Confidence 357999998
No 24
>PF12315 DUF3633: Protein of unknown function (DUF3633); InterPro: IPR022087 This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM.
Probab=32.95 E-value=22 Score=35.71 Aligned_cols=15 Identities=27% Similarity=0.585 Sum_probs=13.3
Q ss_pred hHHHHHHHHHhhccC
Q psy1977 591 AIGFVIGHEITHGFD 605 (764)
Q Consensus 591 ~iG~vighEitHaFD 605 (764)
..|.++|||++||+-
T Consensus 92 l~gsiLAHE~mHa~L 106 (212)
T PF12315_consen 92 LTGSILAHELMHAWL 106 (212)
T ss_pred HHhhHHHHHHHHHHh
Confidence 359999999999995
No 25
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=31.19 E-value=64 Score=34.27 Aligned_cols=41 Identities=24% Similarity=0.322 Sum_probs=31.1
Q ss_pred ceeeeeeecCC---CeeeeccccccCCCCCCCccccchhhhHHHHHHHHHhhcc
Q psy1977 554 ALVNAFYSSIE---NSITFPAGILQGSFFSNDRPKYMNFGAIGFVIGHEITHGF 604 (764)
Q Consensus 554 ~~vna~Y~~~~---N~Iv~PagiLq~Pff~~~~p~a~nyg~iG~vighEitHaF 604 (764)
...|||..+.. -.|++--|+|. - ++=--|=.|||||+.|--
T Consensus 126 ~~~NAFa~g~~~~~~~V~vt~gLl~--~--------l~~dEl~aVlaHElgHi~ 169 (302)
T COG0501 126 PQPNAFALGGGPKNGRVVVTTGLLD--L--------LNDDELEAVLAHELGHIK 169 (302)
T ss_pred CCccceecCCCCCCeeEEecHHHHh--h--------CCHHHHHHHHHHHHHHHh
Confidence 45789987774 58888888886 2 333468899999999975
No 26
>COG3227 LasB Zinc metalloprotease (elastase) [Amino acid transport and metabolism]
Probab=30.91 E-value=31 Score=38.86 Aligned_cols=42 Identities=36% Similarity=0.606 Sum_probs=27.6
Q ss_pred HHHHHHhhccCcc--ccccC-CCCCCcCCCCHhhHHHHHHHHHHHHHHHcCCc
Q psy1977 595 VIGHEITHGFDDQ--GRQFD-KNGNLVDWWQEDTKRSYIEKARCIIEQYGNYS 644 (764)
Q Consensus 595 vighEitHaFD~~--G~~yd-~~Gn~~~wW~~~s~~~f~~~~~C~~~qy~~~~ 644 (764)
|+|||||||+-.+ |-.|. ..|-|+ +.|-...--+++||.++.
T Consensus 340 VvAHElTHGvtq~tA~L~Y~~qsGALN--------EsfSDvfG~~i~~~~~~~ 384 (507)
T COG3227 340 VVAHELTHGVTQQTAGLIYRGQSGALN--------ESFSDVFGTLIEQYVKNQ 384 (507)
T ss_pred eehhhhcchhhhhccCceecCCCCchh--------hHHHHHHHHHHHHHhccC
Confidence 7999999999654 33343 335443 456666666777777654
No 27
>PF01421 Reprolysin: Reprolysin (M12B) family zinc metalloprotease This Prosite motif covers only the active site.; InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=30.67 E-value=35 Score=34.10 Aligned_cols=18 Identities=22% Similarity=0.364 Sum_probs=15.7
Q ss_pred chhhhHHHHHHHHHhhcc
Q psy1977 587 MNFGAIGFVIGHEITHGF 604 (764)
Q Consensus 587 ~nyg~iG~vighEitHaF 604 (764)
..+...+.+|||||.|.+
T Consensus 126 ~~~~~~a~~~AHelGH~l 143 (199)
T PF01421_consen 126 RSGLSFAVIIAHELGHNL 143 (199)
T ss_dssp SSHHHHHHHHHHHHHHHT
T ss_pred chhHHHHHHHHHHHHHhc
Confidence 456788999999999988
No 28
>TIGR03076 near_not_gcvH Chlamydial GcvH-like protein upstream region protein. The H protein (GcvH) of the glycine cleavage system shuttles the methylamine group of glycine from the P protein to the T protein. Most Chlamydia but lack the P and T proteins, and have a single homolog of GcvH that appears deeply split from canonical GcvH in molecular phylogenetic trees. The protein family modeled here is observed so far only in the Chlamydiae, always as part of a two-gene operon, upstream of the homolog of GcvH. Its function is unknown.
Probab=30.52 E-value=1e+02 Score=35.05 Aligned_cols=67 Identities=16% Similarity=0.233 Sum_probs=47.9
Q ss_pred CCcCCCCHhhHHHHHHHHHHHHHHHcCCccCCcccccCCccchhhhhhhhccHHHHHHHHHHHHhcCCCCCCCCCCCCCC
Q psy1977 616 NLVDWWQEDTKRSYIEKARCIIEQYGNYSVPEVNLTLNGVNNQGENIADNGGMKEAYNAYVNWAATHPAEPRLPGLQNFT 695 (764)
Q Consensus 616 n~~~wW~~~s~~~f~~~~~C~~~qy~~~~~~~~~~~~ng~~tl~EnIAD~gGl~~Ay~AY~~~~~~~~~e~~Lpgl~~~t 695 (764)
|+.||+.+.-...-.+..-|+++.=..|. +.-|++|-++|- ++-|-+||.+..+. |+++ +|
T Consensus 211 ~i~DWFg~~yvs~v~e~ll~fi~EQkk~v---------~mps~~EA~~Df--~dkaq~af~~~sk~-------~~~~-ls 271 (686)
T TIGR03076 211 NIKDWFGDAYVSAAVEALLRFIDEQKKNI---------AMPSLKEAQQDF--YDKAKQAFTKLSKH-------AEFN-LT 271 (686)
T ss_pred cHHHhhhHHHHHHHHHHHHHHHHHhcccc---------cCCcHHHHHHHH--HHHHHHHHHHhccC-------CCcC-cC
Confidence 67899998877776777777775323322 123889999994 78888999876432 7887 88
Q ss_pred hhhHHH
Q psy1977 696 PQQMFW 701 (764)
Q Consensus 696 ~~QLFF 701 (764)
.+|.|=
T Consensus 272 ~~e~v~ 277 (686)
T TIGR03076 272 FDQFVS 277 (686)
T ss_pred HHHHHH
Confidence 888663
No 29
>PRK03001 M48 family peptidase; Provisional
Probab=27.16 E-value=93 Score=33.10 Aligned_cols=41 Identities=29% Similarity=0.356 Sum_probs=29.6
Q ss_pred ceeeeeeecC---CCeeeeccccccCCCCCCCccccchhhhHHHHHHHHHhhcc
Q psy1977 554 ALVNAFYSSI---ENSITFPAGILQGSFFSNDRPKYMNFGAIGFVIGHEITHGF 604 (764)
Q Consensus 554 ~~vna~Y~~~---~N~Iv~PagiLq~Pff~~~~p~a~nyg~iG~vighEitHaF 604 (764)
...|||-... +-.|++.-|+|+. ++. .-+-.||||||.|--
T Consensus 93 ~~~NAfa~G~~~~~~~Ivvt~gLl~~--l~~--------~El~aVlAHElgHi~ 136 (283)
T PRK03001 93 DQPNAFATGRNPEHAAVAATTGILRV--LSE--------REIRGVMAHELAHVK 136 (283)
T ss_pred CCcceEEecCCCCCeEEEecHHHHhh--CCH--------HHHHHHHHHHHHHHh
Confidence 4679987643 2369999998873 222 458899999999953
No 30
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=26.33 E-value=38 Score=33.55 Aligned_cols=22 Identities=23% Similarity=0.465 Sum_probs=16.5
Q ss_pred hhHHHHHHHHHhhccCccccccCCC
Q psy1977 590 GAIGFVIGHEITHGFDDQGRQFDKN 614 (764)
Q Consensus 590 g~iG~vighEitHaFD~~G~~yd~~ 614 (764)
...+.+++|||.|.| |-.+|..
T Consensus 131 ~~~~~~~aHElGH~l---G~~HD~~ 152 (192)
T cd04267 131 LLTALTMAHELGHNL---GAEHDGG 152 (192)
T ss_pred eeehhhhhhhHHhhc---CCcCCCC
Confidence 356889999999998 5555544
No 31
>TIGR01089 fucI L-fucose isomerase. This enzyme catalyzes the first step in fucose metabolism, and has been characterized in Escherichia coli and Bacteroides thetaiotaomicron.
Probab=26.11 E-value=4.4e+02 Score=30.91 Aligned_cols=56 Identities=27% Similarity=0.515 Sum_probs=38.0
Q ss_pred eccccccCCCC-C-CCccc-----cchhhhHHHHHHHHHh----------------------------------------
Q psy1977 569 FPAGILQGSFF-S-NDRPK-----YMNFGAIGFVIGHEIT---------------------------------------- 601 (764)
Q Consensus 569 ~PagiLq~Pff-~-~~~p~-----a~nyg~iG~vighEit---------------------------------------- 601 (764)
+..++|..||= + ...|- .-.+|++++.+||.||
T Consensus 311 ~aeA~LNs~fDwnG~repi~~AcEnD~lgaltMl~g~lLTg~p~iFaDVRtyws~~~v~r~tg~~~~g~~~~g~ihl~Ns 390 (587)
T TIGR01089 311 TAEAILNSSFDWNGVREPFVVATENDSLNGVAMLFGHQLTGTAQIFADVRTYWSPEAVERVTGHKLDGLAEHGIIHLINS 390 (587)
T ss_pred hHHHHhCCccccCCCcCCeeeeeccchhhHHHHHHHHHHhCCCceeeecccccChhhhhhcccccccccccCCEEEEecC
Confidence 77788877762 1 11221 2358899999998885
Q ss_pred --hccCccccccCCCCC--CcCCCCHh
Q psy1977 602 --HGFDDQGRQFDKNGN--LVDWWQED 624 (764)
Q Consensus 602 --HaFD~~G~~yd~~Gn--~~~wW~~~ 624 (764)
|+.|-.|+..|.+|| +.+||.-.
T Consensus 391 Ga~~ld~~~~~~d~~gn~~~k~~~~~~ 417 (587)
T TIGR01089 391 GSAALDGSGKQRDAEGNPTMKPHWEIT 417 (587)
T ss_pred cHHHhHHhhcccCcccCcccCcchhcc
Confidence 334556788899988 78999743
No 32
>PF14891 Peptidase_M91: Effector protein
Probab=24.69 E-value=30 Score=34.00 Aligned_cols=14 Identities=36% Similarity=0.534 Sum_probs=12.2
Q ss_pred HHHHHHHHhhccCc
Q psy1977 593 GFVIGHEITHGFDD 606 (764)
Q Consensus 593 G~vighEitHaFD~ 606 (764)
-++++|||.||.|.
T Consensus 104 ~v~L~HEL~HA~~~ 117 (174)
T PF14891_consen 104 FVVLYHELIHAYDY 117 (174)
T ss_pred HHHHHHHHHHHHHH
Confidence 47899999999986
No 33
>PRK03072 heat shock protein HtpX; Provisional
Probab=24.63 E-value=1.2e+02 Score=32.39 Aligned_cols=55 Identities=35% Similarity=0.515 Sum_probs=32.8
Q ss_pred hhCCCCCCCCcccCCCCceeeeeeecC--CC-eeeeccccccCCCCCCCccccchhhhHHHHHHHHHhhcc
Q psy1977 537 LLRKPVNKTEWISHGHPALVNAFYSSI--EN-SITFPAGILQGSFFSNDRPKYMNFGAIGFVIGHEITHGF 604 (764)
Q Consensus 537 ~L~~p~~~~~w~~~~~p~~vna~Y~~~--~N-~Iv~PagiLq~Pff~~~~p~a~nyg~iG~vighEitHaF 604 (764)
+.+-|.. ..+.+. ....|||-.-. ++ .|++.-|+|+ .++=.-+-.||||||.|--
T Consensus 82 ~~g~p~p-~vyv~~--~~~~NAFa~G~~~~~~~v~vt~gLl~----------~l~~~El~aVlAHElgHi~ 139 (288)
T PRK03072 82 AARQPMP-RLYISP--TAAPNAFATGRNPRNAAVCCTEGILQ----------ILNERELRGVLGHELSHVY 139 (288)
T ss_pred HcCCCCC-CEEEec--CCCCceEEecCCCCCcEEEecHHHHH----------hCCHHHHHHHHHHHHHHHh
Confidence 3444432 334444 34578976521 22 3666777774 2344568899999999953
No 34
>PF15601 Imm42: Immunity protein 42
Probab=24.18 E-value=2.6e+02 Score=26.25 Aligned_cols=83 Identities=20% Similarity=0.422 Sum_probs=53.9
Q ss_pred HHhhHHHHhHHhhcccHHHHHHHHHHHHHHHHHHHHhhh-cCCCCCHHHHHHHHHHHHhcccccccCCc---c-CChhhH
Q psy1977 431 SISLAMGALYVRKFFKEDAKANAIEMVELIREEMNKILN-TVEWMDNKTRRKALEKAKAMTAHIAYPDE---L-LDDNKI 505 (764)
Q Consensus 431 ~~~~a~~~lyv~~~f~~~~k~~~~~m~~~Ik~~f~~~l~-~~~Wmd~~tk~~a~~Kl~~m~~~igyP~~---~-~~~~~L 505 (764)
.+|..+..+| +...+.+.-..|.+-+.+||+.|.+.-- +..|--+. +.. -|+| + .+.+.|
T Consensus 37 kfP~Lm~~LY-~g~L~~~~~~~A~~eL~~I~~~l~~~~p~~ViWD~~d-----------l~~---~ppWg~~i~~~i~~L 101 (134)
T PF15601_consen 37 KFPLLMNELY-RGYLRYEELEKALKELEEIRKELKKFPPSEVIWDIED-----------LSK---QPPWGDNISPDITSL 101 (134)
T ss_pred cchHHHHHHH-cCCCCHHHHHHHHHHHHHHHHHHhcCChhhheechhh-----------ccc---CCCCcccCCCCCCcH
Confidence 5788888887 5667777778888888999999977652 34573221 111 2777 2 234557
Q ss_pred HHHhcccccCCCChHHHHHHHHHHH
Q psy1977 506 SEFYDKLEIDPNNYYKSILNLTKFG 530 (764)
Q Consensus 506 ~~~Y~~l~~~~~~yf~n~l~~~~~~ 530 (764)
.++| +.....+||+.+.++....
T Consensus 102 ~~yF--vt~dG~~~f~~l~~a~~~a 124 (134)
T PF15601_consen 102 SDYF--VTSDGKDLFEVLFRALESA 124 (134)
T ss_pred HHHh--cCcchhhHHHHHHHHHHHH
Confidence 7776 3334456888877776554
No 35
>PRK03982 heat shock protein HtpX; Provisional
Probab=23.04 E-value=1.3e+02 Score=32.15 Aligned_cols=41 Identities=32% Similarity=0.343 Sum_probs=28.0
Q ss_pred ceeeeeee---cCCCeeeeccccccCCCCCCCccccchhhhHHHHHHHHHhhcc
Q psy1977 554 ALVNAFYS---SIENSITFPAGILQGSFFSNDRPKYMNFGAIGFVIGHEITHGF 604 (764)
Q Consensus 554 ~~vna~Y~---~~~N~Iv~PagiLq~Pff~~~~p~a~nyg~iG~vighEitHaF 604 (764)
...|||-. |.+-.|++--|+++. ++=.-+-.|+||||.|--
T Consensus 94 ~~~NAfa~G~~~~~~~V~vt~gLl~~----------l~~~El~AVlAHElgHi~ 137 (288)
T PRK03982 94 QTPNAFATGRDPKHAVVAVTEGILNL----------LNEDELEGVIAHELTHIK 137 (288)
T ss_pred CCcceEEeccCCCCeEEEeehHHHhh----------CCHHHHHHHHHHHHHHHH
Confidence 34699876 333456688887753 223458899999999953
No 36
>PF13688 Reprolysin_5: Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=22.96 E-value=56 Score=32.40 Aligned_cols=26 Identities=23% Similarity=0.221 Sum_probs=15.5
Q ss_pred chhhhHHHHHHHHHhhccCccccccCCCC
Q psy1977 587 MNFGAIGFVIGHEITHGFDDQGRQFDKNG 615 (764)
Q Consensus 587 ~nyg~iG~vighEitHaFD~~G~~yd~~G 615 (764)
.....-..+++|||.|.| |...|.++
T Consensus 137 ~~~~~~~~~~AHEiGH~l---Ga~HD~~~ 162 (196)
T PF13688_consen 137 PPTYNGAITFAHEIGHNL---GAPHDGDY 162 (196)
T ss_dssp --HHHHHHHHHHHHHHHT---T-----SS
T ss_pred CCCCceehhhHHhHHHhc---CCCCCCCC
Confidence 345666789999999998 56666654
No 37
>PF13253 DUF4044: Protein of unknown function (DUF4044)
Probab=22.38 E-value=1.8e+02 Score=20.63 Aligned_cols=28 Identities=18% Similarity=0.193 Sum_probs=18.2
Q ss_pred ccccchhhHHHHHHHHHHHHHHHHHHHHH
Q psy1977 12 WHRRTSLEKYLMLVLGTLLTSTFIFIFCV 40 (764)
Q Consensus 12 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 40 (764)
++.+|..|+.-.+++. +++++.++.+++
T Consensus 3 kkkKS~fekiT~v~v~-lM~i~tvg~v~~ 30 (35)
T PF13253_consen 3 KKKKSTFEKITMVVVW-LMLILTVGSVVA 30 (35)
T ss_pred CccccHHHHHHHHHHH-HHHHHHHHHHHH
Confidence 3467888987777666 666655555444
No 38
>PRK04897 heat shock protein HtpX; Provisional
Probab=21.27 E-value=1.4e+02 Score=32.04 Aligned_cols=40 Identities=35% Similarity=0.448 Sum_probs=28.4
Q ss_pred ceeeeeeec---CCCeeeeccccccCCCCCCCccccchhhhHHHHHHHHHhhc
Q psy1977 554 ALVNAFYSS---IENSITFPAGILQGSFFSNDRPKYMNFGAIGFVIGHEITHG 603 (764)
Q Consensus 554 ~~vna~Y~~---~~N~Iv~PagiLq~Pff~~~~p~a~nyg~iG~vighEitHa 603 (764)
...|||... .+..|++--|+|+. ++=--+-.|+||||.|-
T Consensus 106 ~~~NAfa~G~~~~~~~v~vt~gLl~~----------l~~~El~aVlAHElgHi 148 (298)
T PRK04897 106 PSPNAFATGSSPKNAAVAVTTGLLAI----------MNREELEGVIGHEISHI 148 (298)
T ss_pred CCCceEEeccCCCCcEEEeehHHHhh----------CCHHHHHHHHHHHHHHH
Confidence 357998764 34468888888863 12245889999999995
No 39
>PF10835 DUF2573: Protein of unknown function (DUF2573); InterPro: IPR020393 This entry contains proteins with no known function.
Probab=21.19 E-value=3.8e+02 Score=22.61 Aligned_cols=64 Identities=20% Similarity=0.245 Sum_probs=38.7
Q ss_pred hcHHHHHHHHHHHHhhcCCCCC-----CchHhhHHHHHHHHHhhHHHHhHHhhcccHHHHHHHHHHHHHHHHH
Q psy1977 396 LTEAIRNRQLQYSTALSGRTQM-----EPRWKECVDTVSSSISLAMGALYVRKFFKEDAKANAIEMVELIREE 463 (764)
Q Consensus 396 L~~~~~~~~~~~~~~l~G~~~~-----~~rw~~Cv~~~~~~~~~a~~~lyv~~~f~~~~k~~~~~m~~~Ik~~ 463 (764)
|.+.|..+...|..-|.|-+.. .-.|- -...+.+.||..+... +---+++|..+.++|.+||+-
T Consensus 4 l~eq~dgLveKytELL~Ge~~~e~~EkVk~W~-lYshiaKsMPpL~kHW---N~~~PeaK~~ik~li~~Ik~l 72 (82)
T PF10835_consen 4 LQEQFDGLVEKYTELLLGETSPEMKEKVKQWA-LYSHIAKSMPPLAKHW---NGTYPEAKEEIKELIEEIKQL 72 (82)
T ss_pred HHHHHHHHHHHHHHHHhcCCCHHHHHHHHHHH-HHHHHHHhCcHHHHhh---cccCchHHHHHHHHHHHHHHH
Confidence 3456666777788888885532 12341 2344566777643221 111357899999999999864
No 40
>KOG0718|consensus
Probab=20.97 E-value=1.4e+02 Score=33.70 Aligned_cols=66 Identities=21% Similarity=0.292 Sum_probs=41.3
Q ss_pred HhHH-hhcccHHHHHHHHHHHHHHHHHHHHhhhcCCCCCHHHHHHHHHH-----HHhcccccccCCccCChhhHHHHhcc
Q psy1977 438 ALYV-RKFFKEDAKANAIEMVELIREEMNKILNTVEWMDNKTRRKALEK-----AKAMTAHIAYPDELLDDNKISEFYDK 511 (764)
Q Consensus 438 ~lyv-~~~f~~~~k~~~~~m~~~Ik~~f~~~l~~~~Wmd~~tk~~a~~K-----l~~m~~~igyP~~~~~~~~L~~~Y~~ 511 (764)
.||. ++|.+++.|++|+++|+.|++|+.-.. |+.+| +.-+- |+.-...+|++- .+++++.++|+.
T Consensus 35 ~lfHPDkh~dpd~K~~AE~~F~~i~~AyEVLs------Dp~kR-aIYD~~G~qGL~t~gwEl~~r~--~tpeEIreE~Er 105 (546)
T KOG0718|consen 35 RLFHPDKHTDPDQKKAAEEKFQRIQRAYEVLS------DPQKR-AIYDNYGEQGLKTEGWELGFRG--KTPEEIREEYER 105 (546)
T ss_pred HhcCCcccCChhHHHHHHHHHHHHHHHHHHhc------ChHHH-HHHHHhhhccccccCceeecCC--CCHHHHHHHHHH
Confidence 4555 577889999999999999999995432 44443 33222 222223455543 456667777654
Q ss_pred c
Q psy1977 512 L 512 (764)
Q Consensus 512 l 512 (764)
+
T Consensus 106 l 106 (546)
T KOG0718|consen 106 L 106 (546)
T ss_pred H
Confidence 3
Done!