Query psy9963
Match_columns 800
No_of_seqs 292 out of 2054
Neff 8.6
Searched_HMMs 46136
Date Sat Aug 17 00:08:03 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy9963.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/9963hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 COG3590 PepO Predicted metallo 100.0 2E-114 4E-119 916.9 47.6 631 1-693 14-654 (654)
2 KOG3624|consensus 100.0 3E-111 6E-116 998.2 58.6 627 1-693 56-687 (687)
3 PF05649 Peptidase_M13_N: Pept 100.0 1.9E-59 4.1E-64 524.2 31.1 383 8-425 1-390 (390)
4 PF01431 Peptidase_M13: Peptid 100.0 1.1E-53 2.5E-58 432.7 11.3 204 486-692 1-206 (206)
5 COG3590 PepO Predicted metallo 99.7 3.4E-17 7.4E-22 176.1 6.5 96 695-790 316-411 (654)
6 KOG3624|consensus 99.5 4E-14 8.7E-19 168.8 7.2 97 703-799 347-443 (687)
7 PF05649 Peptidase_M13_N: Pept 99.4 1.1E-13 2.4E-18 154.6 5.9 79 701-779 312-390 (390)
8 PF14247 DUF4344: Domain of un 79.9 0.84 1.8E-05 46.1 1.2 51 484-534 48-105 (220)
9 KOG0506|consensus 60.0 6.9 0.00015 43.1 2.7 165 416-606 165-348 (622)
10 PF09768 Peptidase_M76: Peptid 59.2 7.9 0.00017 37.6 2.8 41 485-537 47-87 (173)
11 PF04228 Zn_peptidase: Putativ 58.0 4.1 8.8E-05 43.2 0.6 84 483-572 135-225 (292)
12 PF01421 Reprolysin: Reprolysi 57.1 13 0.00028 37.1 4.1 18 516-533 126-143 (199)
13 KOG2661|consensus 42.8 26 0.00057 37.1 3.6 23 513-535 267-289 (424)
14 cd04272 ZnMc_salivary_gland_MP 42.0 33 0.00072 34.8 4.4 12 522-533 146-157 (220)
15 cd04270 ZnMc_TACE_like Zinc-de 41.8 18 0.00039 37.5 2.3 21 520-543 166-186 (244)
16 cd04269 ZnMc_adamalysin_II_lik 40.5 27 0.00059 34.6 3.4 24 517-543 127-150 (194)
17 KOG3314|consensus 36.3 21 0.00045 33.8 1.6 21 519-539 89-109 (194)
18 PF01447 Peptidase_M4: Thermol 36.0 18 0.0004 34.3 1.3 12 523-534 137-148 (150)
19 cd04273 ZnMc_ADAMTS_like Zinc- 34.9 17 0.00038 36.5 1.0 20 521-543 140-159 (207)
20 PF03002 Somatostatin: Somatos 34.5 13 0.00028 21.3 -0.0 9 4-12 3-11 (18)
21 PF10871 DUF2748: Protein of u 33.9 3.8E+02 0.0081 28.1 10.1 141 367-534 80-223 (447)
22 PF12315 DUF3633: Protein of u 31.3 27 0.00059 34.7 1.6 22 510-535 86-107 (212)
23 cd04267 ZnMc_ADAM_like Zinc-de 26.3 40 0.00086 33.3 1.8 22 519-543 131-152 (192)
24 KOG1047|consensus 24.4 44 0.00096 38.1 1.9 38 520-566 287-330 (613)
25 PF13069 DUF3933: Protein of u 24.0 60 0.0013 23.5 1.8 19 759-777 20-38 (53)
26 PF13485 Peptidase_MA_2: Pepti 21.8 1E+02 0.0022 27.4 3.6 53 487-554 4-56 (128)
No 1
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=100.00 E-value=1.6e-114 Score=916.88 Aligned_cols=631 Identities=30% Similarity=0.516 Sum_probs=584.2
Q ss_pred CCCCCCCCCchHHhhhcCcccCCCCCCCCCccchHHHHHHHHHHHHHHHhcCcC---CCCCChHHHHHHHHHHHHhCccc
Q psy9963 1 MDPGVQPCDNFYKFACGNFLKTSVIPPDRASVSTFSKIEDRLSSQIKTVLEEPQ---SASGPHIFNLMKKFYATCMDEDT 77 (800)
Q Consensus 1 mn~svdPCdDFY~yaCG~w~~~~~ip~~~~~~~~~~~~~~~~~~~l~~lL~~~~---~~~~~~~~~~~~~~Y~sC~~~~~ 77 (800)
||..+.||||||+||+|.|.++++||+|+++++.|.+|.++.+.+++++++... ..++....+++.+||++.||..+
T Consensus 14 ~d~~~rpqdDly~~vNG~Wl~~~~IPaDrsr~G~F~~L~d~~e~~~~~~i~~a~a~~~ap~~~~~~~~g~~y~~~mD~~~ 93 (654)
T COG3590 14 MDAMTRPQDDLYGYVNGEWLKTAEIPADRSRDGAFDKLDDRAEALVRDIIEAAAANEQAPEDAILQRIGKLYRSFMDEAK 93 (654)
T ss_pred CCCcCCcchHHHHHhhhhhhhcCcCCCCccccccHHHHHHHHHHHHHHHHHHHhcCCCCCcchHHHHHHHHHHHhccHHH
Confidence 899999999999999999999999999999999999999999999999998753 23344689999999999999999
Q ss_pred ccccccccHHHHHHHhCCCCCCCCCCCCCCcchHHHHHHHHHhcccCcccceeeeeccccCCCCCCCCCCCCCCCCCCCc
Q psy9963 78 LDKLGVQPMKVIIEQLGGLPVVDGDRWDSSRFSWINTTYKLRDLGFNHNLLINIVFCKQTQHPINTTQIPGAGPDFKNST 157 (800)
Q Consensus 78 ~~~~~~~~l~~~l~~~g~wp~~~~~~~~~~~~~~~~~l~~l~~~~~g~~~l~~~~~~~~~~~~~~~~~~~~v~~d~~~~~ 157 (800)
+|+.|..||+..|.++...... + ++...++++.+. |.+.+|++ +|.+|.+|++
T Consensus 94 ~E~~g~~Pl~~~La~i~~~~s~-------s--df~a~l~~~~~~--g~~~~f~~----------------~vspD~kds~ 146 (654)
T COG3590 94 REKAGVDPLKPELAEIDSLASF-------S--DFAAALGQLERA--GQGNPFGF----------------SVSPDFKDST 146 (654)
T ss_pred HHhcCCCchhHHHHHHHhhccH-------H--HHHHHHHHHHhc--cCCCCcee----------------eeccccccch
Confidence 9999999999999987643211 1 677889999886 78899999 9999999999
Q ss_pred ceeeeecCCCCCchhh-hhcccccHHHHHHHHHHHHHHHHHcCCChh--HHHHHHHHHHHHHHHHhccCCChhhccchhh
Q psy9963 158 RRRLLIDQVDLKPLED-FLREGWRNKNVRAYHSYMVKTAVLFGAPVS--RAFRELKDSIDFEITMTKFIVPDQELRNLSY 234 (800)
Q Consensus 158 ~~~l~l~~~~l~l~~~-~~~~~~~~~~~~~y~~~~~~~~~~~g~~~~--~~~~~~~~v~~fE~~La~~~~~~~~~~~~~~ 234 (800)
.+++++.||+|+||++ |+.+.....++.+|.+.+.+++.++|.+.+ .+.+.+..|+.||.+|++......+.++...
T Consensus 147 ~~v~~~sq~GlgLPD~~YY~de~~~e~l~ay~~hv~rml~~~~~~~~~~d~a~~A~~v~a~et~lA~~~~~~~~~rd~~~ 226 (654)
T COG3590 147 RYVLYFSQSGLGLPDTTYYRDEQHAELLAAYKEHVARMLGLFGFSEEEEDAAKHALRVVALETKLANASWEVVKYRDLYH 226 (654)
T ss_pred hheeeeccCCCCCCchhhhhhhhHHHHHHHHHHHHHHHHHHhCCchhhhhHHHHHHHHHHHHHHHhhhhhHHHHHHhhhc
Confidence 9999999999999998 888777888999999999999999998765 6788899999999999999988888888888
Q ss_pred hccccCHHHHhhhCCCCCccccccCCCCchhHHHHHHHhhCCCCCCCCCCeEEEeChhHHHHHHHHHhcCChhHHHHHHH
Q psy9963 235 LYNLMSVHTLNKYYPSIPCFYLEIDGPSLNKKLEFLNRIIGPKISIDSEELVMVTQPKYMSSIERLVQITPKRILANYML 314 (800)
Q Consensus 235 ~y~~~tl~eL~~~~p~i~w~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~V~v~~~~yl~~l~~ll~~t~~~~l~nYl~ 314 (800)
.||+.++.+|+...|..+| ..++..+ +.-++..|++.+|.|++.++.++.+++...++.|+.
T Consensus 227 ~Yn~~tf~~l~~~~p~~~w-------------~~~~~~~-----G~~PD~~iv~~~p~y~~~~~~~~~e~~~~~wKawl~ 288 (654)
T COG3590 227 TYNPATFAELQPELPGDDW-------------SLLFSAL-----GQLPDKVIVVENPFYLKEFASLLAEENWADWKAWLR 288 (654)
T ss_pred CcChhhHHHhcccCCCccH-------------HHHHHHh-----cCCCCeeEeecCchHHHHHHHHHhcCcHHHHHHHHH
Confidence 8999999999999999999 7777764 323457888999999999999999999999999999
Q ss_pred HHHHHHhhhhcCHHHhhhhhhh-hHhhcCCCCCchhhhhHHHHHHHHhhhhHhhhHHhhccChhhHHHHHHHHHHHHHHH
Q psy9963 315 MQTVLSSSAYLCKSARRPYHDF-KYVLTGVNSTRIRSQRCVDEVAEKFSLAVGSLYVRHFFDRTSKENALELVNNLRQEM 393 (800)
Q Consensus 315 ~~~~~~~~~~l~~~~~~~~~~~-~~~~~g~~~~~~~~~~C~~~~~~~~p~~~~~ly~~~~~~~~~~~~v~~m~~~ik~~~ 393 (800)
|+++...+|||+++++...++| .+.++|.++..+||+..+.+++..|+.++|.+||+++|++++|..++++|.+++.+|
T Consensus 289 ~~~~~~~ap~Lt~dl~~~~f~fy~r~LsG~~E~r~rwKr~~~l~~~~~geaiG~~Yv~~~Fpp~aKa~m~~lVa~l~~ay 368 (654)
T COG3590 289 WKLIRAAAPYLTEDLVDEHFDFYGRTLSGQPEARDRWKRAVNLAERLFGEAIGLLYVKRYFPPEAKADMEELVANLIKAY 368 (654)
T ss_pred HHHHhhcchhhhhHHHHhhhHHHhhhccCChhhhHHHHHHHHHHHHHHHHHHHHHHHHHhCChHHHHHHHHHHHHHHHHH
Confidence 9999999999999999999955 699999999999999999999999999999999999999999999999999999999
Q ss_pred HHhhhcCCCCCHHHHHHHHHHHhhcccccccCCCCCChhhHHHhcccceecCCChHHHHHHHHHHHHHHHHHHhcCCCCC
Q psy9963 394 DYILQNVSWMDPQTKKNALEKSKVLKVHIGYPDELLDDDKIDLHYKDLRIVPGDLLTSVLNLYKKSVDYAYTQLHEPINK 473 (800)
Q Consensus 394 ~~~l~~~~Wmd~~tk~~a~~Kl~~m~~~ig~p~~~~~~~~l~~~y~~l~~~~~~~~~n~~~~~~~~~~~~~~~l~~p~~~ 473 (800)
+.+|++++||+++||++|+.||+.++.+||||+.+. +|+.|.+..+++++|.++..++...+.++++++|||+
T Consensus 369 k~Ri~~ldWmtpeTrekAl~KL~~ft~kIGYP~~wr-------~y~kL~~~~~sl~~N~~r~~~~~~~~~l~K~~kPVDr 441 (654)
T COG3590 369 KARISKLDWMTPETREKALEKLNKFTAKIGYPDPWR-------YYSKLEIKRDSLYGNVLRASAFNHAHELSKIGKPVDR 441 (654)
T ss_pred HHHhhhccccCHHHHHHHHHHHhccccccCCCchhh-------hhhhhccCchhHHHHHHHHHHHHHhhhHHHhCCCCch
Confidence 999999999999999999999999999999997543 5889999889999999999999999999999999999
Q ss_pred CCCCcCCCcceeeeeeecCCCeeeecccccCCCccccccccccchhhHHHHHHHHHHhhccccCCccCCCCCccCCCCHH
Q psy9963 474 TSWKEFGSVATVNAFNHIDKNAIEFPAGILQEPYFAADRPCFLNYGAIGTVIGHEITHTFDDEGSQYDKFGNVMNWWDDE 553 (800)
Q Consensus 474 ~~w~~~~~p~~v~a~Y~~~~N~i~vP~~~l~~P~~~~~~p~~~nyg~lG~~iahel~h~fD~~G~~~d~~G~~~~ww~~~ 553 (800)
..|.|. |++|||||+|..|.|+||++|||+|||+++.+.+.|||+||.+|||||+||||++|++||.+|++.+|||++
T Consensus 442 ~eW~M~--pq~VNAYYnp~~N~IVFPAAILQ~PFfd~ea~~a~NYGgIGaVIgHEI~HgFDdqGakfD~~GnL~dWWT~e 519 (654)
T COG3590 442 DEWEMP--PQTVNAYYNPQKNEIVFPAAILQAPFFDPEADSAANYGGIGAVIGHEIGHGFDDQGAKFDGDGNLNDWWTDE 519 (654)
T ss_pred hhcCCC--HHHhhhhcCCCCceEeeeHHhcCCCCCCCCcchhhcccCccceehhhhcccccCCccccCCCCcHHhhcCHH
Confidence 999999 999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHHHHhccccccCCCcccCCc-cchhhhhhhccChHHHHHHHHHHHHhcCcccCCCCCCCCchhHHHHHH
Q psy9963 554 SKMRYNEKASCIIKQYSNYVIEDIGFRINGI-TTQGENIADNGGVKESYRAYLECAKQKNAEYKLPGLENFSDKQMFWIS 632 (800)
Q Consensus 554 ~~~~f~~~~~Cl~~qy~~~~~~~~~~~~~g~-~tl~E~iaD~~gl~~ay~ay~~~~~~~~~~~~lpgl~~~s~~QLFFi~ 632 (800)
+...|+++++-++.||+.+.... +.. +|. +|++|||||.|||..|+.||+.... .-|-+++++..|-||++
T Consensus 520 D~~aF~~~T~~lvaqf~~~e~~~-~~~-~g~~lTvgENIaDlGGl~~Al~Ayk~~~~------~~pv~dg~tg~qrfF~~ 591 (654)
T COG3590 520 DAAAFKERTKKLVAQFDGYEPEG-GKD-NGNALTVGENIADLGGLAIALDAYKLSLD------PAPVIDGFTGLQRFFLN 591 (654)
T ss_pred HHHHHHHHHHHHHHHhcCccccc-ccc-ccceeeeccccccchhHHHHHHHHHHhcC------CcccccccchHHHHHHH
Confidence 99999999999999999997553 332 345 9999999999999999999998654 22445668999999999
Q ss_pred HHHHhhccCcHHHHHHhhccCCCCCCccEEEecccCchhHHhhcCCCCCCCCC--ccCceeeC
Q psy9963 633 FANVWCTKYRKEAMELLFKNDDHPPAEIRVNGPLSNSYDFARDFQCYSGSKMN--PKHKCEVW 693 (800)
Q Consensus 633 ~a~~~C~~~~~~~~~~~~~~~~h~p~~~RvN~~L~n~~eF~~AF~C~~gs~Mn--p~~kC~lw 693 (800)
|||.|....+++...+++..|+|+|+++|||+++.|+++|.+||.|++|++|. |++|..||
T Consensus 592 wAqiWR~K~r~e~~~~~l~~DpHsP~~fR~Ng~~~N~~~FyeaFdVke~D~myr~pe~Rv~IW 654 (654)
T COG3590 592 WAQIWRMKARPEEQQMRLSVDPHSPAEFRVNGPVRNMDEFYEAFDVKEGDAMYRAPEERVVIW 654 (654)
T ss_pred HHHHHhhccCHHHHHHHHccCCCCCceeeccCCCCChHHHHHHcCCCCCcccccChhhccccC
Confidence 99999999999999999999999999999999999999999999999999995 88999998
No 2
>KOG3624|consensus
Probab=100.00 E-value=2.8e-111 Score=998.23 Aligned_cols=627 Identities=38% Similarity=0.694 Sum_probs=577.5
Q ss_pred CCCCCCCCCchHHhhhcCcccCCCCCCCCCccchHHHHHHHHHHHHHHHhcCcCCCCCChHHHHHHHHHHHHhCcccccc
Q psy9963 1 MDPGVQPCDNFYKFACGNFLKTSVIPPDRASVSTFSKIEDRLSSQIKTVLEEPQSASGPHIFNLMKKFYATCMDEDTLDK 80 (800)
Q Consensus 1 mn~svdPCdDFY~yaCG~w~~~~~ip~~~~~~~~~~~~~~~~~~~l~~lL~~~~~~~~~~~~~~~~~~Y~sC~~~~~~~~ 80 (800)
||+|||||+|||+||||+|.+.|+++.+. +++.+..+++.+...++++|+++....+++++++++.||++|++....+.
T Consensus 56 ~d~svdPC~dFy~~ACG~~~~~~~~~~~~-~~~~~~~~~~~~~~~l~~~L~~~~~~~~s~~~~~~k~~Y~sC~~~~~~~~ 134 (687)
T KOG3624|consen 56 MDTSVDPCEDFYQYACGNWAENHPIPKDK-RSSTLDELQDQVLRQLKELLEEPNSSSTSKAERKAKRFYESCLDAKALES 134 (687)
T ss_pred CCCCCCcchhHHHHHhCcHhhcCCCcccc-cccHHHHHHHHHHHHHHHHHhCCCCCCCCHHHHHHHHHHHHHhchhhhhh
Confidence 79999999999999999999999999888 88999999999999999999988766789999999999999999887777
Q ss_pred cc-cccHHHHHHHhCCCCCCCCCCCCCCcchHHHHHHHHHhcccCcccceeeeeccccCCCCCCCCCCCCCCCCCCCcce
Q psy9963 81 LG-VQPMKVIIEQLGGLPVVDGDRWDSSRFSWINTTYKLRDLGFNHNLLINIVFCKQTQHPINTTQIPGAGPDFKNSTRR 159 (800)
Q Consensus 81 ~~-~~~l~~~l~~~g~wp~~~~~~~~~~~~~~~~~l~~l~~~~~g~~~l~~~~~~~~~~~~~~~~~~~~v~~d~~~~~~~ 159 (800)
.+ ..++.+.|+.+||||.++++ |++++|+|.++++.+... ||..+||.+ .|..|..+++
T Consensus 135 ~~~~~~l~~~i~~~G~wP~l~~~-w~~~~f~~~~~l~~~~~~-yg~~~l~~~----------------~v~~~~~~~~-- 194 (687)
T KOG3624|consen 135 SGALQLLFRIIQSIGGWPLLEGN-WDESKFNLNEMLANLLRR-YGLTTLFLL----------------EVALDYKNSS-- 194 (687)
T ss_pred hcchHHHHHHHHHhCCCcCCCCC-CCcccCCHHHHHHHHHHH-cCccceeEE----------------EEecccccCc--
Confidence 65 77999999999999999975 999999999999998888 999999999 9999988876
Q ss_pred eeeecCCCCCchhh--hhcccccHHHHHHHHHHHHHHHHHcCCChhHHHHHHHHHHHHHHHHhccCCChhhccchhhhcc
Q psy9963 160 RLLIDQVDLKPLED--FLREGWRNKNVRAYHSYMVKTAVLFGAPVSRAFRELKDSIDFEITMTKFIVPDQELRNLSYLYN 237 (800)
Q Consensus 160 ~l~l~~~~l~l~~~--~~~~~~~~~~~~~y~~~~~~~~~~~g~~~~~~~~~~~~v~~fE~~La~~~~~~~~~~~~~~~y~ 237 (800)
++.+.+|.+ |.......+.+.+|..++..++.++|.+.+.+.+.++.+++||..|++++.+.+.+++....++
T Consensus 195 -----~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~e~~l~~~~~~~~~~~~~~~~~~ 269 (687)
T KOG3624|consen 195 -----QPGLILPSRSKYLATDSDSEKLRAYLLLANELLQLLGLDSDEAEEYARLVIELERQLANITKPDENRRELQALYN 269 (687)
T ss_pred -----ccccCcchHhhhhccccHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHhccchhhhhhhHHHHcc
Confidence 677777754 4444443466788999999999999999999999999999999999999988877788888999
Q ss_pred ccCHHHHhhhCCCCCccccccCCCCchhHHHHHHHhhCCCCCCCCCCeEEEeChhHHHHHHHHHhcCChhHHHHHHHHHH
Q psy9963 238 LMSVHTLNKYYPSIPCFYLEIDGPSLNKKLEFLNRIIGPKISIDSEELVMVTQPKYMSSIERLVQITPKRILANYMLMQT 317 (800)
Q Consensus 238 ~~tl~eL~~~~p~i~w~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~V~v~~~~yl~~l~~ll~~t~~~~l~nYl~~~~ 317 (800)
.+++++|++.+|.++| ..++..++.. +.....|++.+++|+.+|..+|.+|++++|+|||+|++
T Consensus 270 ~~~l~~l~~~~~~~~~-------------~~~~~~~~~~---~~~~~~~~~~~~~y~~~l~~ii~~t~~~~laNYl~~~~ 333 (687)
T KOG3624|consen 270 SVNLAELQKKFPSIDW-------------KQLLRSVLGE---ILPHNEVVVFNPDYLVNLSAIIQSTPKRTLANYLIWRL 333 (687)
T ss_pred cccHHHHHHhcccccH-------------HHHhhhhccc---cccCCceEEecHHHHHHHHHHHHhCCHHHHHHHHHHHH
Confidence 9999999999999999 9999998876 22333899999999999999999999999999999999
Q ss_pred HHHhhhhcCHHHhhhhhhhhHhhcCCCCCchhhhhHHHHHHHHhhhhHhhhHHhhccChhhHHHHHHHHHHHHHHHHHhh
Q psy9963 318 VLSSSAYLCKSARRPYHDFKYVLTGVNSTRIRSQRCVDEVAEKFSLAVGSLYVRHFFDRTSKENALELVNNLRQEMDYIL 397 (800)
Q Consensus 318 ~~~~~~~l~~~~~~~~~~~~~~~~g~~~~~~~~~~C~~~~~~~~p~~~~~ly~~~~~~~~~~~~v~~m~~~ik~~~~~~l 397 (800)
+..+.+.++... .+|..|++.+...||.+++++|++++|+++++..+.+|++.||.+|.++|
T Consensus 334 ~~~~~~~~~~~~------------------~~~~~Cv~~~~~~~p~a~~~l~~r~~~~~~~~~~~~~m~~~lk~~f~~~l 395 (687)
T KOG3624|consen 334 LDDLLDLLPKPR------------------KRWADCVELVRTLLPLALGRLYVRNFFDKENKKEVSEMIEDLKKAFEEML 395 (687)
T ss_pred HHHHhcccchhh------------------HHHHHHHHHHHHhhhhhhHHHHHHHhcChhHHHHHHHHHHHHHHHHHHHH
Confidence 999887443321 17899999999999999999999999999999999999999999999999
Q ss_pred hcCCCCCHHHHHHHHHHHhhcccccccCCCCCChhhHHHhcccceecCCChHHHHHHHHHHHHHHHHH-HhcCCCCCCCC
Q psy9963 398 QNVSWMDPQTKKNALEKSKVLKVHIGYPDELLDDDKIDLHYKDLRIVPGDLLTSVLNLYKKSVDYAYT-QLHEPINKTSW 476 (800)
Q Consensus 398 ~~~~Wmd~~tk~~a~~Kl~~m~~~ig~p~~~~~~~~l~~~y~~l~~~~~~~~~n~~~~~~~~~~~~~~-~l~~p~~~~~w 476 (800)
++++|||++||+.|++|+++|+.+||||++..++..|++.|..+.+..++|+.++..+.++.+..... .+..++++..|
T Consensus 396 ~~~~Wl~~~tr~~a~~K~~~m~~~ig~p~~~~~~~~ld~~y~~l~l~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 475 (687)
T KOG3624|consen 396 QELDWLDEETRKSAIKKLNAMKKNIGYPDEFLPPGTLDKEYENLKLSLDDYDSNLEILLKLQRRRSEQLELRAPVDPLDW 475 (687)
T ss_pred HhCcccCHHHHHHHHHHHHHHHHHcCCHHHhcCchhHHHHHhcccCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc
Confidence 99999999999999999999999999999988888899999999988888999999999988776666 47777788889
Q ss_pred CcCCCcceeeeeeecCCCeeeecccccCCCccccccccccchhhHHHHHHHHHHhhccccCCccCCCCCccCCCCHHHHH
Q psy9963 477 KEFGSVATVNAFNHIDKNAIEFPAGILQEPYFAADRPCFLNYGAIGTVIGHEITHTFDDEGSQYDKFGNVMNWWDDESKM 556 (800)
Q Consensus 477 ~~~~~p~~v~a~Y~~~~N~i~vP~~~l~~P~~~~~~p~~~nyg~lG~~iahel~h~fD~~G~~~d~~G~~~~ww~~~~~~ 556 (800)
... |..+||+|.+..|.|+||+++||+|||+..+|.++|||++|++|||||+||||+.|++||+.|+.++||+.++..
T Consensus 476 ~~~--~~~~na~Y~~~~N~i~~pa~ilq~P~f~~~~P~~~nyg~iG~vigHEl~H~FD~~G~~~D~~Gn~~~ww~~~s~~ 553 (687)
T KOG3624|consen 476 VGS--PAQVNAFYSPEKNEIVFPAGLLQPPFFDLSYPDYLNYGGIGFVIGHELTHGFDDQGRQYDESGNLRDWWDTESES 553 (687)
T ss_pred ccc--cceeeccccCCCceEEEehhcccCCCCCcccchhhhhHHHHHHHHHHHhhcccccccccCCCcccccCCCHhHHH
Confidence 886 999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHhccccccCCCcccCCccchhhhhhhccChHHHHHHHHHHHHhcCcccC-CCCCCCCchhHHHHHHHHH
Q psy9963 557 RYNEKASCIIKQYSNYVIEDIGFRINGITTQGENIADNGGVKESYRAYLECAKQKNAEYK-LPGLENFSDKQMFWISFAN 635 (800)
Q Consensus 557 ~f~~~~~Cl~~qy~~~~~~~~~~~~~g~~tl~E~iaD~~gl~~ay~ay~~~~~~~~~~~~-lpgl~~~s~~QLFFi~~a~ 635 (800)
+|.++.+|+++||++|..+ .++||+.|++|||||++|+++||+||+++........+ |||++ +|++||||++|||
T Consensus 554 ~f~~~~qC~~~qy~~y~~~---~~~ng~~t~~EnIAD~~Gl~~A~~AY~~~~~~~~~~~~~lp~l~-~t~~QLFFl~~Aq 629 (687)
T KOG3624|consen 554 EFQERAQCLVKQYSNYLDP---RRLNGSKTLGENIADNGGLKLAYRAYKKWKLDLSGEPRELPGLD-LTPEQLFFLSYAQ 629 (687)
T ss_pred HHHHHHHHHHHHHhccCCC---cccCCccccchhhcchhhHHHHHHHHHHHHHhccCCCCCCCCCC-CChhHHHHHHHHH
Confidence 9999999999999999887 67899999999999999999999999988766544434 99999 9999999999999
Q ss_pred HhhccCcHHHHHHhhccCCCCCCccEEEecccCchhHHhhcCCCCCCCCCccCceeeC
Q psy9963 636 VWCTKYRKEAMELLFKNDDHPPAEIRVNGPLSNSYDFARDFQCYSGSKMNPKHKCEVW 693 (800)
Q Consensus 636 ~~C~~~~~~~~~~~~~~~~h~p~~~RvN~~L~n~~eF~~AF~C~~gs~Mnp~~kC~lw 693 (800)
.||+...++.....+..++|+|+++|||++|+|+|+|++||+||.||+|||++||.||
T Consensus 630 ~~C~~~~~~~~~~~~~~~~H~~~~~RVng~lsN~~eFa~aF~C~~gS~MnP~~kC~v~ 687 (687)
T KOG3624|consen 630 FFCSSEDPKKLPESLLTDPHSPESFRVNGVLSNSPEFAEAFNCPIGSPMNPEKKCKVW 687 (687)
T ss_pred HHhccCChhhhhHHhhcCCCCCcceeEeeeccCcHHHHHhcCCCCCCCCCccccCcCC
Confidence 9999998888888888899999999999999999999999999999999999999999
No 3
>PF05649 Peptidase_M13_N: Peptidase family M13 This is family M13 in the peptidase classification. ; InterPro: IPR008753 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell []. Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0008237 metallopeptidase activity, 0006508 proteolysis; PDB: 3DWB_A 3ZUK_A 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A.
Probab=100.00 E-value=1.9e-59 Score=524.21 Aligned_cols=383 Identities=32% Similarity=0.627 Sum_probs=327.8
Q ss_pred CCchHHhhhcCcccCCCCCCCCCccchHHHHHHHHHHHHHHHhcCcCCC-----CCChHHHHHHHHHHHHhCcccccccc
Q psy9963 8 CDNFYKFACGNFLKTSVIPPDRASVSTFSKIEDRLSSQIKTVLEEPQSA-----SGPHIFNLMKKFYATCMDEDTLDKLG 82 (800)
Q Consensus 8 CdDFY~yaCG~w~~~~~ip~~~~~~~~~~~~~~~~~~~l~~lL~~~~~~-----~~~~~~~~~~~~Y~sC~~~~~~~~~~ 82 (800)
|||||+||||+|.+.|++|.+...+++|+.+++.+.+.++++|+++... ..++++++++.||++||+.++++..|
T Consensus 1 CdDFY~yvCg~w~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~~~~~~~~~~~k~~~~Y~sC~~~~~~~~~~ 80 (390)
T PF05649_consen 1 CDDFYQYVCGNWIKSHPIPSGQSSYSTFSDLQDNIDRQLKEILEKPSSSDTSPAEDSSAEQKAKALYDSCMDTDAREKDG 80 (390)
T ss_dssp TT-HHHHHHHHHHHH----TT-SEEEHHHHHHHHHHHHHHHHHSCCS--HCC-TTTTHHHHHHHHHHHHHH-HHHHHHHT
T ss_pred CccHHHHHhCcHHhhCCCCcCcccccHHHHHHHHhhhhhhhhhcccccccccccCCChHHHHHHHHHHHHHHhhhcchhh
Confidence 9999999999999999999999999999999999999999999997432 25899999999999999999999999
Q ss_pred cccHHHHHHHhCCCCCCCCCCCCCCcchHHHHHHHHHhcccCcccceeeeeccccCCCCCCCCCCCCCCCCCCCcceeee
Q psy9963 83 VQPMKVIIEQLGGLPVVDGDRWDSSRFSWINTTYKLRDLGFNHNLLINIVFCKQTQHPINTTQIPGAGPDFKNSTRRRLL 162 (800)
Q Consensus 83 ~~~l~~~l~~~g~wp~~~~~~~~~~~~~~~~~l~~l~~~~~g~~~l~~~~~~~~~~~~~~~~~~~~v~~d~~~~~~~~l~ 162 (800)
..+|++.|+++|+||..+. |..+.++|.++++.+.+. +|+++||++ .|.+|+.|++.++|+
T Consensus 81 ~~~l~~~l~~~~~~p~~~~--~~~~~~~~~~~l~~l~~~-~~~~~l~~~----------------~v~~d~~~~~~~~l~ 141 (390)
T PF05649_consen 81 LEPLKEFLRSIGGWPFLSD--WNESKFDLLDTLARLSRR-YGIDPLFSL----------------YVDPDPQNPSKYILY 141 (390)
T ss_dssp THHHHHHHHHCTCBCCCSS--HHTTCCHHHHHHHHHHHT-C---SSSEE----------------EEEEETTEEEEEEEE
T ss_pred hhhHHHHHHHhhhcccCCc--ccCCHhHHHHHHHHHHhh-ccccceeee----------------EeeccccchheeEee
Confidence 9999999999999999843 656788999999999998 899999999 999999999999999
Q ss_pred ecCCCCCchhh-hhcccc-cHHHHHHHHHHHHHHHHHcCCChhHHHHHHHHHHHHHHHHhccCCChhhccchhhhccccC
Q psy9963 163 IDQVDLKPLED-FLREGW-RNKNVRAYHSYMVKTAVLFGAPVSRAFRELKDSIDFEITMTKFIVPDQELRNLSYLYNLMS 240 (800)
Q Consensus 163 l~~~~l~l~~~-~~~~~~-~~~~~~~y~~~~~~~~~~~g~~~~~~~~~~~~v~~fE~~La~~~~~~~~~~~~~~~y~~~t 240 (800)
|++|+++++++ ++.++. .......|.+++++++..+|.+...+.+.+++|++||.+|+++....++++++...|+++|
T Consensus 142 i~~~~~~l~~~~~~~~~~~~~~~~~~y~~~v~~~l~~~g~~~~~~~~~~~~i~~~e~~L~~~~~~~~~~~~~~~~~~~~t 221 (390)
T PF05649_consen 142 IDPPELGLPSKEYYRDPHSSKEYLQAYREYVREVLKLLGVDDDEASKLADDIVKFEKQLAKLSSEDEERRDPQKLYNPMT 221 (390)
T ss_dssp EEE---SSSSGGGGCTCGGCHHHHHHHHHHHHHHHHHCSS-HHHHHHHHHHHHHHHHHHHHHS--GGGCT-HHHH-EEEE
T ss_pred cccCCCCCcchHHhhcchhhHHHHHHHHHHHHHHHhhccCChHHHHHHHHHHHHHHHHHHHHhhhhHhhccHHHhcCccc
Confidence 99999999876 554432 3678899999999999999998889999999999999999999988888888888899999
Q ss_pred HHHHhhhCCCCCccccccCCCCchhHHHHHHHhhCCCCCCCCCCeEEEeChhHHHHHHHHHhcCChhHHHHHHHHHHHHH
Q psy9963 241 VHTLNKYYPSIPCFYLEIDGPSLNKKLEFLNRIIGPKISIDSEELVMVTQPKYMSSIERLVQITPKRILANYMLMQTVLS 320 (800)
Q Consensus 241 l~eL~~~~p~i~w~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~V~v~~~~yl~~l~~ll~~t~~~~l~nYl~~~~~~~ 320 (800)
+++|++.+|.++| ..+|+.++++ ++.++.|+|.+|+||++|.++|.++++++|+||++|+++.+
T Consensus 222 l~eL~~~~p~i~w-------------~~~l~~~~~~---~~~~~~V~v~~~~y~~~l~~ll~~~~~~~l~nYl~~~~i~~ 285 (390)
T PF05649_consen 222 LSELQSQYPQIDW-------------DRYLKALFGE---VTESDEVIVRSPDYFKKLSELLKQTSKETLANYLGWRLIRE 285 (390)
T ss_dssp HHHHHHHSTTS-H-------------HHHHHHHCCC---CCTTSEEEES-HHHHHHHHHHHHHS-HHHHHHHHHHHHHHH
T ss_pred HHHHHHHHhhhhH-------------HHHHHHHccc---cCCCceEEEcCcHHHHHHHHHHHhCCcHHHHHHHHHHHHHH
Confidence 9999999999999 9999999976 66778999999999999999999999999999999999999
Q ss_pred hhhhcCHHHhhhhhhhhHhhcCCCCCchhhhhHHHHHHHHhhhhHhhhHHhhccChhhHHHHHHHHHHHHHHHHHhhhcC
Q psy9963 321 SSAYLCKSARRPYHDFKYVLTGVNSTRIRSQRCVDEVAEKFSLAVGSLYVRHFFDRTSKENALELVNNLRQEMDYILQNV 400 (800)
Q Consensus 321 ~~~~l~~~~~~~~~~~~~~~~g~~~~~~~~~~C~~~~~~~~p~~~~~ly~~~~~~~~~~~~v~~m~~~ik~~~~~~l~~~ 400 (800)
++++++.+++.....|...+.|..+..++|..|++.+...||++++++|++++++.+.++.|.+|+++||++|.++|+++
T Consensus 286 ~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~C~~~~~~~l~~~~~~~y~~~~~~~~~~~~v~~~~~~ik~~f~~~i~~~ 365 (390)
T PF05649_consen 286 LSPYLSSELRDLYFEFYRTLSGVEESRPRWQRCVRHVEKLLPFALGRLYVRRYFDKENKKEVEDMVENIKEAFRERIEES 365 (390)
T ss_dssp HGGGS-HHHHHHHHHHHTHHH--SS---HHHHHHHHHHHHHHHHHHHHHHHHH--CCHHHHHHHHHHHHHHHHHHHHCT-
T ss_pred HHhhccHHHHHHHHHHHHHhcCCccchhhHHHHHHHHHHHHHHHHHhhHhhhcCchhHHHHHHHHHHHHHHHHHHHHhhC
Confidence 99999999999999999999999988999999999999999999999999999999999999999999999999999999
Q ss_pred CCCCHHHHHHHHHHHhhcccccccC
Q psy9963 401 SWMDPQTKKNALEKSKVLKVHIGYP 425 (800)
Q Consensus 401 ~Wmd~~tk~~a~~Kl~~m~~~ig~p 425 (800)
+|||++||+.|++||++|+++||||
T Consensus 366 ~Wld~~tk~~a~~Kl~~m~~~iGyP 390 (390)
T PF05649_consen 366 DWLDEETKKEAIEKLNNMKLNIGYP 390 (390)
T ss_dssp TTS-HHHHHHHHHHHHH-EEEEES-
T ss_pred CCCCHHHHHHHHHHHHhhhhhccCC
Confidence 9999999999999999999999998
No 4
>PF01431 Peptidase_M13: Peptidase family M13 This is family M13 in the peptidase classification. ; InterPro: IPR018497 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell []. Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A 3DWB_A 3ZUK_A.
Probab=100.00 E-value=1.1e-53 Score=432.73 Aligned_cols=204 Identities=47% Similarity=0.861 Sum_probs=166.5
Q ss_pred eeeeecCCCeeeecccccCCCccccccccccchhhHHHHHHHHHHhhccccCCccCCCCCccCCCCHHHHHHHHHHHHHH
Q psy9963 486 NAFNHIDKNAIEFPAGILQEPYFAADRPCFLNYGAIGTVIGHEITHTFDDEGSQYDKFGNVMNWWDDESKMRYNEKASCI 565 (800)
Q Consensus 486 ~a~Y~~~~N~i~vP~~~l~~P~~~~~~p~~~nyg~lG~~iahel~h~fD~~G~~~d~~G~~~~ww~~~~~~~f~~~~~Cl 565 (800)
||+|.+..|+|+||+++|++|||+.+.|.++|||+||++|||||+|+||..|+.||++|+.++||++.+...|.++.+|+
T Consensus 1 na~Y~~~~N~i~ip~~~l~~P~f~~~~p~~~~yg~lG~ilahel~hafd~~g~~~D~~g~~~~wWs~~~~~~~~~~~~Cl 80 (206)
T PF01431_consen 1 NAYYSPRFNSIVIPAGILQPPFFDPNYPPALNYGGLGFILAHELMHAFDPEGINYDEDGNLRNWWSEESRSNFKERAQCL 80 (206)
T ss_dssp --EEETTTTEEEEEGGGSSTTT--TTS-HHHHHHTHHHHHHHHHHHCTSTTGGGB-TTS-BS--S-HHHHHHHHHHHHHH
T ss_pred CCCCCcccCEEEecHHHhCCccCCCCCCHHHHHHHHHHHHHHHHHHHHHHhHhhcCcccccccccchhhHHHHHHHHHHH
Confidence 79999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHhccccccCCCc--ccCCccchhhhhhhccChHHHHHHHHHHHHhcCcccCCCCCCCCchhHHHHHHHHHHhhccCcH
Q psy9963 566 IKQYSNYVIEDIGF--RINGITTQGENIADNGGVKESYRAYLECAKQKNAEYKLPGLENFSDKQMFWISFANVWCTKYRK 643 (800)
Q Consensus 566 ~~qy~~~~~~~~~~--~~~g~~tl~E~iaD~~gl~~ay~ay~~~~~~~~~~~~lpgl~~~s~~QLFFi~~a~~~C~~~~~ 643 (800)
++||+++..+..+. .++|..|+.|+|||++||++||+||++ ....+.+||+++++|.+||||++||+.||+..++
T Consensus 81 ~~qy~~~~~~~~~~~~~~~~~~~~~E~iaD~~gl~~a~~ay~~---~~~~~~~l~~~~~~t~~QlFF~~~a~~~C~~~~~ 157 (206)
T PF01431_consen 81 RDQYSNYSVPEGNKSVSLNGSDTLNENIADNAGLRLAYRAYRK---KSSNEERLPGLEGLTPDQLFFISFAQSFCEKMSP 157 (206)
T ss_dssp HHHHHT-BCGGGTTCSB--TTTTHHHHHHHHHHHHHHHHHHHH---HHTTT-B-TTCTTB-HHHHHHHHHHHHT-EEE-H
T ss_pred HHHHhcccccccccccccchHHHHHHHHHHHHhHHHHHHHHhh---cccccccccccccchhHHHHHHHHHHHHhcCcch
Confidence 99999987765432 679999999999999999999999998 2334678999999999999999999999998887
Q ss_pred HHHHHhhccCCCCCCccEEEecccCchhHHhhcCCCCCCCCCccCceee
Q psy9963 644 EAMELLFKNDDHPPAEIRVNGPLSNSYDFARDFQCYSGSKMNPKHKCEV 692 (800)
Q Consensus 644 ~~~~~~~~~~~h~p~~~RvN~~L~n~~eF~~AF~C~~gs~Mnp~~kC~l 692 (800)
.....++..+.|+|+++|||+||+|+|+|++||+||.||+|||.+||++
T Consensus 158 ~~~~~~~~~~~h~p~~~RvN~~L~n~~~Fa~aF~C~~gs~Mnp~~kC~~ 206 (206)
T PF01431_consen 158 ESVYRQLQTDPHSPARFRVNGPLSNMPEFAEAFNCPSGSPMNPKKKCRC 206 (206)
T ss_dssp HHHHHHHHH-SS--HHHHHHHHHHT-HHHHHHHT--TTSTTS-SSGG--
T ss_pred hhhhhhhhcCCCCchHHHHHHHHhcCHHHHHHcCCCCCCCCCCCCCCcC
Confidence 7776677789999999999999999999999999999999999999985
No 5
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=99.68 E-value=3.4e-17 Score=176.08 Aligned_cols=96 Identities=32% Similarity=0.545 Sum_probs=91.0
Q ss_pred cccccccccccchHHHHHHHhhhhhhhhhhcccCChhhHHHHHHHHHHHHHHHHhhcccCCCCCHHHHHHHHHHHHhchh
Q psy9963 695 ESVNSTRIRSQRCVDEVAEKFSLAVGSLYVRHFFDRTSKENALELVNNLRQEMDYILQNVSWMDPQTKKNALEKSKVLKV 774 (800)
Q Consensus 695 ~~~~~~~~r~~~C~~~~~~~~~~~~g~~yv~~~f~~~~~~~~~~m~~~ik~~~~~~l~~~~Wmd~~Tr~~a~~Kl~~~~~ 774 (800)
+++++..+||++.+.+++..||.++|.+||++||++++|.+|++||.+++++|+.+|.+++||..+||++|++||++++.
T Consensus 316 sG~~E~r~rwKr~~~l~~~~~geaiG~~Yv~~~Fpp~aKa~m~~lVa~l~~ayk~Ri~~ldWmtpeTrekAl~KL~~ft~ 395 (654)
T COG3590 316 SGQPEARDRWKRAVNLAERLFGEAIGLLYVKRYFPPEAKADMEELVANLIKAYKARISKLDWMTPETREKALEKLNKFTA 395 (654)
T ss_pred cCChhhhHHHHHHHHHHHHHHHHHHHHHHHHHhCChHHHHHHHHHHHHHHHHHHHHhhhccccCHHHHHHHHHHHhcccc
Confidence 45678899999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hhccCCccCCcchHHh
Q psy9963 775 HIGYPDELLDDDKIDL 790 (800)
Q Consensus 775 ~igyp~~~~~~~~l~~ 790 (800)
+||||+.+.+++.|..
T Consensus 396 kIGYP~~wr~y~kL~~ 411 (654)
T COG3590 396 KIGYPDPWRYYSKLEI 411 (654)
T ss_pred ccCCCchhhhhhhhcc
Confidence 9999998888777653
No 6
>KOG3624|consensus
Probab=99.47 E-value=4e-14 Score=168.83 Aligned_cols=97 Identities=37% Similarity=0.770 Sum_probs=94.1
Q ss_pred cccchHHHHHHHhhhhhhhhhhcccCChhhHHHHHHHHHHHHHHHHhhcccCCCCCHHHHHHHHHHHHhchhhhccCCcc
Q psy9963 703 RSQRCVDEVAEKFSLAVGSLYVRHFFDRTSKENALELVNNLRQEMDYILQNVSWMDPQTKKNALEKSKVLKVHIGYPDEL 782 (800)
Q Consensus 703 r~~~C~~~~~~~~~~~~g~~yv~~~f~~~~~~~~~~m~~~ik~~~~~~l~~~~Wmd~~Tr~~a~~Kl~~~~~~igyp~~~ 782 (800)
||..|++.+...||.++|++|++++|++++++.+.+|+++||++|.++|++++|||++||+.|++|+++|+.+||||+++
T Consensus 347 ~~~~Cv~~~~~~~p~a~~~l~~r~~~~~~~~~~~~~m~~~lk~~f~~~l~~~~Wl~~~tr~~a~~K~~~m~~~ig~p~~~ 426 (687)
T KOG3624|consen 347 RWADCVELVRTLLPLALGRLYVRNFFDKENKKEVSEMIEDLKKAFEEMLQELDWLDEETRKSAIKKLNAMKKNIGYPDEF 426 (687)
T ss_pred HHHHHHHHHHHhhhhhhHHHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHhCcccCHHHHHHHHHHHHHHHHHcCCHHHh
Confidence 89999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CCcchHHhhcCCccccc
Q psy9963 783 LDDDKIDLHYKDLRIVT 799 (800)
Q Consensus 783 ~~~~~l~~~y~~l~~~~ 799 (800)
.++..|++.|.++.+++
T Consensus 427 ~~~~~ld~~y~~l~l~~ 443 (687)
T KOG3624|consen 427 LPPGTLDKEYENLKLSL 443 (687)
T ss_pred cCchhHHHHHhcccCCh
Confidence 99888999999987764
No 7
>PF05649 Peptidase_M13_N: Peptidase family M13 This is family M13 in the peptidase classification. ; InterPro: IPR008753 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell []. Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0008237 metallopeptidase activity, 0006508 proteolysis; PDB: 3DWB_A 3ZUK_A 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A.
Probab=99.42 E-value=1.1e-13 Score=154.64 Aligned_cols=79 Identities=39% Similarity=0.780 Sum_probs=70.7
Q ss_pred cccccchHHHHHHHhhhhhhhhhhcccCChhhHHHHHHHHHHHHHHHHhhcccCCCCCHHHHHHHHHHHHhchhhhccC
Q psy9963 701 RIRSQRCVDEVAEKFSLAVGSLYVRHFFDRTSKENALELVNNLRQEMDYILQNVSWMDPQTKKNALEKSKVLKVHIGYP 779 (800)
Q Consensus 701 ~~r~~~C~~~~~~~~~~~~g~~yv~~~f~~~~~~~~~~m~~~ik~~~~~~l~~~~Wmd~~Tr~~a~~Kl~~~~~~igyp 779 (800)
.+||+.|+..+...||+++|++|++++|++++++.|++|+++||++|.++|++++|||++||+.|++||++|+.+||||
T Consensus 312 ~~~~~~C~~~~~~~l~~~~~~~y~~~~~~~~~~~~v~~~~~~ik~~f~~~i~~~~Wld~~tk~~a~~Kl~~m~~~iGyP 390 (390)
T PF05649_consen 312 RPRWQRCVRHVEKLLPFALGRLYVRRYFDKENKKEVEDMVENIKEAFRERIEESDWLDEETKKEAIEKLNNMKLNIGYP 390 (390)
T ss_dssp --HHHHHHHHHHHHHHHHHHHHHHHHH--CCHHHHHHHHHHHHHHHHHHHHCT-TTS-HHHHHHHHHHHHH-EEEEES-
T ss_pred hhhHHHHHHHHHHHHHHHHHhhHhhhcCchhHHHHHHHHHHHHHHHHHHHHhhCCCCCHHHHHHHHHHHHhhhhhccCC
Confidence 4599999999999999999999999999999999999999999999999999999999999999999999999999999
No 8
>PF14247 DUF4344: Domain of unknown function (DUF4344)
Probab=79.93 E-value=0.84 Score=46.14 Aligned_cols=51 Identities=24% Similarity=0.264 Sum_probs=35.3
Q ss_pred eeeeeeecCCCeeeecccccCCC--cc-ccccccc----cchhhHHHHHHHHHHhhcc
Q psy9963 484 TVNAFNHIDKNAIEFPAGILQEP--YF-AADRPCF----LNYGAIGTVIGHEITHTFD 534 (800)
Q Consensus 484 ~v~a~Y~~~~N~i~vP~~~l~~P--~~-~~~~p~~----~nyg~lG~~iahel~h~fD 534 (800)
..||||+|..++|+||..++..- .| ..+.|.- .-.|.+=+++.||+.|++-
T Consensus 48 e~nA~ydPe~~~I~iCYEf~~~~~~~f~~~~~~~~~~~~~~~~~~~~~l~HE~GHAlI 105 (220)
T PF14247_consen 48 EDNAFYDPENRSITICYEFVDEILDRFAKANDPDEEYGQAAIGNVLFTLYHELGHALI 105 (220)
T ss_pred CCCCccCCCCCEEEECHHHHHHHHHHHHhCCcCcchHHHHHHHHHHHHHHHHHHHHHH
Confidence 46899999999999999886631 11 1121111 1356677889999999983
No 9
>KOG0506|consensus
Probab=60.00 E-value=6.9 Score=43.15 Aligned_cols=165 Identities=16% Similarity=0.300 Sum_probs=90.7
Q ss_pred hhcccccccCCCCCChhhHHHhcccce-ecCCChHHHHHHHHHHHHHHHHHHhcCCCCCCCCCcCCCcceeeeeeecCCC
Q psy9963 416 KVLKVHIGYPDELLDDDKIDLHYKDLR-IVPGDLLTSVLNLYKKSVDYAYTQLHEPINKTSWKEFGSVATVNAFNHIDKN 494 (800)
Q Consensus 416 ~~m~~~ig~p~~~~~~~~l~~~y~~l~-~~~~~~~~n~~~~~~~~~~~~~~~l~~p~~~~~w~~~~~p~~v~a~Y~~~~N 494 (800)
++++.++-.|+|+.=...++..|+..+ ..++. +..++ -+|.+. ....|.++ .-+|..- +.
T Consensus 165 qALrkqmVIPdw~~Fts~I~tIFEscke~seG~-vA~YI-----------PQLar~-sPdlW~vS--vCTvDGQ----R~ 225 (622)
T KOG0506|consen 165 QALRKQMVIPDWEEFTSHIDTIFESCKESSEGK-VATYI-----------PQLARQ-SPDLWGVS--VCTVDGQ----RH 225 (622)
T ss_pred HHHhcCccCCcHHHHHHHHHHHHHHHHhcCCcc-HHHhh-----------HHHhcc-CCccceeE--EeeecCc----cc
Confidence 345556777888665566776676655 33332 12222 222221 24578876 4444322 11
Q ss_pred eeeecccccCCCccccccccccchhhHHHHHHHHHHhhc---cccCCccC-----CCCCccCCCCH----------HHHH
Q psy9963 495 AIEFPAGILQEPYFAADRPCFLNYGAIGTVIGHEITHTF---DDEGSQYD-----KFGNVMNWWDD----------ESKM 556 (800)
Q Consensus 495 ~i~vP~~~l~~P~~~~~~p~~~nyg~lG~~iahel~h~f---D~~G~~~d-----~~G~~~~ww~~----------~~~~ 556 (800)
..|----||-......++||+..-.-||.|..|.+ .+.|+.|+ +++...|-.-+ ....
T Consensus 226 ----s~Gd~k~pFClQSvsK~f~YAiv~sdlG~~~vH~yVG~EPSGrlfN~i~Ld~~~kPHNPmvNaGAIVvtSliK~~~ 301 (622)
T KOG0506|consen 226 ----SLGDTKKPFCLQSVSKPFNYAIVASDLGTEVVHQYVGQEPSGRLFNEICLDSNNKPHNPMVNAGAIVVTSLIKPKH 301 (622)
T ss_pred ----cccccCCchhHhhhcccceeeeeeccccHHHHHHHhcCCCccchhhhhhccCCCCCCCcccccccEEehhhhccCC
Confidence 23444457877788889999999999999999988 56788874 45544432211 1112
Q ss_pred HHHHHHHHHHHHhccccccCCCcccCCccchhhhhhhccChHHHHHHHHH
Q psy9963 557 RYNEKASCIIKQYSNYVIEDIGFRINGITTQGENIADNGGVKESYRAYLE 606 (800)
Q Consensus 557 ~f~~~~~Cl~~qy~~~~~~~~~~~~~g~~tl~E~iaD~~gl~~ay~ay~~ 606 (800)
...+|-.-+.+||+++.-.+. +-++....+.|. +.+-=.-|..-|.+
T Consensus 302 n~AdrFDfvl~~~rK~aGn~y-vGF~NatFlSEr--etaDRNyAlsYymk 348 (622)
T KOG0506|consen 302 NMADRFDFVLQQYRKMAGNEY-VGFDNATFLSER--ETADRNYALSYYMK 348 (622)
T ss_pred ChHHHHHHHHHHHHHhhCCCc-ccccchhhhcch--hcccchhHHHHHHH
Confidence 334566667778877643220 112444455553 22223345454554
No 10
>PF09768 Peptidase_M76: Peptidase M76 family; InterPro: IPR019165 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. Mitochondrial inner membrane protease ATP23 has two roles in the assembly of mitochondrial ATPase. Firstly, it acts as a protease that removes the N-terminal 10 residues of mitochondrial ATPase CF(0) subunit 6 (ATP6) at the intermembrane space side. Secondly, it is involved in the correct assembly of the membrane-embedded ATPase CF(0) particle, probably mediating association of ATP6 with the subunit 9 ring [, ].; GO: 0004222 metalloendopeptidase activity
Probab=59.19 E-value=7.9 Score=37.63 Aligned_cols=41 Identities=20% Similarity=0.262 Sum_probs=32.1
Q ss_pred eeeeeecCCCeeeecccccCCCccccccccccchhhHHHHHHHHHHhhccccC
Q psy9963 485 VNAFNHIDKNAIEFPAGILQEPYFAADRPCFLNYGAIGTVIGHEITHTFDDEG 537 (800)
Q Consensus 485 v~a~Y~~~~N~i~vP~~~l~~P~~~~~~p~~~nyg~lG~~iahel~h~fD~~G 537 (800)
+.+.|+|..+.|+|=..-+ .+.+-+-.+|+|||+||||.--
T Consensus 47 ~~Ggf~p~~~~I~lC~N~~------------~~~~~l~~~l~HELIHayD~cr 87 (173)
T PF09768_consen 47 VSGGFDPSKKGIVLCQNRI------------RSQGHLEDTLTHELIHAYDHCR 87 (173)
T ss_pred CcCCccCCCCCEEEeeCCC------------CCHHHHHHHHHHHHHHHHHHHh
Confidence 5577888888887755444 4667899999999999999753
No 11
>PF04228 Zn_peptidase: Putative neutral zinc metallopeptidase; InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=57.96 E-value=4.1 Score=43.21 Aligned_cols=84 Identities=18% Similarity=0.223 Sum_probs=46.9
Q ss_pred ceeeeeeecCCCeeeecccccCC--CccccccccccchhhHHHHHHHHHHhhcccc-CCccCCCCCccCCCC----HHHH
Q psy9963 483 ATVNAFNHIDKNAIEFPAGILQE--PYFAADRPCFLNYGAIGTVIGHEITHTFDDE-GSQYDKFGNVMNWWD----DESK 555 (800)
Q Consensus 483 ~~v~a~Y~~~~N~i~vP~~~l~~--P~~~~~~p~~~nyg~lG~~iahel~h~fD~~-G~~~d~~G~~~~ww~----~~~~ 555 (800)
..++|||.|..++|+++..++.. --|....+ -..-.+||||..|++-.. |..- .....+.-.+ ....
T Consensus 135 s~~gpFYCp~D~tIYlD~~f~~~L~~~~ga~G~-----~a~ayVlAHEyGHHVQ~l~Gil~-~~~~~~~~~~~~~~~~~s 208 (292)
T PF04228_consen 135 SATGPFYCPADQTIYLDLSFFDELQQRFGASGD-----FAQAYVLAHEYGHHVQNLLGILD-AVRQAQQGRSPAEANELS 208 (292)
T ss_pred CCCCCEeCCCCCEEEechHHHHHHHHHhCCccH-----HHHHHHHHHHHHHHHHHHhhhHH-HHHHHhhccCccccchHH
Confidence 34679999999999999876431 01111111 134568999999998543 3221 0000000001 1223
Q ss_pred HHHHHHHHHHHHHhccc
Q psy9963 556 MRYNEKASCIIKQYSNY 572 (800)
Q Consensus 556 ~~f~~~~~Cl~~qy~~~ 572 (800)
.+.+-.++||.--|-..
T Consensus 209 vr~ELQADC~AGvw~~~ 225 (292)
T PF04228_consen 209 VRLELQADCFAGVWAGH 225 (292)
T ss_pred HHHHHHHHHHHHHHhhh
Confidence 45677799998766543
No 12
>PF01421 Reprolysin: Reprolysin (M12B) family zinc metalloprotease This Prosite motif covers only the active site.; InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=57.14 E-value=13 Score=37.15 Aligned_cols=18 Identities=22% Similarity=0.377 Sum_probs=15.7
Q ss_pred cchhhHHHHHHHHHHhhc
Q psy9963 516 LNYGAIGTVIGHEITHTF 533 (800)
Q Consensus 516 ~nyg~lG~~iahel~h~f 533 (800)
..+..+|.+|||||.|.+
T Consensus 126 ~~~~~~a~~~AHelGH~l 143 (199)
T PF01421_consen 126 RSGLSFAVIIAHELGHNL 143 (199)
T ss_dssp SSHHHHHHHHHHHHHHHT
T ss_pred chhHHHHHHHHHHHHHhc
Confidence 456788999999999998
No 13
>KOG2661|consensus
Probab=42.81 E-value=26 Score=37.11 Aligned_cols=23 Identities=30% Similarity=0.511 Sum_probs=18.3
Q ss_pred ccccchhhHHHHHHHHHHhhccc
Q psy9963 513 PCFLNYGAIGTVIGHEITHTFDD 535 (800)
Q Consensus 513 p~~~nyg~lG~~iahel~h~fD~ 535 (800)
|.--+-.++.++||||+.|+.-.
T Consensus 267 n~ck~ddglAtvLgHE~aHaVar 289 (424)
T KOG2661|consen 267 NSCKDDDGLATVLGHEIAHAVAR 289 (424)
T ss_pred hcccChHHHHHHHHHHHHHHHHH
Confidence 44556678999999999999843
No 14
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=42.04 E-value=33 Score=34.81 Aligned_cols=12 Identities=25% Similarity=0.443 Sum_probs=11.4
Q ss_pred HHHHHHHHHhhc
Q psy9963 522 GTVIGHEITHTF 533 (800)
Q Consensus 522 G~~iahel~h~f 533 (800)
..++||||.|.|
T Consensus 146 ~~~~AHElGH~l 157 (220)
T cd04272 146 VYTMTHELAHLL 157 (220)
T ss_pred HHHHHHHHHHHh
Confidence 699999999998
No 15
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=41.79 E-value=18 Score=37.52 Aligned_cols=21 Identities=38% Similarity=0.494 Sum_probs=16.1
Q ss_pred hHHHHHHHHHHhhccccCCccCCC
Q psy9963 520 AIGTVIGHEITHTFDDEGSQYDKF 543 (800)
Q Consensus 520 ~lG~~iahel~h~fD~~G~~~d~~ 543 (800)
....++||||.|.| |...|..
T Consensus 166 ~~a~t~AHElGHnl---Gm~HD~~ 186 (244)
T cd04270 166 ESDLVTAHELGHNF---GSPHDPD 186 (244)
T ss_pred HHHHHHHHHHHHhc---CCCCCCC
Confidence 37789999999999 5555544
No 16
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=40.46 E-value=27 Score=34.59 Aligned_cols=24 Identities=25% Similarity=0.379 Sum_probs=18.1
Q ss_pred chhhHHHHHHHHHHhhccccCCccCCC
Q psy9963 517 NYGAIGTVIGHEITHTFDDEGSQYDKF 543 (800)
Q Consensus 517 nyg~lG~~iahel~h~fD~~G~~~d~~ 543 (800)
++...+.++||||.|.| |...|..
T Consensus 127 ~~~~~a~~~AHElGH~l---G~~HD~~ 150 (194)
T cd04269 127 NLLLFAVTMAHELGHNL---GMEHDDG 150 (194)
T ss_pred chHHHHHHHHHHHHhhc---CCCcCCC
Confidence 35567899999999998 5555544
No 17
>KOG3314|consensus
Probab=36.30 E-value=21 Score=33.76 Aligned_cols=21 Identities=33% Similarity=0.607 Sum_probs=16.7
Q ss_pred hhHHHHHHHHHHhhccccCCc
Q psy9963 519 GAIGTVIGHEITHTFDDEGSQ 539 (800)
Q Consensus 519 g~lG~~iahel~h~fD~~G~~ 539 (800)
.-+-.++.|||+|+||+--..
T Consensus 89 ~h~n~vv~HElIH~fDd~r~~ 109 (194)
T KOG3314|consen 89 DHVNQVVIHELIHAFDDCRAK 109 (194)
T ss_pred HHHHHHHHHHHHHHHHhhhhh
Confidence 347789999999999986443
No 18
>PF01447 Peptidase_M4: Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ; InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=36.03 E-value=18 Score=34.31 Aligned_cols=12 Identities=42% Similarity=0.747 Sum_probs=10.3
Q ss_pred HHHHHHHHhhcc
Q psy9963 523 TVIGHEITHTFD 534 (800)
Q Consensus 523 ~~iahel~h~fD 534 (800)
=++||||+||+-
T Consensus 137 DVvaHEltHGVt 148 (150)
T PF01447_consen 137 DVVAHELTHGVT 148 (150)
T ss_dssp HHHHHHHHHHHH
T ss_pred ceeeeccccccc
Confidence 489999999973
No 19
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=34.89 E-value=17 Score=36.48 Aligned_cols=20 Identities=30% Similarity=0.371 Sum_probs=14.8
Q ss_pred HHHHHHHHHHhhccccCCccCCC
Q psy9963 521 IGTVIGHEITHTFDDEGSQYDKF 543 (800)
Q Consensus 521 lG~~iahel~h~fD~~G~~~d~~ 543 (800)
.+.++||||.|.+ |...|..
T Consensus 140 ~a~~~aHElGH~L---G~~HD~~ 159 (207)
T cd04273 140 SAFTIAHELGHVL---GMPHDGD 159 (207)
T ss_pred eEEeeeeechhhc---CCCCCCC
Confidence 4689999999998 4444444
No 20
>PF03002 Somatostatin: Somatostatin/Cortistatin family; InterPro: IPR018142 Somatostatin inhibits the release of the pituitary growth hormone, somatotropin and inhibits the release of glucagon and insulin from the pancreas of fasted animals. Cortistatin is a cortical neuropeptide with neuronal depressant and sleep-modulating properties [].; GO: 0005179 hormone activity, 0005576 extracellular region
Probab=34.54 E-value=13 Score=21.34 Aligned_cols=9 Identities=33% Similarity=0.866 Sum_probs=7.0
Q ss_pred CCCCCCchH
Q psy9963 4 GVQPCDNFY 12 (800)
Q Consensus 4 svdPCdDFY 12 (800)
...||.|||
T Consensus 3 ~k~~Cknff 11 (18)
T PF03002_consen 3 RKAGCKNFF 11 (18)
T ss_pred cccccccee
Confidence 457899997
No 21
>PF10871 DUF2748: Protein of unknown function (DUF2748); InterPro: IPR020183 This entry represents proteins that are mainly confined to Rickettsia and Orientia. The proteins, which include RP364 and RC0048, are currently uncharacterised.
Probab=33.92 E-value=3.8e+02 Score=28.12 Aligned_cols=141 Identities=16% Similarity=0.179 Sum_probs=81.4
Q ss_pred hhHHhhccChhhHHHHHHHHHHHHHHHHHhhhcCCCCCHHHHHHHHHHHhhcccccccCC---CCCChhhHHHhccccee
Q psy9963 367 SLYVRHFFDRTSKENALELVNNLRQEMDYILQNVSWMDPQTKKNALEKSKVLKVHIGYPD---ELLDDDKIDLHYKDLRI 443 (800)
Q Consensus 367 ~ly~~~~~~~~~~~~v~~m~~~ik~~~~~~l~~~~Wmd~~tk~~a~~Kl~~m~~~ig~p~---~~~~~~~l~~~y~~l~~ 443 (800)
.+|-...-+....+.+..+|++++....... ..++.+.++|..+=++-.-|= |++ |++. ++=+
T Consensus 80 nlYkn~~~~dkvi~Ki~si~d~LkKqi~~~~--------~Vkkev~~~LARlFVQSAHPIVI~WLL----L~kt--EvFi 145 (447)
T PF10871_consen 80 NLYKNSASDDKVIQKINSIFDNLKKQIQKLQ--------PVKKEVTEMLARLFVQSAHPIVIRWLL----LNKT--EVFI 145 (447)
T ss_pred HHHHhhhhhhhHHHHHHHHHHHHHHHHHHhh--------hhHHHHHHHHHHHHHhccCcceeeeeh----hcce--eEEE
Confidence 3443333333445778889999998887644 678889999999977777663 532 1110 1112
Q ss_pred cCCChHHHHHHHHHHHHHHHHHHhcCCCCCCCCCcCCCcceeeeeeecCCCeeeecccccCCCccccccccccchhhHHH
Q psy9963 444 VPGDLLTSVLNLYKKSVDYAYTQLHEPINKTSWKEFGSVATVNAFNHIDKNAIEFPAGILQEPYFAADRPCFLNYGAIGT 523 (800)
Q Consensus 444 ~~~~~~~n~~~~~~~~~~~~~~~l~~p~~~~~w~~~~~p~~v~a~Y~~~~N~i~vP~~~l~~P~~~~~~p~~~nyg~lG~ 523 (800)
+-+.-.+.++.+..|+. .++...-. ... .-+|--|-+...|-.++.-..-.-|.|..++|.. +++-.
T Consensus 146 tYS~nIGDmMDiv~Wq~------vG~NSGMQ--S~N--GkdvAIfVSCGGNPffFae~~ke~~~YG~G~~A~---ARLQI 212 (447)
T PF10871_consen 146 TYSHNIGDMMDIVSWQR------VGGNSGMQ--STN--GKDVAIFVSCGGNPFFFAENNKEHPIYGDGFAAV---ARLQI 212 (447)
T ss_pred EeccchhHHHHHHHHHH------hcCcCccc--ccC--CCcEEEEEecCCCchhhhhccccCCCcCCcHHHH---HHHHH
Confidence 22334556777766543 23211000 000 1233334455555555555555567787777654 67888
Q ss_pred HHHHHHHhhcc
Q psy9963 524 VIGHEITHTFD 534 (800)
Q Consensus 524 ~iahel~h~fD 534 (800)
+-|.||+|--|
T Consensus 213 IAAQELGHfAD 223 (447)
T PF10871_consen 213 IAAQELGHFAD 223 (447)
T ss_pred HHHHhhhhhhh
Confidence 88999999765
No 22
>PF12315 DUF3633: Protein of unknown function (DUF3633); InterPro: IPR022087 This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM.
Probab=31.31 E-value=27 Score=34.69 Aligned_cols=22 Identities=27% Similarity=0.587 Sum_probs=16.7
Q ss_pred cccccccchhhHHHHHHHHHHhhccc
Q psy9963 510 ADRPCFLNYGAIGTVIGHEITHTFDD 535 (800)
Q Consensus 510 ~~~p~~~nyg~lG~~iahel~h~fD~ 535 (800)
.++|..+ .|.++|||++||+-.
T Consensus 86 ~GLPrll----~gsiLAHE~mHa~Lr 107 (212)
T PF12315_consen 86 YGLPRLL----TGSILAHELMHAWLR 107 (212)
T ss_pred CCCCHHH----HhhHHHHHHHHHHhc
Confidence 3555543 799999999999953
No 23
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=26.27 E-value=40 Score=33.30 Aligned_cols=22 Identities=23% Similarity=0.326 Sum_probs=16.6
Q ss_pred hhHHHHHHHHHHhhccccCCccCCC
Q psy9963 519 GAIGTVIGHEITHTFDDEGSQYDKF 543 (800)
Q Consensus 519 g~lG~~iahel~h~fD~~G~~~d~~ 543 (800)
...|.++||||.|.| |...|..
T Consensus 131 ~~~~~~~aHElGH~l---G~~HD~~ 152 (192)
T cd04267 131 LLTALTMAHELGHNL---GAEHDGG 152 (192)
T ss_pred eeehhhhhhhHHhhc---CCcCCCC
Confidence 356889999999998 5555543
No 24
>KOG1047|consensus
Probab=24.35 E-value=44 Score=38.09 Aligned_cols=38 Identities=34% Similarity=0.638 Sum_probs=26.5
Q ss_pred hHHHHHHHHHHhhccccCCccCCCCCc---cCC---CCHHHHHHHHHHHHHHH
Q psy9963 520 AIGTVIGHEITHTFDDEGSQYDKFGNV---MNW---WDDESKMRYNEKASCII 566 (800)
Q Consensus 520 ~lG~~iahel~h~fD~~G~~~d~~G~~---~~w---w~~~~~~~f~~~~~Cl~ 566 (800)
.++.+|||||.|+-- ||+ .+| |-++-.+-|.+|..|-+
T Consensus 287 sl~~vIaHEIAHSWt---------GNlVTN~sWehfWLNEGfTvylErrI~g~ 330 (613)
T KOG1047|consen 287 SLVDVIAHEIAHSWT---------GNLVTNASWEHFWLNEGFTVYLERRIVGR 330 (613)
T ss_pred chhhHHHHHhhhhhc---------ccccccCccchhhhcccchhhhhhhhhhh
Confidence 379999999999863 443 233 77777777877766643
No 25
>PF13069 DUF3933: Protein of unknown function (DUF3933)
Probab=24.05 E-value=60 Score=23.48 Aligned_cols=19 Identities=37% Similarity=0.658 Sum_probs=16.1
Q ss_pred HHHHHHHHHHHHhchhhhc
Q psy9963 759 PQTKKNALEKSKVLKVHIG 777 (800)
Q Consensus 759 ~~Tr~~a~~Kl~~~~~~ig 777 (800)
.+||+.|++|.+-+....|
T Consensus 20 aetkqeaiekaellglrtg 38 (53)
T PF13069_consen 20 AETKQEAIEKAELLGLRTG 38 (53)
T ss_pred HhhHHHHHHHHHHhccccC
Confidence 5899999999998877665
No 26
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=21.80 E-value=1e+02 Score=27.38 Aligned_cols=53 Identities=15% Similarity=0.162 Sum_probs=30.3
Q ss_pred eeeecCCCeeeecccccCCCccccccccccchhhHHHHHHHHHHhhccccCCccCCCCCccCCCCHHH
Q psy9963 487 AFNHIDKNAIEFPAGILQEPYFAADRPCFLNYGAIGTVIGHEITHTFDDEGSQYDKFGNVMNWWDDES 554 (800)
Q Consensus 487 a~Y~~~~N~i~vP~~~l~~P~~~~~~p~~~nyg~lG~~iahel~h~fD~~G~~~d~~G~~~~ww~~~~ 554 (800)
.++.+..|.|+++... ...-.+..+|+||++|.+-..- .+.......||++..
T Consensus 4 G~~~~~~~~i~~~~~~-------------~~~~~~~~~l~HE~~H~~~~~~--~~~~~~~~~W~~EG~ 56 (128)
T PF13485_consen 4 GVYYPGFNRIVVYFQG-------------SDEDWLDRVLAHELAHQWFGNY--FGGDDNAPRWFNEGL 56 (128)
T ss_pred EEEecCCCEEEEecCC-------------CCHHHHHHHHHHHHHHHHHHHH--cCCCccCchHHHHHH
Confidence 4677777888754311 1222355899999999984332 222233445665543
Done!