Query psy1391
Match_columns 614
No_of_seqs 343 out of 2014
Neff 7.8
Searched_HMMs 46136
Date Fri Aug 16 23:35:35 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy1391.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1391hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0045|consensus 100.0 3E-87 6.5E-92 738.2 27.6 489 5-581 72-607 (612)
2 smart00230 CysPc Calpain-like 100.0 2.9E-68 6.2E-73 555.0 22.8 257 4-289 55-314 (318)
3 PF00648 Peptidase_C2: Calpain 100.0 2.2E-65 4.8E-70 531.8 20.4 254 5-285 43-298 (298)
4 cd00044 CysPc Calpains, domain 100.0 1.1E-62 2.4E-67 514.4 21.2 247 5-283 64-315 (315)
5 cd00214 Calpain_III Calpain, s 100.0 1.8E-29 4E-34 235.3 14.2 135 300-435 2-150 (150)
6 smart00720 calpain_III calpain 99.9 1.4E-27 3E-32 221.2 13.3 130 301-432 1-142 (143)
7 PF01067 Calpain_III: Calpain 99.9 2.6E-25 5.7E-30 207.0 13.5 133 300-433 1-147 (147)
8 KOG0037|consensus 99.7 3.9E-19 8.5E-24 168.4 -5.9 125 457-597 55-211 (221)
9 KOG0045|consensus 99.2 4.1E-11 9E-16 133.9 7.7 120 489-613 241-366 (612)
10 smart00230 CysPc Calpain-like 98.8 9.7E-08 2.1E-12 100.2 15.4 87 492-580 225-314 (318)
11 PF00648 Peptidase_C2: Calpain 98.4 5.4E-07 1.2E-11 93.9 8.8 84 490-575 209-297 (298)
12 cd00044 CysPc Calpains, domain 98.4 2.8E-06 6.1E-11 89.2 12.2 64 508-574 251-315 (315)
13 PTZ00184 calmodulin; Provision 96.2 0.00043 9.2E-09 63.6 -3.0 82 457-550 9-102 (149)
14 PTZ00183 centrin; Provisional 95.7 0.001 2.2E-08 61.9 -3.1 82 457-550 15-108 (158)
15 KOG0027|consensus 95.5 0.0017 3.7E-08 60.5 -2.1 100 457-568 6-148 (151)
16 KOG0037|consensus 94.7 0.0028 6E-08 61.3 -3.4 90 459-567 124-218 (221)
17 COG5126 FRQ1 Ca2+-binding prot 92.8 0.012 2.6E-07 54.9 -2.7 99 457-568 18-155 (160)
18 PF13499 EF-hand_7: EF-hand do 91.7 0.046 1E-06 42.9 -0.2 29 460-488 1-30 (66)
19 PF00036 EF-hand_1: EF hand; 91.1 0.05 1.1E-06 35.6 -0.5 28 534-570 2-29 (29)
20 cd05026 S-100Z S-100Z: S-100Z 90.4 0.093 2E-06 44.6 0.4 74 458-573 9-85 (93)
21 PF13405 EF-hand_6: EF-hand do 90.2 0.29 6.3E-06 32.3 2.6 26 460-485 1-27 (31)
22 PF13202 EF-hand_5: EF hand; P 89.9 0.32 6.9E-06 30.6 2.4 24 461-484 1-25 (25)
23 cd05023 S-100A11 S-100A11: S-1 89.0 0.15 3.3E-06 43.0 0.6 75 457-573 7-84 (89)
24 cd05024 S-100A10 S-100A10: A s 88.6 0.057 1.2E-06 45.5 -2.2 75 458-574 7-81 (91)
25 cd05022 S-100A13 S-100A13: S-1 86.9 0.15 3.3E-06 43.0 -0.6 31 458-488 7-39 (89)
26 PF00036 EF-hand_1: EF hand; 85.5 0.91 2E-05 29.6 2.6 27 460-486 1-28 (29)
27 KOG0027|consensus 84.0 0.24 5.2E-06 46.0 -0.9 54 457-522 83-137 (151)
28 COG5126 FRQ1 Ca2+-binding prot 80.3 0.36 7.8E-06 45.2 -1.1 62 457-530 90-156 (160)
29 cd02619 Peptidase_C1 C1 Peptid 79.3 5.6 0.00012 38.8 6.9 51 195-276 168-218 (223)
30 cd05029 S-100A6 S-100A6: S-100 77.8 0.57 1.2E-05 39.4 -0.6 30 457-486 8-40 (88)
31 KOG0034|consensus 77.7 0.66 1.4E-05 44.8 -0.3 98 457-570 31-176 (187)
32 cd05027 S-100B S-100B: S-100B 74.2 0.76 1.7E-05 38.6 -0.8 31 458-488 7-40 (88)
33 smart00027 EH Eps15 homology d 74.0 0.18 3.8E-06 43.0 -4.8 73 457-578 8-81 (96)
34 cd00052 EH Eps15 homology doma 73.8 0.28 6.2E-06 38.1 -3.3 25 462-486 2-27 (67)
35 cd05030 calgranulins Calgranul 73.5 0.77 1.7E-05 38.5 -0.9 72 457-570 6-80 (88)
36 KOG0036|consensus 73.5 0.64 1.4E-05 49.3 -1.7 86 454-550 9-100 (463)
37 PF01023 S_100: S-100/ICaBP ty 70.3 5.8 0.00013 28.7 3.2 33 458-490 5-40 (44)
38 PF13833 EF-hand_8: EF-hand do 70.1 0.24 5.2E-06 37.1 -4.3 15 471-485 1-15 (54)
39 cd05031 S-100A10_like S-100A10 64.0 1.1 2.5E-05 37.9 -1.8 30 457-486 6-38 (94)
40 COG4870 Cysteine protease [Pos 64.0 6.4 0.00014 41.5 3.3 53 195-275 261-318 (372)
41 smart00054 EFh EF-hand, calciu 58.2 12 0.00027 22.3 2.8 26 461-486 2-28 (29)
42 KOG0044|consensus 57.3 1.8 3.9E-05 41.9 -2.0 80 458-550 25-118 (193)
43 PF05543 Peptidase_C47: Stapho 51.8 58 0.0013 30.9 7.1 32 194-229 115-146 (175)
44 cd05025 S-100A1 S-100A1: S-100 50.1 3.1 6.8E-05 34.9 -1.4 73 458-572 8-83 (92)
45 cd02248 Peptidase_C1A Peptidas 48.9 58 0.0013 31.4 7.2 43 196-271 156-198 (210)
46 PF13833 EF-hand_8: EF-hand do 48.7 7.3 0.00016 28.9 0.5 27 534-569 27-53 (54)
47 smart00027 EH Eps15 homology d 48.1 2.8 6.1E-05 35.6 -2.1 29 532-569 10-38 (96)
48 PF08726 EFhand_Ca_insen: Ca2+ 46.5 21 0.00045 28.6 2.8 29 457-485 4-32 (69)
49 cd00213 S-100 S-100: S-100 dom 46.2 4.9 0.00011 33.3 -0.8 30 457-486 6-38 (88)
50 cd00051 EFh EF-hand, calcium b 42.9 6 0.00013 29.0 -0.7 26 461-486 2-28 (63)
51 PLN02964 phosphatidylserine de 42.6 4.2 9.1E-05 46.6 -2.3 79 457-550 141-233 (644)
52 KOG4251|consensus 40.8 16 0.00035 36.2 1.7 104 398-527 55-165 (362)
53 PF00112 Peptidase_C1: Papain 38.4 46 0.001 32.1 4.6 44 196-272 163-206 (219)
54 cd02859 AMPKbeta_GBD_like AMP- 37.9 36 0.00077 27.8 3.1 24 50-76 44-67 (79)
55 PF07910 Peptidase_C78: Peptid 34.7 1.2E+02 0.0026 30.0 6.8 55 93-150 61-135 (218)
56 PF09279 EF-hand_like: Phospho 31.8 46 0.00099 27.1 2.8 67 460-567 1-67 (83)
57 KOG0028|consensus 28.4 13 0.00029 34.6 -1.0 82 457-550 31-124 (172)
58 PF09069 EF-hand_3: EF-hand; 26.6 79 0.0017 26.7 3.4 30 459-488 3-32 (90)
59 cd02620 Peptidase_C1A_Cathepsi 24.9 2.2E+02 0.0047 28.4 6.9 27 198-229 184-210 (236)
60 KOG0036|consensus 23.7 20 0.00044 38.4 -0.8 54 457-522 80-134 (463)
61 PTZ00183 centrin; Provisional 22.2 21 0.00045 32.6 -1.0 27 534-569 128-154 (158)
62 PF03633 Glyco_hydro_65C: Glyc 21.7 1E+02 0.0022 22.9 2.9 22 51-72 7-28 (54)
63 cd00585 Peptidase_C1B Peptidas 21.4 4.2E+02 0.0091 29.3 8.7 29 198-229 358-386 (437)
64 PF14658 EF-hand_9: EF-hand do 20.8 47 0.001 26.3 0.9 12 556-567 51-62 (66)
65 KOG0041|consensus 20.6 29 0.00063 33.6 -0.4 32 536-567 103-161 (244)
No 1
>KOG0045|consensus
Probab=100.00 E-value=3e-87 Score=738.17 Aligned_cols=489 Identities=36% Similarity=0.573 Sum_probs=412.5
Q ss_pred eccCCcccCCCCchhHHHHHHHhcCCcCcccccccCCCCCCCCCCCCccEEEeeccCCeEEEEEecccccccCCcEEEee
Q psy1391 5 VSCLGVLYLSKGDKWLVSCLGVLYLSKGLFYRVVPADQSFSSPDQYAGVFRFRLWWCGEWVEVLVDDRLPTVNGKLAFLQ 84 (614)
Q Consensus 5 ~~~~~v~QG~lgDcw~laal~~la~~~~~l~~~~~~~q~~~~~~~~~G~y~~r~~~~G~w~~VvVDD~lP~~~g~~~f~~ 84 (614)
+++.||.||.|||||||||+|+||.+|+|+.+++|++|++..+ |+|+|+||||++|+|++|+|||+|||.+|+++|++
T Consensus 72 ~~~~di~Qg~lgdCw~laA~a~la~~~~ll~~vip~~~~~~~~--yaGif~f~~w~~G~W~~VvIDD~LP~~~~~~~~~~ 149 (612)
T KOG0045|consen 72 PSRFDVKQGLLGDCWFLAACAALALRPELLDKVIPQDQSFQEN--YAGIFHFRFWQNGEWVEVVIDDRLPTSNGGLLFSH 149 (612)
T ss_pred CCcceeEEeeecchHHHHHHHHhhcCHHHHHhccCCCcccccc--cceEEEEEEEeCCeEEEEEeeeecceEcCCEEEEe
Confidence 5889999999999999999999999999999999999999887 99999999999999999999999999999999999
Q ss_pred ccCCCCchhHHHhhhhhhhcCccccccccccccceecccCCceEEEeccCCCC----cHHHHHHHhhcCCCeEEEEecCC
Q psy1391 85 SVHSDQFWPGLLEKAYAKLHGSYEALKYGTMLDGLADLTGGITESISIRQDPT----SSGRLLNKLLDMTSLITCTVQSS 160 (614)
Q Consensus 85 s~~~~e~W~~LlEKAyAKl~GsY~~l~gg~~~~al~dlTG~~~~~~~l~~~~~----~l~~~l~~~~~~~~l~~~~~~~~ 160 (614)
++..+|||+|||||||||++|||++|.||.+.+||++|||++++.+.++..+. +++..+....++|.+++|++...
T Consensus 150 s~~~~efW~aLlEKAyaKl~GsY~~l~gg~~~~a~~~lTG~~~e~~~l~~~~~~~~~~l~~~~~~~~~~~~~l~c~~~~~ 229 (612)
T KOG0045|consen 150 SSGKNEFWAALLEKAYAKLLGSYEALHGGSTIDALVDLTGGVTEPFDLNKTPKSFKNNLVWALLKSAHRGSLLLCSIESK 229 (612)
T ss_pred ecCCceeHHHHHHHHHHHHhCcccCCCCCchhhHHHhccCCccceeEcccCcchhHHHHHHHHHHhhhccCceeeecccc
Confidence 98889999999999999999999999999999999999999999999998776 77888888889999999988651
Q ss_pred CCCCCcccccccccccccccCCCCCCCcchhccCCccccceEeeeeEEEEEccCCceeEEEEeeCCCCCCCCccccCCCC
Q psy1391 161 QPQPVGLVEFRRSSRTIRRIPDTKTRPNSEKLANGIQIGINYRLYALERIETYDGEPVQLVKLRNPLGGGGEYIGAWSRD 240 (614)
Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~GL~~~HaY~v~~~~~v~~~~g~~~~LlrlrNPwg~~~ew~G~wsd~ 240 (614)
... ..+.+....+||+++|||+|+++++++.+.+ .++|||||||||. .||+|.|||.
T Consensus 230 ~~~---------------------~~~~~~~~~~gL~~~HaYsit~~~~~~~~~~-~~~lirlrNPwg~-~~W~G~wsd~ 286 (612)
T KOG0045|consen 230 DPT---------------------EEEEEAKLRNGLVKGHAYAITDVREVQGRGG-KHRLIRLRNPWGE-SEWNGPWSDG 286 (612)
T ss_pred ccc---------------------hhHHHHHhhcCccccccEEEEEEEEeecccc-cceeEEecCCcCC-ceeccccccC
Confidence 111 0012567899999999999999999987555 8999999999998 9999999999
Q ss_pred CCCcccCCHHHHhhhCccccCCceEeeehHHHHhccceeEEEeeCCCCCCCCCCcccCCCeEEEEEEeEE--EeCccCCC
Q psy1391 241 SSNWDAVSAHDKETLGVRHLADGEFWISYADFLKAFTHLEVVHLDSETSRDEPSLHHKNTWQMRLFQGTW--QRGVTAGG 318 (614)
Q Consensus 241 s~~W~~~~~~~~~~l~~~~~~dG~FWm~~~df~~~F~~~~ic~~~~~~~~~~~~l~~~~~w~~~~~~G~W--~~g~tAGG 318 (614)
+++|..+.+..++.+.....+||+|||+++||+++|+.++||++.++....... ..|....+.|+| .++.+|||
T Consensus 287 ~~~W~~v~~~~~~~~~~~~~~dGeFWms~~dF~~~F~~~~vC~~~~~~~~~~~~----~~~~~~~~~~~w~~~~~~t~gg 362 (612)
T KOG0045|consen 287 SEEWHLVDKSKLSELGRQPLDDGEFWMSFDDFLREFDSLTVCRLRPDWLESRNQ----LQWVKLSLDGEWELARGVTAGG 362 (612)
T ss_pred CcchhhhCHHHHhhcccccccCCCeeeeHHHHHhhCCeEeecCCCcchhhhhhe----eeeeeeecCCccceeecccCCC
Confidence 999999998889988876689999999999999999999999999987643322 368999999999 88999999
Q ss_pred CCCCCCCcccCceEEEEecccc----eEEEEEEecCCCCCc-------eeEEEEEEccCCCCCcCccccccccCcccccC
Q psy1391 319 CRNNPETFHINPQLHLILSELE----EVIISLNQHSIMEPK-------VIGFTGYSLPKSSSENTGRAFFKKNKSLVNSQ 387 (614)
Q Consensus 319 ~~n~~~tf~~NPQy~l~v~~~~----~v~i~L~Q~~~r~~~-------~Igf~v~~v~~~~~~~l~~~~f~~~~~v~~s~ 387 (614)
|.++.++|+.||||.|.+.+++ .+++.|+|++++... .|||++|+++.+ +..++..+.+++.+.
T Consensus 363 c~~~~~tF~~npq~~~~~~~~~~~~~~~v~~~~q~~~~~~~~~~~~~~~ig~~i~~v~~~-----~~~~~~~~~~~~~~~ 437 (612)
T KOG0045|consen 363 CRNSVDTFDRNPQYILAVRKPTKSLCAVVLALFQKTRRGERSFGANILDIGFHIYEVPLE-----GKYFVLDNAPIASSS 437 (612)
T ss_pred CccCcccccCCceEEEEecCCCccceEEEEEeecccccccccccceeeecceEEEEecCC-----CCceEecccchhccc
Confidence 9999999999999999998865 688999999887652 299999998854 334455566777766
Q ss_pred -cccccceeeeEEecCCcEEEEecccCCCCccCceEEEeeCCCceeeecCCCCcchhhhcccCCCCCCcchhhhHHHHHH
Q psy1391 388 -YTNSRQVSHRCQLEQGSYLVLPTTFEPGQESGFTLRVYSSNPLKLKLLDSLPSVLKSAIVKAPSSLDTKSFSQYEAVFL 466 (614)
Q Consensus 388 -y~~~reVs~~~~L~pG~YvVVPsT~~p~~e~~F~Lrv~s~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~F~ 466 (614)
+++.|+|+.++.+|||.|++||+||+|+++++|+|||||+.++..++
T Consensus 438 ~~i~~r~v~~~~~~P~~~y~~~pst~~~~~~~~f~lrvfs~~~~~~~~-------------------------------- 485 (612)
T KOG0045|consen 438 SFINNREVSVRFRLPPGTYVIVPSTFEPGEEGEFLLRVFSNVKVKSEE-------------------------------- 485 (612)
T ss_pred ccccceeEEEEecCCCcceeecccCCCCCCCccEEEEEeecccccCcc--------------------------------
Confidence 99999999999999999999999999999999999999998876543
Q ss_pred HHhCCCCCCChHHHHHHhhhcCCCCccCCCeeeeeeeeCCcceEEEEeeCCCCCCc----ccccCCCCCCCCccccccch
Q psy1391 467 QLADEHRTVNAFELQELLDACLPNDYIKSCACMEIETYDGEPVQLVKLRNPLGGGG----EYIGAWSRDSSNWDAVSAHD 542 (614)
Q Consensus 467 ~~a~~~~~i~~~eLq~~L~~~~~~~~~~~~f~~~~~~~~~~~~~li~l~d~~g~G~----Ef~~lW~~~~~w~~~f~~~d 542 (614)
+++|...+++..++..... +|+.+.| ..+..+.+..++++ ++.-+|.++.+| + |
T Consensus 486 -----~~~i~~~~~~~~~~~~~~~-----~~~~~~~------~~~~~~~~~~~~~k~s~~~~~~~~~~~~~~--~----~ 543 (612)
T KOG0045|consen 486 -----DMEISLDETKRSTNIIVMK-----GFSLGEC------GDKWKLSSTLVNTKVSRSSEFILTVEVVSP--L----D 543 (612)
T ss_pred -----ceEEeecccccceeeeeec-----ceehhhh------chhhhccccccccccchhhceeeeeccccc--E----E
Confidence 2344444444444433322 4888888 88888888777666 788888888877 2 3
Q ss_pred hhhhccccc-------------------------cccceeEeHHHHHHhhceeeeeeccccCCC
Q psy1391 543 KETLGVRHL-------------------------ADGEFWISYADFLKAFTHLEVVHLDSETSR 581 (614)
Q Consensus 543 ~d~sG~~~~-------------------------~~~~~~m~f~dFi~~~~~L~~~~~~~~~~~ 581 (614)
-++++++.. -+.+.+.+++.|..|.+.+..+...+..++
T Consensus 544 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 607 (612)
T KOG0045|consen 544 IEGESTLVVDIPIAIESKGSGDVAPLLNVIRLRIADPEIAYSFDSTSCCATEGPLVLDELFDLS 607 (612)
T ss_pred EeccccccccccceeeccCCcccccceeeeeeeccChhheeeccccccccccCcchhhhhhcCC
Confidence 333333221 011667888888888888887776666544
No 2
>smart00230 CysPc Calpain-like thiol protease family. Calpain-like thiol protease family (peptidase family C2). Calcium activated neutral protease (large subunit).
Probab=100.00 E-value=2.9e-68 Score=554.99 Aligned_cols=257 Identities=42% Similarity=0.781 Sum_probs=234.5
Q ss_pred eeccCCcccCCCCchhHHHHHHHhcCCcCcccccccCCCCCCCCCCCCccEEEeeccCCeEEEEEecccccccCCcEEEe
Q psy1391 4 LVSCLGVLYLSKGDKWLVSCLGVLYLSKGLFYRVVPADQSFSSPDQYAGVFRFRLWWCGEWVEVLVDDRLPTVNGKLAFL 83 (614)
Q Consensus 4 ~~~~~~v~QG~lgDcw~laal~~la~~~~~l~~~~~~~q~~~~~~~~~G~y~~r~~~~G~w~~VvVDD~lP~~~g~~~f~ 83 (614)
-|++.||+||.||||||||||++||++|+++++|||+++.+.++ +.|+|+||||.+|+|+.|+|||+||+.+|.++|+
T Consensus 55 ~i~~~di~QG~lgDC~~lsal~~la~~~~~i~~if~~~~~~~~~--~~G~y~vrl~~~G~w~~V~VDd~lP~~~~~~~~~ 132 (318)
T smart00230 55 GASRTDICQGVLGDCWLLAALASLTLREKLLDRVIPHDQEFSEN--YAGIFHFRFWRFGKWVDVVIDDRLPTYNGELVFM 132 (318)
T ss_pred CCChhhccCcccccHHHHHHHHHHHhCHHHHhheEeCCcccccc--cCCEEEEEEEECCEEEEEEecCCCeeeCCceEEE
Confidence 37899999999999999999999999999999999977666544 7899999999999999999999999998889999
Q ss_pred eccCCCCchhHHHhhhhhhhcCccccccccccccceecccCCceEEEeccCC---CCcHHHHHHHhhcCCCeEEEEecCC
Q psy1391 84 QSVHSDQFWPGLLEKAYAKLHGSYEALKYGTMLDGLADLTGGITESISIRQD---PTSSGRLLNKLLDMTSLITCTVQSS 160 (614)
Q Consensus 84 ~s~~~~e~W~~LlEKAyAKl~GsY~~l~gg~~~~al~dlTG~~~~~~~l~~~---~~~l~~~l~~~~~~~~l~~~~~~~~ 160 (614)
++++.+|+|++||||||||+||||++|.+|.+.+||++|||++++.++++.. +..+|+.|.+++++|++|+|++...
T Consensus 133 ~~~~~~e~W~~LLEKAyAK~~GsY~~i~gg~~~~al~~LTG~~~~~i~l~~~~~~~~~~w~~l~~~~~~g~lv~~~t~~~ 212 (318)
T smart00230 133 HSNSRNEFWSALLEKAYAKLNGCYEALKGGSTTEALEDLTGGVAESIDLKEASKDPDNLFEDLFKAFERGSLMGCSIGAG 212 (318)
T ss_pred EeCCCCcchhHHHHHHHHHHcCCCcccCCCCHHHHHHHhcCCCeEEEEcccccCCHHHHHHHHHHHHhCCCeEEEEcCCC
Confidence 9999999999999999999999999999999999999999999999999864 5789999999999999999998754
Q ss_pred CCCCCcccccccccccccccCCCCCCCcchhccCCccccceEeeeeEEEEEccCCceeEEEEeeCCCCCCCCccccCCCC
Q psy1391 161 QPQPVGLVEFRRSSRTIRRIPDTKTRPNSEKLANGIQIGINYRLYALERIETYDGEPVQLVKLRNPLGGGGEYIGAWSRD 240 (614)
Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~GL~~~HaY~v~~~~~v~~~~g~~~~LlrlrNPwg~~~ew~G~wsd~ 240 (614)
... ..+.....||+++|||+|+++.++. +.+++||+||||||. .||+|+|||+
T Consensus 213 ~~~-----------------------~~~~~~~~GLv~~HaYsVl~v~~~~---~~~~~Ll~lrNPWg~-~eW~G~wsd~ 265 (318)
T smart00230 213 TAV-----------------------EEEEQKDCGLVKGHAYSVTDVREVQ---GRRQELLRLRNPWGQ-VEWNGPWSDD 265 (318)
T ss_pred Ccc-----------------------hhhhhhhcCcccCccEEEEEEEEEe---cCCeEEEEEECCCCC-CCcCCCCCCC
Confidence 211 1234567999999999999999984 224679999999998 9999999999
Q ss_pred CCCcccCCHHHHhhhCccccCCceEeeehHHHHhccceeEEEeeCCCCC
Q psy1391 241 SSNWDAVSAHDKETLGVRHLADGEFWISYADFLKAFTHLEVVHLDSETS 289 (614)
Q Consensus 241 s~~W~~~~~~~~~~l~~~~~~dG~FWm~~~df~~~F~~~~ic~~~~~~~ 289 (614)
|++|++++++.+++|++...+||+|||+|+||+++|+.++||++.|+.+
T Consensus 266 s~~W~~~~~~~~~~l~~~~~~dG~FWM~~~df~~~F~~~~vc~~~~~~~ 314 (318)
T smart00230 266 SPEWRSVSASEKKNLGLTFDDDGEFWMSFEDFLRHFDKVEICNLNPDSL 314 (318)
T ss_pred CccccccCHHHHHHhCCCCCCCCEEEEEhHHHHhhCCeEEEeccCCccc
Confidence 9999999889999999888899999999999999999999999999754
No 3
>PF00648 Peptidase_C2: Calpain family cysteine protease This is family C2 in the peptidase classification. ; InterPro: IPR001300 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to the MEROPS peptidase family C2 (calpain family, clan CA). A type example is calpain, which is an intracellular protease involved in many important cellular functions that are regulated by calcium []. The protein is a complex of 2 polypeptide chains (light and heavy), with three known forms in mammals [, ]: a highly calcium-sensitive (i.e., micro-molar range) form known as mu-calpain, mu-CANP or calpain I; a form sensitive to calcium in the milli-molar range, known as m-calpain, m-CANP or calpain II; and a third form, known as p94, which is found in skeletal muscle only []. All forms have identical light but different heavy chains. Both mu- and m-calpain are heterodimers containing an identical 28kDa subunit and an 80kDa subunit that shares 55-65% sequence homology between the two proteases [, ]. The crystallographic structure of m-calpain reveals six "domains" in the 80kDa subunit: A 19-amino acid NH2-terminal sequence; Active site domain IIa; Active site domain IIb. Domain 2 shows low levels of sequence similarity to papain; although the catalytic His has not been located by biochemical means, it is likely that calpain and papain are related []. Domain III; An 18-amino acid extended sequence linking domain III to domain IV; Domain IV, which resembles the penta EF-hand family of polypeptides, binds calcium and regulates activity []. />]. Ca2+-binding causes a rearrangement of the protein backbone, the net effect of which is that a Trp side chain, which acts as a wedge between catalytic domains IIa and IIb in the apo state, moves away from the active site cleft allowing for the proper formation of the catalytic triad []. Calpain-like mRNAs have been identified in other organisms including bacteria, but the molecules encoded by these mRNAs have not been isolated, so little is known about their properties. How calpain activity is regulated in these organisms cells is still unclear In metazoans, the activity of calpain is controlled by a single proteinase inhibitor, calpastatin (IPR001259 from INTERPRO). The calpastatin gene can produce eight or more calpastatin polypeptides ranging from 17 to 85 kDa by use of different promoters and alternative splicing events. The physiological significance of these different calpastatins is unclear, although all bind to three different places on the calpain molecule; binding to at least two of the sites is Ca2+ dependent. The calpains ostensibly participate in a variety of cellular processes including remodelling of cytoskeletal/membrane attachments, different signal transduction pathways, and apoptosis. Deregulated calpain activity following loss of Ca2+ homeostasis results in tissue damage in response to events such as myocardial infarcts, stroke, and brain trauma []. Calpains are a family of cytosolic cysteine proteinases (see PDOC00126 from PROSITEDOC). Members of the calpain family are believed to function in various biological processes, including integrin-mediated cell migration, cytoskeletal remodeling, cell differentiation and apoptosis [, ]. The calpain family includes numerous members from C. elegans to mammals and with homologues in yeast and bacteria. The best characterised members are the m- and mu-calpains, both proteins are heterodimer composed of a large catalytic subunit and a small regulatory subunit. The large subunit comprises four domains (dI-dIV) while the small subunit has two domains (dV-dVI). Domain dI is a short region cleaved by autolysis, dII is the catalytic core, dIII is a C2-like domain, dIV consists of five calcium binding EF-hand motifs []. The crystal structure of calpain has been solved [, ]. The catalytic region consists of two distinct structural domains (dIIa and dIIb). dIIa contains a central helix flanked on three faces by a cluster of alpha-helices and is entirely unrelated to the corresponding domain in the typical thiol proteinases. The fold of dIIb is similar to the corresponding domain in other cysteine proteinases and contains two three-stranded anti-parallel beta-sheets. The catalytic triad residues (C,H,N) are located in dIIa and dIIb. The activation of the domain is dependent on the binding of two calcium atoms in two non EF-hand calcium binding sites located in the catalytic core, one close to the Cys active site in dIIa and one at the end of dIIb. Calcium-binding induced conformational changes in the catalytic domain which align the active site [][]. The profile covers the whole catalytic domain.; GO: 0004198 calcium-dependent cysteine-type endopeptidase activity, 0006508 proteolysis, 0005622 intracellular; PDB: 2NQA_A 1KFU_L 1KFX_L 1QXP_B 2R9C_A 1TL9_A 2G8E_A 1KXR_B 2G8J_A 2NQG_A ....
Probab=100.00 E-value=2.2e-65 Score=531.79 Aligned_cols=254 Identities=42% Similarity=0.715 Sum_probs=216.3
Q ss_pred eccCCcccCCCCchhHHHHHHHhcCCcCcccccccCCCCCCCCCCCCccEEEeeccCCeEEEEEecccccccCCcEEEee
Q psy1391 5 VSCLGVLYLSKGDKWLVSCLGVLYLSKGLFYRVVPADQSFSSPDQYAGVFRFRLWWCGEWVEVLVDDRLPTVNGKLAFLQ 84 (614)
Q Consensus 5 ~~~~~v~QG~lgDcw~laal~~la~~~~~l~~~~~~~q~~~~~~~~~G~y~~r~~~~G~w~~VvVDD~lP~~~g~~~f~~ 84 (614)
+++.||.||.||||||||||++||++|++|+++||.++++.++ +.|+|+||||.+|+|+.|+|||+||+.+|+++|++
T Consensus 43 ~~~~di~QG~lgDc~llaaL~~la~~~~~i~~i~~~~~~~~~~--~~G~y~v~l~~~G~w~~V~VDd~lP~~~g~~~f~~ 120 (298)
T PF00648_consen 43 ISPSDIRQGSLGDCWLLAALAALAEHPDLIKKIFPVNQSFNEN--YNGIYTVRLFKNGEWREVTVDDRLPCKNGKPLFAR 120 (298)
T ss_dssp SSGGGEBE-SSSSHHHHHHHHHHTTSHHHHHHHS-SS--SSTT---SSEEEEEEEETTEEEEEEEES-EEEETTEESSSB
T ss_pred CccccccccccCChhHHHHHHHHHhcccccccccccccccccc--cCceeeEeeccCCeeeeeccchhhhccccceeeec
Confidence 5688999999999999999999999999999999877766655 78999999999999999999999999999999999
Q ss_pred ccCCCCchhHHHhhhhhhhcCccccccccccccceecccCCceEEEeccCCC--CcHHHHHHHhhcCCCeEEEEecCCCC
Q psy1391 85 SVHSDQFWPGLLEKAYAKLHGSYEALKYGTMLDGLADLTGGITESISIRQDP--TSSGRLLNKLLDMTSLITCTVQSSQP 162 (614)
Q Consensus 85 s~~~~e~W~~LlEKAyAKl~GsY~~l~gg~~~~al~dlTG~~~~~~~l~~~~--~~l~~~l~~~~~~~~l~~~~~~~~~~ 162 (614)
+.+++|+|++||||||||+||||++|.||.+.+||++|||++++.+.++... .++++.+.+..+++++++|.+.....
T Consensus 121 s~~~~elW~~LlEKAyAKl~GsY~~l~gg~~~~al~~LTG~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 200 (298)
T PF00648_consen 121 SSDPNELWPSLLEKAYAKLHGSYSALEGGNPSEALQDLTGGPPESIDLRDDSSDDELWELWKKLLKSGSLVGCSTGSSTP 200 (298)
T ss_dssp ESSTTB-HHHHHHHHHHHHTTSSGGGSSBSHHHHHHHHHSSEEEEEEGGG--T--THHHHHHHHHHCT-EEEEE--SSSG
T ss_pred cCCcccchhhhhhchhhhccccccccCCCChhhhhHhhcCCcceeeeccccchhhhHHHHHHHHHHhccccccccccccc
Confidence 9899999999999999999999999999999999999999999999997643 57999999999999999998765321
Q ss_pred CCCcccccccccccccccCCCCCCCcchhccCCccccceEeeeeEEEEEccCCceeEEEEeeCCCCCCCCccccCCCCCC
Q psy1391 163 QPVGLVEFRRSSRTIRRIPDTKTRPNSEKLANGIQIGINYRLYALERIETYDGEPVQLVKLRNPLGGGGEYIGAWSRDSS 242 (614)
Q Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~GL~~~HaY~v~~~~~v~~~~g~~~~LlrlrNPwg~~~ew~G~wsd~s~ 242 (614)
. ..+.....||+++|||+|++++++.. +|+.++||+||||||. .+|+|+||+.++
T Consensus 201 ~-----------------------~~~~~~~~gl~~~HaY~Vl~~~~~~~-~~~~~~lv~LrNPwg~-~~w~G~ws~~s~ 255 (298)
T PF00648_consen 201 F-----------------------DSEEYEKNGLVPGHAYAVLDVREVNG-NGEGHRLVKLRNPWGS-TEWKGDWSDDSP 255 (298)
T ss_dssp G-----------------------GTTSBCTTSBBTTS-EEEEEEEEEEE-TTEEEEEEEEE-TTSS----SSTTSTTSG
T ss_pred c-----------------------ccccccccCcccceeEEEEEEEeecc-ccceeEEEEEcCCCcc-cccccccccccc
Confidence 1 23446789999999999999999865 3457999999999998 999999999999
Q ss_pred CcccCCHHHHhhhCccccCCceEeeehHHHHhccceeEEEeeC
Q psy1391 243 NWDAVSAHDKETLGVRHLADGEFWISYADFLKAFTHLEVVHLD 285 (614)
Q Consensus 243 ~W~~~~~~~~~~l~~~~~~dG~FWm~~~df~~~F~~~~ic~~~ 285 (614)
+|+++.++.|++|.....+||+|||+|+||+++|+.|+||+++
T Consensus 256 ~W~~~~~~~~~~~~~~~~~dg~FWM~~~df~~~F~~i~vc~~~ 298 (298)
T PF00648_consen 256 EWTEIHPSLRKRLNQSSSDDGTFWMSFEDFLKYFSSIYVCRLT 298 (298)
T ss_dssp GGGGS-HHHHHHHTTTSSSSSEEEEEHHHHHHHSEEEEEEESC
T ss_pred ccccCCHHHHhhcccccccCccHhHhHHHHHhhCCceEEEeeC
Confidence 9998888899999988889999999999999999999999974
No 4
>cd00044 CysPc Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
Probab=100.00 E-value=1.1e-62 Score=514.38 Aligned_cols=247 Identities=43% Similarity=0.753 Sum_probs=222.7
Q ss_pred eccCCcccCCCCchhHHHHHHHhcCCcCcccccccCCCCCCCCCCCCccEEEeeccCCeEEEEEecccccccCCcEEEee
Q psy1391 5 VSCLGVLYLSKGDKWLVSCLGVLYLSKGLFYRVVPADQSFSSPDQYAGVFRFRLWWCGEWVEVLVDDRLPTVNGKLAFLQ 84 (614)
Q Consensus 5 ~~~~~v~QG~lgDcw~laal~~la~~~~~l~~~~~~~q~~~~~~~~~G~y~~r~~~~G~w~~VvVDD~lP~~~g~~~f~~ 84 (614)
|++.||.||.+|||||||||++||++|++|+++|+..+...++ +.|+|+||||.+|+|+.|+|||+||+.++.++|++
T Consensus 64 ~~~~dI~QG~lgDC~~lsaL~~la~~~~~i~~lf~~~~~~~~~--~~G~y~v~l~~~G~w~~V~VDD~lP~~~~~~~~~~ 141 (315)
T cd00044 64 ASPSDVCQGILGDCWFLAALAALAERPELLKRVIPPDQSFEEN--YAGIYHFRFWKNGEWVEVVIDDRLPTSNGGLLFMH 141 (315)
T ss_pred CChhhcccCcccchHHHHHHHHHHcCHHHHhheEcCCcccccC--cCcEEEEEEEECCEEEEEEecCCCeecCCceEEEE
Confidence 6899999999999999999999999999999999876543323 67999999999999999999999999888899999
Q ss_pred ccCCCCchhHHHhhhhhhhcCccccccccccccceecccCCceEEEeccCC-----CCcHHHHHHHhhcCCCeEEEEecC
Q psy1391 85 SVHSDQFWPGLLEKAYAKLHGSYEALKYGTMLDGLADLTGGITESISIRQD-----PTSSGRLLNKLLDMTSLITCTVQS 159 (614)
Q Consensus 85 s~~~~e~W~~LlEKAyAKl~GsY~~l~gg~~~~al~dlTG~~~~~~~l~~~-----~~~l~~~l~~~~~~~~l~~~~~~~ 159 (614)
+.+.+|+|++||||||||+||||++|.||++.+||++|||++++.++++.. ...+|+.|.++.+.+++|+|++..
T Consensus 142 s~~~~e~W~~LlEKAyAK~~GsY~~i~gg~~~~al~~LTG~~~~~i~~~~~~~~~~~~~~~~~l~~~~~~~~lv~~~t~~ 221 (315)
T cd00044 142 SRDRNELWVALLEKAYAKLHGSYEALVGGNTAEALEDLTGGPTERIDLKSADASSGDNDLFALLLSFLQGGSLIGCSTGS 221 (315)
T ss_pred ECCCCeEcHHHHHHHHHhhcCCccccCCCCHHHHHHHhhCCCcEEEEccccccccCHHHHHHHHHHHhhCCCEEEEEcCC
Confidence 988899999999999999999999999999999999999999999999864 467999999999999999999876
Q ss_pred CCCCCCcccccccccccccccCCCCCCCcchhccCCccccceEeeeeEEEEEccCCceeEEEEeeCCCCCCCCccccCCC
Q psy1391 160 SQPQPVGLVEFRRSSRTIRRIPDTKTRPNSEKLANGIQIGINYRLYALERIETYDGEPVQLVKLRNPLGGGGEYIGAWSR 239 (614)
Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~GL~~~HaY~v~~~~~v~~~~g~~~~LlrlrNPwg~~~ew~G~wsd 239 (614)
... .+.....||+.+|||+|+++.+++. .+++||+||||||. .||+|+|||
T Consensus 222 ~~~-------------------------~~~~~~~Gl~~~HaY~Vl~~~~~~~---~~~~lv~lrNPWg~-~~w~G~ws~ 272 (315)
T cd00044 222 RSE-------------------------EEARTANGLVKGHAYSVLDVREVQE---EGLRLLRLRNPWGV-GEWWGGWSD 272 (315)
T ss_pred CCc-------------------------chhhccCCcccCcceEEeEEEEEcc---CceEEEEecCCccC-CCccCCCCC
Confidence 431 1135678999999999999999843 15899999999999 999999999
Q ss_pred CCCCcccCCHHHHhhhCccccCCceEeeehHHHHhccceeEEEe
Q psy1391 240 DSSNWDAVSAHDKETLGVRHLADGEFWISYADFLKAFTHLEVVH 283 (614)
Q Consensus 240 ~s~~W~~~~~~~~~~l~~~~~~dG~FWm~~~df~~~F~~~~ic~ 283 (614)
.|++|... +..++.+.....+||+|||+|+||+++|+.++||+
T Consensus 273 ~~~~w~~~-~~~~~~~~~~~~~dG~Fwm~~~df~~~F~~~~vc~ 315 (315)
T cd00044 273 DSSEWWVI-DAERKKLLLSGKDDGEFWMSFEDFLRNFDGLYVCN 315 (315)
T ss_pred CCchhccC-hHHHHHhcCCCCCCCEEEEEhHHhheeeCeEEEeC
Confidence 99999744 66777777778899999999999999999999995
No 5
>cd00214 Calpain_III Calpain, subdomain III. Calpains are calcium-activated cytoplasmic cysteine proteinases, participate in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction. Catalytic domain and the two calmodulin-like domains are separated by C2-like domain III. Domain III plays an important role in calcium-induced activation of calpain involving electrostatic interactions with subdomain II. Proposed to mediate calpain's interaction with phospholipids and translocation to cytoplasmic/nuclear membranes. CD includes subdomain III of typical and atypical calpains.
Probab=99.96 E-value=1.8e-29 Score=235.27 Aligned_cols=135 Identities=41% Similarity=0.775 Sum_probs=119.9
Q ss_pred CeEEEEEEeEEEeCccCCCCCCCCCCcccCceEEEEeccc------ceEEEEEEecCCCCC-------ceeEEEEEEccC
Q psy1391 300 TWQMRLFQGTWQRGVTAGGCRNNPETFHINPQLHLILSEL------EEVIISLNQHSIMEP-------KVIGFTGYSLPK 366 (614)
Q Consensus 300 ~w~~~~~~G~W~~g~tAGG~~n~~~tf~~NPQy~l~v~~~------~~v~i~L~Q~~~r~~-------~~Igf~v~~v~~ 366 (614)
.|+..+++|+|.+|.|||||++++.||++||||.|+|.++ ++|+|+|+|+++|.. ..|||+||+++.
T Consensus 2 ~W~~~~~~G~W~~g~tAGGc~~~~~tf~~NPQf~l~v~~~~~~~~~~~v~i~L~q~~~r~~~~~~~~~~~IGf~v~~~~~ 81 (150)
T cd00214 2 KWHTKSFNGEWRRGQTAGGCRNNPDTFWTNPQFRIRVPEPDDDEGKCTVLIALMQKNRRHLRKKGLDLLTIGFHVYKVPG 81 (150)
T ss_pred cceEEEEeCeEeCCcccCCCCCcccccccCceEEEEecCCCCCCCccEEEEEeccCCcchhcccCCCcceEEEEEEEeCC
Confidence 5999999999999999999999999999999999999998 789999999987632 249999999876
Q ss_pred CCCCcCccccccccCcccccC-cccccceeeeEEecCCcEEEEecccCCCCccCceEEEeeCCCceeeec
Q psy1391 367 SSSENTGRAFFKKNKSLVNSQ-YTNSRQVSHRCQLEQGSYLVLPTTFEPGQESGFTLRVYSSNPLKLKLL 435 (614)
Q Consensus 367 ~~~~~l~~~~f~~~~~v~~s~-y~~~reVs~~~~L~pG~YvVVPsT~~p~~e~~F~Lrv~s~~~~~~~~l 435 (614)
. ..++++.+|..+.+++.+. |.+.|+|+.++.|+||.|+||||||+|+++|+|.||||++.++.+++|
T Consensus 82 ~-~~~~~~~~~~~~~~~~~s~~~~~~rev~~~~~L~pG~YvIIPsT~~p~~~g~F~LrVfs~~~~~~~~~ 150 (150)
T cd00214 82 E-NRHLRRDFFLHKAPRARSSTFINTREVSLRFRLPPGEYVIVPSTFEPGEEGEFLLRVFSEKSIKSSEL 150 (150)
T ss_pred c-CcccChhhhhccCcccccCccccccEEEEEEEcCCCCEEEEeeecCCCCcccEEEEEEecCCCccccC
Confidence 3 2367777787777766554 999999999999999999999999999999999999999999988764
No 6
>smart00720 calpain_III calpain_III.
Probab=99.95 E-value=1.4e-27 Score=221.25 Aligned_cols=130 Identities=46% Similarity=0.833 Sum_probs=115.6
Q ss_pred eEEEEEEeEEEeCccCCCCCCCCCCcccCceEEEEecccc----eEEEEEEecCCCCC-------ceeEEEEEEccCCCC
Q psy1391 301 WQMRLFQGTWQRGVTAGGCRNNPETFHINPQLHLILSELE----EVIISLNQHSIMEP-------KVIGFTGYSLPKSSS 369 (614)
Q Consensus 301 w~~~~~~G~W~~g~tAGG~~n~~~tf~~NPQy~l~v~~~~----~v~i~L~Q~~~r~~-------~~Igf~v~~v~~~~~ 369 (614)
|+..+++|+|.+|.|||||+|++.+|++||||.|+|.+++ +|+|+|+|+++|.. ..|||+||+++...
T Consensus 1 w~~~~~~G~W~~~~tAGG~~~~~~tf~~NPqy~l~v~~~~~~~~~v~i~L~q~~~r~~~~~~~~~~~iGf~v~~~~~~~- 79 (143)
T smart00720 1 WHTKSVQGSWTRGQTAGGCRNYPATFWTNPQFRITLEEPDDDDCTVLIALMQKNRRRLRRKGADFLTIGFAVYKVPKEL- 79 (143)
T ss_pred CcEEEEeCeEECCCccCCccccccccccCCeEEEEecCCCCCceEEEEEecccCcccccccCCccceEeEEEEEecccc-
Confidence 7889999999999999999999999999999999999987 49999999987742 23999999987532
Q ss_pred CcCccccccccCcccccC-cccccceeeeEEecCCcEEEEecccCCCCccCceEEEeeCCCcee
Q psy1391 370 ENTGRAFFKKNKSLVNSQ-YTNSRQVSHRCQLEQGSYLVLPTTFEPGQESGFTLRVYSSNPLKL 432 (614)
Q Consensus 370 ~~l~~~~f~~~~~v~~s~-y~~~reVs~~~~L~pG~YvVVPsT~~p~~e~~F~Lrv~s~~~~~~ 432 (614)
++++.++.....++.+. |.+.|+|+.++.|+||.|+|||+||+|+++|+|.||||++.++.+
T Consensus 80 -~~~~~~~~~~~~~~~s~~y~~~r~v~~~~~L~~G~Y~iVPsT~~p~~~g~F~LrV~s~~~~~l 142 (143)
T smart00720 80 -HLRRDFFLSNAPRASSGDYINGREVSERFRLPPGEYVIVPSTFEPNQEGDFLLRVFSEGPFKL 142 (143)
T ss_pred -ccchhhhhccCccccccccccCeEEEEEEEcCCCCEEEEEeecCCCCccCEEEEEEecCcccc
Confidence 56777777777777664 999999999999999999999999999999999999999998865
No 7
>PF01067 Calpain_III: Calpain large subunit, domain III; InterPro: IPR022682 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to the MEROPS peptidase family C2 (calpain family, clan CA). A type example is calpain, which is an intracellular protease involved in many important cellular functions that are regulated by calcium []. The protein is a complex of 2 polypeptide chains (light and heavy), with three known forms in mammals [, ]: a highly calcium-sensitive (i.e., micro-molar range) form known as mu-calpain, mu-CANP or calpain I; a form sensitive to calcium in the milli-molar range, known as m-calpain, m-CANP or calpain II; and a third form, known as p94, which is found in skeletal muscle only []. All forms have identical light but different heavy chains. Both mu- and m-calpain are heterodimers containing an identical 28kDa subunit and an 80kDa subunit that shares 55-65% sequence homology between the two proteases [, ]. The crystallographic structure of m-calpain reveals six "domains" in the 80kDa subunit: A 19-amino acid NH2-terminal sequence; Active site domain IIa; Active site domain IIb. Domain 2 shows low levels of sequence similarity to papain; although the catalytic His has not been located by biochemical means, it is likely that calpain and papain are related []. Domain III; An 18-amino acid extended sequence linking domain III to domain IV; Domain IV, which resembles the penta EF-hand family of polypeptides, binds calcium and regulates activity []. />]. Ca2+-binding causes a rearrangement of the protein backbone, the net effect of which is that a Trp side chain, which acts as a wedge between catalytic domains IIa and IIb in the apo state, moves away from the active site cleft allowing for the proper formation of the catalytic triad []. Calpain-like mRNAs have been identified in other organisms including bacteria, but the molecules encoded by these mRNAs have not been isolated, so little is known about their properties. How calpain activity is regulated in these organisms cells is still unclear In metazoans, the activity of calpain is controlled by a single proteinase inhibitor, calpastatin (IPR001259 from INTERPRO). The calpastatin gene can produce eight or more calpastatin polypeptides ranging from 17 to 85 kDa by use of different promoters and alternative splicing events. The physiological significance of these different calpastatins is unclear, although all bind to three different places on the calpain molecule; binding to at least two of the sites is Ca2+ dependent. The calpains ostensibly participate in a variety of cellular processes including remodelling of cytoskeletal/membrane attachments, different signal transduction pathways, and apoptosis. Deregulated calpain activity following loss of Ca2+ homeostasis results in tissue damage in response to events such as myocardial infarcts, stroke, and brain trauma []. This entry represents domain III. It is found in association with PF00648 from PFAM. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated by calcium ions. ; PDB: 1QXP_B 2QFE_A 1DF0_A 1U5I_A 3DF0_A 3BOW_A 1KFU_L 1KFX_L.
Probab=99.93 E-value=2.6e-25 Score=206.96 Aligned_cols=133 Identities=43% Similarity=0.761 Sum_probs=102.6
Q ss_pred CeEEEEEEeEEEeCccCCCCCCCCCCcccCceEEEEecccce------EEEEEEecCCCCC-------ceeEEEEEEccC
Q psy1391 300 TWQMRLFQGTWQRGVTAGGCRNNPETFHINPQLHLILSELEE------VIISLNQHSIMEP-------KVIGFTGYSLPK 366 (614)
Q Consensus 300 ~w~~~~~~G~W~~g~tAGG~~n~~~tf~~NPQy~l~v~~~~~------v~i~L~Q~~~r~~-------~~Igf~v~~v~~ 366 (614)
.|+..+++|+|..|.|||||.+++.+|++||||.|+|++++. |+|+|+|++.+.. ..|||+||++..
T Consensus 1 ~W~~~~~~G~W~~~~taGG~~~~~~s~~~NPQy~l~v~~~~~~~~~~~v~i~L~q~~~~~~~~~~~~~~~Ig~~v~~~~~ 80 (147)
T PF01067_consen 1 KWHTVTIEGEWVTGNTAGGCPNNPYSWWNNPQYRLTVSEPTEESNKCTVVISLMQKDRRRKRDVGEKDLPIGFYVFKVQS 80 (147)
T ss_dssp EEEEEEEEEEE-TTTS---STT-TTTGGGS-EEEEEESSGCCCSSBEEEEEEEEECSGCCGCSTTTTTSEEEEEEEEETT
T ss_pred CeeEEEEeCEEeCCCcCCCCcccccccccCcEEEEEEcCCCCCcceeEEEEEEEecCcchhhcccccceEEeEEEEeeec
Confidence 499999999998899999999997799999999999999884 9999999987754 249999999822
Q ss_pred CCCCcCccccccccCccccc-CcccccceeeeEEecCCcEEEEecccCCCCccCceEEEeeCCCceee
Q psy1391 367 SSSENTGRAFFKKNKSLVNS-QYTNSRQVSHRCQLEQGSYLVLPTTFEPGQESGFTLRVYSSNPLKLK 433 (614)
Q Consensus 367 ~~~~~l~~~~f~~~~~v~~s-~y~~~reVs~~~~L~pG~YvVVPsT~~p~~e~~F~Lrv~s~~~~~~~ 433 (614)
....+++.++.....+..+ .|.+.|+|+..+.|+||+|+|||||++|+++|+|.|+||++.+++++
T Consensus 81 -~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~L~~G~YvIVPsT~~~~~~g~F~L~v~s~~~~~l~ 147 (147)
T PF01067_consen 81 -QQKRLPRQYFLFNKPVVSSGDYSNSREVSEEFTLPPGTYVIVPSTYEPGQEGEFTLRVFSDSPFELQ 147 (147)
T ss_dssp -TTSE--HHHHHTS-SSEE-SSEBSSSEEEEEEEE-SEEEEEEEEESSTT--EEEEEEEEESSSEEE-
T ss_pred -ccccCCcceeccccceeeccccccceEEEEEEEcCCCCEEEEEecCCCCCeeeEEEEEEECCCcccC
Confidence 2345666555555555555 49999999999999999999999999999999999999999999863
No 8
>KOG0037|consensus
Probab=99.69 E-value=3.9e-19 Score=168.43 Aligned_cols=125 Identities=16% Similarity=0.184 Sum_probs=110.0
Q ss_pred hhhhHHHHHHHH-hCCCCCCChHHHHHHhhhcCCCCccCCCeeeeeeeeCCcceEEEEeeCCCCCCc----ccccCCCCC
Q psy1391 457 SFSQYEAVFLQL-ADEHRTVNAFELQELLDACLPNDYIKSCACMEIETYDGEPVQLVKLRNPLGGGG----EYIGAWSRD 531 (614)
Q Consensus 457 ~~~~~~~~F~~~-a~~~~~i~~~eLq~~L~~~~~~~~~~~~f~~~~~~~~~~~~~li~l~d~~g~G~----Ef~~lW~~~ 531 (614)
..+.+...|.++ +++.|.|++.|||++|.+...+ +|+++|| |.||.|+|.+.+|+ ||+.||..+
T Consensus 55 ~~~~~~~~f~~vD~d~sg~i~~~eLq~aLsn~~~~-----~Fs~~Tc------rlmI~mfd~~~~G~i~f~EF~~Lw~~i 123 (221)
T KOG0037|consen 55 TFPQLAGWFQSVDRDRSGRILAKELQQALSNGTWS-----PFSIETC------RLMISMFDRDNSGTIGFKEFKALWKYI 123 (221)
T ss_pred ccHHHHHHHHhhCccccccccHHHHHHHhhcCCCC-----CCCHHHH------HHHHHHhcCCCCCccCHHHHHHHHHHH
Confidence 467899999999 8999999999999999876554 5999999 99999999999888 999999999
Q ss_pred CCCccccccchhhhhcccc---cccc------------------------ceeEeHHHHHHhhceeeeeeccccCCCCCC
Q psy1391 532 SSNWDAVSAHDKETLGVRH---LADG------------------------EFWISYADFLKAFTHLEVVHLDSETSRDEP 584 (614)
Q Consensus 532 ~~w~~~f~~~d~d~sG~~~---~~~~------------------------~~~m~f~dFi~~~~~L~~~~~~~~~~~~~~ 584 (614)
.+|+.+|+.+|+|+||+|+ |++| .+.|.||+||.||++|+.+++.|..++..
T Consensus 124 ~~Wr~vF~~~D~D~SG~I~~sEL~~Al~~~Gy~Lspq~~~~lv~kyd~~~~g~i~FD~FI~ccv~L~~lt~~Fr~~D~~- 202 (221)
T KOG0037|consen 124 NQWRNVFRTYDRDRSGTIDSSELRQALTQLGYRLSPQFYNLLVRKYDRFGGGRIDFDDFIQCCVVLQRLTEAFRRRDTA- 202 (221)
T ss_pred HHHHHHHHhcccCCCCcccHHHHHHHHHHcCcCCCHHHHHHHHHHhccccCCceeHHHHHHHHHHHHHHHHHHHHhccc-
Confidence 9999999999999999999 4444 56899999999999999999999987653
Q ss_pred CCcCCCCeEEEEE
Q psy1391 585 SLHHKNTWQMRLF 597 (614)
Q Consensus 585 ~~~~~~~w~~~~~ 597 (614)
...+.+..+
T Consensus 203 ----q~G~i~~~y 211 (221)
T KOG0037|consen 203 ----QQGSITISY 211 (221)
T ss_pred ----cceeEEEeH
Confidence 234665554
No 9
>KOG0045|consensus
Probab=99.16 E-value=4.1e-11 Score=133.87 Aligned_cols=120 Identities=38% Similarity=0.730 Sum_probs=93.4
Q ss_pred CCCccCCCe----eeeeeeeCCcceEEEEeeCCCCCCcccccCCCCCCCCccccccchhhhhccccccccceeEeHHHHH
Q psy1391 489 PNDYIKSCA----CMEIETYDGEPVQLVKLRNPLGGGGEYIGAWSRDSSNWDAVSAHDKETLGVRHLADGEFWISYADFL 564 (614)
Q Consensus 489 ~~~~~~~~f----~~~~~~~~~~~~~li~l~d~~g~G~Ef~~lW~~~~~w~~~f~~~d~d~sG~~~~~~~~~~m~f~dFi 564 (614)
..+++++|. .+...+..++...||||+|||| +.||+|.|++...+|....+..+...+....+||+|||+++||+
T Consensus 241 ~~gL~~~HaYsit~~~~~~~~~~~~~lirlrNPwg-~~~W~G~wsd~~~~W~~v~~~~~~~~~~~~~~dGeFWms~~dF~ 319 (612)
T KOG0045|consen 241 RNGLVKGHAYAITDVREVQGRGGKHRLIRLRNPWG-ESEWNGPWSDGSEEWHLVDKSKLSELGRQPLDDGEFWMSFDDFL 319 (612)
T ss_pred hcCccccccEEEEEEEEeecccccceeEEecCCcC-CceeccccccCCcchhhhCHHHHhhcccccccCCCeeeeHHHHH
Confidence 455666663 3333334556789999999998 45999999997776666555555555544478999999999999
Q ss_pred HhhceeeeeeccccCCCCCCCCcCCCCeEEEEEEeEE--eCCCcCCCCCCC
Q psy1391 565 KAFTHLEVVHLDSETSRDEPSLHHKNTWQMRLFQGTW--QRGVTAGGCRNN 613 (614)
Q Consensus 565 ~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~~~~g~w--~~g~~agg~~~~ 613 (614)
.+|..+.+|....+...... ...|....++|+| .++.++|||+|+
T Consensus 320 ~~F~~~~vC~~~~~~~~~~~----~~~~~~~~~~~~w~~~~~~t~ggc~~~ 366 (612)
T KOG0045|consen 320 REFDSLTVCRLRPDWLESRN----QLQWVKLSLDGEWELARGVTAGGCRNS 366 (612)
T ss_pred hhCCeEeecCCCcchhhhhh----eeeeeeeecCCccceeecccCCCCccC
Confidence 99999999998888764432 2479999999999 889999999986
No 10
>smart00230 CysPc Calpain-like thiol protease family. Calpain-like thiol protease family (peptidase family C2). Calcium activated neutral protease (large subunit).
Probab=98.80 E-value=9.7e-08 Score=100.17 Aligned_cols=87 Identities=34% Similarity=0.701 Sum_probs=62.7
Q ss_pred ccCCC-ee-eeeeeeCCcceEEEEeeCCCCCCcccccCCCCC-CCCccccccchhhhhccccccccceeEeHHHHHHhhc
Q psy1391 492 YIKSC-AC-MEIETYDGEPVQLVKLRNPLGGGGEYIGAWSRD-SSNWDAVSAHDKETLGVRHLADGEFWISYADFLKAFT 568 (614)
Q Consensus 492 ~~~~~-f~-~~~~~~~~~~~~li~l~d~~g~G~Ef~~lW~~~-~~w~~~f~~~d~d~sG~~~~~~~~~~m~f~dFi~~~~ 568 (614)
+..+| ++ ++.+...+....||+||||||. .||+|.|++. ..|.+. ....++..+.-..++|.|||+|+||+.+|.
T Consensus 225 Lv~~HaYsVl~v~~~~~~~~~Ll~lrNPWg~-~eW~G~wsd~s~~W~~~-~~~~~~~l~~~~~~dG~FWM~~~df~~~F~ 302 (318)
T smart00230 225 LVKGHAYSVTDVREVQGRRQELLRLRNPWGQ-VEWNGPWSDDSPEWRSV-SASEKKNLGLTFDDDGEFWMSFEDFLRHFD 302 (318)
T ss_pred cccCccEEEEEEEEEecCCeEEEEEECCCCC-CCcCCCCCCCCcccccc-CHHHHHHhCCCCCCCCEEEEEhHHHHhhCC
Confidence 44455 44 4444445544569999999993 4999999985 566643 222344455444789999999999999999
Q ss_pred eeeeeeccccCC
Q psy1391 569 HLEVVHLDSETS 580 (614)
Q Consensus 569 ~L~~~~~~~~~~ 580 (614)
.+.+|...++.+
T Consensus 303 ~~~vc~~~~~~~ 314 (318)
T smart00230 303 KVEICNLNPDSL 314 (318)
T ss_pred eEEEeccCCccc
Confidence 999998777654
No 11
>PF00648 Peptidase_C2: Calpain family cysteine protease This is family C2 in the peptidase classification. ; InterPro: IPR001300 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to the MEROPS peptidase family C2 (calpain family, clan CA). A type example is calpain, which is an intracellular protease involved in many important cellular functions that are regulated by calcium []. The protein is a complex of 2 polypeptide chains (light and heavy), with three known forms in mammals [, ]: a highly calcium-sensitive (i.e., micro-molar range) form known as mu-calpain, mu-CANP or calpain I; a form sensitive to calcium in the milli-molar range, known as m-calpain, m-CANP or calpain II; and a third form, known as p94, which is found in skeletal muscle only []. All forms have identical light but different heavy chains. Both mu- and m-calpain are heterodimers containing an identical 28kDa subunit and an 80kDa subunit that shares 55-65% sequence homology between the two proteases [, ]. The crystallographic structure of m-calpain reveals six "domains" in the 80kDa subunit: A 19-amino acid NH2-terminal sequence; Active site domain IIa; Active site domain IIb. Domain 2 shows low levels of sequence similarity to papain; although the catalytic His has not been located by biochemical means, it is likely that calpain and papain are related []. Domain III; An 18-amino acid extended sequence linking domain III to domain IV; Domain IV, which resembles the penta EF-hand family of polypeptides, binds calcium and regulates activity []. />]. Ca2+-binding causes a rearrangement of the protein backbone, the net effect of which is that a Trp side chain, which acts as a wedge between catalytic domains IIa and IIb in the apo state, moves away from the active site cleft allowing for the proper formation of the catalytic triad []. Calpain-like mRNAs have been identified in other organisms including bacteria, but the molecules encoded by these mRNAs have not been isolated, so little is known about their properties. How calpain activity is regulated in these organisms cells is still unclear In metazoans, the activity of calpain is controlled by a single proteinase inhibitor, calpastatin (IPR001259 from INTERPRO). The calpastatin gene can produce eight or more calpastatin polypeptides ranging from 17 to 85 kDa by use of different promoters and alternative splicing events. The physiological significance of these different calpastatins is unclear, although all bind to three different places on the calpain molecule; binding to at least two of the sites is Ca2+ dependent. The calpains ostensibly participate in a variety of cellular processes including remodelling of cytoskeletal/membrane attachments, different signal transduction pathways, and apoptosis. Deregulated calpain activity following loss of Ca2+ homeostasis results in tissue damage in response to events such as myocardial infarcts, stroke, and brain trauma []. Calpains are a family of cytosolic cysteine proteinases (see PDOC00126 from PROSITEDOC). Members of the calpain family are believed to function in various biological processes, including integrin-mediated cell migration, cytoskeletal remodeling, cell differentiation and apoptosis [, ]. The calpain family includes numerous members from C. elegans to mammals and with homologues in yeast and bacteria. The best characterised members are the m- and mu-calpains, both proteins are heterodimer composed of a large catalytic subunit and a small regulatory subunit. The large subunit comprises four domains (dI-dIV) while the small subunit has two domains (dV-dVI). Domain dI is a short region cleaved by autolysis, dII is the catalytic core, dIII is a C2-like domain, dIV consists of five calcium binding EF-hand motifs []. The crystal structure of calpain has been solved [, ]. The catalytic region consists of two distinct structural domains (dIIa and dIIb). dIIa contains a central helix flanked on three faces by a cluster of alpha-helices and is entirely unrelated to the corresponding domain in the typical thiol proteinases. The fold of dIIb is similar to the corresponding domain in other cysteine proteinases and contains two three-stranded anti-parallel beta-sheets. The catalytic triad residues (C,H,N) are located in dIIa and dIIb. The activation of the domain is dependent on the binding of two calcium atoms in two non EF-hand calcium binding sites located in the catalytic core, one close to the Cys active site in dIIa and one at the end of dIIb. Calcium-binding induced conformational changes in the catalytic domain which align the active site [][]. The profile covers the whole catalytic domain.; GO: 0004198 calcium-dependent cysteine-type endopeptidase activity, 0006508 proteolysis, 0005622 intracellular; PDB: 2NQA_A 1KFU_L 1KFX_L 1QXP_B 2R9C_A 1TL9_A 2G8E_A 1KXR_B 2G8J_A 2NQG_A ....
Probab=98.44 E-value=5.4e-07 Score=93.87 Aligned_cols=84 Identities=35% Similarity=0.627 Sum_probs=55.1
Q ss_pred CCccCCC-ee-eeeee--eCCcceEEEEeeCCCCCCcccccCCCCC-CCCccccccchhhhhccccccccceeEeHHHHH
Q psy1391 490 NDYIKSC-AC-MEIET--YDGEPVQLVKLRNPLGGGGEYIGAWSRD-SSNWDAVSAHDKETLGVRHLADGEFWISYADFL 564 (614)
Q Consensus 490 ~~~~~~~-f~-~~~~~--~~~~~~~li~l~d~~g~G~Ef~~lW~~~-~~w~~~f~~~d~d~sG~~~~~~~~~~m~f~dFi 564 (614)
.++..+| ++ ++.+. ..|....||+||||||. .||+|.|++. ..|.++ ....++..+.-..+++.|||+|+||+
T Consensus 209 ~gl~~~HaY~Vl~~~~~~~~~~~~~lv~LrNPwg~-~~w~G~ws~~s~~W~~~-~~~~~~~~~~~~~~dg~FWM~~~df~ 286 (298)
T PF00648_consen 209 NGLVPGHAYAVLDVREVNGNGEGHRLVKLRNPWGS-TEWKGDWSDDSPEWTEI-HPSLRKRLNQSSSDDGTFWMSFEDFL 286 (298)
T ss_dssp TSBBTTS-EEEEEEEEEEETTEEEEEEEEE-TTSS----SSTTSTTSGGGGGS--HHHHHHHTTTSSSSSEEEEEHHHHH
T ss_pred cCcccceeEEEEEEEeeccccceeEEEEEcCCCcc-ccccccccccccccccC-CHHHHhhcccccccCccHhHhHHHHH
Confidence 3455555 33 23222 23456899999999984 4999999984 567753 23334444433367899999999999
Q ss_pred Hhhceeeeeec
Q psy1391 565 KAFTHLEVVHL 575 (614)
Q Consensus 565 ~~~~~L~~~~~ 575 (614)
.+|..|.+|..
T Consensus 287 ~~F~~i~vc~~ 297 (298)
T PF00648_consen 287 KYFSSIYVCRL 297 (298)
T ss_dssp HHSEEEEEEES
T ss_pred hhCCceEEEee
Confidence 99999999974
No 12
>cd00044 CysPc Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
Probab=98.38 E-value=2.8e-06 Score=89.21 Aligned_cols=64 Identities=42% Similarity=0.839 Sum_probs=49.2
Q ss_pred ceEEEEeeCCCCCCcccccCCCCC-CCCccccccchhhhhccccccccceeEeHHHHHHhhceeeeee
Q psy1391 508 PVQLVKLRNPLGGGGEYIGAWSRD-SSNWDAVSAHDKETLGVRHLADGEFWISYADFLKAFTHLEVVH 574 (614)
Q Consensus 508 ~~~li~l~d~~g~G~Ef~~lW~~~-~~w~~~f~~~d~d~sG~~~~~~~~~~m~f~dFi~~~~~L~~~~ 574 (614)
...||+||||||.+ ||+|.|++. ..|+.. ...+...+.-..++|.|||+|+||+.+|..+.+|.
T Consensus 251 ~~~lv~lrNPWg~~-~w~G~ws~~~~~w~~~--~~~~~~~~~~~~~dG~Fwm~~~df~~~F~~~~vc~ 315 (315)
T cd00044 251 GLRLLRLRNPWGVG-EWWGGWSDDSSEWWVI--DAERKKLLLSGKDDGEFWMSFEDFLRNFDGLYVCN 315 (315)
T ss_pred ceEEEEecCCccCC-CccCCCCCCCchhccC--hHHHHHhcCCCCCCCEEEEEhHHhheeeCeEEEeC
Confidence 38899999999864 999999984 567632 22333333333789999999999999999998874
No 13
>PTZ00184 calmodulin; Provisional
Probab=96.21 E-value=0.00043 Score=63.56 Aligned_cols=82 Identities=16% Similarity=0.247 Sum_probs=60.7
Q ss_pred hhhhHHHHHHHH-hCCCCCCChHHHHHHhhhcCCCCccCCCeeeeeeeeCCcceEEEEeeCCCCCCc----ccccCCCCC
Q psy1391 457 SFSQYEAVFLQL-ADEHRTVNAFELQELLDACLPNDYIKSCACMEIETYDGEPVQLVKLRNPLGGGG----EYIGAWSRD 531 (614)
Q Consensus 457 ~~~~~~~~F~~~-a~~~~~i~~~eLq~~L~~~~~~~~~~~~f~~~~~~~~~~~~~li~l~d~~g~G~----Ef~~lW~~~ 531 (614)
....++..|..+ ++++|.|+..||..+|...... .+.+.+ ..|++.+|.+++|. ||...+...
T Consensus 9 ~~~~~~~~F~~~D~~~~G~i~~~e~~~~l~~~~~~------~~~~~~------~~~~~~~d~~~~g~i~~~ef~~~l~~~ 76 (149)
T PTZ00184 9 QIAEFKEAFSLFDKDGDGTITTKELGTVMRSLGQN------PTEAEL------QDMINEVDADGNGTIDFPEFLTLMARK 76 (149)
T ss_pred HHHHHHHHHHHHcCCCCCcCCHHHHHHHHHHhCCC------CCHHHH------HHHHHhcCcCCCCcCcHHHHHHHHHHh
Confidence 456788999998 7899999999999999653211 223345 77888899888877 787766532
Q ss_pred -------CCCccccccchhhhhcccc
Q psy1391 532 -------SSNWDAVSAHDKETLGVRH 550 (614)
Q Consensus 532 -------~~w~~~f~~~d~d~sG~~~ 550 (614)
..+..+|..+|.+++|.++
T Consensus 77 ~~~~~~~~~~~~~F~~~D~~~~g~i~ 102 (149)
T PTZ00184 77 MKDTDSEEEIKEAFKVFDRDGNGFIS 102 (149)
T ss_pred ccCCcHHHHHHHHHHhhCCCCCCeEe
Confidence 2467889999988888755
No 14
>PTZ00183 centrin; Provisional
Probab=95.68 E-value=0.001 Score=61.86 Aligned_cols=82 Identities=16% Similarity=0.155 Sum_probs=59.8
Q ss_pred hhhhHHHHHHHH-hCCCCCCChHHHHHHhhhcCCCCccCCCeeeeeeeeCCcceEEEEeeCCCCCCc----ccccCCCCC
Q psy1391 457 SFSQYEAVFLQL-ADEHRTVNAFELQELLDACLPNDYIKSCACMEIETYDGEPVQLVKLRNPLGGGG----EYIGAWSRD 531 (614)
Q Consensus 457 ~~~~~~~~F~~~-a~~~~~i~~~eLq~~L~~~~~~~~~~~~f~~~~~~~~~~~~~li~l~d~~g~G~----Ef~~lW~~~ 531 (614)
....++++|..+ .+.+|.|+..||+.+|...... .+.+.+ ..+++..|.+++|. ||..++...
T Consensus 15 ~~~~~~~~F~~~D~~~~G~i~~~e~~~~l~~~g~~------~~~~~~------~~l~~~~d~~~~g~i~~~eF~~~~~~~ 82 (158)
T PTZ00183 15 QKKEIREAFDLFDTDGSGTIDPKELKVAMRSLGFE------PKKEEI------KQMIADVDKDGSGKIDFEEFLDIMTKK 82 (158)
T ss_pred HHHHHHHHHHHhCCCCCCcccHHHHHHHHHHhCCC------CCHHHH------HHHHHHhCCCCCCcEeHHHHHHHHHHH
Confidence 456789999988 5889999999999999754211 223344 66777788888877 776654431
Q ss_pred -------CCCccccccchhhhhcccc
Q psy1391 532 -------SSNWDAVSAHDKETLGVRH 550 (614)
Q Consensus 532 -------~~w~~~f~~~d~d~sG~~~ 550 (614)
..++.+|+.+|.+++|.++
T Consensus 83 ~~~~~~~~~l~~~F~~~D~~~~G~i~ 108 (158)
T PTZ00183 83 LGERDPREEILKAFRLFDDDKTGKIS 108 (158)
T ss_pred hcCCCcHHHHHHHHHHhCCCCCCcCc
Confidence 2578889999999888765
No 15
>KOG0027|consensus
Probab=95.54 E-value=0.0017 Score=60.49 Aligned_cols=100 Identities=20% Similarity=0.262 Sum_probs=74.9
Q ss_pred hhhhHHHHHHHH-hCCCCCCChHHHHHHhhhcCCCCccCCCeeeeeeeeCCcceEEEEeeCCCCCCc----ccccCCCCC
Q psy1391 457 SFSQYEAVFLQL-ADEHRTVNAFELQELLDACLPNDYIKSCACMEIETYDGEPVQLVKLRNPLGGGG----EYIGAWSRD 531 (614)
Q Consensus 457 ~~~~~~~~F~~~-a~~~~~i~~~eLq~~L~~~~~~~~~~~~f~~~~~~~~~~~~~li~l~d~~g~G~----Ef~~lW~~~ 531 (614)
...+++.+|... ++.+|.|+..||..+|...... .+.+.. +.|+.-.|++|+|+ ||..+-...
T Consensus 6 ~~~el~~~F~~fD~d~~G~i~~~el~~~lr~lg~~------~t~~el------~~~~~~~D~dg~g~I~~~eF~~l~~~~ 73 (151)
T KOG0027|consen 6 QILELKEAFQLFDKDGDGKISVEELGAVLRSLGQN------PTEEEL------RDLIKEIDLDGDGTIDFEEFLDLMEKL 73 (151)
T ss_pred HHHHHHHHHHHHCCCCCCcccHHHHHHHHHHcCCC------CCHHHH------HHHHHHhCCCCCCeEcHHHHHHHHHhh
Confidence 457799999999 8999999999999999764432 333334 66777788888888 776654421
Q ss_pred -----------CCCccccccchhhhhcccccc---------------------------ccceeEeHHHHHHhhc
Q psy1391 532 -----------SSNWDAVSAHDKETLGVRHLA---------------------------DGEFWISYADFLKAFT 568 (614)
Q Consensus 532 -----------~~w~~~f~~~d~d~sG~~~~~---------------------------~~~~~m~f~dFi~~~~ 568 (614)
..-+.+|+.+|+|++|+|+.. ++++.++|++|+..+.
T Consensus 74 ~~~~~~~~~~~~el~eaF~~fD~d~~G~Is~~el~~~l~~lg~~~~~~e~~~mi~~~d~d~dg~i~f~ef~~~m~ 148 (151)
T KOG0027|consen 74 GEEKTDEEASSEELKEAFRVFDKDGDGFISASELKKVLTSLGEKLTDEECKEMIREVDVDGDGKVNFEEFVKMMS 148 (151)
T ss_pred hcccccccccHHHHHHHHHHHccCCCCcCcHHHHHHHHHHhCCcCCHHHHHHHHHhcCCCCCCeEeHHHHHHHHh
Confidence 156889999999999998811 1277888998888764
No 16
>KOG0037|consensus
Probab=94.66 E-value=0.0028 Score=61.25 Aligned_cols=90 Identities=26% Similarity=0.314 Sum_probs=67.2
Q ss_pred hhHHHHHHHH-hCCCCCCChHHHHHHhhhcCCCCccCCCeeeeeeeeCCcceEEEEeeCCCCCCc----ccccCCCCCCC
Q psy1391 459 SQYEAVFLQL-ADEHRTVNAFELQELLDACLPNDYIKSCACMEIETYDGEPVQLVKLRNPLGGGG----EYIGAWSRDSS 533 (614)
Q Consensus 459 ~~~~~~F~~~-a~~~~~i~~~eLq~~L~~~~~~~~~~~~f~~~~~~~~~~~~~li~l~d~~g~G~----Ef~~lW~~~~~ 533 (614)
.+.+++|..+ .|.+|.|+..||+++|.+..- .++.+.. ..||+-+|+-++|+ +|..+--.+..
T Consensus 124 ~~Wr~vF~~~D~D~SG~I~~sEL~~Al~~~Gy------~Lspq~~------~~lv~kyd~~~~g~i~FD~FI~ccv~L~~ 191 (221)
T KOG0037|consen 124 NQWRNVFRTYDRDRSGTIDSSELRQALTQLGY------RLSPQFY------NLLVRKYDRFGGGRIDFDDFIQCCVVLQR 191 (221)
T ss_pred HHHHHHHHhcccCCCCcccHHHHHHHHHHcCc------CCCHHHH------HHHHHHhccccCCceeHHHHHHHHHHHHH
Confidence 4567788888 689999999999999976321 1344444 66777788776666 66666555566
Q ss_pred CccccccchhhhhccccccccceeEeHHHHHHhh
Q psy1391 534 NWDAVSAHDKETLGVRHLADGEFWISYADFLKAF 567 (614)
Q Consensus 534 w~~~f~~~d~d~sG~~~~~~~~~~m~f~dFi~~~ 567 (614)
..+.|++.|.+..|.+. ++|+||+.+.
T Consensus 192 lt~~Fr~~D~~q~G~i~-------~~y~dfl~~t 218 (221)
T KOG0037|consen 192 LTEAFRRRDTAQQGSIT-------ISYDDFLQMT 218 (221)
T ss_pred HHHHHHHhccccceeEE-------EeHHHHHHHh
Confidence 78889999999888765 9999999864
No 17
>COG5126 FRQ1 Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only]
Probab=92.77 E-value=0.012 Score=54.93 Aligned_cols=99 Identities=20% Similarity=0.279 Sum_probs=70.6
Q ss_pred hhhhHHHHHHHH-hCCCCCCChHHHHHHhhhcCCCCccCCCeeeeeeeeCCcceEEEEeeCCCCCCc----ccccCCCC-
Q psy1391 457 SFSQYEAVFLQL-ADEHRTVNAFELQELLDACLPNDYIKSCACMEIETYDGEPVQLVKLRNPLGGGG----EYIGAWSR- 530 (614)
Q Consensus 457 ~~~~~~~~F~~~-a~~~~~i~~~eLq~~L~~~~~~~~~~~~f~~~~~~~~~~~~~li~l~d~~g~G~----Ef~~lW~~- 530 (614)
...++++.|... .+.+|.|+-.+|..+|. .++.. .+.+.. ..|+.-.|. |++. ||..+.+.
T Consensus 18 qi~~lkeaF~l~D~d~~G~I~~~el~~ilr-~lg~~-----~s~~ei------~~l~~~~d~-~~~~idf~~Fl~~ms~~ 84 (160)
T COG5126 18 QIQELKEAFQLFDRDSDGLIDRNELGKILR-SLGFN-----PSEAEI------NKLFEEIDA-GNETVDFPEFLTVMSVK 84 (160)
T ss_pred HHHHHHHHHHHhCcCCCCCCcHHHHHHHHH-HcCCC-----CcHHHH------HHHHHhccC-CCCccCHHHHHHHHHHH
Confidence 457899999999 68999999999999996 33222 333333 455666666 5555 66555443
Q ss_pred ------CCCCccccccchhhhhcccccc---------------------------ccceeEeHHHHHHhhc
Q psy1391 531 ------DSSNWDAVSAHDKETLGVRHLA---------------------------DGEFWISYADFLKAFT 568 (614)
Q Consensus 531 ------~~~w~~~f~~~d~d~sG~~~~~---------------------------~~~~~m~f~dFi~~~~ 568 (614)
..+....|+.||+|++|.++.. ++++.|+|++|+..+.
T Consensus 85 ~~~~~~~Eel~~aF~~fD~d~dG~Is~~eL~~vl~~lge~~~deev~~ll~~~d~d~dG~i~~~eF~~~~~ 155 (160)
T COG5126 85 LKRGDKEEELREAFKLFDKDHDGYISIGELRRVLKSLGERLSDEEVEKLLKEYDEDGDGEIDYEEFKKLIK 155 (160)
T ss_pred hccCCcHHHHHHHHHHhCCCCCceecHHHHHHHHHhhcccCCHHHHHHHHHhcCCCCCceEeHHHHHHHHh
Confidence 1357889999999999999811 1278899999998654
No 18
>PF13499 EF-hand_7: EF-hand domain pair; PDB: 1TCF_A 2TN4_A 1TN4_A 1A2X_A 2CT9_B 2OTG_B 2OS8_B 1SNL_A 3O4Y_A 3J04_E ....
Probab=91.68 E-value=0.046 Score=42.85 Aligned_cols=29 Identities=17% Similarity=0.209 Sum_probs=25.6
Q ss_pred hHHHHHHHH-hCCCCCCChHHHHHHhhhcC
Q psy1391 460 QYEAVFLQL-ADEHRTVNAFELQELLDACL 488 (614)
Q Consensus 460 ~~~~~F~~~-a~~~~~i~~~eLq~~L~~~~ 488 (614)
.++++|..+ .+++|.|+..||+.++....
T Consensus 1 ~l~~~F~~~D~d~~G~i~~~el~~~~~~~~ 30 (66)
T PF13499_consen 1 RLKEAFKKFDKDGDGYISKEELRRALKHLG 30 (66)
T ss_dssp HHHHHHHHHSTTSSSEEEHHHHHHHHHHTT
T ss_pred CHHHHHHHHcCCccCCCCHHHHHHHHHHhc
Confidence 378899999 79999999999999997754
No 19
>PF00036 EF-hand_1: EF hand; InterPro: IPR018248 Many calcium-binding proteins belong to the same evolutionary family and share a type of calcium-binding domain known as the EF-hand. This type of domain consists of a twelve residue loop flanked on both sides by a twelve residue alpha-helical domain. In an EF-hand loop the calcium ion is coordinated in a pentagonal bipyramidal configuration. The six residues involved in the binding are in positions 1, 3, 5, 7, 9 and 12; these residues are denoted by X, Y, Z, -Y, -X and -Z. The invariant Glu or Asp at position 12 provides two oxygens for liganding Ca (bidentate ligand).; PDB: 1BJF_A 1XFW_R 1XFV_O 2K0J_A 2F3Z_A 3BYA_A 1XFU_Q 2R28_B 1ZOT_B 3G43_D ....
Probab=91.09 E-value=0.05 Score=35.65 Aligned_cols=28 Identities=25% Similarity=0.392 Sum_probs=22.9
Q ss_pred CccccccchhhhhccccccccceeEeHHHHHHhhcee
Q psy1391 534 NWDAVSAHDKETLGVRHLADGEFWISYADFLKAFTHL 570 (614)
Q Consensus 534 w~~~f~~~d~d~sG~~~~~~~~~~m~f~dFi~~~~~L 570 (614)
++.+|+.+|+|++|. |++++|+..+..|
T Consensus 2 ~~~~F~~~D~d~dG~---------I~~~Ef~~~~~~L 29 (29)
T PF00036_consen 2 LKEAFREFDKDGDGK---------IDFEEFKEMMKKL 29 (29)
T ss_dssp HHHHHHHHSTTSSSE---------EEHHHHHHHHHHT
T ss_pred HHHHHHHHCCCCCCc---------CCHHHHHHHHHhC
Confidence 467899999999887 8899999887543
No 20
>cd05026 S-100Z S-100Z: S-100Z domain found in proteins similar to S100Z. S100Z is a member of the S100 domain family within the EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100Z group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately.S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control. S100Z is normally expressed in various tissues, with its highest level of expression being in spleen and leukocytes. The function of S100Z remains unclear. Preliminary structural data suggests that S100Z is homodimer, however a heterodimer with S100P has been reported. S100Z is capable of binding calcium ions. When calcium binds to S110Z, the protein experiences a conformational change, which exposes hydrophobic surfac
Probab=90.38 E-value=0.093 Score=44.62 Aligned_cols=74 Identities=16% Similarity=0.190 Sum_probs=51.0
Q ss_pred hhhHHHHHHHHhC--CCC-CCChHHHHHHhhhcCCCCccCCCeeeeeeeeCCcceEEEEeeCCCCCCcccccCCCCCCCC
Q psy1391 458 FSQYEAVFLQLAD--EHR-TVNAFELQELLDACLPNDYIKSCACMEIETYDGEPVQLVKLRNPLGGGGEYIGAWSRDSSN 534 (614)
Q Consensus 458 ~~~~~~~F~~~a~--~~~-~i~~~eLq~~L~~~~~~~~~~~~f~~~~~~~~~~~~~li~l~d~~g~G~Ef~~lW~~~~~w 534 (614)
...++.+|.++++ .+| .|+..||+.+|...+...+ + +.. ....|
T Consensus 9 ~~~~~~~F~~~dd~dgdg~~Is~~EL~~ll~~~~~~~~---------~-------------~~~-----------~~~~v 55 (93)
T cd05026 9 MDTLIRIFHNYSGKEGDRYKLSKGELKELLQRELTDFL---------S-------------SQK-----------DPMLV 55 (93)
T ss_pred HHHHHHHHHHHHccCCCCCEECHHHHHHHHHHHhHHhc---------c-------------ccc-----------CHHHH
Confidence 4567888999985 556 5999999999965322110 0 000 11357
Q ss_pred ccccccchhhhhccccccccceeEeHHHHHHhhceeeee
Q psy1391 535 WDAVSAHDKETLGVRHLADGEFWISYADFLKAFTHLEVV 573 (614)
Q Consensus 535 ~~~f~~~d~d~sG~~~~~~~~~~m~f~dFi~~~~~L~~~ 573 (614)
..+++..|.++.|. |+|++|+..+..|-++
T Consensus 56 ~~i~~elD~n~dG~---------Idf~EF~~l~~~l~~~ 85 (93)
T cd05026 56 DKIMNDLDSNKDNE---------VDFNEFVVLVAALTVA 85 (93)
T ss_pred HHHHHHhCCCCCCC---------CCHHHHHHHHHHHHHH
Confidence 78888888887765 8899999998887554
No 21
>PF13405 EF-hand_6: EF-hand domain; PDB: 2AMI_A 3QRX_A 1W7J_B 1OE9_B 1W7I_B 1KFU_S 1KFX_S 2BL0_B 1Y1X_B 3MSE_B ....
Probab=90.24 E-value=0.29 Score=32.30 Aligned_cols=26 Identities=19% Similarity=0.300 Sum_probs=23.7
Q ss_pred hHHHHHHHH-hCCCCCCChHHHHHHhh
Q psy1391 460 QYEAVFLQL-ADEHRTVNAFELQELLD 485 (614)
Q Consensus 460 ~~~~~F~~~-a~~~~~i~~~eLq~~L~ 485 (614)
+++.+|..+ .+++|.|+..||+.+|.
T Consensus 1 ~l~~~F~~~D~d~dG~I~~~el~~~l~ 27 (31)
T PF13405_consen 1 RLREAFKMFDKDGDGFIDFEELRAILR 27 (31)
T ss_dssp HHHHHHHHH-TTSSSEEEHHHHHHHHH
T ss_pred CHHHHHHHHCCCCCCcCcHHHHHHHHH
Confidence 478899999 78999999999999997
No 22
>PF13202 EF-hand_5: EF hand; PDB: 3DD4_A 2Q4U_A 2BE4_A 1UHJ_B 1UHI_A 1UHH_B 1EJ3_B 1UHK_A 2ZFD_A 1UHN_A ....
Probab=89.88 E-value=0.32 Score=30.61 Aligned_cols=24 Identities=25% Similarity=0.382 Sum_probs=21.0
Q ss_pred HHHHHHHH-hCCCCCCChHHHHHHh
Q psy1391 461 YEAVFLQL-ADEHRTVNAFELQELL 484 (614)
Q Consensus 461 ~~~~F~~~-a~~~~~i~~~eLq~~L 484 (614)
++++|.++ .+.+|.|+..|+++++
T Consensus 1 l~~~F~~~D~d~DG~is~~E~~~~~ 25 (25)
T PF13202_consen 1 LKDAFQQFDTDGDGKISFEEFQRLV 25 (25)
T ss_dssp HHHHHHHHTTTSSSEEEHHHHHHHH
T ss_pred CHHHHHHHcCCCCCcCCHHHHHHHC
Confidence 46789999 8999999999999864
No 23
>cd05023 S-100A11 S-100A11: S-100A11 domain found in proteins similar to S100A11. S100A11 is a member of the S-100 domain family within EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100A11 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control . S100 proteins have also been associated with a variety of pathological events, including neoplastic transformation and neurodegenerative diseases such as Alzheimer's, usually via over expression of the protein. S100A11 is expressed in smooth muscle and other tissues and involves in calcium-dependent membrane aggregation, which is important for cell vesiculation . As is the case for many other S100 proteins, S
Probab=88.99 E-value=0.15 Score=42.99 Aligned_cols=75 Identities=19% Similarity=0.229 Sum_probs=50.9
Q ss_pred hhhhHHHHHHHHhCCCC---CCChHHHHHHhhhcCCCCccCCCeeeeeeeeCCcceEEEEeeCCCCCCcccccCCCCCCC
Q psy1391 457 SFSQYEAVFLQLADEHR---TVNAFELQELLDACLPNDYIKSCACMEIETYDGEPVQLVKLRNPLGGGGEYIGAWSRDSS 533 (614)
Q Consensus 457 ~~~~~~~~F~~~a~~~~---~i~~~eLq~~L~~~~~~~~~~~~f~~~~~~~~~~~~~li~l~d~~g~G~Ef~~lW~~~~~ 533 (614)
....+..+|.++++.+| .|+..||+.+|...+++- +. ++- +...
T Consensus 7 ~i~~l~~~F~~y~~~dg~~~~Ls~~Elk~ll~~e~~~~--------------------~~--~~~-----------~~~~ 53 (89)
T cd05023 7 CIESLIAVFQKYAGKDGDSYQLSKTEFLSFMNTELASF--------------------TK--NQK-----------DPGV 53 (89)
T ss_pred HHHHHHHHHHHHhccCCCcCeECHHHHHHHHHHhhhHh--------------------hc--CCC-----------CHHH
Confidence 35678999999875543 799999999997644320 00 000 0124
Q ss_pred CccccccchhhhhccccccccceeEeHHHHHHhhceeeee
Q psy1391 534 NWDAVSAHDKETLGVRHLADGEFWISYADFLKAFTHLEVV 573 (614)
Q Consensus 534 w~~~f~~~d~d~sG~~~~~~~~~~m~f~dFi~~~~~L~~~ 573 (614)
|..+++..|.|+.|. |+|++|+..+..|-++
T Consensus 54 ~~~ll~~~D~d~DG~---------I~f~EF~~l~~~l~~~ 84 (89)
T cd05023 54 LDRMMKKLDLNSDGQ---------LDFQEFLNLIGGLAVA 84 (89)
T ss_pred HHHHHHHcCCCCCCc---------CcHHHHHHHHHHHHHH
Confidence 666777778877665 8899999988877543
No 24
>cd05024 S-100A10 S-100A10: A subgroup of the S-100A10 domain found in proteins similar to S100A10. S100A10 is a member of the S100 family of EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A10 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. A unique feature of S100A10 is that it contains mutation in both of the calcium binding sites, making it calcium insensitive. S100A10 has been detected in brain, heart, gastrointestinal tract, kidney, liver, lung, spleen, testes, epidermis, aorta, and thymus. Structural data supports the homo- and hetero-dimeric as well as hetero-tetrameric nature of the protein. S100A10 has multiple binding partners in its calcium free state and is therefore involved in many diverse biological functions.
Probab=88.60 E-value=0.057 Score=45.55 Aligned_cols=75 Identities=16% Similarity=0.248 Sum_probs=52.2
Q ss_pred hhhHHHHHHHHhCCCCCCChHHHHHHhhhcCCCCccCCCeeeeeeeeCCcceEEEEeeCCCCCCcccccCCCCCCCCccc
Q psy1391 458 FSQYEAVFLQLADEHRTVNAFELQELLDACLPNDYIKSCACMEIETYDGEPVQLVKLRNPLGGGGEYIGAWSRDSSNWDA 537 (614)
Q Consensus 458 ~~~~~~~F~~~a~~~~~i~~~eLq~~L~~~~~~~~~~~~f~~~~~~~~~~~~~li~l~d~~g~G~Ef~~lW~~~~~w~~~ 537 (614)
...+..+|.++||..+.++-.||+++|++.+++=+.. --||. .-..+
T Consensus 7 i~~lI~~FhkYaG~~~tLsk~Elk~Ll~~Elp~~l~~-------------------~~d~~--------------~vd~i 53 (91)
T cd05024 7 MEKMMLTFHKFAGEKNYLNRDDLQKLMEKEFSEFLKN-------------------QNDPM--------------AVDKI 53 (91)
T ss_pred HHHHHHHHHHHcCCCCcCCHHHHHHHHHHHhHHHHcC-------------------CCCHH--------------HHHHH
Confidence 4568899999999999999999999998876421100 01121 23455
Q ss_pred cccchhhhhccccccccceeEeHHHHHHhhceeeeee
Q psy1391 538 VSAHDKETLGVRHLADGEFWISYADFLKAFTHLEVVH 574 (614)
Q Consensus 538 f~~~d~d~sG~~~~~~~~~~m~f~dFi~~~~~L~~~~ 574 (614)
+..-|.++.|. ++|++|+.....|.+++
T Consensus 54 m~~LD~n~Dg~---------vdF~EF~~Lv~~l~~ac 81 (91)
T cd05024 54 MKDLDDCRDGK---------VGFQSFFSLIAGLLIAC 81 (91)
T ss_pred HHHhCCCCCCc---------CcHHHHHHHHHHHHHHH
Confidence 55567776655 78999998888876543
No 25
>cd05022 S-100A13 S-100A13: S-100A13 domain found in proteins similar to S100A13. S100A13 is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A13 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100A13 is involved in the cellular export of interleukin-1 (IL-1) and of fibroblast growth factor-1 (FGF-1), which plays an important role in angiogenesis and tissue regeneration. Export is based on the CuII-dependent formation of multiprotein complexes containing the S100A13 protein. Assembly of these complexes occurs near the inner surface of the plasma membrane. Binding of two Ca(II) ions per monomer triggers key conformational changes leading to the creation of two identical and symmetrical Cu(II)-binding sites on the surface of the protein, close to the interface between the two monomers. These Cu
Probab=86.92 E-value=0.15 Score=42.97 Aligned_cols=31 Identities=26% Similarity=0.289 Sum_probs=26.1
Q ss_pred hhhHHHHHHHHhC--CCCCCChHHHHHHhhhcC
Q psy1391 458 FSQYEAVFLQLAD--EHRTVNAFELQELLDACL 488 (614)
Q Consensus 458 ~~~~~~~F~~~a~--~~~~i~~~eLq~~L~~~~ 488 (614)
-..++.+|..++. .+|.|+..||+.+|.+.+
T Consensus 7 i~~l~~~F~~fd~~~~~g~i~~~ELk~ll~~el 39 (89)
T cd05022 7 IETLVSNFHKASVKGGKESLTASEFQELLTQQL 39 (89)
T ss_pred HHHHHHHHHHHhCCCCCCeECHHHHHHHHHHHh
Confidence 4578999999964 889999999999997633
No 26
>PF00036 EF-hand_1: EF hand; InterPro: IPR018248 Many calcium-binding proteins belong to the same evolutionary family and share a type of calcium-binding domain known as the EF-hand. This type of domain consists of a twelve residue loop flanked on both sides by a twelve residue alpha-helical domain. In an EF-hand loop the calcium ion is coordinated in a pentagonal bipyramidal configuration. The six residues involved in the binding are in positions 1, 3, 5, 7, 9 and 12; these residues are denoted by X, Y, Z, -Y, -X and -Z. The invariant Glu or Asp at position 12 provides two oxygens for liganding Ca (bidentate ligand).; PDB: 1BJF_A 1XFW_R 1XFV_O 2K0J_A 2F3Z_A 3BYA_A 1XFU_Q 2R28_B 1ZOT_B 3G43_D ....
Probab=85.49 E-value=0.91 Score=29.64 Aligned_cols=27 Identities=15% Similarity=0.372 Sum_probs=23.8
Q ss_pred hHHHHHHHH-hCCCCCCChHHHHHHhhh
Q psy1391 460 QYEAVFLQL-ADEHRTVNAFELQELLDA 486 (614)
Q Consensus 460 ~~~~~F~~~-a~~~~~i~~~eLq~~L~~ 486 (614)
+++.+|..+ .|++|.|+..|+..+|.+
T Consensus 1 E~~~~F~~~D~d~dG~I~~~Ef~~~~~~ 28 (29)
T PF00036_consen 1 ELKEAFREFDKDGDGKIDFEEFKEMMKK 28 (29)
T ss_dssp HHHHHHHHHSTTSSSEEEHHHHHHHHHH
T ss_pred CHHHHHHHHCCCCCCcCCHHHHHHHHHh
Confidence 468899999 899999999999999853
No 27
>KOG0027|consensus
Probab=83.99 E-value=0.24 Score=46.02 Aligned_cols=54 Identities=19% Similarity=0.183 Sum_probs=45.5
Q ss_pred hhhhHHHHHHHH-hCCCCCCChHHHHHHhhhcCCCCccCCCeeeeeeeeCCcceEEEEeeCCCCCCc
Q psy1391 457 SFSQYEAVFLQL-ADEHRTVNAFELQELLDACLPNDYIKSCACMEIETYDGEPVQLVKLRNPLGGGG 522 (614)
Q Consensus 457 ~~~~~~~~F~~~-a~~~~~i~~~eLq~~L~~~~~~~~~~~~f~~~~~~~~~~~~~li~l~d~~g~G~ 522 (614)
...+++.+|..+ .+.+|.|++.||+.+|.....+ .+.+.| ..||+-.|.+|+|+
T Consensus 83 ~~~el~eaF~~fD~d~~G~Is~~el~~~l~~lg~~------~~~~e~------~~mi~~~d~d~dg~ 137 (151)
T KOG0027|consen 83 SSEELKEAFRVFDKDGDGFISASELKKVLTSLGEK------LTDEEC------KEMIREVDVDGDGK 137 (151)
T ss_pred cHHHHHHHHHHHccCCCCcCcHHHHHHHHHHhCCc------CCHHHH------HHHHHhcCCCCCCe
Confidence 356899999999 7899999999999999775432 557777 88999999999886
No 28
>COG5126 FRQ1 Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only]
Probab=80.30 E-value=0.36 Score=45.20 Aligned_cols=62 Identities=18% Similarity=0.130 Sum_probs=50.4
Q ss_pred hhhhHHHHHHHH-hCCCCCCChHHHHHHhhhcCCCCccCCCeeeeeeeeCCcceEEEEeeCCCCCCc----ccccCCCC
Q psy1391 457 SFSQYEAVFLQL-ADEHRTVNAFELQELLDACLPNDYIKSCACMEIETYDGEPVQLVKLRNPLGGGG----EYIGAWSR 530 (614)
Q Consensus 457 ~~~~~~~~F~~~-a~~~~~i~~~eLq~~L~~~~~~~~~~~~f~~~~~~~~~~~~~li~l~d~~g~G~----Ef~~lW~~ 530 (614)
..++++..|+-+ -+++|.|+..||..+|..... ..+.+.+ ..||+.+|++++|. ||..++.+
T Consensus 90 ~~Eel~~aF~~fD~d~dG~Is~~eL~~vl~~lge------~~~deev------~~ll~~~d~d~dG~i~~~eF~~~~~~ 156 (160)
T COG5126 90 KEEELREAFKLFDKDHDGYISIGELRRVLKSLGE------RLSDEEV------EKLLKEYDEDGDGEIDYEEFKKLIKD 156 (160)
T ss_pred cHHHHHHHHHHhCCCCCceecHHHHHHHHHhhcc------cCCHHHH------HHHHHhcCCCCCceEeHHHHHHHHhc
Confidence 567899999999 689999999999999974322 1566777 88999999999998 77776654
No 29
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=79.27 E-value=5.6 Score=38.76 Aligned_cols=51 Identities=22% Similarity=0.231 Sum_probs=37.4
Q ss_pred CccccceEeeeeEEEEEccCCceeEEEEeeCCCCCCCCccccCCCCCCCcccCCHHHHhhhCccccCCceEeeehHHHHh
Q psy1391 195 GIQIGINYRLYALERIETYDGEPVQLVKLRNPLGGGGEYIGAWSRDSSNWDAVSAHDKETLGVRHLADGEFWISYADFLK 274 (614)
Q Consensus 195 GL~~~HaY~v~~~~~v~~~~g~~~~LlrlrNPwg~~~ew~G~wsd~s~~W~~~~~~~~~~l~~~~~~dG~FWm~~~df~~ 274 (614)
....+||-+|++...-.. ++.....+||.||. . | .++|-|||+++++..
T Consensus 168 ~~~~~Hav~ivGy~~~~~---~~~~~~i~~NSwG~--~-----------w---------------g~~Gy~~i~~~~~~~ 216 (223)
T cd02619 168 GDLGGHAVVIVGYDDNYV---EGKGAFIVKNSWGT--D-----------W---------------GDNGYGRISYEDVYE 216 (223)
T ss_pred CccCCeEEEEEeecCCCC---CCCCEEEEEeCCCC--c-----------c---------------ccCCEEEEehhhhhh
Confidence 445799999999864211 13578899999997 1 2 358999999999985
Q ss_pred cc
Q psy1391 275 AF 276 (614)
Q Consensus 275 ~F 276 (614)
++
T Consensus 217 ~~ 218 (223)
T cd02619 217 MT 218 (223)
T ss_pred hh
Confidence 44
No 30
>cd05029 S-100A6 S-100A6: S-100A6 domain found in proteins similar to S100A6. S100A6 is a member of the S100 domain family within EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100A6 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control . S100A6 is normally expressed in the G1 phase of the cell cycle in neuronal cells. The function of S100A6 remains unclear, but evidence suggests that it is involved in cell cycle regulation and exocytosis. S100A6 may also be involved in tumorigenesis; the protein is overexpressed in several tumors. Ca2+ binding to S100A6 leads to a conformational change in the protein, which exposes a hydrophobic surface for interact
Probab=77.76 E-value=0.57 Score=39.38 Aligned_cols=30 Identities=23% Similarity=0.391 Sum_probs=24.6
Q ss_pred hhhhHHHHHHHHhCC---CCCCChHHHHHHhhh
Q psy1391 457 SFSQYEAVFLQLADE---HRTVNAFELQELLDA 486 (614)
Q Consensus 457 ~~~~~~~~F~~~a~~---~~~i~~~eLq~~L~~ 486 (614)
.-..+..+|.+++++ +|.|+..||.++|..
T Consensus 8 ~~~~~i~~F~~y~~~~~~~g~Is~~EL~~~l~~ 40 (88)
T cd05029 8 AIGLLVAIFHKYSGREGDKNTLSKKELKELIQK 40 (88)
T ss_pred HHHHHHHHHHHHHccCCCCCEECHHHHHHHHHH
Confidence 346788999999764 569999999999953
No 31
>KOG0034|consensus
Probab=77.66 E-value=0.66 Score=44.76 Aligned_cols=98 Identities=18% Similarity=0.227 Sum_probs=60.2
Q ss_pred hhhhHHHHHHHH-hC-CCCCCChHHHHHHhhhcCCCCccCCCeeeeeeeeCCcceEEEEeeCCCCCCc-----ccccCCC
Q psy1391 457 SFSQYEAVFLQL-AD-EHRTVNAFELQELLDACLPNDYIKSCACMEIETYDGEPVQLVKLRNPLGGGG-----EYIGAWS 529 (614)
Q Consensus 457 ~~~~~~~~F~~~-a~-~~~~i~~~eLq~~L~~~~~~~~~~~~f~~~~~~~~~~~~~li~l~d~~g~G~-----Ef~~lW~ 529 (614)
.-..+...|.++ .+ ..|.|+..|++.|..-.. +. |. ..+++++++.++|- ||...-+
T Consensus 31 EI~~L~~rF~kl~~~~~~g~lt~eef~~i~~~~~-Np-----~~----------~rI~~~f~~~~~~~~v~F~~Fv~~ls 94 (187)
T KOG0034|consen 31 EIERLYERFKKLDRNNGDGYLTKEEFLSIPELAL-NP-----LA----------DRIIDRFDTDGNGDPVDFEEFVRLLS 94 (187)
T ss_pred HHHHHHHHHHHhccccccCccCHHHHHHHHHHhc-Cc-----HH----------HHHHHHHhccCCCCccCHHHHHHHHh
Confidence 446678888998 45 789999999999983221 11 11 33455555554432 4544433
Q ss_pred C--CCCCcc-----ccccchhhhhccccc----------------------------------cccceeEeHHHHHHhhc
Q psy1391 530 R--DSSNWD-----AVSAHDKETLGVRHL----------------------------------ADGEFWISYADFLKAFT 568 (614)
Q Consensus 530 ~--~~~w~~-----~f~~~d~d~sG~~~~----------------------------------~~~~~~m~f~dFi~~~~ 568 (614)
- ...-.. +|+-||.+++|.|+. .|.++.|+|++|...+.
T Consensus 95 ~f~~~~~~~~Kl~faF~vYD~~~~G~I~reel~~iv~~~~~~~~~~~~e~~~~i~d~t~~e~D~d~DG~IsfeEf~~~v~ 174 (187)
T KOG0034|consen 95 VFSPKASKREKLRFAFRVYDLDGDGFISREELKQILRMMVGENDDMSDEQLEDIVDKTFEEADTDGDGKISFEEFCKVVE 174 (187)
T ss_pred hhcCCccHHHHHHHHHHHhcCCCCCcCcHHHHHHHHHHHHccCCcchHHHHHHHHHHHHHHhCCCCCCcCcHHHHHHHHH
Confidence 2 112222 677777777777661 11178899999998765
Q ss_pred ee
Q psy1391 569 HL 570 (614)
Q Consensus 569 ~L 570 (614)
+-
T Consensus 175 ~~ 176 (187)
T KOG0034|consen 175 KQ 176 (187)
T ss_pred cC
Confidence 54
No 32
>cd05027 S-100B S-100B: S-100B domain found in proteins similar to S100B. S100B is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100B group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100B is most abundant in glial cells of the central nervous system, predominately in astrocytes. S100B is involved in signal transduction via the inhibition of protein phoshorylation, regulation of enzyme activity and by affecting the calcium homeostasis. Upon calcium binding the S100B homodimer changes conformation to expose a hydrophobic cleft, which represents the interaction site of S100B with its more than 20 known target proteins. These target proteins include several cellular architecture proteins such as tubulin and GFAP; S100B can inhibit polymerization of these oligomeric molecules. Furthermore, S100B i
Probab=74.16 E-value=0.76 Score=38.61 Aligned_cols=31 Identities=26% Similarity=0.395 Sum_probs=25.8
Q ss_pred hhhHHHHHHHHh--CCCC-CCChHHHHHHhhhcC
Q psy1391 458 FSQYEAVFLQLA--DEHR-TVNAFELQELLDACL 488 (614)
Q Consensus 458 ~~~~~~~F~~~a--~~~~-~i~~~eLq~~L~~~~ 488 (614)
...++.+|..++ +.+| .|+..||+.+|.+.+
T Consensus 7 ~~~l~~aF~~fD~~dgdG~~I~~~eL~~ll~~~~ 40 (88)
T cd05027 7 MVALIDVFHQYSGREGDKHKLKKSELKELINNEL 40 (88)
T ss_pred HHHHHHHHHHhcccCCCcCEECHHHHHHHHHHHh
Confidence 467899999997 5678 599999999997643
No 33
>smart00027 EH Eps15 homology domain. Pair of EF hand motifs that recognise proteins containing Asn-Pro-Phe (NPF) sequences.
Probab=73.99 E-value=0.18 Score=43.03 Aligned_cols=73 Identities=15% Similarity=0.090 Sum_probs=49.6
Q ss_pred hhhhHHHHHHHH-hCCCCCCChHHHHHHhhhcCCCCccCCCeeeeeeeeCCcceEEEEeeCCCCCCcccccCCCCCCCCc
Q psy1391 457 SFSQYEAVFLQL-ADEHRTVNAFELQELLDACLPNDYIKSCACMEIETYDGEPVQLVKLRNPLGGGGEYIGAWSRDSSNW 535 (614)
Q Consensus 457 ~~~~~~~~F~~~-a~~~~~i~~~eLq~~L~~~~~~~~~~~~f~~~~~~~~~~~~~li~l~d~~g~G~Ef~~lW~~~~~w~ 535 (614)
....++.+|..+ .+++|.|+..||..+|... +++.+ ...
T Consensus 8 ~~~~l~~~F~~~D~d~~G~Is~~el~~~l~~~--------~~~~~--------------------------------ev~ 47 (96)
T smart00027 8 DKAKYEQIFRSLDKNQDGTVTGAQAKPILLKS--------GLPQT--------------------------------LLA 47 (96)
T ss_pred HHHHHHHHHHHhCCCCCCeEeHHHHHHHHHHc--------CCCHH--------------------------------HHH
Confidence 456799999999 5789999999999999541 01111 123
Q ss_pred cccccchhhhhccccccccceeEeHHHHHHhhceeeeeecccc
Q psy1391 536 DAVSAHDKETLGVRHLADGEFWISYADFLKAFTHLEVVHLDSE 578 (614)
Q Consensus 536 ~~f~~~d~d~sG~~~~~~~~~~m~f~dFi~~~~~L~~~~~~~~ 578 (614)
.+|..+|.+..|. |+|++|+..+..+..+...+.
T Consensus 48 ~i~~~~d~~~~g~---------I~~~eF~~~~~~~~~~~~g~~ 81 (96)
T smart00027 48 KIWNLADIDNDGE---------LDKDEFALAMHLIYRKLNGYP 81 (96)
T ss_pred HHHHHhcCCCCCC---------cCHHHHHHHHHHHHHHHcCCC
Confidence 4445556555554 778888888877776665443
No 34
>cd00052 EH Eps15 homology domain; found in proteins implicated in endocytosis, vesicle transport, and signal transduction. The alignment contains a pair of EF-hand motifs, typically one of them is canonical and binds to Ca2+, while the other may not bind to Ca2+. A hydrophobic binding pocket is formed by residues from both EF-hand motifs. The EH domain binds to proteins containing NPF (class I), [WF]W or SWG (class II), or H[TS]F (class III) sequence motifs.
Probab=73.78 E-value=0.28 Score=38.13 Aligned_cols=25 Identities=20% Similarity=0.348 Sum_probs=21.0
Q ss_pred HHHHHHH-hCCCCCCChHHHHHHhhh
Q psy1391 462 EAVFLQL-ADEHRTVNAFELQELLDA 486 (614)
Q Consensus 462 ~~~F~~~-a~~~~~i~~~eLq~~L~~ 486 (614)
+++|..+ .+.+|.|+..||+.+|..
T Consensus 2 ~~~F~~~D~~~~G~i~~~el~~~l~~ 27 (67)
T cd00052 2 DQIFRSLDPDGDGLISGDEARPFLGK 27 (67)
T ss_pred hHHHHHhCCCCCCcCcHHHHHHHHHH
Confidence 5678888 578899999999999954
No 35
>cd05030 calgranulins Calgranulins: S-100 domain found in proteins belonging to the Calgranulin subgroup of the S100 family of EF-hand calcium-modulated proteins, including S100A8, S100A9, and S100A12 . Note that the S-100 hierarchy, to which this Calgranulin group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. These proteins are expressed mainly in granulocytes, and are involved in inflammation, allergy, and neuritogenesis, as well as in host-parasite response. Calgranulins are modulated not only by calcium, but also by other metals such as zinc and copper. Structural data suggested that calgranulins may exist in multiple structural forms, homodimers, as well as hetero-oligomers. For example, the S100A8/S100A9 complex called calprotectin plays important roles in the regulation of inflammatory processes, wound repair, and regulating zinc-dependent enzymes as well as microbial growth.
Probab=73.48 E-value=0.77 Score=38.52 Aligned_cols=72 Identities=21% Similarity=0.241 Sum_probs=49.5
Q ss_pred hhhhHHHHHHHHhCC---CCCCChHHHHHHhhhcCCCCccCCCeeeeeeeeCCcceEEEEeeCCCCCCcccccCCCCCCC
Q psy1391 457 SFSQYEAVFLQLADE---HRTVNAFELQELLDACLPNDYIKSCACMEIETYDGEPVQLVKLRNPLGGGGEYIGAWSRDSS 533 (614)
Q Consensus 457 ~~~~~~~~F~~~a~~---~~~i~~~eLq~~L~~~~~~~~~~~~f~~~~~~~~~~~~~li~l~d~~g~G~Ef~~lW~~~~~ 533 (614)
....+..+|+++|.. ++.|+..||..+|...+...+ + .+ .....
T Consensus 6 ~i~~~~~~f~~y~~~~~~~~~Is~~El~~ll~~~~g~~~-----t--~~--------------------------~~~~~ 52 (88)
T cd05030 6 AIETIINVFHQYSVRKGHPDTLYKKEFKQLVEKELPNFL-----K--KE--------------------------KNQKA 52 (88)
T ss_pred HHHHHHHHHHHHhccCCCcccCCHHHHHHHHHHHhhHhh-----c--cC--------------------------CCHHH
Confidence 356788999999855 579999999999964322110 0 00 01134
Q ss_pred CccccccchhhhhccccccccceeEeHHHHHHhhcee
Q psy1391 534 NWDAVSAHDKETLGVRHLADGEFWISYADFLKAFTHL 570 (614)
Q Consensus 534 w~~~f~~~d~d~sG~~~~~~~~~~m~f~dFi~~~~~L 570 (614)
+..+|..+|.++.|. |+|++|+..+..+
T Consensus 53 v~~i~~~~D~d~dG~---------I~f~eF~~~~~~~ 80 (88)
T cd05030 53 IDKIFEDLDTNQDGQ---------LSFEEFLVLVIKV 80 (88)
T ss_pred HHHHHHHcCCCCCCc---------CcHHHHHHHHHHH
Confidence 677778888877665 8899999987664
No 36
>KOG0036|consensus
Probab=73.48 E-value=0.64 Score=49.29 Aligned_cols=86 Identities=15% Similarity=0.109 Sum_probs=62.6
Q ss_pred CcchhhhHHHHHHHH-hCCCCCCChHHHHHHhhhcCCCCccCCCeeeeeeeeCCcceEEEEeeCCCCCCc----ccccCC
Q psy1391 454 DTKSFSQYEAVFLQL-ADEHRTVNAFELQELLDACLPNDYIKSCACMEIETYDGEPVQLVKLRNPLGGGG----EYIGAW 528 (614)
Q Consensus 454 ~~~~~~~~~~~F~~~-a~~~~~i~~~eLq~~L~~~~~~~~~~~~f~~~~~~~~~~~~~li~l~d~~g~G~----Ef~~lW 528 (614)
+.+.+..++.+|+.+ ++.+|.||..+|.+.|.+.-.. +-..+.. ..+++.+|.+.+|+ ||+.--
T Consensus 9 ~~er~~r~~~lf~~lD~~~~g~~d~~~l~k~~~~l~~~-----~~~~~~~------~~l~~~~d~~~dg~vDy~eF~~Y~ 77 (463)
T KOG0036|consen 9 DEERDIRIRCLFKELDSKNDGQVDLDQLEKGLEKLDHP-----KPNYEAA------KMLFSAMDANRDGRVDYSEFKRYL 77 (463)
T ss_pred cHHHHHHHHHHHHHhccCCCCceeHHHHHHHHHhcCCC-----CCchHHH------HHHHHhcccCcCCcccHHHHHHHH
Confidence 445788899999999 6799999999999888664322 1111112 34566677777777 887765
Q ss_pred CC-CCCCccccccchhhhhcccc
Q psy1391 529 SR-DSSNWDAVSAHDKETLGVRH 550 (614)
Q Consensus 529 ~~-~~~w~~~f~~~d~d~sG~~~ 550 (614)
.+ ..+-..+|...|.++.|.+.
T Consensus 78 ~~~E~~l~~~F~~iD~~hdG~i~ 100 (463)
T KOG0036|consen 78 DNKELELYRIFQSIDLEHDGKID 100 (463)
T ss_pred HHhHHHHHHHHhhhccccCCccC
Confidence 54 34678899999999999988
No 37
>PF01023 S_100: S-100/ICaBP type calcium binding domain; InterPro: IPR013787 The calcium-binding domain found in S100 and CaBP-9k proteins is a subfamily of the EF-hand calcium-binding domain []. S100s are small dimeric acidic calcium and zinc-binding proteins abundant in the brain, with S100B playing an important role in modulating the proliferation and differentiation of neurons and glia cells []. S100 proteins have two different types of calcium-binding sites: a low affinity one with a special structure, and a 'normal' EF-hand type high-affinity site. Calbindin-D9k (CaBP-9k) also belong to this family of proteins, but it does not form dimers. CaBP-9k is a cytosolic protein expressed in a variety of tissues. Although its precise function is unknown, it appears to be under the control of the steroid hormones oestrogen and progesterone in the female reproductive system []. In the intestine, CaBP-9k may be involved in calcium absorption by mediating intracellular diffusion []. This entry represents a subdomain of the calcium-binding domain found in S100, CaBP-9k, and related proteins.; PDB: 2RGI_A 4DUQ_B 2KAY_B 2KAX_A 2CNP_A 1CNP_A 1A03_A 1JWD_B 2JTT_A 1XK4_B ....
Probab=70.33 E-value=5.8 Score=28.69 Aligned_cols=33 Identities=36% Similarity=0.505 Sum_probs=26.9
Q ss_pred hhhHHHHHHHHhC---CCCCCChHHHHHHhhhcCCC
Q psy1391 458 FSQYEAVFLQLAD---EHRTVNAFELQELLDACLPN 490 (614)
Q Consensus 458 ~~~~~~~F~~~a~---~~~~i~~~eLq~~L~~~~~~ 490 (614)
...+..+|+++|+ ....++..||.++|...+++
T Consensus 5 i~~iI~vFhkYa~~~Gd~~~Lsk~Elk~Ll~~Elp~ 40 (44)
T PF01023_consen 5 IETIIDVFHKYAGKEGDKDTLSKKELKELLEKELPN 40 (44)
T ss_dssp HHHHHHHHHHHHTSSSSTTSEEHHHHHHHHHHHSTT
T ss_pred HHHHHHHHHHHhccCCCCCeEcHHHHHHHHHHHHHH
Confidence 3568899999994 44589999999999887653
No 38
>PF13833 EF-hand_8: EF-hand domain pair; PDB: 3KF9_A 1TTX_A 1WLZ_A 1ALV_A 1NX3_A 1ALW_A 1NX2_A 1NX1_A 1NX0_A 1DF0_A ....
Probab=70.11 E-value=0.24 Score=37.12 Aligned_cols=15 Identities=13% Similarity=0.237 Sum_probs=12.7
Q ss_pred CCCCCChHHHHHHhh
Q psy1391 471 EHRTVNAFELQELLD 485 (614)
Q Consensus 471 ~~~~i~~~eLq~~L~ 485 (614)
++|.|+..+|+.+|.
T Consensus 1 ~~G~i~~~~~~~~l~ 15 (54)
T PF13833_consen 1 KDGKITREEFRRALS 15 (54)
T ss_dssp SSSEEEHHHHHHHHH
T ss_pred CcCEECHHHHHHHHH
Confidence 468899999999993
No 39
>cd05031 S-100A10_like S-100A10_like: S-100A10 domain found in proteins similar to S100A10. S100A10 is a member of the S100 family of EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A1_like group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. A unique feature of S100A10 is that it contains mutation in both of the calcium binding sites, making it calcium insensitive. S100A10 has been detected in brain, heart, gastrointestinal tract, kidney, liver, lung, spleen, testes, epidermis, aorta, and thymus. Structural data supports the homo- and hetero-dimeric as well as hetero-tetrameric nature of the protein. S100A10 has multiple binding partners in its calcium free state and is therefore involved in many diverse biological functions.
Probab=64.01 E-value=1.1 Score=37.86 Aligned_cols=30 Identities=20% Similarity=0.325 Sum_probs=24.7
Q ss_pred hhhhHHHHHHHHhC--C-CCCCChHHHHHHhhh
Q psy1391 457 SFSQYEAVFLQLAD--E-HRTVNAFELQELLDA 486 (614)
Q Consensus 457 ~~~~~~~~F~~~a~--~-~~~i~~~eLq~~L~~ 486 (614)
....++.+|..+++ + +|.|+..||+.+|..
T Consensus 6 ~~~~l~~~F~~~D~~dg~dG~Is~~El~~~l~~ 38 (94)
T cd05031 6 AMESLILTFHRYAGKDGDKNTLSRKELKKLMEK 38 (94)
T ss_pred HHHHHHHHHHHHhccCCCCCeECHHHHHHHHHH
Confidence 35678999999964 3 599999999999965
No 40
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=64.01 E-value=6.4 Score=41.51 Aligned_cols=53 Identities=23% Similarity=0.318 Sum_probs=36.4
Q ss_pred CccccceEeeeeEEEEEccC----C-ceeEEEEeeCCCCCCCCccccCCCCCCCcccCCHHHHhhhCccccCCceEeeeh
Q psy1391 195 GIQIGINYRLYALERIETYD----G-EPVQLVKLRNPLGGGGEYIGAWSRDSSNWDAVSAHDKETLGVRHLADGEFWISY 269 (614)
Q Consensus 195 GL~~~HaY~v~~~~~v~~~~----g-~~~~LlrlrNPwg~~~ew~G~wsd~s~~W~~~~~~~~~~l~~~~~~dG~FWm~~ 269 (614)
+..-|||=+|++...-...+ + .+.--++|||-||. ...++|-|||++
T Consensus 261 ~~~~gHAv~iVGyDDs~~~n~~~~~~~g~GAfiikNSWGt----------------------------~wG~~GYfwisY 312 (372)
T COG4870 261 GENWGHAVLIVGYDDSFDINNFKYGPPGDGAFIIKNSWGT----------------------------NWGENGYFWISY 312 (372)
T ss_pred cccccceEEEEeccccccccccccCCCCCceEEEECcccc----------------------------ccccCceEEEEe
Confidence 35679999999986532211 0 11236889999997 124679999999
Q ss_pred HHHHhc
Q psy1391 270 ADFLKA 275 (614)
Q Consensus 270 ~df~~~ 275 (614)
.+-..-
T Consensus 313 ~ya~~g 318 (372)
T COG4870 313 YYALNG 318 (372)
T ss_pred eecccc
Confidence 887665
No 41
>smart00054 EFh EF-hand, calcium binding motif. EF-hands are calcium-binding motifs that occur at least in pairs. Links between disease states and genes encoding EF-hands, particularly the S100 subclass, are emerging. Each motif consists of a 12 residue loop flanked on either side by a 12 residue alpha-helix. EF-hands undergo a conformational change unpon binding calcium ions.
Probab=58.17 E-value=12 Score=22.34 Aligned_cols=26 Identities=23% Similarity=0.377 Sum_probs=21.3
Q ss_pred HHHHHHHH-hCCCCCCChHHHHHHhhh
Q psy1391 461 YEAVFLQL-ADEHRTVNAFELQELLDA 486 (614)
Q Consensus 461 ~~~~F~~~-a~~~~~i~~~eLq~~L~~ 486 (614)
++.+|..+ .+.+|.|+..|+..++..
T Consensus 2 ~~~~f~~~d~~~~g~i~~~e~~~~~~~ 28 (29)
T smart00054 2 LKEAFRLFDKDGDGKIDFEEFKDLLKA 28 (29)
T ss_pred HHHHHHHHCCCCCCcEeHHHHHHHHHh
Confidence 56788888 577889999999998854
No 42
>KOG0044|consensus
Probab=57.28 E-value=1.8 Score=41.95 Aligned_cols=80 Identities=13% Similarity=0.073 Sum_probs=50.9
Q ss_pred hhhHHHHHHHHh--CCCCCCChHHHHHHhhhcCCCCccCCCeeeeeeeeCCcceEEEEeeCCCCCCc----ccccCCCCC
Q psy1391 458 FSQYEAVFLQLA--DEHRTVNAFELQELLDACLPNDYIKSCACMEIETYDGEPVQLVKLRNPLGGGG----EYIGAWSRD 531 (614)
Q Consensus 458 ~~~~~~~F~~~a--~~~~~i~~~eLq~~L~~~~~~~~~~~~f~~~~~~~~~~~~~li~l~d~~g~G~----Ef~~lW~~~ 531 (614)
..++..++...- -..|.++..+.+.|+...++.+- +..-+ ..+-+.+|.+++|. ||..+++-.
T Consensus 25 ~~ei~~~Yr~Fk~~cP~G~~~~~~F~~i~~~~fp~gd-----~~~y~------~~vF~~fD~~~dg~i~F~Efi~als~~ 93 (193)
T KOG0044|consen 25 KKEIQQWYRGFKNECPSGRLTLEEFREIYASFFPDGD-----ASKYA------ELVFRTFDKNKDGTIDFLEFICALSLT 93 (193)
T ss_pred HHHHHHHHHHhcccCCCCccCHHHHHHHHHHHCCCCC-----HHHHH------HHHHHHhcccCCCCcCHHHHHHHHHHH
Confidence 456777777773 34688999999998877654220 00111 33455677777776 666666542
Q ss_pred --------CCCccccccchhhhhcccc
Q psy1391 532 --------SSNWDAVSAHDKETLGVRH 550 (614)
Q Consensus 532 --------~~w~~~f~~~d~d~sG~~~ 550 (614)
..|. |+-||.|+.|.|+
T Consensus 94 ~rGt~eekl~w~--F~lyD~dgdG~It 118 (193)
T KOG0044|consen 94 SRGTLEEKLKWA--FRLYDLDGDGYIT 118 (193)
T ss_pred cCCcHHHHhhhh--heeecCCCCceEc
Confidence 1354 8888888888776
No 43
>PF05543 Peptidase_C47: Staphopain peptidase C47; InterPro: IPR008750 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to the peptidase family C47 (staphopain family, clan CA). The type example are the staphopains, which are one of four major families of proteinases secreted by the Gram-positive Staphylococcus aureus. These staphylococcal cysteine proteases are secreted as preproenzymes that are proteolytically cleaved to generate the mature enzyme [, , ].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 1X9Y_D 1Y4H_B 1PXV_B 1CV8_A.
Probab=51.84 E-value=58 Score=30.94 Aligned_cols=32 Identities=19% Similarity=0.269 Sum_probs=22.6
Q ss_pred CCccccceEeeeeEEEEEccCCceeEEEEeeCCCCC
Q psy1391 194 NGIQIGINYRLYALERIETYDGEPVQLVKLRNPLGG 229 (614)
Q Consensus 194 ~GL~~~HaY~v~~~~~v~~~~g~~~~LlrlrNPwg~ 229 (614)
+|...|||-+|.+-.... +| .+.+-+.|||-.
T Consensus 115 ~~~~~gHAlavvGya~~~--~g--~~~y~~WNPW~~ 146 (175)
T PF05543_consen 115 NGPHAGHALAVVGYAKPN--NG--QKTYYFWNPWWN 146 (175)
T ss_dssp TTB--EEEEEEEEEEEET--TS--EEEEEEE-TT-S
T ss_pred CCCccceeEEEEeeeecC--CC--CeEEEEeCCccC
Confidence 678899999999986653 43 778999999965
No 44
>cd05025 S-100A1 S-100A1: S-100A1 domain found in proteins similar to S100A1. S100A1 is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A1 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. As is the case with many other members of S100 protein family, S100A1 is implicated in intracellular and extracellular regulatory activities, including interaction with myosin-associated twitchin kinase, actin-capping protein CapZ, sinapsin I, and tubulin. Structural data suggests that S100A1 proteins exist within cells as antiparallel homodimers, while heterodimers with S100A4 and S100B also has been reported. Upon binding calcium S100A1 changes conformation to expose a hydrophobic cleft which is the interaction site of S100A1 with its more that 20 known target proteins.
Probab=50.05 E-value=3.1 Score=34.94 Aligned_cols=73 Identities=16% Similarity=0.211 Sum_probs=50.1
Q ss_pred hhhHHHHHHHHh--CCCC-CCChHHHHHHhhhcCCCCccCCCeeeeeeeeCCcceEEEEeeCCCCCCcccccCCCCCCCC
Q psy1391 458 FSQYEAVFLQLA--DEHR-TVNAFELQELLDACLPNDYIKSCACMEIETYDGEPVQLVKLRNPLGGGGEYIGAWSRDSSN 534 (614)
Q Consensus 458 ~~~~~~~F~~~a--~~~~-~i~~~eLq~~L~~~~~~~~~~~~f~~~~~~~~~~~~~li~l~d~~g~G~Ef~~lW~~~~~w 534 (614)
...++.+|..+. +.+| .|+..||+.+|...+... ++. ++. ...+
T Consensus 8 ~~~l~~~F~~fDd~dg~G~~Is~~El~~~l~~~lg~~-----~~~----------------~~s------------~~~v 54 (92)
T cd05025 8 METLINVFHAHSGKEGDKYKLSKKELKDLLQTELSDF-----LDA----------------QKD------------ADAV 54 (92)
T ss_pred HHHHHHHHHHHhcccCCCCeECHHHHHHHHHHHHHHH-----ccC----------------CCC------------HHHH
Confidence 467999999995 5678 599999999996422110 000 010 1346
Q ss_pred ccccccchhhhhccccccccceeEeHHHHHHhhceeee
Q psy1391 535 WDAVSAHDKETLGVRHLADGEFWISYADFLKAFTHLEV 572 (614)
Q Consensus 535 ~~~f~~~d~d~sG~~~~~~~~~~m~f~dFi~~~~~L~~ 572 (614)
..+|..+|.+++|. |+|++|+..+..+-+
T Consensus 55 ~~i~~~~D~d~~G~---------I~f~eF~~l~~~~~~ 83 (92)
T cd05025 55 DKIMKELDENGDGE---------VDFQEFVVLVAALTV 83 (92)
T ss_pred HHHHHHHCCCCCCc---------CcHHHHHHHHHHHHH
Confidence 77887888877765 889999998877654
No 45
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=48.89 E-value=58 Score=31.40 Aligned_cols=43 Identities=16% Similarity=0.227 Sum_probs=32.5
Q ss_pred ccccceEeeeeEEEEEccCCceeEEEEeeCCCCCCCCccccCCCCCCCcccCCHHHHhhhCccccCCceEeeehHH
Q psy1391 196 IQIGINYRLYALERIETYDGEPVQLVKLRNPLGGGGEYIGAWSRDSSNWDAVSAHDKETLGVRHLADGEFWISYAD 271 (614)
Q Consensus 196 L~~~HaY~v~~~~~v~~~~g~~~~LlrlrNPwg~~~ew~G~wsd~s~~W~~~~~~~~~~l~~~~~~dG~FWm~~~d 271 (614)
...+||=.|++...- . +.....+||.||. . | .++|.|+|++++
T Consensus 156 ~~~~Hav~iVGy~~~---~--~~~ywiv~NSWG~--~-----------W---------------G~~Gy~~i~~~~ 198 (210)
T cd02248 156 TNLNHAVLLVGYGTE---N--GVDYWIVKNSWGT--S-----------W---------------GEKGYIRIARGS 198 (210)
T ss_pred CcCCEEEEEEEEeec---C--CceEEEEEcCCCC--c-----------c---------------ccCcEEEEEcCC
Confidence 446899999998642 1 3678999999996 1 2 366999999877
No 46
>PF13833 EF-hand_8: EF-hand domain pair; PDB: 3KF9_A 1TTX_A 1WLZ_A 1ALV_A 1NX3_A 1ALW_A 1NX2_A 1NX1_A 1NX0_A 1DF0_A ....
Probab=48.66 E-value=7.3 Score=28.88 Aligned_cols=27 Identities=19% Similarity=0.297 Sum_probs=20.2
Q ss_pred CccccccchhhhhccccccccceeEeHHHHHHhhce
Q psy1391 534 NWDAVSAHDKETLGVRHLADGEFWISYADFLKAFTH 569 (614)
Q Consensus 534 w~~~f~~~d~d~sG~~~~~~~~~~m~f~dFi~~~~~ 569 (614)
-..+|..+|.+++|. |+|++|+.++.+
T Consensus 27 ~~~l~~~~D~~~~G~---------I~~~EF~~~~~~ 53 (54)
T PF13833_consen 27 VDRLFREFDTDGDGY---------ISFDEFISMMQR 53 (54)
T ss_dssp HHHHHHHHTTSSSSS---------EEHHHHHHHHHH
T ss_pred HHHHHHhcccCCCCC---------CCHHHHHHHHHh
Confidence 456677777777766 889999988753
No 47
>smart00027 EH Eps15 homology domain. Pair of EF hand motifs that recognise proteins containing Asn-Pro-Phe (NPF) sequences.
Probab=48.08 E-value=2.8 Score=35.56 Aligned_cols=29 Identities=14% Similarity=0.066 Sum_probs=20.9
Q ss_pred CCCccccccchhhhhccccccccceeEeHHHHHHhhce
Q psy1391 532 SSNWDAVSAHDKETLGVRHLADGEFWISYADFLKAFTH 569 (614)
Q Consensus 532 ~~w~~~f~~~d~d~sG~~~~~~~~~~m~f~dFi~~~~~ 569 (614)
...+.+|..+|++++|. |+++++..++..
T Consensus 10 ~~l~~~F~~~D~d~~G~---------Is~~el~~~l~~ 38 (96)
T smart00027 10 AKYEQIFRSLDKNQDGT---------VTGAQAKPILLK 38 (96)
T ss_pred HHHHHHHHHhCCCCCCe---------EeHHHHHHHHHH
Confidence 45788899999888876 566666666544
No 48
>PF08726 EFhand_Ca_insen: Ca2+ insensitive EF hand; InterPro: IPR014837 EF hands are helix-loop-helix binding motifs involved in the regulation of many cellular processes. EF hands usually bind to Ca2+ ions, which cause a major conformational change that allows the protein to interact with its designated targets. This protein corresponds to an EF hand which has partially or entirely lost its calcium-binding properties. The calcium insensitive EF hand is still able to mediate protein-protein recognition []. ; PDB: 1H8B_A 1SJJ_B.
Probab=46.53 E-value=21 Score=28.58 Aligned_cols=29 Identities=31% Similarity=0.277 Sum_probs=25.5
Q ss_pred hhhhHHHHHHHHhCCCCCCChHHHHHHhh
Q psy1391 457 SFSQYEAVFLQLADEHRTVNAFELQELLD 485 (614)
Q Consensus 457 ~~~~~~~~F~~~a~~~~~i~~~eLq~~L~ 485 (614)
..++++..|..+|+....|+..||++.|.
T Consensus 4 s~eqv~~aFr~lA~~KpyVT~~dLr~~l~ 32 (69)
T PF08726_consen 4 SAEQVEEAFRALAGGKPYVTEEDLRRSLT 32 (69)
T ss_dssp TCHHHHHHHHHHCTSSSCEEHHHHHHHS-
T ss_pred CHHHHHHHHHHHHcCCCcccHHHHHHHcC
Confidence 34679999999999999999999999984
No 49
>cd00213 S-100 S-100: S-100 domain, which represents the largest family within the superfamily of proteins carrying the Ca-binding EF-hand motif. Note that this S-100 hierarchy contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. Intracellularly, S100 proteins act as Ca-signaling or Ca-buffering proteins. The most unusual characteristic of certain S100 proteins is their occurrence in extracellular space, where they act in a cytokine-like manner through RAGE, the receptor for advanced glycation products. Structural data suggest that many S100 members exist within cells as homo- or heterodimers and even oligomers; oligomerization contributes to their functional diversification. Upon binding calcium, most S100 proteins change conformation to a more open structure exposing a hydrophobic cleft. This hydrophobic surface represents th
Probab=46.17 E-value=4.9 Score=33.28 Aligned_cols=30 Identities=27% Similarity=0.420 Sum_probs=25.2
Q ss_pred hhhhHHHHHHHH-h--CCCCCCChHHHHHHhhh
Q psy1391 457 SFSQYEAVFLQL-A--DEHRTVNAFELQELLDA 486 (614)
Q Consensus 457 ~~~~~~~~F~~~-a--~~~~~i~~~eLq~~L~~ 486 (614)
....++.+|..+ . +.+|.|+..||..++..
T Consensus 6 ~~~~l~~~F~~~D~~~~~~G~Is~~el~~~l~~ 38 (88)
T cd00213 6 AIETIIDVFHKYSGKEGDKDTLSKKELKELLET 38 (88)
T ss_pred HHHHHHHHHHHHhhccCCCCcCcHHHHHHHHHH
Confidence 446688999999 4 58899999999999964
No 50
>cd00051 EFh EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
Probab=42.93 E-value=6 Score=29.03 Aligned_cols=26 Identities=27% Similarity=0.341 Sum_probs=21.9
Q ss_pred HHHHHHHH-hCCCCCCChHHHHHHhhh
Q psy1391 461 YEAVFLQL-ADEHRTVNAFELQELLDA 486 (614)
Q Consensus 461 ~~~~F~~~-a~~~~~i~~~eLq~~L~~ 486 (614)
+..+|..+ .+.+|.|+..|+..++..
T Consensus 2 ~~~~f~~~d~~~~g~l~~~e~~~~l~~ 28 (63)
T cd00051 2 LREAFRLFDKDGDGTISADELKAALKS 28 (63)
T ss_pred HHHHHHHhCCCCCCcCcHHHHHHHHHH
Confidence 56788888 578899999999999965
No 51
>PLN02964 phosphatidylserine decarboxylase
Probab=42.64 E-value=4.2 Score=46.64 Aligned_cols=79 Identities=14% Similarity=0.099 Sum_probs=50.9
Q ss_pred hhhhHHHHHHHH-hCCCCCCChHHHHHHhhhcCCCCccCCCeeeee---eeeCCcceEEEEeeCCCCCCc----ccccCC
Q psy1391 457 SFSQYEAVFLQL-ADEHRTVNAFELQELLDACLPNDYIKSCACMEI---ETYDGEPVQLVKLRNPLGGGG----EYIGAW 528 (614)
Q Consensus 457 ~~~~~~~~F~~~-a~~~~~i~~~eLq~~L~~~~~~~~~~~~f~~~~---~~~~~~~~~li~l~d~~g~G~----Ef~~lW 528 (614)
+..+++..|+.+ .+.+|.| |..+|....... .+.+. . +.|++++|.+++|. ||..+-
T Consensus 141 qi~elkeaF~lfD~dgdG~i----Lg~ilrslG~~~-----pte~e~~fi------~~mf~~~D~DgdG~IdfdEFl~lL 205 (644)
T PLN02964 141 EPESACESFDLLDPSSSNKV----VGSIFVSCSIED-----PVETERSFA------RRILAIVDYDEDGQLSFSEFSDLI 205 (644)
T ss_pred HHHHHHHHHHHHCCCCCCcC----HHHHHHHhCCCC-----CCHHHHHHH------HHHHHHhCCCCCCeEcHHHHHHHH
Confidence 457788999988 5778876 777765432100 11111 2 56788889998887 776543
Q ss_pred CCC------CCCccccccchhhhhcccc
Q psy1391 529 SRD------SSNWDAVSAHDKETLGVRH 550 (614)
Q Consensus 529 ~~~------~~w~~~f~~~d~d~sG~~~ 550 (614)
... .+...+|+.+|+|++|.|+
T Consensus 206 ~~lg~~~seEEL~eaFk~fDkDgdG~Is 233 (644)
T PLN02964 206 KAFGNLVAANKKEELFKAADLNGDGVVT 233 (644)
T ss_pred HHhccCCCHHHHHHHHHHhCCCCCCcCC
Confidence 321 2367788888888887744
No 52
>KOG4251|consensus
Probab=40.84 E-value=16 Score=36.22 Aligned_cols=104 Identities=17% Similarity=0.167 Sum_probs=62.6
Q ss_pred EEecCCcEEEEecccCCCCccCceEEEeeCCCceeeecCCCCcchhhhcccCCCCCCcchhhhHHHHHHHH-hCCCCCCC
Q psy1391 398 CQLEQGSYLVLPTTFEPGQESGFTLRVYSSNPLKLKLLDSLPSVLKSAIVKAPSSLDTKSFSQYEAVFLQL-ADEHRTVN 476 (614)
Q Consensus 398 ~~L~pG~YvVVPsT~~p~~e~~F~Lrv~s~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~F~~~-a~~~~~i~ 476 (614)
--|||...-.||-.....-...|.-.+|-.+... ++.- +.........+.-+|.++ -+.++.|+
T Consensus 55 eiLppDHlngvkLEmDGhLNrgFhQEvflgkdLg-------gfDe--------daeprrsrrklmviFsKvDVNtDrkis 119 (362)
T KOG4251|consen 55 EILPPDHLNGVKLEMDGHLNRGFHQEVFLGKDLG-------GFDE--------DAEPRRSRRKLMVIFSKVDVNTDRKIS 119 (362)
T ss_pred ccCCcccccccccccccccchhhhhhheeccCCC-------Cccc--------ccchhHHHHHHHHHHhhcccCcccccc
Confidence 3477777777776655555556665555544322 0000 001112456788999999 69999999
Q ss_pred hHHHHHHhhhcCCCCccCCCe--eeeeeeeCCcceEEEEeeCCCCCCc----ccccC
Q psy1391 477 AFELQELLDACLPNDYIKSCA--CMEIETYDGEPVQLVKLRNPLGGGG----EYIGA 527 (614)
Q Consensus 477 ~~eLq~~L~~~~~~~~~~~~f--~~~~~~~~~~~~~li~l~d~~g~G~----Ef~~l 527 (614)
|.|+|+-+-..+. .|| +.+.. +.-.+..||+|+|. ||+.-
T Consensus 120 AkEmqrwImekta-----EHfqeameeS------kthFraVDpdgDGhvsWdEykvk 165 (362)
T KOG4251|consen 120 AKEMQRWIMEKTA-----EHFQEAMEES------KTHFRAVDPDGDGHVSWDEYKVK 165 (362)
T ss_pred HHHHHHHHHHHHH-----HHHHHHHhhh------hhheeeeCCCCCCceehhhhhhH
Confidence 9999987654331 112 12222 44567789999887 55544
No 53
>PF00112 Peptidase_C1: Papain family cysteine protease This is family C1 in the peptidase classification. ; InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues. The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate []. The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=38.44 E-value=46 Score=32.10 Aligned_cols=44 Identities=20% Similarity=0.229 Sum_probs=33.0
Q ss_pred ccccceEeeeeEEEEEccCCceeEEEEeeCCCCCCCCccccCCCCCCCcccCCHHHHhhhCccccCCceEeeehHHH
Q psy1391 196 IQIGINYRLYALERIETYDGEPVQLVKLRNPLGGGGEYIGAWSRDSSNWDAVSAHDKETLGVRHLADGEFWISYADF 272 (614)
Q Consensus 196 L~~~HaY~v~~~~~v~~~~g~~~~LlrlrNPwg~~~ew~G~wsd~s~~W~~~~~~~~~~l~~~~~~dG~FWm~~~df 272 (614)
...+||=+|++...-. +.....+||.||. . | .++|.|||+.+..
T Consensus 163 ~~~~Hav~iVGy~~~~-----~~~~wiv~NSWG~-~------------W---------------G~~Gy~~i~~~~~ 206 (219)
T PF00112_consen 163 ESGGHAVLIVGYDDEN-----GKGYWIVKNSWGT-D------------W---------------GDNGYFRISYDYN 206 (219)
T ss_dssp SSEEEEEEEEEEEEET-----TEEEEEEE-SBTT-T------------S---------------TBTTEEEEESSSS
T ss_pred cccccccccccccccc-----ceeeEeeehhhCC-c------------c---------------CCCeEEEEeeCCC
Confidence 5679999999986542 4688999999997 1 2 3578999998754
No 54
>cd02859 AMPKbeta_GBD_like AMP-activated protein kinase (AMPK) beta subunit glycogen binding domain (GBD). AMPK is a metabolic stress sensing protein that senses AMP/ATP and has recently been found to act as a glycogen sensor as well. The protein functions as a alpha-beta-gamma heterotrimer. This domain is the glycogen binding domain of the beta subunit.
Probab=37.89 E-value=36 Score=27.76 Aligned_cols=24 Identities=29% Similarity=0.598 Sum_probs=20.1
Q ss_pred CCccEEEeeccCCeEEEEEeccccccc
Q psy1391 50 YAGVFRFRLWWCGEWVEVLVDDRLPTV 76 (614)
Q Consensus 50 ~~G~y~~r~~~~G~w~~VvVDD~lP~~ 76 (614)
+.|.|.+||..+|+|+ +|...|+.
T Consensus 44 ~~g~y~YkF~Vdg~w~---~d~~~~~~ 67 (79)
T cd02859 44 PPGKYQYKFIVDGEWR---HSPDLPTE 67 (79)
T ss_pred CCCCEEEEEEECCEEE---eCCCCCcc
Confidence 4599999999999994 58888874
No 55
>PF07910 Peptidase_C78: Peptidase family C78; InterPro: IPR012462 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This entry contains UfSP1 and UfSP2, which are cysteine peptidases required for the processing and activation of Ubiquitin fold modifier 1 (Ufm1, IPR005375 from INTERPRO) and for its release from conjugated cellular proteins. UfSP1 and UfSP2 are 217 aa and 461 aa respectively [, ]. The peptidases belong to MEROPS peptidase family C78, clan CA. The UfSP2 family have an N-terminal extension with one or more zinc finger domains of the C2H2 type (IPR007087 from INTERPRO), which have been shown to be involved in protein:protein interaction. UfSP2 is present in most, if not all, multi-cellular organisms including plants, nematodes, flies, and mammals, whereas UfSP1 is not present in plants and nematodes []. ; PDB: 3OQC_B 2Z84_A.
Probab=34.67 E-value=1.2e+02 Score=30.04 Aligned_cols=55 Identities=18% Similarity=0.139 Sum_probs=32.9
Q ss_pred hHHHhhhhhh---hcCccccccccc-----------cccceecccCCceEEEecc-CC-----CCcHHHHHHHhhcCC
Q psy1391 93 PGLLEKAYAK---LHGSYEALKYGT-----------MLDGLADLTGGITESISIR-QD-----PTSSGRLLNKLLDMT 150 (614)
Q Consensus 93 ~~LlEKAyAK---l~GsY~~l~gg~-----------~~~al~dlTG~~~~~~~l~-~~-----~~~l~~~l~~~~~~~ 150 (614)
-.+||+|+.| ..|. .+.++. ..-.|..| |-++..++++ .. ...+.+.|.+.++.+
T Consensus 61 Q~~le~awdkG~d~~G~--~~~~~~~GsrkWIGt~E~~~~l~~~-gi~~~i~~f~~~~~~~~~~~~l~~~v~~yF~~~ 135 (218)
T PF07910_consen 61 QQWLEEAWDKGFDPQGA--QLTGGFVGSRKWIGTTEASALLRSL-GIPCKIVDFPKSGSEIRAHPRLLDWVWNYFESG 135 (218)
T ss_dssp HHHHHHCTSS---C---------CGTT------HHHHHHHHHHC--SEEEEEEES-SGCCC---CCGHHHHHHHHCCT
T ss_pred HHHHHHHHhhcCCcccc--cccccccccccEEcHHHHHHHHhhC-CceEEEEEEECCCcccccHHHHHHHHHHHhhcC
Confidence 4789999999 4455 333322 23345556 8888888887 31 167888888888765
No 56
>PF09279 EF-hand_like: Phosphoinositide-specific phospholipase C, efhand-like; InterPro: IPR015359 This domain is predominantly found in the enzyme phosphoinositol-specific phospholipase C. It adopts a structure consisting of a core of four alpha helices, in an EF like fold, and is required for functioning of the enzyme []. ; PDB: 3OHM_B 3QR0_A 2ZKM_X 2FJU_B 3QR1_D 1DJW_B 1DJI_B 1DJG_B 1QAS_B 2ISD_B ....
Probab=31.78 E-value=46 Score=27.12 Aligned_cols=67 Identities=10% Similarity=0.172 Sum_probs=45.7
Q ss_pred hHHHHHHHHhCCCCCCChHHHHHHhhhcCCCCccCCCeeeeeeeeCCcceEEEEeeCCCCCCcccccCCCCCCCCccccc
Q psy1391 460 QYEAVFLQLADEHRTVNAFELQELLDACLPNDYIKSCACMEIETYDGEPVQLVKLRNPLGGGGEYIGAWSRDSSNWDAVS 539 (614)
Q Consensus 460 ~~~~~F~~~a~~~~~i~~~eLq~~L~~~~~~~~~~~~f~~~~~~~~~~~~~li~l~d~~g~G~Ef~~lW~~~~~w~~~f~ 539 (614)
+++.+|.++++..+.|++.+|++.|+..-... . . +...++.++.
T Consensus 1 ei~~if~~ys~~~~~mt~~~f~~FL~~eQ~~~--------------------------~-----~-----~~~~~~~li~ 44 (83)
T PF09279_consen 1 EIEEIFRKYSSDKEYMTAEEFRRFLREEQGEP--------------------------R-----L-----TDEQAKELIE 44 (83)
T ss_dssp HHHHHHHHHCTTSSSEEHHHHHHHHHHTSS-T--------------------------T-----S-----SHHHHHHHHH
T ss_pred CHHHHHHHHhCCCCcCCHHHHHHHHHHHhccc--------------------------c-----C-----cHHHHHHHHH
Confidence 47889999998888999999999997532110 0 0 2345667777
Q ss_pred cchhhhhccccccccceeEeHHHHHHhh
Q psy1391 540 AHDKETLGVRHLADGEFWISYADFLKAF 567 (614)
Q Consensus 540 ~~d~d~sG~~~~~~~~~~m~f~dFi~~~ 567 (614)
++.++..+. ....|+++.|..++
T Consensus 45 ~~~~~~~~~-----~~~~lt~~gF~~fL 67 (83)
T PF09279_consen 45 KFEPDERNR-----QKGQLTLEGFTRFL 67 (83)
T ss_dssp HHHHHHHHH-----CTTEEEHHHHHHHH
T ss_pred HHccchhhc-----ccCCcCHHHHHHHH
Confidence 766554211 13679999999985
No 57
>KOG0028|consensus
Probab=28.38 E-value=13 Score=34.63 Aligned_cols=82 Identities=15% Similarity=0.133 Sum_probs=52.1
Q ss_pred hhhhHHHHHHHH-hCCCCCCChHHHHHHhhhcCCCCccCCCeeeeeeeeCCcceEEEEeeCCCCCCc----ccccCCC--
Q psy1391 457 SFSQYEAVFLQL-ADEHRTVNAFELQELLDACLPNDYIKSCACMEIETYDGEPVQLVKLRNPLGGGG----EYIGAWS-- 529 (614)
Q Consensus 457 ~~~~~~~~F~~~-a~~~~~i~~~eLq~~L~~~~~~~~~~~~f~~~~~~~~~~~~~li~l~d~~g~G~----Ef~~lW~-- 529 (614)
....++..|.-. .+..|.||+.||.-++.+. ||....-.. ..||+=+|..|+|. +|++.-.
T Consensus 31 q~q~i~e~f~lfd~~~~g~iD~~EL~vAmral--------GFE~~k~ei----~kll~d~dk~~~g~i~fe~f~~~mt~k 98 (172)
T KOG0028|consen 31 QKQEIKEAFELFDPDMAGKIDVEELKVAMRAL--------GFEPKKEEI----LKLLADVDKEGSGKITFEDFRRVMTVK 98 (172)
T ss_pred HHhhHHHHHHhhccCCCCcccHHHHHHHHHHc--------CCCcchHHH----HHHHHhhhhccCceechHHHHHHHHHH
Confidence 446677788877 7899999999997776442 243222100 34566667777776 5555422
Q ss_pred ----C-CCCCccccccchhhhhcccc
Q psy1391 530 ----R-DSSNWDAVSAHDKETLGVRH 550 (614)
Q Consensus 530 ----~-~~~w~~~f~~~d~d~sG~~~ 550 (614)
+ ..+-+..|+-+|.|+.|.|+
T Consensus 99 ~~e~dt~eEi~~afrl~D~D~~Gkis 124 (172)
T KOG0028|consen 99 LGERDTKEEIKKAFRLFDDDKTGKIS 124 (172)
T ss_pred HhccCcHHHHHHHHHcccccCCCCcC
Confidence 1 13457778888888888755
No 58
>PF09069 EF-hand_3: EF-hand; InterPro: IPR015154 Like other EF hand domains, this domain forms a helix-loop-helix motif, though since it does not contain the canonical pattern of calcium binding residues found in many EF hand domains, it does not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (IPR001202 from INTERPRO), enhancing dystroglycan binding []. ; PDB: 1EG3_A 1EG4_A.
Probab=26.58 E-value=79 Score=26.69 Aligned_cols=30 Identities=23% Similarity=0.396 Sum_probs=24.0
Q ss_pred hhHHHHHHHHhCCCCCCChHHHHHHhhhcC
Q psy1391 459 SQYEAVFLQLADEHRTVNAFELQELLDACL 488 (614)
Q Consensus 459 ~~~~~~F~~~a~~~~~i~~~eLq~~L~~~~ 488 (614)
+.++-+|.+++|.+|.|+...|..+|....
T Consensus 3 dKyRylFslisd~~g~~~~~~l~~lL~d~l 32 (90)
T PF09069_consen 3 DKYRYLFSLISDSNGCMDQRKLGLLLHDVL 32 (90)
T ss_dssp HHHHHHHHHHS-TTS-B-HHHHHHHHHHHH
T ss_pred HHHHHHHHHHcCCCCCCcHHHHHHHHHHHH
Confidence 568999999999999999999999997654
No 59
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=24.90 E-value=2.2e+02 Score=28.36 Aligned_cols=27 Identities=15% Similarity=0.051 Sum_probs=21.3
Q ss_pred ccceEeeeeEEEEEccCCceeEEEEeeCCCCC
Q psy1391 198 IGINYRLYALERIETYDGEPVQLVKLRNPLGG 229 (614)
Q Consensus 198 ~~HaY~v~~~~~v~~~~g~~~~LlrlrNPwg~ 229 (614)
.+||=.|++...- + +.+...+||-||.
T Consensus 184 ~~HaV~iVGyg~~---~--g~~YWivrNSWG~ 210 (236)
T cd02620 184 GGHAVKIIGWGVE---N--GVPYWLAANSWGT 210 (236)
T ss_pred CCeEEEEEEEecc---C--CeeEEEEEeCCCC
Confidence 5799999997532 3 3678899999996
No 60
>KOG0036|consensus
Probab=23.71 E-value=20 Score=38.39 Aligned_cols=54 Identities=9% Similarity=0.062 Sum_probs=38.1
Q ss_pred hhhhHHHHHHHH-hCCCCCCChHHHHHHhhhcCCCCccCCCeeeeeeeeCCcceEEEEeeCCCCCCc
Q psy1391 457 SFSQYEAVFLQL-ADEHRTVNAFELQELLDACLPNDYIKSCACMEIETYDGEPVQLVKLRNPLGGGG 522 (614)
Q Consensus 457 ~~~~~~~~F~~~-a~~~~~i~~~eLq~~L~~~~~~~~~~~~f~~~~~~~~~~~~~li~l~d~~g~G~ 522 (614)
.+.++..+|+.+ -+.||.|++.|+.+.|.....+ .+.+.. ...+.-+|++|+++
T Consensus 80 ~E~~l~~~F~~iD~~hdG~i~~~Ei~~~l~~~gi~------l~de~~------~k~~e~~d~~g~~~ 134 (463)
T KOG0036|consen 80 KELELYRIFQSIDLEHDGKIDPNEIWRYLKDLGIQ------LSDEKA------AKFFEHMDKDGKAT 134 (463)
T ss_pred hHHHHHHHHhhhccccCCccCHHHHHHHHHHhCCc------cCHHHH------HHHHHHhccCCCee
Confidence 356789999999 6899999999999999765422 233333 33455567777665
No 61
>PTZ00183 centrin; Provisional
Probab=22.23 E-value=21 Score=32.56 Aligned_cols=27 Identities=19% Similarity=0.303 Sum_probs=18.7
Q ss_pred CccccccchhhhhccccccccceeEeHHHHHHhhce
Q psy1391 534 NWDAVSAHDKETLGVRHLADGEFWISYADFLKAFTH 569 (614)
Q Consensus 534 w~~~f~~~d~d~sG~~~~~~~~~~m~f~dFi~~~~~ 569 (614)
...+|..+|.++.|. |+|++|+.++..
T Consensus 128 ~~~~~~~~d~~~~g~---------i~~~ef~~~~~~ 154 (158)
T PTZ00183 128 LQEMIDEADRNGDGE---------ISEEEFYRIMKK 154 (158)
T ss_pred HHHHHHHhCCCCCCc---------CcHHHHHHHHhc
Confidence 456677777666554 778888888754
No 62
>PF03633 Glyco_hydro_65C: Glycosyl hydrolase family 65, C-terminal domain ; InterPro: IPR005194 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This family of glycosyl hydrolases (GH65 from CAZY) contains this domain and includes vacuolar acid trehalase and maltose phosphorylases. Maltose phosphorylase (MP) is a dimeric enzyme that catalyzes the conversion of maltose and inorganic phosphate into beta-D-glucose-1-phosphate and glucose. The C-terminal domain forms a two layered jelly roll motif. This domain is situated at the base of the catalytic domain, however its function remains unknown [].; PDB: 1H54_A.
Probab=21.73 E-value=1e+02 Score=22.94 Aligned_cols=22 Identities=32% Similarity=0.678 Sum_probs=18.9
Q ss_pred CccEEEeeccCCeEEEEEeccc
Q psy1391 51 AGVFRFRLWWCGEWVEVLVDDR 72 (614)
Q Consensus 51 ~G~y~~r~~~~G~w~~VvVDD~ 72 (614)
...+.|+|...|.|..|.||..
T Consensus 7 w~~l~F~~~~rg~~l~v~i~~~ 28 (54)
T PF03633_consen 7 WSSLSFRLRYRGHWLEVEITHE 28 (54)
T ss_dssp -SEEEEEEEETTEEEEEEEETT
T ss_pred cCEeEEEEEECCEEEEEEEECC
Confidence 4679999999999999999854
No 63
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=21.44 E-value=4.2e+02 Score=29.25 Aligned_cols=29 Identities=21% Similarity=0.146 Sum_probs=21.3
Q ss_pred ccceEeeeeEEEEEccCCceeEEEEeeCCCCC
Q psy1391 198 IGINYRLYALERIETYDGEPVQLVKLRNPLGG 229 (614)
Q Consensus 198 ~~HaY~v~~~~~v~~~~g~~~~LlrlrNPwg~ 229 (614)
..||=.|+++..- .+| ....-+++|-||.
T Consensus 358 ~tHAM~ivGv~~D--~~g-~p~yw~VkNSWG~ 386 (437)
T cd00585 358 MTHAMVLTGVDLD--EDG-KPVKWKVENSWGE 386 (437)
T ss_pred CCeEEEEEEEEec--CCC-CcceEEEEcccCC
Confidence 4699999998642 224 3457889999997
No 64
>PF14658 EF-hand_9: EF-hand domain
Probab=20.78 E-value=47 Score=26.32 Aligned_cols=12 Identities=17% Similarity=0.326 Sum_probs=10.2
Q ss_pred eeEeHHHHHHhh
Q psy1391 556 FWISYADFLKAF 567 (614)
Q Consensus 556 ~~m~f~dFi~~~ 567 (614)
+.|+|++|+..+
T Consensus 51 ~~v~~d~F~~iM 62 (66)
T PF14658_consen 51 GSVNFDTFLAIM 62 (66)
T ss_pred ceEeHHHHHHHH
Confidence 779999998865
No 65
>KOG0041|consensus
Probab=20.56 E-value=29 Score=33.60 Aligned_cols=32 Identities=22% Similarity=0.310 Sum_probs=0.0
Q ss_pred cccccchhhhhcccccccc---------------------------ceeEeHHHHHHhh
Q psy1391 536 DAVSAHDKETLGVRHLADG---------------------------EFWISYADFLKAF 567 (614)
Q Consensus 536 ~~f~~~d~d~sG~~~~~~~---------------------------~~~m~f~dFi~~~ 567 (614)
.+|+.+|.|+.|.|++.+- ++.|+|..|+..|
T Consensus 103 ~~Fk~yDe~rDgfIdl~ELK~mmEKLgapQTHL~lK~mikeVded~dgklSfreflLIf 161 (244)
T KOG0041|consen 103 SMFKQYDEDRDGFIDLMELKRMMEKLGAPQTHLGLKNMIKEVDEDFDGKLSFREFLLIF 161 (244)
T ss_pred HHHHHhcccccccccHHHHHHHHHHhCCchhhHHHHHHHHHhhcccccchhHHHHHHHH
Done!