Query psy11546
Match_columns 1634
No_of_seqs 825 out of 3778
Neff 5.1
Searched_HMMs 46136
Date Fri Aug 16 21:16:02 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy11546.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11546hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd00190 Tryp_SPc Trypsin-like 100.0 3.9E-36 8.4E-41 326.7 23.9 229 1285-1519 1-232 (232)
2 KOG3627|consensus 100.0 9.6E-36 2.1E-40 334.2 24.7 236 1282-1521 10-255 (256)
3 smart00020 Tryp_SPc Trypsin-li 100.0 1.5E-33 3.2E-38 308.0 23.3 226 1284-1516 1-229 (229)
4 PF00089 Trypsin: Trypsin; In 100.0 1.6E-31 3.6E-36 288.7 21.7 217 1285-1516 1-220 (220)
5 COG5640 Secreted trypsin-like 99.9 2.5E-25 5.5E-30 253.4 16.4 242 1281-1531 29-289 (413)
6 PF09342 DUF1986: Domain of un 99.5 1.9E-13 4.1E-18 151.3 13.1 216 1292-1520 12-265 (267)
7 PF01390 SEA: SEA domain; Int 99.5 1.8E-13 3.8E-18 135.7 10.8 104 70-183 2-107 (107)
8 PF03761 DUF316: Domain of unk 99.4 5.9E-12 1.3E-16 145.3 16.3 209 1283-1519 40-278 (282)
9 smart00200 SEA Domain found in 99.2 1.3E-10 2.7E-15 119.5 9.9 92 72-170 3-99 (121)
10 smart00202 SR Scavenger recept 98.7 2.7E-08 5.8E-13 98.5 7.0 90 1157-1257 10-101 (101)
11 PF00057 Ldl_recept_a: Low-den 98.7 1E-08 2.2E-13 84.6 3.0 36 992-1028 2-37 (37)
12 cd00112 LDLa Low Density Lipop 98.7 9E-09 1.9E-13 83.9 2.6 35 993-1028 1-35 (35)
13 cd07888 CRD_corin_2 One of two 98.6 2.2E-08 4.8E-13 103.1 2.3 103 855-978 1-110 (122)
14 cd07441 CRD_SFRP3 Cysteine-ric 98.6 2.5E-08 5.5E-13 103.0 2.2 63 854-932 2-64 (126)
15 cd07454 CRD_LIN_17 Cysteine-ri 98.5 5.1E-08 1.1E-12 100.7 4.2 65 853-933 2-66 (124)
16 cd07462 CRD_FZ10 Cysteine-rich 98.5 3.5E-08 7.6E-13 102.1 2.9 106 853-979 2-112 (127)
17 cd07442 CRD_SFRP4 Cysteine-ric 98.5 3.3E-08 7.1E-13 102.3 2.4 62 855-932 4-65 (127)
18 cd07456 CRD_FZ5_like Cysteine- 98.5 5.4E-08 1.2E-12 100.1 3.7 62 856-933 2-63 (120)
19 cd07445 CRD_corin_1 One of two 98.5 3.6E-08 7.9E-13 102.0 2.4 108 852-980 1-114 (130)
20 cd07466 CRD_FZ7 Cysteine-rich 98.5 7.2E-08 1.6E-12 99.7 4.1 65 853-933 2-66 (125)
21 cd07464 CRD_FZ2 Cysteine-rich 98.5 7.4E-08 1.6E-12 99.8 3.4 65 853-933 2-66 (127)
22 cd07460 CRD_FZ5 Cysteine-rich 98.5 9.5E-08 2.1E-12 99.0 4.0 103 855-978 4-111 (127)
23 KOG1215|consensus 98.5 1.3E-07 2.8E-12 125.7 5.7 38 991-1029 136-175 (877)
24 cd07450 CRD_FZ6 Cysteine-rich 98.5 5.3E-08 1.1E-12 100.8 1.5 101 855-977 4-109 (127)
25 cd07447 CRD_Carboxypeptidase_Z 98.4 7.3E-08 1.6E-12 100.0 2.3 107 854-978 2-113 (128)
26 cd07465 CRD_FZ1 Cysteine-rich 98.4 9.9E-08 2.2E-12 98.9 3.2 65 853-933 2-66 (127)
27 PF00530 SRCR: Scavenger recep 98.4 2.6E-07 5.7E-12 90.2 5.8 90 1157-1257 7-99 (99)
28 cd07463 CRD_FZ9 Cysteine-rich 98.4 1.5E-07 3.3E-12 97.6 4.2 103 854-978 3-111 (127)
29 cd07448 CRD_FZ4 Cysteine-rich 98.4 1.5E-07 3.3E-12 97.5 3.8 81 854-955 2-82 (126)
30 PF00057 Ldl_recept_a: Low-den 98.4 1.9E-07 4.2E-12 77.1 3.5 36 1061-1097 2-37 (37)
31 cd07461 CRD_FZ8 Cysteine-rich 98.4 1.8E-07 3.9E-12 96.9 4.1 104 854-978 3-111 (125)
32 cd07449 CRD_FZ3 Cysteine-rich 98.4 8.5E-08 1.8E-12 99.2 1.6 102 855-978 4-110 (127)
33 cd07457 CRD_FZ9_like Cysteine- 98.4 2E-07 4.3E-12 96.1 3.9 103 854-977 1-108 (121)
34 cd07458 CRD_FZ1_like Cysteine- 98.4 1.9E-07 4.1E-12 96.0 3.4 64 854-933 1-64 (119)
35 cd00112 LDLa Low Density Lipop 98.4 1.8E-07 3.9E-12 76.3 2.5 35 1062-1097 1-35 (35)
36 smart00063 FRI Frizzled. Droso 98.3 3E-07 6.6E-12 93.7 3.4 78 856-955 1-78 (113)
37 cd07452 CRD_sizzled Cysteine-r 98.3 3.4E-07 7.4E-12 96.3 3.3 81 851-955 4-86 (141)
38 smart00192 LDLa Low-density li 98.3 4.1E-07 8.8E-12 73.3 3.0 32 993-1025 2-33 (33)
39 cd07444 CRD_SFRP5 Cysteine-ric 98.3 2.7E-07 6E-12 95.6 2.2 101 853-977 4-111 (127)
40 cd07446 CRD_SFRP2 Cysteine-ric 98.2 8.7E-07 1.9E-11 92.1 4.2 81 854-955 3-85 (128)
41 cd07443 CRD_SFRP1 Cysteine-ric 98.2 5.5E-07 1.2E-11 93.0 2.5 79 853-955 4-84 (124)
42 KOG1215|consensus 98.2 1.2E-06 2.6E-11 116.5 5.8 92 1541-1634 141-294 (877)
43 cd07455 CRD_Collagen_XVIII Cys 98.2 1.1E-06 2.4E-11 90.6 3.3 63 854-932 3-67 (123)
44 PF12999 PRKCSH-like: Glucosid 98.1 1.5E-06 3.2E-11 94.2 3.9 68 996-1094 35-110 (176)
45 PF15494 SRCR_2: Scavenger rec 98.1 6.8E-06 1.5E-10 81.7 7.0 89 1160-1265 5-98 (98)
46 cd07453 CRD_crescent Cysteine- 98.1 1.3E-06 2.9E-11 91.3 2.0 63 855-933 2-66 (135)
47 COG3591 V8-like Glu-specific e 98.0 4.8E-05 1.1E-09 87.1 11.8 197 1292-1521 45-251 (251)
48 smart00192 LDLa Low-density li 97.8 1.1E-05 2.3E-10 65.2 2.8 32 1062-1094 2-33 (33)
49 cd07451 CRD_SMO Cysteine-rich 97.7 6.6E-06 1.4E-10 85.9 0.2 65 854-933 3-69 (132)
50 KOG3577|consensus 97.6 7.1E-05 1.5E-09 95.0 6.9 152 850-1031 37-198 (556)
51 cd07066 CRD_FZ CRD_domain cyst 97.6 3.7E-05 8E-10 78.6 3.4 80 855-955 1-80 (119)
52 PF12999 PRKCSH-like: Glucosid 97.2 0.00024 5.2E-09 77.5 3.6 61 1541-1601 36-111 (176)
53 PF01392 Fz: Fz domain; Inter 97.0 0.00014 3.1E-09 73.2 -0.4 62 856-933 1-62 (116)
54 KOG2397|consensus 96.1 0.005 1.1E-07 75.5 4.1 52 1045-1096 61-115 (480)
55 TIGR02037 degP_htrA_DO peripla 95.6 0.22 4.7E-06 62.1 15.8 170 1307-1522 57-228 (428)
56 TIGR02038 protease_degS peripl 95.1 0.5 1.1E-05 57.6 16.4 168 1308-1522 78-249 (351)
57 PF13365 Trypsin_2: Trypsin-li 94.6 0.059 1.3E-06 53.6 5.8 22 1310-1331 1-23 (120)
58 PRK10898 serine endoprotease; 94.5 0.86 1.9E-05 55.7 16.5 82 1308-1413 78-160 (353)
59 PRK10139 serine endoprotease; 93.0 1.3 2.9E-05 55.9 14.5 169 1308-1522 90-261 (455)
60 KOG3509|consensus 92.2 0.1 2.2E-06 69.6 3.5 86 1549-1634 2-110 (964)
61 PRK10942 serine endoprotease; 91.3 2 4.3E-05 54.6 13.4 83 1308-1413 111-195 (473)
62 PF09272 Hepsin-SRCR: Hepsin, 90.9 0.31 6.8E-06 49.2 4.6 95 1156-1265 9-108 (110)
63 KOG3509|consensus 89.6 0.32 7E-06 65.1 4.5 93 1006-1098 3-111 (964)
64 KOG2397|consensus 89.4 0.25 5.4E-06 61.2 3.0 63 996-1058 42-115 (480)
65 PF02395 Peptidase_S6: Immunog 59.7 6.2 0.00013 52.9 2.8 32 1467-1498 213-245 (769)
66 PLN03160 uncharacterized prote 52.7 20 0.00043 41.4 5.1 16 69-84 65-80 (219)
67 PF07423 DUF1510: Protein of u 48.0 9.8 0.00021 43.8 1.7 28 32-59 2-29 (217)
68 PF14654 Epiglycanin_C: Mucin, 38.3 28 0.00062 35.3 2.9 33 38-70 11-44 (106)
69 PF12273 RCR: Chitin synthesis 34.2 27 0.00058 36.9 2.2 13 44-56 1-13 (130)
70 PF11293 DUF3094: Protein of u 30.7 54 0.0012 30.2 3.1 28 40-67 26-53 (55)
71 PF00548 Peptidase_C3: 3C cyst 29.1 51 0.0011 36.6 3.5 28 1467-1495 144-171 (172)
72 PF11770 GAPT: GRB2-binding ad 26.9 46 0.001 36.3 2.4 27 44-70 9-35 (158)
73 PRK11901 hypothetical protein; 25.6 78 0.0017 38.7 4.3 42 27-68 13-60 (327)
74 PF14155 DUF4307: Domain of un 25.6 81 0.0018 32.9 3.9 42 43-84 3-47 (112)
75 PF06024 DUF912: Nucleopolyhed 24.9 52 0.0011 33.6 2.4 27 42-68 58-84 (101)
76 PF00558 Vpu: Vpu protein; In 24.8 58 0.0013 32.3 2.5 24 45-68 5-28 (81)
77 PF01102 Glycophorin_A: Glycop 23.4 70 0.0015 34.0 3.0 30 39-68 58-87 (122)
78 PRK10697 DNA-binding transcrip 22.9 73 0.0016 33.7 3.0 25 96-122 88-115 (118)
79 PF00863 Peptidase_C4: Peptida 21.9 4.8E+02 0.01 30.9 9.5 153 1314-1516 37-192 (235)
No 1
>cd00190 Tryp_SPc Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
Probab=100.00 E-value=3.9e-36 Score=326.70 Aligned_cols=229 Identities=47% Similarity=0.923 Sum_probs=197.5
Q ss_pred EecCeecCCCccceEEEEEeC-CeeeeeceeeCCCccccccccccCCCCCceEEEEeEEecCCCCCCCCceeEeeEEEEe
Q psy11546 1285 IVGGGNARLGSWPWQAALYKE-GEFQCGATLISDQWLLSAGHCFYRAQDDYWVARLGTLRRGTKLPSPYEQLRPISKIIL 1363 (1634)
Q Consensus 1285 IvGG~~a~~GewPW~VsL~~~-g~~~CGGTLIS~rWVLTAAHCV~~~~~~~~~VrLG~~~~~s~~~s~~~Q~~~V~~Iii 1363 (1634)
|+||..+..++|||+|.|+.. +.++|+|+||+++||||||||+.......+.|++|....... ....+.+.|+++++
T Consensus 1 i~~G~~~~~~~~Pw~v~i~~~~~~~~C~GtlIs~~~VLTaAhC~~~~~~~~~~v~~g~~~~~~~--~~~~~~~~v~~~~~ 78 (232)
T cd00190 1 IVGGSEAKIGSFPWQVSLQYTGGRHFCGGSLISPRWVLTAAHCVYSSAPSNYTVRLGSHDLSSN--EGGGQVIKVKKVIV 78 (232)
T ss_pred CcCCeECCCCCCCCEEEEEccCCcEEEEEEEeeCCEEEECHHhcCCCCCccEEEEeCcccccCC--CCceEEEEEEEEEE
Confidence 579999999999999999987 789999999999999999999987666678888887665421 23567889999999
Q ss_pred CCCCCCCCCcCceEEEEeCCC--CCCCceeeecCCCCCCCCCCCeEEEEEecccccCCCcCCCCcEEEEEeeeChhHHhh
Q psy11546 1364 HPQYVDAGFINDISILKMKTP--FSNYVRPICLPHPNTPLTDGTLCTVVGWGQLFEIGRVFPDTLQEVQLPIISTAECRK 1441 (1634)
Q Consensus 1364 HP~Yn~~t~~nDIALLkL~~P--fSd~VqPICLP~~~~~l~~G~~c~VvGWG~t~e~g~~~s~~Lq~v~VpIIs~eeC~~ 1441 (1634)
||.|+.....+|||||+|+++ ++.+++|||||........+..+.++|||..... ...+..++...+.+++...|..
T Consensus 79 hp~y~~~~~~~DiAll~L~~~~~~~~~v~picl~~~~~~~~~~~~~~~~G~g~~~~~-~~~~~~~~~~~~~~~~~~~C~~ 157 (232)
T cd00190 79 HPNYNPSTYDNDIALLKLKRPVTLSDNVRPICLPSSGYNLPAGTTCTVSGWGRTSEG-GPLPDVLQEVNVPIVSNAECKR 157 (232)
T ss_pred CCCCCCCCCcCCEEEEEECCcccCCCcccceECCCccccCCCCCEEEEEeCCcCCCC-CCCCceeeEEEeeeECHHHhhh
Confidence 999999888999999999998 7888999999987656677899999999987554 2346789999999999999998
Q ss_pred hccCCCcccCCCCeEEEeecCCCCCCCCCCCCCeEEEecCCCcEEEEEEEEcCCCCCCCCCCeEEEEcCCchHHHHHh
Q psy11546 1442 RTLFLPLYRVTENMFCAGFERGGRDACLGDSGGPLMCQEPDGRWSLMGVTSNGYGCARANRPGVYTKVSNYIPWLYNN 1519 (1634)
Q Consensus 1442 ~~~~~~~~~It~sm~CAG~~~gg~daC~GDSGGPLVc~~~~GrW~LvGIvS~G~gCg~~~~PgVYTrVs~YlDWI~~v 1519 (1634)
.+.. ...+...++|++......+.|.|||||||++.. +++|+|+||+|+|..|...+.|++||+|+.|++||.++
T Consensus 158 ~~~~--~~~~~~~~~C~~~~~~~~~~c~gdsGgpl~~~~-~~~~~lvGI~s~g~~c~~~~~~~~~t~v~~~~~WI~~~ 232 (232)
T cd00190 158 AYSY--GGTITDNMLCAGGLEGGKDACQGDSGGPLVCND-NGRGVLVGIVSWGSGCARPNYPGVYTRVSSYLDWIQKT 232 (232)
T ss_pred hccC--cccCCCceEeeCCCCCCCccccCCCCCcEEEEe-CCEEEEEEEEehhhccCCCCCCCEEEEcHHhhHHhhcC
Confidence 6542 135788999998765578899999999999987 58999999999999998778999999999999999764
No 2
>KOG3627|consensus
Probab=100.00 E-value=9.6e-36 Score=334.16 Aligned_cols=236 Identities=45% Similarity=0.881 Sum_probs=197.5
Q ss_pred cceEecCeecCCCccceEEEEEeCC--eeeeeceeeCCCccccccccccCC-CCCceEEEEeEEecCCCCCCC-CceeEe
Q psy11546 1282 HSRIVGGGNARLGSWPWQAALYKEG--EFQCGATLISDQWLLSAGHCFYRA-QDDYWVARLGTLRRGTKLPSP-YEQLRP 1357 (1634)
Q Consensus 1282 ~~RIvGG~~a~~GewPW~VsL~~~g--~~~CGGTLIS~rWVLTAAHCV~~~-~~~~~~VrLG~~~~~s~~~s~-~~Q~~~ 1357 (1634)
..||+||.++..++|||+|+|.... .++|||+||+++||||||||+... .. .+.|++|.+......... ..+...
T Consensus 10 ~~~i~~g~~~~~~~~Pw~~~l~~~~~~~~~Cggsli~~~~vltaaHC~~~~~~~-~~~V~~G~~~~~~~~~~~~~~~~~~ 88 (256)
T KOG3627|consen 10 EGRIVGGTEAEPGSFPWQVSLQYGGNGRHLCGGSLISPRWVLTAAHCVKGASAS-LYTVRLGEHDINLSVSEGEEQLVGD 88 (256)
T ss_pred cCCEeCCccCCCCCCCCEEEEEECCCcceeeeeEEeeCCEEEEChhhCCCCCCc-ceEEEECccccccccccCchhhhce
Confidence 5799999999999999999999876 789999999999999999999774 33 778888876544321111 135566
Q ss_pred eEEEEeCCCCCCCCCc-CceEEEEeCCC--CCCCceeeecCCCCC--CCCCCCeEEEEEecccccCCCcCCCCcEEEEEe
Q psy11546 1358 ISKIILHPQYVDAGFI-NDISILKMKTP--FSNYVRPICLPHPNT--PLTDGTLCTVVGWGQLFEIGRVFPDTLQEVQLP 1432 (1634)
Q Consensus 1358 V~~IiiHP~Yn~~t~~-nDIALLkL~~P--fSd~VqPICLP~~~~--~l~~G~~c~VvGWG~t~e~g~~~s~~Lq~v~Vp 1432 (1634)
|.++++||.|+..... ||||||+|..+ |+++|+|||||.... ....+..|+++|||.+.......+..|+++.++
T Consensus 89 v~~~i~H~~y~~~~~~~nDiall~l~~~v~~~~~i~piclp~~~~~~~~~~~~~~~v~GWG~~~~~~~~~~~~L~~~~v~ 168 (256)
T KOG3627|consen 89 VEKIIVHPNYNPRTLENNDIALLRLSEPVTFSSHIQPICLPSSADPYFPPGGTTCLVSGWGRTESGGGPLPDTLQEVDVP 168 (256)
T ss_pred eeEEEECCCCCCCCCCCCCEEEEEECCCcccCCcccccCCCCCcccCCCCCCCEEEEEeCCCcCCCCCCCCceeEEEEEe
Confidence 8899999999998877 99999999997 999999999985543 244558999999998866544567889999999
Q ss_pred eeChhHHhhhccCCCcccCCCCeEEEeecCCCCCCCCCCCCCeEEEecCCCcEEEEEEEEcCCC-CCCCCCCeEEEEcCC
Q psy11546 1433 IISTAECRKRTLFLPLYRVTENMFCAGFERGGRDACLGDSGGPLMCQEPDGRWSLMGVTSNGYG-CARANRPGVYTKVSN 1511 (1634)
Q Consensus 1433 IIs~eeC~~~~~~~~~~~It~sm~CAG~~~gg~daC~GDSGGPLVc~~~~GrW~LvGIvS~G~g-Cg~~~~PgVYTrVs~ 1511 (1634)
+++...|...+.... .++..||||+...++.++|+|||||||++... ++|+++||+|||.+ |+..+.|++||+|+.
T Consensus 169 i~~~~~C~~~~~~~~--~~~~~~~Ca~~~~~~~~~C~GDSGGPLv~~~~-~~~~~~GivS~G~~~C~~~~~P~vyt~V~~ 245 (256)
T KOG3627|consen 169 IISNSECRRAYGGLG--TITDTMLCAGGPEGGKDACQGDSGGPLVCEDN-GRWVLVGIVSWGSGGCGQPNYPGVYTRVSS 245 (256)
T ss_pred EcChhHhcccccCcc--ccCCCEEeeCccCCCCccccCCCCCeEEEeeC-CcEEEEEEEEecCCCCCCCCCCeEEeEhHH
Confidence 999999998654321 46677999998677888999999999999874 48999999999998 999889999999999
Q ss_pred chHHHHHhhc
Q psy11546 1512 YIPWLYNNMA 1521 (1634)
Q Consensus 1512 YlDWI~~vI~ 1521 (1634)
|++||++.+.
T Consensus 246 y~~WI~~~~~ 255 (256)
T KOG3627|consen 246 YLDWIKENIG 255 (256)
T ss_pred hHHHHHHHhc
Confidence 9999999875
No 3
>smart00020 Tryp_SPc Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
Probab=100.00 E-value=1.5e-33 Score=308.02 Aligned_cols=226 Identities=49% Similarity=0.941 Sum_probs=194.6
Q ss_pred eEecCeecCCCccceEEEEEeCC-eeeeeceeeCCCccccccccccCCCCCceEEEEeEEecCCCCCCCCceeEeeEEEE
Q psy11546 1284 RIVGGGNARLGSWPWQAALYKEG-EFQCGATLISDQWLLSAGHCFYRAQDDYWVARLGTLRRGTKLPSPYEQLRPISKII 1362 (1634)
Q Consensus 1284 RIvGG~~a~~GewPW~VsL~~~g-~~~CGGTLIS~rWVLTAAHCV~~~~~~~~~VrLG~~~~~s~~~s~~~Q~~~V~~Ii 1362 (1634)
|++||..+..++|||+|.|+... .+.|+|+||+++||||||||+.......+.|++|....... ...+.+.|..++
T Consensus 1 ~~~~G~~~~~~~~Pw~~~i~~~~~~~~C~GtlIs~~~VLTaahC~~~~~~~~~~v~~g~~~~~~~---~~~~~~~v~~~~ 77 (229)
T smart00020 1 RIVGGSEANIGSFPWQVSLQYRGGRHFCGGSLISPRWVLTAAHCVYGSDPSNIRVRLGSHDLSSG---EEGQVIKVSKVI 77 (229)
T ss_pred CccCCCcCCCCCCCcEEEEEEcCCCcEEEEEEecCCEEEECHHHcCCCCCcceEEEeCcccCCCC---CCceEEeeEEEE
Confidence 68999999999999999999877 89999999999999999999987655678888987654321 223788999999
Q ss_pred eCCCCCCCCCcCceEEEEeCCC--CCCCceeeecCCCCCCCCCCCeEEEEEecccccCCCcCCCCcEEEEEeeeChhHHh
Q psy11546 1363 LHPQYVDAGFINDISILKMKTP--FSNYVRPICLPHPNTPLTDGTLCTVVGWGQLFEIGRVFPDTLQEVQLPIISTAECR 1440 (1634)
Q Consensus 1363 iHP~Yn~~t~~nDIALLkL~~P--fSd~VqPICLP~~~~~l~~G~~c~VvGWG~t~e~g~~~s~~Lq~v~VpIIs~eeC~ 1440 (1634)
+||.|+.....+|||||+|+++ +++.++|||||........+..+.++|||.....+......++...+.+++.+.|.
T Consensus 78 ~~p~~~~~~~~~DiAll~L~~~i~~~~~~~pi~l~~~~~~~~~~~~~~~~g~g~~~~~~~~~~~~~~~~~~~~~~~~~C~ 157 (229)
T smart00020 78 IHPNYNPSTYDNDIALLKLKSPVTLSDNVRPICLPSSNYNVPAGTTCTVSGWGRTSEGAGSLPDTLQEVNVPIVSNATCR 157 (229)
T ss_pred ECCCCCCCCCcCCEEEEEECcccCCCCceeeccCCCcccccCCCCEEEEEeCCCCCCCCCcCCCEeeEEEEEEeCHHHhh
Confidence 9999998888999999999999 78899999999875566678999999999876544445678999999999999999
Q ss_pred hhccCCCcccCCCCeEEEeecCCCCCCCCCCCCCeEEEecCCCcEEEEEEEEcCCCCCCCCCCeEEEEcCCchHHH
Q psy11546 1441 KRTLFLPLYRVTENMFCAGFERGGRDACLGDSGGPLMCQEPDGRWSLMGVTSNGYGCARANRPGVYTKVSNYIPWL 1516 (1634)
Q Consensus 1441 ~~~~~~~~~~It~sm~CAG~~~gg~daC~GDSGGPLVc~~~~GrW~LvGIvS~G~gCg~~~~PgVYTrVs~YlDWI 1516 (1634)
..+... ..+...++|++......+.|.|||||||++.. + +|+|+||+|+|..|...+.|.+|++|++|++||
T Consensus 158 ~~~~~~--~~~~~~~~C~~~~~~~~~~c~gdsG~pl~~~~-~-~~~l~Gi~s~g~~C~~~~~~~~~~~i~~~~~WI 229 (229)
T smart00020 158 RAYSGG--GAITDNMLCAGGLEGGKDACQGDSGGPLVCND-G-RWVLVGIVSWGSGCARPGKPGVYTRVSSYLDWI 229 (229)
T ss_pred hhhccc--cccCCCcEeecCCCCCCcccCCCCCCeeEEEC-C-CEEEEEEEEECCCCCCCCCCCEEEEeccccccC
Confidence 865431 24788999998765578899999999999987 3 999999999999998788999999999999998
No 4
>PF00089 Trypsin: Trypsin; InterPro: IPR001254 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This group of serine proteases belong to the MEROPS peptidase family S1 (chymotrypsin family, clan PA(S))and to peptidase family S6 (Hap serine peptidases). The chymotrypsin family is almost totally confined to animals, although trypsin-like enzymes are found in actinomycetes of the genera Streptomyces and Saccharopolyspora, and in the fungus Fusarium oxysporum []. The enzymes are inherently secreted, being synthesised with a signal peptide that targets them to the secretory pathway. Animal enzymes are either secreted directly, packaged into vesicles for regulated secretion, or are retained in leukocyte granules []. The Hap family, 'Haemophilus adhesion and penetration', are proteins that play a role in the interaction with human epithelial cells. The serine protease activity is localized at the N-terminal domain, whereas the binding domain is in the C-terminal region. ; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 1SPJ_A 1A5I_A 2ZGH_A 2ZKS_A 2ZGJ_A 2ZGC_A 2ODP_A 2I6Q_A 2I6S_A 2ODQ_A ....
Probab=99.98 E-value=1.6e-31 Score=288.73 Aligned_cols=217 Identities=43% Similarity=0.832 Sum_probs=186.8
Q ss_pred EecCeecCCCccceEEEEEeCC-eeeeeceeeCCCccccccccccCCCCCceEEEEeEEecCCCCCCCCceeEeeEEEEe
Q psy11546 1285 IVGGGNARLGSWPWQAALYKEG-EFQCGATLISDQWLLSAGHCFYRAQDDYWVARLGTLRRGTKLPSPYEQLRPISKIIL 1363 (1634)
Q Consensus 1285 IvGG~~a~~GewPW~VsL~~~g-~~~CGGTLIS~rWVLTAAHCV~~~~~~~~~VrLG~~~~~s~~~s~~~Q~~~V~~Iii 1363 (1634)
|+||..+..++|||+|.|+... .++|+|+||+++||||||||+.. ...+.+.+|..... ......+.+.|+++++
T Consensus 1 i~~g~~~~~~~~p~~v~i~~~~~~~~C~G~li~~~~vLTaahC~~~--~~~~~v~~g~~~~~--~~~~~~~~~~v~~~~~ 76 (220)
T PF00089_consen 1 IVGGDPASPGEFPWVVSIRYSNGRFFCTGTLISPRWVLTAAHCVDG--ASDIKVRLGTYSIR--NSDGSEQTIKVSKIII 76 (220)
T ss_dssp SBSSEECGTTSSTTEEEEEETTTEEEEEEEEEETTEEEEEGGGHTS--GGSEEEEESESBTT--STTTTSEEEEEEEEEE
T ss_pred CCCCEECCCCCCCeEEEEeeCCCCeeEeEEeccccccccccccccc--cccccccccccccc--cccccccccccccccc
Confidence 6799999999999999999887 99999999999999999999977 44677788872222 1233368899999999
Q ss_pred CCCCCCCCCcCceEEEEeCCC--CCCCceeeecCCCCCCCCCCCeEEEEEecccccCCCcCCCCcEEEEEeeeChhHHhh
Q psy11546 1364 HPQYVDAGFINDISILKMKTP--FSNYVRPICLPHPNTPLTDGTLCTVVGWGQLFEIGRVFPDTLQEVQLPIISTAECRK 1441 (1634)
Q Consensus 1364 HP~Yn~~t~~nDIALLkL~~P--fSd~VqPICLP~~~~~l~~G~~c~VvGWG~t~e~g~~~s~~Lq~v~VpIIs~eeC~~ 1441 (1634)
||.|+.....+|||||+|+++ +.+.++|+||+........+..+.++|||.....+ ....++...+.+++.+.|..
T Consensus 77 h~~~~~~~~~~DiAll~L~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~G~~~~~~~~--~~~~~~~~~~~~~~~~~c~~ 154 (220)
T PF00089_consen 77 HPKYDPSTYDNDIALLKLDRPITFGDNIQPICLPSAGSDPNVGTSCIVVGWGRTSDNG--YSSNLQSVTVPVVSRKTCRS 154 (220)
T ss_dssp ETTSBTTTTTTSEEEEEESSSSEHBSSBEESBBTSTTHTTTTTSEEEEEESSBSSTTS--BTSBEEEEEEEEEEHHHHHH
T ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccccccccccc--cccccccccccccccccccc
Confidence 999999888999999999999 78999999999865556788999999999865444 45679999999999999998
Q ss_pred hccCCCcccCCCCeEEEeecCCCCCCCCCCCCCeEEEecCCCcEEEEEEEEcCCCCCCCCCCeEEEEcCCchHHH
Q psy11546 1442 RTLFLPLYRVTENMFCAGFERGGRDACLGDSGGPLMCQEPDGRWSLMGVTSNGYGCARANRPGVYTKVSNYIPWL 1516 (1634)
Q Consensus 1442 ~~~~~~~~~It~sm~CAG~~~gg~daC~GDSGGPLVc~~~~GrW~LvGIvS~G~gCg~~~~PgVYTrVs~YlDWI 1516 (1634)
.+. ..+...++|++.. +..+.|.|||||||++... +|+||++++..|...+.|++|++|+.|++||
T Consensus 155 ~~~----~~~~~~~~c~~~~-~~~~~~~g~sG~pl~~~~~----~lvGI~s~~~~c~~~~~~~v~~~v~~~~~WI 220 (220)
T PF00089_consen 155 SYN----DNLTPNMICAGSS-GSGDACQGDSGGPLICNNN----YLVGIVSFGENCGSPNYPGVYTRVSSYLDWI 220 (220)
T ss_dssp HTT----TTSTTTEEEEETT-SSSBGGTTTTTSEEEETTE----EEEEEEEEESSSSBTTSEEEEEEGGGGHHHH
T ss_pred ccc----ccccccccccccc-cccccccccccccccccee----eecceeeecCCCCCCCcCEEEEEHHHhhccC
Confidence 522 2377899999976 6688999999999998762 8999999999999988899999999999998
No 5
>COG5640 Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.93 E-value=2.5e-25 Score=253.41 Aligned_cols=242 Identities=28% Similarity=0.391 Sum_probs=175.9
Q ss_pred CcceEecCeecCCCccceEEEEEeC-----CeeeeeceeeCCCccccccccccCCCCCceEEEEeEEecCCCCCCCCcee
Q psy11546 1281 RHSRIVGGGNARLGSWPWQAALYKE-----GEFQCGATLISDQWLLSAGHCFYRAQDDYWVARLGTLRRGTKLPSPYEQL 1355 (1634)
Q Consensus 1281 ~~~RIvGG~~a~~GewPW~VsL~~~-----g~~~CGGTLIS~rWVLTAAHCV~~~~~~~~~VrLG~~~~~s~~~s~~~Q~ 1355 (1634)
...||+||..+..|+||++|+|... ...+|||++|..|||||||||+........-+..+..... +..+.+.
T Consensus 29 vs~rIigGs~Anag~~P~~VaLv~~isd~~s~tfCGgs~l~~RYvLTAAHC~~~~s~is~d~~~vv~~l~---d~Sq~~r 105 (413)
T COG5640 29 VSSRIIGGSNANAGEYPSLVALVDRISDYVSGTFCGGSKLGGRYVLTAAHCADASSPISSDVNRVVVDLN---DSSQAER 105 (413)
T ss_pred cceeEecCcccccccCchHHHHHhhcccccceeEeccceecceEEeeehhhccCCCCccccceEEEeccc---ccccccC
Confidence 3679999999999999999999643 3468999999999999999999875532222223333332 2345677
Q ss_pred EeeEEEEeCCCCCCCCCcCceEEEEeCCCCC-CCceeeecCCCC----CCCCCCCeEEEEEecccccCCC-cC---CCCc
Q psy11546 1356 RPISKIILHPQYVDAGFINDISILKMKTPFS-NYVRPICLPHPN----TPLTDGTLCTVVGWGQLFEIGR-VF---PDTL 1426 (1634)
Q Consensus 1356 ~~V~~IiiHP~Yn~~t~~nDIALLkL~~PfS-d~VqPICLP~~~----~~l~~G~~c~VvGWG~t~e~g~-~~---s~~L 1426 (1634)
..|..++.|..|.+.++.||||+++|.++.+ ..++ +-+-... ............+||.+.+... .. ...|
T Consensus 106 g~vr~i~~~efY~~~n~~ND~Av~~l~~~a~~pr~k-i~~~~~sdt~l~sv~~~s~~~n~t~~~~~~~~v~~~~p~gt~l 184 (413)
T COG5640 106 GHVRTIYVHEFYSPGNLGNDIAVLELARAASLPRVK-ITSFDASDTFLNSVTTVSPMTNGTFGVTTPSDVPRSSPKGTIL 184 (413)
T ss_pred cceEEEeeecccccccccCcceeeccccccccchhh-eeeccCcccceecccccccccceeeeeeeecCCCCCCCcccee
Confidence 8999999999999999999999999998621 1111 1111110 1122334455667776644321 11 1379
Q ss_pred EEEEEeeeChhHHhhhccC----CCcccCCCCeEEEeecCCCCCCCCCCCCCeEEEecCCCcEEEEEEEEcCCC-CCCCC
Q psy11546 1427 QEVQLPIISTAECRKRTLF----LPLYRVTENMFCAGFERGGRDACLGDSGGPLMCQEPDGRWSLMGVTSNGYG-CARAN 1501 (1634)
Q Consensus 1427 q~v~VpIIs~eeC~~~~~~----~~~~~It~sm~CAG~~~gg~daC~GDSGGPLVc~~~~GrW~LvGIvS~G~g-Cg~~~ 1501 (1634)
+++.+..++...|...+.. .+...++. +|||.. ..++|+||||||++....+ ...++||+|||.+ |+.+.
T Consensus 185 ~e~~v~fv~~stc~~~~g~an~~dg~~~lT~--~cag~~--~~daCqGDSGGPi~~~g~~-G~vQ~GVvSwG~~~Cg~t~ 259 (413)
T COG5640 185 HEVAVLFVPLSTCAQYKGCANASDGATGLTG--FCAGRP--PKDACQGDSGGPIFHKGEE-GRVQRGVVSWGDGGCGGTL 259 (413)
T ss_pred eeeeeeeechHHhhhhccccccCCCCCCccc--eecCCC--CcccccCCCCCceEEeCCC-ccEEEeEEEecCCCCCCCC
Confidence 9999999999999875531 11123444 999954 4899999999999998744 4588999999987 99999
Q ss_pred CCeEEEEcCCchHHHHHhhccccccccccc
Q psy11546 1502 RPGVYTKVSNYIPWLYNNMAASEYNMMRNE 1531 (1634)
Q Consensus 1502 ~PgVYTrVs~YlDWI~~vI~~s~~~~~~~~ 1531 (1634)
.|+|||+|+.|.+||..+|+.-...+.++.
T Consensus 260 ~~gVyT~vsny~~WI~a~~~~l~~~~~rp~ 289 (413)
T COG5640 260 IPGVYTNVSNYQDWIAAMTNGLSYLQFRPL 289 (413)
T ss_pred cceeEEehhHHHHHHHHHhcCCCccccccc
Confidence 999999999999999999988766655443
No 6
>PF09342 DUF1986: Domain of unknown function (DUF1986); InterPro: IPR015420 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain is found in serine endopeptidases belonging to MEROPS peptidase family S1A (clan PA). It is found in unusual mosaic proteins, which are encoded by the Drosophila nudel gene (see P98159 from SWISSPROT). Nudel is involved in defining embryonic dorsoventral polarity. Three proteases; ndl, gd and snk process easter to create active easter. Active easter defines cell identities along the dorsal-ventral continuum by activating the spz ligand for the Tl receptor in the ventral region of the embryo. Nudel, pipe and windbeutel together trigger the protease cascade within the extraembryonic perivitelline compartment which induces dorsoventral polarity of the Drosophila embryo [].
Probab=99.49 E-value=1.9e-13 Score=151.28 Aligned_cols=216 Identities=24% Similarity=0.462 Sum_probs=142.8
Q ss_pred CCCccceEEEEEeCCeeeeeceeeCCCccccccccccCC--CCCceEEEEeEEecCCCCCCCCceeEeeEEEEeCCCCCC
Q psy11546 1292 RLGSWPWQAALYKEGEFQCGATLISDQWLLSAGHCFYRA--QDDYWVARLGTLRRGTKLPSPYEQLRPISKIILHPQYVD 1369 (1634)
Q Consensus 1292 ~~GewPW~VsL~~~g~~~CGGTLIS~rWVLTAAHCV~~~--~~~~~~VrLG~~~~~s~~~s~~~Q~~~V~~IiiHP~Yn~ 1369 (1634)
....|||.|.||.+|.++|+|+||++.|||++..|+.+. ...++.|.+|.........++++|+++|+.+..-|.
T Consensus 12 e~y~WPWlA~IYvdG~~~CsgvLlD~~WlLvsssCl~~I~L~~~YvsallG~~Kt~~~v~Gp~EQI~rVD~~~~V~~--- 88 (267)
T PF09342_consen 12 EDYHWPWLADIYVDGRYWCSGVLLDPHWLLVSSSCLRGISLSHHYVSALLGGGKTYLSVDGPHEQISRVDCFKDVPE--- 88 (267)
T ss_pred ccccCcceeeEEEcCeEEEEEEEeccceEEEeccccCCcccccceEEEEecCcceecccCCChheEEEeeeeeeccc---
Confidence 456799999999999999999999999999999999764 347889999988777667899999999999876654
Q ss_pred CCCcCceEEEEeCCC--CCCCceeeecCCCCCCCCCCCeEEEEEecccccCCCcCCCCcEEEEEeee----ChhHHhhhc
Q psy11546 1370 AGFINDISILKMKTP--FSNYVRPICLPHPNTPLTDGTLCTVVGWGQLFEIGRVFPDTLQEVQLPII----STAECRKRT 1443 (1634)
Q Consensus 1370 ~t~~nDIALLkL~~P--fSd~VqPICLP~~~~~l~~G~~c~VvGWG~t~e~g~~~s~~Lq~v~VpII----s~eeC~~~~ 1443 (1634)
.+++||+|++| |+.+|+|+.||...........|+.+|-..+ |. ..++.-..++.. +...|.+..
T Consensus 89 ----S~v~LLHL~~~~~fTr~VlP~flp~~~~~~~~~~~CVAVg~d~~---g~--~kt~si~l~~n~~nC~s~~~Cy~~~ 159 (267)
T PF09342_consen 89 ----SNVLLLHLEQPANFTRYVLPTFLPETSNENESDDECVAVGHDDT---GR--IKTVSIFLIPNLENCDSCNRCYKLQ 159 (267)
T ss_pred ----cceeeeeecCcccceeeecccccccccCCCCCCCceEEEEcccC---Cc--eeeeEEEeeccccCCCchhhhhhhc
Confidence 78999999999 9999999999986666666679999987541 11 111111111111 112243322
Q ss_pred cCCCcc-----------cCCCCeEEEeecCCC-----CCCCCC-------------CCCCeEEEecCCCcEEEEEEEEcC
Q psy11546 1444 LFLPLY-----------RVTENMFCAGFERGG-----RDACLG-------------DSGGPLMCQEPDGRWSLMGVTSNG 1494 (1634)
Q Consensus 1444 ~~~~~~-----------~It~sm~CAG~~~gg-----~daC~G-------------DSGGPLVc~~~~GrW~LvGIvS~G 1494 (1634)
...+.. .-...++|... .+- -..|.- --+|-|||+.....||.+|+.++.
T Consensus 160 ~~~p~~~~n~e~~~s~s~w~g~i~c~~~-~~~~~~a~F~~c~~~~~kn~~~~~~~~s~~gv~vC~~~~~gw~p~~~~~~~ 238 (267)
T PF09342_consen 160 SKQPYALLNAEDMASISEWVGLISCVSE-KGWYPAAKFTSCDQFGLKNFSDKQHSPSDQGVLVCRDSHSGWYPTAFFNYN 238 (267)
T ss_pred ccCCcccccccccccccccceeEEeecC-CCCcceEEEEecccccccccccCCCCcccCceEEEcCCCCCccchhhhccC
Confidence 211100 11223455431 110 012221 135778898877889999999998
Q ss_pred CC-CCCCCCCeEEEEcCCchHHHHHhh
Q psy11546 1495 YG-CARANRPGVYTKVSNYIPWLYNNM 1520 (1634)
Q Consensus 1495 ~g-Cg~~~~PgVYTrVs~YlDWI~~vI 1520 (1634)
.+ |-.-+.|---+.+....+=|++.+
T Consensus 239 ~~~C~~f~~~~~i~~le~~y~~~q~~~ 265 (267)
T PF09342_consen 239 NGDCQGFNDPFGIRTLEEAYKSLQDSI 265 (267)
T ss_pred CCccccccCeeeEeEHHHhHHHHHHHh
Confidence 75 544444443333333334444443
No 7
>PF01390 SEA: SEA domain; InterPro: IPR000082 SEA is an extracellular domain associated with O-glycosylation []. Proteins found to contain SEA-modules include, agrin, enterokinase, 63 kDa Strongylocentrotus purpuratus (Purple sea urchin) sperm protein, perlecan (heparan sulphate proteoglycan core, mucin 1 and the cell surface antigen, 114/A10, and two functionally uncharacterised, probably extracellular, Caenorhabditis elegans proteins. Despite the functional diversity of these adhesive proteins, a common denominator seems to be their existence in heavily glycosylated environments. In addition, the better characterised proteins all contain O-glycosidic-linked carbohydrates such as heparan sulphate that contribute considerably to their molecular masses. The common module might regulate or assist binding to neighbouring carbohydrate moieties. Enterokinase, the initiator of intestinal digestion, is a mosaic protease composed of a distinctive assortment of domains []. ; PDB: 2ACM_B 1IVZ_A 2E7V_A.
Probab=99.47 E-value=1.8e-13 Score=135.69 Aligned_cols=104 Identities=27% Similarity=0.500 Sum_probs=80.9
Q ss_pred ccEEEEEEEEEecCCccCcccCCcchHhHHHHHHHHHHHHHHHhhcCCcccccceeEEEEeeCCCEEEEEEEEEcccccc
Q psy11546 70 VELIFDSSFRVTAGDSYNPSLENSTSNLYKEKSKRYKSMIEKLYNASVLSPAIKYCGVIGFKNGSLIVFYRIILDRRKIP 149 (1634)
Q Consensus 70 k~~yY~GSFrItn~~~ys~dL~~~sS~~Fk~La~~IE~mV~~iF~~S~L~~~YikS~Vi~F~~gsViV~f~L~Fd~~~~p 149 (1634)
+.++|.++|+|++.. |+++|.+++|.+|+.|+++++.+|+.+|+.+.|.+.|++++|++|++|+|+|+|+|.| ..+
T Consensus 2 ~~~~~~~~~~i~~~~-f~~~l~d~~S~~f~~l~~~i~~~i~~~~~~~~~~~~~~~~~I~~f~~gsViv~~~~~f---~~~ 77 (107)
T PF01390_consen 2 KSQSYSGSFRITNLE-FTPDLADPNSPEFQELARDIEDQINSVFRNSSLSPGFVGVTITSFRPGSVIVDFDVIF---DPP 77 (107)
T ss_dssp ECEEEEEEEEESSSE---GGGG-TTSHHHHHHHHHHHHHHHHHHHTSTTTTTEEEEEEEEEEETEEEEEEEEEE---ETT
T ss_pred ceEEEEEEEEEeCCc-cchhcCCCCCHHHHHHHHHHHHHHHHhhccCccCCCcceEEEEEEECCCEEEEEEEEE---eCC
Confidence 468999999999999 9999999999999999999999999999999999999999999999999999999999 444
Q ss_pred cCc--ccHHHHHHHHHHHHHhhcccCCccceEEcCC
Q psy11546 150 RSI--GNVEEVVKNILVDEITSRKAVAFKNIKVDEN 183 (1634)
Q Consensus 150 ~s~--~~v~~kI~~iL~q~L~~n~s~~l~~L~IDps 183 (1634)
... ..+++.+..+|++ . +...+|.|||+
T Consensus 78 ~~~~~~~~~~~l~~~l~~--~----~~~~~l~i~~~ 107 (107)
T PF01390_consen 78 SSAPPADIEEALQNALQQ--T----SNIGNLTIDPS 107 (107)
T ss_dssp T-S-HHHHHHHHHHHHCC--H----HCCCCCEE-S-
T ss_pred CCCCHHHHHHHHHHHHHh--h----cCcccccCCCC
Confidence 322 3444444444433 1 12337779975
No 8
>PF03761 DUF316: Domain of unknown function (DUF316) ; InterPro: IPR005514 This is a family of uncharacterised proteins from Caenorhabditis elegans.
Probab=99.38 E-value=5.9e-12 Score=145.25 Aligned_cols=209 Identities=24% Similarity=0.393 Sum_probs=135.6
Q ss_pred ceEecCeecCCCccceEEEEEeCC----eeeeeceeeCCCccccccccccCCCCC-----------------ceEEE---
Q psy11546 1283 SRIVGGGNARLGSWPWQAALYKEG----EFQCGATLISDQWLLSAGHCFYRAQDD-----------------YWVAR--- 1338 (1634)
Q Consensus 1283 ~RIvGG~~a~~GewPW~VsL~~~g----~~~CGGTLIS~rWVLTAAHCV~~~~~~-----------------~~~Vr--- 1338 (1634)
.+...|..+..++.||.|.++..+ ..+++|+|||+||||||+||+...... .+.|.
T Consensus 40 ~~~~~g~~~~~~~~pW~v~v~~~~~~~~~~~~~gtlIS~RHiLtss~~~~~~~~~W~~~~~~~~~~C~~~~~~l~vP~~~ 119 (282)
T PF03761_consen 40 SKVFNGTPAESGEAPWAVSVYTKNHNEGNYFSTGTLISPRHILTSSHCVMNDKSKWLNGEEFDNKKCEGNNNHLIVPEEV 119 (282)
T ss_pred ccccCCcccccCCCCCEEEEEeccCcccceecceEEeccCeEEEeeeEEEecccccccCcccccceeeCCCceEEeCHHH
Confidence 456788899999999999998643 356899999999999999999632211 01110
Q ss_pred EeEEec--CCCCCCCCceeEeeEEEEeCCCC----CCCCCcCceEEEEeCCCCCCCceeeecCCCCCCCCCCCeEEEEEe
Q psy11546 1339 LGTLRR--GTKLPSPYEQLRPISKIILHPQY----VDAGFINDISILKMKTPFSNYVRPICLPHPNTPLTDGTLCTVVGW 1412 (1634)
Q Consensus 1339 LG~~~~--~s~~~s~~~Q~~~V~~IiiHP~Y----n~~t~~nDIALLkL~~PfSd~VqPICLP~~~~~l~~G~~c~VvGW 1412 (1634)
+-.+.. .............|.++++--.- ......++++||+|+++++..+.|+|||........+....+.|+
T Consensus 120 l~~~~v~~~~~~~~~~~~~~~v~ka~il~~C~~~~~~~~~~~~~mIlEl~~~~~~~~~~~Cl~~~~~~~~~~~~~~~yg~ 199 (282)
T PF03761_consen 120 LSKIDVRCCNCFSNGKCFSIKVKKAYILNGCKKIKKNFNRPYSPMILELEEDFSKNVSPPCLADSSTNWEKGDEVDVYGF 199 (282)
T ss_pred hccEEEEeecccccCCcccceeEEEEEEecCCCcccccccccceEEEEEcccccccCCCEEeCCCccccccCceEEEeec
Confidence 000000 00001111223456666653221 234456899999999999999999999987776777777777777
Q ss_pred cccccCCCcCCCCcEEEEEeeeChhHHhhhccCCCcccCCCCeEEEeecCCCCCCCCCCCCCeEEEecCCCcEEEEEEEE
Q psy11546 1413 GQLFEIGRVFPDTLQEVQLPIISTAECRKRTLFLPLYRVTENMFCAGFERGGRDACLGDSGGPLMCQEPDGRWSLMGVTS 1492 (1634)
Q Consensus 1413 G~t~e~g~~~s~~Lq~v~VpIIs~eeC~~~~~~~~~~~It~sm~CAG~~~gg~daC~GDSGGPLVc~~~~GrW~LvGIvS 1492 (1634)
. ....+....+.+.....| +.........|.||+||||+-.. +|+|+|+||.+
T Consensus 200 ~--------~~~~~~~~~~~i~~~~~~------------------~~~~~~~~~~~~~d~Gg~lv~~~-~gr~tlIGv~~ 252 (282)
T PF03761_consen 200 N--------STGKLKHRKLKITNCTKC------------------AYSICTKQYSCKGDRGGPLVKNI-NGRWTLIGVGA 252 (282)
T ss_pred C--------CCCeEEEEEEEEEEeecc------------------ceeEecccccCCCCccCeEEEEE-CCCEEEEEEEc
Confidence 1 122355555555443222 11112345789999999999877 89999999999
Q ss_pred cCCCCCCCCCCeEEEEcCCchHHHHHh
Q psy11546 1493 NGYGCARANRPGVYTKVSNYIPWLYNN 1519 (1634)
Q Consensus 1493 ~G~gCg~~~~PgVYTrVs~YlDWI~~v 1519 (1634)
.+..-+.. ....|.+|..|.+=|-+.
T Consensus 253 ~~~~~~~~-~~~~f~~v~~~~~~IC~l 278 (282)
T PF03761_consen 253 SGNYECNK-NNSYFFNVSWYQDEICEL 278 (282)
T ss_pred cCCCcccc-cccEEEEHHHhhhhhccc
Confidence 77632221 267888888887755443
No 9
>smart00200 SEA Domain found in sea urchin sperm protein, enterokinase, agrin. Proposed function of regulating or binding carbohydrate sidechains.
Probab=99.15 E-value=1.3e-10 Score=119.50 Aligned_cols=92 Identities=27% Similarity=0.414 Sum_probs=75.9
Q ss_pred EEEEEEEEEe-----cCCccCcccCCcchHhHHHHHHHHHHHHHHHhhcCCcccccceeEEEEeeCCCEEEEEEEEEccc
Q psy11546 72 LIFDSSFRVT-----AGDSYNPSLENSTSNLYKEKSKRYKSMIEKLYNASVLSPAIKYCGVIGFKNGSLIVFYRIILDRR 146 (1634)
Q Consensus 72 ~yY~GSFrIt-----n~~~ys~dL~~~sS~~Fk~La~~IE~mV~~iF~~S~L~~~YikS~Vi~F~~gsViV~f~L~Fd~~ 146 (1634)
+.+.++|.++ +.. |+++|++++|.+|++|++++|+||+++|++|+|++.|++++|++|++|+|+|++.|.|
T Consensus 3 ~~~~~~~~~~~~~~~~i~-ys~~L~d~sS~~f~eL~~~ie~~l~~~f~~s~l~~~f~~~~V~~~~~gsv~v~~~l~f--- 78 (121)
T smart00200 3 QSFGVSLSVLSVEGENLQ-YSPSLEDPSSEEYQELVRDVEKLLEQIYGKTDLKPDFVGTEVIEFRNGSVVVDLGLLF--- 78 (121)
T ss_pred eEEEEEEEEEEEecccee-eChhhCCcccHHHHHHHHHHHHHHHHHHhcCccccceeEEEEEEEcCCcEEEEEEEEe---
Confidence 4555655555 457 9999999999999999999999999999999999999999999999999999999999
Q ss_pred ccccCcccHHHHHHHHHHHHHhhc
Q psy11546 147 KIPRSIGNVEEVVKNILVDEITSR 170 (1634)
Q Consensus 147 ~~p~s~~~v~~kI~~iL~q~L~~n 170 (1634)
+.+... ...|++.|.+.++..
T Consensus 79 ~~~~~~---~~~V~~~~~~~~~~~ 99 (121)
T smart00200 79 NEGVTN---GQDVEEDLLQVIKQA 99 (121)
T ss_pred ccCCCc---cchhHHHHHHHHHHH
Confidence 555331 345666666665543
No 10
>smart00202 SR Scavenger receptor Cys-rich. The sea ucrhin egg peptide speract contains 4 repeats of SR domains that contain 6 conserved cysteines. May bind bacterial antigens in the protein MARCO.
Probab=98.70 E-value=2.7e-08 Score=98.50 Aligned_cols=90 Identities=17% Similarity=0.279 Sum_probs=71.6
Q ss_pred CcceEEEEecCcccccccCcchhhhhhhchhhhcchhHHhhccCCCCCccceeeeccCC--CCCcceEEEeccccccchh
Q psy11546 1157 DSGFLMIQKQGQWGKLCMNQINNFIMKTLKWKISDLGKAICKSMTFRDLNEIEAVADPS--TDDSIYYELSMNAINSSAA 1234 (1634)
Q Consensus 1157 ~eG~VeV~~~GtWg~VCsd~~~~~~~~~~~~~isdlA~vICRqLG~~~~~sve~v~~~~--~~~~~y~~vscn~~e~s~s 1234 (1634)
.+|+|+++.+|.|++||.++|+.. .|.++||+|||+............ ..+..+..+.|.+.+.++.
T Consensus 10 ~~G~leV~~~~~Wg~VC~~~w~~~-----------~A~vvCrqlG~~~~~~~~~~~~~~~~~~~~~~~~~~C~G~E~sl~ 78 (101)
T smart00202 10 CEGRVEVYHNGQWGTVCDDGWDLR-----------DANVVCRQLGFGGALSASGSAYFGPGSGPIWLDNVRCTGTEASLS 78 (101)
T ss_pred CeEEEEEEECCEEeCeeCCCCChh-----------HHHHHHHHcCCCCcccceeccccCCCCCCeEEEcceEcCCCCCHH
Confidence 789999999999999999999866 789999999998866554321212 2245566889999999888
Q ss_pred cccccccccccCCCCceeeeecc
Q psy11546 1235 TKSALLFQKTKCSQKQVVAVNCS 1257 (1634)
Q Consensus 1235 ~c~~l~~~~~~C~s~~vV~V~C~ 1257 (1634)
.|....+....|.+..++.|.|.
T Consensus 79 ~C~~~~~~~~~C~~~~~~~v~C~ 101 (101)
T smart00202 79 DCPHSGWGSHNCSHGEDAGVVCS 101 (101)
T ss_pred HCCCCCCCCcCCCCCCcEEEEEC
Confidence 88876666666999999999883
No 11
>PF00057 Ldl_recept_a: Low-density lipoprotein receptor domain class A This prints entry is specific to LDL receptor; InterPro: IPR002172 The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing: The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins []. The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor []. The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains. The fourth domain is the hydrophobic transmembrane region. The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits. LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR class A (cyateine-rich) repeat, which contains 6 disulphide-bound cysteines and a highly conserved cluster of negatively charged amino acids, of which many are clustered on one face of the module []. In LDL receptors, the class A domains form the binding site for LDL and calcium. The acidic residues between the fourth and sixth cysteines are important for high-affinity binding of positively charged sequences in LDLR's ligands. The repeat consists of a beta-hairpin structure followed by a series of beta turns. In the absence of calcium, LDL-A domains are unstructured; the bound calcium ion imparts structural integrity. Following these repeats is a 350 residue domain that resembles part of the epidermal growth factor (EGF) precursor. Numerous familial hypercholestorolemia mutations of the LDL receptor alter the calcium coordinating residue of LDL-A domains or other crucial scaffolding residues. ; GO: 0005515 protein binding; PDB: 2I1P_A 3OJY_A 4E0S_B 3T5O_A 4A5W_B 1JRF_A 1K7B_A 1V9U_5 3DPR_E 2KNY_A ....
Probab=98.69 E-value=1e-08 Score=84.61 Aligned_cols=36 Identities=50% Similarity=1.198 Sum_probs=33.8
Q ss_pred CCCCCCceecCCCceecCCcccCCCCCCCCCCCCCCC
Q psy11546 992 ECEMNSSFRCGNGECVSIGSKCNQLVDCADGSDEKNC 1028 (1634)
Q Consensus 992 ~C~~~~~F~C~nG~CIp~~~~CDG~~DC~DgSDE~~C 1028 (1634)
.|. .++|+|.++.||+..|+|||+.||.|||||.+|
T Consensus 2 ~C~-~~~f~C~~~~CI~~~~~CDg~~DC~dgsDE~~C 37 (37)
T PF00057_consen 2 TCP-PGEFRCGNGQCIPKSWVCDGIPDCPDGSDEQNC 37 (37)
T ss_dssp SSS-TTEEEETTSSEEEGGGTTSSSCSSSSSTTTSSH
T ss_pred cCc-CCeeEcCCCCEEChHHcCCCCCCCCCCcccccC
Confidence 577 799999999999999999999999999999876
No 12
>cd00112 LDLa Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure
Probab=98.68 E-value=9e-09 Score=83.87 Aligned_cols=35 Identities=49% Similarity=1.106 Sum_probs=32.6
Q ss_pred CCCCCceecCCCceecCCcccCCCCCCCCCCCCCCC
Q psy11546 993 CEMNSSFRCGNGECVSIGSKCNQLVDCADGSDEKNC 1028 (1634)
Q Consensus 993 C~~~~~F~C~nG~CIp~~~~CDG~~DC~DgSDE~~C 1028 (1634)
|. .++|+|.+|.||+..|+|||+.||+|||||.+|
T Consensus 1 C~-~~~f~C~~~~Ci~~~~~CDg~~DC~dgsDE~~C 35 (35)
T cd00112 1 CP-PNEFRCANGRCIPSSWVCDGEDDCGDGSDEENC 35 (35)
T ss_pred CC-CCeEEcCCCCeeCHHHcCCCccCCCCCcccccC
Confidence 45 589999999999999999999999999999876
No 13
>cd07888 CRD_corin_2 One of two cysteine-rich domains of the corin protein, a type II transmembrane serine protease . The cysteine-rich domain (CRD) is an essential component of corin, a type II transmembrane serine protease which functions as the convertase of the pro-atrial natriuretic peptide (pro-ANP) in the heart. Corin contains two CRDs in its extracellular region, which play an important role in recognition of the physiological substrate, pro-ANP. This model characterizes the second (C-terminal) CRD.
Probab=98.57 E-value=2.2e-08 Score=103.07 Aligned_cols=103 Identities=13% Similarity=0.200 Sum_probs=70.1
Q ss_pred cccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHH--HHHHHHHHHhhcccccccCCcchHHHHHHHhhcccccc
Q psy11546 855 EADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMA--QNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKH 932 (1634)
Q Consensus 855 ~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~--~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~ 932 (1634)
+|++|++++|..|+||.|.|||+++|.+|++ |.+. ..+|..+ +. ..|+++|..|+|++|+|.
T Consensus 1 ~CepI~~~~C~~l~Yn~T~~PN~lgH~sq~e-----a~~~~~~~~f~pL------v~-----~~Csp~l~~FlCsl~~P~ 64 (122)
T cd07888 1 QCEPITLELCMNLPYNTTRYPNYLGHRTQKE-----ASISWESSLFPAL------VQ-----TNCYKYLMFFACTILVPK 64 (122)
T ss_pred CCccCCHhhhCCCCcCcccCCcccCCCCHHH-----HHHhhhHHHHHhH------Hh-----cCCChhHHHHHHhccCcc
Confidence 5999999999999999999999999999998 5443 3345544 33 389999999999999995
Q ss_pred cccCCCCCCCCCCCCCCCceeeccC-----CceeeeeeecccccCCCCCcC
Q psy11546 933 THQHNPDTNISTKAIPSSFNFNQVN-----GIPILTKVYTKVSKTNSTSKE 978 (1634)
Q Consensus 933 ~~~~~p~~~i~t~~ip~~f~~~~v~-----g~~~~~~v~~~~~~c~~~s~~ 978 (1634)
...-.. . .--|++-.|..|. -+....-.|+..-.|.++..+
T Consensus 65 C~~~~~-~----~i~PCRslCe~vr~~C~~~m~~fg~~WP~~l~C~~fP~~ 110 (122)
T cd07888 65 CDPVTQ-Q----RIPPCRSLCRNSKERCESVLGIVGLQWPEDTDCAQFPEE 110 (122)
T ss_pred ccCCCC-C----CCCCcHHHHHHHHHHHHHHHHHhCCCCCCcCccccCCCC
Confidence 311000 1 1125666666511 111112247777788877655
No 14
>cd07441 CRD_SFRP3 Cysteine-rich domain of the secreted frizzled-related protein 3 (SFRP3, alias FRZB), a Wnt antagonist. The cysteine-rich domain (CRD) is an essential part of the secreted frizzled-related protein 3 (SFRP3, alias FRZB), which plays important roles in embryogenesis and postnatal development as an antagonist of Wnt proteins, key players in a number of fundamental cellular processes. SFRPs antagonize the activation of Wnt signaling by binding to the CRD domains of frizzled proteins (Fz), thereby preventing Wnt proteins from binding to these receptors. SFRPs are also known to have functions unrelated to Wnt, as enhancers of procollagen cleavage by the TLD proteinases. SFRPs and Fz proteins both contain CRD domains, but SFRPs lack the seven-pass transmembrane domain which is an integral part of Fzs. SFRP3 regulates Wnt signaling activity in bone development and homeostasis. It is also involved in the control of planar cell polarity.
Probab=98.55 E-value=2.5e-08 Score=102.95 Aligned_cols=63 Identities=11% Similarity=0.082 Sum_probs=53.5
Q ss_pred ccccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhcccccc
Q psy11546 854 SEADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKH 932 (1634)
Q Consensus 854 ~~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~ 932 (1634)
..||+|+.++|..|+||.|.|||+++|.+|++ +......|..+ +. +.|+++|..|+|++|+|.
T Consensus 2 a~CepI~~~~C~~l~Yn~T~~PN~lgH~~q~e-----a~~~~~~f~pL-------v~----~~Csp~l~~FLCS~~~P~ 64 (126)
T cd07441 2 ASCEPVRIPMCKSMPWNMTKMPNHLHHSTQAN-----AVLAIEQFEGL-------LG----TQCSPDLLFFLCAMYAPI 64 (126)
T ss_pred CcccccChhhhCCCCcCcccCCcccCCCCHHH-----HHHHHHHHHHH-------hc----CCCCccchhHHhhccCcc
Confidence 35999999999999999999999999999999 66654455444 33 489999999999999994
No 15
>cd07454 CRD_LIN_17 Cysteine-rich domain (CRD) of LIN_17. A cysteine-rich domain (CRD) is an essential component of a number of cell surface receptors, which are involved in multiple signal transduction pathways, particularly in modulating the activity of the Wnt proteins, which play a fundamental role in the early development of metazoans. CRD is also found in secreted frizzled related proteins (SFRPs), which lack the transmembrane segment found in the frizzled protein. The CRD domain is also present in the alpha-1 chain of mouse type XVIII collagen, in carboxypeptidase Z, several receptor tyrosine kinases, and the mosaic transmembrane serine protease corin. The CRD domain is well conserved in metazoans - 10 frizzled proteins have been identified in mammals, 4 in Drosophila and 3 in Caenorhabditis elegans. CRD domains have also been identified in multiple tandem copies in a Dictyostelium discoideum protein. Very little is known about the mechanism by which CRD domains interact with the
Probab=98.55 E-value=5.1e-08 Score=100.69 Aligned_cols=65 Identities=6% Similarity=-0.047 Sum_probs=54.0
Q ss_pred cccccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhcccccc
Q psy11546 853 TSEADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKH 932 (1634)
Q Consensus 853 t~~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~ 932 (1634)
++.|++|++++|..|+||.|.|||+++|.+|++ +......|.. ++. .+|+++|..|+|++|+|.
T Consensus 2 ~~~C~Pi~~~~C~~l~Yn~T~~PN~lgh~~q~e-----a~~~~~~f~p-------Lv~----~~C~p~l~~FlCsl~~P~ 65 (124)
T cd07454 2 KGKCIPIDIELCKDLPYNYTYFPNTILHNDQHT-----LQTHTEHFKP-------LMK----TKCHPHIHFFICSVFAPM 65 (124)
T ss_pred CCccccCChhhhCCCCcCcccCCcccCCCCHHH-----HHHHHHHHHH-------HHh----cCCChhHHHHhhcccccc
Confidence 578999999999999999999999999999988 5544333444 443 489999999999999995
Q ss_pred c
Q psy11546 933 T 933 (1634)
Q Consensus 933 ~ 933 (1634)
.
T Consensus 66 C 66 (124)
T cd07454 66 C 66 (124)
T ss_pred c
Confidence 3
No 16
>cd07462 CRD_FZ10 Cysteine-rich Wnt-binding domain (CRD) of the frizzled 10 (Fz10) receptor. The cysteine-rich domain (CRD) is an essential extracellular portion of the frizzled 10 (Fz10) receptor, and is required for binding Wnt proteins, which play fundamental roles in many aspects of early development, such as cell and tissue polarity, neural synapse formation, and the regulation of proliferation. Fz proteins serve as Wnt receptors for multiple signal transduction pathways, including both beta-catenin dependent and -independent cellular signaling, as well as the planar cell polarity pathway and Ca(2+) modulating signaling pathway. CRD containing Fzs have been found in diverse species from amoebas to mammals. 10 different frizzled proteins are found in vertebrata. The cellular functon of Fz10 is unknown.
Probab=98.54 E-value=3.5e-08 Score=102.15 Aligned_cols=106 Identities=9% Similarity=0.054 Sum_probs=73.3
Q ss_pred cccccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhcccccc
Q psy11546 853 TSEADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKH 932 (1634)
Q Consensus 853 t~~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~ 932 (1634)
++.|++|+.++|..|+||.|.|||+++|.+|++ +.+....|.. ++. ..|+++|..|+|++|+|.
T Consensus 2 ~~~C~pI~~~~C~~l~Yn~T~~PN~lgH~~q~e-----a~~~~~~f~p-------Lv~----~~C~p~l~~FlCsl~~P~ 65 (127)
T cd07462 2 EGRCQPIEIPMCKDIGYNMTRMPNLMGHENQRE-----AAIQLHEFAP-------LVE----YGCHSHLKFFLCSLYAPM 65 (127)
T ss_pred CCeeeecChhhhCCCCcCceeCCcccCCcCHHH-----HHHHHHHHHH-------Hhc----cCCChhhHHHHHhccccc
Confidence 578999999999999999999999999999988 6554334444 333 499999999999999995
Q ss_pred cccCCCCCCCCCCCCCCCceeeccCC-----ceeeeeeecccccCCCCCcCC
Q psy11546 933 THQHNPDTNISTKAIPSSFNFNQVNG-----IPILTKVYTKVSKTNSTSKER 979 (1634)
Q Consensus 933 ~~~~~p~~~i~t~~ip~~f~~~~v~g-----~~~~~~v~~~~~~c~~~s~~~ 979 (1634)
.... .+..--|++-.|+.|.. +....-.|+....|.+.....
T Consensus 66 C~~~-----~~~~i~PCRslCe~vr~~C~~~m~~~g~~WP~~l~C~~fP~~~ 112 (127)
T cd07462 66 CTEQ-----VSTPIPACRVMCEQARLKCSPIMEQFNFKWPDSLDCSKLPNKN 112 (127)
T ss_pred cCCC-----CCCcCCccHHHHHHHHHHHHHHHHHhCCCCCCcCCcccCCCCC
Confidence 3211 11112256666766211 111122477777887776554
No 17
>cd07442 CRD_SFRP4 Cysteine-rich domain of the secreted frizzled-related protein 4 (SFRP4), a Wnt antagonist. The cysteine-rich domain (CRD) is an essential part of the secreted frizzled-related Protein 4 (SFRP4), which regulates the activity of Wnt proteins, key players in a number of fundamental cellular processes such as embryogenesis and postnatal development. SFRPs antagonize the activation of Wnt signaling by binding to the CRDs domains of frizzled (Fz) proteins, thereby preventing Wnt proteins from binding to these receptors. SFRPs are also known to have functions unrelated to Wnt, as enhancers of procollagen cleavage by the TLD proteinases. SFRPs and Fz proteins both contain CRD domains, but SFRPs lack the seven-pass transmembrane domain which is an integral part of Fzs.
Probab=98.53 E-value=3.3e-08 Score=102.28 Aligned_cols=62 Identities=11% Similarity=0.069 Sum_probs=52.8
Q ss_pred cccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhcccccc
Q psy11546 855 EADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKH 932 (1634)
Q Consensus 855 ~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~ 932 (1634)
.||.|+.++|..|+||.|.|||+++|.+|++ |.+....|.. ++. +.|+++|..|+|++|+|.
T Consensus 4 ~CepI~i~~C~~l~Yn~T~~PN~lgH~~q~e-----a~~~~~~f~p-------Lv~----~~Csp~l~~FLCS~~~P~ 65 (127)
T cd07442 4 PCEAVRIPMCRHMPWNITRMPNHLHHSTQEN-----AVLAIEQYEE-------LVD----TGCSPVLPFFLCAMYAPI 65 (127)
T ss_pred cccccChhhhCCCCcCcccCcCccCCCCHHH-----HHHHHHHHHH-------HHc----CCCCcchHHHHHhhcCCc
Confidence 4999999999999999999999999999999 6655444444 443 388999999999999994
No 18
>cd07456 CRD_FZ5_like Cysteine-rich Wnt-binding domain (CRD) of receptors similar to frizzled 5. The cysteine-rich domain (CRD) is an essential extracellular portion of the frizzled 5 (Fz5) and frizzled 8 (Fz8) receptors, and similar proteins. This domain is required for binding Wnt proteins, which play fundamental roles in many aspects of early development, such as cell and tissue polarity, neural synapse formation, and the regulation of proliferation. Fz proteins serve as Wnt receptors for multiple signal transduction pathways, including both beta-catenin dependent and -independent cellular signaling, as well as the planar cell polarity pathway and Ca(2+) modulating signaling pathway. The CRD domain is well conserved in metazoans - 10 frizzled proteins have been identified in mammals, 4 in Drosophila and 3 in Caenorhabditis elegans. Very little is known about the mechanism by which CRD domains interact with their ligands. The domain contains 10 conserved cysteines.
Probab=98.53 E-value=5.4e-08 Score=100.07 Aligned_cols=62 Identities=10% Similarity=0.029 Sum_probs=52.3
Q ss_pred ccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhccccccc
Q psy11546 856 ADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKHT 933 (1634)
Q Consensus 856 c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~~ 933 (1634)
||+|+.++|..|+||.|.|||++||.+|++ +.+....+.. ++. +.|+++|..|+|++|+|..
T Consensus 2 Cepi~~~~C~~l~Yn~T~~PN~lgh~~q~e-----a~~~~~~~~p-------Lv~----~~Csp~l~~FlCsl~~P~C 63 (120)
T cd07456 2 CEEITIPMCKGIGYNMTYMPNQFNHDTQEE-----AGLEVHQFWP-------LVE----IQCSPDLKFFLCSMYTPIC 63 (120)
T ss_pred ccccChhhhCCCCCCcccCCCcCCCcCHHH-----HHHHHHHHHH-------HHh----CCCChhHHHHhHhccCccc
Confidence 999999999999999999999999999988 6555444444 333 4899999999999999954
No 19
>cd07445 CRD_corin_1 One of two cysteine-rich domains of the corin protein, a type II transmembrane serine protease . The cysteine-rich domain (CRD) is an essential component of corin, a type II transmembrane serine protease which functions as the convertase of the pro-atrial natriuretic peptide (pro-ANP) in the heart. Corin contains two CRDs in its extracellular region, which play an important role in recognition of the physiological substrate, pro-ANP. This model characterizes the first (N-terminal) CRD.
Probab=98.52 E-value=3.6e-08 Score=101.95 Aligned_cols=108 Identities=7% Similarity=0.038 Sum_probs=70.4
Q ss_pred CcccccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhccccc
Q psy11546 852 NTSEADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLK 931 (1634)
Q Consensus 852 ~t~~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p 931 (1634)
+|+.||.|+.++|..|+||.|.|||+++|++ ++ +.+...+| .-++ .++|+++|..|+|++|.|
T Consensus 1 ~~~~CepI~i~~C~~l~YN~T~~PN~lgh~~-~e-----a~~~l~~f-------~pLv----~~~Csp~l~~FLCSlyaP 63 (130)
T cd07445 1 NTSACMNITHSQCQMLPYHSTLKPSLLSVKN-ME-----MEKFLKFF-------SYLH----RLSCYQHIMLFGCSLALP 63 (130)
T ss_pred CCCccccCChhhhCCCCCccccCCccccccC-HH-----HHHHHHhh-------cccc----ccCcChhhHHHhhhccCc
Confidence 5899999999999999999999999999954 34 33332222 2233 369999999999999999
Q ss_pred c-cccCCCCCCCCCCCCCCCceeeccC-----CceeeeeeecccccCCCCCcCCC
Q psy11546 932 H-THQHNPDTNISTKAIPSSFNFNQVN-----GIPILTKVYTKVSKTNSTSKERN 980 (1634)
Q Consensus 932 ~-~~~~~p~~~i~t~~ip~~f~~~~v~-----g~~~~~~v~~~~~~c~~~s~~~~ 980 (1634)
. +.+.. ....--|++..|..|. -+....-.|+..-.|.++.+..+
T Consensus 64 ~C~~~~~----~~~~i~PCRslCe~vr~~C~~~m~~fg~~WPe~L~C~~fP~~~~ 114 (130)
T cd07445 64 ECISDGD----DRHGLLPCRSFCEAAKEGCEPVLGMVNASWPDFLRCSQFRNNTE 114 (130)
T ss_pred cccCCCC----CCCcCCCCHHHHHHHHHHHHHHHHHhCCCCCCcCCccccCCCCC
Confidence 4 32211 0111125666666511 11111224888888887765543
No 20
>cd07466 CRD_FZ7 Cysteine-rich Wnt-binding domain (CRD) of the frizzled 7 (Fz7) receptor. The cysteine-rich domain (CRD) is an essential extracellular portion of the frizzled 7 (Fz7) receptor, and is required for binding Wnt proteins, which play fundamental roles in many aspects of early development, such as cell and tissue polarity, neural synapse formation, and the regulation of proliferation. Fz proteins serve as Wnt receptors for multiple signal transduction pathways, including both beta-catenin dependent and -independent cellular signaling, as well as the planar cell polarity pathway and Ca(2+) modulating signaling pathway. CRD containing Fzs have been found in diverse species from amoebas to mammals. 10 different frizzled proteins are found in vertebrata. Xenopus Fz7 is important in Wnt/beta-catenin signaling pathways controlling the transcriptional activation of target genes Siamois and Xnr3 in the animal caps of late blastula.
Probab=98.51 E-value=7.2e-08 Score=99.67 Aligned_cols=65 Identities=11% Similarity=0.042 Sum_probs=54.2
Q ss_pred cccccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhcccccc
Q psy11546 853 TSEADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKH 932 (1634)
Q Consensus 853 t~~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~ 932 (1634)
.+.||+|+.++|..|+||.|.|||+++|.+|++ |......|.. ++. +.|+++|..|+|++|+|.
T Consensus 2 ~~~CepI~~~~C~~l~Yn~T~~PN~lgH~~q~e-----a~~~~~~f~p-------Lv~----~~Csp~l~~FlCsl~~P~ 65 (125)
T cd07466 2 HGFCQPISIPLCTDIAYNQTIMPNLLGHTNQED-----AGLEVHQFYP-------LVK----VQCSPELKFFLCSMYAPV 65 (125)
T ss_pred CCccccCCHHHhCCCCCCcccCCcccCCCCHHH-----HHHHHHHHHH-------HHh----CCCChhHHHHhhhccCCc
Confidence 478999999999999999999999999999999 6544344433 333 489999999999999995
Q ss_pred c
Q psy11546 933 T 933 (1634)
Q Consensus 933 ~ 933 (1634)
.
T Consensus 66 C 66 (125)
T cd07466 66 C 66 (125)
T ss_pred c
Confidence 3
No 21
>cd07464 CRD_FZ2 Cysteine-rich Wnt-binding domain (CRD) of the frizzled 2 (Fz2) receptor. The cysteine-rich domain (CRD) is an essential extracellular portion of the frizzled 2 (Fz2) receptor, and is required for binding Wnt proteins, which play fundamental roles in many aspects of early development, such as cell and tissue polarity, neural synapse formation, and the regulation of proliferation. Fz proteins serve as Wnt receptors for multiple signal transduction pathways, including both beta-catenin dependent and -independent cellular signaling, as well as the planar cell polarity pathway and Ca(2+) modulating signaling pathway. CRD containing Fzs have been found in diverse species from amoebas to mammals. 10 different frizzled proteins are found in vertebrata. Fz2 is involved in the Wnt/beta-catenin signaling pathway and in the activation of protein kinase C and calcium/calmodulin-dependent protein kinase (CaM kinase).
Probab=98.48 E-value=7.4e-08 Score=99.78 Aligned_cols=65 Identities=11% Similarity=0.033 Sum_probs=54.2
Q ss_pred cccccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhcccccc
Q psy11546 853 TSEADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKH 932 (1634)
Q Consensus 853 t~~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~ 932 (1634)
.+.||+|++++|..|+||.|.|||+++|.+|++ +......|.. ++. ..|+++|..|+|++|+|.
T Consensus 2 ~~~CepI~~~~C~~l~Yn~T~~PN~lgH~sq~e-----a~~~~~~f~p-------Lv~----~~Csp~l~~FlCSl~~P~ 65 (127)
T cd07464 2 HGFCQPISIPLCTDIAYNQTIMPNLLGHTNQED-----AGLEVHQFYP-------LVK----VQCSLELRFFLCSMYAPV 65 (127)
T ss_pred CCccccCChhhhCCCCCCcccCCcccCCCCHHH-----HHHHHHHHHH-------HHh----cCCChhHHHHHhhccCCc
Confidence 467999999999999999999999999999998 5544444444 333 489999999999999995
Q ss_pred c
Q psy11546 933 T 933 (1634)
Q Consensus 933 ~ 933 (1634)
+
T Consensus 66 C 66 (127)
T cd07464 66 C 66 (127)
T ss_pred c
Confidence 3
No 22
>cd07460 CRD_FZ5 Cysteine-rich Wnt-binding domain (CRD) of the frizzled 5 (Fz5) receptor.proteins. The cysteine-rich domain (CRD) is an essential extracellular portion of the frizzled 5 (Fz5) receptor, and is required for binding Wnt proteins, which play fundamental roles in many aspects of early development, such as cell and tissue polarity, neural synapse formation, and the regulation of proliferation. Fz proteins serve as Wnt receptors for multiple signal transduction pathways, including both beta-catenin dependent and -independent cellular signaling, as well as the planar cell polarity pathway and Ca(2+) modulating signaling pathway. CRD containing Fzs have been found in diverse species from amoebas to mammals. 10 different frizzled proteins are found in vertebrata. Fz5 plays critical regulating roles in the yolk sac and placental angiogenesis, in the maturation of the Paneth cell phenotype, in governing the neural potential of progenitors in the developing retina, and in neuronal s
Probab=98.47 E-value=9.5e-08 Score=99.00 Aligned_cols=103 Identities=8% Similarity=-0.012 Sum_probs=69.4
Q ss_pred cccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhcccccccc
Q psy11546 855 EADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKHTH 934 (1634)
Q Consensus 855 ~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~~~ 934 (1634)
.||+|+.++|..|+||.|.|||+++|.+|++ +.+....|.. ++. ++|+++|..|+|++|+|...
T Consensus 4 ~CepI~~~~C~~l~Yn~T~~PN~lgh~~q~e-----a~~~~~~f~p-------Lv~----~~Csp~l~~FlCS~~~P~C~ 67 (127)
T cd07460 4 VCQEITVPMCKGIGYNLTYMPNQFNHDTQDE-----AGLEVHQFWP-------LVE----IQCSPDLRFFLCSMYTPICL 67 (127)
T ss_pred EecccChhhhCCCCCCcccCCCccCCCCHHH-----HHHHHHHHHH-------HHh----CCCChhHHHHhhhccCcccc
Confidence 4999999999999999999999999999998 6655444444 343 48999999999999999532
Q ss_pred cCCCCCCCCCCCCCCCceeeccCC-----ceeeeeeecccccCCCCCcC
Q psy11546 935 QHNPDTNISTKAIPSSFNFNQVNG-----IPILTKVYTKVSKTNSTSKE 978 (1634)
Q Consensus 935 ~~~p~~~i~t~~ip~~f~~~~v~g-----~~~~~~v~~~~~~c~~~s~~ 978 (1634)
.- ....-.|++-.|+.|.. +....-.|+..-.|.+....
T Consensus 68 ~~-----~~~~i~PCRslCe~vr~~C~~~m~~fg~~WP~~l~C~~fP~~ 111 (127)
T cd07460 68 PD-----YRKPLPPCRSVCERAKAGCSPLMRQYGFAWPERMNCDRLPVL 111 (127)
T ss_pred CC-----CCCcCCccHHHHHHHHHHHHHHHHHhCCCCCCcCChhhCCCC
Confidence 10 11112256666665110 11112236666677665443
No 23
>KOG1215|consensus
Probab=98.46 E-value=1.3e-07 Score=125.68 Aligned_cols=38 Identities=39% Similarity=0.823 Sum_probs=34.4
Q ss_pred CCCCCCCceecC--CCceecCCcccCCCCCCCCCCCCCCCC
Q psy11546 991 SECEMNSSFRCG--NGECVSIGSKCNQLVDCADGSDEKNCS 1029 (1634)
Q Consensus 991 ~~C~~~~~F~C~--nG~CIp~~~~CDG~~DC~DgSDE~~C~ 1029 (1634)
..|. ..+|.|. +++|||..|+|||..||+||+||..|.
T Consensus 136 ~~~~-~~~~~c~~~~~~Cip~~~~cd~~~~C~dg~de~~~~ 175 (877)
T KOG1215|consen 136 SHCC-LDKFSCRTGSCKCIPGDWLCDGEADCPDGSDELNCA 175 (877)
T ss_pred cccc-CCCCCCcCccccCCCCceeCCCCCccccchhhhccc
Confidence 4555 6889998 899999999999999999999999986
No 24
>cd07450 CRD_FZ6 Cysteine-rich Wnt-binding domain (CRD) of the frizzled 6 (Fz6) receptor. The cysteine-rich domain (CRD) is an essential extracellular portion of the frizzled 6 (Fz6) receptor, and is required for binding Wnt proteins, which play fundamental roles in many aspects of early development, such as cell and tissue polarity, neural synapse formation, and the regulation of proliferation. Fz proteins serve as Wnt receptors for multiple signal transduction pathways, including both beta-catenin dependent and -independent cellular signaling, as well as the planar cell polarity pathway and Ca(2+) modulating signaling pathway. CRD containing Fzs have been found in diverse species from amoebas to mammals. 10 different frizzled proteins are found in vertebrata. Frizzled 6 (Fz6) is expressed in the skin and hair follicles and controls hair patterning in mammals using a Fz-dependent tissue polarity system, which is similar to the one that patterns the Drosophila cuticle.
Probab=98.46 E-value=5.3e-08 Score=100.77 Aligned_cols=101 Identities=10% Similarity=0.054 Sum_probs=70.2
Q ss_pred cccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhcccccccc
Q psy11546 855 EADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKHTH 934 (1634)
Q Consensus 855 ~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~~~ 934 (1634)
.||+|+.++|+.|+||.|.|||+++|.+|++ |...... |.-++. ++|+++|..|+|++|+|.+.
T Consensus 4 ~CepI~~~~C~~l~YN~T~~PN~lgH~~q~e-----a~~~~~~-------f~pLv~----~~Csp~l~~FLCS~~~P~C~ 67 (127)
T cd07450 4 TCEPITVPRCLKMPYNMTFFPNLMGHYDQDI-----AAVEMEP-------FLPLAN----LRCSPNVHTFLCQAFVPTCT 67 (127)
T ss_pred eeeeCChhhcCCCCCCcccCCcccCCcCHHH-----HHHHHHH-------HHhHhc----cCCChhhhhheeeeeccccc
Confidence 5999999999999999999999999999988 5444333 333444 48999999999999999532
Q ss_pred cCCCCCCCCCCCCCCCceeeccCC-----ceeeeeeecccccCCCCCc
Q psy11546 935 QHNPDTNISTKAIPSSFNFNQVNG-----IPILTKVYTKVSKTNSTSK 977 (1634)
Q Consensus 935 ~~~p~~~i~t~~ip~~f~~~~v~g-----~~~~~~v~~~~~~c~~~s~ 977 (1634)
.. .....|++-.|..|.. +....-.|+..-.|.+..+
T Consensus 68 ~~------~~~i~PCRslCe~vr~~C~~~m~~fg~~WP~~L~C~~fP~ 109 (127)
T cd07450 68 EQ------IHVVRPCRELCEKVYSDCKKLIDTFGISWPEELECDRLQY 109 (127)
T ss_pred CC------CCcCCCCHHHHHHHHHHHHHHHHHhCCCCCCcCCcccCCC
Confidence 11 0112466666766211 1222335777777776643
No 25
>cd07447 CRD_Carboxypeptidase_Z Cysteine-rich domain of carboxypeptidase Z, a member of the carboxypeptidase E family. The cysteine-rich-domain (CRD) is an essential part of carboxypeptidase Z, a member of the carboxypeptidase E family of metallocarboxypeptidases. This is a group of Zn-dependent enzymes implicated in the intra- and extracellular processing of proteins. Carboxypeptidase Z removes C-terminal basic amino acid residues from its substrates, particularly arginine. The CRD acts as a ligand-binding domain for Wnts involved in developmental processes. CPZ binds and may process Wnt-4, CPZ has also been found to enhance the induction of the homeobox gene Cdx1. During vertebrate embryogenesis, the CRD of CPZ upregulates Pax3, a Wnt reporter gene essential for patterning of somites and limb development.
Probab=98.45 E-value=7.3e-08 Score=99.96 Aligned_cols=107 Identities=6% Similarity=-0.017 Sum_probs=70.7
Q ss_pred ccccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhccccccc
Q psy11546 854 SEADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKHT 933 (1634)
Q Consensus 854 ~~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~~ 933 (1634)
..|+.|++++|..|+||.|.|||+++|.+|++ +..+... +++..|.-++. ..|+++|..|+|++|+|..
T Consensus 2 ~~CepI~~~~C~~l~Yn~T~~PN~lgH~sq~~-----~~~~~~~--~~l~~f~pLv~----~~C~p~l~~FLCSl~~P~C 70 (128)
T cd07447 2 ATCTDLLLSYCSDVSYTQTTFPNLLGHRSREV-----TEAGAEY--LLLSVLHGLLG----GECNPDIRLLGCSVLAPRC 70 (128)
T ss_pred CcccCCChhhcCCCCCCcccCCcccCCCCHHH-----HHhhhhH--hHHHHHhhhhh----cCcChhHHhHhhccccccc
Confidence 35999999999999999999999999999876 3322111 12344444554 5899999999999999953
Q ss_pred ccCCCCCCCCCCCCCCCceeeccCC-----ceeeeeeecccccCCCCCcC
Q psy11546 934 HQHNPDTNISTKAIPSSFNFNQVNG-----IPILTKVYTKVSKTNSTSKE 978 (1634)
Q Consensus 934 ~~~~p~~~i~t~~ip~~f~~~~v~g-----~~~~~~v~~~~~~c~~~s~~ 978 (1634)
.. +. .-.|++-.|..|.. +....-.|+..-.|.++...
T Consensus 71 ~~---~~----~i~PCRslCe~vr~~C~~~m~~fg~~WP~~L~C~~fP~~ 113 (128)
T cd07447 71 EN---DK----VIKPCRSTCEALRKRCSHAFDAIQMAWPYFLDCDRFFAG 113 (128)
T ss_pred CC---CC----ccCCCHHHHHHHHHHHHHHHHHhCCCCCCcCccccCCCc
Confidence 21 11 12256666666211 11112247777777766544
No 26
>cd07465 CRD_FZ1 Cysteine-rich Wnt-binding domain (CRD) of the frizzled 1 (Fz1) receptor. The cysteine-rich domain (CRD) is an essential extracellular portion of the frizzled 1 (Fz1) receptor, and is required for binding Wnt proteins, which play fundamental roles in many aspects of early development, such as cell and tissue polarity, neural synapse formation, and the regulation of proliferation. Fz proteins serve as Wnt receptors for multiple signal transduction pathways, including both beta-catenin dependent and -independent cellular signaling, as well as the planar cell polarity pathway and Ca(2+) modulating signaling pathway. CRD containing Fzs have been found in diverse species from amoebas to mammals. 10 different frizzled proteins are found in vertebrata.
Probab=98.44 E-value=9.9e-08 Score=98.91 Aligned_cols=65 Identities=11% Similarity=0.049 Sum_probs=54.2
Q ss_pred cccccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhcccccc
Q psy11546 853 TSEADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKH 932 (1634)
Q Consensus 853 t~~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~ 932 (1634)
.+.||+|+.++|..|+||.|.|||+++|.+|++ |....+.+.. ++. ..|+++|..|+|++|+|.
T Consensus 2 ~~~Cepi~~~~C~~l~Yn~T~~PN~lgh~~q~e-----a~~~~~~f~p-------Lv~----~~Csp~l~~FLCSl~~P~ 65 (127)
T cd07465 2 HGYCQPISIPLCTDIAYNQTIMPNLLGHTNQED-----AGLEVHQFYP-------LVK----VQCSAELKFFLCSMYAPV 65 (127)
T ss_pred CCeeeeCCHHHhCCCCCCcccCCcccCCCCHHH-----HHHHHHHHHH-------HHh----CCCChhhHhHhhhccCCc
Confidence 468999999999999999999999999999999 6544444433 333 489999999999999995
Q ss_pred c
Q psy11546 933 T 933 (1634)
Q Consensus 933 ~ 933 (1634)
+
T Consensus 66 C 66 (127)
T cd07465 66 C 66 (127)
T ss_pred c
Confidence 3
No 27
>PF00530 SRCR: Scavenger receptor cysteine-rich domain; InterPro: IPR001190 The egg peptide speract receptor is a transmembrane glycoprotein []. Other members of this family include the macrophage scavenger receptor type I (a membrane glycoprotein implicated in the pathologic deposition of cholesterol in arterial walls during artherogenesis), an enteropeptidase and T-cell surface glycoprotein CD5 (may act as a receptor in regulating T-cell proliferation).; GO: 0005044 scavenger receptor activity, 0016020 membrane; PDB: 2XRC_D 2JP0_A 2JOP_A 2OYA_A 2OY3_A 1BY2_A 2OTT_Y 2JA4_A.
Probab=98.43 E-value=2.6e-07 Score=90.15 Aligned_cols=90 Identities=13% Similarity=0.297 Sum_probs=68.6
Q ss_pred CcceEEEEe-cCcccccccCcchhhhhhhchhhhcchhHHhhccCCCCCccceeeeccC--CCCCcceEEEeccccccch
Q psy11546 1157 DSGFLMIQK-QGQWGKLCMNQINNFIMKTLKWKISDLGKAICKSMTFRDLNEIEAVADP--STDDSIYYELSMNAINSSA 1233 (1634)
Q Consensus 1157 ~eG~VeV~~-~GtWg~VCsd~~~~~~~~~~~~~isdlA~vICRqLG~~~~~sve~v~~~--~~~~~~y~~vscn~~e~s~ 1233 (1634)
++|+|+++. +|.|+++|.++|+.. .|.++||+|||+.+......... ...+..+..+.|.+.+..+
T Consensus 7 ~~G~vev~~~~~~W~~vC~~~w~~~-----------~a~vvCrqLG~~~~~~~~~~~~~~~~~~~~~~~~~~C~g~e~~l 75 (99)
T PF00530_consen 7 CEGRVEVCFYNGSWGTVCDDGWSMN-----------EANVVCRQLGYGGASSTSSSSSFGSGSGPIWLSNVNCSGNETSL 75 (99)
T ss_dssp TEEEEEEEEETTEEEEEBTTT-SHH-----------HHHHHHHHTTSSEEEEEEECTTTTSS-S-ECECCEEEBSSTSSG
T ss_pred eeEEEEEEEECCEEEeEECCCcchh-----------HHHHHhhhhhccccceeeeecccccccccccccCCEecCCCCcH
Confidence 899999965 999999999999876 78999999999766555433332 1334455688899988899
Q ss_pred hcccccccccccCCCCceeeeecc
Q psy11546 1234 ATKSALLFQKTKCSQKQVVAVNCS 1257 (1634)
Q Consensus 1234 s~c~~l~~~~~~C~s~~vV~V~C~ 1257 (1634)
..|....+....|.+...+.|.|+
T Consensus 76 ~~C~~~~~~~~~C~~~~~~~V~Cs 99 (99)
T PF00530_consen 76 SECTHSPWTSSSCSHSEDAGVICS 99 (99)
T ss_dssp GGSBSSECCCSS-SGGGBEEEEEE
T ss_pred hhCCCcCCCCCCCCCCccEEEEEC
Confidence 999887777788988788999884
No 28
>cd07463 CRD_FZ9 Cysteine-rich Wnt-binding domain (CRD) of the frizzled 9 (Fz9) receptor. The cysteine-rich domain (CRD) is an essential extracellular portion of the frizzled 9 (Fz9) receptor, and is required for binding Wnt proteins, which play fundamental roles in many aspects of early development, such as cell and tissue polarity, neural synapse formation, and the regulation of proliferation. Fz proteins serve as Wnt receptors for multiple signal transduction pathways, including both beta-catenin dependent and -independent cellular signaling, as well as the planar cell polarity pathway and Ca(2+) modulating signaling pathway. CRD containing Fzs have been found in diverse species from amoebas to mammals. 10 different frizzled proteins are found in vertebrata. Fz9 may play a signaling role in lymphoid development and maturation, particularly at points where B cells undergo self-renewal prior to further differentiation.
Probab=98.43 E-value=1.5e-07 Score=97.55 Aligned_cols=103 Identities=10% Similarity=0.059 Sum_probs=69.4
Q ss_pred ccccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhccccccc
Q psy11546 854 SEADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKHT 933 (1634)
Q Consensus 854 ~~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~~ 933 (1634)
+.||+|+.++|..|+||.|.|||+|+|.+|++ +....+.+.. ++. .+|+++|..|+|++|+|..
T Consensus 3 ~~CepI~~~~C~~l~Yn~T~~PN~lgh~sq~e-----a~~~~~~f~p-------Lv~----~~Csp~l~~FlCS~~~P~C 66 (127)
T cd07463 3 AKCQPVVIPMCRGIGYNLTRMPNFLGHDSQRE-----AAIKLNEFAP-------LVE----YGCHVHLRFFLCSLYAPMC 66 (127)
T ss_pred CccccCChhhhCCCCcCcccCCcccCCcCHHH-----HHHHHHHHHH-------HHh----cCCChhhHHHhhhcccccc
Confidence 57999999999999999999999999999988 5544333444 343 4899999999999999953
Q ss_pred ccCCCCCCCCCCCC-CCCceeeccCC-----ceeeeeeecccccCCCCCcC
Q psy11546 934 HQHNPDTNISTKAI-PSSFNFNQVNG-----IPILTKVYTKVSKTNSTSKE 978 (1634)
Q Consensus 934 ~~~~p~~~i~t~~i-p~~f~~~~v~g-----~~~~~~v~~~~~~c~~~s~~ 978 (1634)
... .+ +.| |++-.|+.|.. +....-.|+..-.|.+....
T Consensus 67 ~~~-----~~-~~i~PCRslCe~vr~~C~~~m~~fg~~WP~~l~C~~fP~~ 111 (127)
T cd07463 67 TDQ-----VS-TSIPACRPMCEQARQKCSPIMEQFNFGWPESLDCSRLPTR 111 (127)
T ss_pred CCC-----CC-CcCCccHHHHHHHHHHHHHHHHHhCCCCCCcCChhhCCCC
Confidence 211 11 122 56666666111 11112236666666655443
No 29
>cd07448 CRD_FZ4 Cysteine-rich Wnt-binding domain of the frizzled 4 (Fz4) receptor. The cysteine-rich domain (CRD) is an essential extracellular portion of the frizzled 4 (Fz4) receptor, and is required for binding Wnt proteins, which play fundamental roles in many aspects of early development, such as cell and tissue polarity, neural synapse formation, and the regulation of proliferation. Fz proteins serve as Wnt receptors for multiple signal transduction pathways, including both beta-catenin dependent and -independent cellular signaling, as well as the planar cell polarity pathway and the Ca(2+) modulating signaling pathway. CRD containing Fzs have been found in diverse species from amoebas to mammals. 10 different frizzled proteins are found in vertebrata. Frizzled 4 (Fz4) activates the Ca(2+)/calmodulin-dependent protein kinase II and protein kinase C of the Wnt/Ca(2+) signaling pathway during retinal angiogenesis. Mutations in Fz4 lead to familial exudative vitreoretinopathy (FEVR
Probab=98.41 E-value=1.5e-07 Score=97.49 Aligned_cols=81 Identities=11% Similarity=0.096 Sum_probs=60.4
Q ss_pred ccccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhccccccc
Q psy11546 854 SEADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKHT 933 (1634)
Q Consensus 854 ~~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~~ 933 (1634)
..|++|++++|..|+||.|.|||+++|.+|++ +... ++.|..++. +.|+++|..|+|++|+|..
T Consensus 2 ~~Cepi~~~~C~~l~Yn~T~~PN~lgh~sq~e-----a~~~-------l~~~~pLv~----~~Csp~l~~flCsl~~P~C 65 (126)
T cd07448 2 RRCEPIRIEMCQGLGYNVTRMPNLVGHELQTD-----AELQ-------LQTFTPLIQ----YGCSSQLKFFLCSVYVPMC 65 (126)
T ss_pred CeeeeCCHHHhCCCCcccccCCCcCCCcCHHH-----HHHH-------HHHHHHHhh----cCCChhHHhHhhhcccccc
Confidence 46999999999999999999999999999999 5543 333444444 4899999999999999953
Q ss_pred ccCCCCCCCCCCCCCCCceeec
Q psy11546 934 HQHNPDTNISTKAIPSSFNFNQ 955 (1634)
Q Consensus 934 ~~~~p~~~i~t~~ip~~f~~~~ 955 (1634)
..- .+..-.|++-.|..
T Consensus 66 ~~~-----~~~~i~PCRslCe~ 82 (126)
T cd07448 66 TEK-----VPVPIGPCRPLCLS 82 (126)
T ss_pred CCC-----CCCcCCcCHHHHHH
Confidence 211 11122366666666
No 30
>PF00057 Ldl_recept_a: Low-density lipoprotein receptor domain class A This prints entry is specific to LDL receptor; InterPro: IPR002172 The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing: The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins []. The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor []. The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains. The fourth domain is the hydrophobic transmembrane region. The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits. LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR class A (cyateine-rich) repeat, which contains 6 disulphide-bound cysteines and a highly conserved cluster of negatively charged amino acids, of which many are clustered on one face of the module []. In LDL receptors, the class A domains form the binding site for LDL and calcium. The acidic residues between the fourth and sixth cysteines are important for high-affinity binding of positively charged sequences in LDLR's ligands. The repeat consists of a beta-hairpin structure followed by a series of beta turns. In the absence of calcium, LDL-A domains are unstructured; the bound calcium ion imparts structural integrity. Following these repeats is a 350 residue domain that resembles part of the epidermal growth factor (EGF) precursor. Numerous familial hypercholestorolemia mutations of the LDL receptor alter the calcium coordinating residue of LDL-A domains or other crucial scaffolding residues. ; GO: 0005515 protein binding; PDB: 2I1P_A 3OJY_A 4E0S_B 3T5O_A 4A5W_B 1JRF_A 1K7B_A 1V9U_5 3DPR_E 2KNY_A ....
Probab=98.41 E-value=1.9e-07 Score=77.12 Aligned_cols=36 Identities=50% Similarity=1.268 Sum_probs=33.6
Q ss_pred CCCCCceecCCCCeeeccccccCCCccCCCCCCcccc
Q psy11546 1061 WCSPGQYICPNSRVCIERTRLCDGIKDCPLGDDEKQC 1097 (1634)
Q Consensus 1061 ~C~~~~F~C~~~~~CI~~~~~CDG~~DC~DGSde~~c 1097 (1634)
.|++++|.|.++. ||+..++|||+.||.|||||.+|
T Consensus 2 ~C~~~~f~C~~~~-CI~~~~~CDg~~DC~dgsDE~~C 37 (37)
T PF00057_consen 2 TCPPGEFRCGNGQ-CIPKSWVCDGIPDCPDGSDEQNC 37 (37)
T ss_dssp SSSTTEEEETTSS-EEEGGGTTSSSCSSSSSTTTSSH
T ss_pred cCcCCeeEcCCCC-EEChHHcCCCCCCCCCCcccccC
Confidence 5889999998776 99999999999999999999887
No 31
>cd07461 CRD_FZ8 Cysteine-rich Wnt-binding domain (CRD) of the frizzled 8 (Fz8) receptor. The cysteine-rich domain (CRD) is an essential extracellular portion of the frizzled 8 (Fz8) receptor, and is required for binding Wnt proteins, which play fundamental roles in many aspects of early development, such as cell and tissue polarity, neural synapse formation, and the regulation of proliferation. Fz proteins serve as Wnt receptors for multiple signal transduction pathways, including both beta-catenin dependent and -independent cellular signaling, as well as the planar cell polarity pathway and Ca(2+) modulating signaling pathway. CRD containing Fzs have been found in diverse species from amoebas to mammals. 10 different frizzled proteins are found in vertebrata. Xenopus Fz8 is important in Wnt/beta-catenin signaling pathways controlling the transcriptional activation of target genes Siamois and Xnr3 in the animal caps of late blastula.
Probab=98.41 E-value=1.8e-07 Score=96.85 Aligned_cols=104 Identities=8% Similarity=0.021 Sum_probs=69.2
Q ss_pred ccccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhccccccc
Q psy11546 854 SEADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKHT 933 (1634)
Q Consensus 854 ~~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~~ 933 (1634)
..|++|++++|..|+||.|.|||+++|.+|++ +......|..+ +. +.|+++|..|+|++|+|..
T Consensus 3 ~~C~pI~~~~C~~l~Yn~T~~PN~l~H~sq~e-----a~~~~~~f~pL-------v~----~~Csp~l~~FlCSl~~P~C 66 (125)
T cd07461 3 LQCQEITVPLCKGIGYNYTYMPNQFNHDTQDE-----AGLEVHQFWPL-------VE----IQCSPDLKFFLCSMYTPIC 66 (125)
T ss_pred CcccccCHHHhCCCCCCcccCCCccCCCCHHH-----HHHHHHHHHHH-------Hh----CCCChhHHHHhhhccCccc
Confidence 46999999999999999999999999999988 55544444443 33 4899999999999999943
Q ss_pred ccCCCCCCCCCCCCCCCceeeccCC-----ceeeeeeecccccCCCCCcC
Q psy11546 934 HQHNPDTNISTKAIPSSFNFNQVNG-----IPILTKVYTKVSKTNSTSKE 978 (1634)
Q Consensus 934 ~~~~p~~~i~t~~ip~~f~~~~v~g-----~~~~~~v~~~~~~c~~~s~~ 978 (1634)
..- ....-.|++-.|+.|.. +....-.|+..-.|.+....
T Consensus 67 ~~~-----~~~~i~PCRslCe~vr~~C~~~m~~~g~~WP~~l~C~~fP~~ 111 (125)
T cd07461 67 LED-----YKKPLPPCRSVCERAKAGCAPLMRQYGFPWPDRMRCDLLPEQ 111 (125)
T ss_pred CCC-----CCCcCCccHHHHHHHHHHHHHHHHHhCCCCCCcCChhhCCCC
Confidence 210 01112356666666110 11111136666666655443
No 32
>cd07449 CRD_FZ3 Cysteine-rich Wnt-binding domain (CRD) of the frizzled 3 (Fz3) receptor. The cysteine-rich domain (CRD) is an essential extracellular portion of the frizzled 3 (Fz3) receptor, and is required for binding Wnt proteins, which play fundamental roles in many aspects of early development, such as cell and tissue polarity, neural synapse formation, and the regulation of proliferation. Fz proteins serve as Wnt receptors for multiple signal transduction pathways, including both beta-catenin dependent and -independent cellular signaling, as well as the planar cell polarity pathway and Ca(2+) modulating signaling pathway. CRD containing Fzs have been found in diverse species from amoebas to mammals. 10 different frizzled proteins are found in vertebrata. Fz3 plays a vital role in the anterior-posterior guidance of commissural axons. Knockout mice without Fz3 show defects in fiber tracts in the rostral CNS.
Probab=98.40 E-value=8.5e-08 Score=99.18 Aligned_cols=102 Identities=8% Similarity=0.103 Sum_probs=70.6
Q ss_pred cccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhcccccccc
Q psy11546 855 EADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKHTH 934 (1634)
Q Consensus 855 ~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~~~ 934 (1634)
.|++|++++|..|+||.|.|||+++|.+|++ |++. ++.|.-++. +.|+++|..|+|++|+|...
T Consensus 4 ~CepI~~~~C~~l~Yn~T~~PN~lgH~~q~e-----a~~~-------~~~f~pLv~----~~Csp~l~~FLCSly~P~C~ 67 (127)
T cd07449 4 SCEPITLRMCQDLPYNTTFMPNLLNHYDQQT-----AALA-------MEPFHPMVN----LECSRDFRPFLCALYAPVCM 67 (127)
T ss_pred eeeeCChhhhCCCCCCcccCCcccCCcCHHH-----HHHH-------HHhhccccc----CCcChhhHhHhheeeccccC
Confidence 5999999999999999999999999999998 5543 344555555 58999999999999999532
Q ss_pred cCCCCCCCCCCCCCCCceeeccCC-----ceeeeeeecccccCCCCCcC
Q psy11546 935 QHNPDTNISTKAIPSSFNFNQVNG-----IPILTKVYTKVSKTNSTSKE 978 (1634)
Q Consensus 935 ~~~p~~~i~t~~ip~~f~~~~v~g-----~~~~~~v~~~~~~c~~~s~~ 978 (1634)
.. + ..-.|++-.|+.|.. +....-.|+..-.|.+..+.
T Consensus 68 ~~--~----~~i~PCRslCe~vr~~C~~~m~~fg~~WP~~L~C~~fP~~ 110 (127)
T cd07449 68 EY--G----RVTLPCRRLCQRAYSECSKLMEMFGVPWPEDMECSRFPDC 110 (127)
T ss_pred CC--C----CCCCCCHHHHHHHHHHHHHHHHHhCCCCCCcCCcccCCCC
Confidence 10 0 112366666666211 11112247777777766543
No 33
>cd07457 CRD_FZ9_like Cysteine-rich Wnt-binding domain (CRD) of receptors similar to frizzled 9. The cysteine-rich domain (CRD) is an essential extracellular portion of the frizzled 9 (Fz9) and frizzled 10 (Fz10) receptors, and similar proteins. This domain is required for binding Wnt proteins, which play fundamental roles in many aspects of early development, such as cell and tissue polarity, neural synapse formation, and the regulation of proliferation. Fz proteins serve as Wnt receptors for multiple signal transduction pathways, including both beta-catenin dependent and -independent cellular signaling, as well as the planar cell polarity pathway and Ca(2+) modulating signaling pathway. The CRD domain is well conserved in metazoans - 10 frizzled proteins have been identified in mammals, 4 in Drosophila and 3 in Caenorhabditis elegans. Very little is known about the mechanism by which CRD domains interact with their ligands. The domain contains 10 conserved cysteines.
Probab=98.39 E-value=2e-07 Score=96.05 Aligned_cols=103 Identities=8% Similarity=0.085 Sum_probs=68.2
Q ss_pred ccccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhccccccc
Q psy11546 854 SEADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKHT 933 (1634)
Q Consensus 854 ~~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~~ 933 (1634)
+.||+|++++|..|+||.|.|||+++|.+|++ +...-. .|..++. ++|+++|..|+|++|+|..
T Consensus 1 ~~CepI~~~~C~~l~Yn~T~~PN~lgh~~q~e-----a~~~~~-------~~~pLv~----~~Csp~l~~FlCsl~~P~C 64 (121)
T cd07457 1 GKCERITIPMCQGIGYNMTRMPNLLGHESQSE-----AAISIH-------EFAPLVQ----YGCAEHLRFFLCSLYAPMC 64 (121)
T ss_pred CcCccCChHhhCCCCCCceeCCCcCCCcCHHH-----HHHHHH-------HHHHHHh----cCCChhHHHHhhhccCccc
Confidence 46999999999999999999999999999988 544333 3333443 3899999999999999953
Q ss_pred ccCCCCCCCCCCCCCCCceeeccCC-----ceeeeeeecccccCCCCCc
Q psy11546 934 HQHNPDTNISTKAIPSSFNFNQVNG-----IPILTKVYTKVSKTNSTSK 977 (1634)
Q Consensus 934 ~~~~p~~~i~t~~ip~~f~~~~v~g-----~~~~~~v~~~~~~c~~~s~ 977 (1634)
... .+..-.|++-.|+.|.. +....-.|+....|.+...
T Consensus 65 ~~~-----~~~~i~PCRslCe~vr~~C~~~m~~~g~~WP~~l~C~~fP~ 108 (121)
T cd07457 65 TEQ-----VSIPIPACRSMCEQARDKCSPIMEQFSFSWPDSLDCDRLPR 108 (121)
T ss_pred CCC-----CCCcCCccHHHHHHHHHHHHHHHHHhCCCCCCcCCcccCCC
Confidence 211 11112256666666211 1111223666666665543
No 34
>cd07458 CRD_FZ1_like Cysteine-rich Wnt-binding domain (CRD) of receptors similar to frizzled 1. The cysteine-rich domain (CRD) is an essential extracellular portion of the frizzled 1 (Fz1), frizzled 2 (Fz2), and frizzled 7 (Fz7) receptors, and similar proteins. This domain is required for binding Wnt proteins, which play fundamental roles in many aspects of early development, such as cell and tissue polarity, neural synapse formation, and the regulation of proliferation. Fz proteins serve as Wnt receptors for multiple signal transduction pathways, including both beta-catenin dependent and -independent cellular signaling, as well as the planar cell polarity pathway and Ca(2+) modulating signaling pathway. The CRD domain is well conserved in metazoans - 10 frizzled proteins have been identified in mammals, 4 in Drosophila and 3 in Caenorhabditis elegans. Very little is known about the mechanism by which CRD domains interact with their ligands. The domain contains 10 conserved cysteines.
Probab=98.38 E-value=1.9e-07 Score=95.96 Aligned_cols=64 Identities=9% Similarity=0.047 Sum_probs=52.6
Q ss_pred ccccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhccccccc
Q psy11546 854 SEADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKHT 933 (1634)
Q Consensus 854 ~~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~~ 933 (1634)
+.||+|++++|..|+||.|.|||+++|.+|++ +...-..+.. ++. ..|+++|..|+|++|+|..
T Consensus 1 ~~C~pi~~~~C~~l~Yn~T~~PN~lgh~~q~e-----a~~~l~~~~p-------Lv~----~~C~p~l~~FlCsl~~P~C 64 (119)
T cd07458 1 GKCEPITIPLCTDIPYNMTIFPNLLGHTKQED-----AGLEVHQFYP-------LVK----VQCSPDLKFFLCSVYAPVC 64 (119)
T ss_pred CccccCChHHhCCCCCCccCCCCccCCCCHHH-----HHHHHHHHHH-------HHh----CCCChhHHHHhhhcccccc
Confidence 46999999999999999999999999999998 5443333333 333 4899999999999999953
No 35
>cd00112 LDLa Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure
Probab=98.37 E-value=1.8e-07 Score=76.31 Aligned_cols=35 Identities=43% Similarity=1.079 Sum_probs=32.4
Q ss_pred CCCCceecCCCCeeeccccccCCCccCCCCCCcccc
Q psy11546 1062 CSPGQYICPNSRVCIERTRLCDGIKDCPLGDDEKQC 1097 (1634)
Q Consensus 1062 C~~~~F~C~~~~~CI~~~~~CDG~~DC~DGSde~~c 1097 (1634)
|++++|+|.+ +.||+..++|||+.||+|||||..|
T Consensus 1 C~~~~f~C~~-~~Ci~~~~~CDg~~DC~dgsDE~~C 35 (35)
T cd00112 1 CPPNEFRCAN-GRCIPSSWVCDGEDDCGDGSDEENC 35 (35)
T ss_pred CCCCeEEcCC-CCeeCHHHcCCCccCCCCCcccccC
Confidence 6789999987 5699999999999999999999977
No 36
>smart00063 FRI Frizzled. Drosophila melanogaster frizzled mediates signalling that polarises a precursor cell along the anteroposterior axis. Homologues of the N-terminal region of frizzled exist either as transmembrane or secreted molecules. Frizzled homologues are reported to be receptors for the Wnt growth factors. (Not yet in MEDLINE: the FRI domain occurs in several receptor tyrosine kinases [Xu, Y.K. and Nusse, Curr. Biol. 8 R405-R406 (1998); Masiakowski, P. and Yanopoulos, G.D., Curr. Biol. 8, R407 (1998)].
Probab=98.33 E-value=3e-07 Score=93.72 Aligned_cols=78 Identities=13% Similarity=0.141 Sum_probs=59.6
Q ss_pred ccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhccccccccc
Q psy11546 856 ADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKHTHQ 935 (1634)
Q Consensus 856 c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~~~~ 935 (1634)
|++|++++|..|+||.|.|||+++|.+|++ +... ++.|..++. +.|+++|..|+|++|+|....
T Consensus 1 C~pi~~~~C~~l~Yn~T~~PN~l~h~~q~e-----a~~~-------l~~~~~L~~----~~C~p~l~~flCsl~~P~C~~ 64 (113)
T smart00063 1 CEPITIPLCKDLGYNLTSMPNLLGHTTQEE-----AGLE-------LEQFHPLLN----VQCSPDLRFFLCSVYAPICTE 64 (113)
T ss_pred CcCCCcHhHCCCCcccccCCCccCCCCHHH-----HHHH-------HHHHHhhcc----CCCChhHHHHhHhccCcccCC
Confidence 899999999999999999999999999988 4432 455555665 479999999999999995321
Q ss_pred CCCCCCCCCCCCCCCceeec
Q psy11546 936 HNPDTNISTKAIPSSFNFNQ 955 (1634)
Q Consensus 936 ~~p~~~i~t~~ip~~f~~~~ 955 (1634)
. +. ...|++-.|..
T Consensus 65 ~---~~---~~~PCRs~Ce~ 78 (113)
T smart00063 65 D---LR---PILPCRSLCEA 78 (113)
T ss_pred C---CC---cCCcCHHHHHH
Confidence 1 10 23466666666
No 37
>cd07452 CRD_sizzled Cysteine-rich domain of the sizzled protein. The cysteine-rich domain (CRD) is an essential part of the sizzled protein, which regulates bone morphogenetic protein (Bmp) signaling by stabilizing chordin, and plays a critical role in the patterning of vertebrate and invertebrate embryos. Sizzled also functions in the ventral region as a Wnt inhibitor and modulates canonical Wnt signaling. Sizzled proteins belong to the secreted frizzled-related protein family (SFRP), and have be identified in the genomes of birds, fishes and frogs, but not mammals.
Probab=98.30 E-value=3.4e-07 Score=96.25 Aligned_cols=81 Identities=6% Similarity=-0.059 Sum_probs=61.1
Q ss_pred cCcccccccc--hheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhcc
Q psy11546 851 NNTSEADSIE--KKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQ 928 (1634)
Q Consensus 851 ~~t~~c~~i~--~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~ 928 (1634)
-.++.|+.|. +++|..|+||.|.|||+++|.+|++ +......+.. ++. ..|+++|..|+|++
T Consensus 4 ~~s~~C~pIp~~l~~C~~l~Yn~T~~PN~lgH~sq~e-----a~~~~~~f~p-------Lv~----~~C~p~l~~FlCSl 67 (141)
T cd07452 4 GLSTKCVPIPPEMSMCQDVGYSEMRLPNLLGHTSMAE-----VVPKSADWQT-------LLH----TGCHPHARTFLCSL 67 (141)
T ss_pred CCCCcceeCCCcchhhcCCCCCcccCCcccCCCCHHH-----HHHHHHHHHH-------HHh----cCCCHHHHHHhHhh
Confidence 3577899996 9999999999999999999999998 5443333333 333 48999999999999
Q ss_pred cccccccCCCCCCCCCCCCCCCceeec
Q psy11546 929 HLKHTHQHNPDTNISTKAIPSSFNFNQ 955 (1634)
Q Consensus 929 ~~p~~~~~~p~~~i~t~~ip~~f~~~~ 955 (1634)
|.|.... +.+ .|++-.|..
T Consensus 68 ~~P~C~~----~~i----~PCRslCe~ 86 (141)
T cd07452 68 FAPVCLD----TFI----QPCRSMCVA 86 (141)
T ss_pred cccCCCC----CCC----cCCHHHHHH
Confidence 9995432 111 366666666
No 38
>smart00192 LDLa Low-density lipoprotein receptor domain class A. Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. The N-terminal type A repeats in LDL receptor bind the lipoproteins. Other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.
Probab=98.30 E-value=4.1e-07 Score=73.28 Aligned_cols=32 Identities=44% Similarity=1.098 Sum_probs=29.9
Q ss_pred CCCCCceecCCCceecCCcccCCCCCCCCCCCC
Q psy11546 993 CEMNSSFRCGNGECVSIGSKCNQLVDCADGSDE 1025 (1634)
Q Consensus 993 C~~~~~F~C~nG~CIp~~~~CDG~~DC~DgSDE 1025 (1634)
|. ..+|+|.+|.||+..++|||++||+|||||
T Consensus 2 C~-~~~f~C~~~~Ci~~~~~Cdg~~dC~dgsDE 33 (33)
T smart00192 2 CP-PGEFQCDNGRCIPLSWVCDGVDDCSDGSDE 33 (33)
T ss_pred CC-CCeEECCCCCEECchhhCCCcCcCcCCCCC
Confidence 55 469999999999999999999999999998
No 39
>cd07444 CRD_SFRP5 Cysteine-rich domain of the secreted frizzled-related protein 5 (SFRP5), a regulator of Wnt activity. The cysteine-rich domain (CRD) is an essential part of the secreted frizzled-related Protein 5 (SFRP5), which regulates the activity of Wnt proteins, key players in a number of fundamental cellular processes such as embryogenesis and postnatal development. SFRPs antagonize the activation of Wnt signaling by binding to the CRD domains of frizzled (Fz) proteins, thereby preventing Wnt proteins from binding to these receptors. SFRPs are also known to have functions unrelated to Wnt, as enhancers of procollagen cleavage by the TLD proteinases. SFRPs and Fz proteins both contain CRD domains, but SFRPs lack the seven-pass transmembrane domain which is an integral part of Fzs.
Probab=98.29 E-value=2.7e-07 Score=95.59 Aligned_cols=101 Identities=5% Similarity=-0.058 Sum_probs=69.9
Q ss_pred cccccccc--hheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhcccc
Q psy11546 853 TSEADSIE--KKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHL 930 (1634)
Q Consensus 853 t~~c~~i~--~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~ 930 (1634)
++.|+.|. +++|..|+||.|.|||+++|.+|++ +......+..+ +. ..|+++|..|+|++|.
T Consensus 4 ~~~C~pIp~~l~~C~~l~Yn~T~~PN~lgH~~q~e-----a~~~~~~~~pL------v~-----~~C~p~l~~FlCSl~~ 67 (127)
T cd07444 4 QPQCVDIPADLPLCHNVGYKRMRLPNLLEHESMAE-----VKQQASSWVPL------LA-----KRCHADTQVFLCSLFA 67 (127)
T ss_pred CCCceECCCCchhhcCCCCCceeCCcccCCCCHHH-----HHHHHHHHHHH------Hh-----cccCHHHHHHhhhhcC
Confidence 57899997 9999999999999999999999999 66554444443 23 3899999999999999
Q ss_pred cccccCCCCCCCCCCCCCCCceeeccCCc-----eeeeeeecccccCCCCCc
Q psy11546 931 KHTHQHNPDTNISTKAIPSSFNFNQVNGI-----PILTKVYTKVSKTNSTSK 977 (1634)
Q Consensus 931 p~~~~~~p~~~i~t~~ip~~f~~~~v~g~-----~~~~~v~~~~~~c~~~s~ 977 (1634)
|.... ..-.|++-.|+.|..- ....-.|+..-+|.++..
T Consensus 68 P~C~~--------~~i~PCRslCe~vr~~C~~~m~~~g~~WP~~l~C~~fP~ 111 (127)
T cd07444 68 PVCLD--------RPIYPCRSLCEAVRDSCAPVMESYGFPWPEMLHCHKFPL 111 (127)
T ss_pred CCCCC--------CCCCCcHHHHHHHHHHHHHHHHHcCCCCCCCCChhhCCC
Confidence 95421 1123666667662110 001113666666665543
No 40
>cd07446 CRD_SFRP2 Cysteine-rich domain of the secreted frizzled-related protein 2 (SFRP2), a regulator of Wnt activity. The cysteine-rich-domain (CRD) is an essential part of the secreted frizzled related protein 2 (SFRP2), which regulates the activity of Wnt proteins, key players in a number of fundamental cellular processes such as embryogenesis and postnatal development. SFRPs antagonize the activation of Wnt signaling by binding to CRD domains of frizzled (Fz) proteins, thereby preventing Wnt proteins from binding to these receptors. SFRPs and Fz proteins both contain CRD domains, but SFRPs lack the seven-pass transmembrane domain which is an integral part of Fzs. As a Wnt antagonist, SFRP2 regulates Nkx2.2 expression in the ventral spinal cord and anteroposterior axis elongation. SFRP2 also has a Wnt-independent function as an enhancer of procollagen cleavage by the TLD proteinases. SFRP2 binds both procollagen and TLD, thus facilitating the enzymatic reaction by bringing togeth
Probab=98.23 E-value=8.7e-07 Score=92.12 Aligned_cols=81 Identities=6% Similarity=-0.048 Sum_probs=59.8
Q ss_pred ccccccc--hheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhccccc
Q psy11546 854 SEADSIE--KKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLK 931 (1634)
Q Consensus 854 ~~c~~i~--~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p 931 (1634)
+.|+.|. +++|..|+||.|.|||+++|.+|++ |......+..+ +. ..|+++|..|+|++|+|
T Consensus 3 s~C~pI~~~l~~C~~l~Yn~T~~PN~lgH~sq~e-----a~~~~~~~~pL-------v~----~~C~p~l~~FlCSl~~P 66 (128)
T cd07446 3 SNCKPIPANMLLCHGIEYTNMRLPNLLGHETMKE-----VLQQAGSWIPL-------VQ----KQCHPDTKKFLCSLFAP 66 (128)
T ss_pred CCceECCCCchhhcCCCCCcccCCcccCCCCHHH-----HHHHHHHHHHH-------Hh----cCCChhhHHHhhhccCc
Confidence 6799998 8899999999999999999999999 65543334333 22 49999999999999999
Q ss_pred ccccCCCCCCCCCCCCCCCceeec
Q psy11546 932 HTHQHNPDTNISTKAIPSSFNFNQ 955 (1634)
Q Consensus 932 ~~~~~~p~~~i~t~~ip~~f~~~~ 955 (1634)
...... ...-.|++-.|..
T Consensus 67 ~C~~~~-----~~~i~PCRslCe~ 85 (128)
T cd07446 67 VCLDDL-----DEAIQPCRSLCEA 85 (128)
T ss_pred cccCCC-----CCcCCccHHHHHH
Confidence 532211 1122356666666
No 41
>cd07443 CRD_SFRP1 Cysteine-rich domain of the secreted frizzled-related protein 1 (SFRP1), a regulator of Wnt activity. The cysteine-rich domain (CRD) is an essential part of the secreted frizzled-related protein 1 (SFRP1), which regulates the activity of Wnt proteins, key players in a number of fundamental cellular processes such as embryogenesis and postnatal development. SFRPs antagonize the activation of Wnt signaling by binding to the CRDs domains of frizzled (Fz) proteins, thereby preventing Wnt proteins from binding to these receptors. SFRPs are also known to have functions unrelated to Wnt, as enhancers of procollagen cleavage by the TLD proteinases. SFRPs and Fz proteins both contain CRD domains, but SFRPs lack the seven-pass transmembrane domain which is an integral part of Fzs. SFRP1 is expressed in many tissues and is involved in the regulation of Wnt signaling in osteoblasts, leading to enhanced trabecular bone formation in adults; it has also been shown to control the gro
Probab=98.22 E-value=5.5e-07 Score=93.05 Aligned_cols=79 Identities=6% Similarity=-0.015 Sum_probs=59.6
Q ss_pred cccccccc--hheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhcccc
Q psy11546 853 TSEADSIE--KKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHL 930 (1634)
Q Consensus 853 t~~c~~i~--~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~ 930 (1634)
...|+.|. +++|..|+||.|.|||+++|.+|++ +......+..+ +. ..|+++|..|+|++|.
T Consensus 4 ~~~C~pIp~~l~~C~~l~Yn~T~~PN~lgH~sq~e-----a~~~~~~~~pL------v~-----~~C~p~l~~FlCsl~~ 67 (124)
T cd07443 4 PPQCVDIPADLRLCHNVGYKKMVLPNLLDHETMAE-----VKQQASSWVPL------LN-----KNCHKGTQVFLCSLFA 67 (124)
T ss_pred CCCceECCCchhhhcCCCCCceeCccccCCCCHHH-----HHHHHHHHHHH------Hh-----ccccHHHHHHhhheec
Confidence 46799997 8999999999999999999999999 55443333333 22 3899999999999999
Q ss_pred cccccCCCCCCCCCCCCCCCceeec
Q psy11546 931 KHTHQHNPDTNISTKAIPSSFNFNQ 955 (1634)
Q Consensus 931 p~~~~~~p~~~i~t~~ip~~f~~~~ 955 (1634)
|.... + .-.|++-.|+.
T Consensus 68 P~C~~----~----~i~PCRslCe~ 84 (124)
T cd07443 68 PVCLD----R----PVYPCRWLCEA 84 (124)
T ss_pred cCCCC----C----CCCCCHHHHHH
Confidence 95421 1 12366666666
No 42
>KOG1215|consensus
Probab=98.20 E-value=1.2e-06 Score=116.48 Aligned_cols=92 Identities=38% Similarity=0.805 Sum_probs=75.9
Q ss_pred CCCcCC--CCcccCCCcccCCcccCCCCCCCCCcc---------------------------------------------
Q psy11546 1541 NGHRCP--LGECLPKARVCNGYMECSDGKCEMNSS--------------------------------------------- 1573 (1634)
Q Consensus 1541 ~~f~C~--~G~CIp~~~vCDG~~DC~DGsDE~~~~--------------------------------------------- 1573 (1634)
..|.|. .++|+|..|.|||..||.||+||..+.
T Consensus 141 ~~~~c~~~~~~Cip~~~~cd~~~~C~dg~de~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~d~~~~~~~~~~~~~~ 220 (877)
T KOG1215|consen 141 DKFSCRTGSCKCIPGDWLCDGEADCPDGSDELNCAVRRCEPRGASLDCIVAIKVCDIQHDCADDYDESEGRIYWTDDSRI 220 (877)
T ss_pred CCCCCcCccccCCCCceeCCCCCccccchhhhcccccccCccccccccceeeeecCcccccccccccccCcccccCCcce
Confidence 478888 789999999999999999999998752
Q ss_pred --cccCC-CcEecCCCccCCCCCCCCCCCCC--CC----------ccCCcCCCCccCCcccCCCCCCCCCCCCCCC
Q psy11546 1574 --FRCGN-GECVSIGSKCNQLVDCADGSDEK--NC----------SCADFLKSQFLTRKICDGIIDCWDFSDEYEC 1634 (1634)
Q Consensus 1574 --F~C~~-G~CI~~~~~CDG~~DC~DgSDE~--~C----------~C~~~~~~c~~~~~~CDG~~DC~DgSDE~~C 1634 (1634)
++|.. +.||...+.|||..||.+++||. +| .|. ...|++..+.|||..||.||+||.+|
T Consensus 221 ~~~~c~g~~~~i~~~~~~Dg~~dc~~~~de~~~~~~~~~~~~~e~~~~--~~~~~~~~~~~~g~~d~pdg~de~~~ 294 (877)
T KOG1215|consen 221 EVTRCDGSSRCILISEVCDGPRDCVDGPDEGVMNCSDATCEAPEIECA--DGDCSDRQKLCDGDLDCPDGLDEDYC 294 (877)
T ss_pred eEEEecCCCcEEeehhccCCCcccccCCcCceeEeeccccCCcceeec--CCCCccceEEecCccCCCCccccccc
Confidence 34444 48999999999999999999995 22 342 34577778889999999999999876
No 43
>cd07455 CRD_Collagen_XVIII Cysteine-rich domain of the variant 3 of collagen XVIII (V3C18 ). The cysteine-rich domain (CRD) is an essential part of the variant 3 of collagen XVIII (V3C18), which regulates major cellular functions such as the differential epithelial morphogenesis of early lung and kidney development. V3C18 is a 170 kD protein, which is proteolotically processed into the CRD-containing 50 kD glucoprotein precursor that binds Wnt3a through its CRD domain and suppresses the Wnt3a-induced stabilization of beta catenin. Full-length V3C18 is unable to inhibit Wnt signaling.
Probab=98.16 E-value=1.1e-06 Score=90.55 Aligned_cols=63 Identities=6% Similarity=-0.044 Sum_probs=52.2
Q ss_pred cccccc--chheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhccccc
Q psy11546 854 SEADSI--EKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLK 931 (1634)
Q Consensus 854 ~~c~~i--~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p 931 (1634)
..|+.| ++++|..|+||.|.|||+++|.+|++ +...... |.-++. ..|+++|..|+|++|+|
T Consensus 3 ~~C~pIp~~l~~C~~l~Yn~t~~PN~lgH~sq~e-----a~~~~~~-------f~pLv~----~~Csp~l~~FlCs~~~P 66 (123)
T cd07455 3 PRCLPVPSSLPFCSRLGIRSFWLPNFLNHTSVEE-----VRAVLAE-------WAWLLE----SGCHPSLEWFFCLLLVP 66 (123)
T ss_pred CCCccCCCccccccCCCcccccCCcccCCCCHHH-----HHHHHHH-------HHHHHh----CCCChhhHHHhhhccCC
Confidence 359999 99999999999999999999999999 5544333 333444 48999999999999999
Q ss_pred c
Q psy11546 932 H 932 (1634)
Q Consensus 932 ~ 932 (1634)
.
T Consensus 67 ~ 67 (123)
T cd07455 67 S 67 (123)
T ss_pred c
Confidence 4
No 44
>PF12999 PRKCSH-like: Glucosidase II beta subunit-like
Probab=98.14 E-value=1.5e-06 Score=94.24 Aligned_cols=68 Identities=38% Similarity=0.725 Sum_probs=56.4
Q ss_pred CCceecCCCc-e-ecCCcccCCCCCCCCCCCCCCCCCCCcccCccccccccCCCCCCCCCCCCCCCCCCCCCceecCCCC
Q psy11546 996 NSSFRCGNGE-C-VSIGSKCNQLVDCADGSDEKNCSCADFLKSQFLTRKICDGIIDCWDFSDEYECEWCSPGQYICPNSR 1073 (1634)
Q Consensus 996 ~~~F~C~nG~-C-Ip~~~~CDG~~DC~DgSDE~~C~C~~~~~~~~~~~~~CDG~~DC~D~SDE~~C~~C~~~~F~C~~~~ 1073 (1634)
.+.|.|-+|. = |+..++.|++-||+|||||.+= .-|+.+.|+|.+.+
T Consensus 35 ~~~f~Cl~~~~~~I~~~~iNDdyCDC~DGSDEPGT-------------------------------sAC~~~~FyC~N~g 83 (176)
T PF12999_consen 35 NGKFTCLDGSKIVIPFSQINDDYCDCPDGSDEPGT-------------------------------SACSNGKFYCENKG 83 (176)
T ss_pred CCceEecCCCCceecHHHccCcceeCCCCCCcccc-------------------------------ccCcCceEeeccCC
Confidence 4679998763 3 7999999999888888888431 24667799998775
Q ss_pred ---eeeccccccCCCcc---CCCCCCc
Q psy11546 1074 ---VCIERTRLCDGIKD---CPLGDDE 1094 (1634)
Q Consensus 1074 ---~CI~~~~~CDG~~D---C~DGSde 1094 (1634)
.-|+...|-||+-| |.|||||
T Consensus 84 ~~p~~i~~s~VnDGICDy~~CCDGSDE 110 (176)
T PF12999_consen 84 HIPRYIPSSRVNDGICDYDICCDGSDE 110 (176)
T ss_pred CCCceeehhhhcCCcCcccccCCCCCC
Confidence 68999999999999 9999999
No 45
>PF15494 SRCR_2: Scavenger receptor cysteine-rich domain
Probab=98.08 E-value=6.8e-06 Score=81.65 Aligned_cols=89 Identities=27% Similarity=0.518 Sum_probs=60.6
Q ss_pred eEEEEec--CcccccccCcchhhhhhhchhhhcchhHHhhccCCCCCccceeeeccCCC---CCcceEEEeccccccchh
Q psy11546 1160 FLMIQKQ--GQWGKLCMNQINNFIMKTLKWKISDLGKAICKSMTFRDLNEIEAVADPST---DDSIYYELSMNAINSSAA 1234 (1634)
Q Consensus 1160 ~VeV~~~--GtWg~VCsd~~~~~~~~~~~~~isdlA~vICRqLG~~~~~sve~v~~~~~---~~~~y~~vscn~~e~s~s 1234 (1634)
.|+|+.. +.|.+||+|+|+.. ++..+|++|||........+..... ....|..+..... .
T Consensus 5 ~LQV~~~~~~~W~~VC~d~W~~~-----------~s~~~C~qLGy~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~----~ 69 (98)
T PF15494_consen 5 LLQVYSAASGSWLPVCSDNWNEA-----------LSKAACQQLGYSSSSSSSSVNLSDISPSPSQSFVKLSSNSN----S 69 (98)
T ss_pred EEEEEecCCCCEeeecccccCHH-----------HHHHHHHHhCCCCCCccceeEeccCCcccccCeEEEecCCC----C
Confidence 4555555 89999999999987 7899999999987666555433321 1335666622211 1
Q ss_pred cccccccccccCCCCceeeeeccccccCCCC
Q psy11546 1235 TKSALLFQKTKCSQKQVVAVNCSSLECGIRP 1265 (1634)
Q Consensus 1235 ~c~~l~~~~~~C~s~~vV~V~C~~~~CGirp 1265 (1634)
...........|.++.+|.+.|. +||.|+
T Consensus 70 ~l~~~~~~~~~C~S~~vVsL~C~--~CG~r~ 98 (98)
T PF15494_consen 70 TLQQSLSPSSSCSSGSVVSLQCS--DCGKRT 98 (98)
T ss_pred ceEEEEecCCCCCCCCEEEEECc--ccCCcC
Confidence 11222335679999999999997 899873
No 46
>cd07453 CRD_crescent Cysteine-rich domain of the crescent protein. The cysteine-rich domain (CRD) is an essential part of the crescent protein, a member of the secreted frizzled-related protein (SFRP) family, which regulates convergent extension movements (CEMs) during gastrulation and neurulation. Xenopus laevis crescent efficiently forms inhibitory complexes with Wnt5a and Wnt11, but this effect is cancelled in the presence of another member of the SFRP family, Frzb1. A potential role for Crescent in head formation is to regulate a non-canonical Wnt pathway positively in the adjacent posterior mesoderm, and negatively in the overlying anterior neuroectoderm.
Probab=98.08 E-value=1.3e-06 Score=91.33 Aligned_cols=63 Identities=6% Similarity=-0.071 Sum_probs=49.2
Q ss_pred ccccc--chheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhcccccc
Q psy11546 855 EADSI--EKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKH 932 (1634)
Q Consensus 855 ~c~~i--~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~ 932 (1634)
.|+.| .+++|..|+||.|.|||+++|.+|++ +......+..+ +. ..|+++|..|+|++|+|.
T Consensus 2 ~C~~iP~~l~~C~~l~Yn~T~~PN~lgH~sq~e-----a~~~~~~~~pL------v~-----~~C~p~l~~FLCSl~~P~ 65 (135)
T cd07453 2 PCMRIPKSMALCYDIGYSEMRIPNLLEHETMAE-----VIQQSSSWLPL------LA-----RECHPDARIFLCSLFAPI 65 (135)
T ss_pred CCccCCCcchhhCCCCCCcccCCcccCCCCHHH-----HHHHHHHHHHH------Hh-----cCCChhhHHHhhhhcCcc
Confidence 35544 58899999999999999999999999 65543333333 23 489999999999999995
Q ss_pred c
Q psy11546 933 T 933 (1634)
Q Consensus 933 ~ 933 (1634)
.
T Consensus 66 C 66 (135)
T cd07453 66 C 66 (135)
T ss_pred C
Confidence 4
No 47
>COG3591 V8-like Glu-specific endopeptidase [Amino acid transport and metabolism]
Probab=97.96 E-value=4.8e-05 Score=87.12 Aligned_cols=197 Identities=20% Similarity=0.268 Sum_probs=104.8
Q ss_pred CCCccceEEEEEe---CCeeeeeceeeCCCccccccccccCCCCCceEEEEeEEecCCCCCCCCceeEeeEEEEe--CCC
Q psy11546 1292 RLGSWPWQAALYK---EGEFQCGATLISDQWLLSAGHCFYRAQDDYWVARLGTLRRGTKLPSPYEQLRPISKIIL--HPQ 1366 (1634)
Q Consensus 1292 ~~GewPW~VsL~~---~g~~~CGGTLIS~rWVLTAAHCV~~~~~~~~~VrLG~~~~~s~~~s~~~Q~~~V~~Iii--HP~ 1366 (1634)
....|||-+-... .|++-|+|+||+++.||||+||+....... . .+-.+..+.. ........+..+.. .|.
T Consensus 45 dt~~~Py~av~~~~~~tG~~~~~~~lI~pntvLTa~Hc~~s~~~G~-~-~~~~~p~g~~--~~~~~~~~~~~~~~~~~~g 120 (251)
T COG3591 45 DTTQFPYSAVVQFEAATGRLCTAATLIGPNTVLTAGHCIYSPDYGE-D-DIAAAPPGVN--SDGGPFYGITKIEIRVYPG 120 (251)
T ss_pred cCCCCCcceeEEeecCCCcceeeEEEEcCceEEEeeeEEecCCCCh-h-hhhhcCCccc--CCCCCCCceeeEEEEecCC
Confidence 4567999776643 356777779999999999999997654321 1 1111111111 11122222222222 333
Q ss_pred --CCCCCCcCceEEEEeC--CCCCCCceeeecCCCCCCCCCCCeEEEEEecccccCCCcCCCCcEEEEEeeeChhHHhhh
Q psy11546 1367 --YVDAGFINDISILKMK--TPFSNYVRPICLPHPNTPLTDGTLCTVVGWGQLFEIGRVFPDTLQEVQLPIISTAECRKR 1442 (1634)
Q Consensus 1367 --Yn~~t~~nDIALLkL~--~PfSd~VqPICLP~~~~~l~~G~~c~VvGWG~t~e~g~~~s~~Lq~v~VpIIs~eeC~~~ 1442 (1634)
|.......|+..+.|. .++...+...-++..... ..+....++||-..... .++. .+.|...
T Consensus 121 ~~~~~d~~~~~v~~~~~~~g~~~~~~~~~~~~~~~~~~-~~~d~i~v~GYP~dk~~------~~~~-------~e~t~~v 186 (251)
T COG3591 121 ELYKEDGASYDVGEAALESGINIGDVVNYLKRNTASEA-KANDRITVIGYPGDKPN------IGTM-------WESTGKV 186 (251)
T ss_pred ceeccCCceeeccHHHhccCCCcccccccccccccccc-ccCceeEEEeccCCCCc------ceeE-------eeeccee
Confidence 3444555666666666 235555554444433333 33444888888533221 1110 0011110
Q ss_pred ccCCCcccCCCCeEEEeecCCCCCCCCCCCCCeEEEecCCCcEEEEEEEEcCCCCCCCCCCeEEEEc-CCchHHHHHhhc
Q psy11546 1443 TLFLPLYRVTENMFCAGFERGGRDACLGDSGGPLMCQEPDGRWSLMGVTSNGYGCARANRPGVYTKV-SNYIPWLYNNMA 1521 (1634)
Q Consensus 1443 ~~~~~~~~It~sm~CAG~~~gg~daC~GDSGGPLVc~~~~GrW~LvGIvS~G~gCg~~~~PgVYTrV-s~YlDWI~~vI~ 1521 (1634)
. .+.... ..-..+.+.|+||+|++.... +++||...|.+-.........+++ ..+++||++.++
T Consensus 187 ~------~~~~~~-----l~y~~dT~pG~SGSpv~~~~~----~vigv~~~g~~~~~~~~~n~~vr~t~~~~~~I~~~~~ 251 (251)
T COG3591 187 N------SIKGNK-----LFYDADTLPGSSGSPVLISKD----EVIGVHYNGPGANGGSLANNAVRLTPEILNFIQQNIK 251 (251)
T ss_pred E------EEecce-----EEEEecccCCCCCCceEecCc----eEEEEEecCCCcccccccCcceEecHHHHHHHHHhhC
Confidence 0 011110 112357899999999997652 999999988753322334445555 446899987753
No 48
>smart00192 LDLa Low-density lipoprotein receptor domain class A. Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. The N-terminal type A repeats in LDL receptor bind the lipoproteins. Other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.
Probab=97.84 E-value=1.1e-05 Score=65.17 Aligned_cols=32 Identities=47% Similarity=1.175 Sum_probs=29.1
Q ss_pred CCCCceecCCCCeeeccccccCCCccCCCCCCc
Q psy11546 1062 CSPGQYICPNSRVCIERTRLCDGIKDCPLGDDE 1094 (1634)
Q Consensus 1062 C~~~~F~C~~~~~CI~~~~~CDG~~DC~DGSde 1094 (1634)
|+.++|+|.++ .||+..++|||+.||.|||||
T Consensus 2 C~~~~f~C~~~-~Ci~~~~~Cdg~~dC~dgsDE 33 (33)
T smart00192 2 CPPGEFQCDNG-RCIPLSWVCDGVDDCSDGSDE 33 (33)
T ss_pred CCCCeEECCCC-CEECchhhCCCcCcCcCCCCC
Confidence 67789999865 599999999999999999998
No 49
>cd07451 CRD_SMO Cysteine-rich domain of the smoothened receptor (Smo) integral membrane protein. The cysteine-rich domain (CRD) is part of the smoothened receptor (Smo), an integral membrane protein and one of the key players in the Hedgehog (Hh) signaling pathway, critical for development, cell growth and migration, as well as stem cell maintenance. The CRD of Smo is conserved in vertebrates and can also be identified in invertebrates. The precise function of the CRD in Smo is unknown. Mutations in the Drosophila CRD disrupt Smo activity in vivo, while deletion of the CRD in mammalian cells does not seem to affect the activity of overexpressed Smo.
Probab=97.74 E-value=6.6e-06 Score=85.92 Aligned_cols=65 Identities=5% Similarity=0.061 Sum_probs=52.7
Q ss_pred ccccccchheeec--ccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhccccc
Q psy11546 854 SEADSIEKKVILV--MSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLK 931 (1634)
Q Consensus 854 ~~c~~i~~~~c~~--l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p 931 (1634)
..|+.|+..+|.. ||||.|.|||+++|.||++ |......+.. ++.+ .+|+++|..|+|++|.|
T Consensus 3 ~~Cepi~~~~C~g~~lpYn~T~~PN~~~h~tq~e-----a~~~l~~f~p-------Lv~v---p~C~~~l~~FLCSly~P 67 (132)
T cd07451 3 AKCEPLKNTTCLGSKLPYTYTSLDLVPDSTTQEE-----VQEKLHLWSG-------LRNV---PKCWAVIQPLLCALYMP 67 (132)
T ss_pred ceeeECCccccCCCCCCccceecCCccCCcCHHH-----HHHHHHHHHH-------HHhh---cccchhheeeeEeeECC
Confidence 4699999999977 9999999999999999999 5544333333 4442 38999999999999999
Q ss_pred cc
Q psy11546 932 HT 933 (1634)
Q Consensus 932 ~~ 933 (1634)
..
T Consensus 68 ~C 69 (132)
T cd07451 68 KC 69 (132)
T ss_pred cC
Confidence 53
No 50
>KOG3577|consensus
Probab=97.63 E-value=7.1e-05 Score=94.97 Aligned_cols=152 Identities=11% Similarity=0.099 Sum_probs=93.2
Q ss_pred ccCcccccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhccc
Q psy11546 850 ENNTSEADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQH 929 (1634)
Q Consensus 850 ~~~t~~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~ 929 (1634)
.+.+..|++|+.++|..++||.|.|||.++|.||++ |.. .++.|.-++ .++|++.|..|+|++|
T Consensus 37 ~~~~~~c~pi~i~~C~~i~Yn~T~~PNll~H~tQ~e-----a~~-------~~~~f~PLv----~~~Cs~~l~~FLCS~y 100 (556)
T KOG3577|consen 37 AAGKARCEPITIPMCQGLGYNMTAMPNLVGHETQEE-----AGL-------KLHQFHPLV----ALRCSPGLRFFLCSLY 100 (556)
T ss_pred ccccccccceeccccCCCCCCcccCCccccchhHHH-----hhh-------cccccCCcc----ccccchhhhhHhhcCC
Confidence 456678999999999999999999999999999998 222 233444434 4799999999999999
Q ss_pred ccccccCCCCCCCCCCCCCCCceeeccCC-----ceeeeeeecccccCCCCCcCC--Ccceeccccc---cCCCCCCCce
Q psy11546 930 LKHTHQHNPDTNISTKAIPSSFNFNQVNG-----IPILTKVYTKVSKTNSTSKER--NQTEFHLSVN---RSECEMNSSF 999 (1634)
Q Consensus 930 ~p~~~~~~p~~~i~t~~ip~~f~~~~v~g-----~~~~~~v~~~~~~c~~~s~~~--~~~~f~~~~~---~~~C~~~~~F 999 (1634)
.|-+.+ .+.....|++-.|..+.. +......|+..-.|....... ...|...... .........+
T Consensus 101 ~P~C~~-----~~d~pi~PCRslCe~ar~~Ce~~m~~~gf~WPe~L~C~~fp~~~~~~~~C~~~~~~~~~~~~~~~~~~~ 175 (556)
T KOG3577|consen 101 APMCTE-----DLDRPILPCRSLCEAAREGCEPLMEMFGFPWPEFLRCDKFPVPDSAGELCMGPPSEGFAISAGSAGVSV 175 (556)
T ss_pred CCcCcc-----ccCcccCccHHHHHHHhcccHHHHHhcCCCCCccccCCCCCCCCCccccccCCccccccCCCCCCCCCC
Confidence 994221 111122255555655221 222344688888888776654 2222222111 0111111112
Q ss_pred ecCCCceecCCcccCCCCCCCCCCCCCCCCCC
Q psy11546 1000 RCGNGECVSIGSKCNQLVDCADGSDEKNCSCA 1031 (1634)
Q Consensus 1000 ~C~nG~CIp~~~~CDG~~DC~DgSDE~~C~C~ 1031 (1634)
.+..|. ++..++...+||. |.+.
T Consensus 176 ~~~~g~--------~~~~~~~~~~~~~-~~~~ 198 (556)
T KOG3577|consen 176 LPDFGG--------PGVVLAKPWSDEL-CQNP 198 (556)
T ss_pred CCCCCC--------CccccCCCCcccc-ccCc
Confidence 222222 6777777778877 6654
No 51
>cd07066 CRD_FZ CRD_domain cysteine-rich domain, also known as Fz (frizzled) domain. CRD_FZ is an essential component of a number of cell surface receptors, which are involved in multiple signal transduction pathways, particularly in modulating the activity of the Wnt proteins, which play a fundamental role in the early development of metazoans. CRD is also found in secreted frizzled related proteins (SFRPs), which lack the transmembrane segment found in the frizzled protein. The CRD domain is also present in the alpha-1 chain of mouse type XVIII collagen, in carboxypeptidase Z, several receptor tyrosine kinases, and the mosaic transmembrane serine protease corin. The CRD domain is well conserved in metazoans - 10 frizzled proteins have been identified in mammals, 4 in Drosophila and 3 in Caenorhabditis elegans. CRD domains have also been identified in multiple tandem copies in a Dictyostelium discoideum protein. Very little is known about the mechanism by which CRD domains interact wit
Probab=97.62 E-value=3.7e-05 Score=78.57 Aligned_cols=80 Identities=9% Similarity=0.046 Sum_probs=57.7
Q ss_pred cccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhcccccccc
Q psy11546 855 EADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKHTH 934 (1634)
Q Consensus 855 ~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~~~ 934 (1634)
.|++|..++|..++||.|.+||+++|.+|++ +......+..+ +. .+|++++..|+|++++|...
T Consensus 1 ~C~pi~~~~C~~~~Yn~t~~pn~~~~~~~~~-----a~~~~~~~~~l------~~-----~~C~~~~~~f~Cs~~~P~C~ 64 (119)
T cd07066 1 KCEPIPLPLCRGLPYNTTRFPNLLGHESQEE-----AEQELESFTPL------VN-----SGCHPDLRFFLCSLYFPECT 64 (119)
T ss_pred CCccCCHHHhCCCCCCceeCCCccCCCCHHH-----HHHHHHHHHHH------Hh-----cCCChhHHHHHHhccCcccc
Confidence 4999999999999999999999999988877 44433333222 23 38999999999999999642
Q ss_pred cCCCCCCCCCCCCCCCceeec
Q psy11546 935 QHNPDTNISTKAIPSSFNFNQ 955 (1634)
Q Consensus 935 ~~~p~~~i~t~~ip~~f~~~~ 955 (1634)
.... ....|++-.|..
T Consensus 65 ~~~~-----~~~~PCRs~Cee 80 (119)
T cd07066 65 PDGD-----RPIPPCRSLCEE 80 (119)
T ss_pred CCCC-----CcCCChHHHHHH
Confidence 2211 234456666666
No 52
>PF12999 PRKCSH-like: Glucosidase II beta subunit-like
Probab=97.21 E-value=0.00024 Score=77.46 Aligned_cols=61 Identities=30% Similarity=0.626 Sum_probs=53.3
Q ss_pred CCCcCCCCc-c-cCCCcccCCcccCCCCCCCCC------cccccCCC----cEecCCCccCCCCC---CCCCCCCC
Q psy11546 1541 NGHRCPLGE-C-LPKARVCNGYMECSDGKCEMN------SSFRCGNG----ECVSIGSKCNQLVD---CADGSDEK 1601 (1634)
Q Consensus 1541 ~~f~C~~G~-C-Ip~~~vCDG~~DC~DGsDE~~------~~F~C~~G----~CI~~~~~CDG~~D---C~DgSDE~ 1601 (1634)
+.|.|-+|. = |+.+++.|++-||+|||||.. ..|.|.|. .-|+.+++=||+=| |=|||||.
T Consensus 36 ~~f~Cl~~~~~~I~~~~iNDdyCDC~DGSDEPGTsAC~~~~FyC~N~g~~p~~i~~s~VnDGICDy~~CCDGSDE~ 111 (176)
T PF12999_consen 36 GKFTCLDGSKIVIPFSQINDDYCDCPDGSDEPGTSACSNGKFYCENKGHIPRYIPSSRVNDGICDYDICCDGSDES 111 (176)
T ss_pred CceEecCCCCceecHHHccCcceeCCCCCCccccccCcCceEeeccCCCCCceeehhhhcCCcCcccccCCCCCCC
Confidence 469999885 3 799999999999999999975 25999983 58899999999999 99999994
No 53
>PF01392 Fz: Fz domain; InterPro: IPR020067 The frizzled (fz) domain is an extracellular domain of about 120 amino acids.It was first identified in the alpha-1 chain of type XVIII collagen and in members of the Frizzled family of seven transmembrane (7TM) proteins which act as receptors for secreted Wingless (Wg)/Wnt glycoproteins []. In addition to these proteins, one or two copies of the fz domain are also found [, , , , ] in: The frizbee (Frzb) family; secreted frizzled-like proteins. Smoothened; another 7TM receptor involved in hedgehog signaling. Carboxpeptidase Z (CPZ). Transmembrane serine protease corin. Two receptor tyrosine kinases (RTKs) subfamilies, the Ror family and the muscle-specific kinase (MuSK) family. As the fz domain contains 10 cysteines which are largely conserved, it has also been called cysteine-rich domain (CRD) []. The fz domain also contains several other highly conserved residues, for example, a basic amino acid follows C6, and a conserved proline residues lies four residues C-terminal to C9 []. The crystal structure of a fz domain shows that it is predominantly alpha-helical with all cysteines forming disulphide bonds. In addition to helical regions, two short beta-strands at the N terminus form a minimal beta-sheet with the second beta sheet passing through a knot created by disulphide bonds []. Several fz domains have been shown to be both necessary and sufficient for Wg/Wnt ligand binding, strongly suggesting that the fz domain is a Wg/Wnt interacting domain [, ].; GO: 0005515 protein binding; PDB: 1IJX_E 3HKL_B 1IJY_B 4F0A_A.
Probab=97.01 E-value=0.00014 Score=73.15 Aligned_cols=62 Identities=13% Similarity=0.190 Sum_probs=45.2
Q ss_pred ccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhccccccc
Q psy11546 856 ADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKHT 933 (1634)
Q Consensus 856 c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~~ 933 (1634)
|++|+.++|..++||.|.+||.+++.++++ +.+. +..|..++.. .|++++..|+|+.++|..
T Consensus 1 C~pi~~~~C~~~~y~~t~~pn~~~~~~~~~-----~~~~-------~~~~~~l~~~----~C~~~~~~flC~~~~P~C 62 (116)
T PF01392_consen 1 CEPITLSFCRGLPYNQTSFPNSLGHQTQDE-----ASQQ-------LNAFSTLVNS----GCHPYLRPFLCSLYFPPC 62 (116)
T ss_dssp EEE---GGGTTSS-TEEESSETTTSSSHHH-----HHHH-------HHCCHHHHCC----TSHTTHHHHHHHHHS-BB
T ss_pred CccCChHHhcCCCCCceEeecccCCcCHHH-----HHHH-------HHHHHHHhcc----hhhHHHhhhhhcccCCcC
Confidence 899999999999999999999999988766 3333 3334444442 899999999999999964
No 54
>KOG2397|consensus
Probab=96.05 E-value=0.005 Score=75.46 Aligned_cols=52 Identities=40% Similarity=0.618 Sum_probs=38.2
Q ss_pred CCCCCCCCCCCCCCCCCCCCCceecCCCC---eeeccccccCCCccCCCCCCccc
Q psy11546 1045 DGIIDCWDFSDEYECEWCSPGQYICPNSR---VCIERTRLCDGIKDCPLGDDEKQ 1096 (1634)
Q Consensus 1045 DG~~DC~D~SDE~~C~~C~~~~F~C~~~~---~CI~~~~~CDG~~DC~DGSde~~ 1096 (1634)
|++-||.||+||..=.-|+.+.|+|.+.+ .=|+...+=||+-||.|||||..
T Consensus 61 Dd~CDC~DGsDEPGtsACpngkF~C~N~G~~p~~i~ssrV~DGICDCCDgSDE~~ 115 (480)
T KOG2397|consen 61 DDSCDCLDGSDEPGTSACPNGKFYCVNQGHQPKYIPSSRVNDGICDCCDGSDEYL 115 (480)
T ss_pred cccccCCCCCCCCccccCCCCceeeeecCCCceeeechhccCcccccccCCCCcc
Confidence 33344444444433346889999997654 67999999999999999999953
No 55
>TIGR02037 degP_htrA_DO periplasmic serine protease, Do/DeqQ family. This family consists of a set proteins various designated DegP, heat shock protein HtrA, and protease DO. The ortholog in Pseudomonas aeruginosa is designated MucD and is found in an operon that controls mucoid phenotype. This family also includes the DegQ (HhoA) paralog in E. coli which can rescue a DegP mutant, but not the smaller DegS paralog, which cannot. Members of this family are located in the periplasm and have separable functions as both protease and chaperone. Members have a trypsin domain and two copies of a PDZ domain. This protein protects bacteria from thermal and other stresses and may be important for the survival of bacterial pathogens.// The chaperone function is dominant at low temperatures, whereas the proteolytic activity is turned on at elevated temperatures.
Probab=95.59 E-value=0.22 Score=62.07 Aligned_cols=170 Identities=17% Similarity=0.178 Sum_probs=89.6
Q ss_pred eeeeeceeeCCC-ccccccccccCCCCCceEEEEeEEecCCCCCCCCceeEeeEEEEeCCCCCCCCCcCceEEEEeCCCC
Q psy11546 1307 EFQCGATLISDQ-WLLSAGHCFYRAQDDYWVARLGTLRRGTKLPSPYEQLRPISKIILHPQYVDAGFINDISILKMKTPF 1385 (1634)
Q Consensus 1307 ~~~CGGTLIS~r-WVLTAAHCV~~~~~~~~~VrLG~~~~~s~~~s~~~Q~~~V~~IiiHP~Yn~~t~~nDIALLkL~~Pf 1385 (1634)
...++|.+|++. ||||++|.+.... .+.|.+.. ...+..+-+..++ ..|||||+++.+
T Consensus 57 ~~~GSGfii~~~G~IlTn~Hvv~~~~--~i~V~~~~-----------~~~~~a~vv~~d~-------~~DlAllkv~~~- 115 (428)
T TIGR02037 57 RGLGSGVIISADGYILTNNHVVDGAD--EITVTLSD-----------GREFKAKLVGKDP-------RTDIAVLKIDAK- 115 (428)
T ss_pred cceeeEEEECCCCEEEEcHHHcCCCC--eEEEEeCC-----------CCEEEEEEEEecC-------CCCEEEEEecCC-
Confidence 457899999876 9999999997643 23333221 1234444333444 379999999865
Q ss_pred CCCceeeecCCCCCCCCCCCeEEEEEecccccCCCcCCCCcEEEEEeeeChhHHhhhccCCCcccCCCCeEEEeecCCCC
Q psy11546 1386 SNYVRPICLPHPNTPLTDGTLCTVVGWGQLFEIGRVFPDTLQEVQLPIISTAECRKRTLFLPLYRVTENMFCAGFERGGR 1465 (1634)
Q Consensus 1386 Sd~VqPICLP~~~~~l~~G~~c~VvGWG~t~e~g~~~s~~Lq~v~VpIIs~eeC~~~~~~~~~~~It~sm~CAG~~~gg~ 1465 (1634)
..+.++-|... ..+..|..+++.|+.... ........+.-..... .. .. .. ..++= ...
T Consensus 116 -~~~~~~~l~~~-~~~~~G~~v~aiG~p~g~------~~~~t~G~vs~~~~~~----~~-~~--~~-~~~i~-----tda 174 (428)
T TIGR02037 116 -KNLPVIKLGDS-DKLRVGDWVLAIGNPFGL------GQTVTSGIVSALGRSG----LG-IG--DY-ENFIQ-----TDA 174 (428)
T ss_pred -CCceEEEccCC-CCCCCCCEEEEEECCCcC------CCcEEEEEEEecccCc----cC-CC--Cc-cceEE-----ECC
Confidence 34556666432 346789999999985321 1112222221111000 00 00 00 11111 112
Q ss_pred CCCCCCCCCeEEEecCCCcEEEEEEEEcCCC-CCCCCCCeEEEEcCCchHHHHHhhcc
Q psy11546 1466 DACLGDSGGPLMCQEPDGRWSLMGVTSNGYG-CARANRPGVYTKVSNYIPWLYNNMAA 1522 (1634)
Q Consensus 1466 daC~GDSGGPLVc~~~~GrW~LvGIvS~G~g-Cg~~~~PgVYTrVs~YlDWI~~vI~~ 1522 (1634)
..-.|.|||||+-.. | .++||.+.... -+.....+...-+......+.+.++.
T Consensus 175 ~i~~GnSGGpl~n~~--G--~viGI~~~~~~~~g~~~g~~faiP~~~~~~~~~~l~~~ 228 (428)
T TIGR02037 175 AINPGNSGGPLVNLR--G--EVIGINTAIYSPSGGNVGIGFAIPSNMAKNVVDQLIEG 228 (428)
T ss_pred CCCCCCCCCceECCC--C--eEEEEEeEEEcCCCCccceEEEEEhHHHHHHHHHHHhc
Confidence 345689999998432 3 89999875422 11111223445555555555554443
No 56
>TIGR02038 protease_degS periplasmic serine pepetdase DegS. This family consists of the periplasmic serine protease DegS (HhoB), a shorter paralog of protease DO (HtrA, DegP) and DegQ (HhoA). It is found in E. coli and several other Proteobacteria of the gamma subdivision. It contains a trypsin domain and a single copy of PDZ domain (in contrast to DegP with two copies). A critical role of this DegS is to sense stress in the periplasm and partially degrade an inhibitor of sigma(E).
Probab=95.11 E-value=0.5 Score=57.59 Aligned_cols=168 Identities=14% Similarity=0.185 Sum_probs=86.2
Q ss_pred eeeeceeeCCC-ccccccccccCCCCCceEEEEeEEecCCCCCCCCceeEeeEEEEeCCCCCCCCCcCceEEEEeCCCCC
Q psy11546 1308 FQCGATLISDQ-WLLSAGHCFYRAQDDYWVARLGTLRRGTKLPSPYEQLRPISKIILHPQYVDAGFINDISILKMKTPFS 1386 (1634)
Q Consensus 1308 ~~CGGTLIS~r-WVLTAAHCV~~~~~~~~~VrLG~~~~~s~~~s~~~Q~~~V~~IiiHP~Yn~~t~~nDIALLkL~~PfS 1386 (1634)
...+|.+|++. ||||++|-+.... .+.|.+.. ...+..+-+..+| ..||||||++...
T Consensus 78 ~~GSG~vi~~~G~IlTn~HVV~~~~--~i~V~~~d-----------g~~~~a~vv~~d~-------~~DlAvlkv~~~~- 136 (351)
T TIGR02038 78 GLGSGVIMSKEGYILTNYHVIKKAD--QIVVALQD-----------GRKFEAELVGSDP-------LTDLAVLKIEGDN- 136 (351)
T ss_pred ceEEEEEEeCCeEEEecccEeCCCC--EEEEEECC-----------CCEEEEEEEEecC-------CCCEEEEEecCCC-
Confidence 46889999876 9999999996543 23333211 1233444343443 4799999998652
Q ss_pred CCceeeecCCCCCCCCCCCeEEEEEecccccCCCcCCCCcEEEEEeeeChhHHhhhccCCCcccCCCCeEEEeecCCCCC
Q psy11546 1387 NYVRPICLPHPNTPLTDGTLCTVVGWGQLFEIGRVFPDTLQEVQLPIISTAECRKRTLFLPLYRVTENMFCAGFERGGRD 1466 (1634)
Q Consensus 1387 d~VqPICLP~~~~~l~~G~~c~VvGWG~t~e~g~~~s~~Lq~v~VpIIs~eeC~~~~~~~~~~~It~sm~CAG~~~gg~d 1466 (1634)
+.++-|- ....+..|..+.+.|+.... ........+.-+... . ... .-....+= ....
T Consensus 137 --~~~~~l~-~s~~~~~G~~V~aiG~P~~~------~~s~t~GiIs~~~r~---~---~~~--~~~~~~iq-----tda~ 194 (351)
T TIGR02038 137 --LPTIPVN-LDRPPHVGDVVLAIGNPYNL------GQTITQGIISATGRN---G---LSS--VGRQNFIQ-----TDAA 194 (351)
T ss_pred --CceEecc-CcCccCCCCEEEEEeCCCCC------CCcEEEEEEEeccCc---c---cCC--CCcceEEE-----ECCc
Confidence 2233332 22356789999999985321 111222222111110 0 000 00001111 1234
Q ss_pred CCCCCCCCeEEEecCCCcEEEEEEEEcCCCCCC-C--CCCeEEEEcCCchHHHHHhhcc
Q psy11546 1467 ACLGDSGGPLMCQEPDGRWSLMGVTSNGYGCAR-A--NRPGVYTKVSNYIPWLYNNMAA 1522 (1634)
Q Consensus 1467 aC~GDSGGPLVc~~~~GrW~LvGIvS~G~gCg~-~--~~PgVYTrVs~YlDWI~~vI~~ 1522 (1634)
.-.|.|||||+-.. | .++||.+....... . ...+....+......+.+.++.
T Consensus 195 i~~GnSGGpl~n~~--G--~vIGI~~~~~~~~~~~~~~g~~faIP~~~~~~vl~~l~~~ 249 (351)
T TIGR02038 195 INAGNSGGALINTN--G--ELVGINTASFQKGGDEGGEGINFAIPIKLAHKIMGKIIRD 249 (351)
T ss_pred cCCCCCcceEECCC--C--eEEEEEeeeecccCCCCccceEEEecHHHHHHHHHHHhhc
Confidence 56789999999432 3 79999876432111 1 1123344444455555555443
No 57
>PF13365 Trypsin_2: Trypsin-like peptidase domain; PDB: 1Y8T_A 2Z9I_A 3QO6_A 1L1J_A 1QY6_A 2O8L_A 3OTP_E 2ZLE_I 1KY9_A 3CS0_A ....
Probab=94.64 E-value=0.059 Score=53.56 Aligned_cols=22 Identities=23% Similarity=0.294 Sum_probs=19.5
Q ss_pred eeceeeCCC-ccccccccccCCC
Q psy11546 1310 CGATLISDQ-WLLSAGHCFYRAQ 1331 (1634)
Q Consensus 1310 CGGTLIS~r-WVLTAAHCV~~~~ 1331 (1634)
|+|.+|+++ +|||||||+....
T Consensus 1 GTGf~i~~~g~ilT~~Hvv~~~~ 23 (120)
T PF13365_consen 1 GTGFLIGPDGYILTAAHVVEDWN 23 (120)
T ss_dssp EEEEEEETTTEEEEEHHHHTCCT
T ss_pred CEEEEEcCCceEEEchhheeccc
Confidence 689999999 9999999997643
No 58
>PRK10898 serine endoprotease; Provisional
Probab=94.55 E-value=0.86 Score=55.65 Aligned_cols=82 Identities=12% Similarity=0.188 Sum_probs=50.0
Q ss_pred eeeeceeeCCC-ccccccccccCCCCCceEEEEeEEecCCCCCCCCceeEeeEEEEeCCCCCCCCCcCceEEEEeCCCCC
Q psy11546 1308 FQCGATLISDQ-WLLSAGHCFYRAQDDYWVARLGTLRRGTKLPSPYEQLRPISKIILHPQYVDAGFINDISILKMKTPFS 1386 (1634)
Q Consensus 1308 ~~CGGTLIS~r-WVLTAAHCV~~~~~~~~~VrLG~~~~~s~~~s~~~Q~~~V~~IiiHP~Yn~~t~~nDIALLkL~~PfS 1386 (1634)
..-+|.+|++. +|||+||=+.... .+.|.+.. ...+..+-+..+| ..|||||+++..
T Consensus 78 ~~GSGfvi~~~G~IlTn~HVv~~a~--~i~V~~~d-----------g~~~~a~vv~~d~-------~~DlAvl~v~~~-- 135 (353)
T PRK10898 78 TLGSGVIMDQRGYILTNKHVINDAD--QIIVALQD-----------GRVFEALLVGSDS-------LTDLAVLKINAT-- 135 (353)
T ss_pred ceeeEEEEeCCeEEEecccEeCCCC--EEEEEeCC-----------CCEEEEEEEEEcC-------CCCEEEEEEcCC--
Confidence 46789999875 9999999886532 33333211 1223443344444 389999999864
Q ss_pred CCceeeecCCCCCCCCCCCeEEEEEec
Q psy11546 1387 NYVRPICLPHPNTPLTDGTLCTVVGWG 1413 (1634)
Q Consensus 1387 d~VqPICLP~~~~~l~~G~~c~VvGWG 1413 (1634)
...++-|.. ...+..|..+.+.|+.
T Consensus 136 -~l~~~~l~~-~~~~~~G~~V~aiG~P 160 (353)
T PRK10898 136 -NLPVIPINP-KRVPHIGDVVLAIGNP 160 (353)
T ss_pred -CCCeeeccC-cCcCCCCCEEEEEeCC
Confidence 122333322 2345678888888875
No 59
>PRK10139 serine endoprotease; Provisional
Probab=92.97 E-value=1.3 Score=55.89 Aligned_cols=169 Identities=19% Similarity=0.179 Sum_probs=88.6
Q ss_pred eeeeceeeCC--CccccccccccCCCCCceEEEEeEEecCCCCCCCCceeEeeEEEEeCCCCCCCCCcCceEEEEeCCCC
Q psy11546 1308 FQCGATLISD--QWLLSAGHCFYRAQDDYWVARLGTLRRGTKLPSPYEQLRPISKIILHPQYVDAGFINDISILKMKTPF 1385 (1634)
Q Consensus 1308 ~~CGGTLIS~--rWVLTAAHCV~~~~~~~~~VrLG~~~~~s~~~s~~~Q~~~V~~IiiHP~Yn~~t~~nDIALLkL~~Pf 1385 (1634)
...+|.+|++ .+|||.+|.+.+.. .+.|.+.. ...+..+-+...| ..||||||++.+
T Consensus 90 ~~GSG~ii~~~~g~IlTn~HVv~~a~--~i~V~~~d-----------g~~~~a~vvg~D~-------~~DlAvlkv~~~- 148 (455)
T PRK10139 90 GLGSGVIIDAAKGYVLTNNHVINQAQ--KISIQLND-----------GREFDAKLIGSDD-------QSDIALLQIQNP- 148 (455)
T ss_pred ceEEEEEEECCCCEEEeChHHhCCCC--EEEEEECC-----------CCEEEEEEEEEcC-------CCCEEEEEecCC-
Confidence 4678999974 69999999997643 34444321 1234444343433 479999999854
Q ss_pred CCCceeeecCCCCCCCCCCCeEEEEEecccccCCCcCCCCcEEEEEeeeChhHHhhhccCCCcccCCCCeEEEeecCCCC
Q psy11546 1386 SNYVRPICLPHPNTPLTDGTLCTVVGWGQLFEIGRVFPDTLQEVQLPIISTAECRKRTLFLPLYRVTENMFCAGFERGGR 1465 (1634)
Q Consensus 1386 Sd~VqPICLP~~~~~l~~G~~c~VvGWG~t~e~g~~~s~~Lq~v~VpIIs~eeC~~~~~~~~~~~It~sm~CAG~~~gg~ 1465 (1634)
....++.|... ..+..|..+.+.|+-.- . ... +..-+++... +.. ...... ...+= ...
T Consensus 149 -~~l~~~~lg~s-~~~~~G~~V~aiG~P~g----~--~~t---vt~GivS~~~--r~~--~~~~~~-~~~iq-----tda 207 (455)
T PRK10139 149 -SKLTQIAIADS-DKLRVGDFAVAVGNPFG----L--GQT---ATSGIISALG--RSG--LNLEGL-ENFIQ-----TDA 207 (455)
T ss_pred -CCCceeEecCc-cccCCCCEEEEEecCCC----C--CCc---eEEEEEcccc--ccc--cCCCCc-ceEEE-----ECC
Confidence 22345555432 34667888888887321 1 111 2223333110 000 000000 11111 123
Q ss_pred CCCCCCCCCeEEEecCCCcEEEEEEEEcCCCCC-CCCCCeEEEEcCCchHHHHHhhcc
Q psy11546 1466 DACLGDSGGPLMCQEPDGRWSLMGVTSNGYGCA-RANRPGVYTKVSNYIPWLYNNMAA 1522 (1634)
Q Consensus 1466 daC~GDSGGPLVc~~~~GrW~LvGIvS~G~gCg-~~~~PgVYTrVs~YlDWI~~vI~~ 1522 (1634)
..-.|.|||||+-.. =.++||.+....-+ ....-+...-+......+.+.++.
T Consensus 208 ~in~GnSGGpl~n~~----G~vIGi~~~~~~~~~~~~gigfaIP~~~~~~v~~~l~~~ 261 (455)
T PRK10139 208 SINRGNSGGALLNLN----GELIGINTAILAPGGGSVGIGFAIPSNMARTLAQQLIDF 261 (455)
T ss_pred ccCCCCCcceEECCC----CeEEEEEEEEEcCCCCccceEEEEEhHHHHHHHHHHhhc
Confidence 456799999999432 28999998643211 112234455554444555554443
No 60
>KOG3509|consensus
Probab=92.16 E-value=0.1 Score=69.60 Aligned_cols=86 Identities=28% Similarity=0.502 Sum_probs=67.5
Q ss_pred cccCCCcccCCcccCCCCCCCCCc----------ccccCCCcEecCCCccCCCCCCCCCCCCCCCccCCc-------CCC
Q psy11546 1549 ECLPKARVCNGYMECSDGKCEMNS----------SFRCGNGECVSIGSKCNQLVDCADGSDEKNCSCADF-------LKS 1611 (1634)
Q Consensus 1549 ~CIp~~~vCDG~~DC~DGsDE~~~----------~F~C~~G~CI~~~~~CDG~~DC~DgSDE~~C~C~~~-------~~~ 1611 (1634)
+|......|++..||.+.+|+.++ .|+|.+++|...-+.||...+|++++++.+|.+..- ...
T Consensus 2 ~c~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~ 81 (964)
T KOG3509|consen 2 ECVKNRYACDRQPDCRDRSDVANDPAIGSACSPNEFKCNNPRCVQPEALLDADSTCGPNSTPSGCNAKPSASDCKPTETQ 81 (964)
T ss_pred chhhhhhhhccchhhHhhcccCCCccccccCCcchhccCCccccCchhhhccccccCCCCCcCCccccccccccCCcccc
Confidence 567778889999999999988774 388999999999999999999999998888843321 111
Q ss_pred ------CccCCcccCCCCCCCCCCCCCCC
Q psy11546 1612 ------QFLTRKICDGIIDCWDFSDEYEC 1634 (1634)
Q Consensus 1612 ------c~~~~~~CDG~~DC~DgSDE~~C 1634 (1634)
|......|||.+||.|+++|.+|
T Consensus 82 c~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 110 (964)
T KOG3509|consen 82 CRDRLRCNPQSFQCDGTNDCKDGSDEVGC 110 (964)
T ss_pred cccchhcCCccccccCCCCCCccchhccc
Confidence 22344579999999999999765
No 61
>PRK10942 serine endoprotease; Provisional
Probab=91.28 E-value=2 Score=54.55 Aligned_cols=83 Identities=14% Similarity=0.243 Sum_probs=51.0
Q ss_pred eeeeceeeCC--CccccccccccCCCCCceEEEEeEEecCCCCCCCCceeEeeEEEEeCCCCCCCCCcCceEEEEeCCCC
Q psy11546 1308 FQCGATLISD--QWLLSAGHCFYRAQDDYWVARLGTLRRGTKLPSPYEQLRPISKIILHPQYVDAGFINDISILKMKTPF 1385 (1634)
Q Consensus 1308 ~~CGGTLIS~--rWVLTAAHCV~~~~~~~~~VrLG~~~~~s~~~s~~~Q~~~V~~IiiHP~Yn~~t~~nDIALLkL~~Pf 1385 (1634)
...+|.+|+. .+|||.+|.+.+.. .+.|.+.. ...+..+-+..+| ..|||||+++.+
T Consensus 111 ~~GSG~ii~~~~G~IlTn~HVv~~a~--~i~V~~~d-----------g~~~~a~vv~~D~-------~~DlAvlki~~~- 169 (473)
T PRK10942 111 ALGSGVIIDADKGYVVTNNHVVDNAT--KIKVQLSD-----------GRKFDAKVVGKDP-------RSDIALIQLQNP- 169 (473)
T ss_pred ceEEEEEEECCCCEEEeChhhcCCCC--EEEEEECC-----------CCEEEEEEEEecC-------CCCEEEEEecCC-
Confidence 4678999984 59999999986543 34444321 1223444343444 379999999743
Q ss_pred CCCceeeecCCCCCCCCCCCeEEEEEec
Q psy11546 1386 SNYVRPICLPHPNTPLTDGTLCTVVGWG 1413 (1634)
Q Consensus 1386 Sd~VqPICLP~~~~~l~~G~~c~VvGWG 1413 (1634)
....++-|-. ...+..|..+++.|+-
T Consensus 170 -~~l~~~~lg~-s~~l~~G~~V~aiG~P 195 (473)
T PRK10942 170 -KNLTAIKMAD-SDALRVGDYTVAIGNP 195 (473)
T ss_pred -CCCceeEecC-ccccCCCCEEEEEcCC
Confidence 1233444432 2346678888888863
No 62
>PF09272 Hepsin-SRCR: Hepsin, SRCR; InterPro: IPR015352 This entry represents the extracellular domain of the serine protease hepsin. The domain is formed primarily by three elements of regular secondary structure: a 12-residue alpha helix, a twisted five-stranded antiparallel beta sheet, and a second, two-stranded, antiparallel sheet. The two beta-sheets lie at roughly right angles to each other, with the helix nestled between the two, adopting an SRCR fold. The exact function of this domain has not been identified, though it probably may serve to orient the protease domain or place it in the vicinity of its substrate []. ; GO: 0004252 serine-type endopeptidase activity, 0070008 serine-type exopeptidase activity; PDB: 3T2N_B 1Z8G_A 1P57_A 1O5E_L 1O5F_L.
Probab=90.94 E-value=0.31 Score=49.19 Aligned_cols=95 Identities=19% Similarity=0.377 Sum_probs=50.0
Q ss_pred cCcceEEEE--ecCcccccccCcchhhhhhhchhhhcchhHHhhccCCCCCccceeeeccCC---CCCcceEEEeccccc
Q psy11546 1156 HDSGFLMIQ--KQGQWGKLCMNQINNFIMKTLKWKISDLGKAICKSMTFRDLNEIEAVADPS---TDDSIYYELSMNAIN 1230 (1634)
Q Consensus 1156 ~~eG~VeV~--~~GtWg~VCsd~~~~~~~~~~~~~isdlA~vICRqLG~~~~~sve~v~~~~---~~~~~y~~vscn~~e 1230 (1634)
....+|+++ .+|+|.-+|+...+.. .|..-|.+|||-.+.......... .+...|+-+......
T Consensus 9 ~aD~rL~vfD~te~~WRllCss~~N~r-----------vA~lsCeemGFlr~~~hs~l~~~~~g~~g~s~fFcv~~~~L~ 77 (110)
T PF09272_consen 9 PADQRLMVFDSTEGTWRLLCSSRSNAR-----------VAGLSCEEMGFLRAVSHSELDVRTAGANGTSGFFCVDEGRLP 77 (110)
T ss_dssp TTT-BEEEEETTTTEEEEEB--TTHHH-----------HHHHHHHHTT-S-EEEEEEEEHHHH--SS--SEEEE-TTTGG
T ss_pred CccceEEEEeccCCEEEEEeecccchh-----------hhcccchhcceEEeeccccccccccccCCCcceEEECcccCc
Confidence 345567665 3578999999877765 677789999998876665554433 345566655443222
Q ss_pred cchhcccccccccccCCCCceeeeeccccccCCCC
Q psy11546 1231 SSAATKSALLFQKTKCSQKQVVAVNCSSLECGIRP 1265 (1634)
Q Consensus 1231 ~s~s~c~~l~~~~~~C~s~~vV~V~C~~~~CGirp 1265 (1634)
. ..........-.|+++....+.|+ +||.|.
T Consensus 78 ~--a~~L~~v~~vCdCp~G~fLat~CQ--DCGRRk 108 (110)
T PF09272_consen 78 Y--ARRLLEVLSVCDCPRGRFLATICQ--DCGRRK 108 (110)
T ss_dssp G---S-GGGGEEEE--TT-EEEEEEE-----S---
T ss_pred H--hHhhhhheeeeeCCCCceeehhhh--hhCccc
Confidence 1 112223344568999999999998 799874
No 63
>KOG3509|consensus
Probab=89.57 E-value=0.32 Score=65.07 Aligned_cols=93 Identities=28% Similarity=0.475 Sum_probs=58.7
Q ss_pred eecCCcccCCCCCCCCCCCCCCCCCCCccc----------CccccccccCCCCCCCCCCCCCCC------CCCCCCceec
Q psy11546 1006 CVSIGSKCNQLVDCADGSDEKNCSCADFLK----------SQFLTRKICDGIIDCWDFSDEYEC------EWCSPGQYIC 1069 (1634)
Q Consensus 1006 CIp~~~~CDG~~DC~DgSDE~~C~C~~~~~----------~~~~~~~~CDG~~DC~D~SDE~~C------~~C~~~~F~C 1069 (1634)
|......|++..||.+.+|+.++.+..... .+...-++||...+|.+++++.+| ..|.+..+.|
T Consensus 3 c~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~c 82 (964)
T KOG3509|consen 3 CVKNRYACDRQPDCRDRSDVANDPAIGSACSPNEFKCNNPRCVQPEALLDADSTCGPNSTPSGCNAKPSASDCKPTETQC 82 (964)
T ss_pred hhhhhhhhccchhhHhhcccCCCccccccCCcchhccCCccccCchhhhccccccCCCCCcCCccccccccccCCccccc
Confidence 444455566666666666666554321110 112233556666666666655555 2377788888
Q ss_pred CCCCeeeccccccCCCccCCCCCCccccc
Q psy11546 1070 PNSRVCIERTRLCDGIKDCPLGDDEKQCI 1098 (1634)
Q Consensus 1070 ~~~~~CI~~~~~CDG~~DC~DGSde~~c~ 1098 (1634)
.+--+|.+....|||.+||.|+++|..|.
T Consensus 83 ~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 111 (964)
T KOG3509|consen 83 RDRLRCNPQSFQCDGTNDCKDGSDEVGCK 111 (964)
T ss_pred ccchhcCCccccccCCCCCCccchhcccc
Confidence 76667888888999999999999888764
No 64
>KOG2397|consensus
Probab=89.35 E-value=0.25 Score=61.23 Aligned_cols=63 Identities=38% Similarity=0.734 Sum_probs=48.6
Q ss_pred CCceecCCC-ceecCCcccCCCCCCCCCCCCCC---CC-----CCC--cccCccccccccCCCCCCCCCCCCCC
Q psy11546 996 NSSFRCGNG-ECVSIGSKCNQLVDCADGSDEKN---CS-----CAD--FLKSQFLTRKICDGIIDCWDFSDEYE 1058 (1634)
Q Consensus 996 ~~~F~C~nG-~CIp~~~~CDG~~DC~DgSDE~~---C~-----C~~--~~~~~~~~~~~CDG~~DC~D~SDE~~ 1058 (1634)
...|.|.+| +=|+..+.=|..-||.|||||.+ |. |.. .....++...+=||+.||.|+|||..
T Consensus 42 s~~~~CLdgs~~i~f~qlNDd~CDC~DGsDEPGtsACpngkF~C~N~G~~p~~i~ssrV~DGICDCCDgSDE~~ 115 (480)
T KOG2397|consen 42 SSMFKCLDGSKTISFSQLNDDSCDCLDGSDEPGTSACPNGKFYCVNQGHQPKYIPSSRVNDGICDCCDGSDEYL 115 (480)
T ss_pred ccceeeccCCcccCHHHhccccccCCCCCCCCccccCCCCceeeeecCCCceeeechhccCcccccccCCCCcc
Confidence 357999877 78899999999999999999964 43 211 11223455678899999999999986
No 65
>PF02395 Peptidase_S6: Immunoglobulin A1 protease Serine protease Prosite pattern; InterPro: IPR000710 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This group of serine peptidases belong to the MEROPS peptidase family S6 (clan PA(S)). The type sample being the IgA1-specific serine endopeptidase from Neisseria gonorrhoeae []. These cleave prolyl bonds in the hinge regions of immunoglobulin A heavy chains. Similar specificity is shown by the unrelated family of M26 metalloendopeptidases.; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 3SZE_A 3H09_B 3SYJ_A 1WXR_A 3AK5_B.
Probab=59.73 E-value=6.2 Score=52.92 Aligned_cols=32 Identities=41% Similarity=0.794 Sum_probs=22.9
Q ss_pred CCCCCCCCeEEEec-CCCcEEEEEEEEcCCCCC
Q psy11546 1467 ACLGDSGGPLMCQE-PDGRWSLMGVTSNGYGCA 1498 (1634)
Q Consensus 1467 aC~GDSGGPLVc~~-~~GrW~LvGIvS~G~gCg 1498 (1634)
.=.||||+||+.-+ ..++|+|+|+++.+.+..
T Consensus 213 ~~~GDSGSPlF~YD~~~kKWvl~Gv~~~~~~~~ 245 (769)
T PF02395_consen 213 GSPGDSGSPLFAYDKEKKKWVLVGVLSGGNGYN 245 (769)
T ss_dssp --TT-TT-EEEEEETTTTEEEEEEEEEEECCCC
T ss_pred cccCcCCCceEEEEccCCeEEEEEEEccccccC
Confidence 45799999998754 467999999999886653
No 66
>PLN03160 uncharacterized protein; Provisional
Probab=52.72 E-value=20 Score=41.38 Aligned_cols=16 Identities=19% Similarity=0.002 Sum_probs=13.5
Q ss_pred cccEEEEEEEEEecCC
Q psy11546 69 GVELIFDSSFRVTAGD 84 (1634)
Q Consensus 69 ~k~~yY~GSFrItn~~ 84 (1634)
+.+.+..-+++|++-+
T Consensus 65 k~P~~~v~~v~l~~~~ 80 (219)
T PLN03160 65 KDPVIKMNGVTVTKLE 80 (219)
T ss_pred cCCeEEEEEEEEeeee
Confidence 7788999999998866
No 67
>PF07423 DUF1510: Protein of unknown function (DUF1510); InterPro: IPR009988 This family consists of several hypothetical bacterial proteins of around 200 residues in length. The function of this family is unknown.
Probab=48.03 E-value=9.8 Score=43.82 Aligned_cols=28 Identities=21% Similarity=0.356 Sum_probs=15.5
Q ss_pred cccccCCCCCccchHHHHHHHHHHHHHH
Q psy11546 32 KSQKSKPSQKCCSVLLASVIFLVLLLVI 59 (1634)
Q Consensus 32 k~t~~~~~r~~~~w~i~li~~lvllvLa 59 (1634)
||..+.++|+...+|=++|.+++|||++
T Consensus 2 Rf~~r~KrRK~N~iLNiaI~IV~lLIii 29 (217)
T PF07423_consen 2 RFQQRQKRRKTNKILNIAIGIVSLLIII 29 (217)
T ss_pred chhHHHHhhhhhhhHHHHHHHHHHHHHH
Confidence 5666656666666655555544444333
No 68
>PF14654 Epiglycanin_C: Mucin, catalytic, TM and cytoplasmic tail region
Probab=38.27 E-value=28 Score=35.31 Aligned_cols=33 Identities=15% Similarity=0.081 Sum_probs=25.0
Q ss_pred CCCCccchHHHHHHHHHHHH-HHHHHhhheeccc
Q psy11546 38 PSQKCCSVLLASVIFLVLLL-VITIAALAIYMGV 70 (1634)
Q Consensus 38 ~~r~~~~w~i~li~~lvllv-LaviigLlvyF~k 70 (1634)
.+-++.+|-|+||.++.|++ +++.+||.++.++
T Consensus 11 PsGsL~PWeIfLItLasVvvavGl~aGLfFcvR~ 44 (106)
T PF14654_consen 11 PSGSLKPWEIFLITLASVVVAVGLFAGLFFCVRN 44 (106)
T ss_pred cCCCccchHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence 44468899999998886554 7888888887654
No 69
>PF12273 RCR: Chitin synthesis regulation, resistance to Congo red; InterPro: IPR020999 RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall []. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5 [].
Probab=34.17 E-value=27 Score=36.95 Aligned_cols=13 Identities=23% Similarity=0.478 Sum_probs=6.4
Q ss_pred chHHHHHHHHHHH
Q psy11546 44 SVLLASVIFLVLL 56 (1634)
Q Consensus 44 ~w~i~li~~lvll 56 (1634)
+|+|++|||++++
T Consensus 1 RW~l~~iii~~i~ 13 (130)
T PF12273_consen 1 RWVLFAIIIVAIL 13 (130)
T ss_pred CeeeHHHHHHHHH
Confidence 3665555444443
No 70
>PF11293 DUF3094: Protein of unknown function (DUF3094); InterPro: IPR021444 This family of proteins with unknown function appears to be restricted to Gammaproteobacteria.
Probab=30.66 E-value=54 Score=30.16 Aligned_cols=28 Identities=14% Similarity=0.069 Sum_probs=22.6
Q ss_pred CCccchHHHHHHHHHHHHHHHHHhhhee
Q psy11546 40 QKCCSVLLASVIFLVLLLVITIAALAIY 67 (1634)
Q Consensus 40 r~~~~w~i~li~~lvllvLaviigLlvy 67 (1634)
+..++|++.++++++++.|+++..|+-+
T Consensus 26 ~PFrP~~Ll~~li~Vv~gl~llS~ll~~ 53 (55)
T PF11293_consen 26 KPFRPWRLLIVLIVVVIGLGLLSRLLSR 53 (55)
T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3478999999999999988888877643
No 71
>PF00548 Peptidase_C3: 3C cysteine protease (picornain 3C); InterPro: IPR000199 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This signature defines cysteine peptidases belong to MEROPS peptidase family C3 (picornain, clan PA(C)), subfamilies C3A and C3B. The protein fold of this peptidase domain for members of this family resembles that of the serine peptidase, chymotrypsin [], the type example for clan PA. Picornaviral proteins are expressed as a single polyprotein which is cleaved by the viral C3 cysteine protease. The poliovirus polyprotein is selectively cleaved between the Gln-|-Gly bond. In other picornavirus reactions Glu may be substituted for Gln, and Ser or Thr for Gly. ; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3SJO_E 2H6M_A 1QA7_C 1HAV_B 2HAL_A 2H9H_A 3QZQ_B 3QZR_A 3R0F_B 3SJ9_A ....
Probab=29.09 E-value=51 Score=36.63 Aligned_cols=28 Identities=25% Similarity=0.491 Sum_probs=21.5
Q ss_pred CCCCCCCCeEEEecCCCcEEEEEEEEcCC
Q psy11546 1467 ACLGDSGGPLMCQEPDGRWSLMGVTSNGY 1495 (1634)
Q Consensus 1467 aC~GDSGGPLVc~~~~GrW~LvGIvS~G~ 1495 (1634)
+-.|+-||||+... .+...++||-..|.
T Consensus 144 t~~G~CG~~l~~~~-~~~~~i~GiHvaG~ 171 (172)
T PF00548_consen 144 TKPGMCGSPLVSRI-GGQGKIIGIHVAGN 171 (172)
T ss_dssp EETTGTTEEEEESC-GGTTEEEEEEEEEE
T ss_pred CCCCccCCeEEEee-ccCccEEEEEeccC
Confidence 34689999999865 45679999987664
No 72
>PF11770 GAPT: GRB2-binding adapter (GAPT); InterPro: IPR021082 This entry represents a family of transmembrane proteins which bind the growth factor receptor-bound protein 2 (GRB2) in B cells []. In contrast to other transmembrane adaptor proteins, GAPT, which this entry represents, is not phosphorylated upon BCR ligation. It associates with GRB2 constitutively through its proline-rich region [].
Probab=26.89 E-value=46 Score=36.34 Aligned_cols=27 Identities=26% Similarity=0.150 Sum_probs=20.4
Q ss_pred chHHHHHHHHHHHHHHHHHhhheeccc
Q psy11546 44 SVLLASVIFLVLLLVITIAALAIYMGV 70 (1634)
Q Consensus 44 ~w~i~li~~lvllvLaviigLlvyF~k 70 (1634)
...+++.++++||+|+..||++||.+.
T Consensus 9 sv~i~igi~Ll~lLl~cgiGcvwhwkh 35 (158)
T PF11770_consen 9 SVAISIGISLLLLLLLCGIGCVWHWKH 35 (158)
T ss_pred hHHHHHHHHHHHHHHHHhcceEEEeec
Confidence 456666677777778889999999844
No 73
>PRK11901 hypothetical protein; Reviewed
Probab=25.64 E-value=78 Score=38.66 Aligned_cols=42 Identities=24% Similarity=0.315 Sum_probs=25.4
Q ss_pred CCCCccccccCCCCCcc------chHHHHHHHHHHHHHHHHHhhheec
Q psy11546 27 PNSSKKSQKSKPSQKCC------SVLLASVIFLVLLLVITIAALAIYM 68 (1634)
Q Consensus 27 ~~s~~k~t~~~~~r~~~------~w~i~li~~lvllvLaviigLlvyF 68 (1634)
+-||+|-+|+++..... --|.+-|=++|||+|++.||-++.-
T Consensus 13 DtSDRrp~Rsr~~~~~pk~~vSRQh~MiGiGilVLlLLIi~IgSALks 60 (327)
T PRK11901 13 DTSDRRPTRSRKSSNGPKLAVSRQHMMIGIGILVLLLLIIAIGSALKS 60 (327)
T ss_pred CcccCCCcccccCCCCCCchHHHHHHHHHHHHHHHHHHHHHHhhhccC
Confidence 44788877776554322 2244455555666677778877654
No 74
>PF14155 DUF4307: Domain of unknown function (DUF4307)
Probab=25.60 E-value=81 Score=32.86 Aligned_cols=42 Identities=14% Similarity=0.017 Sum_probs=24.5
Q ss_pred cchHHHHHHHHHHHHHHHHHhhhee-c--cccEEEEEEEEEecCC
Q psy11546 43 CSVLLASVIFLVLLLVITIAALAIY-M--GVELIFDSSFRVTAGD 84 (1634)
Q Consensus 43 ~~w~i~li~~lvllvLaviigLlvy-F--~k~~yY~GSFrItn~~ 84 (1634)
+.|+++++++++++++++++|+.+. + ..-..=...|++.+..
T Consensus 3 rr~~~~~~~v~~vv~~~~~~w~~~~~~~~~~v~~~~~gf~vv~d~ 47 (112)
T PF14155_consen 3 RRKLVIAGAVLVVVAGAVVAWFGYSQFGSPPVSAEVIGFEVVDDS 47 (112)
T ss_pred cceeEehHHHHHHHHHHHHhHhhhhhccCCCceEEEEEEEECCCC
Confidence 3455566666666667777777765 4 2223344566666554
No 75
>PF06024 DUF912: Nucleopolyhedrovirus protein of unknown function (DUF912); InterPro: IPR009261 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf78; it is a family of uncharacterised viral proteins.
Probab=24.92 E-value=52 Score=33.64 Aligned_cols=27 Identities=11% Similarity=0.151 Sum_probs=17.9
Q ss_pred ccchHHHHHHHHHHHHHHHHHhhheec
Q psy11546 42 CCSVLLASVIFLVLLLVITIAALAIYM 68 (1634)
Q Consensus 42 ~~~w~i~li~~lvllvLaviigLlvyF 68 (1634)
...+.++++.+++++++.|++.+++||
T Consensus 58 ~~~~~iili~lls~v~IlVily~IyYF 84 (101)
T PF06024_consen 58 QNNGNIILISLLSFVCILVILYAIYYF 84 (101)
T ss_pred cccccchHHHHHHHHHHHHHHhhheEE
Confidence 334555566666666677777778888
No 76
>PF00558 Vpu: Vpu protein; InterPro: IPR008187 The Human immunodeficiency virus 1 (HIV-1) Vpu protein acts in the degradation of CD4 in the endoplasmic reticulum and in the enhancement of virion release from the plasma membrane of infected cells [].; GO: 0019076 release of virus from host; PDB: 2JPX_A 1PI8_A 2GOH_A 2GOF_A 1PI7_A 1PJE_A 1VPU_A 2K7Y_A.
Probab=24.85 E-value=58 Score=32.33 Aligned_cols=24 Identities=21% Similarity=0.476 Sum_probs=14.9
Q ss_pred hHHHHHHHHHHHHHHHHHhhheec
Q psy11546 45 VLLASVIFLVLLLVITIAALAIYM 68 (1634)
Q Consensus 45 w~i~li~~lvllvLaviigLlvyF 68 (1634)
++++++.+++++++|+++|.++|-
T Consensus 5 ~i~~iialiv~~iiaIvvW~iv~i 28 (81)
T PF00558_consen 5 EILAIIALIVALIIAIVVWTIVYI 28 (81)
T ss_dssp ---HHHHHHHHHHHHHHHHHHH--
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 456666677777788888888775
No 77
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=23.40 E-value=70 Score=34.05 Aligned_cols=30 Identities=27% Similarity=0.320 Sum_probs=21.4
Q ss_pred CCCccchHHHHHHHHHHHHHHHHHhhheec
Q psy11546 39 SQKCCSVLLASVIFLVLLLVITIAALAIYM 68 (1634)
Q Consensus 39 ~r~~~~w~i~li~~lvllvLaviigLlvyF 68 (1634)
......=.|++|++.|++.++++|.|++|+
T Consensus 58 ~h~fs~~~i~~Ii~gv~aGvIg~Illi~y~ 87 (122)
T PF01102_consen 58 VHRFSEPAIIGIIFGVMAGVIGIILLISYC 87 (122)
T ss_dssp SSSSS-TCHHHHHHHHHHHHHHHHHHHHHH
T ss_pred ccCccccceeehhHHHHHHHHHHHHHHHHH
Confidence 334444467888888888888888888888
No 78
>PRK10697 DNA-binding transcriptional activator PspC; Provisional
Probab=22.89 E-value=73 Score=33.72 Aligned_cols=25 Identities=20% Similarity=0.290 Sum_probs=13.3
Q ss_pred HhHHHHHHHHHHHHHHHhhcC---Cccccc
Q psy11546 96 NLYKEKSKRYKSMIEKLYNAS---VLSPAI 122 (1634)
Q Consensus 96 ~~Fk~La~~IE~mV~~iF~~S---~L~~~Y 122 (1634)
.+|+.+.+++++| +.|-.| .|+++|
T Consensus 88 ~~~~~~e~Rlr~m--E~yVTS~~f~l~ref 115 (118)
T PRK10697 88 RELAAGEQRLREM--ERYVTSDTFTLRSRF 115 (118)
T ss_pred HHHHHHHHHHHHH--HHHhcCCCcCHHHHH
Confidence 4455555666665 335555 445544
No 79
>PF00863 Peptidase_C4: Peptidase family C4 This family belongs to family C4 of the peptidase classification.; InterPro: IPR001730 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. Nuclear inclusion A (NIA) proteases from potyviruses are cysteine peptidases belong to the MEROPS peptidase family C4 (NIa protease family, clan PA(C)) [, ]. Potyviruses include plant viruses in which the single-stranded RNA encodes a polyprotein with NIA protease activity, where proteolytic cleavage is specific for Gln+Gly sites. The NIA protease acts on the polyprotein, releasing itself by Gln+Gly cleavage at both the N- and C-termini. It further processes the polyprotein by cleavage at five similar sites in the C-terminal half of the sequence. In addition to its C-terminal protease activity, the NIA protease contains an N-terminal domain that has been implicated in the transcription process []. This peptidase is present in the nuclear inclusion protein of potyviruses.; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MMG_B 1Q31_B 1LVB_A 1LVM_A.
Probab=21.95 E-value=4.8e+02 Score=30.92 Aligned_cols=153 Identities=18% Similarity=0.300 Sum_probs=64.3
Q ss_pred eeCCCccccccccccCCCCCceEEE--EeEEecCCCCCCCCceeEeeEEEEeCCCCCCCCCcCceEEEEeCCCCCCCcee
Q psy11546 1314 LISDQWLLSAGHCFYRAQDDYWVAR--LGTLRRGTKLPSPYEQLRPISKIILHPQYVDAGFINDISILKMKTPFSNYVRP 1391 (1634)
Q Consensus 1314 LIS~rWVLTAAHCV~~~~~~~~~Vr--LG~~~~~s~~~s~~~Q~~~V~~IiiHP~Yn~~t~~nDIALLkL~~PfSd~VqP 1391 (1634)
|.-..||||-+|-+...... +.+. .|.+.... .. ++.+||- ...||.||+|...|...-+-
T Consensus 37 igyG~~iItn~HLf~~nng~-L~i~s~hG~f~v~n------t~-----~lkv~~i-----~~~DiviirmPkDfpPf~~k 99 (235)
T PF00863_consen 37 IGYGSYIITNAHLFKRNNGE-LTIKSQHGEFTVPN------TT-----QLKVHPI-----EGRDIVIIRMPKDFPPFPQK 99 (235)
T ss_dssp EEETTEEEEEGGGGSSTTCE-EEEEETTEEEEECE------GG-----GSEEEE------TCSSEEEEE--TTS----S-
T ss_pred EeECCEEEEChhhhccCCCe-EEEEeCceEEEcCC------cc-----ccceEEe-----CCccEEEEeCCcccCCcchh
Confidence 44567999999998664432 3332 23332221 11 1222221 24799999999886655454
Q ss_pred eecCCCCCCCCCCCeEEEEEecccccCCCcCCCCcEEEEEeeeChhHHhhhccCCCcccCCCCeEEEeecCCCCCCCCCC
Q psy11546 1392 ICLPHPNTPLTDGTLCTVVGWGQLFEIGRVFPDTLQEVQLPIISTAECRKRTLFLPLYRVTENMFCAGFERGGRDACLGD 1471 (1634)
Q Consensus 1392 ICLP~~~~~l~~G~~c~VvGWG~t~e~g~~~s~~Lq~v~VpIIs~eeC~~~~~~~~~~~It~sm~CAG~~~gg~daC~GD 1471 (1634)
+++- .+..+..+.++|--...... .....+... +.+... ..|=. -...+=.||
T Consensus 100 l~FR----~P~~~e~v~mVg~~fq~k~~---~s~vSesS~-i~p~~~---------------~~fWk----HwIsTk~G~ 152 (235)
T PF00863_consen 100 LKFR----APKEGERVCMVGSNFQEKSI---SSTVSESSW-IYPEEN---------------SHFWK----HWISTKDGD 152 (235)
T ss_dssp --B--------TT-EEEEEEEECSSCCC---EEEEEEEEE-EEEETT---------------TTEEE----E-C---TT-
T ss_pred hhcc----CCCCCCEEEEEEEEEEcCCe---eEEECCceE-EeecCC---------------CCeeE----EEecCCCCc
Confidence 4442 23345667777764321111 111111110 011000 00110 112334688
Q ss_pred CCCeEEEecCCCcEEEEEEEEcCCCCCCCCCCeEEEEcCC-chHHH
Q psy11546 1472 SGGPLMCQEPDGRWSLMGVTSNGYGCARANRPGVYTKVSN-YIPWL 1516 (1634)
Q Consensus 1472 SGGPLVc~~~~GrW~LvGIvS~G~gCg~~~~PgVYTrVs~-YlDWI 1516 (1634)
-|.|||... +| .++||-|.+..-. .-.+|+.+.. +.+-+
T Consensus 153 CG~PlVs~~-Dg--~IVGiHsl~~~~~---~~N~F~~f~~~f~~~~ 192 (235)
T PF00863_consen 153 CGLPLVSTK-DG--KIVGIHSLTSNTS---SRNYFTPFPDDFEEFY 192 (235)
T ss_dssp TT-EEEETT-T----EEEEEEEEETTT---SSEEEEE--TTHHHHH
T ss_pred cCCcEEEcC-CC--cEEEEEcCccCCC---CeEEEEcCCHHHHHHH
Confidence 899999866 33 8999999765332 2347777754 44433
Done!