Query         psy11546
Match_columns 1634
No_of_seqs    825 out of 3778
Neff          5.1 
Searched_HMMs 46136
Date          Fri Aug 16 21:16:02 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy11546.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11546hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 cd00190 Tryp_SPc Trypsin-like  100.0 3.9E-36 8.4E-41  326.7  23.9  229 1285-1519    1-232 (232)
  2 KOG3627|consensus              100.0 9.6E-36 2.1E-40  334.2  24.7  236 1282-1521   10-255 (256)
  3 smart00020 Tryp_SPc Trypsin-li 100.0 1.5E-33 3.2E-38  308.0  23.3  226 1284-1516    1-229 (229)
  4 PF00089 Trypsin:  Trypsin;  In 100.0 1.6E-31 3.6E-36  288.7  21.7  217 1285-1516    1-220 (220)
  5 COG5640 Secreted trypsin-like   99.9 2.5E-25 5.5E-30  253.4  16.4  242 1281-1531   29-289 (413)
  6 PF09342 DUF1986:  Domain of un  99.5 1.9E-13 4.1E-18  151.3  13.1  216 1292-1520   12-265 (267)
  7 PF01390 SEA:  SEA domain;  Int  99.5 1.8E-13 3.8E-18  135.7  10.8  104   70-183     2-107 (107)
  8 PF03761 DUF316:  Domain of unk  99.4 5.9E-12 1.3E-16  145.3  16.3  209 1283-1519   40-278 (282)
  9 smart00200 SEA Domain found in  99.2 1.3E-10 2.7E-15  119.5   9.9   92   72-170     3-99  (121)
 10 smart00202 SR Scavenger recept  98.7 2.7E-08 5.8E-13   98.5   7.0   90 1157-1257   10-101 (101)
 11 PF00057 Ldl_recept_a:  Low-den  98.7   1E-08 2.2E-13   84.6   3.0   36  992-1028    2-37  (37)
 12 cd00112 LDLa Low Density Lipop  98.7   9E-09 1.9E-13   83.9   2.6   35  993-1028    1-35  (35)
 13 cd07888 CRD_corin_2 One of two  98.6 2.2E-08 4.8E-13  103.1   2.3  103  855-978     1-110 (122)
 14 cd07441 CRD_SFRP3 Cysteine-ric  98.6 2.5E-08 5.5E-13  103.0   2.2   63  854-932     2-64  (126)
 15 cd07454 CRD_LIN_17 Cysteine-ri  98.5 5.1E-08 1.1E-12  100.7   4.2   65  853-933     2-66  (124)
 16 cd07462 CRD_FZ10 Cysteine-rich  98.5 3.5E-08 7.6E-13  102.1   2.9  106  853-979     2-112 (127)
 17 cd07442 CRD_SFRP4 Cysteine-ric  98.5 3.3E-08 7.1E-13  102.3   2.4   62  855-932     4-65  (127)
 18 cd07456 CRD_FZ5_like Cysteine-  98.5 5.4E-08 1.2E-12  100.1   3.7   62  856-933     2-63  (120)
 19 cd07445 CRD_corin_1 One of two  98.5 3.6E-08 7.9E-13  102.0   2.4  108  852-980     1-114 (130)
 20 cd07466 CRD_FZ7 Cysteine-rich   98.5 7.2E-08 1.6E-12   99.7   4.1   65  853-933     2-66  (125)
 21 cd07464 CRD_FZ2 Cysteine-rich   98.5 7.4E-08 1.6E-12   99.8   3.4   65  853-933     2-66  (127)
 22 cd07460 CRD_FZ5 Cysteine-rich   98.5 9.5E-08 2.1E-12   99.0   4.0  103  855-978     4-111 (127)
 23 KOG1215|consensus               98.5 1.3E-07 2.8E-12  125.7   5.7   38  991-1029  136-175 (877)
 24 cd07450 CRD_FZ6 Cysteine-rich   98.5 5.3E-08 1.1E-12  100.8   1.5  101  855-977     4-109 (127)
 25 cd07447 CRD_Carboxypeptidase_Z  98.4 7.3E-08 1.6E-12  100.0   2.3  107  854-978     2-113 (128)
 26 cd07465 CRD_FZ1 Cysteine-rich   98.4 9.9E-08 2.2E-12   98.9   3.2   65  853-933     2-66  (127)
 27 PF00530 SRCR:  Scavenger recep  98.4 2.6E-07 5.7E-12   90.2   5.8   90 1157-1257    7-99  (99)
 28 cd07463 CRD_FZ9 Cysteine-rich   98.4 1.5E-07 3.3E-12   97.6   4.2  103  854-978     3-111 (127)
 29 cd07448 CRD_FZ4 Cysteine-rich   98.4 1.5E-07 3.3E-12   97.5   3.8   81  854-955     2-82  (126)
 30 PF00057 Ldl_recept_a:  Low-den  98.4 1.9E-07 4.2E-12   77.1   3.5   36 1061-1097    2-37  (37)
 31 cd07461 CRD_FZ8 Cysteine-rich   98.4 1.8E-07 3.9E-12   96.9   4.1  104  854-978     3-111 (125)
 32 cd07449 CRD_FZ3 Cysteine-rich   98.4 8.5E-08 1.8E-12   99.2   1.6  102  855-978     4-110 (127)
 33 cd07457 CRD_FZ9_like Cysteine-  98.4   2E-07 4.3E-12   96.1   3.9  103  854-977     1-108 (121)
 34 cd07458 CRD_FZ1_like Cysteine-  98.4 1.9E-07 4.1E-12   96.0   3.4   64  854-933     1-64  (119)
 35 cd00112 LDLa Low Density Lipop  98.4 1.8E-07 3.9E-12   76.3   2.5   35 1062-1097    1-35  (35)
 36 smart00063 FRI Frizzled. Droso  98.3   3E-07 6.6E-12   93.7   3.4   78  856-955     1-78  (113)
 37 cd07452 CRD_sizzled Cysteine-r  98.3 3.4E-07 7.4E-12   96.3   3.3   81  851-955     4-86  (141)
 38 smart00192 LDLa Low-density li  98.3 4.1E-07 8.8E-12   73.3   3.0   32  993-1025    2-33  (33)
 39 cd07444 CRD_SFRP5 Cysteine-ric  98.3 2.7E-07   6E-12   95.6   2.2  101  853-977     4-111 (127)
 40 cd07446 CRD_SFRP2 Cysteine-ric  98.2 8.7E-07 1.9E-11   92.1   4.2   81  854-955     3-85  (128)
 41 cd07443 CRD_SFRP1 Cysteine-ric  98.2 5.5E-07 1.2E-11   93.0   2.5   79  853-955     4-84  (124)
 42 KOG1215|consensus               98.2 1.2E-06 2.6E-11  116.5   5.8   92 1541-1634  141-294 (877)
 43 cd07455 CRD_Collagen_XVIII Cys  98.2 1.1E-06 2.4E-11   90.6   3.3   63  854-932     3-67  (123)
 44 PF12999 PRKCSH-like:  Glucosid  98.1 1.5E-06 3.2E-11   94.2   3.9   68  996-1094   35-110 (176)
 45 PF15494 SRCR_2:  Scavenger rec  98.1 6.8E-06 1.5E-10   81.7   7.0   89 1160-1265    5-98  (98)
 46 cd07453 CRD_crescent Cysteine-  98.1 1.3E-06 2.9E-11   91.3   2.0   63  855-933     2-66  (135)
 47 COG3591 V8-like Glu-specific e  98.0 4.8E-05 1.1E-09   87.1  11.8  197 1292-1521   45-251 (251)
 48 smart00192 LDLa Low-density li  97.8 1.1E-05 2.3E-10   65.2   2.8   32 1062-1094    2-33  (33)
 49 cd07451 CRD_SMO Cysteine-rich   97.7 6.6E-06 1.4E-10   85.9   0.2   65  854-933     3-69  (132)
 50 KOG3577|consensus               97.6 7.1E-05 1.5E-09   95.0   6.9  152  850-1031   37-198 (556)
 51 cd07066 CRD_FZ CRD_domain cyst  97.6 3.7E-05   8E-10   78.6   3.4   80  855-955     1-80  (119)
 52 PF12999 PRKCSH-like:  Glucosid  97.2 0.00024 5.2E-09   77.5   3.6   61 1541-1601   36-111 (176)
 53 PF01392 Fz:  Fz domain;  Inter  97.0 0.00014 3.1E-09   73.2  -0.4   62  856-933     1-62  (116)
 54 KOG2397|consensus               96.1   0.005 1.1E-07   75.5   4.1   52 1045-1096   61-115 (480)
 55 TIGR02037 degP_htrA_DO peripla  95.6    0.22 4.7E-06   62.1  15.8  170 1307-1522   57-228 (428)
 56 TIGR02038 protease_degS peripl  95.1     0.5 1.1E-05   57.6  16.4  168 1308-1522   78-249 (351)
 57 PF13365 Trypsin_2:  Trypsin-li  94.6   0.059 1.3E-06   53.6   5.8   22 1310-1331    1-23  (120)
 58 PRK10898 serine endoprotease;   94.5    0.86 1.9E-05   55.7  16.5   82 1308-1413   78-160 (353)
 59 PRK10139 serine endoprotease;   93.0     1.3 2.9E-05   55.9  14.5  169 1308-1522   90-261 (455)
 60 KOG3509|consensus               92.2     0.1 2.2E-06   69.6   3.5   86 1549-1634    2-110 (964)
 61 PRK10942 serine endoprotease;   91.3       2 4.3E-05   54.6  13.4   83 1308-1413  111-195 (473)
 62 PF09272 Hepsin-SRCR:  Hepsin,   90.9    0.31 6.8E-06   49.2   4.6   95 1156-1265    9-108 (110)
 63 KOG3509|consensus               89.6    0.32   7E-06   65.1   4.5   93 1006-1098    3-111 (964)
 64 KOG2397|consensus               89.4    0.25 5.4E-06   61.2   3.0   63  996-1058   42-115 (480)
 65 PF02395 Peptidase_S6:  Immunog  59.7     6.2 0.00013   52.9   2.8   32 1467-1498  213-245 (769)
 66 PLN03160 uncharacterized prote  52.7      20 0.00043   41.4   5.1   16   69-84     65-80  (219)
 67 PF07423 DUF1510:  Protein of u  48.0     9.8 0.00021   43.8   1.7   28   32-59      2-29  (217)
 68 PF14654 Epiglycanin_C:  Mucin,  38.3      28 0.00062   35.3   2.9   33   38-70     11-44  (106)
 69 PF12273 RCR:  Chitin synthesis  34.2      27 0.00058   36.9   2.2   13   44-56      1-13  (130)
 70 PF11293 DUF3094:  Protein of u  30.7      54  0.0012   30.2   3.1   28   40-67     26-53  (55)
 71 PF00548 Peptidase_C3:  3C cyst  29.1      51  0.0011   36.6   3.5   28 1467-1495  144-171 (172)
 72 PF11770 GAPT:  GRB2-binding ad  26.9      46   0.001   36.3   2.4   27   44-70      9-35  (158)
 73 PRK11901 hypothetical protein;  25.6      78  0.0017   38.7   4.3   42   27-68     13-60  (327)
 74 PF14155 DUF4307:  Domain of un  25.6      81  0.0018   32.9   3.9   42   43-84      3-47  (112)
 75 PF06024 DUF912:  Nucleopolyhed  24.9      52  0.0011   33.6   2.4   27   42-68     58-84  (101)
 76 PF00558 Vpu:  Vpu protein;  In  24.8      58  0.0013   32.3   2.5   24   45-68      5-28  (81)
 77 PF01102 Glycophorin_A:  Glycop  23.4      70  0.0015   34.0   3.0   30   39-68     58-87  (122)
 78 PRK10697 DNA-binding transcrip  22.9      73  0.0016   33.7   3.0   25   96-122    88-115 (118)
 79 PF00863 Peptidase_C4:  Peptida  21.9 4.8E+02    0.01   30.9   9.5  153 1314-1516   37-192 (235)

No 1  
>cd00190 Tryp_SPc Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
Probab=100.00  E-value=3.9e-36  Score=326.70  Aligned_cols=229  Identities=47%  Similarity=0.923  Sum_probs=197.5

Q ss_pred             EecCeecCCCccceEEEEEeC-CeeeeeceeeCCCccccccccccCCCCCceEEEEeEEecCCCCCCCCceeEeeEEEEe
Q psy11546       1285 IVGGGNARLGSWPWQAALYKE-GEFQCGATLISDQWLLSAGHCFYRAQDDYWVARLGTLRRGTKLPSPYEQLRPISKIIL 1363 (1634)
Q Consensus      1285 IvGG~~a~~GewPW~VsL~~~-g~~~CGGTLIS~rWVLTAAHCV~~~~~~~~~VrLG~~~~~s~~~s~~~Q~~~V~~Iii 1363 (1634)
                      |+||..+..++|||+|.|+.. +.++|+|+||+++||||||||+.......+.|++|.......  ....+.+.|+++++
T Consensus         1 i~~G~~~~~~~~Pw~v~i~~~~~~~~C~GtlIs~~~VLTaAhC~~~~~~~~~~v~~g~~~~~~~--~~~~~~~~v~~~~~   78 (232)
T cd00190           1 IVGGSEAKIGSFPWQVSLQYTGGRHFCGGSLISPRWVLTAAHCVYSSAPSNYTVRLGSHDLSSN--EGGGQVIKVKKVIV   78 (232)
T ss_pred             CcCCeECCCCCCCCEEEEEccCCcEEEEEEEeeCCEEEECHHhcCCCCCccEEEEeCcccccCC--CCceEEEEEEEEEE
Confidence            579999999999999999987 789999999999999999999987666678888887665421  23567889999999


Q ss_pred             CCCCCCCCCcCceEEEEeCCC--CCCCceeeecCCCCCCCCCCCeEEEEEecccccCCCcCCCCcEEEEEeeeChhHHhh
Q psy11546       1364 HPQYVDAGFINDISILKMKTP--FSNYVRPICLPHPNTPLTDGTLCTVVGWGQLFEIGRVFPDTLQEVQLPIISTAECRK 1441 (1634)
Q Consensus      1364 HP~Yn~~t~~nDIALLkL~~P--fSd~VqPICLP~~~~~l~~G~~c~VvGWG~t~e~g~~~s~~Lq~v~VpIIs~eeC~~ 1441 (1634)
                      ||.|+.....+|||||+|+++  ++.+++|||||........+..+.++|||..... ...+..++...+.+++...|..
T Consensus        79 hp~y~~~~~~~DiAll~L~~~~~~~~~v~picl~~~~~~~~~~~~~~~~G~g~~~~~-~~~~~~~~~~~~~~~~~~~C~~  157 (232)
T cd00190          79 HPNYNPSTYDNDIALLKLKRPVTLSDNVRPICLPSSGYNLPAGTTCTVSGWGRTSEG-GPLPDVLQEVNVPIVSNAECKR  157 (232)
T ss_pred             CCCCCCCCCcCCEEEEEECCcccCCCcccceECCCccccCCCCCEEEEEeCCcCCCC-CCCCceeeEEEeeeECHHHhhh
Confidence            999999888999999999998  7888999999987656677899999999987554 2346789999999999999998


Q ss_pred             hccCCCcccCCCCeEEEeecCCCCCCCCCCCCCeEEEecCCCcEEEEEEEEcCCCCCCCCCCeEEEEcCCchHHHHHh
Q psy11546       1442 RTLFLPLYRVTENMFCAGFERGGRDACLGDSGGPLMCQEPDGRWSLMGVTSNGYGCARANRPGVYTKVSNYIPWLYNN 1519 (1634)
Q Consensus      1442 ~~~~~~~~~It~sm~CAG~~~gg~daC~GDSGGPLVc~~~~GrW~LvGIvS~G~gCg~~~~PgVYTrVs~YlDWI~~v 1519 (1634)
                      .+..  ...+...++|++......+.|.|||||||++.. +++|+|+||+|+|..|...+.|++||+|+.|++||.++
T Consensus       158 ~~~~--~~~~~~~~~C~~~~~~~~~~c~gdsGgpl~~~~-~~~~~lvGI~s~g~~c~~~~~~~~~t~v~~~~~WI~~~  232 (232)
T cd00190         158 AYSY--GGTITDNMLCAGGLEGGKDACQGDSGGPLVCND-NGRGVLVGIVSWGSGCARPNYPGVYTRVSSYLDWIQKT  232 (232)
T ss_pred             hccC--cccCCCceEeeCCCCCCCccccCCCCCcEEEEe-CCEEEEEEEEehhhccCCCCCCCEEEEcHHhhHHhhcC
Confidence            6542  135788999998765578899999999999987 58999999999999998778999999999999999764


No 2  
>KOG3627|consensus
Probab=100.00  E-value=9.6e-36  Score=334.16  Aligned_cols=236  Identities=45%  Similarity=0.881  Sum_probs=197.5

Q ss_pred             cceEecCeecCCCccceEEEEEeCC--eeeeeceeeCCCccccccccccCC-CCCceEEEEeEEecCCCCCCC-CceeEe
Q psy11546       1282 HSRIVGGGNARLGSWPWQAALYKEG--EFQCGATLISDQWLLSAGHCFYRA-QDDYWVARLGTLRRGTKLPSP-YEQLRP 1357 (1634)
Q Consensus      1282 ~~RIvGG~~a~~GewPW~VsL~~~g--~~~CGGTLIS~rWVLTAAHCV~~~-~~~~~~VrLG~~~~~s~~~s~-~~Q~~~ 1357 (1634)
                      ..||+||.++..++|||+|+|....  .++|||+||+++||||||||+... .. .+.|++|.+......... ..+...
T Consensus        10 ~~~i~~g~~~~~~~~Pw~~~l~~~~~~~~~Cggsli~~~~vltaaHC~~~~~~~-~~~V~~G~~~~~~~~~~~~~~~~~~   88 (256)
T KOG3627|consen   10 EGRIVGGTEAEPGSFPWQVSLQYGGNGRHLCGGSLISPRWVLTAAHCVKGASAS-LYTVRLGEHDINLSVSEGEEQLVGD   88 (256)
T ss_pred             cCCEeCCccCCCCCCCCEEEEEECCCcceeeeeEEeeCCEEEEChhhCCCCCCc-ceEEEECccccccccccCchhhhce
Confidence            5799999999999999999999876  789999999999999999999774 33 778888876544321111 135566


Q ss_pred             eEEEEeCCCCCCCCCc-CceEEEEeCCC--CCCCceeeecCCCCC--CCCCCCeEEEEEecccccCCCcCCCCcEEEEEe
Q psy11546       1358 ISKIILHPQYVDAGFI-NDISILKMKTP--FSNYVRPICLPHPNT--PLTDGTLCTVVGWGQLFEIGRVFPDTLQEVQLP 1432 (1634)
Q Consensus      1358 V~~IiiHP~Yn~~t~~-nDIALLkL~~P--fSd~VqPICLP~~~~--~l~~G~~c~VvGWG~t~e~g~~~s~~Lq~v~Vp 1432 (1634)
                      |.++++||.|+..... ||||||+|..+  |+++|+|||||....  ....+..|+++|||.+.......+..|+++.++
T Consensus        89 v~~~i~H~~y~~~~~~~nDiall~l~~~v~~~~~i~piclp~~~~~~~~~~~~~~~v~GWG~~~~~~~~~~~~L~~~~v~  168 (256)
T KOG3627|consen   89 VEKIIVHPNYNPRTLENNDIALLRLSEPVTFSSHIQPICLPSSADPYFPPGGTTCLVSGWGRTESGGGPLPDTLQEVDVP  168 (256)
T ss_pred             eeEEEECCCCCCCCCCCCCEEEEEECCCcccCCcccccCCCCCcccCCCCCCCEEEEEeCCCcCCCCCCCCceeEEEEEe
Confidence            8899999999998877 99999999997  999999999985543  244558999999998866544567889999999


Q ss_pred             eeChhHHhhhccCCCcccCCCCeEEEeecCCCCCCCCCCCCCeEEEecCCCcEEEEEEEEcCCC-CCCCCCCeEEEEcCC
Q psy11546       1433 IISTAECRKRTLFLPLYRVTENMFCAGFERGGRDACLGDSGGPLMCQEPDGRWSLMGVTSNGYG-CARANRPGVYTKVSN 1511 (1634)
Q Consensus      1433 IIs~eeC~~~~~~~~~~~It~sm~CAG~~~gg~daC~GDSGGPLVc~~~~GrW~LvGIvS~G~g-Cg~~~~PgVYTrVs~ 1511 (1634)
                      +++...|...+....  .++..||||+...++.++|+|||||||++... ++|+++||+|||.+ |+..+.|++||+|+.
T Consensus       169 i~~~~~C~~~~~~~~--~~~~~~~Ca~~~~~~~~~C~GDSGGPLv~~~~-~~~~~~GivS~G~~~C~~~~~P~vyt~V~~  245 (256)
T KOG3627|consen  169 IISNSECRRAYGGLG--TITDTMLCAGGPEGGKDACQGDSGGPLVCEDN-GRWVLVGIVSWGSGGCGQPNYPGVYTRVSS  245 (256)
T ss_pred             EcChhHhcccccCcc--ccCCCEEeeCccCCCCccccCCCCCeEEEeeC-CcEEEEEEEEecCCCCCCCCCCeEEeEhHH
Confidence            999999998654321  46677999998677888999999999999874 48999999999998 999889999999999


Q ss_pred             chHHHHHhhc
Q psy11546       1512 YIPWLYNNMA 1521 (1634)
Q Consensus      1512 YlDWI~~vI~ 1521 (1634)
                      |++||++.+.
T Consensus       246 y~~WI~~~~~  255 (256)
T KOG3627|consen  246 YLDWIKENIG  255 (256)
T ss_pred             hHHHHHHHhc
Confidence            9999999875


No 3  
>smart00020 Tryp_SPc Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
Probab=100.00  E-value=1.5e-33  Score=308.02  Aligned_cols=226  Identities=49%  Similarity=0.941  Sum_probs=194.6

Q ss_pred             eEecCeecCCCccceEEEEEeCC-eeeeeceeeCCCccccccccccCCCCCceEEEEeEEecCCCCCCCCceeEeeEEEE
Q psy11546       1284 RIVGGGNARLGSWPWQAALYKEG-EFQCGATLISDQWLLSAGHCFYRAQDDYWVARLGTLRRGTKLPSPYEQLRPISKII 1362 (1634)
Q Consensus      1284 RIvGG~~a~~GewPW~VsL~~~g-~~~CGGTLIS~rWVLTAAHCV~~~~~~~~~VrLG~~~~~s~~~s~~~Q~~~V~~Ii 1362 (1634)
                      |++||..+..++|||+|.|+... .+.|+|+||+++||||||||+.......+.|++|.......   ...+.+.|..++
T Consensus         1 ~~~~G~~~~~~~~Pw~~~i~~~~~~~~C~GtlIs~~~VLTaahC~~~~~~~~~~v~~g~~~~~~~---~~~~~~~v~~~~   77 (229)
T smart00020        1 RIVGGSEANIGSFPWQVSLQYRGGRHFCGGSLISPRWVLTAAHCVYGSDPSNIRVRLGSHDLSSG---EEGQVIKVSKVI   77 (229)
T ss_pred             CccCCCcCCCCCCCcEEEEEEcCCCcEEEEEEecCCEEEECHHHcCCCCCcceEEEeCcccCCCC---CCceEEeeEEEE
Confidence            68999999999999999999877 89999999999999999999987655678888987654321   223788999999


Q ss_pred             eCCCCCCCCCcCceEEEEeCCC--CCCCceeeecCCCCCCCCCCCeEEEEEecccccCCCcCCCCcEEEEEeeeChhHHh
Q psy11546       1363 LHPQYVDAGFINDISILKMKTP--FSNYVRPICLPHPNTPLTDGTLCTVVGWGQLFEIGRVFPDTLQEVQLPIISTAECR 1440 (1634)
Q Consensus      1363 iHP~Yn~~t~~nDIALLkL~~P--fSd~VqPICLP~~~~~l~~G~~c~VvGWG~t~e~g~~~s~~Lq~v~VpIIs~eeC~ 1440 (1634)
                      +||.|+.....+|||||+|+++  +++.++|||||........+..+.++|||.....+......++...+.+++.+.|.
T Consensus        78 ~~p~~~~~~~~~DiAll~L~~~i~~~~~~~pi~l~~~~~~~~~~~~~~~~g~g~~~~~~~~~~~~~~~~~~~~~~~~~C~  157 (229)
T smart00020       78 IHPNYNPSTYDNDIALLKLKSPVTLSDNVRPICLPSSNYNVPAGTTCTVSGWGRTSEGAGSLPDTLQEVNVPIVSNATCR  157 (229)
T ss_pred             ECCCCCCCCCcCCEEEEEECcccCCCCceeeccCCCcccccCCCCEEEEEeCCCCCCCCCcCCCEeeEEEEEEeCHHHhh
Confidence            9999998888999999999999  78899999999875566678999999999876544445678999999999999999


Q ss_pred             hhccCCCcccCCCCeEEEeecCCCCCCCCCCCCCeEEEecCCCcEEEEEEEEcCCCCCCCCCCeEEEEcCCchHHH
Q psy11546       1441 KRTLFLPLYRVTENMFCAGFERGGRDACLGDSGGPLMCQEPDGRWSLMGVTSNGYGCARANRPGVYTKVSNYIPWL 1516 (1634)
Q Consensus      1441 ~~~~~~~~~~It~sm~CAG~~~gg~daC~GDSGGPLVc~~~~GrW~LvGIvS~G~gCg~~~~PgVYTrVs~YlDWI 1516 (1634)
                      ..+...  ..+...++|++......+.|.|||||||++.. + +|+|+||+|+|..|...+.|.+|++|++|++||
T Consensus       158 ~~~~~~--~~~~~~~~C~~~~~~~~~~c~gdsG~pl~~~~-~-~~~l~Gi~s~g~~C~~~~~~~~~~~i~~~~~WI  229 (229)
T smart00020      158 RAYSGG--GAITDNMLCAGGLEGGKDACQGDSGGPLVCND-G-RWVLVGIVSWGSGCARPGKPGVYTRVSSYLDWI  229 (229)
T ss_pred             hhhccc--cccCCCcEeecCCCCCCcccCCCCCCeeEEEC-C-CEEEEEEEEECCCCCCCCCCCEEEEeccccccC
Confidence            865431  24788999998765578899999999999987 3 999999999999998788999999999999998


No 4  
>PF00089 Trypsin:  Trypsin;  InterPro: IPR001254 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This group of serine proteases belong to the MEROPS peptidase family S1 (chymotrypsin family, clan PA(S))and to peptidase family S6 (Hap serine peptidases). The chymotrypsin family is almost totally confined to animals, although trypsin-like enzymes are found in actinomycetes of the genera Streptomyces and Saccharopolyspora, and in the fungus Fusarium oxysporum []. The enzymes are inherently secreted, being synthesised with a signal peptide that targets them to the secretory pathway. Animal enzymes are either secreted directly, packaged into vesicles for regulated secretion, or are retained in leukocyte granules []. The Hap family, 'Haemophilus adhesion and penetration', are proteins that play a role in the interaction with human epithelial cells. The serine protease activity is localized at the N-terminal domain, whereas the binding domain is in the C-terminal region. ; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 1SPJ_A 1A5I_A 2ZGH_A 2ZKS_A 2ZGJ_A 2ZGC_A 2ODP_A 2I6Q_A 2I6S_A 2ODQ_A ....
Probab=99.98  E-value=1.6e-31  Score=288.73  Aligned_cols=217  Identities=43%  Similarity=0.832  Sum_probs=186.8

Q ss_pred             EecCeecCCCccceEEEEEeCC-eeeeeceeeCCCccccccccccCCCCCceEEEEeEEecCCCCCCCCceeEeeEEEEe
Q psy11546       1285 IVGGGNARLGSWPWQAALYKEG-EFQCGATLISDQWLLSAGHCFYRAQDDYWVARLGTLRRGTKLPSPYEQLRPISKIIL 1363 (1634)
Q Consensus      1285 IvGG~~a~~GewPW~VsL~~~g-~~~CGGTLIS~rWVLTAAHCV~~~~~~~~~VrLG~~~~~s~~~s~~~Q~~~V~~Iii 1363 (1634)
                      |+||..+..++|||+|.|+... .++|+|+||+++||||||||+..  ...+.+.+|.....  ......+.+.|+++++
T Consensus         1 i~~g~~~~~~~~p~~v~i~~~~~~~~C~G~li~~~~vLTaahC~~~--~~~~~v~~g~~~~~--~~~~~~~~~~v~~~~~   76 (220)
T PF00089_consen    1 IVGGDPASPGEFPWVVSIRYSNGRFFCTGTLISPRWVLTAAHCVDG--ASDIKVRLGTYSIR--NSDGSEQTIKVSKIII   76 (220)
T ss_dssp             SBSSEECGTTSSTTEEEEEETTTEEEEEEEEEETTEEEEEGGGHTS--GGSEEEEESESBTT--STTTTSEEEEEEEEEE
T ss_pred             CCCCEECCCCCCCeEEEEeeCCCCeeEeEEeccccccccccccccc--cccccccccccccc--cccccccccccccccc
Confidence            6799999999999999999887 99999999999999999999977  44677788872222  1233368899999999


Q ss_pred             CCCCCCCCCcCceEEEEeCCC--CCCCceeeecCCCCCCCCCCCeEEEEEecccccCCCcCCCCcEEEEEeeeChhHHhh
Q psy11546       1364 HPQYVDAGFINDISILKMKTP--FSNYVRPICLPHPNTPLTDGTLCTVVGWGQLFEIGRVFPDTLQEVQLPIISTAECRK 1441 (1634)
Q Consensus      1364 HP~Yn~~t~~nDIALLkL~~P--fSd~VqPICLP~~~~~l~~G~~c~VvGWG~t~e~g~~~s~~Lq~v~VpIIs~eeC~~ 1441 (1634)
                      ||.|+.....+|||||+|+++  +.+.++|+||+........+..+.++|||.....+  ....++...+.+++.+.|..
T Consensus        77 h~~~~~~~~~~DiAll~L~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~G~~~~~~~~--~~~~~~~~~~~~~~~~~c~~  154 (220)
T PF00089_consen   77 HPKYDPSTYDNDIALLKLDRPITFGDNIQPICLPSAGSDPNVGTSCIVVGWGRTSDNG--YSSNLQSVTVPVVSRKTCRS  154 (220)
T ss_dssp             ETTSBTTTTTTSEEEEEESSSSEHBSSBEESBBTSTTHTTTTTSEEEEEESSBSSTTS--BTSBEEEEEEEEEEHHHHHH
T ss_pred             cccccccccccccccccccccccccccccccccccccccccccccccccccccccccc--cccccccccccccccccccc
Confidence            999999888999999999999  78999999999865556788999999999865444  45679999999999999998


Q ss_pred             hccCCCcccCCCCeEEEeecCCCCCCCCCCCCCeEEEecCCCcEEEEEEEEcCCCCCCCCCCeEEEEcCCchHHH
Q psy11546       1442 RTLFLPLYRVTENMFCAGFERGGRDACLGDSGGPLMCQEPDGRWSLMGVTSNGYGCARANRPGVYTKVSNYIPWL 1516 (1634)
Q Consensus      1442 ~~~~~~~~~It~sm~CAG~~~gg~daC~GDSGGPLVc~~~~GrW~LvGIvS~G~gCg~~~~PgVYTrVs~YlDWI 1516 (1634)
                      .+.    ..+...++|++.. +..+.|.|||||||++...    +|+||++++..|...+.|++|++|+.|++||
T Consensus       155 ~~~----~~~~~~~~c~~~~-~~~~~~~g~sG~pl~~~~~----~lvGI~s~~~~c~~~~~~~v~~~v~~~~~WI  220 (220)
T PF00089_consen  155 SYN----DNLTPNMICAGSS-GSGDACQGDSGGPLICNNN----YLVGIVSFGENCGSPNYPGVYTRVSSYLDWI  220 (220)
T ss_dssp             HTT----TTSTTTEEEEETT-SSSBGGTTTTTSEEEETTE----EEEEEEEEESSSSBTTSEEEEEEGGGGHHHH
T ss_pred             ccc----ccccccccccccc-cccccccccccccccccee----eecceeeecCCCCCCCcCEEEEEHHHhhccC
Confidence            522    2377899999976 6688999999999998762    8999999999999988899999999999998


No 5  
>COG5640 Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.93  E-value=2.5e-25  Score=253.41  Aligned_cols=242  Identities=28%  Similarity=0.391  Sum_probs=175.9

Q ss_pred             CcceEecCeecCCCccceEEEEEeC-----CeeeeeceeeCCCccccccccccCCCCCceEEEEeEEecCCCCCCCCcee
Q psy11546       1281 RHSRIVGGGNARLGSWPWQAALYKE-----GEFQCGATLISDQWLLSAGHCFYRAQDDYWVARLGTLRRGTKLPSPYEQL 1355 (1634)
Q Consensus      1281 ~~~RIvGG~~a~~GewPW~VsL~~~-----g~~~CGGTLIS~rWVLTAAHCV~~~~~~~~~VrLG~~~~~s~~~s~~~Q~ 1355 (1634)
                      ...||+||..+..|+||++|+|...     ...+|||++|..|||||||||+........-+..+.....   +..+.+.
T Consensus        29 vs~rIigGs~Anag~~P~~VaLv~~isd~~s~tfCGgs~l~~RYvLTAAHC~~~~s~is~d~~~vv~~l~---d~Sq~~r  105 (413)
T COG5640          29 VSSRIIGGSNANAGEYPSLVALVDRISDYVSGTFCGGSKLGGRYVLTAAHCADASSPISSDVNRVVVDLN---DSSQAER  105 (413)
T ss_pred             cceeEecCcccccccCchHHHHHhhcccccceeEeccceecceEEeeehhhccCCCCccccceEEEeccc---ccccccC
Confidence            3679999999999999999999643     3468999999999999999999875532222223333332   2345677


Q ss_pred             EeeEEEEeCCCCCCCCCcCceEEEEeCCCCC-CCceeeecCCCC----CCCCCCCeEEEEEecccccCCC-cC---CCCc
Q psy11546       1356 RPISKIILHPQYVDAGFINDISILKMKTPFS-NYVRPICLPHPN----TPLTDGTLCTVVGWGQLFEIGR-VF---PDTL 1426 (1634)
Q Consensus      1356 ~~V~~IiiHP~Yn~~t~~nDIALLkL~~PfS-d~VqPICLP~~~----~~l~~G~~c~VvGWG~t~e~g~-~~---s~~L 1426 (1634)
                      ..|..++.|..|.+.++.||||+++|.++.+ ..++ +-+-...    ............+||.+.+... ..   ...|
T Consensus       106 g~vr~i~~~efY~~~n~~ND~Av~~l~~~a~~pr~k-i~~~~~sdt~l~sv~~~s~~~n~t~~~~~~~~v~~~~p~gt~l  184 (413)
T COG5640         106 GHVRTIYVHEFYSPGNLGNDIAVLELARAASLPRVK-ITSFDASDTFLNSVTTVSPMTNGTFGVTTPSDVPRSSPKGTIL  184 (413)
T ss_pred             cceEEEeeecccccccccCcceeeccccccccchhh-eeeccCcccceecccccccccceeeeeeeecCCCCCCCcccee
Confidence            8999999999999999999999999998621 1111 1111110    1122334455667776644321 11   1379


Q ss_pred             EEEEEeeeChhHHhhhccC----CCcccCCCCeEEEeecCCCCCCCCCCCCCeEEEecCCCcEEEEEEEEcCCC-CCCCC
Q psy11546       1427 QEVQLPIISTAECRKRTLF----LPLYRVTENMFCAGFERGGRDACLGDSGGPLMCQEPDGRWSLMGVTSNGYG-CARAN 1501 (1634)
Q Consensus      1427 q~v~VpIIs~eeC~~~~~~----~~~~~It~sm~CAG~~~gg~daC~GDSGGPLVc~~~~GrW~LvGIvS~G~g-Cg~~~ 1501 (1634)
                      +++.+..++...|...+..    .+...++.  +|||..  ..++|+||||||++....+ ...++||+|||.+ |+.+.
T Consensus       185 ~e~~v~fv~~stc~~~~g~an~~dg~~~lT~--~cag~~--~~daCqGDSGGPi~~~g~~-G~vQ~GVvSwG~~~Cg~t~  259 (413)
T COG5640         185 HEVAVLFVPLSTCAQYKGCANASDGATGLTG--FCAGRP--PKDACQGDSGGPIFHKGEE-GRVQRGVVSWGDGGCGGTL  259 (413)
T ss_pred             eeeeeeeechHHhhhhccccccCCCCCCccc--eecCCC--CcccccCCCCCceEEeCCC-ccEEEeEEEecCCCCCCCC
Confidence            9999999999999875531    11123444  999954  4899999999999998744 4588999999987 99999


Q ss_pred             CCeEEEEcCCchHHHHHhhccccccccccc
Q psy11546       1502 RPGVYTKVSNYIPWLYNNMAASEYNMMRNE 1531 (1634)
Q Consensus      1502 ~PgVYTrVs~YlDWI~~vI~~s~~~~~~~~ 1531 (1634)
                      .|+|||+|+.|.+||..+|+.-...+.++.
T Consensus       260 ~~gVyT~vsny~~WI~a~~~~l~~~~~rp~  289 (413)
T COG5640         260 IPGVYTNVSNYQDWIAAMTNGLSYLQFRPL  289 (413)
T ss_pred             cceeEEehhHHHHHHHHHhcCCCccccccc
Confidence            999999999999999999988766655443


No 6  
>PF09342 DUF1986:  Domain of unknown function (DUF1986);  InterPro: IPR015420 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain is found in serine endopeptidases belonging to MEROPS peptidase family S1A (clan PA). It is found in unusual mosaic proteins, which are encoded by the Drosophila nudel gene (see P98159 from SWISSPROT). Nudel is involved in defining embryonic dorsoventral polarity. Three proteases; ndl, gd and snk process easter to create active easter. Active easter defines cell identities along the dorsal-ventral continuum by activating the spz ligand for the Tl receptor in the ventral region of the embryo. Nudel, pipe and windbeutel together trigger the protease cascade within the extraembryonic perivitelline compartment which induces dorsoventral polarity of the Drosophila embryo [].
Probab=99.49  E-value=1.9e-13  Score=151.28  Aligned_cols=216  Identities=24%  Similarity=0.462  Sum_probs=142.8

Q ss_pred             CCCccceEEEEEeCCeeeeeceeeCCCccccccccccCC--CCCceEEEEeEEecCCCCCCCCceeEeeEEEEeCCCCCC
Q psy11546       1292 RLGSWPWQAALYKEGEFQCGATLISDQWLLSAGHCFYRA--QDDYWVARLGTLRRGTKLPSPYEQLRPISKIILHPQYVD 1369 (1634)
Q Consensus      1292 ~~GewPW~VsL~~~g~~~CGGTLIS~rWVLTAAHCV~~~--~~~~~~VrLG~~~~~s~~~s~~~Q~~~V~~IiiHP~Yn~ 1369 (1634)
                      ....|||.|.||.+|.++|+|+||++.|||++..|+.+.  ...++.|.+|.........++++|+++|+.+..-|.   
T Consensus        12 e~y~WPWlA~IYvdG~~~CsgvLlD~~WlLvsssCl~~I~L~~~YvsallG~~Kt~~~v~Gp~EQI~rVD~~~~V~~---   88 (267)
T PF09342_consen   12 EDYHWPWLADIYVDGRYWCSGVLLDPHWLLVSSSCLRGISLSHHYVSALLGGGKTYLSVDGPHEQISRVDCFKDVPE---   88 (267)
T ss_pred             ccccCcceeeEEEcCeEEEEEEEeccceEEEeccccCCcccccceEEEEecCcceecccCCChheEEEeeeeeeccc---
Confidence            456799999999999999999999999999999999764  347889999988777667899999999999876654   


Q ss_pred             CCCcCceEEEEeCCC--CCCCceeeecCCCCCCCCCCCeEEEEEecccccCCCcCCCCcEEEEEeee----ChhHHhhhc
Q psy11546       1370 AGFINDISILKMKTP--FSNYVRPICLPHPNTPLTDGTLCTVVGWGQLFEIGRVFPDTLQEVQLPII----STAECRKRT 1443 (1634)
Q Consensus      1370 ~t~~nDIALLkL~~P--fSd~VqPICLP~~~~~l~~G~~c~VvGWG~t~e~g~~~s~~Lq~v~VpII----s~eeC~~~~ 1443 (1634)
                          .+++||+|++|  |+.+|+|+.||...........|+.+|-..+   |.  ..++.-..++..    +...|.+..
T Consensus        89 ----S~v~LLHL~~~~~fTr~VlP~flp~~~~~~~~~~~CVAVg~d~~---g~--~kt~si~l~~n~~nC~s~~~Cy~~~  159 (267)
T PF09342_consen   89 ----SNVLLLHLEQPANFTRYVLPTFLPETSNENESDDECVAVGHDDT---GR--IKTVSIFLIPNLENCDSCNRCYKLQ  159 (267)
T ss_pred             ----cceeeeeecCcccceeeecccccccccCCCCCCCceEEEEcccC---Cc--eeeeEEEeeccccCCCchhhhhhhc
Confidence                78999999999  9999999999986666666679999987541   11  111111111111    112243322


Q ss_pred             cCCCcc-----------cCCCCeEEEeecCCC-----CCCCCC-------------CCCCeEEEecCCCcEEEEEEEEcC
Q psy11546       1444 LFLPLY-----------RVTENMFCAGFERGG-----RDACLG-------------DSGGPLMCQEPDGRWSLMGVTSNG 1494 (1634)
Q Consensus      1444 ~~~~~~-----------~It~sm~CAG~~~gg-----~daC~G-------------DSGGPLVc~~~~GrW~LvGIvS~G 1494 (1634)
                      ...+..           .-...++|... .+-     -..|.-             --+|-|||+.....||.+|+.++.
T Consensus       160 ~~~p~~~~n~e~~~s~s~w~g~i~c~~~-~~~~~~a~F~~c~~~~~kn~~~~~~~~s~~gv~vC~~~~~gw~p~~~~~~~  238 (267)
T PF09342_consen  160 SKQPYALLNAEDMASISEWVGLISCVSE-KGWYPAAKFTSCDQFGLKNFSDKQHSPSDQGVLVCRDSHSGWYPTAFFNYN  238 (267)
T ss_pred             ccCCcccccccccccccccceeEEeecC-CCCcceEEEEecccccccccccCCCCcccCceEEEcCCCCCccchhhhccC
Confidence            211100           11223455431 110     012221             135778898877889999999998


Q ss_pred             CC-CCCCCCCeEEEEcCCchHHHHHhh
Q psy11546       1495 YG-CARANRPGVYTKVSNYIPWLYNNM 1520 (1634)
Q Consensus      1495 ~g-Cg~~~~PgVYTrVs~YlDWI~~vI 1520 (1634)
                      .+ |-.-+.|---+.+....+=|++.+
T Consensus       239 ~~~C~~f~~~~~i~~le~~y~~~q~~~  265 (267)
T PF09342_consen  239 NGDCQGFNDPFGIRTLEEAYKSLQDSI  265 (267)
T ss_pred             CCccccccCeeeEeEHHHhHHHHHHHh
Confidence            75 544444443333333334444443


No 7  
>PF01390 SEA:  SEA domain;  InterPro: IPR000082 SEA is an extracellular domain associated with O-glycosylation []. Proteins found to contain SEA-modules include, agrin, enterokinase, 63 kDa Strongylocentrotus purpuratus (Purple sea urchin) sperm protein, perlecan (heparan sulphate proteoglycan core, mucin 1 and the cell surface antigen, 114/A10, and two functionally uncharacterised, probably extracellular, Caenorhabditis elegans proteins. Despite the functional diversity of these adhesive proteins, a common denominator seems to be their existence in heavily glycosylated environments. In addition, the better characterised proteins all contain O-glycosidic-linked carbohydrates such as heparan sulphate that contribute considerably to their molecular masses. The common module might regulate or assist binding to neighbouring carbohydrate moieties. Enterokinase, the initiator of intestinal digestion, is a mosaic protease composed of a distinctive assortment of domains []. ; PDB: 2ACM_B 1IVZ_A 2E7V_A.
Probab=99.47  E-value=1.8e-13  Score=135.69  Aligned_cols=104  Identities=27%  Similarity=0.500  Sum_probs=80.9

Q ss_pred             ccEEEEEEEEEecCCccCcccCCcchHhHHHHHHHHHHHHHHHhhcCCcccccceeEEEEeeCCCEEEEEEEEEcccccc
Q psy11546         70 VELIFDSSFRVTAGDSYNPSLENSTSNLYKEKSKRYKSMIEKLYNASVLSPAIKYCGVIGFKNGSLIVFYRIILDRRKIP  149 (1634)
Q Consensus        70 k~~yY~GSFrItn~~~ys~dL~~~sS~~Fk~La~~IE~mV~~iF~~S~L~~~YikS~Vi~F~~gsViV~f~L~Fd~~~~p  149 (1634)
                      +.++|.++|+|++.. |+++|.+++|.+|+.|+++++.+|+.+|+.+.|.+.|++++|++|++|+|+|+|+|.|   ..+
T Consensus         2 ~~~~~~~~~~i~~~~-f~~~l~d~~S~~f~~l~~~i~~~i~~~~~~~~~~~~~~~~~I~~f~~gsViv~~~~~f---~~~   77 (107)
T PF01390_consen    2 KSQSYSGSFRITNLE-FTPDLADPNSPEFQELARDIEDQINSVFRNSSLSPGFVGVTITSFRPGSVIVDFDVIF---DPP   77 (107)
T ss_dssp             ECEEEEEEEEESSSE---GGGG-TTSHHHHHHHHHHHHHHHHHHHTSTTTTTEEEEEEEEEEETEEEEEEEEEE---ETT
T ss_pred             ceEEEEEEEEEeCCc-cchhcCCCCCHHHHHHHHHHHHHHHHhhccCccCCCcceEEEEEEECCCEEEEEEEEE---eCC
Confidence            468999999999999 9999999999999999999999999999999999999999999999999999999999   444


Q ss_pred             cCc--ccHHHHHHHHHHHHHhhcccCCccceEEcCC
Q psy11546        150 RSI--GNVEEVVKNILVDEITSRKAVAFKNIKVDEN  183 (1634)
Q Consensus       150 ~s~--~~v~~kI~~iL~q~L~~n~s~~l~~L~IDps  183 (1634)
                      ...  ..+++.+..+|++  .    +...+|.|||+
T Consensus        78 ~~~~~~~~~~~l~~~l~~--~----~~~~~l~i~~~  107 (107)
T PF01390_consen   78 SSAPPADIEEALQNALQQ--T----SNIGNLTIDPS  107 (107)
T ss_dssp             T-S-HHHHHHHHHHHHCC--H----HCCCCCEE-S-
T ss_pred             CCCCHHHHHHHHHHHHHh--h----cCcccccCCCC
Confidence            322  3444444444433  1    12337779975


No 8  
>PF03761 DUF316:  Domain of unknown function (DUF316) ;  InterPro: IPR005514 This is a family of uncharacterised proteins from Caenorhabditis elegans.
Probab=99.38  E-value=5.9e-12  Score=145.25  Aligned_cols=209  Identities=24%  Similarity=0.393  Sum_probs=135.6

Q ss_pred             ceEecCeecCCCccceEEEEEeCC----eeeeeceeeCCCccccccccccCCCCC-----------------ceEEE---
Q psy11546       1283 SRIVGGGNARLGSWPWQAALYKEG----EFQCGATLISDQWLLSAGHCFYRAQDD-----------------YWVAR--- 1338 (1634)
Q Consensus      1283 ~RIvGG~~a~~GewPW~VsL~~~g----~~~CGGTLIS~rWVLTAAHCV~~~~~~-----------------~~~Vr--- 1338 (1634)
                      .+...|..+..++.||.|.++..+    ..+++|+|||+||||||+||+......                 .+.|.   
T Consensus        40 ~~~~~g~~~~~~~~pW~v~v~~~~~~~~~~~~~gtlIS~RHiLtss~~~~~~~~~W~~~~~~~~~~C~~~~~~l~vP~~~  119 (282)
T PF03761_consen   40 SKVFNGTPAESGEAPWAVSVYTKNHNEGNYFSTGTLISPRHILTSSHCVMNDKSKWLNGEEFDNKKCEGNNNHLIVPEEV  119 (282)
T ss_pred             ccccCCcccccCCCCCEEEEEeccCcccceecceEEeccCeEEEeeeEEEecccccccCcccccceeeCCCceEEeCHHH
Confidence            456788899999999999998643    356899999999999999999632211                 01110   


Q ss_pred             EeEEec--CCCCCCCCceeEeeEEEEeCCCC----CCCCCcCceEEEEeCCCCCCCceeeecCCCCCCCCCCCeEEEEEe
Q psy11546       1339 LGTLRR--GTKLPSPYEQLRPISKIILHPQY----VDAGFINDISILKMKTPFSNYVRPICLPHPNTPLTDGTLCTVVGW 1412 (1634)
Q Consensus      1339 LG~~~~--~s~~~s~~~Q~~~V~~IiiHP~Y----n~~t~~nDIALLkL~~PfSd~VqPICLP~~~~~l~~G~~c~VvGW 1412 (1634)
                      +-.+..  .............|.++++--.-    ......++++||+|+++++..+.|+|||........+....+.|+
T Consensus       120 l~~~~v~~~~~~~~~~~~~~~v~ka~il~~C~~~~~~~~~~~~~mIlEl~~~~~~~~~~~Cl~~~~~~~~~~~~~~~yg~  199 (282)
T PF03761_consen  120 LSKIDVRCCNCFSNGKCFSIKVKKAYILNGCKKIKKNFNRPYSPMILELEEDFSKNVSPPCLADSSTNWEKGDEVDVYGF  199 (282)
T ss_pred             hccEEEEeecccccCCcccceeEEEEEEecCCCcccccccccceEEEEEcccccccCCCEEeCCCccccccCceEEEeec
Confidence            000000  00001111223456666653221    234456899999999999999999999987776777777777777


Q ss_pred             cccccCCCcCCCCcEEEEEeeeChhHHhhhccCCCcccCCCCeEEEeecCCCCCCCCCCCCCeEEEecCCCcEEEEEEEE
Q psy11546       1413 GQLFEIGRVFPDTLQEVQLPIISTAECRKRTLFLPLYRVTENMFCAGFERGGRDACLGDSGGPLMCQEPDGRWSLMGVTS 1492 (1634)
Q Consensus      1413 G~t~e~g~~~s~~Lq~v~VpIIs~eeC~~~~~~~~~~~It~sm~CAG~~~gg~daC~GDSGGPLVc~~~~GrW~LvGIvS 1492 (1634)
                      .        ....+....+.+.....|                  +.........|.||+||||+-.. +|+|+|+||.+
T Consensus       200 ~--------~~~~~~~~~~~i~~~~~~------------------~~~~~~~~~~~~~d~Gg~lv~~~-~gr~tlIGv~~  252 (282)
T PF03761_consen  200 N--------STGKLKHRKLKITNCTKC------------------AYSICTKQYSCKGDRGGPLVKNI-NGRWTLIGVGA  252 (282)
T ss_pred             C--------CCCeEEEEEEEEEEeecc------------------ceeEecccccCCCCccCeEEEEE-CCCEEEEEEEc
Confidence            1        122355555555443222                  11112345789999999999877 89999999999


Q ss_pred             cCCCCCCCCCCeEEEEcCCchHHHHHh
Q psy11546       1493 NGYGCARANRPGVYTKVSNYIPWLYNN 1519 (1634)
Q Consensus      1493 ~G~gCg~~~~PgVYTrVs~YlDWI~~v 1519 (1634)
                      .+..-+.. ....|.+|..|.+=|-+.
T Consensus       253 ~~~~~~~~-~~~~f~~v~~~~~~IC~l  278 (282)
T PF03761_consen  253 SGNYECNK-NNSYFFNVSWYQDEICEL  278 (282)
T ss_pred             cCCCcccc-cccEEEEHHHhhhhhccc
Confidence            77632221 267888888887755443


No 9  
>smart00200 SEA Domain found in sea urchin sperm protein, enterokinase, agrin. Proposed function of regulating or binding carbohydrate sidechains.
Probab=99.15  E-value=1.3e-10  Score=119.50  Aligned_cols=92  Identities=27%  Similarity=0.414  Sum_probs=75.9

Q ss_pred             EEEEEEEEEe-----cCCccCcccCCcchHhHHHHHHHHHHHHHHHhhcCCcccccceeEEEEeeCCCEEEEEEEEEccc
Q psy11546         72 LIFDSSFRVT-----AGDSYNPSLENSTSNLYKEKSKRYKSMIEKLYNASVLSPAIKYCGVIGFKNGSLIVFYRIILDRR  146 (1634)
Q Consensus        72 ~yY~GSFrIt-----n~~~ys~dL~~~sS~~Fk~La~~IE~mV~~iF~~S~L~~~YikS~Vi~F~~gsViV~f~L~Fd~~  146 (1634)
                      +.+.++|.++     +.. |+++|++++|.+|++|++++|+||+++|++|+|++.|++++|++|++|+|+|++.|.|   
T Consensus         3 ~~~~~~~~~~~~~~~~i~-ys~~L~d~sS~~f~eL~~~ie~~l~~~f~~s~l~~~f~~~~V~~~~~gsv~v~~~l~f---   78 (121)
T smart00200        3 QSFGVSLSVLSVEGENLQ-YSPSLEDPSSEEYQELVRDVEKLLEQIYGKTDLKPDFVGTEVIEFRNGSVVVDLGLLF---   78 (121)
T ss_pred             eEEEEEEEEEEEecccee-eChhhCCcccHHHHHHHHHHHHHHHHHHhcCccccceeEEEEEEEcCCcEEEEEEEEe---
Confidence            4555655555     457 9999999999999999999999999999999999999999999999999999999999   


Q ss_pred             ccccCcccHHHHHHHHHHHHHhhc
Q psy11546        147 KIPRSIGNVEEVVKNILVDEITSR  170 (1634)
Q Consensus       147 ~~p~s~~~v~~kI~~iL~q~L~~n  170 (1634)
                      +.+...   ...|++.|.+.++..
T Consensus        79 ~~~~~~---~~~V~~~~~~~~~~~   99 (121)
T smart00200       79 NEGVTN---GQDVEEDLLQVIKQA   99 (121)
T ss_pred             ccCCCc---cchhHHHHHHHHHHH
Confidence            555331   345666666665543


No 10 
>smart00202 SR Scavenger receptor Cys-rich. The sea ucrhin egg peptide speract contains 4 repeats of SR domains that contain 6 conserved cysteines. May bind bacterial antigens in the protein MARCO.
Probab=98.70  E-value=2.7e-08  Score=98.50  Aligned_cols=90  Identities=17%  Similarity=0.279  Sum_probs=71.6

Q ss_pred             CcceEEEEecCcccccccCcchhhhhhhchhhhcchhHHhhccCCCCCccceeeeccCC--CCCcceEEEeccccccchh
Q psy11546       1157 DSGFLMIQKQGQWGKLCMNQINNFIMKTLKWKISDLGKAICKSMTFRDLNEIEAVADPS--TDDSIYYELSMNAINSSAA 1234 (1634)
Q Consensus      1157 ~eG~VeV~~~GtWg~VCsd~~~~~~~~~~~~~isdlA~vICRqLG~~~~~sve~v~~~~--~~~~~y~~vscn~~e~s~s 1234 (1634)
                      .+|+|+++.+|.|++||.++|+..           .|.++||+|||+............  ..+..+..+.|.+.+.++.
T Consensus        10 ~~G~leV~~~~~Wg~VC~~~w~~~-----------~A~vvCrqlG~~~~~~~~~~~~~~~~~~~~~~~~~~C~G~E~sl~   78 (101)
T smart00202       10 CEGRVEVYHNGQWGTVCDDGWDLR-----------DANVVCRQLGFGGALSASGSAYFGPGSGPIWLDNVRCTGTEASLS   78 (101)
T ss_pred             CeEEEEEEECCEEeCeeCCCCChh-----------HHHHHHHHcCCCCcccceeccccCCCCCCeEEEcceEcCCCCCHH
Confidence            789999999999999999999866           789999999998866554321212  2245566889999999888


Q ss_pred             cccccccccccCCCCceeeeecc
Q psy11546       1235 TKSALLFQKTKCSQKQVVAVNCS 1257 (1634)
Q Consensus      1235 ~c~~l~~~~~~C~s~~vV~V~C~ 1257 (1634)
                      .|....+....|.+..++.|.|.
T Consensus        79 ~C~~~~~~~~~C~~~~~~~v~C~  101 (101)
T smart00202       79 DCPHSGWGSHNCSHGEDAGVVCS  101 (101)
T ss_pred             HCCCCCCCCcCCCCCCcEEEEEC
Confidence            88876666666999999999883


No 11 
>PF00057 Ldl_recept_a:  Low-density lipoprotein receptor domain class A This prints entry is specific to LDL receptor;  InterPro: IPR002172  The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing:    The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins [].      The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor [].     The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains.     The fourth domain is the hydrophobic transmembrane region.     The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits.   LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR class A (cyateine-rich) repeat, which contains 6 disulphide-bound cysteines and a highly conserved cluster of negatively charged amino acids, of which many are clustered on one face of the module []. In LDL receptors, the class A domains form the binding site for LDL and calcium. The acidic residues between the fourth and sixth cysteines are important for high-affinity binding of positively charged sequences in LDLR's ligands. The repeat consists of a beta-hairpin structure followed by a series of beta turns. In the absence of calcium, LDL-A domains are unstructured; the bound calcium ion imparts structural integrity. Following these repeats is a 350 residue domain that resembles part of the epidermal growth factor (EGF) precursor. Numerous familial hypercholestorolemia mutations of the LDL receptor alter the calcium coordinating residue of LDL-A domains or other crucial scaffolding residues. ; GO: 0005515 protein binding; PDB: 2I1P_A 3OJY_A 4E0S_B 3T5O_A 4A5W_B 1JRF_A 1K7B_A 1V9U_5 3DPR_E 2KNY_A ....
Probab=98.69  E-value=1e-08  Score=84.61  Aligned_cols=36  Identities=50%  Similarity=1.198  Sum_probs=33.8

Q ss_pred             CCCCCCceecCCCceecCCcccCCCCCCCCCCCCCCC
Q psy11546        992 ECEMNSSFRCGNGECVSIGSKCNQLVDCADGSDEKNC 1028 (1634)
Q Consensus       992 ~C~~~~~F~C~nG~CIp~~~~CDG~~DC~DgSDE~~C 1028 (1634)
                      .|. .++|+|.++.||+..|+|||+.||.|||||.+|
T Consensus         2 ~C~-~~~f~C~~~~CI~~~~~CDg~~DC~dgsDE~~C   37 (37)
T PF00057_consen    2 TCP-PGEFRCGNGQCIPKSWVCDGIPDCPDGSDEQNC   37 (37)
T ss_dssp             SSS-TTEEEETTSSEEEGGGTTSSSCSSSSSTTTSSH
T ss_pred             cCc-CCeeEcCCCCEEChHHcCCCCCCCCCCcccccC
Confidence            577 799999999999999999999999999999876


No 12 
>cd00112 LDLa Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure
Probab=98.68  E-value=9e-09  Score=83.87  Aligned_cols=35  Identities=49%  Similarity=1.106  Sum_probs=32.6

Q ss_pred             CCCCCceecCCCceecCCcccCCCCCCCCCCCCCCC
Q psy11546        993 CEMNSSFRCGNGECVSIGSKCNQLVDCADGSDEKNC 1028 (1634)
Q Consensus       993 C~~~~~F~C~nG~CIp~~~~CDG~~DC~DgSDE~~C 1028 (1634)
                      |. .++|+|.+|.||+..|+|||+.||+|||||.+|
T Consensus         1 C~-~~~f~C~~~~Ci~~~~~CDg~~DC~dgsDE~~C   35 (35)
T cd00112           1 CP-PNEFRCANGRCIPSSWVCDGEDDCGDGSDEENC   35 (35)
T ss_pred             CC-CCeEEcCCCCeeCHHHcCCCccCCCCCcccccC
Confidence            45 589999999999999999999999999999876


No 13 
>cd07888 CRD_corin_2 One of two cysteine-rich domains of the corin protein, a type II transmembrane serine protease . The cysteine-rich domain (CRD) is an essential component of corin, a type II transmembrane serine protease which functions as the convertase of the pro-atrial natriuretic peptide (pro-ANP) in the heart. Corin contains two CRDs in its extracellular region, which play an important role in recognition of the physiological substrate, pro-ANP. This model characterizes the second (C-terminal) CRD.
Probab=98.57  E-value=2.2e-08  Score=103.07  Aligned_cols=103  Identities=13%  Similarity=0.200  Sum_probs=70.1

Q ss_pred             cccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHH--HHHHHHHHHhhcccccccCCcchHHHHHHHhhcccccc
Q psy11546        855 EADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMA--QNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKH  932 (1634)
Q Consensus       855 ~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~--~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~  932 (1634)
                      +|++|++++|..|+||.|.|||+++|.+|++     |.+.  ..+|..+      +.     ..|+++|..|+|++|+|.
T Consensus         1 ~CepI~~~~C~~l~Yn~T~~PN~lgH~sq~e-----a~~~~~~~~f~pL------v~-----~~Csp~l~~FlCsl~~P~   64 (122)
T cd07888           1 QCEPITLELCMNLPYNTTRYPNYLGHRTQKE-----ASISWESSLFPAL------VQ-----TNCYKYLMFFACTILVPK   64 (122)
T ss_pred             CCccCCHhhhCCCCcCcccCCcccCCCCHHH-----HHHhhhHHHHHhH------Hh-----cCCChhHHHHHHhccCcc
Confidence            5999999999999999999999999999998     5443  3345544      33     389999999999999995


Q ss_pred             cccCCCCCCCCCCCCCCCceeeccC-----CceeeeeeecccccCCCCCcC
Q psy11546        933 THQHNPDTNISTKAIPSSFNFNQVN-----GIPILTKVYTKVSKTNSTSKE  978 (1634)
Q Consensus       933 ~~~~~p~~~i~t~~ip~~f~~~~v~-----g~~~~~~v~~~~~~c~~~s~~  978 (1634)
                      ...-.. .    .--|++-.|..|.     -+....-.|+..-.|.++..+
T Consensus        65 C~~~~~-~----~i~PCRslCe~vr~~C~~~m~~fg~~WP~~l~C~~fP~~  110 (122)
T cd07888          65 CDPVTQ-Q----RIPPCRSLCRNSKERCESVLGIVGLQWPEDTDCAQFPEE  110 (122)
T ss_pred             ccCCCC-C----CCCCcHHHHHHHHHHHHHHHHHhCCCCCCcCccccCCCC
Confidence            311000 1    1125666666511     111112247777788877655


No 14 
>cd07441 CRD_SFRP3 Cysteine-rich domain of the secreted frizzled-related protein 3 (SFRP3, alias FRZB), a Wnt antagonist. The cysteine-rich domain (CRD) is an essential part of the secreted frizzled-related protein 3 (SFRP3, alias FRZB), which plays important roles in embryogenesis and postnatal development as an antagonist of Wnt proteins, key players in a number of fundamental cellular processes. SFRPs antagonize the activation of Wnt signaling by binding to the CRD domains of frizzled proteins (Fz), thereby preventing Wnt proteins from binding to these receptors. SFRPs are also known to have functions unrelated to Wnt, as enhancers of procollagen cleavage by the TLD proteinases. SFRPs and Fz proteins both contain CRD domains, but SFRPs lack the seven-pass transmembrane domain which is an integral part of Fzs. SFRP3 regulates Wnt signaling activity in bone development and homeostasis. It is also involved in the control of planar cell polarity.
Probab=98.55  E-value=2.5e-08  Score=102.95  Aligned_cols=63  Identities=11%  Similarity=0.082  Sum_probs=53.5

Q ss_pred             ccccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhcccccc
Q psy11546        854 SEADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKH  932 (1634)
Q Consensus       854 ~~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~  932 (1634)
                      ..||+|+.++|..|+||.|.|||+++|.+|++     +......|..+       +.    +.|+++|..|+|++|+|.
T Consensus         2 a~CepI~~~~C~~l~Yn~T~~PN~lgH~~q~e-----a~~~~~~f~pL-------v~----~~Csp~l~~FLCS~~~P~   64 (126)
T cd07441           2 ASCEPVRIPMCKSMPWNMTKMPNHLHHSTQAN-----AVLAIEQFEGL-------LG----TQCSPDLLFFLCAMYAPI   64 (126)
T ss_pred             CcccccChhhhCCCCcCcccCCcccCCCCHHH-----HHHHHHHHHHH-------hc----CCCCccchhHHhhccCcc
Confidence            35999999999999999999999999999999     66654455444       33    489999999999999994


No 15 
>cd07454 CRD_LIN_17 Cysteine-rich domain (CRD) of LIN_17. A cysteine-rich domain (CRD) is an essential component of a number of cell surface receptors, which are involved in multiple signal transduction pathways, particularly in modulating the activity of the Wnt proteins, which play a fundamental role in the early development of metazoans. CRD is also found in secreted frizzled related proteins (SFRPs), which lack the transmembrane segment found in the frizzled protein. The CRD domain is also present in the alpha-1 chain of mouse type XVIII collagen, in carboxypeptidase Z, several receptor tyrosine kinases, and the mosaic transmembrane serine protease corin. The CRD domain is well conserved in metazoans - 10 frizzled proteins have been identified in mammals, 4 in Drosophila and 3 in Caenorhabditis elegans. CRD domains have also been identified in multiple tandem copies in a Dictyostelium discoideum protein. Very little is known about the mechanism by which CRD domains interact with the
Probab=98.55  E-value=5.1e-08  Score=100.69  Aligned_cols=65  Identities=6%  Similarity=-0.047  Sum_probs=54.0

Q ss_pred             cccccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhcccccc
Q psy11546        853 TSEADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKH  932 (1634)
Q Consensus       853 t~~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~  932 (1634)
                      ++.|++|++++|..|+||.|.|||+++|.+|++     +......|..       ++.    .+|+++|..|+|++|+|.
T Consensus         2 ~~~C~Pi~~~~C~~l~Yn~T~~PN~lgh~~q~e-----a~~~~~~f~p-------Lv~----~~C~p~l~~FlCsl~~P~   65 (124)
T cd07454           2 KGKCIPIDIELCKDLPYNYTYFPNTILHNDQHT-----LQTHTEHFKP-------LMK----TKCHPHIHFFICSVFAPM   65 (124)
T ss_pred             CCccccCChhhhCCCCcCcccCCcccCCCCHHH-----HHHHHHHHHH-------HHh----cCCChhHHHHhhcccccc
Confidence            578999999999999999999999999999988     5544333444       443    489999999999999995


Q ss_pred             c
Q psy11546        933 T  933 (1634)
Q Consensus       933 ~  933 (1634)
                      .
T Consensus        66 C   66 (124)
T cd07454          66 C   66 (124)
T ss_pred             c
Confidence            3


No 16 
>cd07462 CRD_FZ10 Cysteine-rich Wnt-binding domain (CRD) of the frizzled 10 (Fz10) receptor. The cysteine-rich domain (CRD) is an essential extracellular portion of the frizzled 10 (Fz10) receptor, and is required for binding Wnt proteins, which play fundamental roles in many aspects of early development, such as cell and tissue polarity, neural synapse formation, and the regulation of proliferation. Fz proteins serve as Wnt receptors for multiple signal transduction pathways, including both beta-catenin dependent and -independent cellular signaling, as well as the planar cell polarity pathway and Ca(2+) modulating signaling pathway. CRD containing Fzs have been found in diverse species from amoebas to mammals. 10 different frizzled proteins are found in vertebrata. The cellular functon of Fz10 is unknown.
Probab=98.54  E-value=3.5e-08  Score=102.15  Aligned_cols=106  Identities=9%  Similarity=0.054  Sum_probs=73.3

Q ss_pred             cccccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhcccccc
Q psy11546        853 TSEADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKH  932 (1634)
Q Consensus       853 t~~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~  932 (1634)
                      ++.|++|+.++|..|+||.|.|||+++|.+|++     +.+....|..       ++.    ..|+++|..|+|++|+|.
T Consensus         2 ~~~C~pI~~~~C~~l~Yn~T~~PN~lgH~~q~e-----a~~~~~~f~p-------Lv~----~~C~p~l~~FlCsl~~P~   65 (127)
T cd07462           2 EGRCQPIEIPMCKDIGYNMTRMPNLMGHENQRE-----AAIQLHEFAP-------LVE----YGCHSHLKFFLCSLYAPM   65 (127)
T ss_pred             CCeeeecChhhhCCCCcCceeCCcccCCcCHHH-----HHHHHHHHHH-------Hhc----cCCChhhHHHHHhccccc
Confidence            578999999999999999999999999999988     6554334444       333    499999999999999995


Q ss_pred             cccCCCCCCCCCCCCCCCceeeccCC-----ceeeeeeecccccCCCCCcCC
Q psy11546        933 THQHNPDTNISTKAIPSSFNFNQVNG-----IPILTKVYTKVSKTNSTSKER  979 (1634)
Q Consensus       933 ~~~~~p~~~i~t~~ip~~f~~~~v~g-----~~~~~~v~~~~~~c~~~s~~~  979 (1634)
                      ....     .+..--|++-.|+.|..     +....-.|+....|.+.....
T Consensus        66 C~~~-----~~~~i~PCRslCe~vr~~C~~~m~~~g~~WP~~l~C~~fP~~~  112 (127)
T cd07462          66 CTEQ-----VSTPIPACRVMCEQARLKCSPIMEQFNFKWPDSLDCSKLPNKN  112 (127)
T ss_pred             cCCC-----CCCcCCccHHHHHHHHHHHHHHHHHhCCCCCCcCCcccCCCCC
Confidence            3211     11112256666766211     111122477777887776554


No 17 
>cd07442 CRD_SFRP4 Cysteine-rich domain of the secreted frizzled-related protein 4 (SFRP4), a Wnt antagonist. The cysteine-rich domain (CRD) is an essential part of the secreted frizzled-related Protein 4 (SFRP4), which regulates the activity of Wnt proteins, key players in a number of fundamental cellular processes such as embryogenesis and postnatal development. SFRPs antagonize the activation of Wnt signaling by binding to the CRDs domains of frizzled (Fz) proteins, thereby preventing Wnt proteins from binding to these receptors. SFRPs are also known to have functions unrelated to Wnt, as enhancers of procollagen cleavage by the TLD proteinases. SFRPs and Fz proteins both contain CRD domains, but SFRPs lack the seven-pass transmembrane domain which is an integral part of Fzs.
Probab=98.53  E-value=3.3e-08  Score=102.28  Aligned_cols=62  Identities=11%  Similarity=0.069  Sum_probs=52.8

Q ss_pred             cccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhcccccc
Q psy11546        855 EADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKH  932 (1634)
Q Consensus       855 ~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~  932 (1634)
                      .||.|+.++|..|+||.|.|||+++|.+|++     |.+....|..       ++.    +.|+++|..|+|++|+|.
T Consensus         4 ~CepI~i~~C~~l~Yn~T~~PN~lgH~~q~e-----a~~~~~~f~p-------Lv~----~~Csp~l~~FLCS~~~P~   65 (127)
T cd07442           4 PCEAVRIPMCRHMPWNITRMPNHLHHSTQEN-----AVLAIEQYEE-------LVD----TGCSPVLPFFLCAMYAPI   65 (127)
T ss_pred             cccccChhhhCCCCcCcccCcCccCCCCHHH-----HHHHHHHHHH-------HHc----CCCCcchHHHHHhhcCCc
Confidence            4999999999999999999999999999999     6655444444       443    388999999999999994


No 18 
>cd07456 CRD_FZ5_like Cysteine-rich Wnt-binding domain (CRD) of receptors similar to frizzled 5. The cysteine-rich domain (CRD) is an essential extracellular portion of the frizzled 5 (Fz5) and frizzled 8 (Fz8) receptors, and similar proteins. This domain is required for binding Wnt proteins, which play fundamental roles in many aspects of early development, such as cell and tissue polarity, neural synapse formation, and the regulation of proliferation. Fz proteins serve as Wnt receptors for multiple signal transduction pathways, including both beta-catenin dependent and -independent cellular signaling, as well as the planar cell polarity pathway and Ca(2+) modulating signaling pathway. The CRD domain is well conserved in metazoans - 10 frizzled proteins have been identified in mammals, 4 in Drosophila and 3 in Caenorhabditis elegans. Very little is known about the mechanism by which CRD domains interact with their ligands. The domain contains 10 conserved cysteines.
Probab=98.53  E-value=5.4e-08  Score=100.07  Aligned_cols=62  Identities=10%  Similarity=0.029  Sum_probs=52.3

Q ss_pred             ccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhccccccc
Q psy11546        856 ADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKHT  933 (1634)
Q Consensus       856 c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~~  933 (1634)
                      ||+|+.++|..|+||.|.|||++||.+|++     +.+....+..       ++.    +.|+++|..|+|++|+|..
T Consensus         2 Cepi~~~~C~~l~Yn~T~~PN~lgh~~q~e-----a~~~~~~~~p-------Lv~----~~Csp~l~~FlCsl~~P~C   63 (120)
T cd07456           2 CEEITIPMCKGIGYNMTYMPNQFNHDTQEE-----AGLEVHQFWP-------LVE----IQCSPDLKFFLCSMYTPIC   63 (120)
T ss_pred             ccccChhhhCCCCCCcccCCCcCCCcCHHH-----HHHHHHHHHH-------HHh----CCCChhHHHHhHhccCccc
Confidence            999999999999999999999999999988     6555444444       333    4899999999999999954


No 19 
>cd07445 CRD_corin_1 One of two cysteine-rich domains of the corin protein, a type II transmembrane serine protease . The cysteine-rich domain (CRD) is an essential component of corin, a type II transmembrane serine protease which functions as the convertase of the pro-atrial natriuretic peptide (pro-ANP) in the heart. Corin contains two CRDs in its extracellular region, which play an important role in recognition of the physiological substrate, pro-ANP. This model characterizes the first (N-terminal) CRD.
Probab=98.52  E-value=3.6e-08  Score=101.95  Aligned_cols=108  Identities=7%  Similarity=0.038  Sum_probs=70.4

Q ss_pred             CcccccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhccccc
Q psy11546        852 NTSEADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLK  931 (1634)
Q Consensus       852 ~t~~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p  931 (1634)
                      +|+.||.|+.++|..|+||.|.|||+++|++ ++     +.+...+|       .-++    .++|+++|..|+|++|.|
T Consensus         1 ~~~~CepI~i~~C~~l~YN~T~~PN~lgh~~-~e-----a~~~l~~f-------~pLv----~~~Csp~l~~FLCSlyaP   63 (130)
T cd07445           1 NTSACMNITHSQCQMLPYHSTLKPSLLSVKN-ME-----MEKFLKFF-------SYLH----RLSCYQHIMLFGCSLALP   63 (130)
T ss_pred             CCCccccCChhhhCCCCCccccCCccccccC-HH-----HHHHHHhh-------cccc----ccCcChhhHHHhhhccCc
Confidence            5899999999999999999999999999954 34     33332222       2233    369999999999999999


Q ss_pred             c-cccCCCCCCCCCCCCCCCceeeccC-----CceeeeeeecccccCCCCCcCCC
Q psy11546        932 H-THQHNPDTNISTKAIPSSFNFNQVN-----GIPILTKVYTKVSKTNSTSKERN  980 (1634)
Q Consensus       932 ~-~~~~~p~~~i~t~~ip~~f~~~~v~-----g~~~~~~v~~~~~~c~~~s~~~~  980 (1634)
                      . +.+..    ....--|++..|..|.     -+....-.|+..-.|.++.+..+
T Consensus        64 ~C~~~~~----~~~~i~PCRslCe~vr~~C~~~m~~fg~~WPe~L~C~~fP~~~~  114 (130)
T cd07445          64 ECISDGD----DRHGLLPCRSFCEAAKEGCEPVLGMVNASWPDFLRCSQFRNNTE  114 (130)
T ss_pred             cccCCCC----CCCcCCCCHHHHHHHHHHHHHHHHHhCCCCCCcCCccccCCCCC
Confidence            4 32211    0111125666666511     11111224888888887765543


No 20 
>cd07466 CRD_FZ7 Cysteine-rich Wnt-binding domain (CRD) of the frizzled 7 (Fz7) receptor. The cysteine-rich domain (CRD) is an essential extracellular portion of the frizzled 7 (Fz7) receptor, and is required for binding Wnt proteins, which play fundamental roles in many aspects of early development, such as cell and tissue polarity, neural synapse formation, and the regulation of proliferation. Fz proteins serve as Wnt receptors for multiple signal transduction pathways, including both beta-catenin dependent and -independent cellular signaling, as well as the planar cell polarity pathway and Ca(2+) modulating signaling pathway. CRD containing Fzs have been found in diverse species from amoebas to mammals. 10 different frizzled proteins are found in vertebrata. Xenopus Fz7 is important in Wnt/beta-catenin signaling pathways controlling the transcriptional activation of target genes Siamois and Xnr3 in the animal caps of late blastula.
Probab=98.51  E-value=7.2e-08  Score=99.67  Aligned_cols=65  Identities=11%  Similarity=0.042  Sum_probs=54.2

Q ss_pred             cccccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhcccccc
Q psy11546        853 TSEADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKH  932 (1634)
Q Consensus       853 t~~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~  932 (1634)
                      .+.||+|+.++|..|+||.|.|||+++|.+|++     |......|..       ++.    +.|+++|..|+|++|+|.
T Consensus         2 ~~~CepI~~~~C~~l~Yn~T~~PN~lgH~~q~e-----a~~~~~~f~p-------Lv~----~~Csp~l~~FlCsl~~P~   65 (125)
T cd07466           2 HGFCQPISIPLCTDIAYNQTIMPNLLGHTNQED-----AGLEVHQFYP-------LVK----VQCSPELKFFLCSMYAPV   65 (125)
T ss_pred             CCccccCCHHHhCCCCCCcccCCcccCCCCHHH-----HHHHHHHHHH-------HHh----CCCChhHHHHhhhccCCc
Confidence            478999999999999999999999999999999     6544344433       333    489999999999999995


Q ss_pred             c
Q psy11546        933 T  933 (1634)
Q Consensus       933 ~  933 (1634)
                      .
T Consensus        66 C   66 (125)
T cd07466          66 C   66 (125)
T ss_pred             c
Confidence            3


No 21 
>cd07464 CRD_FZ2 Cysteine-rich Wnt-binding domain (CRD) of the frizzled 2 (Fz2) receptor. The cysteine-rich domain (CRD) is an essential extracellular portion of the frizzled 2 (Fz2) receptor, and is required for binding Wnt proteins, which play fundamental roles in many aspects of early development, such as cell and tissue polarity, neural synapse formation, and the regulation of proliferation. Fz proteins serve as Wnt receptors for multiple signal transduction pathways, including both beta-catenin dependent and -independent cellular signaling, as well as the planar cell polarity pathway and Ca(2+) modulating signaling pathway. CRD containing Fzs have been found in diverse species from amoebas to mammals. 10 different frizzled proteins are found in vertebrata. Fz2 is involved in the Wnt/beta-catenin signaling pathway and in the activation of protein kinase C and calcium/calmodulin-dependent protein kinase (CaM kinase).
Probab=98.48  E-value=7.4e-08  Score=99.78  Aligned_cols=65  Identities=11%  Similarity=0.033  Sum_probs=54.2

Q ss_pred             cccccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhcccccc
Q psy11546        853 TSEADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKH  932 (1634)
Q Consensus       853 t~~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~  932 (1634)
                      .+.||+|++++|..|+||.|.|||+++|.+|++     +......|..       ++.    ..|+++|..|+|++|+|.
T Consensus         2 ~~~CepI~~~~C~~l~Yn~T~~PN~lgH~sq~e-----a~~~~~~f~p-------Lv~----~~Csp~l~~FlCSl~~P~   65 (127)
T cd07464           2 HGFCQPISIPLCTDIAYNQTIMPNLLGHTNQED-----AGLEVHQFYP-------LVK----VQCSLELRFFLCSMYAPV   65 (127)
T ss_pred             CCccccCChhhhCCCCCCcccCCcccCCCCHHH-----HHHHHHHHHH-------HHh----cCCChhHHHHHhhccCCc
Confidence            467999999999999999999999999999998     5544444444       333    489999999999999995


Q ss_pred             c
Q psy11546        933 T  933 (1634)
Q Consensus       933 ~  933 (1634)
                      +
T Consensus        66 C   66 (127)
T cd07464          66 C   66 (127)
T ss_pred             c
Confidence            3


No 22 
>cd07460 CRD_FZ5 Cysteine-rich Wnt-binding domain (CRD) of the frizzled 5 (Fz5) receptor.proteins. The cysteine-rich domain (CRD) is an essential extracellular portion of the frizzled 5 (Fz5) receptor, and is required for binding Wnt proteins, which play fundamental roles in many aspects of early development, such as cell and tissue polarity, neural synapse formation, and the regulation of proliferation. Fz proteins serve as Wnt receptors for multiple signal transduction pathways, including both beta-catenin dependent and -independent cellular signaling, as well as the planar cell polarity pathway and Ca(2+) modulating signaling pathway. CRD containing Fzs have been found in diverse species from amoebas to mammals. 10 different frizzled proteins are found in vertebrata. Fz5 plays critical regulating roles in the yolk sac and placental angiogenesis, in the maturation of the Paneth cell phenotype, in governing the neural potential of progenitors in the developing retina, and in neuronal s
Probab=98.47  E-value=9.5e-08  Score=99.00  Aligned_cols=103  Identities=8%  Similarity=-0.012  Sum_probs=69.4

Q ss_pred             cccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhcccccccc
Q psy11546        855 EADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKHTH  934 (1634)
Q Consensus       855 ~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~~~  934 (1634)
                      .||+|+.++|..|+||.|.|||+++|.+|++     +.+....|..       ++.    ++|+++|..|+|++|+|...
T Consensus         4 ~CepI~~~~C~~l~Yn~T~~PN~lgh~~q~e-----a~~~~~~f~p-------Lv~----~~Csp~l~~FlCS~~~P~C~   67 (127)
T cd07460           4 VCQEITVPMCKGIGYNLTYMPNQFNHDTQDE-----AGLEVHQFWP-------LVE----IQCSPDLRFFLCSMYTPICL   67 (127)
T ss_pred             EecccChhhhCCCCCCcccCCCccCCCCHHH-----HHHHHHHHHH-------HHh----CCCChhHHHHhhhccCcccc
Confidence            4999999999999999999999999999998     6655444444       343    48999999999999999532


Q ss_pred             cCCCCCCCCCCCCCCCceeeccCC-----ceeeeeeecccccCCCCCcC
Q psy11546        935 QHNPDTNISTKAIPSSFNFNQVNG-----IPILTKVYTKVSKTNSTSKE  978 (1634)
Q Consensus       935 ~~~p~~~i~t~~ip~~f~~~~v~g-----~~~~~~v~~~~~~c~~~s~~  978 (1634)
                      .-     ....-.|++-.|+.|..     +....-.|+..-.|.+....
T Consensus        68 ~~-----~~~~i~PCRslCe~vr~~C~~~m~~fg~~WP~~l~C~~fP~~  111 (127)
T cd07460          68 PD-----YRKPLPPCRSVCERAKAGCSPLMRQYGFAWPERMNCDRLPVL  111 (127)
T ss_pred             CC-----CCCcCCccHHHHHHHHHHHHHHHHHhCCCCCCcCChhhCCCC
Confidence            10     11112256666665110     11112236666677665443


No 23 
>KOG1215|consensus
Probab=98.46  E-value=1.3e-07  Score=125.68  Aligned_cols=38  Identities=39%  Similarity=0.823  Sum_probs=34.4

Q ss_pred             CCCCCCCceecC--CCceecCCcccCCCCCCCCCCCCCCCC
Q psy11546        991 SECEMNSSFRCG--NGECVSIGSKCNQLVDCADGSDEKNCS 1029 (1634)
Q Consensus       991 ~~C~~~~~F~C~--nG~CIp~~~~CDG~~DC~DgSDE~~C~ 1029 (1634)
                      ..|. ..+|.|.  +++|||..|+|||..||+||+||..|.
T Consensus       136 ~~~~-~~~~~c~~~~~~Cip~~~~cd~~~~C~dg~de~~~~  175 (877)
T KOG1215|consen  136 SHCC-LDKFSCRTGSCKCIPGDWLCDGEADCPDGSDELNCA  175 (877)
T ss_pred             cccc-CCCCCCcCccccCCCCceeCCCCCccccchhhhccc
Confidence            4555 6889998  899999999999999999999999986


No 24 
>cd07450 CRD_FZ6 Cysteine-rich Wnt-binding domain (CRD) of the frizzled 6 (Fz6) receptor. The cysteine-rich domain (CRD) is an essential extracellular portion of the frizzled 6 (Fz6) receptor, and is required for binding Wnt proteins, which play fundamental roles in many aspects of early development, such as cell and tissue polarity, neural synapse formation, and the regulation of proliferation. Fz proteins serve as Wnt receptors for multiple signal transduction pathways, including both beta-catenin dependent and -independent cellular signaling, as well as the planar cell polarity pathway and Ca(2+) modulating signaling pathway. CRD containing Fzs have been found in diverse species from amoebas to mammals. 10 different frizzled proteins are found in vertebrata. Frizzled 6 (Fz6) is expressed in the skin and hair follicles and controls hair patterning in mammals using a Fz-dependent tissue polarity system, which is similar to the one that patterns the Drosophila cuticle.
Probab=98.46  E-value=5.3e-08  Score=100.77  Aligned_cols=101  Identities=10%  Similarity=0.054  Sum_probs=70.2

Q ss_pred             cccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhcccccccc
Q psy11546        855 EADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKHTH  934 (1634)
Q Consensus       855 ~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~~~  934 (1634)
                      .||+|+.++|+.|+||.|.|||+++|.+|++     |......       |.-++.    ++|+++|..|+|++|+|.+.
T Consensus         4 ~CepI~~~~C~~l~YN~T~~PN~lgH~~q~e-----a~~~~~~-------f~pLv~----~~Csp~l~~FLCS~~~P~C~   67 (127)
T cd07450           4 TCEPITVPRCLKMPYNMTFFPNLMGHYDQDI-----AAVEMEP-------FLPLAN----LRCSPNVHTFLCQAFVPTCT   67 (127)
T ss_pred             eeeeCChhhcCCCCCCcccCCcccCCcCHHH-----HHHHHHH-------HHhHhc----cCCChhhhhheeeeeccccc
Confidence            5999999999999999999999999999988     5444333       333444    48999999999999999532


Q ss_pred             cCCCCCCCCCCCCCCCceeeccCC-----ceeeeeeecccccCCCCCc
Q psy11546        935 QHNPDTNISTKAIPSSFNFNQVNG-----IPILTKVYTKVSKTNSTSK  977 (1634)
Q Consensus       935 ~~~p~~~i~t~~ip~~f~~~~v~g-----~~~~~~v~~~~~~c~~~s~  977 (1634)
                      ..      .....|++-.|..|..     +....-.|+..-.|.+..+
T Consensus        68 ~~------~~~i~PCRslCe~vr~~C~~~m~~fg~~WP~~L~C~~fP~  109 (127)
T cd07450          68 EQ------IHVVRPCRELCEKVYSDCKKLIDTFGISWPEELECDRLQY  109 (127)
T ss_pred             CC------CCcCCCCHHHHHHHHHHHHHHHHHhCCCCCCcCCcccCCC
Confidence            11      0112466666766211     1222335777777776643


No 25 
>cd07447 CRD_Carboxypeptidase_Z Cysteine-rich domain of carboxypeptidase Z, a member of the carboxypeptidase E family. The cysteine-rich-domain (CRD) is an essential part of carboxypeptidase Z, a member of the carboxypeptidase E family of metallocarboxypeptidases. This is a group of Zn-dependent enzymes implicated in the intra- and extracellular processing of proteins. Carboxypeptidase Z removes C-terminal basic amino acid residues from its substrates, particularly arginine. The CRD acts as a ligand-binding domain for Wnts involved in developmental processes. CPZ binds and may process Wnt-4, CPZ has also been found to enhance the induction of the homeobox gene Cdx1. During vertebrate embryogenesis, the CRD of CPZ upregulates Pax3, a Wnt reporter gene essential for patterning of somites and limb development.
Probab=98.45  E-value=7.3e-08  Score=99.96  Aligned_cols=107  Identities=6%  Similarity=-0.017  Sum_probs=70.7

Q ss_pred             ccccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhccccccc
Q psy11546        854 SEADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKHT  933 (1634)
Q Consensus       854 ~~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~~  933 (1634)
                      ..|+.|++++|..|+||.|.|||+++|.+|++     +..+...  +++..|.-++.    ..|+++|..|+|++|+|..
T Consensus         2 ~~CepI~~~~C~~l~Yn~T~~PN~lgH~sq~~-----~~~~~~~--~~l~~f~pLv~----~~C~p~l~~FLCSl~~P~C   70 (128)
T cd07447           2 ATCTDLLLSYCSDVSYTQTTFPNLLGHRSREV-----TEAGAEY--LLLSVLHGLLG----GECNPDIRLLGCSVLAPRC   70 (128)
T ss_pred             CcccCCChhhcCCCCCCcccCCcccCCCCHHH-----HHhhhhH--hHHHHHhhhhh----cCcChhHHhHhhccccccc
Confidence            35999999999999999999999999999876     3322111  12344444554    5899999999999999953


Q ss_pred             ccCCCCCCCCCCCCCCCceeeccCC-----ceeeeeeecccccCCCCCcC
Q psy11546        934 HQHNPDTNISTKAIPSSFNFNQVNG-----IPILTKVYTKVSKTNSTSKE  978 (1634)
Q Consensus       934 ~~~~p~~~i~t~~ip~~f~~~~v~g-----~~~~~~v~~~~~~c~~~s~~  978 (1634)
                      ..   +.    .-.|++-.|..|..     +....-.|+..-.|.++...
T Consensus        71 ~~---~~----~i~PCRslCe~vr~~C~~~m~~fg~~WP~~L~C~~fP~~  113 (128)
T cd07447          71 EN---DK----VIKPCRSTCEALRKRCSHAFDAIQMAWPYFLDCDRFFAG  113 (128)
T ss_pred             CC---CC----ccCCCHHHHHHHHHHHHHHHHHhCCCCCCcCccccCCCc
Confidence            21   11    12256666666211     11112247777777766544


No 26 
>cd07465 CRD_FZ1 Cysteine-rich Wnt-binding domain (CRD) of the frizzled 1 (Fz1) receptor. The cysteine-rich domain (CRD) is an essential extracellular portion of the frizzled 1 (Fz1) receptor, and is required for binding Wnt proteins, which play fundamental roles in many aspects of early development, such as cell and tissue polarity, neural synapse formation, and the regulation of proliferation. Fz proteins serve as Wnt receptors for multiple signal transduction pathways, including both beta-catenin dependent and -independent cellular signaling, as well as the planar cell polarity pathway and Ca(2+) modulating signaling pathway. CRD containing Fzs have been found in diverse species from amoebas to mammals. 10 different frizzled proteins are found in vertebrata.
Probab=98.44  E-value=9.9e-08  Score=98.91  Aligned_cols=65  Identities=11%  Similarity=0.049  Sum_probs=54.2

Q ss_pred             cccccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhcccccc
Q psy11546        853 TSEADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKH  932 (1634)
Q Consensus       853 t~~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~  932 (1634)
                      .+.||+|+.++|..|+||.|.|||+++|.+|++     |....+.+..       ++.    ..|+++|..|+|++|+|.
T Consensus         2 ~~~Cepi~~~~C~~l~Yn~T~~PN~lgh~~q~e-----a~~~~~~f~p-------Lv~----~~Csp~l~~FLCSl~~P~   65 (127)
T cd07465           2 HGYCQPISIPLCTDIAYNQTIMPNLLGHTNQED-----AGLEVHQFYP-------LVK----VQCSAELKFFLCSMYAPV   65 (127)
T ss_pred             CCeeeeCCHHHhCCCCCCcccCCcccCCCCHHH-----HHHHHHHHHH-------HHh----CCCChhhHhHhhhccCCc
Confidence            468999999999999999999999999999999     6544444433       333    489999999999999995


Q ss_pred             c
Q psy11546        933 T  933 (1634)
Q Consensus       933 ~  933 (1634)
                      +
T Consensus        66 C   66 (127)
T cd07465          66 C   66 (127)
T ss_pred             c
Confidence            3


No 27 
>PF00530 SRCR:  Scavenger receptor cysteine-rich domain;  InterPro: IPR001190 The egg peptide speract receptor is a transmembrane glycoprotein []. Other members of this family include the macrophage scavenger receptor type I (a membrane glycoprotein implicated in the pathologic deposition of cholesterol in arterial walls during artherogenesis), an enteropeptidase and T-cell surface glycoprotein CD5 (may act as a receptor in regulating T-cell proliferation).; GO: 0005044 scavenger receptor activity, 0016020 membrane; PDB: 2XRC_D 2JP0_A 2JOP_A 2OYA_A 2OY3_A 1BY2_A 2OTT_Y 2JA4_A.
Probab=98.43  E-value=2.6e-07  Score=90.15  Aligned_cols=90  Identities=13%  Similarity=0.297  Sum_probs=68.6

Q ss_pred             CcceEEEEe-cCcccccccCcchhhhhhhchhhhcchhHHhhccCCCCCccceeeeccC--CCCCcceEEEeccccccch
Q psy11546       1157 DSGFLMIQK-QGQWGKLCMNQINNFIMKTLKWKISDLGKAICKSMTFRDLNEIEAVADP--STDDSIYYELSMNAINSSA 1233 (1634)
Q Consensus      1157 ~eG~VeV~~-~GtWg~VCsd~~~~~~~~~~~~~isdlA~vICRqLG~~~~~sve~v~~~--~~~~~~y~~vscn~~e~s~ 1233 (1634)
                      ++|+|+++. +|.|+++|.++|+..           .|.++||+|||+.+.........  ...+..+..+.|.+.+..+
T Consensus         7 ~~G~vev~~~~~~W~~vC~~~w~~~-----------~a~vvCrqLG~~~~~~~~~~~~~~~~~~~~~~~~~~C~g~e~~l   75 (99)
T PF00530_consen    7 CEGRVEVCFYNGSWGTVCDDGWSMN-----------EANVVCRQLGYGGASSTSSSSSFGSGSGPIWLSNVNCSGNETSL   75 (99)
T ss_dssp             TEEEEEEEEETTEEEEEBTTT-SHH-----------HHHHHHHHTTSSEEEEEEECTTTTSS-S-ECECCEEEBSSTSSG
T ss_pred             eeEEEEEEEECCEEEeEECCCcchh-----------HHHHHhhhhhccccceeeeecccccccccccccCCEecCCCCcH
Confidence            899999965 999999999999876           78999999999766555433332  1334455688899988899


Q ss_pred             hcccccccccccCCCCceeeeecc
Q psy11546       1234 ATKSALLFQKTKCSQKQVVAVNCS 1257 (1634)
Q Consensus      1234 s~c~~l~~~~~~C~s~~vV~V~C~ 1257 (1634)
                      ..|....+....|.+...+.|.|+
T Consensus        76 ~~C~~~~~~~~~C~~~~~~~V~Cs   99 (99)
T PF00530_consen   76 SECTHSPWTSSSCSHSEDAGVICS   99 (99)
T ss_dssp             GGSBSSECCCSS-SGGGBEEEEEE
T ss_pred             hhCCCcCCCCCCCCCCccEEEEEC
Confidence            999887777788988788999884


No 28 
>cd07463 CRD_FZ9 Cysteine-rich Wnt-binding domain (CRD) of the frizzled 9 (Fz9) receptor. The cysteine-rich domain (CRD) is an essential extracellular portion of the frizzled 9 (Fz9) receptor, and is required for binding Wnt proteins, which play fundamental roles in many aspects of early development, such as cell and tissue polarity, neural synapse formation, and the regulation of proliferation. Fz proteins serve as Wnt receptors for multiple signal transduction pathways, including both beta-catenin dependent and -independent cellular signaling, as well as the planar cell polarity pathway and Ca(2+) modulating signaling pathway. CRD containing Fzs have been found in diverse species from amoebas to mammals. 10 different frizzled proteins are found in vertebrata. Fz9 may play a signaling role in lymphoid development and maturation, particularly at points where B cells undergo self-renewal prior to further differentiation.
Probab=98.43  E-value=1.5e-07  Score=97.55  Aligned_cols=103  Identities=10%  Similarity=0.059  Sum_probs=69.4

Q ss_pred             ccccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhccccccc
Q psy11546        854 SEADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKHT  933 (1634)
Q Consensus       854 ~~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~~  933 (1634)
                      +.||+|+.++|..|+||.|.|||+|+|.+|++     +....+.+..       ++.    .+|+++|..|+|++|+|..
T Consensus         3 ~~CepI~~~~C~~l~Yn~T~~PN~lgh~sq~e-----a~~~~~~f~p-------Lv~----~~Csp~l~~FlCS~~~P~C   66 (127)
T cd07463           3 AKCQPVVIPMCRGIGYNLTRMPNFLGHDSQRE-----AAIKLNEFAP-------LVE----YGCHVHLRFFLCSLYAPMC   66 (127)
T ss_pred             CccccCChhhhCCCCcCcccCCcccCCcCHHH-----HHHHHHHHHH-------HHh----cCCChhhHHHhhhcccccc
Confidence            57999999999999999999999999999988     5544333444       343    4899999999999999953


Q ss_pred             ccCCCCCCCCCCCC-CCCceeeccCC-----ceeeeeeecccccCCCCCcC
Q psy11546        934 HQHNPDTNISTKAI-PSSFNFNQVNG-----IPILTKVYTKVSKTNSTSKE  978 (1634)
Q Consensus       934 ~~~~p~~~i~t~~i-p~~f~~~~v~g-----~~~~~~v~~~~~~c~~~s~~  978 (1634)
                      ...     .+ +.| |++-.|+.|..     +....-.|+..-.|.+....
T Consensus        67 ~~~-----~~-~~i~PCRslCe~vr~~C~~~m~~fg~~WP~~l~C~~fP~~  111 (127)
T cd07463          67 TDQ-----VS-TSIPACRPMCEQARQKCSPIMEQFNFGWPESLDCSRLPTR  111 (127)
T ss_pred             CCC-----CC-CcCCccHHHHHHHHHHHHHHHHHhCCCCCCcCChhhCCCC
Confidence            211     11 122 56666666111     11112236666666655443


No 29 
>cd07448 CRD_FZ4 Cysteine-rich Wnt-binding domain of the frizzled 4 (Fz4) receptor. The cysteine-rich domain (CRD) is an essential extracellular portion of the frizzled 4 (Fz4) receptor, and is required for binding Wnt proteins, which play fundamental roles in many aspects of early development, such as cell and tissue polarity, neural synapse formation, and the regulation of proliferation. Fz proteins serve as Wnt receptors for multiple signal transduction pathways, including  both beta-catenin dependent and -independent cellular signaling, as well as the planar cell polarity pathway and the Ca(2+) modulating signaling pathway. CRD containing Fzs have been found in diverse species from amoebas to mammals. 10 different frizzled proteins are found in vertebrata. Frizzled 4 (Fz4) activates the Ca(2+)/calmodulin-dependent protein kinase II and protein kinase C of the Wnt/Ca(2+) signaling pathway during retinal angiogenesis. Mutations in Fz4 lead to familial exudative vitreoretinopathy (FEVR
Probab=98.41  E-value=1.5e-07  Score=97.49  Aligned_cols=81  Identities=11%  Similarity=0.096  Sum_probs=60.4

Q ss_pred             ccccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhccccccc
Q psy11546        854 SEADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKHT  933 (1634)
Q Consensus       854 ~~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~~  933 (1634)
                      ..|++|++++|..|+||.|.|||+++|.+|++     +...       ++.|..++.    +.|+++|..|+|++|+|..
T Consensus         2 ~~Cepi~~~~C~~l~Yn~T~~PN~lgh~sq~e-----a~~~-------l~~~~pLv~----~~Csp~l~~flCsl~~P~C   65 (126)
T cd07448           2 RRCEPIRIEMCQGLGYNVTRMPNLVGHELQTD-----AELQ-------LQTFTPLIQ----YGCSSQLKFFLCSVYVPMC   65 (126)
T ss_pred             CeeeeCCHHHhCCCCcccccCCCcCCCcCHHH-----HHHH-------HHHHHHHhh----cCCChhHHhHhhhcccccc
Confidence            46999999999999999999999999999999     5543       333444444    4899999999999999953


Q ss_pred             ccCCCCCCCCCCCCCCCceeec
Q psy11546        934 HQHNPDTNISTKAIPSSFNFNQ  955 (1634)
Q Consensus       934 ~~~~p~~~i~t~~ip~~f~~~~  955 (1634)
                      ..-     .+..-.|++-.|..
T Consensus        66 ~~~-----~~~~i~PCRslCe~   82 (126)
T cd07448          66 TEK-----VPVPIGPCRPLCLS   82 (126)
T ss_pred             CCC-----CCCcCCcCHHHHHH
Confidence            211     11122366666666


No 30 
>PF00057 Ldl_recept_a:  Low-density lipoprotein receptor domain class A This prints entry is specific to LDL receptor;  InterPro: IPR002172  The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing:    The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins [].      The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor [].     The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains.     The fourth domain is the hydrophobic transmembrane region.     The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits.   LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR class A (cyateine-rich) repeat, which contains 6 disulphide-bound cysteines and a highly conserved cluster of negatively charged amino acids, of which many are clustered on one face of the module []. In LDL receptors, the class A domains form the binding site for LDL and calcium. The acidic residues between the fourth and sixth cysteines are important for high-affinity binding of positively charged sequences in LDLR's ligands. The repeat consists of a beta-hairpin structure followed by a series of beta turns. In the absence of calcium, LDL-A domains are unstructured; the bound calcium ion imparts structural integrity. Following these repeats is a 350 residue domain that resembles part of the epidermal growth factor (EGF) precursor. Numerous familial hypercholestorolemia mutations of the LDL receptor alter the calcium coordinating residue of LDL-A domains or other crucial scaffolding residues. ; GO: 0005515 protein binding; PDB: 2I1P_A 3OJY_A 4E0S_B 3T5O_A 4A5W_B 1JRF_A 1K7B_A 1V9U_5 3DPR_E 2KNY_A ....
Probab=98.41  E-value=1.9e-07  Score=77.12  Aligned_cols=36  Identities=50%  Similarity=1.268  Sum_probs=33.6

Q ss_pred             CCCCCceecCCCCeeeccccccCCCccCCCCCCcccc
Q psy11546       1061 WCSPGQYICPNSRVCIERTRLCDGIKDCPLGDDEKQC 1097 (1634)
Q Consensus      1061 ~C~~~~F~C~~~~~CI~~~~~CDG~~DC~DGSde~~c 1097 (1634)
                      .|++++|.|.++. ||+..++|||+.||.|||||.+|
T Consensus         2 ~C~~~~f~C~~~~-CI~~~~~CDg~~DC~dgsDE~~C   37 (37)
T PF00057_consen    2 TCPPGEFRCGNGQ-CIPKSWVCDGIPDCPDGSDEQNC   37 (37)
T ss_dssp             SSSTTEEEETTSS-EEEGGGTTSSSCSSSSSTTTSSH
T ss_pred             cCcCCeeEcCCCC-EEChHHcCCCCCCCCCCcccccC
Confidence            5889999998776 99999999999999999999887


No 31 
>cd07461 CRD_FZ8 Cysteine-rich Wnt-binding domain (CRD) of the frizzled 8 (Fz8) receptor. The cysteine-rich domain (CRD) is an essential extracellular portion of the frizzled 8 (Fz8) receptor, and is required for binding Wnt proteins, which play fundamental roles in many aspects of early development, such as cell and tissue polarity, neural synapse formation, and the regulation of proliferation. Fz proteins serve as Wnt receptors for multiple signal transduction pathways, including both beta-catenin dependent and -independent cellular signaling, as well as the planar cell polarity pathway and Ca(2+) modulating signaling pathway. CRD containing Fzs have been found in diverse species from amoebas to mammals. 10 different frizzled proteins are found in vertebrata. Xenopus Fz8 is important in Wnt/beta-catenin signaling pathways controlling the transcriptional activation of target genes Siamois and Xnr3 in the animal caps of late blastula.
Probab=98.41  E-value=1.8e-07  Score=96.85  Aligned_cols=104  Identities=8%  Similarity=0.021  Sum_probs=69.2

Q ss_pred             ccccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhccccccc
Q psy11546        854 SEADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKHT  933 (1634)
Q Consensus       854 ~~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~~  933 (1634)
                      ..|++|++++|..|+||.|.|||+++|.+|++     +......|..+       +.    +.|+++|..|+|++|+|..
T Consensus         3 ~~C~pI~~~~C~~l~Yn~T~~PN~l~H~sq~e-----a~~~~~~f~pL-------v~----~~Csp~l~~FlCSl~~P~C   66 (125)
T cd07461           3 LQCQEITVPLCKGIGYNYTYMPNQFNHDTQDE-----AGLEVHQFWPL-------VE----IQCSPDLKFFLCSMYTPIC   66 (125)
T ss_pred             CcccccCHHHhCCCCCCcccCCCccCCCCHHH-----HHHHHHHHHHH-------Hh----CCCChhHHHHhhhccCccc
Confidence            46999999999999999999999999999988     55544444443       33    4899999999999999943


Q ss_pred             ccCCCCCCCCCCCCCCCceeeccCC-----ceeeeeeecccccCCCCCcC
Q psy11546        934 HQHNPDTNISTKAIPSSFNFNQVNG-----IPILTKVYTKVSKTNSTSKE  978 (1634)
Q Consensus       934 ~~~~p~~~i~t~~ip~~f~~~~v~g-----~~~~~~v~~~~~~c~~~s~~  978 (1634)
                      ..-     ....-.|++-.|+.|..     +....-.|+..-.|.+....
T Consensus        67 ~~~-----~~~~i~PCRslCe~vr~~C~~~m~~~g~~WP~~l~C~~fP~~  111 (125)
T cd07461          67 LED-----YKKPLPPCRSVCERAKAGCAPLMRQYGFPWPDRMRCDLLPEQ  111 (125)
T ss_pred             CCC-----CCCcCCccHHHHHHHHHHHHHHHHHhCCCCCCcCChhhCCCC
Confidence            210     01112356666666110     11111136666666655443


No 32 
>cd07449 CRD_FZ3 Cysteine-rich Wnt-binding domain (CRD) of the frizzled 3 (Fz3) receptor. The cysteine-rich domain (CRD) is an essential extracellular portion of the frizzled 3 (Fz3) receptor, and is required for binding Wnt proteins, which play fundamental roles in many aspects of early development, such as cell and tissue polarity, neural synapse formation, and the regulation of proliferation. Fz proteins serve as Wnt receptors for multiple signal transduction pathways, including both beta-catenin dependent and -independent cellular signaling, as well as the planar cell polarity pathway and Ca(2+) modulating signaling pathway. CRD containing Fzs have been found in diverse species from amoebas to mammals. 10 different frizzled proteins are found in vertebrata. Fz3 plays a vital role in the anterior-posterior guidance of commissural axons. Knockout mice without Fz3 show defects in fiber tracts in the rostral CNS.
Probab=98.40  E-value=8.5e-08  Score=99.18  Aligned_cols=102  Identities=8%  Similarity=0.103  Sum_probs=70.6

Q ss_pred             cccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhcccccccc
Q psy11546        855 EADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKHTH  934 (1634)
Q Consensus       855 ~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~~~  934 (1634)
                      .|++|++++|..|+||.|.|||+++|.+|++     |++.       ++.|.-++.    +.|+++|..|+|++|+|...
T Consensus         4 ~CepI~~~~C~~l~Yn~T~~PN~lgH~~q~e-----a~~~-------~~~f~pLv~----~~Csp~l~~FLCSly~P~C~   67 (127)
T cd07449           4 SCEPITLRMCQDLPYNTTFMPNLLNHYDQQT-----AALA-------MEPFHPMVN----LECSRDFRPFLCALYAPVCM   67 (127)
T ss_pred             eeeeCChhhhCCCCCCcccCCcccCCcCHHH-----HHHH-------HHhhccccc----CCcChhhHhHhheeeccccC
Confidence            5999999999999999999999999999998     5543       344555555    58999999999999999532


Q ss_pred             cCCCCCCCCCCCCCCCceeeccCC-----ceeeeeeecccccCCCCCcC
Q psy11546        935 QHNPDTNISTKAIPSSFNFNQVNG-----IPILTKVYTKVSKTNSTSKE  978 (1634)
Q Consensus       935 ~~~p~~~i~t~~ip~~f~~~~v~g-----~~~~~~v~~~~~~c~~~s~~  978 (1634)
                      ..  +    ..-.|++-.|+.|..     +....-.|+..-.|.+..+.
T Consensus        68 ~~--~----~~i~PCRslCe~vr~~C~~~m~~fg~~WP~~L~C~~fP~~  110 (127)
T cd07449          68 EY--G----RVTLPCRRLCQRAYSECSKLMEMFGVPWPEDMECSRFPDC  110 (127)
T ss_pred             CC--C----CCCCCCHHHHHHHHHHHHHHHHHhCCCCCCcCCcccCCCC
Confidence            10  0    112366666666211     11112247777777766543


No 33 
>cd07457 CRD_FZ9_like Cysteine-rich Wnt-binding domain (CRD) of receptors similar to frizzled 9. The cysteine-rich domain (CRD) is an essential extracellular portion of the frizzled 9 (Fz9) and frizzled 10 (Fz10) receptors, and similar proteins. This domain is required for binding Wnt proteins, which play fundamental roles in many aspects of early development, such as cell and tissue polarity, neural synapse formation, and the regulation of proliferation. Fz proteins serve as Wnt receptors for multiple signal transduction pathways, including both beta-catenin dependent and -independent cellular signaling, as well as the planar cell polarity pathway and Ca(2+) modulating signaling pathway. The CRD domain is well conserved in metazoans - 10 frizzled proteins have been identified in mammals, 4 in Drosophila and 3 in Caenorhabditis elegans. Very little is known about the mechanism by which CRD domains interact with their ligands. The domain contains 10 conserved cysteines.
Probab=98.39  E-value=2e-07  Score=96.05  Aligned_cols=103  Identities=8%  Similarity=0.085  Sum_probs=68.2

Q ss_pred             ccccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhccccccc
Q psy11546        854 SEADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKHT  933 (1634)
Q Consensus       854 ~~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~~  933 (1634)
                      +.||+|++++|..|+||.|.|||+++|.+|++     +...-.       .|..++.    ++|+++|..|+|++|+|..
T Consensus         1 ~~CepI~~~~C~~l~Yn~T~~PN~lgh~~q~e-----a~~~~~-------~~~pLv~----~~Csp~l~~FlCsl~~P~C   64 (121)
T cd07457           1 GKCERITIPMCQGIGYNMTRMPNLLGHESQSE-----AAISIH-------EFAPLVQ----YGCAEHLRFFLCSLYAPMC   64 (121)
T ss_pred             CcCccCChHhhCCCCCCceeCCCcCCCcCHHH-----HHHHHH-------HHHHHHh----cCCChhHHHHhhhccCccc
Confidence            46999999999999999999999999999988     544333       3333443    3899999999999999953


Q ss_pred             ccCCCCCCCCCCCCCCCceeeccCC-----ceeeeeeecccccCCCCCc
Q psy11546        934 HQHNPDTNISTKAIPSSFNFNQVNG-----IPILTKVYTKVSKTNSTSK  977 (1634)
Q Consensus       934 ~~~~p~~~i~t~~ip~~f~~~~v~g-----~~~~~~v~~~~~~c~~~s~  977 (1634)
                      ...     .+..-.|++-.|+.|..     +....-.|+....|.+...
T Consensus        65 ~~~-----~~~~i~PCRslCe~vr~~C~~~m~~~g~~WP~~l~C~~fP~  108 (121)
T cd07457          65 TEQ-----VSIPIPACRSMCEQARDKCSPIMEQFSFSWPDSLDCDRLPR  108 (121)
T ss_pred             CCC-----CCCcCCccHHHHHHHHHHHHHHHHHhCCCCCCcCCcccCCC
Confidence            211     11112256666666211     1111223666666665543


No 34 
>cd07458 CRD_FZ1_like Cysteine-rich Wnt-binding domain (CRD) of receptors similar to frizzled 1. The cysteine-rich domain (CRD) is an essential extracellular portion of the frizzled 1 (Fz1), frizzled 2 (Fz2), and frizzled 7 (Fz7) receptors, and similar proteins. This domain is required for binding Wnt proteins, which play fundamental roles in many aspects of early development, such as cell and tissue polarity, neural synapse formation, and the regulation of proliferation. Fz proteins serve as Wnt receptors for multiple signal transduction pathways, including both beta-catenin dependent and -independent cellular signaling, as well as the planar cell polarity pathway and Ca(2+) modulating signaling pathway. The CRD domain is well conserved in metazoans - 10 frizzled proteins have been identified in mammals, 4 in Drosophila and 3 in Caenorhabditis elegans. Very little is known about the mechanism by which CRD domains interact with their ligands. The domain contains 10 conserved cysteines.
Probab=98.38  E-value=1.9e-07  Score=95.96  Aligned_cols=64  Identities=9%  Similarity=0.047  Sum_probs=52.6

Q ss_pred             ccccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhccccccc
Q psy11546        854 SEADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKHT  933 (1634)
Q Consensus       854 ~~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~~  933 (1634)
                      +.||+|++++|..|+||.|.|||+++|.+|++     +...-..+..       ++.    ..|+++|..|+|++|+|..
T Consensus         1 ~~C~pi~~~~C~~l~Yn~T~~PN~lgh~~q~e-----a~~~l~~~~p-------Lv~----~~C~p~l~~FlCsl~~P~C   64 (119)
T cd07458           1 GKCEPITIPLCTDIPYNMTIFPNLLGHTKQED-----AGLEVHQFYP-------LVK----VQCSPDLKFFLCSVYAPVC   64 (119)
T ss_pred             CccccCChHHhCCCCCCccCCCCccCCCCHHH-----HHHHHHHHHH-------HHh----CCCChhHHHHhhhcccccc
Confidence            46999999999999999999999999999998     5443333333       333    4899999999999999953


No 35 
>cd00112 LDLa Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure
Probab=98.37  E-value=1.8e-07  Score=76.31  Aligned_cols=35  Identities=43%  Similarity=1.079  Sum_probs=32.4

Q ss_pred             CCCCceecCCCCeeeccccccCCCccCCCCCCcccc
Q psy11546       1062 CSPGQYICPNSRVCIERTRLCDGIKDCPLGDDEKQC 1097 (1634)
Q Consensus      1062 C~~~~F~C~~~~~CI~~~~~CDG~~DC~DGSde~~c 1097 (1634)
                      |++++|+|.+ +.||+..++|||+.||+|||||..|
T Consensus         1 C~~~~f~C~~-~~Ci~~~~~CDg~~DC~dgsDE~~C   35 (35)
T cd00112           1 CPPNEFRCAN-GRCIPSSWVCDGEDDCGDGSDEENC   35 (35)
T ss_pred             CCCCeEEcCC-CCeeCHHHcCCCccCCCCCcccccC
Confidence            6789999987 5699999999999999999999977


No 36 
>smart00063 FRI Frizzled. Drosophila melanogaster frizzled mediates signalling that polarises a precursor cell along the anteroposterior axis. Homologues of the  N-terminal region of frizzled exist either as transmembrane or secreted molecules. Frizzled homologues are reported to be receptors for the Wnt growth factors. (Not yet in MEDLINE: the FRI domain occurs in several receptor tyrosine kinases [Xu, Y.K. and Nusse, Curr. Biol. 8 R405-R406 (1998); Masiakowski, P. and Yanopoulos, G.D., Curr. Biol. 8, R407 (1998)].
Probab=98.33  E-value=3e-07  Score=93.72  Aligned_cols=78  Identities=13%  Similarity=0.141  Sum_probs=59.6

Q ss_pred             ccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhccccccccc
Q psy11546        856 ADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKHTHQ  935 (1634)
Q Consensus       856 c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~~~~  935 (1634)
                      |++|++++|..|+||.|.|||+++|.+|++     +...       ++.|..++.    +.|+++|..|+|++|+|....
T Consensus         1 C~pi~~~~C~~l~Yn~T~~PN~l~h~~q~e-----a~~~-------l~~~~~L~~----~~C~p~l~~flCsl~~P~C~~   64 (113)
T smart00063        1 CEPITIPLCKDLGYNLTSMPNLLGHTTQEE-----AGLE-------LEQFHPLLN----VQCSPDLRFFLCSVYAPICTE   64 (113)
T ss_pred             CcCCCcHhHCCCCcccccCCCccCCCCHHH-----HHHH-------HHHHHhhcc----CCCChhHHHHhHhccCcccCC
Confidence            899999999999999999999999999988     4432       455555665    479999999999999995321


Q ss_pred             CCCCCCCCCCCCCCCceeec
Q psy11546        936 HNPDTNISTKAIPSSFNFNQ  955 (1634)
Q Consensus       936 ~~p~~~i~t~~ip~~f~~~~  955 (1634)
                      .   +.   ...|++-.|..
T Consensus        65 ~---~~---~~~PCRs~Ce~   78 (113)
T smart00063       65 D---LR---PILPCRSLCEA   78 (113)
T ss_pred             C---CC---cCCcCHHHHHH
Confidence            1   10   23466666666


No 37 
>cd07452 CRD_sizzled Cysteine-rich domain of the sizzled protein. The cysteine-rich domain (CRD) is an essential part of the sizzled protein, which regulates bone morphogenetic protein (Bmp) signaling by stabilizing chordin, and plays a critical role in the patterning of vertebrate and invertebrate embryos. Sizzled also functions in the ventral region as a Wnt inhibitor and modulates canonical Wnt signaling. Sizzled proteins belong to the secreted frizzled-related protein family (SFRP), and have be identified in the genomes of birds, fishes and frogs, but not mammals.
Probab=98.30  E-value=3.4e-07  Score=96.25  Aligned_cols=81  Identities=6%  Similarity=-0.059  Sum_probs=61.1

Q ss_pred             cCcccccccc--hheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhcc
Q psy11546        851 NNTSEADSIE--KKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQ  928 (1634)
Q Consensus       851 ~~t~~c~~i~--~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~  928 (1634)
                      -.++.|+.|.  +++|..|+||.|.|||+++|.+|++     +......+..       ++.    ..|+++|..|+|++
T Consensus         4 ~~s~~C~pIp~~l~~C~~l~Yn~T~~PN~lgH~sq~e-----a~~~~~~f~p-------Lv~----~~C~p~l~~FlCSl   67 (141)
T cd07452           4 GLSTKCVPIPPEMSMCQDVGYSEMRLPNLLGHTSMAE-----VVPKSADWQT-------LLH----TGCHPHARTFLCSL   67 (141)
T ss_pred             CCCCcceeCCCcchhhcCCCCCcccCCcccCCCCHHH-----HHHHHHHHHH-------HHh----cCCCHHHHHHhHhh
Confidence            3577899996  9999999999999999999999998     5443333333       333    48999999999999


Q ss_pred             cccccccCCCCCCCCCCCCCCCceeec
Q psy11546        929 HLKHTHQHNPDTNISTKAIPSSFNFNQ  955 (1634)
Q Consensus       929 ~~p~~~~~~p~~~i~t~~ip~~f~~~~  955 (1634)
                      |.|....    +.+    .|++-.|..
T Consensus        68 ~~P~C~~----~~i----~PCRslCe~   86 (141)
T cd07452          68 FAPVCLD----TFI----QPCRSMCVA   86 (141)
T ss_pred             cccCCCC----CCC----cCCHHHHHH
Confidence            9995432    111    366666666


No 38 
>smart00192 LDLa Low-density lipoprotein receptor domain class A. Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that  plays a central role in mammalian cholesterol metabolism. The N-terminal  type A repeats in LDL receptor bind the lipoproteins. Other homologous  domains occur in related receptors, including the very low-density  lipoprotein receptor and the LDL receptor-related protein/alpha  2-macroglobulin receptor, and in proteins which are functionally  unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.
Probab=98.30  E-value=4.1e-07  Score=73.28  Aligned_cols=32  Identities=44%  Similarity=1.098  Sum_probs=29.9

Q ss_pred             CCCCCceecCCCceecCCcccCCCCCCCCCCCC
Q psy11546        993 CEMNSSFRCGNGECVSIGSKCNQLVDCADGSDE 1025 (1634)
Q Consensus       993 C~~~~~F~C~nG~CIp~~~~CDG~~DC~DgSDE 1025 (1634)
                      |. ..+|+|.+|.||+..++|||++||+|||||
T Consensus         2 C~-~~~f~C~~~~Ci~~~~~Cdg~~dC~dgsDE   33 (33)
T smart00192        2 CP-PGEFQCDNGRCIPLSWVCDGVDDCSDGSDE   33 (33)
T ss_pred             CC-CCeEECCCCCEECchhhCCCcCcCcCCCCC
Confidence            55 469999999999999999999999999998


No 39 
>cd07444 CRD_SFRP5 Cysteine-rich domain of the secreted frizzled-related protein 5 (SFRP5), a regulator of Wnt activity. The cysteine-rich domain (CRD) is an essential part of the secreted frizzled-related Protein 5 (SFRP5), which regulates the activity of Wnt proteins, key players in a number of fundamental cellular processes such as embryogenesis and postnatal development. SFRPs antagonize the activation of Wnt signaling by binding to the CRD domains of frizzled (Fz) proteins, thereby preventing Wnt proteins from binding to these receptors. SFRPs are also known to have functions unrelated to Wnt, as enhancers of procollagen cleavage by the TLD proteinases. SFRPs and Fz proteins both contain CRD domains, but SFRPs lack the seven-pass transmembrane domain which is an integral part of Fzs.
Probab=98.29  E-value=2.7e-07  Score=95.59  Aligned_cols=101  Identities=5%  Similarity=-0.058  Sum_probs=69.9

Q ss_pred             cccccccc--hheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhcccc
Q psy11546        853 TSEADSIE--KKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHL  930 (1634)
Q Consensus       853 t~~c~~i~--~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~  930 (1634)
                      ++.|+.|.  +++|..|+||.|.|||+++|.+|++     +......+..+      +.     ..|+++|..|+|++|.
T Consensus         4 ~~~C~pIp~~l~~C~~l~Yn~T~~PN~lgH~~q~e-----a~~~~~~~~pL------v~-----~~C~p~l~~FlCSl~~   67 (127)
T cd07444           4 QPQCVDIPADLPLCHNVGYKRMRLPNLLEHESMAE-----VKQQASSWVPL------LA-----KRCHADTQVFLCSLFA   67 (127)
T ss_pred             CCCceECCCCchhhcCCCCCceeCCcccCCCCHHH-----HHHHHHHHHHH------Hh-----cccCHHHHHHhhhhcC
Confidence            57899997  9999999999999999999999999     66554444443      23     3899999999999999


Q ss_pred             cccccCCCCCCCCCCCCCCCceeeccCCc-----eeeeeeecccccCCCCCc
Q psy11546        931 KHTHQHNPDTNISTKAIPSSFNFNQVNGI-----PILTKVYTKVSKTNSTSK  977 (1634)
Q Consensus       931 p~~~~~~p~~~i~t~~ip~~f~~~~v~g~-----~~~~~v~~~~~~c~~~s~  977 (1634)
                      |....        ..-.|++-.|+.|..-     ....-.|+..-+|.++..
T Consensus        68 P~C~~--------~~i~PCRslCe~vr~~C~~~m~~~g~~WP~~l~C~~fP~  111 (127)
T cd07444          68 PVCLD--------RPIYPCRSLCEAVRDSCAPVMESYGFPWPEMLHCHKFPL  111 (127)
T ss_pred             CCCCC--------CCCCCcHHHHHHHHHHHHHHHHHcCCCCCCCCChhhCCC
Confidence            95421        1123666667662110     001113666666665543


No 40 
>cd07446 CRD_SFRP2 Cysteine-rich domain of the secreted frizzled-related protein 2 (SFRP2), a regulator of Wnt activity. The cysteine-rich-domain (CRD) is an essential part of the secreted frizzled related protein 2 (SFRP2), which regulates the activity of Wnt  proteins, key players in a number of fundamental cellular processes such as embryogenesis and postnatal development. SFRPs antagonize the activation of Wnt signaling by binding to CRD domains of frizzled (Fz) proteins, thereby preventing Wnt proteins from binding to these receptors. SFRPs and Fz proteins both contain CRD domains, but SFRPs lack the seven-pass transmembrane domain which is an integral part of Fzs. As a Wnt antagonist, SFRP2 regulates Nkx2.2  expression in the ventral spinal cord and anteroposterior axis elongation. SFRP2 also has a Wnt-independent function as an enhancer of procollagen cleavage by the TLD proteinases. SFRP2 binds both procollagen and TLD, thus facilitating the enzymatic reaction by bringing togeth
Probab=98.23  E-value=8.7e-07  Score=92.12  Aligned_cols=81  Identities=6%  Similarity=-0.048  Sum_probs=59.8

Q ss_pred             ccccccc--hheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhccccc
Q psy11546        854 SEADSIE--KKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLK  931 (1634)
Q Consensus       854 ~~c~~i~--~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p  931 (1634)
                      +.|+.|.  +++|..|+||.|.|||+++|.+|++     |......+..+       +.    ..|+++|..|+|++|+|
T Consensus         3 s~C~pI~~~l~~C~~l~Yn~T~~PN~lgH~sq~e-----a~~~~~~~~pL-------v~----~~C~p~l~~FlCSl~~P   66 (128)
T cd07446           3 SNCKPIPANMLLCHGIEYTNMRLPNLLGHETMKE-----VLQQAGSWIPL-------VQ----KQCHPDTKKFLCSLFAP   66 (128)
T ss_pred             CCceECCCCchhhcCCCCCcccCCcccCCCCHHH-----HHHHHHHHHHH-------Hh----cCCChhhHHHhhhccCc
Confidence            6799998  8899999999999999999999999     65543334333       22    49999999999999999


Q ss_pred             ccccCCCCCCCCCCCCCCCceeec
Q psy11546        932 HTHQHNPDTNISTKAIPSSFNFNQ  955 (1634)
Q Consensus       932 ~~~~~~p~~~i~t~~ip~~f~~~~  955 (1634)
                      ......     ...-.|++-.|..
T Consensus        67 ~C~~~~-----~~~i~PCRslCe~   85 (128)
T cd07446          67 VCLDDL-----DEAIQPCRSLCEA   85 (128)
T ss_pred             cccCCC-----CCcCCccHHHHHH
Confidence            532211     1122356666666


No 41 
>cd07443 CRD_SFRP1 Cysteine-rich domain of the secreted frizzled-related protein 1 (SFRP1), a regulator of Wnt activity. The cysteine-rich domain (CRD) is an essential part of the secreted frizzled-related protein 1 (SFRP1), which regulates the activity of Wnt proteins, key players in a number of fundamental cellular processes such as embryogenesis and postnatal development. SFRPs antagonize the activation of Wnt signaling by binding to the CRDs domains of frizzled (Fz) proteins, thereby preventing Wnt proteins from binding to these receptors. SFRPs are also known to have functions unrelated to Wnt, as enhancers of procollagen cleavage by the TLD proteinases. SFRPs and Fz proteins both contain CRD domains, but SFRPs lack the seven-pass transmembrane domain which is an integral part of Fzs. SFRP1 is expressed in many tissues and is involved in the regulation of Wnt signaling in osteoblasts, leading to enhanced trabecular bone formation in adults; it has also been shown to control the gro
Probab=98.22  E-value=5.5e-07  Score=93.05  Aligned_cols=79  Identities=6%  Similarity=-0.015  Sum_probs=59.6

Q ss_pred             cccccccc--hheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhcccc
Q psy11546        853 TSEADSIE--KKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHL  930 (1634)
Q Consensus       853 t~~c~~i~--~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~  930 (1634)
                      ...|+.|.  +++|..|+||.|.|||+++|.+|++     +......+..+      +.     ..|+++|..|+|++|.
T Consensus         4 ~~~C~pIp~~l~~C~~l~Yn~T~~PN~lgH~sq~e-----a~~~~~~~~pL------v~-----~~C~p~l~~FlCsl~~   67 (124)
T cd07443           4 PPQCVDIPADLRLCHNVGYKKMVLPNLLDHETMAE-----VKQQASSWVPL------LN-----KNCHKGTQVFLCSLFA   67 (124)
T ss_pred             CCCceECCCchhhhcCCCCCceeCccccCCCCHHH-----HHHHHHHHHHH------Hh-----ccccHHHHHHhhheec
Confidence            46799997  8999999999999999999999999     55443333333      22     3899999999999999


Q ss_pred             cccccCCCCCCCCCCCCCCCceeec
Q psy11546        931 KHTHQHNPDTNISTKAIPSSFNFNQ  955 (1634)
Q Consensus       931 p~~~~~~p~~~i~t~~ip~~f~~~~  955 (1634)
                      |....    +    .-.|++-.|+.
T Consensus        68 P~C~~----~----~i~PCRslCe~   84 (124)
T cd07443          68 PVCLD----R----PVYPCRWLCEA   84 (124)
T ss_pred             cCCCC----C----CCCCCHHHHHH
Confidence            95421    1    12366666666


No 42 
>KOG1215|consensus
Probab=98.20  E-value=1.2e-06  Score=116.48  Aligned_cols=92  Identities=38%  Similarity=0.805  Sum_probs=75.9

Q ss_pred             CCCcCC--CCcccCCCcccCCcccCCCCCCCCCcc---------------------------------------------
Q psy11546       1541 NGHRCP--LGECLPKARVCNGYMECSDGKCEMNSS--------------------------------------------- 1573 (1634)
Q Consensus      1541 ~~f~C~--~G~CIp~~~vCDG~~DC~DGsDE~~~~--------------------------------------------- 1573 (1634)
                      ..|.|.  .++|+|..|.|||..||.||+||..+.                                             
T Consensus       141 ~~~~c~~~~~~Cip~~~~cd~~~~C~dg~de~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~d~~~~~~~~~~~~~~  220 (877)
T KOG1215|consen  141 DKFSCRTGSCKCIPGDWLCDGEADCPDGSDELNCAVRRCEPRGASLDCIVAIKVCDIQHDCADDYDESEGRIYWTDDSRI  220 (877)
T ss_pred             CCCCCcCccccCCCCceeCCCCCccccchhhhcccccccCccccccccceeeeecCcccccccccccccCcccccCCcce
Confidence            478888  789999999999999999999998752                                             


Q ss_pred             --cccCC-CcEecCCCccCCCCCCCCCCCCC--CC----------ccCCcCCCCccCCcccCCCCCCCCCCCCCCC
Q psy11546       1574 --FRCGN-GECVSIGSKCNQLVDCADGSDEK--NC----------SCADFLKSQFLTRKICDGIIDCWDFSDEYEC 1634 (1634)
Q Consensus      1574 --F~C~~-G~CI~~~~~CDG~~DC~DgSDE~--~C----------~C~~~~~~c~~~~~~CDG~~DC~DgSDE~~C 1634 (1634)
                        ++|.. +.||...+.|||..||.+++||.  +|          .|.  ...|++..+.|||..||.||+||.+|
T Consensus       221 ~~~~c~g~~~~i~~~~~~Dg~~dc~~~~de~~~~~~~~~~~~~e~~~~--~~~~~~~~~~~~g~~d~pdg~de~~~  294 (877)
T KOG1215|consen  221 EVTRCDGSSRCILISEVCDGPRDCVDGPDEGVMNCSDATCEAPEIECA--DGDCSDRQKLCDGDLDCPDGLDEDYC  294 (877)
T ss_pred             eEEEecCCCcEEeehhccCCCcccccCCcCceeEeeccccCCcceeec--CCCCccceEEecCccCCCCccccccc
Confidence              34444 48999999999999999999995  22          342  34577778889999999999999876


No 43 
>cd07455 CRD_Collagen_XVIII Cysteine-rich domain of the variant 3 of collagen XVIII (V3C18 ). The cysteine-rich domain (CRD) is an essential part of the variant 3 of collagen XVIII (V3C18), which regulates major cellular functions such as the differential epithelial morphogenesis of early lung and kidney development. V3C18 is a 170 kD protein, which is proteolotically processed into the CRD-containing 50 kD glucoprotein precursor that binds Wnt3a through its CRD domain and suppresses the Wnt3a-induced stabilization of beta catenin. Full-length V3C18 is unable to inhibit Wnt signaling.
Probab=98.16  E-value=1.1e-06  Score=90.55  Aligned_cols=63  Identities=6%  Similarity=-0.044  Sum_probs=52.2

Q ss_pred             cccccc--chheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhccccc
Q psy11546        854 SEADSI--EKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLK  931 (1634)
Q Consensus       854 ~~c~~i--~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p  931 (1634)
                      ..|+.|  ++++|..|+||.|.|||+++|.+|++     +......       |.-++.    ..|+++|..|+|++|+|
T Consensus         3 ~~C~pIp~~l~~C~~l~Yn~t~~PN~lgH~sq~e-----a~~~~~~-------f~pLv~----~~Csp~l~~FlCs~~~P   66 (123)
T cd07455           3 PRCLPVPSSLPFCSRLGIRSFWLPNFLNHTSVEE-----VRAVLAE-------WAWLLE----SGCHPSLEWFFCLLLVP   66 (123)
T ss_pred             CCCccCCCccccccCCCcccccCCcccCCCCHHH-----HHHHHHH-------HHHHHh----CCCChhhHHHhhhccCC
Confidence            359999  99999999999999999999999999     5544333       333444    48999999999999999


Q ss_pred             c
Q psy11546        932 H  932 (1634)
Q Consensus       932 ~  932 (1634)
                      .
T Consensus        67 ~   67 (123)
T cd07455          67 S   67 (123)
T ss_pred             c
Confidence            4


No 44 
>PF12999 PRKCSH-like:  Glucosidase II beta subunit-like
Probab=98.14  E-value=1.5e-06  Score=94.24  Aligned_cols=68  Identities=38%  Similarity=0.725  Sum_probs=56.4

Q ss_pred             CCceecCCCc-e-ecCCcccCCCCCCCCCCCCCCCCCCCcccCccccccccCCCCCCCCCCCCCCCCCCCCCceecCCCC
Q psy11546        996 NSSFRCGNGE-C-VSIGSKCNQLVDCADGSDEKNCSCADFLKSQFLTRKICDGIIDCWDFSDEYECEWCSPGQYICPNSR 1073 (1634)
Q Consensus       996 ~~~F~C~nG~-C-Ip~~~~CDG~~DC~DgSDE~~C~C~~~~~~~~~~~~~CDG~~DC~D~SDE~~C~~C~~~~F~C~~~~ 1073 (1634)
                      .+.|.|-+|. = |+..++.|++-||+|||||.+=                               .-|+.+.|+|.+.+
T Consensus        35 ~~~f~Cl~~~~~~I~~~~iNDdyCDC~DGSDEPGT-------------------------------sAC~~~~FyC~N~g   83 (176)
T PF12999_consen   35 NGKFTCLDGSKIVIPFSQINDDYCDCPDGSDEPGT-------------------------------SACSNGKFYCENKG   83 (176)
T ss_pred             CCceEecCCCCceecHHHccCcceeCCCCCCcccc-------------------------------ccCcCceEeeccCC
Confidence            4679998763 3 7999999999888888888431                               24667799998775


Q ss_pred             ---eeeccccccCCCcc---CCCCCCc
Q psy11546       1074 ---VCIERTRLCDGIKD---CPLGDDE 1094 (1634)
Q Consensus      1074 ---~CI~~~~~CDG~~D---C~DGSde 1094 (1634)
                         .-|+...|-||+-|   |.|||||
T Consensus        84 ~~p~~i~~s~VnDGICDy~~CCDGSDE  110 (176)
T PF12999_consen   84 HIPRYIPSSRVNDGICDYDICCDGSDE  110 (176)
T ss_pred             CCCceeehhhhcCCcCcccccCCCCCC
Confidence               68999999999999   9999999


No 45 
>PF15494 SRCR_2:  Scavenger receptor cysteine-rich domain
Probab=98.08  E-value=6.8e-06  Score=81.65  Aligned_cols=89  Identities=27%  Similarity=0.518  Sum_probs=60.6

Q ss_pred             eEEEEec--CcccccccCcchhhhhhhchhhhcchhHHhhccCCCCCccceeeeccCCC---CCcceEEEeccccccchh
Q psy11546       1160 FLMIQKQ--GQWGKLCMNQINNFIMKTLKWKISDLGKAICKSMTFRDLNEIEAVADPST---DDSIYYELSMNAINSSAA 1234 (1634)
Q Consensus      1160 ~VeV~~~--GtWg~VCsd~~~~~~~~~~~~~isdlA~vICRqLG~~~~~sve~v~~~~~---~~~~y~~vscn~~e~s~s 1234 (1634)
                      .|+|+..  +.|.+||+|+|+..           ++..+|++|||........+.....   ....|..+.....    .
T Consensus         5 ~LQV~~~~~~~W~~VC~d~W~~~-----------~s~~~C~qLGy~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~----~   69 (98)
T PF15494_consen    5 LLQVYSAASGSWLPVCSDNWNEA-----------LSKAACQQLGYSSSSSSSSVNLSDISPSPSQSFVKLSSNSN----S   69 (98)
T ss_pred             EEEEEecCCCCEeeecccccCHH-----------HHHHHHHHhCCCCCCccceeEeccCCcccccCeEEEecCCC----C
Confidence            4555555  89999999999987           7899999999987666555433321   1335666622211    1


Q ss_pred             cccccccccccCCCCceeeeeccccccCCCC
Q psy11546       1235 TKSALLFQKTKCSQKQVVAVNCSSLECGIRP 1265 (1634)
Q Consensus      1235 ~c~~l~~~~~~C~s~~vV~V~C~~~~CGirp 1265 (1634)
                      ...........|.++.+|.+.|.  +||.|+
T Consensus        70 ~l~~~~~~~~~C~S~~vVsL~C~--~CG~r~   98 (98)
T PF15494_consen   70 TLQQSLSPSSSCSSGSVVSLQCS--DCGKRT   98 (98)
T ss_pred             ceEEEEecCCCCCCCCEEEEECc--ccCCcC
Confidence            11222335679999999999997  899873


No 46 
>cd07453 CRD_crescent Cysteine-rich domain of the crescent protein. The cysteine-rich domain (CRD) is an essential part of the crescent protein, a member of the secreted frizzled-related protein (SFRP) family, which regulates convergent extension movements (CEMs) during gastrulation and neurulation. Xenopus laevis crescent efficiently forms inhibitory complexes with Wnt5a and Wnt11, but this effect is cancelled in the presence of another member of the SFRP family, Frzb1. A potential role for Crescent in head formation is to regulate a non-canonical Wnt pathway positively in the adjacent posterior mesoderm, and negatively in the overlying anterior neuroectoderm.
Probab=98.08  E-value=1.3e-06  Score=91.33  Aligned_cols=63  Identities=6%  Similarity=-0.071  Sum_probs=49.2

Q ss_pred             ccccc--chheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhcccccc
Q psy11546        855 EADSI--EKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKH  932 (1634)
Q Consensus       855 ~c~~i--~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~  932 (1634)
                      .|+.|  .+++|..|+||.|.|||+++|.+|++     +......+..+      +.     ..|+++|..|+|++|+|.
T Consensus         2 ~C~~iP~~l~~C~~l~Yn~T~~PN~lgH~sq~e-----a~~~~~~~~pL------v~-----~~C~p~l~~FLCSl~~P~   65 (135)
T cd07453           2 PCMRIPKSMALCYDIGYSEMRIPNLLEHETMAE-----VIQQSSSWLPL------LA-----RECHPDARIFLCSLFAPI   65 (135)
T ss_pred             CCccCCCcchhhCCCCCCcccCCcccCCCCHHH-----HHHHHHHHHHH------Hh-----cCCChhhHHHhhhhcCcc
Confidence            35544  58899999999999999999999999     65543333333      23     489999999999999995


Q ss_pred             c
Q psy11546        933 T  933 (1634)
Q Consensus       933 ~  933 (1634)
                      .
T Consensus        66 C   66 (135)
T cd07453          66 C   66 (135)
T ss_pred             C
Confidence            4


No 47 
>COG3591 V8-like Glu-specific endopeptidase [Amino acid transport and metabolism]
Probab=97.96  E-value=4.8e-05  Score=87.12  Aligned_cols=197  Identities=20%  Similarity=0.268  Sum_probs=104.8

Q ss_pred             CCCccceEEEEEe---CCeeeeeceeeCCCccccccccccCCCCCceEEEEeEEecCCCCCCCCceeEeeEEEEe--CCC
Q psy11546       1292 RLGSWPWQAALYK---EGEFQCGATLISDQWLLSAGHCFYRAQDDYWVARLGTLRRGTKLPSPYEQLRPISKIIL--HPQ 1366 (1634)
Q Consensus      1292 ~~GewPW~VsL~~---~g~~~CGGTLIS~rWVLTAAHCV~~~~~~~~~VrLG~~~~~s~~~s~~~Q~~~V~~Iii--HP~ 1366 (1634)
                      ....|||-+-...   .|++-|+|+||+++.||||+||+....... . .+-.+..+..  ........+..+..  .|.
T Consensus        45 dt~~~Py~av~~~~~~tG~~~~~~~lI~pntvLTa~Hc~~s~~~G~-~-~~~~~p~g~~--~~~~~~~~~~~~~~~~~~g  120 (251)
T COG3591          45 DTTQFPYSAVVQFEAATGRLCTAATLIGPNTVLTAGHCIYSPDYGE-D-DIAAAPPGVN--SDGGPFYGITKIEIRVYPG  120 (251)
T ss_pred             cCCCCCcceeEEeecCCCcceeeEEEEcCceEEEeeeEEecCCCCh-h-hhhhcCCccc--CCCCCCCceeeEEEEecCC
Confidence            4567999776643   356777779999999999999997654321 1 1111111111  11122222222222  333


Q ss_pred             --CCCCCCcCceEEEEeC--CCCCCCceeeecCCCCCCCCCCCeEEEEEecccccCCCcCCCCcEEEEEeeeChhHHhhh
Q psy11546       1367 --YVDAGFINDISILKMK--TPFSNYVRPICLPHPNTPLTDGTLCTVVGWGQLFEIGRVFPDTLQEVQLPIISTAECRKR 1442 (1634)
Q Consensus      1367 --Yn~~t~~nDIALLkL~--~PfSd~VqPICLP~~~~~l~~G~~c~VvGWG~t~e~g~~~s~~Lq~v~VpIIs~eeC~~~ 1442 (1634)
                        |.......|+..+.|.  .++...+...-++..... ..+....++||-.....      .++.       .+.|...
T Consensus       121 ~~~~~d~~~~~v~~~~~~~g~~~~~~~~~~~~~~~~~~-~~~d~i~v~GYP~dk~~------~~~~-------~e~t~~v  186 (251)
T COG3591         121 ELYKEDGASYDVGEAALESGINIGDVVNYLKRNTASEA-KANDRITVIGYPGDKPN------IGTM-------WESTGKV  186 (251)
T ss_pred             ceeccCCceeeccHHHhccCCCcccccccccccccccc-ccCceeEEEeccCCCCc------ceeE-------eeeccee
Confidence              3444555666666666  235555554444433333 33444888888533221      1110       0011110


Q ss_pred             ccCCCcccCCCCeEEEeecCCCCCCCCCCCCCeEEEecCCCcEEEEEEEEcCCCCCCCCCCeEEEEc-CCchHHHHHhhc
Q psy11546       1443 TLFLPLYRVTENMFCAGFERGGRDACLGDSGGPLMCQEPDGRWSLMGVTSNGYGCARANRPGVYTKV-SNYIPWLYNNMA 1521 (1634)
Q Consensus      1443 ~~~~~~~~It~sm~CAG~~~gg~daC~GDSGGPLVc~~~~GrW~LvGIvS~G~gCg~~~~PgVYTrV-s~YlDWI~~vI~ 1521 (1634)
                      .      .+....     ..-..+.+.|+||+|++....    +++||...|.+-.........+++ ..+++||++.++
T Consensus       187 ~------~~~~~~-----l~y~~dT~pG~SGSpv~~~~~----~vigv~~~g~~~~~~~~~n~~vr~t~~~~~~I~~~~~  251 (251)
T COG3591         187 N------SIKGNK-----LFYDADTLPGSSGSPVLISKD----EVIGVHYNGPGANGGSLANNAVRLTPEILNFIQQNIK  251 (251)
T ss_pred             E------EEecce-----EEEEecccCCCCCCceEecCc----eEEEEEecCCCcccccccCcceEecHHHHHHHHHhhC
Confidence            0      011110     112357899999999997652    999999988753322334445555 446899987753


No 48 
>smart00192 LDLa Low-density lipoprotein receptor domain class A. Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that  plays a central role in mammalian cholesterol metabolism. The N-terminal  type A repeats in LDL receptor bind the lipoproteins. Other homologous  domains occur in related receptors, including the very low-density  lipoprotein receptor and the LDL receptor-related protein/alpha  2-macroglobulin receptor, and in proteins which are functionally  unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.
Probab=97.84  E-value=1.1e-05  Score=65.17  Aligned_cols=32  Identities=47%  Similarity=1.175  Sum_probs=29.1

Q ss_pred             CCCCceecCCCCeeeccccccCCCccCCCCCCc
Q psy11546       1062 CSPGQYICPNSRVCIERTRLCDGIKDCPLGDDE 1094 (1634)
Q Consensus      1062 C~~~~F~C~~~~~CI~~~~~CDG~~DC~DGSde 1094 (1634)
                      |+.++|+|.++ .||+..++|||+.||.|||||
T Consensus         2 C~~~~f~C~~~-~Ci~~~~~Cdg~~dC~dgsDE   33 (33)
T smart00192        2 CPPGEFQCDNG-RCIPLSWVCDGVDDCSDGSDE   33 (33)
T ss_pred             CCCCeEECCCC-CEECchhhCCCcCcCcCCCCC
Confidence            67789999865 599999999999999999998


No 49 
>cd07451 CRD_SMO Cysteine-rich domain of the smoothened receptor (Smo) integral membrane protein. The cysteine-rich domain (CRD) is part of the smoothened receptor (Smo), an integral membrane protein and one of the key players in the Hedgehog (Hh) signaling pathway, critical for development, cell growth and migration, as well as stem cell maintenance. The CRD of Smo is conserved in vertebrates and can also be identified in invertebrates. The precise function of the CRD in Smo is unknown. Mutations in the Drosophila CRD disrupt Smo activity in vivo, while deletion of the CRD in mammalian cells does not seem to affect the activity of overexpressed Smo.
Probab=97.74  E-value=6.6e-06  Score=85.92  Aligned_cols=65  Identities=5%  Similarity=0.061  Sum_probs=52.7

Q ss_pred             ccccccchheeec--ccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhccccc
Q psy11546        854 SEADSIEKKVILV--MSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLK  931 (1634)
Q Consensus       854 ~~c~~i~~~~c~~--l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p  931 (1634)
                      ..|+.|+..+|..  ||||.|.|||+++|.||++     |......+..       ++.+   .+|+++|..|+|++|.|
T Consensus         3 ~~Cepi~~~~C~g~~lpYn~T~~PN~~~h~tq~e-----a~~~l~~f~p-------Lv~v---p~C~~~l~~FLCSly~P   67 (132)
T cd07451           3 AKCEPLKNTTCLGSKLPYTYTSLDLVPDSTTQEE-----VQEKLHLWSG-------LRNV---PKCWAVIQPLLCALYMP   67 (132)
T ss_pred             ceeeECCccccCCCCCCccceecCCccCCcCHHH-----HHHHHHHHHH-------HHhh---cccchhheeeeEeeECC
Confidence            4699999999977  9999999999999999999     5544333333       4442   38999999999999999


Q ss_pred             cc
Q psy11546        932 HT  933 (1634)
Q Consensus       932 ~~  933 (1634)
                      ..
T Consensus        68 ~C   69 (132)
T cd07451          68 KC   69 (132)
T ss_pred             cC
Confidence            53


No 50 
>KOG3577|consensus
Probab=97.63  E-value=7.1e-05  Score=94.97  Aligned_cols=152  Identities=11%  Similarity=0.099  Sum_probs=93.2

Q ss_pred             ccCcccccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhccc
Q psy11546        850 ENNTSEADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQH  929 (1634)
Q Consensus       850 ~~~t~~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~  929 (1634)
                      .+.+..|++|+.++|..++||.|.|||.++|.||++     |..       .++.|.-++    .++|++.|..|+|++|
T Consensus        37 ~~~~~~c~pi~i~~C~~i~Yn~T~~PNll~H~tQ~e-----a~~-------~~~~f~PLv----~~~Cs~~l~~FLCS~y  100 (556)
T KOG3577|consen   37 AAGKARCEPITIPMCQGLGYNMTAMPNLVGHETQEE-----AGL-------KLHQFHPLV----ALRCSPGLRFFLCSLY  100 (556)
T ss_pred             ccccccccceeccccCCCCCCcccCCccccchhHHH-----hhh-------cccccCCcc----ccccchhhhhHhhcCC
Confidence            456678999999999999999999999999999998     222       233444434    4799999999999999


Q ss_pred             ccccccCCCCCCCCCCCCCCCceeeccCC-----ceeeeeeecccccCCCCCcCC--Ccceeccccc---cCCCCCCCce
Q psy11546        930 LKHTHQHNPDTNISTKAIPSSFNFNQVNG-----IPILTKVYTKVSKTNSTSKER--NQTEFHLSVN---RSECEMNSSF  999 (1634)
Q Consensus       930 ~p~~~~~~p~~~i~t~~ip~~f~~~~v~g-----~~~~~~v~~~~~~c~~~s~~~--~~~~f~~~~~---~~~C~~~~~F  999 (1634)
                      .|-+.+     .+.....|++-.|..+..     +......|+..-.|.......  ...|......   .........+
T Consensus       101 ~P~C~~-----~~d~pi~PCRslCe~ar~~Ce~~m~~~gf~WPe~L~C~~fp~~~~~~~~C~~~~~~~~~~~~~~~~~~~  175 (556)
T KOG3577|consen  101 APMCTE-----DLDRPILPCRSLCEAAREGCEPLMEMFGFPWPEFLRCDKFPVPDSAGELCMGPPSEGFAISAGSAGVSV  175 (556)
T ss_pred             CCcCcc-----ccCcccCccHHHHHHHhcccHHHHHhcCCCCCccccCCCCCCCCCccccccCCccccccCCCCCCCCCC
Confidence            994221     111122255555655221     222344688888888776654  2222222111   0111111112


Q ss_pred             ecCCCceecCCcccCCCCCCCCCCCCCCCCCC
Q psy11546       1000 RCGNGECVSIGSKCNQLVDCADGSDEKNCSCA 1031 (1634)
Q Consensus      1000 ~C~nG~CIp~~~~CDG~~DC~DgSDE~~C~C~ 1031 (1634)
                      .+..|.        ++..++...+||. |.+.
T Consensus       176 ~~~~g~--------~~~~~~~~~~~~~-~~~~  198 (556)
T KOG3577|consen  176 LPDFGG--------PGVVLAKPWSDEL-CQNP  198 (556)
T ss_pred             CCCCCC--------CccccCCCCcccc-ccCc
Confidence            222222        6777777778877 6654


No 51 
>cd07066 CRD_FZ CRD_domain cysteine-rich domain, also known as Fz (frizzled) domain. CRD_FZ is an essential component of a number of cell surface receptors, which are involved in multiple signal transduction pathways, particularly in modulating the activity of the Wnt proteins, which play a fundamental role in the early development of metazoans. CRD is also found in secreted frizzled related proteins (SFRPs), which lack the transmembrane segment found in the frizzled protein. The CRD domain is also present in the alpha-1 chain of mouse type XVIII collagen, in carboxypeptidase Z, several receptor tyrosine kinases, and the mosaic transmembrane serine protease corin. The CRD domain is well conserved in metazoans - 10 frizzled proteins have been identified in mammals, 4 in Drosophila and 3 in Caenorhabditis elegans. CRD domains have also been identified in multiple tandem copies in a Dictyostelium discoideum protein. Very little is known about the mechanism by which CRD domains interact wit
Probab=97.62  E-value=3.7e-05  Score=78.57  Aligned_cols=80  Identities=9%  Similarity=0.046  Sum_probs=57.7

Q ss_pred             cccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhcccccccc
Q psy11546        855 EADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKHTH  934 (1634)
Q Consensus       855 ~c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~~~  934 (1634)
                      .|++|..++|..++||.|.+||+++|.+|++     +......+..+      +.     .+|++++..|+|++++|...
T Consensus         1 ~C~pi~~~~C~~~~Yn~t~~pn~~~~~~~~~-----a~~~~~~~~~l------~~-----~~C~~~~~~f~Cs~~~P~C~   64 (119)
T cd07066           1 KCEPIPLPLCRGLPYNTTRFPNLLGHESQEE-----AEQELESFTPL------VN-----SGCHPDLRFFLCSLYFPECT   64 (119)
T ss_pred             CCccCCHHHhCCCCCCceeCCCccCCCCHHH-----HHHHHHHHHHH------Hh-----cCCChhHHHHHHhccCcccc
Confidence            4999999999999999999999999988877     44433333222      23     38999999999999999642


Q ss_pred             cCCCCCCCCCCCCCCCceeec
Q psy11546        935 QHNPDTNISTKAIPSSFNFNQ  955 (1634)
Q Consensus       935 ~~~p~~~i~t~~ip~~f~~~~  955 (1634)
                      ....     ....|++-.|..
T Consensus        65 ~~~~-----~~~~PCRs~Cee   80 (119)
T cd07066          65 PDGD-----RPIPPCRSLCEE   80 (119)
T ss_pred             CCCC-----CcCCChHHHHHH
Confidence            2211     234456666666


No 52 
>PF12999 PRKCSH-like:  Glucosidase II beta subunit-like
Probab=97.21  E-value=0.00024  Score=77.46  Aligned_cols=61  Identities=30%  Similarity=0.626  Sum_probs=53.3

Q ss_pred             CCCcCCCCc-c-cCCCcccCCcccCCCCCCCCC------cccccCCC----cEecCCCccCCCCC---CCCCCCCC
Q psy11546       1541 NGHRCPLGE-C-LPKARVCNGYMECSDGKCEMN------SSFRCGNG----ECVSIGSKCNQLVD---CADGSDEK 1601 (1634)
Q Consensus      1541 ~~f~C~~G~-C-Ip~~~vCDG~~DC~DGsDE~~------~~F~C~~G----~CI~~~~~CDG~~D---C~DgSDE~ 1601 (1634)
                      +.|.|-+|. = |+.+++.|++-||+|||||..      ..|.|.|.    .-|+.+++=||+=|   |=|||||.
T Consensus        36 ~~f~Cl~~~~~~I~~~~iNDdyCDC~DGSDEPGTsAC~~~~FyC~N~g~~p~~i~~s~VnDGICDy~~CCDGSDE~  111 (176)
T PF12999_consen   36 GKFTCLDGSKIVIPFSQINDDYCDCPDGSDEPGTSACSNGKFYCENKGHIPRYIPSSRVNDGICDYDICCDGSDES  111 (176)
T ss_pred             CceEecCCCCceecHHHccCcceeCCCCCCccccccCcCceEeeccCCCCCceeehhhhcCCcCcccccCCCCCCC
Confidence            469999885 3 799999999999999999975      25999983    58899999999999   99999994


No 53 
>PF01392 Fz:  Fz domain;  InterPro: IPR020067 The frizzled (fz) domain is an extracellular domain of about 120 amino acids.It was first identified in the alpha-1 chain of type XVIII collagen and in members of the Frizzled family of seven transmembrane (7TM) proteins which act as receptors for secreted Wingless (Wg)/Wnt glycoproteins []. In addition to these proteins, one or two copies of the fz domain are also found [, , , , ] in:  The frizbee (Frzb) family; secreted frizzled-like proteins. Smoothened; another 7TM receptor involved in hedgehog signaling. Carboxpeptidase Z (CPZ). Transmembrane serine protease corin. Two receptor tyrosine kinases (RTKs) subfamilies, the Ror family and the muscle-specific kinase (MuSK) family.   As the fz domain contains 10 cysteines which are largely conserved, it has also been called cysteine-rich domain (CRD) []. The fz domain also contains several other highly conserved residues, for example, a basic amino acid follows C6, and a conserved proline residues lies four residues C-terminal to C9 []. The crystal structure of a fz domain shows that it is predominantly alpha-helical with all cysteines forming disulphide bonds. In addition to helical regions, two short beta-strands at the N terminus form a minimal beta-sheet with the second beta sheet passing through a knot created by disulphide bonds []. Several fz domains have been shown to be both necessary and sufficient for Wg/Wnt ligand binding, strongly suggesting that the fz domain is a Wg/Wnt interacting domain [, ].; GO: 0005515 protein binding; PDB: 1IJX_E 3HKL_B 1IJY_B 4F0A_A.
Probab=97.01  E-value=0.00014  Score=73.15  Aligned_cols=62  Identities=13%  Similarity=0.190  Sum_probs=45.2

Q ss_pred             ccccchheeecccCCCcccCCCCcCCCCCCcchhhHHHHHHHHHHHHHhhcccccccCCcchHHHHHHHhhccccccc
Q psy11546        856 ADSIEKKVILVMSSNSSNMLNFNENRTSDDNDNKNKAMAQNLLTQMLEKYNRVITNDSSVSSLKYLIDQISHQHLKHT  933 (1634)
Q Consensus       856 c~~i~~~~c~~l~yn~t~~~n~l~~~t~~~~~~~~~~~~~~~~~~~~~~y~~~~t~~~~~~c~~~i~~f~c~~~~p~~  933 (1634)
                      |++|+.++|..++||.|.+||.+++.++++     +.+.       +..|..++..    .|++++..|+|+.++|..
T Consensus         1 C~pi~~~~C~~~~y~~t~~pn~~~~~~~~~-----~~~~-------~~~~~~l~~~----~C~~~~~~flC~~~~P~C   62 (116)
T PF01392_consen    1 CEPITLSFCRGLPYNQTSFPNSLGHQTQDE-----ASQQ-------LNAFSTLVNS----GCHPYLRPFLCSLYFPPC   62 (116)
T ss_dssp             EEE---GGGTTSS-TEEESSETTTSSSHHH-----HHHH-------HHCCHHHHCC----TSHTTHHHHHHHHHS-BB
T ss_pred             CccCChHHhcCCCCCceEeecccCCcCHHH-----HHHH-------HHHHHHHhcc----hhhHHHhhhhhcccCCcC
Confidence            899999999999999999999999988766     3333       3334444442    899999999999999964


No 54 
>KOG2397|consensus
Probab=96.05  E-value=0.005  Score=75.46  Aligned_cols=52  Identities=40%  Similarity=0.618  Sum_probs=38.2

Q ss_pred             CCCCCCCCCCCCCCCCCCCCCceecCCCC---eeeccccccCCCccCCCCCCccc
Q psy11546       1045 DGIIDCWDFSDEYECEWCSPGQYICPNSR---VCIERTRLCDGIKDCPLGDDEKQ 1096 (1634)
Q Consensus      1045 DG~~DC~D~SDE~~C~~C~~~~F~C~~~~---~CI~~~~~CDG~~DC~DGSde~~ 1096 (1634)
                      |++-||.||+||..=.-|+.+.|+|.+.+   .=|+...+=||+-||.|||||..
T Consensus        61 Dd~CDC~DGsDEPGtsACpngkF~C~N~G~~p~~i~ssrV~DGICDCCDgSDE~~  115 (480)
T KOG2397|consen   61 DDSCDCLDGSDEPGTSACPNGKFYCVNQGHQPKYIPSSRVNDGICDCCDGSDEYL  115 (480)
T ss_pred             cccccCCCCCCCCccccCCCCceeeeecCCCceeeechhccCcccccccCCCCcc
Confidence            33344444444433346889999997654   67999999999999999999953


No 55 
>TIGR02037 degP_htrA_DO periplasmic serine protease, Do/DeqQ family. This family consists of a set proteins various designated DegP, heat shock protein HtrA, and protease DO. The ortholog in Pseudomonas aeruginosa is designated MucD and is found in an operon that controls mucoid phenotype. This family also includes the DegQ (HhoA) paralog in E. coli which can rescue a DegP mutant, but not the smaller DegS paralog, which cannot. Members of this family are located in the periplasm and have separable functions as both protease and chaperone. Members have a trypsin domain and two copies of a PDZ domain. This protein protects bacteria from thermal and other stresses and may be important for the survival of bacterial pathogens.// The chaperone function is dominant at low temperatures, whereas the proteolytic activity is turned on at elevated temperatures.
Probab=95.59  E-value=0.22  Score=62.07  Aligned_cols=170  Identities=17%  Similarity=0.178  Sum_probs=89.6

Q ss_pred             eeeeeceeeCCC-ccccccccccCCCCCceEEEEeEEecCCCCCCCCceeEeeEEEEeCCCCCCCCCcCceEEEEeCCCC
Q psy11546       1307 EFQCGATLISDQ-WLLSAGHCFYRAQDDYWVARLGTLRRGTKLPSPYEQLRPISKIILHPQYVDAGFINDISILKMKTPF 1385 (1634)
Q Consensus      1307 ~~~CGGTLIS~r-WVLTAAHCV~~~~~~~~~VrLG~~~~~s~~~s~~~Q~~~V~~IiiHP~Yn~~t~~nDIALLkL~~Pf 1385 (1634)
                      ...++|.+|++. ||||++|.+....  .+.|.+..           ...+..+-+..++       ..|||||+++.+ 
T Consensus        57 ~~~GSGfii~~~G~IlTn~Hvv~~~~--~i~V~~~~-----------~~~~~a~vv~~d~-------~~DlAllkv~~~-  115 (428)
T TIGR02037        57 RGLGSGVIISADGYILTNNHVVDGAD--EITVTLSD-----------GREFKAKLVGKDP-------RTDIAVLKIDAK-  115 (428)
T ss_pred             cceeeEEEECCCCEEEEcHHHcCCCC--eEEEEeCC-----------CCEEEEEEEEecC-------CCCEEEEEecCC-
Confidence            457899999876 9999999997643  23333221           1234444333444       379999999865 


Q ss_pred             CCCceeeecCCCCCCCCCCCeEEEEEecccccCCCcCCCCcEEEEEeeeChhHHhhhccCCCcccCCCCeEEEeecCCCC
Q psy11546       1386 SNYVRPICLPHPNTPLTDGTLCTVVGWGQLFEIGRVFPDTLQEVQLPIISTAECRKRTLFLPLYRVTENMFCAGFERGGR 1465 (1634)
Q Consensus      1386 Sd~VqPICLP~~~~~l~~G~~c~VvGWG~t~e~g~~~s~~Lq~v~VpIIs~eeC~~~~~~~~~~~It~sm~CAG~~~gg~ 1465 (1634)
                       ..+.++-|... ..+..|..+++.|+....      ........+.-.....    .. ..  .. ..++=     ...
T Consensus       116 -~~~~~~~l~~~-~~~~~G~~v~aiG~p~g~------~~~~t~G~vs~~~~~~----~~-~~--~~-~~~i~-----tda  174 (428)
T TIGR02037       116 -KNLPVIKLGDS-DKLRVGDWVLAIGNPFGL------GQTVTSGIVSALGRSG----LG-IG--DY-ENFIQ-----TDA  174 (428)
T ss_pred             -CCceEEEccCC-CCCCCCCEEEEEECCCcC------CCcEEEEEEEecccCc----cC-CC--Cc-cceEE-----ECC
Confidence             34556666432 346789999999985321      1112222221111000    00 00  00 11111     112


Q ss_pred             CCCCCCCCCeEEEecCCCcEEEEEEEEcCCC-CCCCCCCeEEEEcCCchHHHHHhhcc
Q psy11546       1466 DACLGDSGGPLMCQEPDGRWSLMGVTSNGYG-CARANRPGVYTKVSNYIPWLYNNMAA 1522 (1634)
Q Consensus      1466 daC~GDSGGPLVc~~~~GrW~LvGIvS~G~g-Cg~~~~PgVYTrVs~YlDWI~~vI~~ 1522 (1634)
                      ..-.|.|||||+-..  |  .++||.+.... -+.....+...-+......+.+.++.
T Consensus       175 ~i~~GnSGGpl~n~~--G--~viGI~~~~~~~~g~~~g~~faiP~~~~~~~~~~l~~~  228 (428)
T TIGR02037       175 AINPGNSGGPLVNLR--G--EVIGINTAIYSPSGGNVGIGFAIPSNMAKNVVDQLIEG  228 (428)
T ss_pred             CCCCCCCCCceECCC--C--eEEEEEeEEEcCCCCccceEEEEEhHHHHHHHHHHHhc
Confidence            345689999998432  3  89999875422 11111223445555555555554443


No 56 
>TIGR02038 protease_degS periplasmic serine pepetdase DegS. This family consists of the periplasmic serine protease DegS (HhoB), a shorter paralog of protease DO (HtrA, DegP) and DegQ (HhoA). It is found in E. coli and several other Proteobacteria of the gamma subdivision. It contains a trypsin domain and a single copy of PDZ domain (in contrast to DegP with two copies). A critical role of this DegS is to sense stress in the periplasm and partially degrade an inhibitor of sigma(E).
Probab=95.11  E-value=0.5  Score=57.59  Aligned_cols=168  Identities=14%  Similarity=0.185  Sum_probs=86.2

Q ss_pred             eeeeceeeCCC-ccccccccccCCCCCceEEEEeEEecCCCCCCCCceeEeeEEEEeCCCCCCCCCcCceEEEEeCCCCC
Q psy11546       1308 FQCGATLISDQ-WLLSAGHCFYRAQDDYWVARLGTLRRGTKLPSPYEQLRPISKIILHPQYVDAGFINDISILKMKTPFS 1386 (1634)
Q Consensus      1308 ~~CGGTLIS~r-WVLTAAHCV~~~~~~~~~VrLG~~~~~s~~~s~~~Q~~~V~~IiiHP~Yn~~t~~nDIALLkL~~PfS 1386 (1634)
                      ...+|.+|++. ||||++|-+....  .+.|.+..           ...+..+-+..+|       ..||||||++... 
T Consensus        78 ~~GSG~vi~~~G~IlTn~HVV~~~~--~i~V~~~d-----------g~~~~a~vv~~d~-------~~DlAvlkv~~~~-  136 (351)
T TIGR02038        78 GLGSGVIMSKEGYILTNYHVIKKAD--QIVVALQD-----------GRKFEAELVGSDP-------LTDLAVLKIEGDN-  136 (351)
T ss_pred             ceEEEEEEeCCeEEEecccEeCCCC--EEEEEECC-----------CCEEEEEEEEecC-------CCCEEEEEecCCC-
Confidence            46889999876 9999999996543  23333211           1233444343443       4799999998652 


Q ss_pred             CCceeeecCCCCCCCCCCCeEEEEEecccccCCCcCCCCcEEEEEeeeChhHHhhhccCCCcccCCCCeEEEeecCCCCC
Q psy11546       1387 NYVRPICLPHPNTPLTDGTLCTVVGWGQLFEIGRVFPDTLQEVQLPIISTAECRKRTLFLPLYRVTENMFCAGFERGGRD 1466 (1634)
Q Consensus      1387 d~VqPICLP~~~~~l~~G~~c~VvGWG~t~e~g~~~s~~Lq~v~VpIIs~eeC~~~~~~~~~~~It~sm~CAG~~~gg~d 1466 (1634)
                        +.++-|- ....+..|..+.+.|+....      ........+.-+...   .   ...  .-....+=     ....
T Consensus       137 --~~~~~l~-~s~~~~~G~~V~aiG~P~~~------~~s~t~GiIs~~~r~---~---~~~--~~~~~~iq-----tda~  194 (351)
T TIGR02038       137 --LPTIPVN-LDRPPHVGDVVLAIGNPYNL------GQTITQGIISATGRN---G---LSS--VGRQNFIQ-----TDAA  194 (351)
T ss_pred             --CceEecc-CcCccCCCCEEEEEeCCCCC------CCcEEEEEEEeccCc---c---cCC--CCcceEEE-----ECCc
Confidence              2233332 22356789999999985321      111222222111110   0   000  00001111     1234


Q ss_pred             CCCCCCCCeEEEecCCCcEEEEEEEEcCCCCCC-C--CCCeEEEEcCCchHHHHHhhcc
Q psy11546       1467 ACLGDSGGPLMCQEPDGRWSLMGVTSNGYGCAR-A--NRPGVYTKVSNYIPWLYNNMAA 1522 (1634)
Q Consensus      1467 aC~GDSGGPLVc~~~~GrW~LvGIvS~G~gCg~-~--~~PgVYTrVs~YlDWI~~vI~~ 1522 (1634)
                      .-.|.|||||+-..  |  .++||.+....... .  ...+....+......+.+.++.
T Consensus       195 i~~GnSGGpl~n~~--G--~vIGI~~~~~~~~~~~~~~g~~faIP~~~~~~vl~~l~~~  249 (351)
T TIGR02038       195 INAGNSGGALINTN--G--ELVGINTASFQKGGDEGGEGINFAIPIKLAHKIMGKIIRD  249 (351)
T ss_pred             cCCCCCcceEECCC--C--eEEEEEeeeecccCCCCccceEEEecHHHHHHHHHHHhhc
Confidence            56789999999432  3  79999876432111 1  1123344444455555555443


No 57 
>PF13365 Trypsin_2:  Trypsin-like peptidase domain; PDB: 1Y8T_A 2Z9I_A 3QO6_A 1L1J_A 1QY6_A 2O8L_A 3OTP_E 2ZLE_I 1KY9_A 3CS0_A ....
Probab=94.64  E-value=0.059  Score=53.56  Aligned_cols=22  Identities=23%  Similarity=0.294  Sum_probs=19.5

Q ss_pred             eeceeeCCC-ccccccccccCCC
Q psy11546       1310 CGATLISDQ-WLLSAGHCFYRAQ 1331 (1634)
Q Consensus      1310 CGGTLIS~r-WVLTAAHCV~~~~ 1331 (1634)
                      |+|.+|+++ +|||||||+....
T Consensus         1 GTGf~i~~~g~ilT~~Hvv~~~~   23 (120)
T PF13365_consen    1 GTGFLIGPDGYILTAAHVVEDWN   23 (120)
T ss_dssp             EEEEEEETTTEEEEEHHHHTCCT
T ss_pred             CEEEEEcCCceEEEchhheeccc
Confidence            689999999 9999999997643


No 58 
>PRK10898 serine endoprotease; Provisional
Probab=94.55  E-value=0.86  Score=55.65  Aligned_cols=82  Identities=12%  Similarity=0.188  Sum_probs=50.0

Q ss_pred             eeeeceeeCCC-ccccccccccCCCCCceEEEEeEEecCCCCCCCCceeEeeEEEEeCCCCCCCCCcCceEEEEeCCCCC
Q psy11546       1308 FQCGATLISDQ-WLLSAGHCFYRAQDDYWVARLGTLRRGTKLPSPYEQLRPISKIILHPQYVDAGFINDISILKMKTPFS 1386 (1634)
Q Consensus      1308 ~~CGGTLIS~r-WVLTAAHCV~~~~~~~~~VrLG~~~~~s~~~s~~~Q~~~V~~IiiHP~Yn~~t~~nDIALLkL~~PfS 1386 (1634)
                      ..-+|.+|++. +|||+||=+....  .+.|.+..           ...+..+-+..+|       ..|||||+++..  
T Consensus        78 ~~GSGfvi~~~G~IlTn~HVv~~a~--~i~V~~~d-----------g~~~~a~vv~~d~-------~~DlAvl~v~~~--  135 (353)
T PRK10898         78 TLGSGVIMDQRGYILTNKHVINDAD--QIIVALQD-----------GRVFEALLVGSDS-------LTDLAVLKINAT--  135 (353)
T ss_pred             ceeeEEEEeCCeEEEecccEeCCCC--EEEEEeCC-----------CCEEEEEEEEEcC-------CCCEEEEEEcCC--
Confidence            46789999875 9999999886532  33333211           1223443344444       389999999864  


Q ss_pred             CCceeeecCCCCCCCCCCCeEEEEEec
Q psy11546       1387 NYVRPICLPHPNTPLTDGTLCTVVGWG 1413 (1634)
Q Consensus      1387 d~VqPICLP~~~~~l~~G~~c~VvGWG 1413 (1634)
                       ...++-|.. ...+..|..+.+.|+.
T Consensus       136 -~l~~~~l~~-~~~~~~G~~V~aiG~P  160 (353)
T PRK10898        136 -NLPVIPINP-KRVPHIGDVVLAIGNP  160 (353)
T ss_pred             -CCCeeeccC-cCcCCCCCEEEEEeCC
Confidence             122333322 2345678888888875


No 59 
>PRK10139 serine endoprotease; Provisional
Probab=92.97  E-value=1.3  Score=55.89  Aligned_cols=169  Identities=19%  Similarity=0.179  Sum_probs=88.6

Q ss_pred             eeeeceeeCC--CccccccccccCCCCCceEEEEeEEecCCCCCCCCceeEeeEEEEeCCCCCCCCCcCceEEEEeCCCC
Q psy11546       1308 FQCGATLISD--QWLLSAGHCFYRAQDDYWVARLGTLRRGTKLPSPYEQLRPISKIILHPQYVDAGFINDISILKMKTPF 1385 (1634)
Q Consensus      1308 ~~CGGTLIS~--rWVLTAAHCV~~~~~~~~~VrLG~~~~~s~~~s~~~Q~~~V~~IiiHP~Yn~~t~~nDIALLkL~~Pf 1385 (1634)
                      ...+|.+|++  .+|||.+|.+.+..  .+.|.+..           ...+..+-+...|       ..||||||++.+ 
T Consensus        90 ~~GSG~ii~~~~g~IlTn~HVv~~a~--~i~V~~~d-----------g~~~~a~vvg~D~-------~~DlAvlkv~~~-  148 (455)
T PRK10139         90 GLGSGVIIDAAKGYVLTNNHVINQAQ--KISIQLND-----------GREFDAKLIGSDD-------QSDIALLQIQNP-  148 (455)
T ss_pred             ceEEEEEEECCCCEEEeChHHhCCCC--EEEEEECC-----------CCEEEEEEEEEcC-------CCCEEEEEecCC-
Confidence            4678999974  69999999997643  34444321           1234444343433       479999999854 


Q ss_pred             CCCceeeecCCCCCCCCCCCeEEEEEecccccCCCcCCCCcEEEEEeeeChhHHhhhccCCCcccCCCCeEEEeecCCCC
Q psy11546       1386 SNYVRPICLPHPNTPLTDGTLCTVVGWGQLFEIGRVFPDTLQEVQLPIISTAECRKRTLFLPLYRVTENMFCAGFERGGR 1465 (1634)
Q Consensus      1386 Sd~VqPICLP~~~~~l~~G~~c~VvGWG~t~e~g~~~s~~Lq~v~VpIIs~eeC~~~~~~~~~~~It~sm~CAG~~~gg~ 1465 (1634)
                       ....++.|... ..+..|..+.+.|+-.-    .  ...   +..-+++...  +..  ...... ...+=     ...
T Consensus       149 -~~l~~~~lg~s-~~~~~G~~V~aiG~P~g----~--~~t---vt~GivS~~~--r~~--~~~~~~-~~~iq-----tda  207 (455)
T PRK10139        149 -SKLTQIAIADS-DKLRVGDFAVAVGNPFG----L--GQT---ATSGIISALG--RSG--LNLEGL-ENFIQ-----TDA  207 (455)
T ss_pred             -CCCceeEecCc-cccCCCCEEEEEecCCC----C--CCc---eEEEEEcccc--ccc--cCCCCc-ceEEE-----ECC
Confidence             22345555432 34667888888887321    1  111   2223333110  000  000000 11111     123


Q ss_pred             CCCCCCCCCeEEEecCCCcEEEEEEEEcCCCCC-CCCCCeEEEEcCCchHHHHHhhcc
Q psy11546       1466 DACLGDSGGPLMCQEPDGRWSLMGVTSNGYGCA-RANRPGVYTKVSNYIPWLYNNMAA 1522 (1634)
Q Consensus      1466 daC~GDSGGPLVc~~~~GrW~LvGIvS~G~gCg-~~~~PgVYTrVs~YlDWI~~vI~~ 1522 (1634)
                      ..-.|.|||||+-..    =.++||.+....-+ ....-+...-+......+.+.++.
T Consensus       208 ~in~GnSGGpl~n~~----G~vIGi~~~~~~~~~~~~gigfaIP~~~~~~v~~~l~~~  261 (455)
T PRK10139        208 SINRGNSGGALLNLN----GELIGINTAILAPGGGSVGIGFAIPSNMARTLAQQLIDF  261 (455)
T ss_pred             ccCCCCCcceEECCC----CeEEEEEEEEEcCCCCccceEEEEEhHHHHHHHHHHhhc
Confidence            456799999999432    28999998643211 112234455554444555554443


No 60 
>KOG3509|consensus
Probab=92.16  E-value=0.1  Score=69.60  Aligned_cols=86  Identities=28%  Similarity=0.502  Sum_probs=67.5

Q ss_pred             cccCCCcccCCcccCCCCCCCCCc----------ccccCCCcEecCCCccCCCCCCCCCCCCCCCccCCc-------CCC
Q psy11546       1549 ECLPKARVCNGYMECSDGKCEMNS----------SFRCGNGECVSIGSKCNQLVDCADGSDEKNCSCADF-------LKS 1611 (1634)
Q Consensus      1549 ~CIp~~~vCDG~~DC~DGsDE~~~----------~F~C~~G~CI~~~~~CDG~~DC~DgSDE~~C~C~~~-------~~~ 1611 (1634)
                      +|......|++..||.+.+|+.++          .|+|.+++|...-+.||...+|++++++.+|.+..-       ...
T Consensus         2 ~c~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~   81 (964)
T KOG3509|consen    2 ECVKNRYACDRQPDCRDRSDVANDPAIGSACSPNEFKCNNPRCVQPEALLDADSTCGPNSTPSGCNAKPSASDCKPTETQ   81 (964)
T ss_pred             chhhhhhhhccchhhHhhcccCCCccccccCCcchhccCCccccCchhhhccccccCCCCCcCCccccccccccCCcccc
Confidence            567778889999999999988774          388999999999999999999999998888843321       111


Q ss_pred             ------CccCCcccCCCCCCCCCCCCCCC
Q psy11546       1612 ------QFLTRKICDGIIDCWDFSDEYEC 1634 (1634)
Q Consensus      1612 ------c~~~~~~CDG~~DC~DgSDE~~C 1634 (1634)
                            |......|||.+||.|+++|.+|
T Consensus        82 c~~~~~~~~~~~~~~g~~~~~~~~~~~~~  110 (964)
T KOG3509|consen   82 CRDRLRCNPQSFQCDGTNDCKDGSDEVGC  110 (964)
T ss_pred             cccchhcCCccccccCCCCCCccchhccc
Confidence                  22344579999999999999765


No 61 
>PRK10942 serine endoprotease; Provisional
Probab=91.28  E-value=2  Score=54.55  Aligned_cols=83  Identities=14%  Similarity=0.243  Sum_probs=51.0

Q ss_pred             eeeeceeeCC--CccccccccccCCCCCceEEEEeEEecCCCCCCCCceeEeeEEEEeCCCCCCCCCcCceEEEEeCCCC
Q psy11546       1308 FQCGATLISD--QWLLSAGHCFYRAQDDYWVARLGTLRRGTKLPSPYEQLRPISKIILHPQYVDAGFINDISILKMKTPF 1385 (1634)
Q Consensus      1308 ~~CGGTLIS~--rWVLTAAHCV~~~~~~~~~VrLG~~~~~s~~~s~~~Q~~~V~~IiiHP~Yn~~t~~nDIALLkL~~Pf 1385 (1634)
                      ...+|.+|+.  .+|||.+|.+.+..  .+.|.+..           ...+..+-+..+|       ..|||||+++.+ 
T Consensus       111 ~~GSG~ii~~~~G~IlTn~HVv~~a~--~i~V~~~d-----------g~~~~a~vv~~D~-------~~DlAvlki~~~-  169 (473)
T PRK10942        111 ALGSGVIIDADKGYVVTNNHVVDNAT--KIKVQLSD-----------GRKFDAKVVGKDP-------RSDIALIQLQNP-  169 (473)
T ss_pred             ceEEEEEEECCCCEEEeChhhcCCCC--EEEEEECC-----------CCEEEEEEEEecC-------CCCEEEEEecCC-
Confidence            4678999984  59999999986543  34444321           1223444343444       379999999743 


Q ss_pred             CCCceeeecCCCCCCCCCCCeEEEEEec
Q psy11546       1386 SNYVRPICLPHPNTPLTDGTLCTVVGWG 1413 (1634)
Q Consensus      1386 Sd~VqPICLP~~~~~l~~G~~c~VvGWG 1413 (1634)
                       ....++-|-. ...+..|..+++.|+-
T Consensus       170 -~~l~~~~lg~-s~~l~~G~~V~aiG~P  195 (473)
T PRK10942        170 -KNLTAIKMAD-SDALRVGDYTVAIGNP  195 (473)
T ss_pred             -CCCceeEecC-ccccCCCCEEEEEcCC
Confidence             1233444432 2346678888888863


No 62 
>PF09272 Hepsin-SRCR:  Hepsin, SRCR;  InterPro: IPR015352 This entry represents the extracellular domain of the serine protease hepsin. The domain is formed primarily by three elements of regular secondary structure: a 12-residue alpha helix, a twisted five-stranded antiparallel beta sheet, and a second, two-stranded, antiparallel sheet. The two beta-sheets lie at roughly right angles to each other, with the helix nestled between the two, adopting an SRCR fold. The exact function of this domain has not been identified, though it probably may serve to orient the protease domain or place it in the vicinity of its substrate []. ; GO: 0004252 serine-type endopeptidase activity, 0070008 serine-type exopeptidase activity; PDB: 3T2N_B 1Z8G_A 1P57_A 1O5E_L 1O5F_L.
Probab=90.94  E-value=0.31  Score=49.19  Aligned_cols=95  Identities=19%  Similarity=0.377  Sum_probs=50.0

Q ss_pred             cCcceEEEE--ecCcccccccCcchhhhhhhchhhhcchhHHhhccCCCCCccceeeeccCC---CCCcceEEEeccccc
Q psy11546       1156 HDSGFLMIQ--KQGQWGKLCMNQINNFIMKTLKWKISDLGKAICKSMTFRDLNEIEAVADPS---TDDSIYYELSMNAIN 1230 (1634)
Q Consensus      1156 ~~eG~VeV~--~~GtWg~VCsd~~~~~~~~~~~~~isdlA~vICRqLG~~~~~sve~v~~~~---~~~~~y~~vscn~~e 1230 (1634)
                      ....+|+++  .+|+|.-+|+...+..           .|..-|.+|||-.+..........   .+...|+-+......
T Consensus         9 ~aD~rL~vfD~te~~WRllCss~~N~r-----------vA~lsCeemGFlr~~~hs~l~~~~~g~~g~s~fFcv~~~~L~   77 (110)
T PF09272_consen    9 PADQRLMVFDSTEGTWRLLCSSRSNAR-----------VAGLSCEEMGFLRAVSHSELDVRTAGANGTSGFFCVDEGRLP   77 (110)
T ss_dssp             TTT-BEEEEETTTTEEEEEB--TTHHH-----------HHHHHHHHTT-S-EEEEEEEEHHHH--SS--SEEEE-TTTGG
T ss_pred             CccceEEEEeccCCEEEEEeecccchh-----------hhcccchhcceEEeeccccccccccccCCCcceEEECcccCc
Confidence            345567665  3578999999877765           677789999998876665554433   345566655443222


Q ss_pred             cchhcccccccccccCCCCceeeeeccccccCCCC
Q psy11546       1231 SSAATKSALLFQKTKCSQKQVVAVNCSSLECGIRP 1265 (1634)
Q Consensus      1231 ~s~s~c~~l~~~~~~C~s~~vV~V~C~~~~CGirp 1265 (1634)
                      .  ..........-.|+++....+.|+  +||.|.
T Consensus        78 ~--a~~L~~v~~vCdCp~G~fLat~CQ--DCGRRk  108 (110)
T PF09272_consen   78 Y--ARRLLEVLSVCDCPRGRFLATICQ--DCGRRK  108 (110)
T ss_dssp             G---S-GGGGEEEE--TT-EEEEEEE-----S---
T ss_pred             H--hHhhhhheeeeeCCCCceeehhhh--hhCccc
Confidence            1  112223344568999999999998  799874


No 63 
>KOG3509|consensus
Probab=89.57  E-value=0.32  Score=65.07  Aligned_cols=93  Identities=28%  Similarity=0.475  Sum_probs=58.7

Q ss_pred             eecCCcccCCCCCCCCCCCCCCCCCCCccc----------CccccccccCCCCCCCCCCCCCCC------CCCCCCceec
Q psy11546       1006 CVSIGSKCNQLVDCADGSDEKNCSCADFLK----------SQFLTRKICDGIIDCWDFSDEYEC------EWCSPGQYIC 1069 (1634)
Q Consensus      1006 CIp~~~~CDG~~DC~DgSDE~~C~C~~~~~----------~~~~~~~~CDG~~DC~D~SDE~~C------~~C~~~~F~C 1069 (1634)
                      |......|++..||.+.+|+.++.+.....          .+...-++||...+|.+++++.+|      ..|.+..+.|
T Consensus         3 c~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~c   82 (964)
T KOG3509|consen    3 CVKNRYACDRQPDCRDRSDVANDPAIGSACSPNEFKCNNPRCVQPEALLDADSTCGPNSTPSGCNAKPSASDCKPTETQC   82 (964)
T ss_pred             hhhhhhhhccchhhHhhcccCCCccccccCCcchhccCCccccCchhhhccccccCCCCCcCCccccccccccCCccccc
Confidence            444455566666666666666554321110          112233556666666666655555      2377788888


Q ss_pred             CCCCeeeccccccCCCccCCCCCCccccc
Q psy11546       1070 PNSRVCIERTRLCDGIKDCPLGDDEKQCI 1098 (1634)
Q Consensus      1070 ~~~~~CI~~~~~CDG~~DC~DGSde~~c~ 1098 (1634)
                      .+--+|.+....|||.+||.|+++|..|.
T Consensus        83 ~~~~~~~~~~~~~~g~~~~~~~~~~~~~~  111 (964)
T KOG3509|consen   83 RDRLRCNPQSFQCDGTNDCKDGSDEVGCK  111 (964)
T ss_pred             ccchhcCCccccccCCCCCCccchhcccc
Confidence            76667888888999999999999888764


No 64 
>KOG2397|consensus
Probab=89.35  E-value=0.25  Score=61.23  Aligned_cols=63  Identities=38%  Similarity=0.734  Sum_probs=48.6

Q ss_pred             CCceecCCC-ceecCCcccCCCCCCCCCCCCCC---CC-----CCC--cccCccccccccCCCCCCCCCCCCCC
Q psy11546        996 NSSFRCGNG-ECVSIGSKCNQLVDCADGSDEKN---CS-----CAD--FLKSQFLTRKICDGIIDCWDFSDEYE 1058 (1634)
Q Consensus       996 ~~~F~C~nG-~CIp~~~~CDG~~DC~DgSDE~~---C~-----C~~--~~~~~~~~~~~CDG~~DC~D~SDE~~ 1058 (1634)
                      ...|.|.+| +=|+..+.=|..-||.|||||.+   |.     |..  .....++...+=||+.||.|+|||..
T Consensus        42 s~~~~CLdgs~~i~f~qlNDd~CDC~DGsDEPGtsACpngkF~C~N~G~~p~~i~ssrV~DGICDCCDgSDE~~  115 (480)
T KOG2397|consen   42 SSMFKCLDGSKTISFSQLNDDSCDCLDGSDEPGTSACPNGKFYCVNQGHQPKYIPSSRVNDGICDCCDGSDEYL  115 (480)
T ss_pred             ccceeeccCCcccCHHHhccccccCCCCCCCCccccCCCCceeeeecCCCceeeechhccCcccccccCCCCcc
Confidence            357999877 78899999999999999999964   43     211  11223455678899999999999986


No 65 
>PF02395 Peptidase_S6:  Immunoglobulin A1 protease Serine protease Prosite pattern;  InterPro: IPR000710 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This group of serine peptidases belong to the MEROPS peptidase family S6 (clan PA(S)). The type sample being the IgA1-specific serine endopeptidase from Neisseria gonorrhoeae []. These cleave prolyl bonds in the hinge regions of immunoglobulin A heavy chains. Similar specificity is shown by the unrelated family of M26 metalloendopeptidases.; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 3SZE_A 3H09_B 3SYJ_A 1WXR_A 3AK5_B.
Probab=59.73  E-value=6.2  Score=52.92  Aligned_cols=32  Identities=41%  Similarity=0.794  Sum_probs=22.9

Q ss_pred             CCCCCCCCeEEEec-CCCcEEEEEEEEcCCCCC
Q psy11546       1467 ACLGDSGGPLMCQE-PDGRWSLMGVTSNGYGCA 1498 (1634)
Q Consensus      1467 aC~GDSGGPLVc~~-~~GrW~LvGIvS~G~gCg 1498 (1634)
                      .=.||||+||+.-+ ..++|+|+|+++.+.+..
T Consensus       213 ~~~GDSGSPlF~YD~~~kKWvl~Gv~~~~~~~~  245 (769)
T PF02395_consen  213 GSPGDSGSPLFAYDKEKKKWVLVGVLSGGNGYN  245 (769)
T ss_dssp             --TT-TT-EEEEEETTTTEEEEEEEEEEECCCC
T ss_pred             cccCcCCCceEEEEccCCeEEEEEEEccccccC
Confidence            45799999998754 467999999999886653


No 66 
>PLN03160 uncharacterized protein; Provisional
Probab=52.72  E-value=20  Score=41.38  Aligned_cols=16  Identities=19%  Similarity=0.002  Sum_probs=13.5

Q ss_pred             cccEEEEEEEEEecCC
Q psy11546         69 GVELIFDSSFRVTAGD   84 (1634)
Q Consensus        69 ~k~~yY~GSFrItn~~   84 (1634)
                      +.+.+..-+++|++-+
T Consensus        65 k~P~~~v~~v~l~~~~   80 (219)
T PLN03160         65 KDPVIKMNGVTVTKLE   80 (219)
T ss_pred             cCCeEEEEEEEEeeee
Confidence            7788999999998866


No 67 
>PF07423 DUF1510:  Protein of unknown function (DUF1510);  InterPro: IPR009988 This family consists of several hypothetical bacterial proteins of around 200 residues in length. The function of this family is unknown.
Probab=48.03  E-value=9.8  Score=43.82  Aligned_cols=28  Identities=21%  Similarity=0.356  Sum_probs=15.5

Q ss_pred             cccccCCCCCccchHHHHHHHHHHHHHH
Q psy11546         32 KSQKSKPSQKCCSVLLASVIFLVLLLVI   59 (1634)
Q Consensus        32 k~t~~~~~r~~~~w~i~li~~lvllvLa   59 (1634)
                      ||..+.++|+...+|=++|.+++|||++
T Consensus         2 Rf~~r~KrRK~N~iLNiaI~IV~lLIii   29 (217)
T PF07423_consen    2 RFQQRQKRRKTNKILNIAIGIVSLLIII   29 (217)
T ss_pred             chhHHHHhhhhhhhHHHHHHHHHHHHHH
Confidence            5666656666666655555544444333


No 68 
>PF14654 Epiglycanin_C:  Mucin, catalytic, TM and cytoplasmic tail region
Probab=38.27  E-value=28  Score=35.31  Aligned_cols=33  Identities=15%  Similarity=0.081  Sum_probs=25.0

Q ss_pred             CCCCccchHHHHHHHHHHHH-HHHHHhhheeccc
Q psy11546         38 PSQKCCSVLLASVIFLVLLL-VITIAALAIYMGV   70 (1634)
Q Consensus        38 ~~r~~~~w~i~li~~lvllv-LaviigLlvyF~k   70 (1634)
                      .+-++.+|-|+||.++.|++ +++.+||.++.++
T Consensus        11 PsGsL~PWeIfLItLasVvvavGl~aGLfFcvR~   44 (106)
T PF14654_consen   11 PSGSLKPWEIFLITLASVVVAVGLFAGLFFCVRN   44 (106)
T ss_pred             cCCCccchHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence            44468899999998886554 7888888887654


No 69 
>PF12273 RCR:  Chitin synthesis regulation, resistance to Congo red;  InterPro: IPR020999  RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall []. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5 []. 
Probab=34.17  E-value=27  Score=36.95  Aligned_cols=13  Identities=23%  Similarity=0.478  Sum_probs=6.4

Q ss_pred             chHHHHHHHHHHH
Q psy11546         44 SVLLASVIFLVLL   56 (1634)
Q Consensus        44 ~w~i~li~~lvll   56 (1634)
                      +|+|++|||++++
T Consensus         1 RW~l~~iii~~i~   13 (130)
T PF12273_consen    1 RWVLFAIIIVAIL   13 (130)
T ss_pred             CeeeHHHHHHHHH
Confidence            3665555444443


No 70 
>PF11293 DUF3094:  Protein of unknown function (DUF3094);  InterPro: IPR021444  This family of proteins with unknown function appears to be restricted to Gammaproteobacteria. 
Probab=30.66  E-value=54  Score=30.16  Aligned_cols=28  Identities=14%  Similarity=0.069  Sum_probs=22.6

Q ss_pred             CCccchHHHHHHHHHHHHHHHHHhhhee
Q psy11546         40 QKCCSVLLASVIFLVLLLVITIAALAIY   67 (1634)
Q Consensus        40 r~~~~w~i~li~~lvllvLaviigLlvy   67 (1634)
                      +..++|++.++++++++.|+++..|+-+
T Consensus        26 ~PFrP~~Ll~~li~Vv~gl~llS~ll~~   53 (55)
T PF11293_consen   26 KPFRPWRLLIVLIVVVIGLGLLSRLLSR   53 (55)
T ss_pred             CCcchHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3478999999999999988888877643


No 71 
>PF00548 Peptidase_C3:  3C cysteine protease (picornain 3C);  InterPro: IPR000199 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This signature defines cysteine peptidases belong to MEROPS peptidase family C3 (picornain, clan PA(C)), subfamilies C3A and C3B. The protein fold of this peptidase domain for members of this family resembles that of the serine peptidase, chymotrypsin [], the type example for clan PA. Picornaviral proteins are expressed as a single polyprotein which is cleaved by the viral C3 cysteine protease. The poliovirus polyprotein is selectively cleaved between the Gln-|-Gly bond. In other picornavirus reactions Glu may be substituted for Gln, and Ser or Thr for Gly. ; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3SJO_E 2H6M_A 1QA7_C 1HAV_B 2HAL_A 2H9H_A 3QZQ_B 3QZR_A 3R0F_B 3SJ9_A ....
Probab=29.09  E-value=51  Score=36.63  Aligned_cols=28  Identities=25%  Similarity=0.491  Sum_probs=21.5

Q ss_pred             CCCCCCCCeEEEecCCCcEEEEEEEEcCC
Q psy11546       1467 ACLGDSGGPLMCQEPDGRWSLMGVTSNGY 1495 (1634)
Q Consensus      1467 aC~GDSGGPLVc~~~~GrW~LvGIvS~G~ 1495 (1634)
                      +-.|+-||||+... .+...++||-..|.
T Consensus       144 t~~G~CG~~l~~~~-~~~~~i~GiHvaG~  171 (172)
T PF00548_consen  144 TKPGMCGSPLVSRI-GGQGKIIGIHVAGN  171 (172)
T ss_dssp             EETTGTTEEEEESC-GGTTEEEEEEEEEE
T ss_pred             CCCCccCCeEEEee-ccCccEEEEEeccC
Confidence            34689999999865 45679999987664


No 72 
>PF11770 GAPT:  GRB2-binding adapter (GAPT);  InterPro: IPR021082  This entry represents a family of transmembrane proteins which bind the growth factor receptor-bound protein 2 (GRB2) in B cells []. In contrast to other transmembrane adaptor proteins, GAPT, which this entry represents, is not phosphorylated upon BCR ligation. It associates with GRB2 constitutively through its proline-rich region []. 
Probab=26.89  E-value=46  Score=36.34  Aligned_cols=27  Identities=26%  Similarity=0.150  Sum_probs=20.4

Q ss_pred             chHHHHHHHHHHHHHHHHHhhheeccc
Q psy11546         44 SVLLASVIFLVLLLVITIAALAIYMGV   70 (1634)
Q Consensus        44 ~w~i~li~~lvllvLaviigLlvyF~k   70 (1634)
                      ...+++.++++||+|+..||++||.+.
T Consensus         9 sv~i~igi~Ll~lLl~cgiGcvwhwkh   35 (158)
T PF11770_consen    9 SVAISIGISLLLLLLLCGIGCVWHWKH   35 (158)
T ss_pred             hHHHHHHHHHHHHHHHHhcceEEEeec
Confidence            456666677777778889999999844


No 73 
>PRK11901 hypothetical protein; Reviewed
Probab=25.64  E-value=78  Score=38.66  Aligned_cols=42  Identities=24%  Similarity=0.315  Sum_probs=25.4

Q ss_pred             CCCCccccccCCCCCcc------chHHHHHHHHHHHHHHHHHhhheec
Q psy11546         27 PNSSKKSQKSKPSQKCC------SVLLASVIFLVLLLVITIAALAIYM   68 (1634)
Q Consensus        27 ~~s~~k~t~~~~~r~~~------~w~i~li~~lvllvLaviigLlvyF   68 (1634)
                      +-||+|-+|+++.....      --|.+-|=++|||+|++.||-++.-
T Consensus        13 DtSDRrp~Rsr~~~~~pk~~vSRQh~MiGiGilVLlLLIi~IgSALks   60 (327)
T PRK11901         13 DTSDRRPTRSRKSSNGPKLAVSRQHMMIGIGILVLLLLIIAIGSALKS   60 (327)
T ss_pred             CcccCCCcccccCCCCCCchHHHHHHHHHHHHHHHHHHHHHHhhhccC
Confidence            44788877776554322      2244455555666677778877654


No 74 
>PF14155 DUF4307:  Domain of unknown function (DUF4307)
Probab=25.60  E-value=81  Score=32.86  Aligned_cols=42  Identities=14%  Similarity=0.017  Sum_probs=24.5

Q ss_pred             cchHHHHHHHHHHHHHHHHHhhhee-c--cccEEEEEEEEEecCC
Q psy11546         43 CSVLLASVIFLVLLLVITIAALAIY-M--GVELIFDSSFRVTAGD   84 (1634)
Q Consensus        43 ~~w~i~li~~lvllvLaviigLlvy-F--~k~~yY~GSFrItn~~   84 (1634)
                      +.|+++++++++++++++++|+.+. +  ..-..=...|++.+..
T Consensus         3 rr~~~~~~~v~~vv~~~~~~w~~~~~~~~~~v~~~~~gf~vv~d~   47 (112)
T PF14155_consen    3 RRKLVIAGAVLVVVAGAVVAWFGYSQFGSPPVSAEVIGFEVVDDS   47 (112)
T ss_pred             cceeEehHHHHHHHHHHHHhHhhhhhccCCCceEEEEEEEECCCC
Confidence            3455566666666667777777765 4  2223344566666554


No 75 
>PF06024 DUF912:  Nucleopolyhedrovirus protein of unknown function (DUF912);  InterPro: IPR009261 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf78; it is a family of uncharacterised viral proteins.
Probab=24.92  E-value=52  Score=33.64  Aligned_cols=27  Identities=11%  Similarity=0.151  Sum_probs=17.9

Q ss_pred             ccchHHHHHHHHHHHHHHHHHhhheec
Q psy11546         42 CCSVLLASVIFLVLLLVITIAALAIYM   68 (1634)
Q Consensus        42 ~~~w~i~li~~lvllvLaviigLlvyF   68 (1634)
                      ...+.++++.+++++++.|++.+++||
T Consensus        58 ~~~~~iili~lls~v~IlVily~IyYF   84 (101)
T PF06024_consen   58 QNNGNIILISLLSFVCILVILYAIYYF   84 (101)
T ss_pred             cccccchHHHHHHHHHHHHHHhhheEE
Confidence            334555566666666677777778888


No 76 
>PF00558 Vpu:  Vpu protein;  InterPro: IPR008187 The Human immunodeficiency virus 1 (HIV-1) Vpu protein acts in the degradation of CD4 in the endoplasmic reticulum and in the enhancement of virion release from the plasma membrane of infected cells [].; GO: 0019076 release of virus from host; PDB: 2JPX_A 1PI8_A 2GOH_A 2GOF_A 1PI7_A 1PJE_A 1VPU_A 2K7Y_A.
Probab=24.85  E-value=58  Score=32.33  Aligned_cols=24  Identities=21%  Similarity=0.476  Sum_probs=14.9

Q ss_pred             hHHHHHHHHHHHHHHHHHhhheec
Q psy11546         45 VLLASVIFLVLLLVITIAALAIYM   68 (1634)
Q Consensus        45 w~i~li~~lvllvLaviigLlvyF   68 (1634)
                      ++++++.+++++++|+++|.++|-
T Consensus         5 ~i~~iialiv~~iiaIvvW~iv~i   28 (81)
T PF00558_consen    5 EILAIIALIVALIIAIVVWTIVYI   28 (81)
T ss_dssp             ---HHHHHHHHHHHHHHHHHHH--
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHH
Confidence            456666677777788888888775


No 77 
>PF01102 Glycophorin_A:  Glycophorin A;  InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=23.40  E-value=70  Score=34.05  Aligned_cols=30  Identities=27%  Similarity=0.320  Sum_probs=21.4

Q ss_pred             CCCccchHHHHHHHHHHHHHHHHHhhheec
Q psy11546         39 SQKCCSVLLASVIFLVLLLVITIAALAIYM   68 (1634)
Q Consensus        39 ~r~~~~w~i~li~~lvllvLaviigLlvyF   68 (1634)
                      ......=.|++|++.|++.++++|.|++|+
T Consensus        58 ~h~fs~~~i~~Ii~gv~aGvIg~Illi~y~   87 (122)
T PF01102_consen   58 VHRFSEPAIIGIIFGVMAGVIGIILLISYC   87 (122)
T ss_dssp             SSSSS-TCHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             ccCccccceeehhHHHHHHHHHHHHHHHHH
Confidence            334444467888888888888888888888


No 78 
>PRK10697 DNA-binding transcriptional activator PspC; Provisional
Probab=22.89  E-value=73  Score=33.72  Aligned_cols=25  Identities=20%  Similarity=0.290  Sum_probs=13.3

Q ss_pred             HhHHHHHHHHHHHHHHHhhcC---Cccccc
Q psy11546         96 NLYKEKSKRYKSMIEKLYNAS---VLSPAI  122 (1634)
Q Consensus        96 ~~Fk~La~~IE~mV~~iF~~S---~L~~~Y  122 (1634)
                      .+|+.+.+++++|  +.|-.|   .|+++|
T Consensus        88 ~~~~~~e~Rlr~m--E~yVTS~~f~l~ref  115 (118)
T PRK10697         88 RELAAGEQRLREM--ERYVTSDTFTLRSRF  115 (118)
T ss_pred             HHHHHHHHHHHHH--HHHhcCCCcCHHHHH
Confidence            4455555666665  335555   445544


No 79 
>PF00863 Peptidase_C4:  Peptidase family C4 This family belongs to family C4 of the peptidase classification.;  InterPro: IPR001730 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  Nuclear inclusion A (NIA) proteases from potyviruses are cysteine peptidases belong to the MEROPS peptidase family C4 (NIa protease family, clan PA(C)) [, ].  Potyviruses include plant viruses in which the single-stranded RNA encodes a polyprotein with NIA protease activity, where proteolytic cleavage is specific for Gln+Gly sites. The NIA protease acts on the polyprotein, releasing itself by Gln+Gly cleavage at both the N- and C-termini. It further processes the polyprotein by cleavage at five similar sites in the C-terminal half of the sequence. In addition to its C-terminal protease activity, the NIA protease contains an N-terminal domain that has been implicated in the transcription process []. This peptidase is present in the nuclear inclusion protein of potyviruses.; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MMG_B 1Q31_B 1LVB_A 1LVM_A.
Probab=21.95  E-value=4.8e+02  Score=30.92  Aligned_cols=153  Identities=18%  Similarity=0.300  Sum_probs=64.3

Q ss_pred             eeCCCccccccccccCCCCCceEEE--EeEEecCCCCCCCCceeEeeEEEEeCCCCCCCCCcCceEEEEeCCCCCCCcee
Q psy11546       1314 LISDQWLLSAGHCFYRAQDDYWVAR--LGTLRRGTKLPSPYEQLRPISKIILHPQYVDAGFINDISILKMKTPFSNYVRP 1391 (1634)
Q Consensus      1314 LIS~rWVLTAAHCV~~~~~~~~~Vr--LG~~~~~s~~~s~~~Q~~~V~~IiiHP~Yn~~t~~nDIALLkL~~PfSd~VqP 1391 (1634)
                      |.-..||||-+|-+...... +.+.  .|.+....      ..     ++.+||-     ...||.||+|...|...-+-
T Consensus        37 igyG~~iItn~HLf~~nng~-L~i~s~hG~f~v~n------t~-----~lkv~~i-----~~~DiviirmPkDfpPf~~k   99 (235)
T PF00863_consen   37 IGYGSYIITNAHLFKRNNGE-LTIKSQHGEFTVPN------TT-----QLKVHPI-----EGRDIVIIRMPKDFPPFPQK   99 (235)
T ss_dssp             EEETTEEEEEGGGGSSTTCE-EEEEETTEEEEECE------GG-----GSEEEE------TCSSEEEEE--TTS----S-
T ss_pred             EeECCEEEEChhhhccCCCe-EEEEeCceEEEcCC------cc-----ccceEEe-----CCccEEEEeCCcccCCcchh
Confidence            44567999999998664432 3332  23332221      11     1222221     24799999999886655454


Q ss_pred             eecCCCCCCCCCCCeEEEEEecccccCCCcCCCCcEEEEEeeeChhHHhhhccCCCcccCCCCeEEEeecCCCCCCCCCC
Q psy11546       1392 ICLPHPNTPLTDGTLCTVVGWGQLFEIGRVFPDTLQEVQLPIISTAECRKRTLFLPLYRVTENMFCAGFERGGRDACLGD 1471 (1634)
Q Consensus      1392 ICLP~~~~~l~~G~~c~VvGWG~t~e~g~~~s~~Lq~v~VpIIs~eeC~~~~~~~~~~~It~sm~CAG~~~gg~daC~GD 1471 (1634)
                      +++-    .+..+..+.++|--......   .....+... +.+...               ..|=.    -...+=.||
T Consensus       100 l~FR----~P~~~e~v~mVg~~fq~k~~---~s~vSesS~-i~p~~~---------------~~fWk----HwIsTk~G~  152 (235)
T PF00863_consen  100 LKFR----APKEGERVCMVGSNFQEKSI---SSTVSESSW-IYPEEN---------------SHFWK----HWISTKDGD  152 (235)
T ss_dssp             --B--------TT-EEEEEEEECSSCCC---EEEEEEEEE-EEEETT---------------TTEEE----E-C---TT-
T ss_pred             hhcc----CCCCCCEEEEEEEEEEcCCe---eEEECCceE-EeecCC---------------CCeeE----EEecCCCCc
Confidence            4442    23345667777764321111   111111110 011000               00110    112334688


Q ss_pred             CCCeEEEecCCCcEEEEEEEEcCCCCCCCCCCeEEEEcCC-chHHH
Q psy11546       1472 SGGPLMCQEPDGRWSLMGVTSNGYGCARANRPGVYTKVSN-YIPWL 1516 (1634)
Q Consensus      1472 SGGPLVc~~~~GrW~LvGIvS~G~gCg~~~~PgVYTrVs~-YlDWI 1516 (1634)
                      -|.|||... +|  .++||-|.+..-.   .-.+|+.+.. +.+-+
T Consensus       153 CG~PlVs~~-Dg--~IVGiHsl~~~~~---~~N~F~~f~~~f~~~~  192 (235)
T PF00863_consen  153 CGLPLVSTK-DG--KIVGIHSLTSNTS---SRNYFTPFPDDFEEFY  192 (235)
T ss_dssp             TT-EEEETT-T----EEEEEEEEETTT---SSEEEEE--TTHHHHH
T ss_pred             cCCcEEEcC-CC--cEEEEEcCccCCC---CeEEEEcCCHHHHHHH
Confidence            899999866 33  8999999765332   2347777754 44433


Done!