Query         psy1807
Match_columns 618
No_of_seqs    648 out of 3229
Neff          8.7 
Searched_HMMs 46136
Date          Fri Aug 16 21:50:42 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy1807.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1807hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PHA02927 secreted complement-b 100.0 1.7E-41 3.7E-46  338.8  22.4  234  272-512    15-262 (263)
  2 PHA02927 secreted complement-b 100.0 1.3E-38 2.8E-43  318.0  23.2  236  209-454    12-262 (263)
  3 PHA02954 EEV membrane glycopro 100.0 4.8E-35   1E-39  293.4  21.5  225  270-514    13-240 (317)
  4 PHA02639 EEV host range protei 100.0 5.2E-33 1.1E-37  277.8  21.8  181  332-515    16-209 (295)
  5 PHA02954 EEV membrane glycopro 100.0 1.4E-32   3E-37  275.6  21.8  216  218-456    19-240 (317)
  6 PHA02639 EEV host range protei 100.0 1.1E-30 2.3E-35  261.1  18.3  180  273-455    17-207 (295)
  7 PHA02831 EEV host range protei 100.0 2.4E-30 5.3E-35  252.6  18.4  161  352-521    38-205 (268)
  8 PHA02831 EEV host range protei 100.0 6.5E-28 1.4E-32  235.6  16.4  160  235-400    38-204 (268)
  9 cd03597 CLECT_attractin_like C  99.9   9E-26   2E-30  201.7  13.3  125   84-215     3-129 (129)
 10 cd03588 CLECT_CSPGs C-type lec  99.9 1.5E-25 3.3E-30  199.2  12.2  120   84-215     3-123 (124)
 11 cd03589 CLECT_CEL-1_like C-typ  99.9 2.2E-25 4.7E-30  202.0  12.6  128   84-215     3-137 (137)
 12 cd03599 CLECT_DGCR2_like C-typ  99.9 1.7E-24 3.8E-29  196.4  12.9  128   84-215     3-153 (153)
 13 cd03590 CLECT_DC-SIGN_like C-t  99.9 6.3E-24 1.4E-28  189.5  12.8  121   84-215     3-126 (126)
 14 cd03594 CLECT_REG-1_like C-typ  99.9 1.3E-23 2.8E-28  188.3  12.3  124   84-215     3-129 (129)
 15 cd03596 CLECT_tetranectin_like  99.9   2E-23 4.3E-28  187.0  12.9  124   84-215     3-129 (129)
 16 PHA02817 EEV Host range protei  99.9 1.2E-22 2.6E-27  193.3  14.8  123  389-514    17-148 (225)
 17 cd03593 CLECT_NK_receptors_lik  99.9 1.1E-22 2.4E-27  178.6  11.7  114   84-215     3-116 (116)
 18 cd03598 CLECT_EMBP_like C-type  99.9 1.5E-22 3.3E-27  178.0  11.6  116   91-215     1-117 (117)
 19 cd03591 CLECT_collectin_like C  99.9 2.4E-22 5.1E-27  175.9  11.4  114   91-215     1-114 (114)
 20 cd03603 CLECT_VCBS A bacterial  99.9 4.4E-22 9.6E-27  175.1  12.0  113   92-214     1-117 (118)
 21 cd03601 CLECT_TC14_like C-type  99.9 1.7E-21 3.6E-26  171.7  11.5  114   94-215     3-119 (119)
 22 cd03592 CLECT_selectins_like C  99.9 1.5E-21 3.4E-26  171.1  10.8  113   94-215     3-115 (115)
 23 PHA02817 EEV Host range protei  99.9 2.9E-21 6.2E-26  183.9  12.9  122  272-394    18-148 (225)
 24 PHA02642 C-type lectin-like pr  99.9 3.6E-21 7.8E-26  181.2  11.5  116   75-217    87-202 (216)
 25 cd03600 CLECT_thrombomodulin_l  99.8 6.9E-21 1.5E-25  173.2  11.8  124   90-215     3-139 (141)
 26 smart00034 CLECT C-type lectin  99.8 9.6E-21 2.1E-25  168.2  11.8  123   84-214     3-126 (126)
 27 cd03595 CLECT_chondrolectin_li  99.8 1.1E-20 2.4E-25  173.2  12.0  122   89-215     8-149 (149)
 28 PHA02953 IEV and EEV membrane   99.8 1.7E-20 3.7E-25  173.0  10.1  116   74-216    45-167 (170)
 29 cd03602 CLECT_1 C-type lectin   99.8 4.8E-20   1E-24  159.7  10.9  108   92-215     1-108 (108)
 30 PHA03097 C-type lectin-like pr  99.8 1.8E-19 3.9E-24  165.0  11.0  111   75-216    45-155 (157)
 31 TIGR00864 PCC polycystin catio  99.8 1.2E-19 2.7E-24  217.6  12.5  129   74-219   316-450 (2740)
 32 PHA02867 C-type lectin protein  99.8 4.5E-18 9.7E-23  154.6  12.0  109   74-216    47-155 (167)
 33 cd00037 CLECT C-type lectin (C  99.6 3.2E-15   7E-20  129.9  11.3  115   92-215     1-116 (116)
 34 PF00059 Lectin_C:  Lectin C-ty  99.6 5.2E-15 1.1E-19  126.8   8.7  100  101-215     2-105 (105)
 35 PHA02911 C-type lectin-like pr  99.3 5.9E-12 1.3E-16  114.6   8.1  106   81-217   102-211 (213)
 36 cd00033 CCP Complement control  99.0 9.8E-10 2.1E-14   82.8   6.0   55  458-512     1-57  (57)
 37 PF00084 Sushi:  Sushi domain (  98.9 8.8E-10 1.9E-14   82.8   4.5   56  396-453     1-56  (56)
 38 smart00032 CCP Domain abundant  98.9 2.1E-09 4.6E-14   80.9   6.1   54  458-511     1-57  (57)
 39 cd00033 CCP Complement control  98.9 2.5E-09 5.5E-14   80.5   5.7   57  396-454     1-57  (57)
 40 smart00032 CCP Domain abundant  98.9 2.7E-09   6E-14   80.2   5.9   56  396-453     1-57  (57)
 41 PF00084 Sushi:  Sushi domain (  98.9 1.6E-09 3.5E-14   81.3   4.1   56  278-333     1-56  (56)
 42 PF05473 Herpes_UL45:  UL45 pro  98.2 3.7E-06 7.9E-11   80.1   8.4   55   74-134    83-137 (200)
 43 PHA02672 ORF110 EEV glycoprote  96.4  0.0037 7.9E-08   54.6   4.0   83   71-170    45-127 (166)
 44 KOG4646|consensus               95.5  0.0035 7.5E-08   54.6   0.0   41    1-42     89-129 (173)
 45 PHA03093 EEV glycoprotein; Pro  95.5   0.027 5.8E-07   51.5   5.6   47   87-136   104-150 (185)
 46 PHA02673 ORF109 EEV glycoprote  95.4    0.02 4.3E-07   51.0   4.2   54   73-136    74-127 (161)
 47 PF05966 Chordopox_A33R:  Chord  94.1    0.03 6.5E-07   51.8   2.1   33   87-120   108-140 (190)
 48 PF12191 stn_TNFRSF12A:  Tumour  92.0   0.074 1.6E-06   45.5   1.3   19  495-513    33-51  (129)
 49 cd03519 Link_domain_HAPLN_modu  90.4    0.28 6.1E-06   39.9   3.1   23   99-121     8-30  (91)
 50 cd03520 Link_domain_CSPGs_modu  88.2    0.48   1E-05   39.1   3.0   23   99-121     8-30  (96)
 51 KOG4564|consensus               86.9    0.77 1.7E-05   49.4   4.5   35  476-510    79-118 (473)
 52 PTZ00382 Variant-specific surf  85.6    0.49 1.1E-05   39.5   1.8   11  483-493    41-51  (96)
 53 cd01102 Link_Domain The link d  84.9    0.68 1.5E-05   38.1   2.2   21  101-121    13-33  (92)
 54 smart00445 LINK Link (Hyaluron  84.6    0.72 1.6E-05   38.1   2.3   21  101-121    14-34  (94)
 55 cd03518 Link_domain_HAPLN_modu  81.9       1 2.2E-05   37.1   2.2   21  101-121    13-33  (95)
 56 cd03515 Link_domain_TSG_6_like  81.7    0.99 2.1E-05   37.1   2.0   21  101-121    13-33  (93)
 57 PF14704 DERM:  Dermatopontin    81.6     4.8  0.0001   36.8   6.7  138  239-392     3-149 (152)
 58 PF00193 Xlink:  Extracellular   80.9     1.6 3.4E-05   36.1   2.9   21  101-121    13-33  (92)
 59 PF07204 Orthoreo_P10:  Orthore  80.2    0.88 1.9E-05   36.8   1.2   26  537-562    47-72  (98)
 60 cd03521 Link_domain_KIAA0527_l  79.9     1.4   3E-05   35.7   2.2   21  101-121    13-33  (95)
 61 KOG4289|consensus               78.6     7.4 0.00016   46.7   8.2  154  239-449  1737-1902(2531)
 62 cd03517 Link_domain_CSPGs_modu  77.6     1.5 3.2E-05   36.3   1.8   21  101-121    13-33  (95)
 63 PF01034 Syndecan:  Syndecan do  74.8    0.97 2.1E-05   34.1   0.0   23  542-564    22-44  (64)
 64 smart00185 ARM Armadillo/beta-  74.3     1.3 2.9E-05   29.8   0.6   38    2-40      3-40  (41)
 65 PF15050 SCIMP:  SCIMP protein   73.7     1.2 2.6E-05   37.8   0.3   14  541-554    16-29  (133)
 66 PF02009 Rifin_STEVOR:  Rifin/s  72.3     1.4 2.9E-05   44.9   0.4   18  547-564   273-290 (299)
 67 PF14575 EphA2_TM:  Ephrin type  70.8     1.5 3.3E-05   34.7   0.3   15  601-615    56-70  (75)
 68 TIGR01478 STEVOR variant surfa  70.2     2.8 6.1E-05   41.5   2.0   32  533-564   259-292 (295)
 69 cd03516 Link_domain_CD44_like   69.8     3.3 7.2E-05   36.9   2.2   21  101-121    18-38  (144)
 70 PTZ00370 STEVOR; Provisional    69.5     2.6 5.6E-05   41.9   1.6   31  534-564   256-288 (296)
 71 PF08693 SKG6:  Transmembrane a  69.0     3.4 7.3E-05   28.2   1.6   23  531-553    11-33  (40)
 72 PF00514 Arm:  Armadillo/beta-c  68.7     1.1 2.4E-05   30.6  -0.8   38    2-40      3-40  (41)
 73 PF08374 Protocadherin:  Protoc  68.6     6.2 0.00013   37.5   3.8   29  529-557    35-64  (221)
 74 PF01102 Glycophorin_A:  Glycop  67.1       5 0.00011   34.9   2.7   15  542-556    75-89  (122)
 75 KOG1219|consensus               65.4      44 0.00096   42.9  10.6   14  594-607  4038-4051(4289)
 76 PF14704 DERM:  Dermatopontin    64.9      17 0.00038   33.1   6.0  127  297-447     2-144 (152)
 77 PF02793 HRM:  Hormone receptor  64.6     7.4 0.00016   29.8   3.0   36  414-449    17-55  (66)
 78 smart00008 HormR Domain presen  60.8      10 0.00022   29.5   3.2   34  474-507    18-54  (70)
 79 smart00008 HormR Domain presen  59.0     9.8 0.00021   29.6   2.8   32  237-268    18-53  (70)
 80 KOG4297|consensus               57.7      30 0.00066   29.0   6.3   94   91-194    99-192 (207)
 81 PF12273 RCR:  Chitin synthesis  56.0     8.7 0.00019   34.0   2.3   26  538-563     4-29  (130)
 82 PF04478 Mid2:  Mid2 like cell   55.7     3.7 8.1E-05   36.9  -0.1   15  532-546    51-65  (154)
 83 PF15102 TMEM154:  TMEM154 prot  55.0       4 8.7E-05   36.4   0.0   21  530-550    58-78  (146)
 84 PTZ00046 rifin; Provisional     53.4     7.6 0.00016   40.2   1.7   18  547-564   332-349 (358)
 85 PF05454 DAG1:  Dystroglycan (D  53.2     4.5 9.7E-05   40.9   0.0   29  534-562   150-178 (290)
 86 PF02793 HRM:  Hormone receptor  52.2      12 0.00026   28.6   2.3   34  474-507    17-55  (66)
 87 TIGR01477 RIFIN variant surfac  52.0     8.3 0.00018   39.8   1.7   18  547-564   327-344 (353)
 88 PF15330 SIT:  SHP2-interacting  51.5      22 0.00048   30.2   3.9    7  594-600    47-53  (107)
 89 PF02439 Adeno_E3_CR2:  Adenovi  49.9      16 0.00034   24.6   2.1   16  546-561    19-34  (38)
 90 KOG0994|consensus               49.1      44 0.00096   39.7   6.9  123  255-398   840-971 (1758)
 91 PF15069 FAM163:  FAM163 family  48.7      36 0.00077   30.3   4.8   26  532-557     4-31  (143)
 92 PF12768 Rax2:  Cortical protei  47.6      30 0.00064   35.1   4.9   36  528-563   223-261 (281)
 93 PF11980 DUF3481:  Domain of un  47.2      22 0.00048   28.4   3.0   31  534-564    19-49  (87)
 94 PF12662 cEGF:  Complement Clr-  45.3      14 0.00031   22.2   1.3   19  481-500     3-21  (24)
 95 smart00607 FTP eel-Fucolectin   44.9      20 0.00044   32.6   2.9   26   42-67    121-146 (151)
 96 KOG1025|consensus               44.8      37  0.0008   39.3   5.4   30  235-264   274-304 (1177)
 97 TIGR03370 PEPCTERM_Roseo varia  44.1      37 0.00081   20.8   2.9   20  540-559     6-25  (26)
 98 KOG3653|consensus               43.5      98  0.0021   33.5   7.9   29  434-464    78-109 (534)
 99 PF05393 Hum_adeno_E3A:  Human   43.4      42 0.00092   27.1   4.0   20  542-561    42-61  (94)
100 PF03302 VSP:  Giardia variant-  41.9      20 0.00044   38.3   2.9   31  482-513   304-337 (397)
101 PF02038 ATP1G1_PLM_MAT8:  ATP1  41.3      13 0.00029   26.6   0.9   26  538-563    20-45  (50)
102 KOG1219|consensus               40.9 1.5E+02  0.0033   38.6   9.8   26  302-328  3742-3767(4289)
103 KOG4289|consensus               39.7   1E+02  0.0022   37.9   7.9  139  361-513  1740-1909(2531)
104 PF02009 Rifin_STEVOR:  Rifin/s  39.3      16 0.00034   37.3   1.4   27  538-564   261-287 (299)
105 TIGR03778 VPDSG_CTERM VPDSG-CT  37.9      46   0.001   20.4   2.7   19  539-557     6-24  (26)
106 PF06084 Cytomega_TRL10:  Cytom  35.5      16 0.00034   30.8   0.6   17  593-609   115-135 (150)
107 TIGR02595 PEP_exosort PEP-CTER  34.4      48   0.001   20.3   2.5   12  543-554     9-20  (26)
108 PF07010 Endomucin:  Endomucin;  33.1 1.2E+02  0.0026   29.3   6.0   49  545-594   203-251 (259)
109 KOG4564|consensus               32.6      45 0.00097   36.3   3.6   38  415-452    78-118 (473)
110 PF08114 PMP1_2:  ATPase proteo  32.1      46   0.001   22.7   2.2   15  547-561    24-38  (43)
111 PHA03265 envelope glycoprotein  30.9      42 0.00092   34.4   2.9    8  528-535   347-354 (402)
112 PHA03242 envelope glycoprotein  28.2      52  0.0011   35.2   3.1   27  589-617   402-428 (428)
113 PF12606 RELT:  Tumour necrosis  27.9 1.3E+02  0.0028   21.8   4.1   24  541-564     8-31  (50)
114 PF12877 DUF3827:  Domain of un  27.7      91   0.002   34.9   4.9   40  528-567   264-305 (684)
115 PF07979 Intimin_C:  Intimin C-  27.7      42 0.00091   28.1   1.8   31  100-132    11-41  (101)
116 PF15345 TMEM51:  Transmembrane  27.5 1.1E+02  0.0024   29.7   4.9   25  537-561    65-89  (233)
117 PF06523 DUF1106:  Protein of u  26.3      18 0.00038   27.8  -0.5   22  594-615    69-90  (91)
118 PF04689 S1FA:  DNA binding pro  26.2 1.7E+02  0.0037   22.2   4.5   24  536-559    18-41  (69)
119 PF01708 Gemini_mov:  Geminivir  26.1 1.2E+02  0.0026   24.7   4.1   12  558-569    67-78  (91)
120 PF13268 DUF4059:  Protein of u  25.8      55  0.0012   25.4   2.0   25  539-563    15-39  (72)
121 PF08374 Protocadherin:  Protoc  25.1      90  0.0019   29.9   3.7   25  530-554    40-64  (221)
122 PF05545 FixQ:  Cbb3-type cytoc  24.9      84  0.0018   22.4   2.8   15  546-560    20-34  (49)
123 PF06247 Plasmod_Pvs28:  Plasmo  24.8      54  0.0012   30.7   2.2   52  300-369    68-119 (197)
124 PTZ00046 rifin; Provisional     24.8      48   0.001   34.5   2.1   23  542-564   324-346 (358)
125 PF01299 Lamp:  Lysosome-associ  24.8      24 0.00051   36.3  -0.1   13  547-559   288-300 (306)
126 TIGR01477 RIFIN variant surfac  24.4      49  0.0011   34.3   2.1   23  542-564   319-341 (353)
127 KOG0196|consensus               24.3 1.2E+02  0.0025   35.2   5.0   29  305-333   231-261 (996)
128 PTZ00382 Variant-specific surf  24.3      50  0.0011   27.5   1.8    7  424-430     9-15  (96)
129 PF14851 FAM176:  FAM176 family  24.2      20 0.00042   32.6  -0.7   27  528-554    21-47  (153)
130 PF08391 Ly49:  Ly49-like prote  24.0      44 0.00095   29.0   1.4   13   84-96    107-119 (119)
131 PF07589 VPEP:  PEP-CTERM motif  23.3   1E+02  0.0022   18.7   2.5   12  542-553     9-20  (25)
132 PF10577 UPF0560:  Uncharacteri  21.5      77  0.0017   36.5   3.0   18  236-253    46-64  (807)
133 PRK11056 hypothetical protein;  20.9 1.2E+02  0.0026   26.1   3.4   26  538-563    93-118 (120)
134 KOG0994|consensus               20.2 2.4E+02  0.0052   34.0   6.5   68  241-311   932-1008(1758)

No 1  
>PHA02927 secreted complement-binding protein; Provisional
Probab=100.00  E-value=1.7e-41  Score=338.80  Aligned_cols=234  Identities=29%  Similarity=0.637  Sum_probs=210.8

Q ss_pred             ceeeecCCCCCccCCeeEEec-----CCCcccCCEEEEEeCCCceec--CCCeEEeCCCCCcCCCCCcccccccCCCCCc
Q psy1807         272 TCQYVDCGKLEHIEHGTVTLE-----TTRTTHGAVAIYACHENYTLI--GETRRVCGDGGKWNGTEPQCLFDWCAEPPQI  344 (618)
Q Consensus       272 ~C~~~~C~~~~~~~ng~~~~~-----~~~~~~g~~~~~~C~~Gy~l~--G~~~~~C~~~G~Ws~~~p~C~~~~C~~p~~~  344 (618)
                      .|.++.|+.|..+.||.+..+     ...|.+|++++|+|++||.+.  |...++|+.+| |+. .|.|+.+.|+.|+.+
T Consensus        15 ~c~~~~c~~~~~~~~~~~~~~~~~~~~~~y~~g~~v~y~C~~Gy~~~~~g~~~~~C~~~g-Ws~-~p~C~~~~C~~p~~i   92 (263)
T PHA02927         15 GCVLSCCTIPSRPINMKFKNSVETDANANYNIGDTIEYLCLPGYRKQKMGPIYAKCTGTG-WTL-FNQCIKRRCPSPRDI   92 (263)
T ss_pred             HHHhccCCCCCcccceeeccccccccCceeCCCCEEEEEeCCCceecCCCccEEEecCCC-CCC-CCcEEeCCCcCCCCC
Confidence            477889999999999998732     346889999999999999975  77889999888 995 799999999999999


Q ss_pred             cccEEEecccccCceeeecCCCCcEEcCCceEEEccCC----eecCCCCceeeecCCCCCccCCCceEeecCcccccceE
Q psy1807         345 SGGIVTTSGRRTGSVATYSCEPGFILFGSNELKCGLGG----DWSGKAPTCKYVDCGSPANIDNGHYSLLNGTTTHGSVA  420 (618)
Q Consensus       345 ~~g~~~~~~~~~g~~~~~~C~~Gy~l~G~~~~~C~~~G----~Ws~~~p~C~~~~C~~p~~~~~g~~~~~~~~~~~g~~~  420 (618)
                      .||.+....+.+|++|+|+|++||.|.|...++|+.+|    +|++..|.|+.+.|+.|+.+.||.+......|.+|++|
T Consensus        93 ~NG~~~~~~~~~G~~v~y~C~~Gy~l~G~~~~~C~~~~~g~~~Ws~~~P~C~~~~C~~P~~~~nG~~~~~~~~y~~g~~v  172 (263)
T PHA02927         93 DNGQLDIGGVDFGSSITYSCNSGYQLIGESKSYCELGSTGSMVWNPEAPICESVKCQSPPSISNGRHNGYEDFYTDGSVV  172 (263)
T ss_pred             CCCEEeCCCccCCCEEEEECCCCCEEcCCCeeEEEeCCCCcceECCCCCccccccCCCCCCCCCcEEcCCcccccCCCEE
Confidence            99999888889999999999999999999999999753    89999999999999999999999987655568899999


Q ss_pred             EEEcCCCceeccCCcceEEeccCCeeeCCCCceeeecCCCCCcCCCcEEEc---ccCCCCcEEEEEcCCCCeeeCCceeE
Q psy1807         421 EYTCDDDYWLEAGAMRRQVCMRDGKWSGDTPSCELITCPEPEVPKGGYVVG---YDLNVHSTLEFHCEPGHLLVGQASLK  497 (618)
Q Consensus       421 ~~~C~~Gy~l~~~G~~~~~C~~~G~Ws~~~p~C~~~~C~~p~~~~~g~~~~---~~~~~g~~v~~~C~~Gy~l~G~~~~~  497 (618)
                      +|+|++||.|.  |...++|. +|+|+. +|.|+.+.|+.|.++ ||.+..   ..|.+|++|+|+|++||+|.|++.++
T Consensus       173 ~y~C~~Gy~l~--G~~~~~C~-~G~Ws~-~P~C~~v~C~~P~i~-ng~~~~~~k~~y~~g~~v~y~C~~Gy~l~G~~~~~  247 (263)
T PHA02927        173 TYSCNSGYSLI--GNSGVLCS-GGEWSD-PPTCQIVKCPHPTIS-NGYLSSGFKRSYSYNDNVDFKCKYGYKLSGSSSST  247 (263)
T ss_pred             EEECCCCCEEC--CCCeeEEC-CCccCC-CCeEeEeECcCCCCC-CCEEecCCCCccccCCEEEEECCCCCeEcCCCCeE
Confidence            99999999999  99999997 899997 799999999998765 565543   36899999999999999999999999


Q ss_pred             eCcCCcccCCCCceE
Q psy1807         498 CTRQGDWDNMSPQFI  512 (618)
Q Consensus       498 C~~~G~Ws~~~P~C~  512 (618)
                      |+++|+|++++|+|+
T Consensus       248 C~~~g~Ws~~~P~C~  262 (263)
T PHA02927        248 CSPGNTWQPELPKCV  262 (263)
T ss_pred             ECCCCEECCCCCeec
Confidence            999999999999996


No 2  
>PHA02927 secreted complement-binding protein; Provisional
Probab=100.00  E-value=1.3e-38  Score=318.03  Aligned_cols=236  Identities=31%  Similarity=0.643  Sum_probs=207.5

Q ss_pred             ccccccCCCCCCCCCCcccceeeec------ccccccCCEEEEEcCCCceec--CCceeEecCCcccCCCCceeeecCCC
Q psy1807         209 LHWICQHNPANCGSPDRHLNTTFVG------TVNTKLGSTISYACPEGNMLV--GSATRTCKEGFWTGVAPTCQYVDCGK  280 (618)
Q Consensus       209 ~~~iC~~~~~~C~~p~~~~n~~~~~------~~~~~~G~~~~y~C~~Gy~l~--g~~~~~C~~g~Ws~~~p~C~~~~C~~  280 (618)
                      +.+-|.  +..|+.|..+.|+.+..      ...|.+|++|+|+|++||.+.  |...++|+.+.|+. .|.|+.+.|+.
T Consensus        12 ~~~~c~--~~~c~~~~~~~~~~~~~~~~~~~~~~y~~g~~v~y~C~~Gy~~~~~g~~~~~C~~~gWs~-~p~C~~~~C~~   88 (263)
T PHA02927         12 LGIGCV--LSCCTIPSRPINMKFKNSVETDANANYNIGDTIEYLCLPGYRKQKMGPIYAKCTGTGWTL-FNQCIKRRCPS   88 (263)
T ss_pred             HHHHHH--hccCCCCCcccceeeccccccccCceeCCCCEEEEEeCCCceecCCCccEEEecCCCCCC-CCcEEeCCCcC
Confidence            345565  47799999999988752      247899999999999999874  78889999555994 79999999999


Q ss_pred             CCccCCeeEEecCCCcccCCEEEEEeCCCceecCCCeEEeCCCC----CcCCCCCcccccccCCCCCccccEEEec--cc
Q psy1807         281 LEHIEHGTVTLETTRTTHGAVAIYACHENYTLIGETRRVCGDGG----KWNGTEPQCLFDWCAEPPQISGGIVTTS--GR  354 (618)
Q Consensus       281 ~~~~~ng~~~~~~~~~~~g~~~~~~C~~Gy~l~G~~~~~C~~~G----~Ws~~~p~C~~~~C~~p~~~~~g~~~~~--~~  354 (618)
                      |+.+.||.+...  .+.+|++++|+|++||+|.|...++|+.+|    +|++..|.|+.+.|+.|+.+.||.+...  .+
T Consensus        89 p~~i~NG~~~~~--~~~~G~~v~y~C~~Gy~l~G~~~~~C~~~~~g~~~Ws~~~P~C~~~~C~~P~~~~nG~~~~~~~~y  166 (263)
T PHA02927         89 PRDIDNGQLDIG--GVDFGSSITYSCNSGYQLIGESKSYCELGSTGSMVWNPEAPICESVKCQSPPSISNGRHNGYEDFY  166 (263)
T ss_pred             CCCCCCCEEeCC--CccCCCEEEEECCCCCEEcCCCeeEEEeCCCCcceECCCCCccccccCCCCCCCCCcEEcCCcccc
Confidence            999999998753  478999999999999999999999998753    7999999999999999999999988653  36


Q ss_pred             ccCceeeecCCCCcEEcCCceEEEccCCeecCCCCceeeecCCCCCccCCCceEee-cCcccccceEEEEcCCCceeccC
Q psy1807         355 RTGSVATYSCEPGFILFGSNELKCGLGGDWSGKAPTCKYVDCGSPANIDNGHYSLL-NGTTTHGSVAEYTCDDDYWLEAG  433 (618)
Q Consensus       355 ~~g~~~~~~C~~Gy~l~G~~~~~C~~~G~Ws~~~p~C~~~~C~~p~~~~~g~~~~~-~~~~~~g~~~~~~C~~Gy~l~~~  433 (618)
                      .+|++|+|+|++||.|.|...++|+ +|+|+. +|.|+.+.|+.|. +.||.+... ...|.+|++|+|+|++||.|.  
T Consensus       167 ~~g~~v~y~C~~Gy~l~G~~~~~C~-~G~Ws~-~P~C~~v~C~~P~-i~ng~~~~~~k~~y~~g~~v~y~C~~Gy~l~--  241 (263)
T PHA02927        167 TDGSVVTYSCNSGYSLIGNSGVLCS-GGEWSD-PPTCQIVKCPHPT-ISNGYLSSGFKRSYSYNDNVDFKCKYGYKLS--  241 (263)
T ss_pred             cCCCEEEEECCCCCEECCCCeeEEC-CCccCC-CCeEeEeECcCCC-CCCCEEecCCCCccccCCEEEEECCCCCeEc--
Confidence            7899999999999999999999997 899997 7999999999875 778887643 356899999999999999999  


Q ss_pred             CcceEEeccCCeeeCCCCcee
Q psy1807         434 AMRRQVCMRDGKWSGDTPSCE  454 (618)
Q Consensus       434 G~~~~~C~~~G~Ws~~~p~C~  454 (618)
                      |++.++|+.+|+|+++.|+|.
T Consensus       242 G~~~~~C~~~g~Ws~~~P~C~  262 (263)
T PHA02927        242 GSSSSTCSPGNTWQPELPKCV  262 (263)
T ss_pred             CCCCeEECCCCEECCCCCeec
Confidence            999999999999999999995


No 3  
>PHA02954 EEV membrane glycoprotein; Provisional
Probab=100.00  E-value=4.8e-35  Score=293.40  Aligned_cols=225  Identities=20%  Similarity=0.487  Sum_probs=190.0

Q ss_pred             CCceeeecCCCCCccCCeeEEecCCCcccCCEEEEEeCCCceecCCCeEEeCCCCCcCCCCCcccccccCCCCCccccEE
Q psy1807         270 APTCQYVDCGKLEHIEHGTVTLETTRTTHGAVAIYACHENYTLIGETRRVCGDGGKWNGTEPQCLFDWCAEPPQISGGIV  349 (618)
Q Consensus       270 ~p~C~~~~C~~~~~~~ng~~~~~~~~~~~g~~~~~~C~~Gy~l~G~~~~~C~~~G~Ws~~~p~C~~~~C~~p~~~~~g~~  349 (618)
                      .+.|....|+.|. +.||.+......|.+|++++|.|++||.+.|. .++|..+ .|++..| |. ..|+.|..+.++..
T Consensus        13 ~~~~v~~~C~~P~-~~ng~~~~~~~~Y~~gd~V~y~C~~GY~l~~~-~itC~~~-~WS~~~p-C~-k~C~~p~~~~~~~~   87 (317)
T PHA02954         13 LPAVVYSTCTVPT-MNNAKLTSTETSFNDKQKVTFTCDSGYYSLDP-NAVCETD-KWKYENP-CK-KMCTVSDYVSELYD   87 (317)
T ss_pred             HHHHHhCcCCCCC-CCcCEEcCCcccccCCCEEEEEcCCCcccCCC-CeEECCC-cccCCCC-CC-CcCCCCcccccccc
Confidence            4567788999864 57888776545788899999999999998875 6999765 6998777 97 57988766665543


Q ss_pred             EecccccCceeeecCCCCcEEcCCceEEEcc---CCeecCCCCceeeecCCCCCccCCCceEeecCcccccceEEEEcCC
Q psy1807         350 TTSGRRTGSVATYSCEPGFILFGSNELKCGL---GGDWSGKAPTCKYVDCGSPANIDNGHYSLLNGTTTHGSVAEYTCDD  426 (618)
Q Consensus       350 ~~~~~~~g~~~~~~C~~Gy~l~G~~~~~C~~---~G~Ws~~~p~C~~~~C~~p~~~~~g~~~~~~~~~~~g~~~~~~C~~  426 (618)
                       ...|.+|++|+|+|++||.     .++|+.   ++.|+. .|.|+.+.|++| .+.||.+......|.+|++|+|+|++
T Consensus        88 -~~~y~~G~~V~ysC~~Gy~-----~~~C~~~g~~~~WS~-~ptC~~i~C~pP-~i~NG~~~~~~~~Y~yGd~VtysC~~  159 (317)
T PHA02954         88 -KPLYEVNSTITLICKDETK-----YFRCEEKNGNTSWND-TVTCPNAECQPL-QLEHGSCQPVKEKYSFGEHITINCDV  159 (317)
T ss_pred             -CCCccCCCEEEEEECCCCc-----EEEeCCCCCCCccCC-CCeecceECCCC-CCCCCeeccCCCceeCCCEEEEEcCC
Confidence             3468899999999999994     578975   468996 679999999755 58899987656679999999999999


Q ss_pred             CceeccCCcceEEeccCCeeeCCCCceeeecCCCCCcCCCcEEEcccCCCCcEEEEEcCCCCeeeCCceeEeCcCCcccC
Q psy1807         427 DYWLEAGAMRRQVCMRDGKWSGDTPSCELITCPEPEVPKGGYVVGYDLNVHSTLEFHCEPGHLLVGQASLKCTRQGDWDN  506 (618)
Q Consensus       427 Gy~l~~~G~~~~~C~~~G~Ws~~~p~C~~~~C~~p~~~~~g~~~~~~~~~g~~v~~~C~~Gy~l~G~~~~~C~~~G~Ws~  506 (618)
                      ||.|.  |++.++|+.+ .|+ +.|.|+. .|+.|. +.||.+.+..|.+|++|+|+|++||.|.|+..++|+ +|+|++
T Consensus       160 GY~L~--Gs~~i~C~~~-~Ws-~~P~C~~-~C~~P~-i~NG~~sg~~y~~Gd~Vtf~C~~Gy~L~Gs~~itC~-~g~Ws~  232 (317)
T PHA02954        160 GYEVI--GASYISCTAN-SWN-VIPSCQQ-KCDIPS-LSNGLISGSTFSIGGVIHLSCKSGFTLTGSPSSTCI-DGKWNP  232 (317)
T ss_pred             CCEEC--cCCeeEECCC-cCC-CCCcccc-ccCCCC-CCCCeEcCCCcccCCEEEEECCCCCeECCCcceEEC-CCeEeC
Confidence            99999  9999999977 698 6799986 698764 567877777799999999999999999999999998 999999


Q ss_pred             CCCceEEe
Q psy1807         507 MSPQFISF  514 (618)
Q Consensus       507 ~~P~C~~~  514 (618)
                      ++|+|...
T Consensus       233 ~~P~C~~~  240 (317)
T PHA02954        233 VLPICVRS  240 (317)
T ss_pred             CCCceeCC
Confidence            99999986


No 4  
>PHA02639 EEV host range protein; Provisional
Probab=100.00  E-value=5.2e-33  Score=277.76  Aligned_cols=181  Identities=27%  Similarity=0.592  Sum_probs=162.4

Q ss_pred             cccccccCCCCCccccEEEe--cccccCceeeecCCCCcEEcCCceEEEccC---CeecCCCCceeeecCCCCCccCCCc
Q psy1807         332 QCLFDWCAEPPQISGGIVTT--SGRRTGSVATYSCEPGFILFGSNELKCGLG---GDWSGKAPTCKYVDCGSPANIDNGH  406 (618)
Q Consensus       332 ~C~~~~C~~p~~~~~g~~~~--~~~~~g~~~~~~C~~Gy~l~G~~~~~C~~~---G~Ws~~~p~C~~~~C~~p~~~~~g~  406 (618)
                      .++.+.|+.|+.+.||.+..  ..|.+|++++|+|++||.|.|...++|+++   |+|++..|.|+.+.|+.|+.+.||.
T Consensus        16 ~~k~i~C~~P~~i~NG~v~~~~~~y~~G~~V~Y~Cn~GY~L~G~~~~~C~~d~~nG~WS~~~P~C~~~~C~~Pp~~~nG~   95 (295)
T PHA02639         16 GVKSIYCDKPDDISNGFITELMEKYEIGKLIEYTCNTDYALIGDRFRTCIKDKNNAIWSNKAPFCMLKECNDPPSIINGK   95 (295)
T ss_pred             CccccCCcCCCCCCCCEEeccCCcccCCCEEEEEeCCCCEECCCCeEEEeCCCCCCEECCCCCEEeeccCcCCCCCCCcE
Confidence            35567899999999998875  358899999999999999999999999864   6999999999999999999999999


Q ss_pred             eEeecCcccccceEEEEcCC----CceeccCCcceEEeccCCeeeCCCCceeeecCCCCCcCCCcEEEc----ccCCCCc
Q psy1807         407 YSLLNGTTTHGSVAEYTCDD----DYWLEAGAMRRQVCMRDGKWSGDTPSCELITCPEPEVPKGGYVVG----YDLNVHS  478 (618)
Q Consensus       407 ~~~~~~~~~~g~~~~~~C~~----Gy~l~~~G~~~~~C~~~G~Ws~~~p~C~~~~C~~p~~~~~g~~~~----~~~~~g~  478 (618)
                      +......|.+|++++|+|++    ||.|.  |...++|+.+|+|+++.|+|+.+.|+.|.++ +|.+..    ..|.+|+
T Consensus        96 i~~~~~~y~~G~~V~y~C~~g~~~gY~L~--G~~~~~C~~dG~WS~~~P~C~~i~C~~P~i~-nG~v~~~~~~~~~~yg~  172 (295)
T PHA02639         96 IYNKREMYKVGDEIYYVCNEHKGVQYSLV--GNEKITCIQDKSWKPDPPICKMINCRFPALQ-NGYINGIPSNKKFYYKT  172 (295)
T ss_pred             EecCCCceECCCEEEEEeCCCCCCceEEC--CCCeEEECCCCeECCCCCeeeeEEeCCCCCC-CCceEcccCCCceecCC
Confidence            88766779999999999999    59999  9999999999999999999999999988654 555543    3578999


Q ss_pred             EEEEEcCCCCeeeCCceeEeCcCCcccCCCCceEEec
Q psy1807         479 TLEFHCEPGHLLVGQASLKCTRQGDWDNMSPQFISFY  515 (618)
Q Consensus       479 ~v~~~C~~Gy~l~G~~~~~C~~~G~Ws~~~P~C~~~~  515 (618)
                      +|+|+|++||.|.|+..++|+.+|+|++++|+|+...
T Consensus       173 ~V~fsC~~GY~L~Gs~~~tC~~nG~Ws~~~P~C~~~~  209 (295)
T PHA02639        173 RVGFSCKSGFDLVGEKYSTCNINATWFPSIPTCVRNK  209 (295)
T ss_pred             EEEEEcCCCCeEcCCCcEEECCCCeECCCCCeEeccC
Confidence            9999999999999999999999999999999999753


No 5  
>PHA02954 EEV membrane glycoprotein; Provisional
Probab=100.00  E-value=1.4e-32  Score=275.64  Aligned_cols=216  Identities=21%  Similarity=0.490  Sum_probs=181.7

Q ss_pred             CCCCCCCcccceeeecc-cccccCCEEEEEcCCCceecCCceeEecCCcccCCCCceeeecCCCCCccCCeeEEecCCCc
Q psy1807         218 ANCGSPDRHLNTTFVGT-VNTKLGSTISYACPEGNMLVGSATRTCKEGFWTGVAPTCQYVDCGKLEHIEHGTVTLETTRT  296 (618)
Q Consensus       218 ~~C~~p~~~~n~~~~~~-~~~~~G~~~~y~C~~Gy~l~g~~~~~C~~g~Ws~~~p~C~~~~C~~~~~~~ng~~~~~~~~~  296 (618)
                      ..|+.|.. .|+.+.+. ..|.+|++++|+|++||.+.+. .++|+++.|+...| |. +.|..|..+.++..   ...|
T Consensus        19 ~~C~~P~~-~ng~~~~~~~~Y~~gd~V~y~C~~GY~l~~~-~itC~~~~WS~~~p-C~-k~C~~p~~~~~~~~---~~~y   91 (317)
T PHA02954         19 STCTVPTM-NNAKLTSTETSFNDKQKVTFTCDSGYYSLDP-NAVCETDKWKYENP-CK-KMCTVSDYVSELYD---KPLY   91 (317)
T ss_pred             CcCCCCCC-CcCEEcCCcccccCCCEEEEEcCCCcccCCC-CeEECCCcccCCCC-CC-CcCCCCcccccccc---CCCc
Confidence            67988765 68876654 5799999999999999998875 68999888998777 97 57998766665533   2358


Q ss_pred             ccCCEEEEEeCCCceecCCCeEEeCC---CCCcCCCCCcccccccCCCCCccccEEEec--ccccCceeeecCCCCcEEc
Q psy1807         297 THGAVAIYACHENYTLIGETRRVCGD---GGKWNGTEPQCLFDWCAEPPQISGGIVTTS--GRRTGSVATYSCEPGFILF  371 (618)
Q Consensus       297 ~~g~~~~~~C~~Gy~l~G~~~~~C~~---~G~Ws~~~p~C~~~~C~~p~~~~~g~~~~~--~~~~g~~~~~~C~~Gy~l~  371 (618)
                      .+|++++|+|++||.     .++|..   ++.|+. .|+|+.+.|+ ++.+.||.+...  .|.+|+.|+|+|++||.|.
T Consensus        92 ~~G~~V~ysC~~Gy~-----~~~C~~~g~~~~WS~-~ptC~~i~C~-pP~i~NG~~~~~~~~Y~yGd~VtysC~~GY~L~  164 (317)
T PHA02954         92 EVNSTITLICKDETK-----YFRCEEKNGNTSWND-TVTCPNAECQ-PLQLEHGSCQPVKEKYSFGEHITINCDVGYEVI  164 (317)
T ss_pred             cCCCEEEEEECCCCc-----EEEeCCCCCCCccCC-CCeecceECC-CCCCCCCeeccCCCceeCCCEEEEEcCCCCEEC
Confidence            899999999999995     478975   468997 5799999997 456789987653  4899999999999999999


Q ss_pred             CCceEEEccCCeecCCCCceeeecCCCCCccCCCceEeecCcccccceEEEEcCCCceeccCCcceEEeccCCeeeCCCC
Q psy1807         372 GSNELKCGLGGDWSGKAPTCKYVDCGSPANIDNGHYSLLNGTTTHGSVAEYTCDDDYWLEAGAMRRQVCMRDGKWSGDTP  451 (618)
Q Consensus       372 G~~~~~C~~~G~Ws~~~p~C~~~~C~~p~~~~~g~~~~~~~~~~~g~~~~~~C~~Gy~l~~~G~~~~~C~~~G~Ws~~~p  451 (618)
                      |+..++|+++ .|+ ..|.|+. .|+.| .+.||.+..  ..|.+|++|+|+|++||.|.  |++.++|. +|+|+++.|
T Consensus       165 Gs~~i~C~~~-~Ws-~~P~C~~-~C~~P-~i~NG~~sg--~~y~~Gd~Vtf~C~~Gy~L~--Gs~~itC~-~g~Ws~~~P  235 (317)
T PHA02954        165 GASYISCTAN-SWN-VIPSCQQ-KCDIP-SLSNGLISG--STFSIGGVIHLSCKSGFTLT--GSPSSTCI-DGKWNPVLP  235 (317)
T ss_pred             cCCeeEECCC-cCC-CCCcccc-ccCCC-CCCCCeEcC--CCcccCCEEEEECCCCCeEC--CCcceEEC-CCeEeCCCC
Confidence            9999999987 698 6899985 69865 588998753  45889999999999999999  99999997 999999999


Q ss_pred             ceeee
Q psy1807         452 SCELI  456 (618)
Q Consensus       452 ~C~~~  456 (618)
                      .|...
T Consensus       236 ~C~~~  240 (317)
T PHA02954        236 ICVRS  240 (317)
T ss_pred             ceeCC
Confidence            99875


No 6  
>PHA02639 EEV host range protein; Provisional
Probab=99.97  E-value=1.1e-30  Score=261.14  Aligned_cols=180  Identities=24%  Similarity=0.578  Sum_probs=162.6

Q ss_pred             eeeecCCCCCccCCeeEEecCCCcccCCEEEEEeCCCceecCCCeEEeCCC---CCcCCCCCcccccccCCCCCccccEE
Q psy1807         273 CQYVDCGKLEHIEHGTVTLETTRTTHGAVAIYACHENYTLIGETRRVCGDG---GKWNGTEPQCLFDWCAEPPQISGGIV  349 (618)
Q Consensus       273 C~~~~C~~~~~~~ng~~~~~~~~~~~g~~~~~~C~~Gy~l~G~~~~~C~~~---G~Ws~~~p~C~~~~C~~p~~~~~g~~  349 (618)
                      ++.+.|+.|+.+.||.+......|.+|+++.|+|++||+|.|...++|+.+   |+|++..|.|+.+.|+.|+.+.||.+
T Consensus        17 ~k~i~C~~P~~i~NG~v~~~~~~y~~G~~V~Y~Cn~GY~L~G~~~~~C~~d~~nG~WS~~~P~C~~~~C~~Pp~~~nG~i   96 (295)
T PHA02639         17 VKSIYCDKPDDISNGFITELMEKYEIGKLIEYTCNTDYALIGDRFRTCIKDKNNAIWSNKAPFCMLKECNDPPSIINGKI   96 (295)
T ss_pred             ccccCCcCCCCCCCCEEeccCCcccCCCEEEEEeCCCCEECCCCeEEEeCCCCCCEECCCCCEEeeccCcCCCCCCCcEE
Confidence            566789999999999988666679999999999999999999999999864   69999999999999999999999988


Q ss_pred             Eec--ccccCceeeecCCC----CcEEcCCceEEEccCCeecCCCCceeeecCCCCCccCCCceEe--ecCcccccceEE
Q psy1807         350 TTS--GRRTGSVATYSCEP----GFILFGSNELKCGLGGDWSGKAPTCKYVDCGSPANIDNGHYSL--LNGTTTHGSVAE  421 (618)
Q Consensus       350 ~~~--~~~~g~~~~~~C~~----Gy~l~G~~~~~C~~~G~Ws~~~p~C~~~~C~~p~~~~~g~~~~--~~~~~~~g~~~~  421 (618)
                      ...  .|.+|++++|+|++    ||.|.|...++|+.+|+|++..|.|+.+.|+.|. +.||.+..  ....+.+|++|+
T Consensus        97 ~~~~~~y~~G~~V~y~C~~g~~~gY~L~G~~~~~C~~dG~WS~~~P~C~~i~C~~P~-i~nG~v~~~~~~~~~~yg~~V~  175 (295)
T PHA02639         97 YNKREMYKVGDEIYYVCNEHKGVQYSLVGNEKITCIQDKSWKPDPPICKMINCRFPA-LQNGYINGIPSNKKFYYKTRVG  175 (295)
T ss_pred             ecCCCceECCCEEEEEeCCCCCCceEECCCCeEEECCCCeECCCCCeeeeEEeCCCC-CCCCceEcccCCCceecCCEEE
Confidence            764  47889999999999    5999999999999999999999999999999774 77887653  234678999999


Q ss_pred             EEcCCCceeccCCcceEEeccCCeeeCCCCceee
Q psy1807         422 YTCDDDYWLEAGAMRRQVCMRDGKWSGDTPSCEL  455 (618)
Q Consensus       422 ~~C~~Gy~l~~~G~~~~~C~~~G~Ws~~~p~C~~  455 (618)
                      |+|++||.|.  |+..++|+.+|+|++..|+|..
T Consensus       176 fsC~~GY~L~--Gs~~~tC~~nG~Ws~~~P~C~~  207 (295)
T PHA02639        176 FSCKSGFDLV--GEKYSTCNINATWFPSIPTCVR  207 (295)
T ss_pred             EEcCCCCeEc--CCCcEEECCCCeECCCCCeEec
Confidence            9999999999  9999999999999999999986


No 7  
>PHA02831 EEV host range protein; Provisional
Probab=99.97  E-value=2.4e-30  Score=252.64  Aligned_cols=161  Identities=21%  Similarity=0.477  Sum_probs=142.6

Q ss_pred             cccccCceeeecCCCCcEEcCCceEEEccCCeecCCCCcee-eecCCCCCccCCCceEeecCcccccceEEEEcC----C
Q psy1807         352 SGRRTGSVATYSCEPGFILFGSNELKCGLGGDWSGKAPTCK-YVDCGSPANIDNGHYSLLNGTTTHGSVAEYTCD----D  426 (618)
Q Consensus       352 ~~~~~g~~~~~~C~~Gy~l~G~~~~~C~~~G~Ws~~~p~C~-~~~C~~p~~~~~g~~~~~~~~~~~g~~~~~~C~----~  426 (618)
                      ..|.+|++|+|+|++||.+   ..++|. +|+|++ .|.|. .+.|+.|+.+.||.+......|.+|++|+|+|+    +
T Consensus        38 ~~Y~~Gs~V~Y~C~~GY~~---~~itC~-dG~WS~-~P~C~~~~~C~~Pp~i~NG~i~~~~~~Y~~Gd~VtYsC~~g~~~  112 (268)
T PHA02831         38 KVYEENENLEYKCNNNFDK---VFVTCN-NGSWST-KNMCIGKRNCKDPVTILNGYIKNKKDQYSFGDSVTYACKVNKLE  112 (268)
T ss_pred             cccCCCCEEEEECCCCCEe---eeEEeC-CCccCC-CCcccccccCcCCCCCCCCEEeccCCceeCCCEEEEECCCCCCC
Confidence            3588999999999999985   378895 999996 57785 468999999999999876677999999999999    6


Q ss_pred             CceeccCCcceEEeccCCeeeCCCCceeeecCCCCCcCCCcEEEc--ccCCCCcEEEEEcCCCCeeeCCceeEeCcCCcc
Q psy1807         427 DYWLEAGAMRRQVCMRDGKWSGDTPSCELITCPEPEVPKGGYVVG--YDLNVHSTLEFHCEPGHLLVGQASLKCTRQGDW  504 (618)
Q Consensus       427 Gy~l~~~G~~~~~C~~~G~Ws~~~p~C~~~~C~~p~~~~~g~~~~--~~~~~g~~v~~~C~~Gy~l~G~~~~~C~~~G~W  504 (618)
                      ||.|.  |.+.++|. +|+|+++.|.|+.+.|+.|. ..||.+..  ..|.+|++|+|+|++||.|.|+..++|+.+|+|
T Consensus       113 GY~Lv--G~s~~tC~-dG~WS~~~P~C~~i~C~~P~-i~NG~i~~~~~~y~~G~~Vty~C~~GY~L~Gss~~tC~~nG~W  188 (268)
T PHA02831        113 KYSIV--GNETVKCI-NKQWVPKYPVCKLIRCKYPA-LQNGFLNVFEKKFYYGDIVNFKCKKGFILLGSSVSTCDINSIW  188 (268)
T ss_pred             CeeEc--CCceeEeC-CCcCCCCCCeeeEeeCCCCC-CCCCccccCCCceecCCEEEEEcCCCCEECCCccEEECCCCeE
Confidence            99999  99999995 99999999999999999986 45776653  468899999999999999999999999999999


Q ss_pred             cCCCCceEEeccCCCcc
Q psy1807         505 DNMSPQFISFYFPPSET  521 (618)
Q Consensus       505 s~~~P~C~~~~c~p~~~  521 (618)
                      ++++|+|+...|+++..
T Consensus       189 sp~~P~C~~~~~~~~~~  205 (268)
T PHA02831        189 YPGIPKCVKDKVHNEIQ  205 (268)
T ss_pred             CCCCCCccCcccCCCCC
Confidence            99999999988886543


No 8  
>PHA02831 EEV host range protein; Provisional
Probab=99.95  E-value=6.5e-28  Score=235.62  Aligned_cols=160  Identities=23%  Similarity=0.517  Sum_probs=142.9

Q ss_pred             cccccCCEEEEEcCCCceecCCceeEecCCcccCCCCcee-eecCCCCCccCCeeEEecCCCcccCCEEEEEeC----CC
Q psy1807         235 VNTKLGSTISYACPEGNMLVGSATRTCKEGFWTGVAPTCQ-YVDCGKLEHIEHGTVTLETTRTTHGAVAIYACH----EN  309 (618)
Q Consensus       235 ~~~~~G~~~~y~C~~Gy~l~g~~~~~C~~g~Ws~~~p~C~-~~~C~~~~~~~ng~~~~~~~~~~~g~~~~~~C~----~G  309 (618)
                      ..|.+|++|+|+|++||.+   ..++|.+|.|++ .|.|. .+.|+.|+.+.||.+......|.+|+.++|+|+    +|
T Consensus        38 ~~Y~~Gs~V~Y~C~~GY~~---~~itC~dG~WS~-~P~C~~~~~C~~Pp~i~NG~i~~~~~~Y~~Gd~VtYsC~~g~~~G  113 (268)
T PHA02831         38 KVYEENENLEYKCNNNFDK---VFVTCNNGSWST-KNMCIGKRNCKDPVTILNGYIKNKKDQYSFGDSVTYACKVNKLEK  113 (268)
T ss_pred             cccCCCCEEEEECCCCCEe---eeEEeCCCccCC-CCcccccccCcCCCCCCCCEEeccCCceeCCCEEEEECCCCCCCC
Confidence            4799999999999999985   478999999996 47775 468999999999999876667999999999999    69


Q ss_pred             ceecCCCeEEeCCCCCcCCCCCcccccccCCCCCccccEEEe--cccccCceeeecCCCCcEEcCCceEEEccCCeecCC
Q psy1807         310 YTLIGETRRVCGDGGKWNGTEPQCLFDWCAEPPQISGGIVTT--SGRRTGSVATYSCEPGFILFGSNELKCGLGGDWSGK  387 (618)
Q Consensus       310 y~l~G~~~~~C~~~G~Ws~~~p~C~~~~C~~p~~~~~g~~~~--~~~~~g~~~~~~C~~Gy~l~G~~~~~C~~~G~Ws~~  387 (618)
                      |.|.|+..++|. +|+|+++.|.|+.+.|+.| .+.||.+..  ..+.+|++|+|+|++||.|.|+.+++|+.+|+|++.
T Consensus       114 Y~LvG~s~~tC~-dG~WS~~~P~C~~i~C~~P-~i~NG~i~~~~~~y~~G~~Vty~C~~GY~L~Gss~~tC~~nG~Wsp~  191 (268)
T PHA02831        114 YSIVGNETVKCI-NKQWVPKYPVCKLIRCKYP-ALQNGFLNVFEKKFYYGDIVNFKCKKGFILLGSSVSTCDINSIWYPG  191 (268)
T ss_pred             eeEcCCceeEeC-CCcCCCCCCeeeEeeCCCC-CCCCCccccCCCceecCCEEEEEcCCCCEECCCccEEECCCCeECCC
Confidence            999999999996 9999999999999999876 578888764  357899999999999999999999999999999999


Q ss_pred             CCceeeecCCCCC
Q psy1807         388 APTCKYVDCGSPA  400 (618)
Q Consensus       388 ~p~C~~~~C~~p~  400 (618)
                      .|.|+...|+.|.
T Consensus       192 ~P~C~~~~~~~~~  204 (268)
T PHA02831        192 IPKCVKDKVHNEI  204 (268)
T ss_pred             CCCccCcccCCCC
Confidence            9999988887654


No 9  
>cd03597 CLECT_attractin_like C-type lectin-like domain (CTLD) of the type found in human and mouse attractin (AtrN) and attractin-like protein (ALP). CLECT_attractin_like: C-type lectin-like domain (CTLD) of the type found in human and mouse attractin (AtrN) and attractin-like protein (ALP).  CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins.  Mouse AtrN (the product of the mahogany gene) has been shown to bind Agouti protein and to function in agouti-induced pigmentation and obesity.  Mutations in AtrN have also been shown to cause spongiform encephalopathy and hypomyelination in rats and hamsters.  The cytoplasmic region of mouse ALP has been shown to binds to melanocortin receptor (MCR4).  Signaling through MCR4 plays a role in appetite suppression.  Attractin may have therapeutic potential in the treatment of obesity.  Human attractin (hAtrN) has been shown to be expressed on activated T cells and released extracellularly.  The 
Probab=99.93  E-value=9e-26  Score=201.66  Aligned_cols=125  Identities=19%  Similarity=0.314  Sum_probs=105.7

Q ss_pred             ccceeecceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCcceeEE
Q psy1807          84 AEITSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTWKW  163 (618)
Q Consensus        84 ~~w~~~~~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~~~w~W  163 (618)
                      .||..|+++||+|+.. .++|.+|+.+|+++||+||+|++++|++||.+++...........+||||++..   .+.|+|
T Consensus         3 ~gw~~~~~~Cy~~~~~-~~tw~~A~~~C~~~g~~La~i~~~~E~~fi~~~~~~~~~~~~~~~~WIGl~d~~---~g~w~W   78 (129)
T cd03597           3 EGWHLVGNSCLKINTA-RESYDNAKLYCRNLNAVLASLTTQKKVEFVLKELQKHQMTKQKLTPWVGLRKIN---VSYWCW   78 (129)
T ss_pred             CcCEEcCCEEEEEEcC-CCCHHHHHHHHHHcCCEEcCCCCHHHHHHHHHHHHhhcccCCCCceEEeeecCC---CCceEE
Confidence            9999999999999765 699999999999999999999999999999998876432112246899999864   578999


Q ss_pred             eCCccc--ccCCCCCCCCCCCCCccceEEecCCCCceeeccccccccccccccC
Q psy1807         164 VDGEVV--TKPSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQH  215 (618)
Q Consensus       164 ~DGs~~--~~~~W~~~~P~~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~  215 (618)
                      +||+++  .|++|.++||++.   ++||.|..+...+|+|.+|.....+||||+
T Consensus        79 ~Dgs~~~~~~~~W~~geP~~~---~~C~~~~~~~~~~w~d~~C~~~~~~~iCe~  129 (129)
T cd03597          79 EDMSPFTNTTLQWLPGEPSDA---GFCGYLEEPAVSGLKANPCTNPVNGSVCER  129 (129)
T ss_pred             CCCCCCCCccccCCCCCCCCc---ccEEEEcccccCccccCCcCCCCcceeecC
Confidence            999988  4789999999973   689999766666899999998435899985


No 10 
>cd03588 CLECT_CSPGs C-type lectin-like domain (CTLD) of the type found in chondroitin sulfate proteoglycan core proteins. CLECT_CSPGs: C-type lectin-like domain (CTLD) of the type found in chondroitin sulfate proteoglycan core proteins (CSPGs) in human and chicken aggrecan, frog brevican, and zebra fish dermacan.  CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins.  In cartilage, aggrecan forms cartilage link protein stabilized aggregates with hyaluronan (HA).  These aggregates contribute to the tissue's load bearing properties.  Aggregates having other CSPGs substituting for aggrecan may contribute to the structural integrity of many different tissues.  Xenopus brevican is expressed in the notochord and the brain during early embryogenesis.  Zebra fish dermacan is expressed in dermal bones and may play a role in dermal bone development.  CSPGs do contain LINK domain(s) which bind HA.  These LINK domains are considered by one classif
Probab=99.93  E-value=1.5e-25  Score=199.20  Aligned_cols=120  Identities=25%  Similarity=0.562  Sum_probs=104.9

Q ss_pred             ccceeecceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCcceeEE
Q psy1807          84 AEITSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTWKW  163 (618)
Q Consensus        84 ~~w~~~~~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~~~w~W  163 (618)
                      +||+.|+++||+|+.. .++|.+|++.|+.+||+||+|+|++|++||..++.        ..+||||++..  .++.|+|
T Consensus         3 ~gw~~~~~~Cy~~~~~-~~sw~~A~~~C~~~gg~La~i~s~~e~~fl~~~~~--------~~~WIGl~~~~--~~~~~~W   71 (124)
T cd03588           3 EGWDKFQGHCYRHFPD-RETWEDAERRCREQQGHLSSIVTPEEQEFVNNNAQ--------DYQWIGLNDRT--IEGDFRW   71 (124)
T ss_pred             ccceecCCEEEEEECC-ccCHHHHHHHHHhcCCEEeccCCHHHHHHHHHhcc--------CcEEecceecC--CCCceEe
Confidence            9999999999999765 69999999999999999999999999999988653        24899999765  3789999


Q ss_pred             eCCcccccCCCCCCCCCCC-CCccceEEecCCCCceeeccccccccccccccC
Q psy1807         164 VDGEVVTKPSWGKDQPNNY-NGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQH  215 (618)
Q Consensus       164 ~DGs~~~~~~W~~~~P~~~-~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~  215 (618)
                      +||+++.|.+|.+++|++. .+.++||.|......+|+|..|.. ..+||||+
T Consensus        72 ~dg~~~~~~~W~~~~p~~~~~~~~~Cv~~~~~~~~~W~d~~C~~-~~~fICe~  123 (124)
T cd03588          72 SDGHPLQFENWRPNQPDNFFATGEDCVVMIWHEEGEWNDVPCNY-HLPFTCKK  123 (124)
T ss_pred             CCCCcccccCcCCCCCCCCCCCCCCeEEEecCCCCeEcCCCCCC-CCeeeeeC
Confidence            9999999999999999875 345799999644445999999997 67999986


No 11 
>cd03589 CLECT_CEL-1_like C-type lectin-like domain (CTLD) of the type found in CEL-1 from Cucumaria echinata and Echinoidin from Anthocidaris crassispina. CLECT_CEL-1_like: C-type lectin-like domain (CTLD) of the type found in CEL-1 from Cucumaria echinata and Echinoidin from Anthocidaris crassispina.  CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins.  The CEL-1 CTLD binds three calcium ions and has a high specificity for N-acteylgalactosamine (GalNAc).  CEL-1 exhibits strong cytotoxicity which is inhibited by GalNAc.  This protein may play a role as a toxin defending against predation.  Echinoidin is found in the coelomic fluid of the sea urchin and is specific for GalBeta1-3GalNAc.  Echinoidin has a cell adhesive activity towards human cancer cells which is not mediated through the CTLD.  Both CEL-1 and Echinoidin are multimeric proteins comprised of multiple dimers linked by disulfide bonds.
Probab=99.93  E-value=2.2e-25  Score=202.03  Aligned_cols=128  Identities=27%  Similarity=0.574  Sum_probs=110.4

Q ss_pred             ccceeecceEEEEecCCCCCHHHHHHHHHHc-----CCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCc
Q psy1807          84 AEITSFDRNCYEFNIDKGASFADARSKCQAS-----GGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITA  158 (618)
Q Consensus        84 ~~w~~~~~~CY~~~~~~~~tw~~A~~~C~~~-----g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~  158 (618)
                      .||+.|+++||+|+.. .++|.+|+++|+++     +|+||+|+|++|++||..++...........|||||++..  .+
T Consensus         3 ~gw~~~~~~Cy~~~~~-~~~w~~A~~~C~~~~~~g~~~~La~i~s~~e~~~l~~~~~~~~~~~~~~~~WiGl~~~~--~~   79 (137)
T cd03589           3 TFWTAFGGYCYRFFGD-RLTWEEAELRCRSFSIPGLIAHLVSIHSQEENDFVYDLFESSRGPDTPYGLWIGLHDRT--SE   79 (137)
T ss_pred             cchhhhCCEEEEEeCC-CcCHHHHHHHHHhhcCCCCCceEcccCCHHHHHHHHHHHhhccccCCCCcEEEeeecCC--cc
Confidence            9999999999999765 69999999999997     6999999999999999999876421222357999999765  38


Q ss_pred             ceeEEeCCcccccCCCCCCCCCCCCCccceEEecCCC--CceeeccccccccccccccC
Q psy1807         159 RTWKWVDGEVVTKPSWGKDQPNNYNGEQNCVVLDGGR--GWLWNDVGCKLDYLHWICQH  215 (618)
Q Consensus       159 ~~w~W~DGs~~~~~~W~~~~P~~~~~~~~Cv~l~~~~--~~~W~d~~C~~~~~~~iC~~  215 (618)
                      +.|+|+||+++.|++|.+++|++..+.++|+.|....  ...|++..|+. ..+||||.
T Consensus        80 ~~~~W~dG~~~~~~~W~~~~P~~~~~~~~C~~~~~~~~~~~~W~d~~C~~-~~~fIC~~  137 (137)
T cd03589          80 GPFEWTDGSPVDFTKWAGGQPDNYGGNEDCVQMWRRGDAGQSWNDMPCDA-VFPYICKM  137 (137)
T ss_pred             CceEeCCcCcCCcCCcCCCCCCCCCCCCCceeeecCCCCCCeecCCCCCC-CcceeeeC
Confidence            8999999999999999999999877678999997543  55899999997 68999984


No 12 
>cd03599 CLECT_DGCR2_like C-type lectin-like domain (CTLD) of the type found in DGCR2, an integral membrane protein deleted in DiGeorge Syndrome (DGS). CLECT_DGCR2_like: C-type lectin-like domain (CTLD) of the type found in DGCR2, an integral membrane protein deleted in DiGeorge Syndrome (DGS).  CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins.  DGS is also known velo-cardio-facial syndrome (VCFS).  DGS is a genetic abnormality that results in malformations of the heart, face, and limbs and is associated with schizophrenia and depressive disorders.  DGCR2 is a candidate for involvement in the pathogenesis of DGS since the DGCR2 gene lies within the minimal DGS critical region (MDGRC) of 22q11, which when deleted gives rise to DGS, and the DGCR2 gene is in close proximity to the balanced translocation breakpoint in a DGS patient having a balanced translocation.
Probab=99.92  E-value=1.7e-24  Score=196.43  Aligned_cols=128  Identities=18%  Similarity=0.350  Sum_probs=103.0

Q ss_pred             ccceeecc--eEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccC---C-CCccEEEeEEeC----
Q psy1807          84 AEITSFDR--NCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDR---L-KTLLVWIGAQKD----  153 (618)
Q Consensus        84 ~~w~~~~~--~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~---~-~~~~~WIGl~~~----  153 (618)
                      .||..|++  +||+++.. .++|.+|+++|+++||+||+|++.+|++||..++......   . ....+||||++.    
T Consensus         3 ~gw~~~~~~~~CYk~~~~-~~tw~dA~~~C~~~Gg~Lasi~s~~e~~fl~~l~~~~~~~~~~~~~~~~~WIGL~~~~~~~   81 (153)
T cd03599           3 SGWHHYEGTASCYKVYLS-GENYWDAVQTCQKVNGSLATFTTDQELQFILAQEWDFDERVFGRKDQCKFWVGYQYVITNR   81 (153)
T ss_pred             CchhccCCCCeEEEEeCC-cCCHHHHHHHHHHcCCEEcCCCCHHHHHHHHHHHHhhcccccccccCCCEEEeeccccccc
Confidence            99999998  89999765 6999999999999999999999999999999998654321   0 114599999522    


Q ss_pred             CCCCcceeEEe-CCcccccCC----CCCCCCCCCCCccceEEecC--------CCCceeeccccccccccccccC
Q psy1807         154 PGITARTWKWV-DGEVVTKPS----WGKDQPNNYNGEQNCVVLDG--------GRGWLWNDVGCKLDYLHWICQH  215 (618)
Q Consensus       154 ~~~~~~~w~W~-DGs~~~~~~----W~~~~P~~~~~~~~Cv~l~~--------~~~~~W~d~~C~~~~~~~iC~~  215 (618)
                      +...++.|+|+ ||+++.|.+    |.+++|++  ++++|+.|..        ....+|+|..|+. ...||||+
T Consensus        82 ~~~~eg~w~WsddGs~~~y~~w~~~w~~gePn~--~~e~C~~~~~~~~~~~~~~~~~~W~d~~C~~-~~~fiCq~  153 (153)
T cd03599          82 NHSLEGRWEVAYKGSMEVFLPPEPIFATGMSTN--DNVFCAQLQCFQIPSLRERGLHSWHAENCYE-KSSFLCKR  153 (153)
T ss_pred             CcccCCeEEEecCCccceecCccccCCCCCCCC--CCCCCeEEEeeccccccccccCeeeCccCCC-CCcceeCC
Confidence            22358899999 599998664    57889986  4579999962        2334899999998 67999985


No 13 
>cd03590 CLECT_DC-SIGN_like C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). CLECT_DC-SIGN_like: C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR).  This group also contains proteins similar to hepatic asialoglycoprotein receptor (ASGP-R) and langerin in human.  These proteins are type II membrane proteins with a CTLD ectodomain.  CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins.  DC-SIGN is thought to mediate the initial contact between dendritic cells and resting T cells, and may also mediate the rolling of DCs on epithelium.  DC-SIGN and DC-SIGNR bind to oligosaccharides present on human tissues, as well as, on path
Probab=99.91  E-value=6.3e-24  Score=189.49  Aligned_cols=121  Identities=33%  Similarity=0.688  Sum_probs=106.8

Q ss_pred             ccceeecceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCcceeEE
Q psy1807          84 AEITSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTWKW  163 (618)
Q Consensus        84 ~~w~~~~~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~~~w~W  163 (618)
                      .||+.|+++||+|+.. .++|.+|+..|+.+||+||+|+|++|++||.+++..      ...|||||++..  .++.|+|
T Consensus         3 ~~w~~~~~~Cy~~~~~-~~tw~~A~~~C~~~g~~La~i~s~~e~~~l~~~~~~------~~~~WiGl~~~~--~~~~~~W   73 (126)
T cd03590           3 TNWKSFQSSCYFFSTE-KKSWEESRQFCEDMGAHLVIINSQEEQEFISKILSG------NRSYWIGLSDEE--TEGEWKW   73 (126)
T ss_pred             CCceEeCCEEEEEeCC-CcCHHHHHHHHHhCCCEEEeeCCHHHHHHHHHHhCC------CCCEEEeeecCC--CcCCeEe
Confidence            9999999999999765 699999999999999999999999999999998862      245999999865  3789999


Q ss_pred             eCCccc--ccCCCCCCCCCCCC-CccceEEecCCCCceeeccccccccccccccC
Q psy1807         164 VDGEVV--TKPSWGKDQPNNYN-GEQNCVVLDGGRGWLWNDVGCKLDYLHWICQH  215 (618)
Q Consensus       164 ~DGs~~--~~~~W~~~~P~~~~-~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~  215 (618)
                      +||+++  .|.+|.+++|++.. +.++||.|.... +.|.+..|.. ...||||+
T Consensus        74 ~dg~~~~~~~~~W~~~~p~~~~~~~~~C~~~~~~~-~~w~~~~C~~-~~~fiCek  126 (126)
T cd03590          74 VDGTPLNSSKTFWHPGEPNNWGGGGEDCAELVYDS-GGWNDVPCNL-EYRWICEK  126 (126)
T ss_pred             cCCCCCCCccCCcCCCcCCCCCCCCCCCEEEECCC-CcEeCcCCCC-CEeeeeeC
Confidence            999998  89999999999864 467999997654 5899999998 67999984


No 14 
>cd03594 CLECT_REG-1_like C-type lectin-like domain (CTLD) of the type found in Human REG-1 (lithostathine), REG-4, and avian eggshell-specific proteins: ansocalcin, structhiocalcin-1(SCA-1), and -2(SCA-2). CLECT_REG-1_like: C-type lectin-like domain (CTLD) of the type found in Human REG-1 (lithostathine), REG-4, and avian eggshell-specific proteins: ansocalcin, structhiocalcin-1(SCA-1), and -2(SCA-2).  CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins.  REG-1 is a proliferating factor which participates in various kinds of tissue regeneration including pancreatic beta-cell regeneration, regeneration of intestinal mucosa, regeneration of motor neurons, and perhaps in tissue regeneration of damaged heart.  REG-1 may play a role on the pathophysiology of Alzheimer's disease and in the development of gastric cancers.  Its expression is correlated with reduced survival from early-stage colorectal cancer.  REG-1 also binds and aggregates 
Probab=99.90  E-value=1.3e-23  Score=188.27  Aligned_cols=124  Identities=25%  Similarity=0.523  Sum_probs=105.4

Q ss_pred             ccceeecceEEEEecCCCCCHHHHHHHHHHc--CCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCccee
Q psy1807          84 AEITSFDRNCYEFNIDKGASFADARSKCQAS--GGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTW  161 (618)
Q Consensus        84 ~~w~~~~~~CY~~~~~~~~tw~~A~~~C~~~--g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~~~w  161 (618)
                      .||+.|+++||+|+.. .++|.+|++.|+.+  ||+||+|++++|++||..++.....  ....|||||++...  ++.|
T Consensus         3 ~gw~~~~~~Cy~~~~~-~~tw~~A~~~C~~~~~g~~La~i~s~~e~~~l~~~~~~~~~--~~~~~WiGl~~~~~--~~~~   77 (129)
T cd03594           3 KGWLPYKGNCYGYFRQ-PLSWSDAELFCQKYGPGAHLASIHSPAEAAAIASLISSYQK--AYQPVWIGLHDPQQ--SRGW   77 (129)
T ss_pred             CCCEecCCEeeeEecc-CcCHHHHHHHHHhcCCCceEcccCCHHHHHHHHHHHHhhcc--CCccEEEeeccCCC--CCce
Confidence            9999999999999755 69999999999999  5999999999999999998876431  12469999998653  6799


Q ss_pred             EEeCCcccccCCCCCCCCCCCCCccceEEecCCCC-ceeeccccccccccccccC
Q psy1807         162 KWVDGEVVTKPSWGKDQPNNYNGEQNCVVLDGGRG-WLWNDVGCKLDYLHWICQH  215 (618)
Q Consensus       162 ~W~DGs~~~~~~W~~~~P~~~~~~~~Cv~l~~~~~-~~W~d~~C~~~~~~~iC~~  215 (618)
                      +|+||+++.|.+|++++|.  ...++||.|....+ ..|++..|.. ..+||||+
T Consensus        78 ~W~dg~~~~~~~W~~~~p~--~~~~~Cv~~~~~~~~~~W~~~~C~~-~~~fICe~  129 (129)
T cd03594          78 EWSDGSKLDYRSWDRNPPY--ARGGYCAELSRSTGFLKWNDANCEE-RNPFICKY  129 (129)
T ss_pred             EeCCCCcceecccCCCCCC--CCCCCceEEEecCCCCeEECCCCCC-CceeeeeC
Confidence            9999999999999999993  33579999975443 3799999998 67999985


No 15 
>cd03596 CLECT_tetranectin_like C-type lectin-like domain (CTLD) of the type found in the tetranectin (TN), cartilage derived C-type lectin (CLECSF1), and stem cell growth factor (SCGF). CLECT_tetranectin_like: C-type lectin-like domain (CTLD) of the type found in the tetranectin (TN), cartilage derived C-type lectin (CLECSF1), and stem cell growth factor (SCGF).  CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins.  TN binds to plasminogen and stimulates activation of plasminogen, playing a key role in the regulation of proteolytic processes.  The TN CTLD binds two calcium ions.  Its calcium free form binds to various kringle-like protein ligands.  Two residues involved in the coordination of calcium are critical for the binding of TN to the fourth kringle (K4) domain of plasminogen (Plg K4).  TN binds the kringle 1-4 form of angiostatin (AST K1-4).  AST K1-4 is a fragment of Plg, commonly found in cancer tissues.  TN inhibits the bin
Probab=99.90  E-value=2e-23  Score=187.01  Aligned_cols=124  Identities=22%  Similarity=0.445  Sum_probs=105.2

Q ss_pred             ccceeecceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCcceeEE
Q psy1807          84 AEITSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTWKW  163 (618)
Q Consensus        84 ~~w~~~~~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~~~w~W  163 (618)
                      .||.. +++||+|+.. .++|.+|++.|+++||+||+|+|.+|++||..++.....  ....|||||++..  .++.|+|
T Consensus         3 ~g~~~-~~~CY~~~~~-~~~w~~A~~~C~~~g~~La~i~s~~e~~~l~~~~~~~~~--~~~~~WiGl~~~~--~~~~w~w   76 (129)
T cd03596           3 KGTKI-HKKCYLVSEE-TKHYHEASEDCIARGGTLATPRDSDENDALRDYVKASVP--GNWEVWLGINDMV--AEGKWVD   76 (129)
T ss_pred             CCcEE-CCEEEEEecc-cCCHHHHHHHHHhcCCeEecCCCHHHHHHHHHHHHhccC--CCCcEEEeccccC--ccCeEEe
Confidence            88755 7899999755 699999999999999999999999999999998865432  2346999999865  3889999


Q ss_pred             eCCcccccCCCCC---CCCCCCCCccceEEecCCCCceeeccccccccccccccC
Q psy1807         164 VDGEVVTKPSWGK---DQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQH  215 (618)
Q Consensus       164 ~DGs~~~~~~W~~---~~P~~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~  215 (618)
                      +||+++.|.+|++   ++|++. ..++||.|.....++|+|..|.. .++||||+
T Consensus        77 ~dG~~~~~~~W~~~~~~~p~~~-~~~~Cv~l~~~~~~~W~d~~C~~-~~~fICe~  129 (129)
T cd03596          77 VNGSPISYFNWEREITAQPDGG-KRENCVALSSSAQGKWFDEDCRR-EKPYVCEF  129 (129)
T ss_pred             CCCCCccccccCCCCCCCCCCC-CCCCCEEEccCCCCcCcCccCCC-CCceeccC
Confidence            9999999999997   899863 45799999765455899999997 67999985


No 16 
>PHA02817 EEV Host range protein; Provisional
Probab=99.89  E-value=1.2e-22  Score=193.29  Aligned_cols=123  Identities=28%  Similarity=0.614  Sum_probs=110.8

Q ss_pred             CceeeecCCCCCccCCCceEeecCcccccceEEEEcCCC-----ceeccCCcceEEeccCCeeeCCCCceeeecCCCCCc
Q psy1807         389 PTCKYVDCGSPANIDNGHYSLLNGTTTHGSVAEYTCDDD-----YWLEAGAMRRQVCMRDGKWSGDTPSCELITCPEPEV  463 (618)
Q Consensus       389 p~C~~~~C~~p~~~~~g~~~~~~~~~~~g~~~~~~C~~G-----y~l~~~G~~~~~C~~~G~Ws~~~p~C~~~~C~~p~~  463 (618)
                      -.|+.+.|+.|+.+.||.+......|.+|++|+|+|++|     |.|.  |...++|+.+|+|+++.|.|+.+.|+.|.+
T Consensus        17 ~~C~~~~C~~PP~i~NG~i~~~~~~Y~~Gd~Vty~C~~G~~~~gY~L~--G~~~i~C~~dG~Ws~~~P~C~~v~C~~P~i   94 (225)
T PHA02817         17 SLCDLNKCCYPPSIKNGYIYNKKTEYNIGSNVTFFCGNNTRGVRYTLV--GEKNIICEKDGKWNKEFPVCKIIRCRFPAL   94 (225)
T ss_pred             CEecCcCCCCCCCCCCCEEecCCCceeCCCEEEEEeCCCCCCCCEEEC--CCCeEEECCCCcCCCCCCeeeeeECCCCCC
Confidence            458889999999999999876666799999999999997     8999  999999999999999999999999999874


Q ss_pred             CCCcEEEc----ccCCCCcEEEEEcCCCCeeeCCceeEeCcCCcccCCCCceEEe
Q psy1807         464 PKGGYVVG----YDLNVHSTLEFHCEPGHLLVGQASLKCTRQGDWDNMSPQFISF  514 (618)
Q Consensus       464 ~~~g~~~~----~~~~~g~~v~~~C~~Gy~l~G~~~~~C~~~G~Ws~~~P~C~~~  514 (618)
                       .||.+..    ..|.+|++|+|+|++||.|.|+..++|+++|+|+++.|+|...
T Consensus        95 -~NG~v~~~~~~~~y~yg~~Vty~C~~Gy~L~G~~~~tC~~~G~WSp~~P~C~~~  148 (225)
T PHA02817         95 -QNGFVNGIPDSKKFYYESEVSFSCKPGFVLIGTKYSVCGINSSWIPKVPICSRD  148 (225)
T ss_pred             -cCceeEccccCCceEcCCEEEEEcCCCCEEcCCCceEECCCCeECCCCCEeecC
Confidence             4676653    3688999999999999999999999999999999999999874


No 17 
>cd03593 CLECT_NK_receptors_like C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs). CLECT_NK_receptors_like: C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs), including proteins similar to oxidized low density lipoprotein (OxLDL) receptor (LOX-1), CD94, CD69, NKG2-A and -D, osteoclast inhibitory lectin (OCIL), dendritic cell-associated C-type lectin-1 (dectin-1),  human myeloid inhibitory C-type lectin-like receptor (MICL), mast cell-associated functional antigen (MAFA), killer cell lectin-like receptors: subfamily F, member 1 (KLRF1) and subfamily B, member 1 (KLRB1), and lys49 receptors.  CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins.  NKRs are variously associated with activation or inhibition of natural killer (NK) cells. Activating NKRs stimulate cytolysis by NK cells of virally infected or transformed cells; inhibitory NKRs block cytolysis up
Probab=99.88  E-value=1.1e-22  Score=178.61  Aligned_cols=114  Identities=19%  Similarity=0.519  Sum_probs=100.0

Q ss_pred             ccceeecceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCcceeEE
Q psy1807          84 AEITSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTWKW  163 (618)
Q Consensus        84 ~~w~~~~~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~~~w~W  163 (618)
                      .||+.|+++||+|+.. .++|.+|+..|+.+||+||+|+|++|++||..++..       ..+||||++..  .++.|+|
T Consensus         3 ~gw~~~~~~Cy~~~~~-~~~w~~A~~~C~~~g~~La~i~s~~e~~~l~~~~~~-------~~~WiGl~~~~--~~~~~~W   72 (116)
T cd03593           3 KDWICYGNKCYYFSME-KKTWNESKEACSSKNSSLLKIDDEEELEFLQSQIGS-------SSYWIGLSREK--SEKPWKW   72 (116)
T ss_pred             CcCEEeCCEEEEEEcC-CCCHHHHHHHHHhCCCcEEEECCHHHHHHHHHhcCC-------CceEEEEeecC--CCCCeEc
Confidence            9999999999999765 599999999999999999999999999999998721       35899999875  3789999


Q ss_pred             eCCcccccCCCCCCCCCCCCCccceEEecCCCCceeeccccccccccccccC
Q psy1807         164 VDGEVVTKPSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQH  215 (618)
Q Consensus       164 ~DGs~~~~~~W~~~~P~~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~  215 (618)
                      +||+++.  +|...+|++  ..++|++|...   .|.+..|.. ..+||||+
T Consensus        73 ~dg~~~~--~~~~~~~~~--~~~~C~~~~~~---~w~~~~C~~-~~~~IC~k  116 (116)
T cd03593          73 IDGSPLN--NLFNIRGST--KSGNCAYLSST---GIYSEDCST-KKRWICEK  116 (116)
T ss_pred             cCCCccc--ccccccCCC--CCCCceEEcCC---cEEcccCCc-CceeeeeC
Confidence            9999987  788888765  46799999765   699999997 67999985


No 18 
>cd03598 CLECT_EMBP_like C-type lectin-like domain (CTLD) of the type found in the human proteins, eosinophil major basic protein (EMBP) and prepro major basic protein homolog (MBPH). CLECT_EMBP_like: C-type lectin-like domain (CTLD) of the type found in the human proteins, eosinophil major basic protein (EMBP) and prepro major basic protein homolog (MBPH).  CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins.  Eosinophils and basophils carry out various functions in allergic, parasitic, and inflammatory diseases.  EMBP is stored in eosinophil crystalloid granules and is released upon degranulation.  EMBP is also expressed in basophils.  The proform of EMBP is expressed in placental X cells and breast tissue and increases significantly during human pregnancy.  EMBP has cytotoxic properties and damages bacteria and mammalian cells, in vitro, as well as, helminth parasites.  EMBP deposition has been observed in the inflamed tissue of all
Probab=99.88  E-value=1.5e-22  Score=177.97  Aligned_cols=116  Identities=29%  Similarity=0.510  Sum_probs=98.4

Q ss_pred             ceEEEEecCCCCCHHHHHHHHHHc-CCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCcceeEEeCCccc
Q psy1807          91 RNCYEFNIDKGASFADARSKCQAS-GGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTWKWVDGEVV  169 (618)
Q Consensus        91 ~~CY~~~~~~~~tw~~A~~~C~~~-g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~~~w~W~DGs~~  169 (618)
                      ++||+|+.. .++|.+|+..|+++ ||+||+|+|++|++||.+++....    ...+||||++.....++.|.|+||+++
T Consensus         1 ~~Cy~~~~~-~~t~~~A~~~C~~~~g~~La~i~s~~e~~~l~~~~~~~~----~~~~WiGl~~~~~~~~~~~~W~dg~~~   75 (117)
T cd03598           1 GRCYRFVKS-PRTFRDAQVICRRCYRGNLASIHSFAFNYRVQRLVSTLN----QAQVWIGGIITGKGRCRRFSWVDGSVW   75 (117)
T ss_pred             CceEEEecC-CCCHHHHHHHhhcCCCceEeeecChhHhHHHHHHHhCCC----CCCEEEeeEcCCCCcCCeeEeCCCCcc
Confidence            589999765 69999999999995 999999999999999999885422    246999999862123789999999999


Q ss_pred             ccCCCCCCCCCCCCCccceEEecCCCCceeeccccccccccccccC
Q psy1807         170 TKPSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQH  215 (618)
Q Consensus       170 ~~~~W~~~~P~~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~  215 (618)
                      .|.+|.+++|++  +.++|+.|....+ +|++..|.. .++|||++
T Consensus        76 ~y~~W~~g~p~~--~~~~Cv~~~~~~g-~W~~~~C~~-~~~fiC~~  117 (117)
T cd03598          76 NYAYWAPGQPGN--RRGHCVELCTRGG-HWRRAHCKL-RRPFICSY  117 (117)
T ss_pred             CcCCCCCCCCCC--CCCCcEEEeCCCC-eECCCcCCC-CceeeecC
Confidence            999999999998  4578999975443 899999997 78999984


No 19 
>cd03591 CLECT_collectin_like C-type lectin-like domain (CTLD) of the type found in human collectins including lung surfactant proteins A and D, mannose- or mannan binding lectin (MBL), and CL-L1 (collectin liver 1). CLECT_collectin_like: C-type lectin-like domain (CTLD) of the type found in human collectins including lung surfactant proteins A and D, mannose- or mannan binding lectin (MBL), and CL-L1 (collectin liver 1).  CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. The CTLDs of these collectins bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, or apoptotic cells) and mediate functions associated with killing and phagocytosis.  MBPs recognize high mannose oligosaccharides in a calcium dependent manner, bind to a broad range of pathogens, and trigger cell killing by activating the complement pathway.  MBP also acts directly as an opsonin.  SP-A and SP-D in addition to functioning as host defense components, a
Probab=99.88  E-value=2.4e-22  Score=175.93  Aligned_cols=114  Identities=25%  Similarity=0.525  Sum_probs=99.5

Q ss_pred             ceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCcceeEEeCCcccc
Q psy1807          91 RNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTWKWVDGEVVT  170 (618)
Q Consensus        91 ~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~~~w~W~DGs~~~  170 (618)
                      .+||.++ .+.++|.+|+.+|+.+||+||+|+|.+|++||..++.. .    ...+||||++..  .++.|+|+||+++.
T Consensus         1 ~k~y~~~-~~~~~w~~A~~~C~~~g~~La~i~s~~e~~~l~~~~~~-~----~~~~WiGl~~~~--~~~~~~w~dg~~~~   72 (114)
T cd03591           1 EKIFVTN-GEEKNFDDAQKLCSEAGGTLAMPRNAAENAAIASYVKK-G----NTYAFIGITDLE--TEGQFVYLDGGPLT   72 (114)
T ss_pred             CcEEEeC-CceeCHHHHHHHHhhcCCEEecCCCHHHHHHHHHHHhc-C----CccEEEecccCC--cCCcEEeCCCCCcc
Confidence            3799996 45799999999999999999999999999999999876 1    246999999765  38899999999999


Q ss_pred             cCCCCCCCCCCCCCccceEEecCCCCceeeccccccccccccccC
Q psy1807         171 KPSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQH  215 (618)
Q Consensus       171 ~~~W~~~~P~~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~  215 (618)
                      |++|.+++|++..+.++||.|...  ..|++.+|+. ..+||||+
T Consensus        73 y~~W~~~ep~~~~~~~~Cv~~~~~--~~W~~~~C~~-~~~fICe~  114 (114)
T cd03591          73 YTNWKPGEPNNAGGGEDCVEMYTS--GKWNDVACNL-TRLFVCEF  114 (114)
T ss_pred             cCCcCCCCCCCCCCCCCeEEECCC--CcCcCccCCC-CeeEEeeC
Confidence            999999999987667899999743  3899999998 68999984


No 20 
>cd03603 CLECT_VCBS A bacterial subgroup of the C-type lectin-like (CTLD) domain; a subgroup of bacterial protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. CLECT_VCBS: A bacterial subgroup of the C-type lectin-like (CTLD) domain; a subgroup of bacterial protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins.  Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces including CaCO3 and ice.  Bacterial CTLDs within this group are functionally uncharacterized.  Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions.  CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose.  CTLDs associate with each other through several different surface
Probab=99.87  E-value=4.4e-22  Score=175.07  Aligned_cols=113  Identities=30%  Similarity=0.543  Sum_probs=97.2

Q ss_pred             eEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCcceeEEeCCccccc
Q psy1807          92 NCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTWKWVDGEVVTK  171 (618)
Q Consensus        92 ~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~~~w~W~DGs~~~~  171 (618)
                      +||+|+. +.++|.+|+..|+++||+||+|+|++|++||..++..      ...+||||++..  .++.|+|+||+++.|
T Consensus         1 ~~Y~~~~-~~~sw~~A~~~C~~~g~~La~I~s~~E~~fv~~~~~~------~~~~WiG~~~~~--~~~~w~W~dg~~~~~   71 (118)
T cd03603           1 HFYKFVD-GGMTWEAAQTLAESLGGHLVTINSAEENDWLLSNFGG------YGASWIGASDAA--TEGTWKWSDGEESTY   71 (118)
T ss_pred             CeEEEeC-CCcCHHHHHHHHHHcCCEEcccCCHHHHHHHHHHhcc------CCCEEEeeecCC--CCCceEeCCCCcCCC
Confidence            5899965 4699999999999999999999999999999998861      235899999865  388999999999999


Q ss_pred             CCCCCCCC-CCCCCccceEEecCC--CCceeeccccc-ccccccccc
Q psy1807         172 PSWGKDQP-NNYNGEQNCVVLDGG--RGWLWNDVGCK-LDYLHWICQ  214 (618)
Q Consensus       172 ~~W~~~~P-~~~~~~~~Cv~l~~~--~~~~W~d~~C~-~~~~~~iC~  214 (618)
                      .+|.+++| ++..+.++||.|...  ..++|+|..|+ . +..||||
T Consensus        72 ~~W~~~eP~~~~~~~~~Cv~~~~~~~~~~~W~d~~C~~~-~~~~iCe  117 (118)
T cd03603          72 TNWGSGEPHNNGGGNEDYAAINHFPGISGKWNDLANSYN-TLGYVIE  117 (118)
T ss_pred             CCcCCCCCCCCCCCCcCeEEeecCCCCCCcCccCCCCcc-ccceEEe
Confidence            99999999 766667899998642  23489999999 7 6799997


No 21 
>cd03601 CLECT_TC14_like C-type lectin-like domain (CTLD) of the type found in lectins TC14, TC14-2, TC14-3, and TC14-4 from the budding tunicate Polyandrocarpa misakiensis and PfG6 from the Acorn worm. CLECT_TC14_like: C-type lectin-like domain (CTLD) of the type found in lectins TC14, TC14-2, TC14-3, and TC14-4 from the budding tunicate Polyandrocarpa misakiensis and PfG6 from the Acorn worm.  CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins.  TC14 is homodimeric.  The CTLD of TC14 binds D-galactose and D-fucose.  TC14 is expressed constitutively by multipotent epithelial and mesenchymal cells and plays in role during budding, in inducing the aggregation of undifferentiated mesenchymal cells to give rise to epithelial forming tissue.   TC14-2 and TC14-3 shows calcium-dependent galactose binding activity.  TC14-3 is a cytostatic factor which blocks cell growth and dedifferentiation of the atrial epithelium during asexual reproducti
Probab=99.86  E-value=1.7e-21  Score=171.73  Aligned_cols=114  Identities=22%  Similarity=0.311  Sum_probs=90.5

Q ss_pred             EEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeC-CCCCcceeEEeCCccc--c
Q psy1807          94 YEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKD-PGITARTWKWVDGEVV--T  170 (618)
Q Consensus        94 Y~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~-~~~~~~~w~W~DGs~~--~  170 (618)
                      |+|+ .+.++|.+|+.+|+.+||+||+|+|++| .++. .+..... .....|||||++. .  .++.|+|+||+++  .
T Consensus         3 ~~~~-~~~~~w~~A~~~C~~~G~~La~i~s~~e-~~~~-~i~~~~~-~~~~~~WIGl~d~~~--~~g~~~W~dG~~~~~~   76 (119)
T cd03601           3 ILCS-DETMNYAKAGAFCRSRGMRLASLAMRDS-EMRD-AILAFTL-VKGHGYWVGADNLQD--GEYDFLWNDGVSLPTD   76 (119)
T ss_pred             EEEc-CCcCCHHHHHHHHHhcCCEEeeecCHHH-HHHH-HHHhccc-cCCccEEEEeccCCC--CCCCeEeCCCCCcCCC
Confidence            6675 4469999999999999999999999988 3333 3332211 1124699999986 5  3889999999998  6


Q ss_pred             cCCCCCCCCCCCCCccceEEecCCCCceeeccccccccccccccC
Q psy1807         171 KPSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQH  215 (618)
Q Consensus       171 ~~~W~~~~P~~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~  215 (618)
                      |++|+++||++..+.++||.|....+ +|+|..|.. ..+||||+
T Consensus        77 y~~W~~geP~~~~~~e~Cv~~~~~~~-~W~d~~C~~-~~~fICek  119 (119)
T cd03601          77 SDLWAPNEPSNPQSRQLCVQLWSKYN-LLDDEYCGR-AKRVICEK  119 (119)
T ss_pred             CCccCCCcCcCcCCCcCCeEEeCCCC-CEeCccCCC-CceeeecC
Confidence            99999999998766789999965433 899999997 67999985


No 22 
>cd03592 CLECT_selectins_like C-type lectin-like domain (CTLD) of the type found in the type 1 transmembrane proteins:  P(platlet)-, E(endothelial)-, and L(leukocyte)- selectins (sels). CLECT_selectins_like: C-type lectin-like domain (CTLD) of the type found in the type 1 transmembrane proteins:  P(platlet)-, E(endothelial)-, and L(leukocyte)- selectins (sels).  CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins.  P- E- and L-sels are cell adhesion receptors that mediate the initial attachment, tethering, and rolling of lymphocytes on inflamed vascular walls enabling subsequent lymphocyte adhesion and transmigration.  L- sel is expressed constitutively on most leukocytes.  P-sel is stored in the Weibel-Palade bodies of endothelial cells and in the alpha granules of platlets.  E- sels are present on endothelial cells.  Following platelet and/or endothelial cell activation P- sel is rapidly translocated to the cell surface and E-sel exp
Probab=99.86  E-value=1.5e-21  Score=171.08  Aligned_cols=113  Identities=25%  Similarity=0.414  Sum_probs=96.4

Q ss_pred             EEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCcceeEEeCCcccccCC
Q psy1807          94 YEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTWKWVDGEVVTKPS  173 (618)
Q Consensus        94 Y~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~~~w~W~DGs~~~~~~  173 (618)
                      |+|+. +.++|++|+.+|+.+||+||+|+|++|++||..++....    ...|||||++..  ..+.|.|.||+++.|.+
T Consensus         3 Y~~~~-~~~~w~~A~~~C~~~g~~La~i~s~~e~~~i~~~~~~~~----~~~~WiG~~~~~--~~~~W~~~dg~~~~y~~   75 (115)
T cd03592           3 YHYST-EKMTFNEAVKYCKSRGTDLVAIQNAEENALLNGFALKYN----LGYYWIDGNDIN--NEGTWVDTDKKELEYKN   75 (115)
T ss_pred             EEEcC-CccCHHHHHHHHHHcCCeEeecCCHHHHHHHHHHHHhcC----CCCEEEeCccCC--ccCeEEeCCCCcccccc
Confidence            77754 469999999999999999999999999999999875432    236999999865  37889999999999999


Q ss_pred             CCCCCCCCCCCccceEEecCCCCceeeccccccccccccccC
Q psy1807         174 WGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQH  215 (618)
Q Consensus       174 W~~~~P~~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~  215 (618)
                      |++++|++.. .++|+.|......+|+|.+|.. ..+||||+
T Consensus        76 W~~geP~~~~-~~~Cv~~~~~~~g~W~d~~C~~-~~~fICe~  115 (115)
T cd03592          76 WAPGEPNNGR-NENCLEIYIKDNGKWNDEPCSK-KKSAICYT  115 (115)
T ss_pred             cCCCCCCCCC-CCCceEEccCCCCCCcCcCCCC-CccceeCC
Confidence            9999999865 6799999764344899999997 67999985


No 23 
>PHA02817 EEV Host range protein; Provisional
Probab=99.86  E-value=2.9e-21  Score=183.90  Aligned_cols=122  Identities=31%  Similarity=0.667  Sum_probs=110.3

Q ss_pred             ceeeecCCCCCccCCeeEEecCCCcccCCEEEEEeCCC-----ceecCCCeEEeCCCCCcCCCCCcccccccCCCCCccc
Q psy1807         272 TCQYVDCGKLEHIEHGTVTLETTRTTHGAVAIYACHEN-----YTLIGETRRVCGDGGKWNGTEPQCLFDWCAEPPQISG  346 (618)
Q Consensus       272 ~C~~~~C~~~~~~~ng~~~~~~~~~~~g~~~~~~C~~G-----y~l~G~~~~~C~~~G~Ws~~~p~C~~~~C~~p~~~~~  346 (618)
                      .|+.+.|+.|+.+.||.+......|.+|+.++|+|++|     |+|.|+..++|+.+|+|+++.|.|+.+.|+.|. +.|
T Consensus        18 ~C~~~~C~~PP~i~NG~i~~~~~~Y~~Gd~Vty~C~~G~~~~gY~L~G~~~i~C~~dG~Ws~~~P~C~~v~C~~P~-i~N   96 (225)
T PHA02817         18 LCDLNKCCYPPSIKNGYIYNKKTEYNIGSNVTFFCGNNTRGVRYTLVGEKNIICEKDGKWNKEFPVCKIIRCRFPA-LQN   96 (225)
T ss_pred             EecCcCCCCCCCCCCCEEecCCCceeCCCEEEEEeCCCCCCCCEEECCCCeEEECCCCcCCCCCCeeeeeECCCCC-CcC
Confidence            58889999999999999876666799999999999997     899999999999999999989999999999764 778


Q ss_pred             cEEEe----cccccCceeeecCCCCcEEcCCceEEEccCCeecCCCCceeee
Q psy1807         347 GIVTT----SGRRTGSVATYSCEPGFILFGSNELKCGLGGDWSGKAPTCKYV  394 (618)
Q Consensus       347 g~~~~----~~~~~g~~~~~~C~~Gy~l~G~~~~~C~~~G~Ws~~~p~C~~~  394 (618)
                      |.+..    ..+.+|++|+|+|++||.|.|+..++|+++|+|+++.|.|+..
T Consensus        97 G~v~~~~~~~~y~yg~~Vty~C~~Gy~L~G~~~~tC~~~G~WSp~~P~C~~~  148 (225)
T PHA02817         97 GFVNGIPDSKKFYYESEVSFSCKPGFVLIGTKYSVCGINSSWIPKVPICSRD  148 (225)
T ss_pred             ceeEccccCCceEcCCEEEEEcCCCCEEcCCCceEECCCCeECCCCCEeecC
Confidence            87753    3578999999999999999999999999999999999999853


No 24 
>PHA02642 C-type lectin-like protein; Provisional
Probab=99.85  E-value=3.6e-21  Score=181.16  Aligned_cols=116  Identities=20%  Similarity=0.381  Sum_probs=100.9

Q ss_pred             ccccccccCccceeecceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCC
Q psy1807          75 RCASEKAASAEITSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDP  154 (618)
Q Consensus        75 ~Cp~~~~~~~~w~~~~~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~  154 (618)
                      .||      .||+.|+++||+|+.. .++|.+|++.|+++||+||+|++++|++||+++...       ..|||||++..
T Consensus        87 ~CP------~gW~~~~~kCYyfs~~-~ksW~eA~~~C~s~ga~La~I~seeE~~FL~~~~~~-------~~yWIGLsd~~  152 (216)
T PHA02642         87 TCP------KGWIGFGYKCFYFSED-SKNWTFGNTFCTSLGATLVKVETEEELNFLKRYKDS-------SDHWIGLNRES  152 (216)
T ss_pred             CCC------CcCEEECCEEEEEeCc-ccCHHHHHHHHhhCCCeEeeECCHHHHHHHHHhhcC-------CeEEEEeEeCC
Confidence            688      9999999999999754 699999999999999999999999999999986432       24999999976


Q ss_pred             CCCcceeEEeCCcccccCCCCCCCCCCCCCccceEEecCCCCceeeccccccccccccccCCC
Q psy1807         155 GITARTWKWVDGEVVTKPSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQHNP  217 (618)
Q Consensus       155 ~~~~~~w~W~DGs~~~~~~W~~~~P~~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~~~  217 (618)
                      .  ++.|+|+||+++.+.+|..|       .++||+++..   .+.+..|.. ...+||++..
T Consensus       153 ~--e~~W~WvDGS~~n~~~~i~G-------~g~CAyLs~~---~i~s~~C~~-~~~wIC~K~l  202 (216)
T PHA02642        153 S--NHPWKWADNSNYNASFVITG-------TGECAYLNDI---RISSSRVYA-NRKWICSKTY  202 (216)
T ss_pred             C--CCceEECCCCccCcceeccC-------CCceEEEeCC---ceEccCcCC-CceEEeeeec
Confidence            3  88999999999998888653       4689999777   789999998 5799999853


No 25 
>cd03600 CLECT_thrombomodulin_like C-type lectin-like domain (CTLD) of the type found in human thrombomodulin(TM), Endosialin, C14orf27, and C1qR. CLECT_thrombomodulin_like: C-type lectin-like domain (CTLD) of the type found in human thrombomodulin(TM), Endosialin, C14orf27, and C1qR.  CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins.  In these thrombomodulin-like proteins the residues involved in coordinating Ca2+ in the classical MBP-A CTLD are not conserved.  TM exerts anti-fibrinolytic and anti-inflammatory activity.  TM also regulates blood coagulation in the anticoagulant protein C pathway.  In this pathway, the procoagulant properties of thrombin (T) are lost when it binds TM.  TM also plays a key role in tumor biology.  It is expressed on endothelial cells and on several type of tumor cell including squamous cell carcinoma.  Loss of TM expression correlates with advanced stage and poor prognosis.  Loss of function of TM func
Probab=99.85  E-value=6.9e-21  Score=173.22  Aligned_cols=124  Identities=24%  Similarity=0.531  Sum_probs=99.3

Q ss_pred             cceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccC--CCCccEEEeEEeCCC------CCccee
Q psy1807          90 DRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDR--LKTLLVWIGAQKDPG------ITARTW  161 (618)
Q Consensus        90 ~~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~--~~~~~~WIGl~~~~~------~~~~~w  161 (618)
                      +++||+++.. .++|.+|+.+|+++||+||+|+|.+|++||..++......  .....+||||++...      ..++.|
T Consensus         3 ~~~Cy~~~~~-~~sw~~A~~~C~~~gg~La~i~s~~E~~~v~~~l~~~~~~~~~~~~~~WIGl~~~~~~~~~~~~~~~~f   81 (141)
T cd03600           3 SDACYTLHPQ-KLTFLEAQRSCIELGGNLATVRSGEEADVVSLLLAAGPGRHGRGSLRLWIGLQREPRQCSDPSLPLRGF   81 (141)
T ss_pred             CCceEEEeCC-ccCHHHHHHHHHhhCCEeeecCCHHHHHHHHHHHhhccccccCCCccEEEeEecCcccCccccccCCcc
Confidence            6799999765 6999999999999999999999999999999998764210  013469999997421      125789


Q ss_pred             EEeCCc-ccccCCCCCCCCCCCCCccceEEecCCC----CceeeccccccccccccccC
Q psy1807         162 KWVDGE-VVTKPSWGKDQPNNYNGEQNCVVLDGGR----GWLWNDVGCKLDYLHWICQH  215 (618)
Q Consensus       162 ~W~DGs-~~~~~~W~~~~P~~~~~~~~Cv~l~~~~----~~~W~d~~C~~~~~~~iC~~  215 (618)
                      +|+||. .+.|++|+++ |++..+.++||.|....    .++|+|.+|+.....||||+
T Consensus        82 ~W~d~~~~~~y~~W~~~-p~n~~~~~~Cv~l~~~~~~~~~~~W~d~~C~~~~~~fIC~~  139 (141)
T cd03600          82 SWVTGDQDTDFSNWLQE-PAGTCTSPRCVALSAAGSTPDNLKWKDGPCSARADGYLCKF  139 (141)
T ss_pred             EECCCCCCCCccccccC-CCCCCCCCccEEEEccCCCCCCCccccCCcCCCCCCeEEee
Confidence            999976 5789999998 77765678999996543    56999999998435799996


No 26 
>smart00034 CLECT C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
Probab=99.84  E-value=9.6e-21  Score=168.17  Aligned_cols=123  Identities=30%  Similarity=0.658  Sum_probs=104.7

Q ss_pred             ccceeecceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCcceeEE
Q psy1807          84 AEITSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTWKW  163 (618)
Q Consensus        84 ~~w~~~~~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~~~w~W  163 (618)
                      .||+.++++||+++.. .++|.+|+.+|+.++|+||+|++++|++||..++....  .....|||||++..  ..+.|+|
T Consensus         3 ~~w~~~~~~Cy~~~~~-~~~~~~A~~~C~~~~~~La~i~~~~e~~~i~~~~~~~~--~~~~~~WiG~~~~~--~~~~~~W   77 (126)
T smart00034        3 SGWISYGGKCYKFSTE-KKTWADAQAFCQSLGAHLASIHSEAENDFVASLLKNSG--SNSDYYWIGLSDPD--SNGSWQW   77 (126)
T ss_pred             cchhhhCCEEEEEECC-ccCHHHHHHHHHhcCCEEcccCCHHHHHHHHHHHHhhc--CCCCCEEEecCccC--cCCCeEE
Confidence            9999999999999765 59999999999999999999999999999999987742  12346999999743  3789999


Q ss_pred             eCCcc-cccCCCCCCCCCCCCCccceEEecCCCCceeecccccccccccccc
Q psy1807         164 VDGEV-VTKPSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQ  214 (618)
Q Consensus       164 ~DGs~-~~~~~W~~~~P~~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~  214 (618)
                      +||++ +.|.+|.++ ++. ...++|+.|.....+.|.+.+|.. ..+||||
T Consensus        78 ~dg~~~~~~~~w~~~-~~~-~~~~~C~~~~~~~~~~w~~~~C~~-~~~~ICe  126 (126)
T smart00034       78 SDGSGPVNYSNWAPG-EPN-GGSGDCVVLSTSGGGKWNDVSCTS-KLPFVCE  126 (126)
T ss_pred             CCCCCCCCccccCCC-CCC-CCCCCCEEEecCCCCcccCCCCCC-CcccccC
Confidence            99998 789999998 433 346799999866546899999997 7799997


No 27 
>cd03595 CLECT_chondrolectin_like C-type lectin-like domain (CTLD) of the type found in the human type-1A transmembrane proteins chondrolectin (CHODL) and layilin. CLECT_chondrolectin_like: C-type lectin-like domain (CTLD) of the type found in the human type-1A transmembrane proteins chondrolectin (CHODL) and layilin.  CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins.  CHODL is predominantly expressed in muscle cells and is associated with T-cell maturation.  Various alternatively spliced isoforms have been of CHODL have been identified.  The transmembrane form of CHODL is localized in the ER-Golgi apparatus.  Layilin is widely expressed in different cell types.  The extracellular CTLD of layilin binds hyaluronan (HA), a major constituent of the extracellular matrix (ECM).  The cytoplasmic tail of layilin binds various members of the band 4.1/ERM superfamily (talin, radixin, and merlin).  The ERM proteins are cytoskeleton-membrane l
Probab=99.84  E-value=1.1e-20  Score=173.24  Aligned_cols=122  Identities=24%  Similarity=0.554  Sum_probs=98.2

Q ss_pred             ecceEEEEe----cCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCC------Cc
Q psy1807          89 FDRNCYEFN----IDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGI------TA  158 (618)
Q Consensus        89 ~~~~CY~~~----~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~------~~  158 (618)
                      |+++||+++    ..+.++|.+|+..|+.+||+||+|+|++|++||..++.....  ....+||||++....      ..
T Consensus         8 ~~~~Cy~~~~~~~~~~~~tw~~A~~~C~~~g~~LasI~s~~E~~~i~~~i~~~~~--~~~~~WIGl~~~~~~~~~~~~~~   85 (149)
T cd03595           8 TEKPCYKIAYFQDSRRRLNFEEARQACREDGGELLSIESENEQKLIERFIQTLRA--SDGDFWIGLRRSSQYNVTSSACS   85 (149)
T ss_pred             CCCccEEEEEEeccccccCHHHHHHHHHHcCCEECccCCHHHHHHHHHHHHhhcC--CCCcEEEEeECCCCccccccccC
Confidence            558999864    245799999999999999999999999999999998865322  234699999975421      13


Q ss_pred             ceeEEeCCcccccCCCCCCCCCCCCCccceEEecCC----------CCceeeccccccccccccccC
Q psy1807         159 RTWKWVDGEVVTKPSWGKDQPNNYNGEQNCVVLDGG----------RGWLWNDVGCKLDYLHWICQH  215 (618)
Q Consensus       159 ~~w~W~DGs~~~~~~W~~~~P~~~~~~~~Cv~l~~~----------~~~~W~d~~C~~~~~~~iC~~  215 (618)
                      +.|+|+||+++.|.+|.+++|++  +.++|+.|...          ..++|+|..|+. ..+||||+
T Consensus        86 ~~~~W~dG~~~~y~~W~~~eP~~--~~~~Cv~l~~~~~~~~~~~~~~~~~W~d~~C~~-~~~fICe~  149 (149)
T cd03595          86 SLYYWLDGSISTFRNWYVDEPSC--GSEVCVVMYHQPSAPAGQGGPYLFQWNDDNCNM-KNNFICKY  149 (149)
T ss_pred             CccEEcCCCccCccCCCCCCCCC--cccCCEEEEecCCCCcCcCcccCCCccCCCCCC-CcccccCC
Confidence            46999999999999999999986  35799998531          134899999998 67999984


No 28 
>PHA02953 IEV and EEV membrane glycoprotein; Provisional
Probab=99.83  E-value=1.7e-20  Score=172.98  Aligned_cols=116  Identities=17%  Similarity=0.258  Sum_probs=93.0

Q ss_pred             CccccccccCccceeecceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeC
Q psy1807          74 DRCASEKAASAEITSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKD  153 (618)
Q Consensus        74 ~~Cp~~~~~~~~w~~~~~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~  153 (618)
                      ..||      .||..|+++||+++.. .++|++|+.+|+.+||+|++|+  +|+.||..+...       ..+||||++ 
T Consensus        45 ~~CP------~GW~~~~~~CYk~f~~-~~tW~~A~~~C~~~Gg~L~~~~--~e~~fv~~~~~~-------~~~WIGL~d-  107 (170)
T PHA02953         45 SACA------KGWIPYDNYCYLDTNI-QLSTYGAVYLCNKYRARLPKPN--FRHLKVLSLTYG-------KDFWVSLKK-  107 (170)
T ss_pred             CcCc------ccChhhCCEEEEEECC-cCCHHHHHHHHHhcCCCCCCCc--HHHHHHHHhccC-------CCEEEeEEC-
Confidence            4799      9999999999999755 6999999999999999998877  667888765431       248999997 


Q ss_pred             CCCCcceeEEeCC-cccccCC------CCCCCCCCCCCccceEEecCCCCceeeccccccccccccccCC
Q psy1807         154 PGITARTWKWVDG-EVVTKPS------WGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQHN  216 (618)
Q Consensus       154 ~~~~~~~w~W~DG-s~~~~~~------W~~~~P~~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~~  216 (618)
                      .   ++.|+|.|| +.++|.+      |. ++|++  +.++|+.+..+   +|+|..|.. ..+||||+.
T Consensus       108 ~---eg~W~w~Dggs~~~y~~~~~~~~w~-~~~~~--~~e~C~~~~~~---~W~d~~C~~-~~~fICqk~  167 (170)
T PHA02953        108 K---NNRWLDINTNKTVDMNKNTELKKIK-SKTKN--DNEACYIYKSG---ELKETVCNS-VNYIICVKR  167 (170)
T ss_pred             C---CCceEeCCCCeeecccccccccccc-CCCCC--CCCCceEEeCC---eEEeccCCC-CcEEEEEEe
Confidence            2   779999996 5666554      66 45544  46799998653   799999998 569999873


No 29 
>cd03602 CLECT_1 C-type lectin (CTL)/C-type lectin-like (CTLD) domain subgroup 1; a subgroup of protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. CLECT_1: C-type lectin (CTL)/C-type lectin-like (CTLD) domain subgroup 1; a subgroup of protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins.  Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces including CaCO3 and ice.  Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions.  CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose. CTLDs associate with each other through several different surfaces to form dimers, trimers, or tetramers from which ligand-binding sites project in differe
Probab=99.82  E-value=4.8e-20  Score=159.68  Aligned_cols=108  Identities=20%  Similarity=0.453  Sum_probs=92.9

Q ss_pred             eEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCcceeEEeCCccccc
Q psy1807          92 NCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTWKWVDGEVVTK  171 (618)
Q Consensus        92 ~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~~~w~W~DGs~~~~  171 (618)
                      ++|+|+. +.++|.+|+++|+.+||+||+|+|++|++||..++..     ....|||||++.    ++.|.|+||+++.|
T Consensus         1 ~~y~~~~-~~~~w~~A~~~C~~~g~~La~i~s~~e~~~l~~~~~~-----~~~~~WiGl~~~----~~~~~W~dg~~~~~   70 (108)
T cd03602           1 RTFYLVN-ESKTWSEAQQYCRENYTDLATVQNQEDNALLSNLSRV-----SNSAAWIGLYRD----VDSWRWSDGSESSF   70 (108)
T ss_pred             CceEEec-cccCHHHHHHHHHHHCCccCeecCHHHHHHHHHHHhc-----cCCcEEEEEECC----CCceEEcCCCCCcc
Confidence            4788864 4699999999999999999999999999999998862     124699999975    67899999999999


Q ss_pred             CCCCCCCCCCCCCccceEEecCCCCceeeccccccccccccccC
Q psy1807         172 PSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQH  215 (618)
Q Consensus       172 ~~W~~~~P~~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~  215 (618)
                      .+|.+++|..   .++||.|...  .+|++..|.. ..+|||+.
T Consensus        71 ~~w~~~~~~~---~~~C~~~~~~--~~w~~~~C~~-~~~fIC~~  108 (108)
T cd03602          71 RNWNTFQPFG---QGDCATMYSS--GRWYAALCSA-LKPFICYD  108 (108)
T ss_pred             CccCCCCCCC---CCCeeEECcC--CeECcccCCC-CcCEeccC
Confidence            9999998753   5799999764  3899999997 67999984


No 30 
>PHA03097 C-type lectin-like protein; Provisional
Probab=99.80  E-value=1.8e-19  Score=164.95  Aligned_cols=111  Identities=21%  Similarity=0.309  Sum_probs=91.8

Q ss_pred             ccccccccCccceeecceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCC
Q psy1807          75 RCASEKAASAEITSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDP  154 (618)
Q Consensus        75 ~Cp~~~~~~~~w~~~~~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~  154 (618)
                      .||      .||+.|+++||+|+.. .++|.+|+..|+++||+||+|++++|++||.+++..       ..|||||++..
T Consensus        45 ~CP------~gW~~~~~~CY~~s~~-~~sW~~A~~~C~~~g~~La~I~~~~E~~fi~~~~~~-------~~~WIGL~d~~  110 (157)
T PHA03097         45 NCR------SGWVGYNNKCYTFSEN-ITNKHLAIERCADMDGILTLIDDQKEVLFVSRYKGG-------QDLWIGIEKKK  110 (157)
T ss_pred             CCC------CCceeeCCEEEEEecC-CCcHHHHHHHHHhCCCEEeeeCCHHHHHHHHHhcCC-------CCEEEeeecCC
Confidence            688      9999999999999755 599999999999999999999999999999987632       35999999863


Q ss_pred             CCCcceeEEeCCcccccCCCCCCCCCCCCCccceEEecCCCCceeeccccccccccccccCC
Q psy1807         155 GITARTWKWVDGEVVTKPSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQHN  216 (618)
Q Consensus       155 ~~~~~~w~W~DGs~~~~~~W~~~~P~~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~~  216 (618)
                         .    |.||+++.. +|.+  +   .+.++||++...   .|+|..|.. ...||||+.
T Consensus       111 ---~----W~dgs~~~~-~~~~--~---~~~e~Cv~i~~~---~w~d~~C~~-~~~~ICek~  155 (157)
T PHA03097        111 ---G----DDDDREVLD-KVVK--P---PKSGKCAYLKDK---TIISSNCNA-TKGWICFDR  155 (157)
T ss_pred             ---C----ccCCCcccc-cccC--C---CCCCCEEEEECC---cEEeCCCCC-CeeEEEeec
Confidence               2    999987643 4432  2   235789999765   699999998 679999974


No 31 
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=99.80  E-value=1.2e-19  Score=217.60  Aligned_cols=129  Identities=22%  Similarity=0.366  Sum_probs=108.3

Q ss_pred             CccccccccCccceee--cceEEEEecCCCCCHHHHHHHHHHc-CCeecccCChHHHHHHHHHhhhcccCCCCccEEEeE
Q psy1807          74 DRCASEKAASAEITSF--DRNCYEFNIDKGASFADARSKCQAS-GGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGA  150 (618)
Q Consensus        74 ~~Cp~~~~~~~~w~~~--~~~CY~~~~~~~~tw~~A~~~C~~~-g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl  150 (618)
                      ..||      .||..|  +++||+|+.. .++|.+|+..|+++ ||+||+|+|++|++||.+++....    ...+||||
T Consensus       316 ~~CP------~GW~~f~~~g~CYk~~~e-~~TW~dAe~~C~s~GGAhLAsI~S~eEn~FL~~lv~~s~----~~~vWIGL  384 (2740)
T TIGR00864       316 PHCP------KDGEIFEENGHCFQIVPE-EAAWLDAQEQCLARAGAALAIVDNDALQNFLARKVTHSL----DRGVWIGF  384 (2740)
T ss_pred             CCCC------CCCeecCCCCEEEEEeCC-ccCHHHHHHHhhccCCeEEecCCCHHHHHHHHHHhhccC----CccEEEee
Confidence            3799      999765  8999999765 69999999999999 599999999999999999886532    23599999


Q ss_pred             EeCCCCCcc--eeEEeCCccc-ccCCCCCCCCCCCCCccceEEecCCCCceeeccccccccccccccCCCCC
Q psy1807         151 QKDPGITAR--TWKWVDGEVV-TKPSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQHNPAN  219 (618)
Q Consensus       151 ~~~~~~~~~--~w~W~DGs~~-~~~~W~~~~P~~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~~~~~  219 (618)
                      ++...  ++  .|+|.||+++ .|++|.++||++.. .++|+.|... + .|+|..|.. ...||||+.+..
T Consensus       385 sD~~~--EG~g~WvWsDGS~l~~YtnW~pGEPNn~~-~EdCV~l~~~-g-~WND~~Cs~-~~~FICE~~~~~  450 (2740)
T TIGR00864       385 SDVNG--AEKGPAHQGEAFEAEECEEGLAGEPHPAR-AEHCVRLDPR-G-QCNSDLCNA-PHAYVCELNPGG  450 (2740)
T ss_pred             eCCCC--CCceeeEECCCCcccCccCcCCCCCCCCC-CCCCEEEcCC-C-CEEccCCCC-CeeEEeEECCCC
Confidence            98653  44  4999999998 79999999999864 5899998654 2 699999998 679999986543


No 32 
>PHA02867 C-type lectin protein; Provisional
Probab=99.76  E-value=4.5e-18  Score=154.56  Aligned_cols=109  Identities=17%  Similarity=0.308  Sum_probs=88.3

Q ss_pred             CccccccccCccceeecceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeC
Q psy1807          74 DRCASEKAASAEITSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKD  153 (618)
Q Consensus        74 ~~Cp~~~~~~~~w~~~~~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~  153 (618)
                      ..||      .||+.|+++||+|+.+ .++|.+|+..|.++||+||+|++++|++||..+..        ..|||||++.
T Consensus        47 ~~CP------~gWi~~~~~CY~fs~~-~~tW~~A~~~C~~~ga~La~I~s~eE~~Fl~~~~~--------~~~WIGLs~~  111 (167)
T PHA02867         47 KVCP------DEWIGYNSKCYYFTIN-ETNWNDSKKLCDVMDSSLIRFDNIETLNFVSRYGK--------GSYWIDINQN  111 (167)
T ss_pred             CCCC------CCCEEECCEEEEEecc-ccCHHHHHHHHhhCCCEECCcCCHHHHHHHHHcCC--------CCEEEEEEeC
Confidence            3688      9999999999999654 69999999999999999999999999999987532        2499999986


Q ss_pred             CCCCcceeEEeCCcccccCCCCCCCCCCCCCccceEEecCCCCceeeccccccccccccccCC
Q psy1807         154 PGITARTWKWVDGEVVTKPSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQHN  216 (618)
Q Consensus       154 ~~~~~~~w~W~DGs~~~~~~W~~~~P~~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~~  216 (618)
                      .     .|.|.+++.+     ..     ..++++|++++.+   ...+..|.. .+.+||++.
T Consensus       112 ~-----~~~~~~~s~~-----~~-----~~~~~~Ca~i~~~---~i~s~~C~~-~~~wIC~K~  155 (167)
T PHA02867        112 R-----KIPGINFSLY-----YE-----QGVNDICLLFDTS---NIIEMSCIF-HERTICVKE  155 (167)
T ss_pred             C-----CCCCccCcee-----ee-----cCCCCcEEEEeCC---eEEeecccC-CcEEEEEcc
Confidence            4     3456555433     11     2346789999887   678999998 679999984


No 33 
>cd00037 CLECT C-type lectin (CTL)/C-type lectin-like (CTLD) domain. CLECT: C-type lectin (CTL)/C-type lectin-like (CTLD) domain; protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins.  This group is chiefly comprised of eukaryotic CTLDs, but contains some, as yet functionally uncharacterized, bacterial CTLDs.  Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces, including CaCO3 and ice.  Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. For example: mannose-binding lectin and lung surfactant proteins A and D bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, and apoptotic cells) and mediate functions associated with killing and phagocytosis;  P (platlet)-, E (endothelial)-, and L (leukocyte)- selectins (sels) mediate the initia
Probab=99.62  E-value=3.2e-15  Score=129.86  Aligned_cols=115  Identities=34%  Similarity=0.742  Sum_probs=96.5

Q ss_pred             eEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCcceeEEeCCcc-cc
Q psy1807          92 NCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTWKWVDGEV-VT  170 (618)
Q Consensus        92 ~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~~~w~W~DGs~-~~  170 (618)
                      .||+++.. .++|++|++.|+.+||+|++|++.+|.+||..++..    .....+|||+++..  ..+.|.|.||+. ..
T Consensus         1 ~Cy~~~~~-~~~~~~A~~~C~~~~~~L~~~~~~~e~~~i~~~~~~----~~~~~~wvg~~~~~--~~~~~~~~~~~~~~~   73 (116)
T cd00037           1 SCYKFSTE-KLTWEEAQEYCRSLGGHLASIHSEEENDFLASLLKK----SSSSDVWIGLNDLS--SEGTWKWSDGSPLVD   73 (116)
T ss_pred             CCEEEcCC-ccCHHHHHHHHHHcCCEEcccCCHHHHHHHHHHHhC----CCCCCEEEcccccC--cCCCeEecCCCcccc
Confidence            49999655 799999999999999999999999999999999874    12346999999875  378999999998 78


Q ss_pred             cCCCCCCCCCCCCCccceEEecCCCCceeeccccccccccccccC
Q psy1807         171 KPSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQH  215 (618)
Q Consensus       171 ~~~W~~~~P~~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~  215 (618)
                      +.+|.+++|.. ....+|+.+.......|.+.+|.. ..+|||++
T Consensus        74 ~~~w~~~~~~~-~~~~~C~~~~~~~~~~~~~~~C~~-~~~~iC~~  116 (116)
T cd00037          74 YTNWAPGEPNP-GGSEDCVVLSSSSDGKWNDVSCSS-KLPFICEK  116 (116)
T ss_pred             ccCCCCCCcCC-CCCCCeeEEccCCCCCccCCCCCC-CceeeecC
Confidence            89999999832 336799999765445899999996 67999974


No 34 
>PF00059 Lectin_C:  Lectin C-type domain;  InterPro: IPR001304 Lectins occur in plants, animals, bacteria and viruses. Initially described for their carbohydrate-binding activity [], they are now recognised as a more diverse group of proteins, some of which are involved in protein-protein, protein-lipid or protein-nucleic acid interactions []. There are at least twelve structural families of lectins:   C-type lectins, which are Ca+-dependent.  S-type (galectins), a widespread family of glycan-binding proteins []. I-type, which have an immunoglobulin-like fold and can recognise sialic acids, other sugars and glycosaminoglycans []. P-type, which bind phosphomannosyl receptors []. Pentraxins []. (Trout) egg lectins. Calreticulin and calnexin, which act as molecular chaperones of the endoplasmic reticulum []. ERGIC-53 and VIP-36 []. Discoidins []. Eel aggutinins (fucolectins) []. Annexin lectins []. Fibrinogen-type lectins, which includes ficolins, tachylectins 5A and 5B, and Limax flavus (Spotted garden slug) agglutinin (these proteins have clear distinctions from one another, but they share a homologous fibrinogen-like domain used for carbohydrate binding). Also unclassified orphan lectins, including amphoterin, Cel-II, complement factor H, thrombospondin, sailic acid-binding lectins, adherence lectin, and cytokins (such as tumour necrosis factor and several interleukins).   C-type lectins can be further divided into seven subgroups based on additional non-lectin domains and gene structure: (I) hyalectans, (II) asialoglycoprotein receptors, (III) collectins, (IV) selectins, (V) NK group transmembrane receptors, (VI) macrophage mannose receptors, and (VII) simple (single domain) lectins []. Therefore, lectins are a diverse group of proteins, both in terms of structure and activity. Carbohydrate binding ability may have evolved independently and sporadically in numerous unrelated families, where each evolved a structure that was conserved to fulfil some other activity and function. In general, animal lectins act as recognition molecules within the immune system, their functions involving defence against pathogens, cell trafficking, immune regulation and the prevention of autoimmunity [].; GO: 0005488 binding; PDB: 1T8D_A 2H2T_B 1T8C_A 2H2R_A 1TN3_A 1RJH_A 1HTN_A 3G8K_B 2E3X_B 1UMR_D ....
Probab=99.58  E-value=5.2e-15  Score=126.76  Aligned_cols=100  Identities=28%  Similarity=0.588  Sum_probs=81.0

Q ss_pred             CCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCcceeEEeCCcccc----cCCCCC
Q psy1807         101 GASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTWKWVDGEVVT----KPSWGK  176 (618)
Q Consensus       101 ~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~~~w~W~DGs~~~----~~~W~~  176 (618)
                      +++|.+|+++|+.+||+|+.|.+.+|++||..++..      ...+|||| +..  ..+.|+|.||++..    |.+|  
T Consensus         2 ~~~~~~A~~~C~~~~~~L~~i~~~~e~~~i~~~~~~------~~~~Wig~-~~~--~~~~~~w~~~~~~~~~~~~~~w--   70 (105)
T PF00059_consen    2 PMTWEEAQQYCQSMGAHLASINSEEENDFIQSQLKS------NESYWIGL-DSD--NNGTWKWIDGSPNSPENFYTNW--   70 (105)
T ss_dssp             EEEHHHHHHHHHHTTSEEB-GSSHHHHHHHHHHHHS------SSEEEEEE-ESS--STSEEEETTSSBSSSTTBSGCB--
T ss_pred             CCCHHHHHHHHhcCCCEEeEeCCHHHhhhhhhcccc------cceeeeee-ecc--ccceeccccCCCcccccccccc--
Confidence            479999999999999999999999999999999871      24699999 333  38899999998765    5678  


Q ss_pred             CCCCCCCCccceEEecCCCCceeeccccccccccccccC
Q psy1807         177 DQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQH  215 (618)
Q Consensus       177 ~~P~~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~  215 (618)
                       .  ......+|++|.....+.|.+.+|.. ..+|||++
T Consensus        71 -~--~~~~~~~C~~~~~~~~~~w~~~~C~~-~~~fiCek  105 (105)
T PF00059_consen   71 -N--PPNDSENCAYIYYSSSGKWNDVPCSE-KYPFICEK  105 (105)
T ss_dssp             -S--SGGSSEEEEEEGCSTTTEEEEEETTS-EEEEEEEE
T ss_pred             -c--cCCCCCCeEEEEEcCCCeEEeeCCCC-CeeEEeeC
Confidence             1  22335799999555556999999998 78999985


No 35 
>PHA02911 C-type lectin-like protein; Provisional
Probab=99.30  E-value=5.9e-12  Score=114.60  Aligned_cols=106  Identities=15%  Similarity=0.264  Sum_probs=75.0

Q ss_pred             ccCccceeecceEEEEecCC-CCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCcc
Q psy1807          81 AASAEITSFDRNCYEFNIDK-GASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITAR  159 (618)
Q Consensus        81 ~~~~~w~~~~~~CY~~~~~~-~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~~  159 (618)
                      ||+.+|..|+++||+|+..+ ..+|++|+.+|..+||+|+. .+.+|++||+.             +|.|+.       +
T Consensus       102 pC~~nW~~H~nkCYlFS~~s~sWsW~~Srr~C~~kGATLLk-~sdEELeFIqn-------------~wkGks-------~  160 (213)
T PHA02911        102 ICSRDWINHMGICLLSLGEEVGFRMEIAKRFCEKKDADLIG-KIDEEKKALEN-------------IWTGND-------H  160 (213)
T ss_pred             CCchhhHhhCCEEEEEeccccccchhHHHHHHHhcCCEecc-CcHHHHHHHHH-------------HHhccc-------c
Confidence            47799999999999996542 34559999999999999999 77799999996             355542       3


Q ss_pred             eeEEeCCcccccCCCCCCCCCCCCCccceEEecCCC---CceeeccccccccccccccCCC
Q psy1807         160 TWKWVDGEVVTKPSWGKDQPNNYNGEQNCVVLDGGR---GWLWNDVGCKLDYLHWICQHNP  217 (618)
Q Consensus       160 ~w~W~DGs~~~~~~W~~~~P~~~~~~~~Cv~l~~~~---~~~W~d~~C~~~~~~~iC~~~~  217 (618)
                      .|.|+||.- ..+-        .+...+||.+...+   -.+-....|.. .+.+|||+..
T Consensus       161 ~~fWID~r~-ngSt--------~n~s~sCA~Is~~~~~~~~~V~sesCs~-~~~wICqK~~  211 (213)
T PHA02911        161 SRFWIDNRA-AAST--------FDPVNECAIGTQNHIPEVPEVLKSPCDE-RHSFICIKKD  211 (213)
T ss_pred             ceEEecCcc-ccce--------ecCCCCeEEEEcccccCCCceEccccCC-CceEEEEecc
Confidence            566666631 0000        11145799997630   01456678998 5799999853


No 36 
>cd00033 CCP Complement control protein (CCP) modules (aka short consensus repeats SCRs or SUSHI repeats) have been identified in several proteins of the complement system. SUSHI repeats (short complement-like repeat, SCR) are abundant in complement control proteins. The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
Probab=98.98  E-value=9.8e-10  Score=82.79  Aligned_cols=55  Identities=36%  Similarity=0.797  Sum_probs=50.7

Q ss_pred             CCCCCcCCCcEEEc--ccCCCCcEEEEEcCCCCeeeCCceeEeCcCCcccCCCCceE
Q psy1807         458 CPEPEVPKGGYVVG--YDLNVHSTLEFHCEPGHLLVGQASLKCTRQGDWDNMSPQFI  512 (618)
Q Consensus       458 C~~p~~~~~g~~~~--~~~~~g~~v~~~C~~Gy~l~G~~~~~C~~~G~Ws~~~P~C~  512 (618)
                      |+.|..+.||.+..  ..|.+|+.|+|.|++||.|.|+..++|+.+|+|++..|.|+
T Consensus         1 C~~p~~~~~g~~~~~~~~~~~g~~~~~~C~~Gy~~~g~~~~~C~~~g~W~~~~p~C~   57 (57)
T cd00033           1 CPPPPVPENGTVTGSKGSYSYGSTVTYSCNEGYTLVGSSTITCTENGGWSPPPPTCE   57 (57)
T ss_pred             CcCCCCCCCcEEecCCCcccCCCEEEEECCCCCeEeCCCeeEECCCCeECCCCcccC
Confidence            78888888998887  57999999999999999999999999999999999999984


No 37 
>PF00084 Sushi:  Sushi domain (SCR repeat);  InterPro: IPR000436 Sushi domains are also known as Complement control protein (CCP) modules, or short consensus repeats (SCR), exist in a wide variety of complement and adhesion proteins. The structure is known for this domain, it is based on a beta-sandwich arrangement; one face made up of three beta-strands hydrogen-bonded to form a triple-stranded region at its centre and the other face formed from two separate beta-strands []. CD21 (also called C3d receptor, CR2, Epstein Barr virus receptor or EBV-R) is the receptor for EBV and for C3d, C3dg and iC3b. Complement components may activate B cells through CD21. CD21 is part of a large signal-transduction complex that also involves CD19, CD81, and Leu13.  Some of the proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Complement decay-accelerating factor (Antigen CD55) belongs to the Cromer blood group system and is associated with Cr(a), Dr(a), Es(a), Tc(a/b/c), Wd(a), WES(a/b), IFC and UMC antigens. Complement receptor type 1 (C3b/C4b receptor) (Antigen CD35) belongs to the Knops blood group system and is associated with Kn(a/b), McC(a), Sl(a) and Yk(a) antigens. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; PDB: 1HFH_A 1HFI_A 1E5G_A 2UWN_A 2V8E_A 2XWB_H 3HS0_I 2OK5_A 2XWJ_J 3HRZ_D ....
Probab=98.94  E-value=8.8e-10  Score=82.81  Aligned_cols=56  Identities=32%  Similarity=0.712  Sum_probs=50.5

Q ss_pred             CCCCCccCCCceEeecCcccccceEEEEcCCCceeccCCcceEEeccCCeeeCCCCce
Q psy1807         396 CGSPANIDNGHYSLLNGTTTHGSVAEYTCDDDYWLEAGAMRRQVCMRDGKWSGDTPSC  453 (618)
Q Consensus       396 C~~p~~~~~g~~~~~~~~~~~g~~~~~~C~~Gy~l~~~G~~~~~C~~~G~Ws~~~p~C  453 (618)
                      |+.|+.+.||.+...+..|.+|++|+|+|++||.+.  |...++|+.+|+|++..|.|
T Consensus         1 C~~P~~~~~~~~~~~~~~~~~g~~~~~~C~~Gy~~~--g~~~~~C~~~g~W~~~~p~C   56 (56)
T PF00084_consen    1 CPPPPPIPNGSISCTNNPYSYGSTVTFSCNPGYELS--GSSTITCQSNGQWSPPPPTC   56 (56)
T ss_dssp             EESESCBTTEEEESSSSSEETTEEEEEEESTTEEEE--SSSEEEEETTSEESSTTSEE
T ss_pred             CcCCCCCCCcEEecCCCCccCCCEEEEcCCCCCEec--CCCEEEECCCCEECCCCCCC
Confidence            667888999977666678999999999999999999  99999999999999988887


No 38 
>smart00032 CCP Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short  consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino  acid residues and have been identified in several proteins of the  complement system. A missense mutation in seventh CCP domain causes  deficiency of the b subunit of factor XIII.
Probab=98.93  E-value=2.1e-09  Score=80.85  Aligned_cols=54  Identities=28%  Similarity=0.744  Sum_probs=48.8

Q ss_pred             CCCCCcCCCcEEEcc---cCCCCcEEEEEcCCCCeeeCCceeEeCcCCcccCCCCce
Q psy1807         458 CPEPEVPKGGYVVGY---DLNVHSTLEFHCEPGHLLVGQASLKCTRQGDWDNMSPQF  511 (618)
Q Consensus       458 C~~p~~~~~g~~~~~---~~~~g~~v~~~C~~Gy~l~G~~~~~C~~~G~Ws~~~P~C  511 (618)
                      |+.|..+.||.+...   .|.+|++++|.|++||.|.|...++|+.+|+|++..|.|
T Consensus         1 C~~p~~~~~g~~~~~~~~~~~~g~~~~~~C~~Gy~l~g~~~~~C~~~g~Ws~~~p~C   57 (57)
T smart00032        1 CPPPPDIENGTITSSNKGTYSVGDTVTYSCNPGYTLIGSSTITCLEDGTWSPPAPTC   57 (57)
T ss_pred             CcCCCCCCCCEEecCCCCeecCCCEEEEEcCCCCEEcCCCeeEECCCCEECCCCCcC
Confidence            778888888888764   589999999999999999999999999999999998887


No 39 
>cd00033 CCP Complement control protein (CCP) modules (aka short consensus repeats SCRs or SUSHI repeats) have been identified in several proteins of the complement system. SUSHI repeats (short complement-like repeat, SCR) are abundant in complement control proteins. The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
Probab=98.90  E-value=2.5e-09  Score=80.51  Aligned_cols=57  Identities=32%  Similarity=0.711  Sum_probs=52.0

Q ss_pred             CCCCCccCCCceEeecCcccccceEEEEcCCCceeccCCcceEEeccCCeeeCCCCcee
Q psy1807         396 CGSPANIDNGHYSLLNGTTTHGSVAEYTCDDDYWLEAGAMRRQVCMRDGKWSGDTPSCE  454 (618)
Q Consensus       396 C~~p~~~~~g~~~~~~~~~~~g~~~~~~C~~Gy~l~~~G~~~~~C~~~G~Ws~~~p~C~  454 (618)
                      |+.|+.+.||.+......|.+|+.++|.|++||.|.  |...++|+.+|.|++..|.|+
T Consensus         1 C~~p~~~~~g~~~~~~~~~~~g~~~~~~C~~Gy~~~--g~~~~~C~~~g~W~~~~p~C~   57 (57)
T cd00033           1 CPPPPVPENGTVTGSKGSYSYGSTVTYSCNEGYTLV--GSSTITCTENGGWSPPPPTCE   57 (57)
T ss_pred             CcCCCCCCCcEEecCCCcccCCCEEEEECCCCCeEe--CCCeeEECCCCeECCCCcccC
Confidence            788888889999876667999999999999999999  999999999999999888884


No 40 
>smart00032 CCP Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short  consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino  acid residues and have been identified in several proteins of the  complement system. A missense mutation in seventh CCP domain causes  deficiency of the b subunit of factor XIII.
Probab=98.90  E-value=2.7e-09  Score=80.21  Aligned_cols=56  Identities=36%  Similarity=0.819  Sum_probs=50.7

Q ss_pred             CCCCCccCCCceEeec-CcccccceEEEEcCCCceeccCCcceEEeccCCeeeCCCCce
Q psy1807         396 CGSPANIDNGHYSLLN-GTTTHGSVAEYTCDDDYWLEAGAMRRQVCMRDGKWSGDTPSC  453 (618)
Q Consensus       396 C~~p~~~~~g~~~~~~-~~~~~g~~~~~~C~~Gy~l~~~G~~~~~C~~~G~Ws~~~p~C  453 (618)
                      |+.|+.+.||.+.... ..|.+|++++|.|++||.|.  |...++|+.+|+|+...|.|
T Consensus         1 C~~p~~~~~g~~~~~~~~~~~~g~~~~~~C~~Gy~l~--g~~~~~C~~~g~Ws~~~p~C   57 (57)
T smart00032        1 CPPPPDIENGTITSSNKGTYSVGDTVTYSCNPGYTLI--GSSTITCLEDGTWSPPAPTC   57 (57)
T ss_pred             CcCCCCCCCCEEecCCCCeecCCCEEEEEcCCCCEEc--CCCeeEECCCCEECCCCCcC
Confidence            7788889999988765 67899999999999999999  99999999999999988877


No 41 
>PF00084 Sushi:  Sushi domain (SCR repeat);  InterPro: IPR000436 Sushi domains are also known as Complement control protein (CCP) modules, or short consensus repeats (SCR), exist in a wide variety of complement and adhesion proteins. The structure is known for this domain, it is based on a beta-sandwich arrangement; one face made up of three beta-strands hydrogen-bonded to form a triple-stranded region at its centre and the other face formed from two separate beta-strands []. CD21 (also called C3d receptor, CR2, Epstein Barr virus receptor or EBV-R) is the receptor for EBV and for C3d, C3dg and iC3b. Complement components may activate B cells through CD21. CD21 is part of a large signal-transduction complex that also involves CD19, CD81, and Leu13.  Some of the proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Complement decay-accelerating factor (Antigen CD55) belongs to the Cromer blood group system and is associated with Cr(a), Dr(a), Es(a), Tc(a/b/c), Wd(a), WES(a/b), IFC and UMC antigens. Complement receptor type 1 (C3b/C4b receptor) (Antigen CD35) belongs to the Knops blood group system and is associated with Kn(a/b), McC(a), Sl(a) and Yk(a) antigens. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; PDB: 1HFH_A 1HFI_A 1E5G_A 2UWN_A 2V8E_A 2XWB_H 3HS0_I 2OK5_A 2XWJ_J 3HRZ_D ....
Probab=98.88  E-value=1.6e-09  Score=81.32  Aligned_cols=56  Identities=23%  Similarity=0.667  Sum_probs=50.5

Q ss_pred             CCCCCccCCeeEEecCCCcccCCEEEEEeCCCceecCCCeEEeCCCCCcCCCCCcc
Q psy1807         278 CGKLEHIEHGTVTLETTRTTHGAVAIYACHENYTLIGETRRVCGDGGKWNGTEPQC  333 (618)
Q Consensus       278 C~~~~~~~ng~~~~~~~~~~~g~~~~~~C~~Gy~l~G~~~~~C~~~G~Ws~~~p~C  333 (618)
                      |+.|+.+.||.+......+.+|++++|+|++||++.|...++|+.+|+|++..|.|
T Consensus         1 C~~P~~~~~~~~~~~~~~~~~g~~~~~~C~~Gy~~~g~~~~~C~~~g~W~~~~p~C   56 (56)
T PF00084_consen    1 CPPPPPIPNGSISCTNNPYSYGSTVTFSCNPGYELSGSSTITCQSNGQWSPPPPTC   56 (56)
T ss_dssp             EESESCBTTEEEESSSSSEETTEEEEEEESTTEEEESSSEEEEETTSEESSTTSEE
T ss_pred             CcCCCCCCCcEEecCCCCccCCCEEEEcCCCCCEecCCCEEEECCCCEECCCCCCC
Confidence            56778899998777667899999999999999999999999999999999988877


No 42 
>PF05473 Herpes_UL45:  UL45 protein;  InterPro: IPR008646 This family consists several UL45 proteins and homologues found in the herpes simplex virus family. The herpes simplex virus UL45 gene encodes an 18 kDa virion envelope protein whose function remains unknown. It has been suggested that the 18 kDa UL45 gene product is required for efficient growth in the central nervous system at low doses and may play an important role under the conditions of a naturally acquired infection []. The Equine herpesvirus 1 UL45 protein represents a type II membrane glycoprotein which has found to be non-essential for EHV-1 growth in vitro but deletion reduces the viruses' replication efficiency [].
Probab=98.24  E-value=3.7e-06  Score=80.10  Aligned_cols=55  Identities=22%  Similarity=0.407  Sum_probs=45.5

Q ss_pred             CccccccccCccceeecceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHh
Q psy1807          74 DRCASEKAASAEITSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGEL  134 (618)
Q Consensus        74 ~~Cp~~~~~~~~w~~~~~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l  134 (618)
                      ..||      ++|+.|++.||+|+....++|+||+..|++++++|+.+.+......+...+
T Consensus        83 ~~CP------~~Wi~~~~~Cy~~~~~~~~t~~eA~~~C~~~~s~L~~~~~~~~L~~ll~~~  137 (200)
T PF05473_consen   83 GPCP------KGWIGYNNSCYRFSNSPKKTWEEARNICAAYNSTLANVNNAKSLLELLDVL  137 (200)
T ss_pred             CCCC------ccceeeCCEEEEEeCCCCcCHHHHHHHHHhcCCcCCCchhHHHHHHHHHHh
Confidence            4788      999999999999976543499999999999999999987776665555544


No 43 
>PHA02672 ORF110 EEV glycoprotein; Provisional
Probab=96.42  E-value=0.0037  Score=54.59  Aligned_cols=83  Identities=11%  Similarity=0.111  Sum_probs=58.5

Q ss_pred             CCcCccccccccCccceeecceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeE
Q psy1807          71 FSTDRCASEKAASAEITSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGA  150 (618)
Q Consensus        71 ~~~~~Cp~~~~~~~~w~~~~~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl  150 (618)
                      +-.+.|.      .||+.|.+.||.- +.-..+-..|...|+.+.|+|..|-|..-++=|..+       .....||+|-
T Consensus        45 f~~~AC~------~GWvp~dd~CyLn-T~~q~s~~~A~~iC~~~~a~lP~~pn~~hLKgVm~l-------t~~r~FW~th  110 (166)
T PHA02672         45 FLKEACE------NGWVPIKDLCVLN-THVITNVTLAHDICASMDGDPPATPNTMLLKGIMFL-------TGERSFWMTH  110 (166)
T ss_pred             hhhhhhh------cCCeeccCEEEEe-cceeecccHHHHHHHhccCCCCCCCCHhHhhheeeE-------ecccEEEEEc
Confidence            3344788      9999999999976 444588999999999999999998775533322121       1234699998


Q ss_pred             EeCCCCCcceeEEeCCcccc
Q psy1807         151 QKDPGITARTWKWVDGEVVT  170 (618)
Q Consensus       151 ~~~~~~~~~~w~W~DGs~~~  170 (618)
                      ++.-   ...+.=.+|..+.
T Consensus       111 h~~y---~~vy~~~~~~~~~  127 (166)
T PHA02672        111 HDAY---TSVYEPKRGSYLP  127 (166)
T ss_pred             cccc---eeeeecCCCCccc
Confidence            8653   4456555666554


No 44 
>KOG4646|consensus
Probab=95.54  E-value=0.0035  Score=54.60  Aligned_cols=41  Identities=27%  Similarity=0.208  Sum_probs=37.2

Q ss_pred             CcccceeeccCCchhhhhhcccCCCchhHHHHHHhhhhcccC
Q psy1807           1 KDNKEYILSVGGIKPVTSLLLNSNHEQTVLNSITTLMYLVTP   42 (618)
Q Consensus         1 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~~l~~i~~l~~l~~~   42 (618)
                      |+|+++|+..+|+++|..+++ ++.|++|.++++.+|+|..+
T Consensus        89 ~~n~~~I~ea~g~plii~~ls-sp~e~tv~sa~~~l~~l~~~  129 (173)
T KOG4646|consen   89 KTNAKFIREALGLPLIIFVLS-SPPEITVHSAALFLQLLEFG  129 (173)
T ss_pred             hHHHHHHHHhcCCceEEeecC-CChHHHHHHHHHHHHHhcCc
Confidence            579999999999999999988 99999999999999888743


No 45 
>PHA03093 EEV glycoprotein; Provisional
Probab=95.52  E-value=0.027  Score=51.49  Aligned_cols=47  Identities=30%  Similarity=0.582  Sum_probs=37.3

Q ss_pred             eeecceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhh
Q psy1807          87 TSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELER  136 (618)
Q Consensus        87 ~~~~~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~  136 (618)
                      +.|+++||.|+.+ +++|+||...|...|+.|.+. +..+ .+|..+|..
T Consensus       104 I~~~~~C~~~~~e-pkTf~dA~~~C~~~g~~LPs~-~l~~-~WL~dYLeg  150 (185)
T PHA03093        104 IVYDGSCYIFHSE-PKTFSDAKADCAKKSSTLPNS-NLMT-TWLSDYLED  150 (185)
T ss_pred             eecCCEeEEecCC-CcCHHHHHHHHHhcCCcCCCc-chHH-HHHHHHhhc
Confidence            5889999999865 699999999999999999983 3222 377766644


No 46 
>PHA02673 ORF109 EEV glycoprotein; Provisional
Probab=95.39  E-value=0.02  Score=51.04  Aligned_cols=54  Identities=15%  Similarity=0.293  Sum_probs=40.3

Q ss_pred             cCccccccccCccceeecceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhh
Q psy1807          73 TDRCASEKAASAEITSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELER  136 (618)
Q Consensus        73 ~~~Cp~~~~~~~~w~~~~~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~  136 (618)
                      ...|.      .  +.|+++||.++ . +.+|+||...|...|+.|.+..-.+...+|..+|..
T Consensus        74 ~~tC~------G--I~~~~~Cltl~-~-p~tf~eAn~~C~~~g~~LPs~~l~~~~pWL~dYL~g  127 (161)
T PHA02673         74 NRTCD------G--INAGNKCLTLK-Y-PDTWTNANERCKELGQRLPSPSINKKYPWLTDYLVG  127 (161)
T ss_pred             cCcCC------C--cccCCeeEEeC-C-CCcHHHHHHHHHhcCCcCCCCcccccCcHHHHHhhc
Confidence            45787      3  57899999997 4 589999999999999999985433333455555543


No 47 
>PF05966 Chordopox_A33R:  Chordopoxvirus A33R protein;  InterPro: IPR009238 This family consists of several Chordopoxvirus A33R proteins. A33R plays a role in promoting Ab-resistant cell-to-cell spread of virus [] and interacts with A36R to incorporate the protein into the outer membrane of intracellular enveloped virions (IEV) [].; PDB: 3K7B_A.
Probab=94.08  E-value=0.03  Score=51.75  Aligned_cols=33  Identities=27%  Similarity=0.631  Sum_probs=26.3

Q ss_pred             eeecceEEEEecCCCCCHHHHHHHHHHcCCeecc
Q psy1807          87 TSFDRNCYEFNIDKGASFADARSKCQASGGDLVH  120 (618)
Q Consensus        87 ~~~~~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~  120 (618)
                      +.|+++||.++.+ +++|+||...|...|+.|.+
T Consensus       108 I~~~n~Cl~~~~~-p~T~~~A~~~C~~~g~~LPs  140 (190)
T PF05966_consen  108 IVYDNKCLTLNYE-PKTFDEANSDCNNKGQTLPS  140 (190)
T ss_dssp             EEETTEEEEEEEE-EEEHHHHHHHHHHTT-B---
T ss_pred             cccCCEEEEecCC-CCCHHHHHHHHHhcCCcCCC
Confidence            7789999999765 58999999999999999997


No 48 
>PF12191 stn_TNFRSF12A:  Tumour necrosis factor receptor stn_TNFRSF12A_TNFR domain;  InterPro: IPR022316 The tumour necrosis factor (TNF) receptor (TNFR) superfamily comprises more than 20 type-I transmembrane proteins. Family members are defined based on similarity in their extracellular domain - a region that contains many cysteine residues arranged in a specific repetitive pattern []. The cysteines allow formation of an extended rod-like structure, responsible for ligand binding []. Upon receptor activation, different intracellular signalling complexes are assembled for different members of the TNFR superfamily, depending on their intracellular domains and sequences []. Activation of TNFRs can therefore induce a range of disparate effects, including cell proliferation, differentiation, survival, or apoptotic cell death, depending upon the receptor involved []. TNFRs are widely distributed and play important roles in many crucial biological processes, such as lymphoid and neuronal development, innate and adaptive immunity, and maintenance of cellular homeostasis []. Drugs that manipulate their signalling have potential roles in the prevention and treatment of many diseases, such as viral infections, coronary heart disease, transplant rejection, and immune disease []. TNF receptor 12 (also known as TWEAK receptor, and fibroblast growth factor-inducible-14 (Fn14)) has been implicated in endothelial cell growth and migration []. The receptor may also play a role in cell-matrix interactions [].; PDB: 2KN0_A 2RPJ_A 2KMZ_A 2EQP_A.
Probab=91.97  E-value=0.074  Score=45.54  Aligned_cols=19  Identities=21%  Similarity=0.524  Sum_probs=13.8

Q ss_pred             eeEeCcCCcccCCCCceEE
Q psy1807         495 SLKCTRQGDWDNMSPQFIS  513 (618)
Q Consensus       495 ~~~C~~~G~Ws~~~P~C~~  513 (618)
                      +..|...-.|+...-.|.+
T Consensus        33 ~~~Cp~G~sWs~dLdkCm~   51 (129)
T PF12191_consen   33 TTPCPRGSSWSADLDKCMS   51 (129)
T ss_dssp             CSSS-SSEEEETTTTEEEE
T ss_pred             CCCCCCCCcCccccccccc
Confidence            4567766679988888887


No 49 
>cd03519 Link_domain_HAPLN_module_2 Link_domain_HAPLN_module_2; this link domain is found in the second link module of proteins similar to the vertebrate HAPLN (hyaluronan/HA and proteoglycan binding link) protein family which includes cartilage link protein. The link domain is a HA-binding domain. HAPLNs contain two contiguous link modules. Both link modules of cartilage link protein are involved in interaction with HA. In cartilage, a chondroitin sulfate proteoglycan core protein (CSPG) aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates with other CSPGs substituting for aggregan may contribute to the structural integrity of many different tissues. Members of the vertebrate HAPLN gene family are physically linked adjacent to CSPG genes.
Probab=90.37  E-value=0.28  Score=39.94  Aligned_cols=23  Identities=22%  Similarity=0.459  Sum_probs=20.7

Q ss_pred             CCCCCHHHHHHHHHHcCCeeccc
Q psy1807          99 DKGASFADARSKCQASGGDLVHG  121 (618)
Q Consensus        99 ~~~~tw~~A~~~C~~~g~~L~~i  121 (618)
                      ..+++|+||++.|+..|++||+.
T Consensus         8 ~~~l~f~eA~~aC~~~ga~lAs~   30 (91)
T cd03519           8 PGKLTFSEAVAACQRDGAQIAKV   30 (91)
T ss_pred             ccccCHHHHHHHHHHcCCEeCCH
Confidence            34699999999999999999985


No 50 
>cd03520 Link_domain_CSPGs_modules_2_4 Link_domain_CSPGs_modules_2_4; this link domain is found in the second and fourth link modules of the chondroitin sulfate proteoglycan core protein (CSPG) aggrecan and, in the second link module of three other CSPGs: versican, neurocan, and brevican. The link domain is a hyaluronan (HA)-binding domain. CSPGs are characterized by an N-terminal globular domain (G1 domain) containing two contiguous link modules (modules 1 and 2). Both link modules of the G1 domain of aggrecan are involved in interaction with HA. Aggrecan in addition contains a second globular domain (G2) having link modules 3 and 4 which lack HA-binding activity. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggregan may contribute to the structural integrity of many different tissues. Members of the vertebrate HPLN (hyaluronan/HA and
Probab=88.19  E-value=0.48  Score=39.09  Aligned_cols=23  Identities=26%  Similarity=0.528  Sum_probs=20.7

Q ss_pred             CCCCCHHHHHHHHHHcCCeeccc
Q psy1807          99 DKGASFADARSKCQASGGDLVHG  121 (618)
Q Consensus        99 ~~~~tw~~A~~~C~~~g~~L~~i  121 (618)
                      ..+++++||++.|++.|++||+.
T Consensus         8 ~~~l~f~eA~~aC~~~ga~lAs~   30 (96)
T cd03520           8 PEKFTFQEARAECRSLGAVLATT   30 (96)
T ss_pred             CCCcCHHHHHHHHHHcCCEeCCH
Confidence            34689999999999999999984


No 51 
>KOG4564|consensus
Probab=86.87  E-value=0.77  Score=49.42  Aligned_cols=35  Identities=20%  Similarity=0.497  Sum_probs=25.4

Q ss_pred             CCcEEEEEcCCC---C--eeeCCceeEeCcCCcccCCCCc
Q psy1807         476 VHSTLEFHCEPG---H--LLVGQASLKCTRQGDWDNMSPQ  510 (618)
Q Consensus       476 ~g~~v~~~C~~G---y--~l~G~~~~~C~~~G~Ws~~~P~  510 (618)
                      .|+.+...|.+.   +  .-.|.....|.+||+|+..+|.
T Consensus        79 ~G~~v~~~CP~yf~~~~~~~~~~v~R~C~~~G~W~~~~~~  118 (473)
T KOG4564|consen   79 AGTLVTVPCPDYFPGFSYSTSGNVTRNCTSNGTWSERPPS  118 (473)
T ss_pred             CCceEEecCccccCCCcccCCCceEeecCCCCccCCCCCc
Confidence            356677777753   2  2457778999999999987664


No 52 
>PTZ00382 Variant-specific surface protein (VSP); Provisional
Probab=85.63  E-value=0.49  Score=39.48  Aligned_cols=11  Identities=27%  Similarity=0.549  Sum_probs=8.4

Q ss_pred             EcCCCCeeeCC
Q psy1807         483 HCEPGHLLVGQ  493 (618)
Q Consensus       483 ~C~~Gy~l~G~  493 (618)
                      .|..||.+.+.
T Consensus        41 ~C~~GY~~~~~   51 (96)
T PTZ00382         41 ECNSGFSLDNG   51 (96)
T ss_pred             cCcCCcccCCC
Confidence            58999998643


No 53 
>cd01102 Link_Domain The link domain is a hyaluronan (HA)-binding domain. It functions to mediate adhesive interactions during inflammatory leukocyte homing and tumor metastasis. It is found in the CD44 receptor and in human TSG-6. TSG-6 is the protein product of the tumor necrosis factor-stimulated gene-6. TSG-6 has a strong anti-inflammatory effect in models of acute inflammation and autoimmune arthritis and plays an essential role in female fertility. This group also contains the link domains of the chondroitin sulfate proteoglycan core proteins (CSPG) including aggrecan, versican, neurocan, and brevican and the link domains of the vertebrate HAPLN (HA and proteoglycan binding link) protein family. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates in which other CSPGs substitute for aggregan might contribute to the structural integrity of many different tissues. Members of
Probab=84.91  E-value=0.68  Score=38.08  Aligned_cols=21  Identities=33%  Similarity=0.564  Sum_probs=19.7

Q ss_pred             CCCHHHHHHHHHHcCCeeccc
Q psy1807         101 GASFADARSKCQASGGDLVHG  121 (618)
Q Consensus       101 ~~tw~~A~~~C~~~g~~L~~i  121 (618)
                      .+++.||++.|+++|++||+.
T Consensus        13 ~l~f~eA~~aC~~~ga~lAs~   33 (92)
T cd01102          13 KLTFAEAALACKARGAHLATP   33 (92)
T ss_pred             ccCHHHHHHHHHHcCCEeCCH
Confidence            689999999999999999985


No 54 
>smart00445 LINK Link (Hyaluronan-binding).
Probab=84.62  E-value=0.72  Score=38.07  Aligned_cols=21  Identities=38%  Similarity=0.657  Sum_probs=19.7

Q ss_pred             CCCHHHHHHHHHHcCCeeccc
Q psy1807         101 GASFADARSKCQASGGDLVHG  121 (618)
Q Consensus       101 ~~tw~~A~~~C~~~g~~L~~i  121 (618)
                      .++++||++.|+++|++||+.
T Consensus        14 ~l~f~eA~~aC~~~ga~lAs~   34 (94)
T smart00445       14 KLTFAEAREACRAQGATLATV   34 (94)
T ss_pred             ccCHHHHHHHHHHcCCEeCCH
Confidence            589999999999999999985


No 55 
>cd03518 Link_domain_HAPLN_module_1 Link_domain_HAPLN_module_1; this link domain is found in the first link module of proteins similar to the vertebrate HAPLN (hyaluronan/HA and proteoglycan binding link) protein family which includes cartilage link protein. The link domain is a HA-binding domain. HAPLNs contain two contiguous link modules. Both link modules of cartilage link protein are involved in interaction with HA. In cartilage, a chondroitin sulfate proteoglycan core protein (CSPG) aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates with other CSPGs substituting for aggregan may contribute to the structural integrity of many different tissues. Members of the vertebrate HAPLN gene family are physically linked adjacent to CSPG genes.
Probab=81.93  E-value=1  Score=37.14  Aligned_cols=21  Identities=19%  Similarity=0.404  Sum_probs=19.5

Q ss_pred             CCCHHHHHHHHHHcCCeeccc
Q psy1807         101 GASFADARSKCQASGGDLVHG  121 (618)
Q Consensus       101 ~~tw~~A~~~C~~~g~~L~~i  121 (618)
                      .+++.||++.|+.+|++||+.
T Consensus        13 ~l~f~eA~~aC~~~ga~lAs~   33 (95)
T cd03518          13 NLNFHEAQQACEEQDATLASF   33 (95)
T ss_pred             ccCHHHHHHHHHHcCCeeCCH
Confidence            579999999999999999985


No 56 
>cd03515 Link_domain_TSG_6_like This is the extracellular link domain of the type found in human TSG-6. The link domain is a hyaluronan (HA)-binding domain. TSG-6 is the protein product of tumor necrosis factor-stimulated gene-6. TSG-6 is up-regulated in inflammatory lesions and in the ovary during ovulation. It has a strong anti-inflammatory and chondroprotective effect in models of acute inflammation and autoimmune arthritis and plays an essential role in female fertility. Also included in this group are the stabilins: stabilin-1 (FEEL-1, CLEVER-1) and stabilin-2 (FEEL-2). Stabilin-2 functions as the major liver and lymph node-scavenging receptor for HA and related glycosaminoglycans. Stabilin-2 is a scavenger receptor with a broad range of ligands including advanced glycation end (AGE) products, acetylated low density lipoprotein and procollagen peptides. In contrast, stabilin-1 does not bind HA, but binds acetylated low density lipoprotein and AGEs with lower affinity. As AGEs accum
Probab=81.66  E-value=0.99  Score=37.09  Aligned_cols=21  Identities=24%  Similarity=0.580  Sum_probs=19.6

Q ss_pred             CCCHHHHHHHHHHcCCeeccc
Q psy1807         101 GASFADARSKCQASGGDLVHG  121 (618)
Q Consensus       101 ~~tw~~A~~~C~~~g~~L~~i  121 (618)
                      .+++.||++.|+++|++||+.
T Consensus        13 ~l~f~eA~~aC~~~ga~lAs~   33 (93)
T cd03515          13 KLTYTEAKAACEAEGAHLATY   33 (93)
T ss_pred             ccCHHHHHHHHHHcCCccCCH
Confidence            589999999999999999985


No 57 
>PF14704 DERM:  Dermatopontin
Probab=81.60  E-value=4.8  Score=36.75  Aligned_cols=138  Identities=16%  Similarity=0.329  Sum_probs=79.6

Q ss_pred             cCCEEEEEcCCCceecCCceeEec---CCcccCCCCceeeecCCCCCc-cCCeeEEecCCCcccCCEEEEEeCCCceecC
Q psy1807         239 LGSTISYACPEGNMLVGSATRTCK---EGFWTGVAPTCQYVDCGKLEH-IEHGTVTLETTRTTHGAVAIYACHENYTLIG  314 (618)
Q Consensus       239 ~G~~~~y~C~~Gy~l~g~~~~~C~---~g~Ws~~~p~C~~~~C~~~~~-~~ng~~~~~~~~~~~g~~~~~~C~~Gy~l~G  314 (618)
                      ++..+.|+|+.|..|.+-.+....   +..|.   -.|..+     +. +..+.-........++....|+|..|+.|.|
T Consensus         3 ~d~~f~f~Cp~gq~i~~v~S~~~n~~eDR~W~---F~C~~~-----~~~~~~~~~~~~~~vN~~d~~~~~tCp~n~~i~G   74 (152)
T PF14704_consen    3 YDQPFSFQCPSGQGISGVRSYHSNNREDRRWS---FGCSPV-----GSGPTQGSSCWWSYVNNYDQPFNFTCPSNYYITG   74 (152)
T ss_pred             CCCCEEEECCCCCEEEEEEEEeCCCCCcccEE---EEEEeC-----CCCCCccceEcchhhhccCCcEEEECCCCCEEEE
Confidence            567789999999999888777776   66886   235443     22 1111111111345689999999999999988


Q ss_pred             CCeEEeC--CCCCcCCCCCcccccccCCCCCccccE-EEecccccCceeeecCCCCcEEcCCceE--EEccCCeecCCCC
Q psy1807         315 ETRRVCG--DGGKWNGTEPQCLFDWCAEPPQISGGI-VTTSGRRTGSVATYSCEPGFILFGSNEL--KCGLGGDWSGKAP  389 (618)
Q Consensus       315 ~~~~~C~--~~G~Ws~~~p~C~~~~C~~p~~~~~g~-~~~~~~~~g~~~~~~C~~Gy~l~G~~~~--~C~~~G~Ws~~~p  389 (618)
                      -.+..=.  +|-.|.   ..|....   ...+.+=. .+.-...+...+.|.-..|+.|.|-.+.  .-..|-.|.  .-
T Consensus        75 ~~S~h~n~~eDR~w~---f~C~~~~---~~~~~~C~~~t~~~N~~d~~~~~~~~~~~~i~G~~S~~~~~~~DR~wk--~~  146 (152)
T PF14704_consen   75 MQSYHDNSKEDRRWQ---FQCCRIN---GATLYNCYWTTGYVNNYDGPLDFTVPSGYVITGVSSYHDNSNEDRRWK--FR  146 (152)
T ss_pred             EEEEecCCcccCceE---EEEEecC---CceeecceecccccccCCCcEEEEcCCCcEEEeEeeecCCCCcCceEE--EE
Confidence            6544322  345563   2332210   00011100 0001124677788888899999885432  234566775  34


Q ss_pred             cee
Q psy1807         390 TCK  392 (618)
Q Consensus       390 ~C~  392 (618)
                      .|+
T Consensus       147 ~C~  149 (152)
T PF14704_consen  147 TCR  149 (152)
T ss_pred             EEe
Confidence            565


No 58 
>PF00193 Xlink:  Extracellular link domain;  InterPro: IPR000538 The link domain [] is a hyaluronan(HA)-binding region found in proteins of vertebrates that are involved in the assembly of extracellular matrix, cell adhesion, and migration. The structure has been shown [] to consist of two alpha helices and two antiparallel beta sheets arranged around a large hydrophobic core similar to that of C-type lectin. This domain contains four conserved cysteines involved in two disulphide bonds. The link domain has also been termed HABM [] (HA binding module) and PTR [] (proteoglycan tandem repeat). Proteins with such a domain include the proteoglycans aggrecan, brevican, neurocan and versican, which are expressed in the CNS; the cartilage link protein (LP), a proteoglycan that together with HA and aggrecan forms multimolecular aggregates; Tumour necrosis factor-inducible protein TSG-6, which may be involved in cell-cell and cell-matrix interactions during inflammation and tumourgenesis; and CD44 antigen, the main cell surface receptor for HA.; GO: 0005540 hyaluronic acid binding, 0007155 cell adhesion; PDB: 1O7B_T 2PF5_C 1O7C_T 2JCQ_A 2JCR_A 2JCP_A 1UUH_B 1POZ_A 2I83_A.
Probab=80.91  E-value=1.6  Score=36.15  Aligned_cols=21  Identities=29%  Similarity=0.566  Sum_probs=16.8

Q ss_pred             CCCHHHHHHHHHHcCCeeccc
Q psy1807         101 GASFADARSKCQASGGDLVHG  121 (618)
Q Consensus       101 ~~tw~~A~~~C~~~g~~L~~i  121 (618)
                      .+++.+|++.|++.|+.||+.
T Consensus        13 ~l~f~eA~~~C~~~ga~LAs~   33 (92)
T PF00193_consen   13 KLTFTEAQQACRALGARLASP   33 (92)
T ss_dssp             SB-HHHHHHHHHHTTCBE--H
T ss_pred             cCcHHHHHHHHHHcCCeeCCH
Confidence            579999999999999999984


No 59 
>PF07204 Orthoreo_P10:  Orthoreovirus membrane fusion protein p10;  InterPro: IPR009854 This family consists of several Orthoreovirus membrane fusion protein p10 sequences. p10 is thought to be a multifunctional protein that plays a key role in virus-host interaction [].
Probab=80.23  E-value=0.88  Score=36.84  Aligned_cols=26  Identities=27%  Similarity=0.594  Sum_probs=12.9

Q ss_pred             ehHHHHHHHHHHHHHHHHHHHhcCCC
Q psy1807         537 VGAGVVLFLLILLAIIYARLRKATPV  562 (618)
Q Consensus       537 ~~~~~~l~l~vl~~~~~~~~r~k~~~  562 (618)
                      .+++++|+++++++++..|.|+|...
T Consensus        47 ~GGG~iLilIii~Lv~CC~~K~K~~~   72 (98)
T PF07204_consen   47 AGGGLILILIIIALVCCCRAKHKTSA   72 (98)
T ss_pred             ccchhhhHHHHHHHHHHhhhhhhhHh
Confidence            34444444445444555565555443


No 60 
>cd03521 Link_domain_KIAA0527_like Link_domain_KIAA0527_like; this domain is found in the human protein KIAA0527. Sequence-wise, it is highly similar to the link domain. The link domain is a hyaluronan-binding (HA) domain. KIAA0527 contains a single link module. The KIAA0527 gene was originally cloned from human brain tissue.
Probab=79.94  E-value=1.4  Score=35.66  Aligned_cols=21  Identities=29%  Similarity=0.569  Sum_probs=19.6

Q ss_pred             CCCHHHHHHHHHHcCCeeccc
Q psy1807         101 GASFADARSKCQASGGDLVHG  121 (618)
Q Consensus       101 ~~tw~~A~~~C~~~g~~L~~i  121 (618)
                      .++++||++.|+++|++||+.
T Consensus        13 ~l~f~eA~~AC~~~gA~lAs~   33 (95)
T cd03521          13 GLGLRAARQSCASLGARLASA   33 (95)
T ss_pred             ccCHHHHHHHHHHcCCEeccH
Confidence            579999999999999999984


No 61 
>KOG4289|consensus
Probab=78.62  E-value=7.4  Score=46.72  Aligned_cols=154  Identities=26%  Similarity=0.545  Sum_probs=88.0

Q ss_pred             cCCEEEEEcCCCceecCC-----ceeEecCCcccCCCCceeeecCCCCC------ccCCeeEEecCCCccc-CCEEEEEe
Q psy1807         239 LGSTISYACPEGNMLVGS-----ATRTCKEGFWTGVAPTCQYVDCGKLE------HIEHGTVTLETTRTTH-GAVAIYAC  306 (618)
Q Consensus       239 ~G~~~~y~C~~Gy~l~g~-----~~~~C~~g~Ws~~~p~C~~~~C~~~~------~~~ng~~~~~~~~~~~-g~~~~~~C  306 (618)
                      .|-+|  .|++||.  |.     ....|-.|=|.  .|+|.+-.|..-.      .-.+|...|....|.. |+  -..|
T Consensus      1737 ~GY~C--~C~~g~~--G~~Ce~~~dq~CPrGWWG--~P~CgpC~CavsKgfdp~CnKt~G~CqCKe~hy~~~~~--Cl~C 1808 (2531)
T KOG4289|consen 1737 HGYTC--ECPPGYT--GPYCELRADQPCPRGWWG--FPTCGPCNCAVSKGFDPDCNKTNGQCQCKENHYRPIGS--CLPC 1808 (2531)
T ss_pred             CceeE--ECCCccc--CcchhhhccCCCCCcccC--CCCccCccccccCCCCCCccccCcceeeccccccCCCc--ceee
Confidence            44444  4889984  33     23456677775  7899876665321      2356777776655543 44  3335


Q ss_pred             CCCceecCCCeEEeCCCCCcCCCCCcccccccCCCCCccccEEEecccccCceeeecCCCCcEEcCCceEEEccCCeecC
Q psy1807         307 HENYTLIGETRRVCGDGGKWNGTEPQCLFDWCAEPPQISGGIVTTSGRRTGSVATYSCEPGFILFGSNELKCGLGGDWSG  386 (618)
Q Consensus       307 ~~Gy~l~G~~~~~C~~~G~Ws~~~p~C~~~~C~~p~~~~~g~~~~~~~~~g~~~~~~C~~Gy~l~G~~~~~C~~~G~Ws~  386 (618)
                      +-+   .|+.++.|.++|+-          .|. |     |.+       |-+| =+|+.-|.               ..
T Consensus      1809 dC~---~Gs~Sr~C~adGqC----------~C~-p-----gai-------GRqC-drCd~pfa---------------ev 1846 (2531)
T KOG4289|consen 1809 DCY---FGSDSRECDADGQC----------PCK-P-----GAI-------GRQC-DRCDNPFA---------------EV 1846 (2531)
T ss_pred             ccc---cCCCcccccCCCcC----------CCC-C-----ccc-------cccc-cccCChhh---------------hc
Confidence            444   47888888877731          121 1     000       1100 02332221               11


Q ss_pred             CCCceeeecCCCCCccCCCceEeecCcccccceEEEEcCCCceeccCCcceEEeccCCeeeCC
Q psy1807         387 KAPTCKYVDCGSPANIDNGHYSLLNGTTTHGSVAEYTCDDDYWLEAGAMRRQVCMRDGKWSGD  449 (618)
Q Consensus       387 ~~p~C~~~~C~~p~~~~~g~~~~~~~~~~~g~~~~~~C~~Gy~l~~~G~~~~~C~~~G~Ws~~  449 (618)
                      ....|+.+.=.-|..++.|....   ..++|+.+...|..|-.    |...+.|...|.|..+
T Consensus      1847 ttlgCrv~y~~CPk~ie~giwWP---~tkfG~~a~~pCP~G~~----G~AvR~C~e~~gWl~p 1902 (2531)
T KOG4289|consen 1847 TTLGCRVVYDKCPKAIEAGIWWP---QTKFGSPAAVPCPKGSS----GLAVRHCDETGGWLPP 1902 (2531)
T ss_pred             cccCcEEccccCcHHHhhccccc---ccccCCcccccCCCCcc----hhhhhhhhccCCCCCc
Confidence            23345544322345566665543   24789999999999943    8899999999999864


No 62 
>cd03517 Link_domain_CSPGs_modules_1_3 Link_domain_CSPGs_modules_1_3; this extracellular link domain is found in the first and third link modules of the chondroitin sulfate proteoglycan core protein (CSPG) aggrecan. In addition, it is found in the first link module of three other CSPGs: versican, neurocan, and brevican. The link domain is a hyaluronan (HA)-binding domain. CSPGs are characterized by an N-terminal globular domain (G1 domain) containing two contiguous link modules (modules 1 and 2). Both link modules of the G1 domain of aggrecan are involved in interaction with HA. In addition, aggrecan contains a second globular domain (G2) which contains link modules 3 and 4. G2 appears to lack HA-binding activity. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggrecan may contribute to the structural integrity of many different tissues.
Probab=77.64  E-value=1.5  Score=36.30  Aligned_cols=21  Identities=14%  Similarity=0.346  Sum_probs=19.4

Q ss_pred             CCCHHHHHHHHHHcCCeeccc
Q psy1807         101 GASFADARSKCQASGGDLVHG  121 (618)
Q Consensus       101 ~~tw~~A~~~C~~~g~~L~~i  121 (618)
                      .+++.||++.|+++|+.||+.
T Consensus        13 ~l~f~eA~~aC~~~ga~lAs~   33 (95)
T cd03517          13 ALTFPRAQRACLDISAQIATP   33 (95)
T ss_pred             eECHHHHHHHHHHcCCEeCCH
Confidence            478999999999999999985


No 63 
>PF01034 Syndecan:  Syndecan domain;  InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ]. Structurally, these proteins consist of four separate domains:   A signal sequence; An extracellular domain (ectodomain) of variable length whose sequence is not evolutionary conserved in the various forms of syndecans. The ectodomain contains the sites of attachment of the heparan sulphate glycosaminoglycan side chains;  A transmembrane region;  A highly conserved cytoplasmic domain of about 30 to 35 residues, which could interact with cytoskeletal proteins.    The proteins known to belong to this family are:    Syndecan 1.  Syndecan 2 or fibroglycan.  Syndecan 3 or neuroglycan or N-syndecan.  Syndecan 4 or amphiglycan or ryudocan.  Drosophila syndecan.   Caenorhabditis elegans probable syndecan (F57C7.3).    Syndecan-4, a transmembrane heparan sulphate proteoglycan, is a coreceptor with integrins in cell adhesion. It has been suggested to form a ternary signalling complex with protein kinase Calpha and phosphatidylinositol 4,5-bisphosphate (PIP2). Structural studies have demonstrated that the cytoplasmic domain undergoes a conformational transition and forms a symmetric dimer in the presence of phospholipid activator PIP2, and whose overall structure in solution exhibits a twisted clamp shape having a cavity in the centre of dimeric interface. In addition, it has been observed that the syndecan-4 variable domain interacts, strongly, not only with fatty acyl groups but also the anionic head group of PIP2. These findings indicate that PIP2 promotes oligomerisation of the syndecan-4 cytoplasmic domain for transmembrane signalling and cell-matrix adhesion [, ].; GO: 0008092 cytoskeletal protein binding, 0016020 membrane; PDB: 1EJQ_B 1EJP_B 1YBO_C 1OBY_Q.
Probab=74.82  E-value=0.97  Score=34.06  Aligned_cols=23  Identities=26%  Similarity=0.488  Sum_probs=1.3

Q ss_pred             HHHHHHHHHHHHHHHHhcCCCCC
Q psy1807         542 VLFLLILLAIIYARLRKATPVKN  564 (618)
Q Consensus       542 ~l~l~vl~~~~~~~~r~k~~~~~  564 (618)
                      ++++++|++++..|+|||.+...
T Consensus        22 ll~ailLIlf~iyR~rkkdEGSY   44 (64)
T PF01034_consen   22 LLFAILLILFLIYRMRKKDEGSY   44 (64)
T ss_dssp             ---------------S------S
T ss_pred             HHHHHHHHHHHHHHHHhcCCCCc
Confidence            33444555555567777777654


No 64 
>smart00185 ARM Armadillo/beta-catenin-like repeats. Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Probab=74.29  E-value=1.3  Score=29.79  Aligned_cols=38  Identities=26%  Similarity=0.421  Sum_probs=32.1

Q ss_pred             cccceeeccCCchhhhhhcccCCCchhHHHHHHhhhhcc
Q psy1807           2 DNKEYILSVGGIKPVTSLLLNSNHEQTVLNSITTLMYLV   40 (618)
Q Consensus         2 ~~~~~i~~~~~~~~~~~~~~~~~~~~~~l~~i~~l~~l~   40 (618)
                      .++++|+..++++.+..+|. +++++++..++.+|.+|.
T Consensus         3 ~~~~~i~~~g~i~~L~~ll~-~~~~~i~~~a~~aL~nl~   40 (41)
T smart00185        3 EQKQAVVDAGGLPALVELLK-SEDEEVVKEAAWALSNLS   40 (41)
T ss_pred             HHHHHHHHCCCHHHHHHHHc-CCCHHHHHHHHHHHHHHc
Confidence            46788888999999999988 778889999988887764


No 65 
>PF15050 SCIMP:  SCIMP protein
Probab=73.74  E-value=1.2  Score=37.76  Aligned_cols=14  Identities=29%  Similarity=0.634  Sum_probs=6.6

Q ss_pred             HHHHHHHHHHHHHH
Q psy1807         541 VVLFLLILLAIIYA  554 (618)
Q Consensus       541 ~~l~l~vl~~~~~~  554 (618)
                      ++++-++|.+++|+
T Consensus        16 II~vS~~lglIlyC   29 (133)
T PF15050_consen   16 IILVSVVLGLILYC   29 (133)
T ss_pred             HHHHHHHHHHHHHH
Confidence            33333455555553


No 66 
>PF02009 Rifin_STEVOR:  Rifin/stevor family;  InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=72.30  E-value=1.4  Score=44.87  Aligned_cols=18  Identities=22%  Similarity=0.301  Sum_probs=9.2

Q ss_pred             HHHHHHHHHHHhcCCCCC
Q psy1807         547 ILLAIIYARLRKATPVKN  564 (618)
Q Consensus       547 vl~~~~~~~~r~k~~~~~  564 (618)
                      +++.+|+|.||||+-+|+
T Consensus       273 vIIYLILRYRRKKKmkKK  290 (299)
T PF02009_consen  273 VIIYLILRYRRKKKMKKK  290 (299)
T ss_pred             HHHHHHHHHHHHhhhhHH
Confidence            344455556665555443


No 67 
>PF14575 EphA2_TM:  Ephrin type-A receptor 2 transmembrane domain; PDB: 3KUL_A 2XVD_A 2VX1_A 2VWV_A 2VX0_A 2VWY_A 2VWZ_A 2VWW_A 2VWU_A 2VWX_A ....
Probab=70.85  E-value=1.5  Score=34.69  Aligned_cols=15  Identities=33%  Similarity=0.705  Sum_probs=13.6

Q ss_pred             hhccCChhhhhhhhc
Q psy1807         601 ENTYDNPYETALAFA  615 (618)
Q Consensus       601 ~~~~~~~~~~~~~~~  615 (618)
                      -.||..|...-.|||
T Consensus        56 P~TYEDP~qAV~eFA   70 (75)
T PF14575_consen   56 PHTYEDPNQAVREFA   70 (75)
T ss_dssp             GGGSSSHHHHHHHCS
T ss_pred             cccccCHHHHHHHHH
Confidence            689999999999997


No 68 
>TIGR01478 STEVOR variant surface antigen, stevor family. This model represents the stevor branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of stevor sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 8 bits.
Probab=70.22  E-value=2.8  Score=41.54  Aligned_cols=32  Identities=22%  Similarity=0.352  Sum_probs=14.9

Q ss_pred             eEEeehHHHHHHHHHHHHHH--HHHHHhcCCCCC
Q psy1807         533 YMISVGAGVVLFLLILLAII--YARLRKATPVKN  564 (618)
Q Consensus       533 ~~~~~~~~~~l~l~vl~~~~--~~~~r~k~~~~~  564 (618)
                      +-.|+++.|+|+|.|+|+++  ..+||||..||.
T Consensus       259 ~Pcgiaalvllil~vvliiLYiWlyrrRK~swkh  292 (295)
T TIGR01478       259 LPYGIAALVLIILTVVLIILYIWLYRRRKKSWKH  292 (295)
T ss_pred             cccHHHHHHHHHHHHHHHHHHHHHHHhhcccccc
Confidence            33343444444443333333  236666766654


No 69 
>cd03516 Link_domain_CD44_like This domain is a hyaluronan (HA)-binding domain. It is found in CD44 receptor and mediates adhesive interactions during inflammatory leukocyte homing and tumor metastasis. It also plays an important role in arteriogenesis. The functional HA-binding domain of CD44 is an extended domain comprised of a single link module flanked with N-and C- extensions. These extensions are essential for folding and for functional activity. This group also contains the cell surface retention sequence (CRS) binding protein-1 (CRSBP-1) and lymph vessel endothelial receptor-1 (LYVE-1). CRSBP-1 is a cell surface binding protein for the CRS motif of PDGF-BB (platelet-derived growth factor-BB) and is responsible for the cell surface retention of PDGF-BB in SSV-transformed cells. CRSBP-1 may play a role in autocrine regulation of cell growth mediated by CRS containing growth regulators. LYVE-1 is preferentially expressed on the lymphatic endothelium and is used as a molecular marke
Probab=69.84  E-value=3.3  Score=36.90  Aligned_cols=21  Identities=29%  Similarity=0.534  Sum_probs=19.5

Q ss_pred             CCCHHHHHHHHHHcCCeeccc
Q psy1807         101 GASFADARSKCQASGGDLVHG  121 (618)
Q Consensus       101 ~~tw~~A~~~C~~~g~~L~~i  121 (618)
                      .+++.||++.|+++|+.||+.
T Consensus        18 ~lnf~eA~~aC~~~ga~lAs~   38 (144)
T cd03516          18 SLNFTEAKEACRALGLTLASK   38 (144)
T ss_pred             cCCHHHHHHHHHHcCCeeCCH
Confidence            479999999999999999984


No 70 
>PTZ00370 STEVOR; Provisional
Probab=69.50  E-value=2.6  Score=41.89  Aligned_cols=31  Identities=29%  Similarity=0.421  Sum_probs=14.0

Q ss_pred             EEeehHHHHHHHHHHHHHHH--HHHHhcCCCCC
Q psy1807         534 MISVGAGVVLFLLILLAIIY--ARLRKATPVKN  564 (618)
Q Consensus       534 ~~~~~~~~~l~l~vl~~~~~--~~~r~k~~~~~  564 (618)
                      -.|+++.|+|+|.|+|+++|  .+||||..||.
T Consensus       256 Pygiaalvllil~vvliilYiwlyrrRK~swkh  288 (296)
T PTZ00370        256 PYGIAALVLLILAVVLIILYIWLYRRRKNSWKH  288 (296)
T ss_pred             ccHHHHHHHHHHHHHHHHHHHHHHHhhcchhHH
Confidence            33334444444433333332  36666666653


No 71 
>PF08693 SKG6:  Transmembrane alpha-helix domain;  InterPro: IPR014805 SKG6 and AXL2 are membrane proteins that show polarised intracellular localisation [, ]. This entry represents the highly conserved transmembrane alpha-helical domain found in these proteins [, ]. The full-length AXL2 protein has a negative regulatory function in cytokinesis [].
Probab=69.03  E-value=3.4  Score=28.17  Aligned_cols=23  Identities=17%  Similarity=0.371  Sum_probs=10.5

Q ss_pred             cceEEeehHHHHHHHHHHHHHHH
Q psy1807         531 LGYMISVGAGVVLFLLILLAIIY  553 (618)
Q Consensus       531 ~~~~~~~~~~~~l~l~vl~~~~~  553 (618)
                      .+..++++.-+..++++|+++++
T Consensus        11 vaIa~~VvVPV~vI~~vl~~~l~   33 (40)
T PF08693_consen   11 VAIAVGVVVPVGVIIIVLGAFLF   33 (40)
T ss_pred             EEEEEEEEechHHHHHHHHHHhh
Confidence            44555555544444444433333


No 72 
>PF00514 Arm:  Armadillo/beta-catenin-like repeat;  InterPro: IPR000225 The armadillo (Arm) repeat is an approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila melanogaster segment polarity gene armadillo involved in signal transduction through wingless. Animal Arm-repeat proteins function in various processes, including intracellular signalling and cytoskeletal regulation, and include such proteins as beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumour suppressor protein, and the nuclear transport factor importin-alpha, amongst others []. A subset of these proteins is conserved across eukaryotic kingdoms. In higher plants, some Arm-repeat proteins function in intracellular signalling like their mammalian counterparts, while others have novel functions []. The 3-dimensional fold of an armadillo repeat is known from the crystal structure of beta-catenin, where the 12 repeats form a superhelix of alpha helices with three helices per unit []. The cylindrical structure features a positively charged grove, which presumably interacts with the acidic surfaces of the known interaction partners of beta-catenin.; GO: 0005515 protein binding; PDB: 2Z6G_A 1IQ1_C 3RZX_A 2C1M_A 3BTR_C 3OQS_A 3TPO_A 1IAL_A 1Q1S_C 1PJM_B ....
Probab=68.70  E-value=1.1  Score=30.59  Aligned_cols=38  Identities=32%  Similarity=0.418  Sum_probs=32.3

Q ss_pred             cccceeeccCCchhhhhhcccCCCchhHHHHHHhhhhcc
Q psy1807           2 DNKEYILSVGGIKPVTSLLLNSNHEQTVLNSITTLMYLV   40 (618)
Q Consensus         2 ~~~~~i~~~~~~~~~~~~~~~~~~~~~~l~~i~~l~~l~   40 (618)
                      ++++.|+..++++.+..+|+ +.+.++...++.+|-+|.
T Consensus         3 ~~~~~i~~~g~i~~Lv~ll~-~~~~~v~~~a~~al~nl~   40 (41)
T PF00514_consen    3 ENKQAIVEAGGIPPLVQLLK-SPDPEVQEEAAWALGNLA   40 (41)
T ss_dssp             HHHHHHHHTTHHHHHHHHTT-SSSHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHcccHHHHHHHHc-CCCHHHHHHHHHHHHHHh
Confidence            56778899999999999998 888889988888876653


No 73 
>PF08374 Protocadherin:  Protocadherin;  InterPro: IPR013585 The structure of protocadherins is similar to that of classic cadherins (IPR002126 from INTERPRO), but they also have some unique features associated with the cytoplasmic domains. They are expressed in a variety of organisms and are found in high concentrations in the brain where they seem to be localised mainly at cell-cell contact sites. Their expression seems to be developmentally regulated []. 
Probab=68.63  E-value=6.2  Score=37.54  Aligned_cols=29  Identities=24%  Similarity=0.484  Sum_probs=14.6

Q ss_pred             cccceEEeehHHHHHHHH-HHHHHHHHHHH
Q psy1807         529 QTLGYMISVGAGVVLFLL-ILLAIIYARLR  557 (618)
Q Consensus       529 ~~~~~~~~~~~~~~l~l~-vl~~~~~~~~r  557 (618)
                      ..+.++++++++++.++| ++++++.|++|
T Consensus        35 d~~~I~iaiVAG~~tVILVI~i~v~vR~CR   64 (221)
T PF08374_consen   35 DYVKIMIAIVAGIMTVILVIFIVVLVRYCR   64 (221)
T ss_pred             cceeeeeeeecchhhhHHHHHHHHHHHHHh
Confidence            345566665555544444 44444445344


No 74 
>PF01102 Glycophorin_A:  Glycophorin A;  InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=67.14  E-value=5  Score=34.95  Aligned_cols=15  Identities=27%  Similarity=0.217  Sum_probs=6.9

Q ss_pred             HHHHHHHHHHHHHHH
Q psy1807         542 VLFLLILLAIIYARL  556 (618)
Q Consensus       542 ~l~l~vl~~~~~~~~  556 (618)
                      +.+++++|+++|..|
T Consensus        75 aGvIg~Illi~y~ir   89 (122)
T PF01102_consen   75 AGVIGIILLISYCIR   89 (122)
T ss_dssp             HHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHH
Confidence            345555554444433


No 75 
>KOG1219|consensus
Probab=65.44  E-value=44  Score=42.89  Aligned_cols=14  Identities=29%  Similarity=0.392  Sum_probs=8.4

Q ss_pred             chhhchhhhccCCh
Q psy1807         594 NIYENVAENTYDNP  607 (618)
Q Consensus       594 ~~~~~~~~~~~~~~  607 (618)
                      |+|.++.-|.+-.|
T Consensus      4038 nVy~~v~PqvplrP 4051 (4289)
T KOG1219|consen 4038 NVYVPVVPQVPLRP 4051 (4289)
T ss_pred             CccccCCCCCCcCC
Confidence            77776665555444


No 76 
>PF14704 DERM:  Dermatopontin
Probab=64.88  E-value=17  Score=33.06  Aligned_cols=127  Identities=11%  Similarity=0.247  Sum_probs=73.9

Q ss_pred             ccCCEEEEEeCCCceecCCCeEEeC--CCCCcCCCCCcccccccCCCCC-ccccE--EEecccccCceeeecCCCCcEEc
Q psy1807         297 THGAVAIYACHENYTLIGETRRVCG--DGGKWNGTEPQCLFDWCAEPPQ-ISGGI--VTTSGRRTGSVATYSCEPGFILF  371 (618)
Q Consensus       297 ~~g~~~~~~C~~Gy~l~G~~~~~C~--~~G~Ws~~~p~C~~~~C~~p~~-~~~g~--~~~~~~~~g~~~~~~C~~Gy~l~  371 (618)
                      .++....|+|..|..|.+-.+....  +|-.|.-   .|.     ..+. +..+.  ....-..++..+.|+|+.|+.|.
T Consensus         2 ~~d~~f~f~Cp~gq~i~~v~S~~~n~~eDR~W~F---~C~-----~~~~~~~~~~~~~~~~vN~~d~~~~~tCp~n~~i~   73 (152)
T PF14704_consen    2 NYDQPFSFQCPSGQGISGVRSYHSNNREDRRWSF---GCS-----PVGSGPTQGSSCWWSYVNNYDQPFNFTCPSNYYIT   73 (152)
T ss_pred             CCCCCEEEECCCCCEEEEEEEEeCCCCCcccEEE---EEE-----eCCCCCCccceEcchhhhccCCcEEEECCCCCEEE
Confidence            3567789999999988876666654  6668854   344     3332 11111  11112367899999999999998


Q ss_pred             CCceEEEc--cCCeecCCCCceee------ecCC-CCCccCCCceEeecCcccccceEEEEcCCCceeccCCcceE--Ee
Q psy1807         372 GSNELKCG--LGGDWSGKAPTCKY------VDCG-SPANIDNGHYSLLNGTTTHGSVAEYTCDDDYWLEAGAMRRQ--VC  440 (618)
Q Consensus       372 G~~~~~C~--~~G~Ws~~~p~C~~------~~C~-~p~~~~~g~~~~~~~~~~~g~~~~~~C~~Gy~l~~~G~~~~--~C  440 (618)
                      |-.+..=.  +|-.|.   ..|..      ..|. .-.           -.-.+...+.|+-..||.|.  |..+.  .-
T Consensus        74 G~~S~h~n~~eDR~w~---f~C~~~~~~~~~~C~~~t~-----------~~N~~d~~~~~~~~~~~~i~--G~~S~~~~~  137 (152)
T PF14704_consen   74 GMQSYHDNSKEDRRWQ---FQCCRINGATLYNCYWTTG-----------YVNNYDGPLDFTVPSGYVIT--GVSSYHDNS  137 (152)
T ss_pred             EEEEEecCCcccCceE---EEEEecCCceeecceeccc-----------ccccCCCcEEEEcCCCcEEE--eEeeecCCC
Confidence            75544322  345564   23322      1222 110           01245677888888889888  76432  22


Q ss_pred             ccCCeee
Q psy1807         441 MRDGKWS  447 (618)
Q Consensus       441 ~~~G~Ws  447 (618)
                      ..|-.|.
T Consensus       138 ~~DR~wk  144 (152)
T PF14704_consen  138 NEDRRWK  144 (152)
T ss_pred             CcCceEE
Confidence            4455675


No 77 
>PF02793 HRM:  Hormone receptor domain;  InterPro: IPR001879 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The secretin-like GPCRs include secretin [], calcitonin [], parathyroid hormone/parathyroid hormone-related peptides [] and vasoactive intestinal peptide [], all of which activate adenylyl cyclase and the phosphatidyl-inositol-calcium pathway. These receptors contain seven transmembrane regions, in a manner reminiscent of the rhodopsins and other receptors believed to interact with G-proteins (however there is no significant sequence identity between these families, the secretin-like receptors thus bear their own unique '7TM' signature). Their N terminus is probably located on the extracellular side of the membrane and potentially glycosylated. This N-terminal region contains a long conserved region which allow the binding of large peptidic ligand such as glucagon, secretin, VIP and PACAP; this region contains five conserved cysteines residues which could be involved in disulphide bond. The C-terminal region of these receptor is probably cytoplasmic. Every receptor gene in this family is encoded on multiple exons, and several of these genes are alternatively spliced to yield functionally distinct products.  This domain is found in the extracellular part of some of the secretin-like (family 2) GPCRs including the calcitonin receptor; corticotropin releasing factor receptor 1; diuretic hormone receptor; glucagon-like peptide 1 receptor; and parathyroid hormone peptide receptor.; GO: 0004930 G-protein coupled receptor activity, 0016020 membrane; PDB: 3EHT_A 3EHU_A 2L27_A 3EHS_A 4DLO_B 2QKH_A 3C5T_A 3C59_A 3IOL_A 3N7R_B ....
Probab=64.61  E-value=7.4  Score=29.79  Aligned_cols=36  Identities=19%  Similarity=0.582  Sum_probs=27.9

Q ss_pred             ccccceEEEEcCCCce---eccCCcceEEeccCCeeeCC
Q psy1807         414 TTHGSVAEYTCDDDYW---LEAGAMRRQVCMRDGKWSGD  449 (618)
Q Consensus       414 ~~~g~~~~~~C~~Gy~---l~~~G~~~~~C~~~G~Ws~~  449 (618)
                      ...|.++...|..++.   ....|..++.|..+|.|...
T Consensus        17 T~~G~~a~~~CP~~~~~~~~~~~g~a~R~C~~nG~W~~~   55 (66)
T PF02793_consen   17 TPAGETASQPCPSGFYGFGSDTSGNATRNCTENGTWEEH   55 (66)
T ss_dssp             BETTEEEEEECSTTSTCCTSSTTSEEEEEEETTSSBECS
T ss_pred             CCCCCEEEeeCcCCccccccccceeEEEEcCCCCcCCCc
Confidence            4678999999998873   22226789999999999863


No 78 
>smart00008 HormR Domain present in hormone receptors.
Probab=60.78  E-value=10  Score=29.51  Aligned_cols=34  Identities=24%  Similarity=0.342  Sum_probs=26.1

Q ss_pred             CCCCcEEEEEcCCCCeeeCC---ceeEeCcCCcccCC
Q psy1807         474 LNVHSTLEFHCEPGHLLVGQ---ASLKCTRQGDWDNM  507 (618)
Q Consensus       474 ~~~g~~v~~~C~~Gy~l~G~---~~~~C~~~G~Ws~~  507 (618)
                      ...|.++...|..++.....   ....|..+|.|+..
T Consensus        18 t~~G~~~~~~CP~~~~g~~~~~~a~R~C~~~G~W~~~   54 (70)
T smart00008       18 TPAGQLVEVPCPDYFSGFSYKTGASRNCTENGGWSPP   54 (70)
T ss_pred             CCCCCEEEecCCCcccCcCCCCCEEEEcCCCCCCccC
Confidence            45688999999987654322   67899999999864


No 79 
>smart00008 HormR Domain present in hormone receptors.
Probab=59.02  E-value=9.8  Score=29.61  Aligned_cols=32  Identities=28%  Similarity=0.556  Sum_probs=25.2

Q ss_pred             cccCCEEEEEcCCCceecCC---ceeEec-CCcccC
Q psy1807         237 TKLGSTISYACPEGNMLVGS---ATRTCK-EGFWTG  268 (618)
Q Consensus       237 ~~~G~~~~y~C~~Gy~l~g~---~~~~C~-~g~Ws~  268 (618)
                      -..|.++...|+.++.....   ..+.|. +|.|+.
T Consensus        18 t~~G~~~~~~CP~~~~g~~~~~~a~R~C~~~G~W~~   53 (70)
T smart00008       18 TPAGQLVEVPCPDYFSGFSYKTGASRNCTENGGWSP   53 (70)
T ss_pred             CCCCCEEEecCCCcccCcCCCCCEEEEcCCCCCCcc
Confidence            45788999999998654422   789999 889985


No 80 
>KOG4297|consensus
Probab=57.69  E-value=30  Score=29.04  Aligned_cols=94  Identities=20%  Similarity=0.393  Sum_probs=55.0

Q ss_pred             ceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCcceeEEeCCcccc
Q psy1807          91 RNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTWKWVDGEVVT  170 (618)
Q Consensus        91 ~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~~~w~W~DGs~~~  170 (618)
                      ..|+..+.. ..+|..+...|...+++|+.+.+..++.++.......     ....|++.....  ....+.|.++..  
T Consensus        99 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~--~~~~~~~~~~~~--  168 (207)
T KOG4297|consen   99 CTCYTSFEW-TDGWSTTGTGCGGQGANLVGVLSVQENNFITSNFRLF-----LGCLWLGGDSSF--YFTYPTWGDGTT--  168 (207)
T ss_pred             ccccCchhh-ccCHHHHHHHHHHhCCCcCeeCCHHHHHHHHHhHHhe-----eeeEEecCCccc--ccccccccCCCC--
Confidence            345544322 2489999999999999999999999999998751110     012344332221  133455555554  


Q ss_pred             cCCCCCCCCCCCCCccceEEecCC
Q psy1807         171 KPSWGKDQPNNYNGEQNCVVLDGG  194 (618)
Q Consensus       171 ~~~W~~~~P~~~~~~~~Cv~l~~~  194 (618)
                      +..|....+........|+.+...
T Consensus       169 ~~~~~~~~~~~~~~~~~C~~~~~~  192 (207)
T KOG4297|consen  169 YSNWALGDPNPPGDDENCLSLSTS  192 (207)
T ss_pred             cccccccccCCCCCCCceEEEECC
Confidence            334444333322335678887544


No 81 
>PF12273 RCR:  Chitin synthesis regulation, resistance to Congo red;  InterPro: IPR020999  RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall []. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5 []. 
Probab=55.95  E-value=8.7  Score=33.99  Aligned_cols=26  Identities=4%  Similarity=0.216  Sum_probs=16.6

Q ss_pred             hHHHHHHHHHHHHHHHHHHHhcCCCC
Q psy1807         538 GAGVVLFLLILLAIIYARLRKATPVK  563 (618)
Q Consensus       538 ~~~~~l~l~vl~~~~~~~~r~k~~~~  563 (618)
                      ++.+++++++|+++++.++++|++++
T Consensus         4 l~~iii~~i~l~~~~~~~~~rRR~r~   29 (130)
T PF12273_consen    4 LFAIIIVAILLFLFLFYCHNRRRRRR   29 (130)
T ss_pred             eHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence            34455555566666677887777765


No 82 
>PF04478 Mid2:  Mid2 like cell wall stress sensor;  InterPro: IPR007567 This family represents a region near the C terminus of Mid2, which contains a transmembrane region. The remainder of the protein sequence is serine-rich and of low complexity, and is therefore impossible to align accurately. Mid2 is thought to act as a mechanosensor of cell wall stress. The C-terminal cytoplasmic region of Mid2 is known to interact with Rom2, a guanine nucleotide exchange factor (GEF) for Rho1, which is part of the cell wall integrity signalling pathway [].
Probab=55.72  E-value=3.7  Score=36.87  Aligned_cols=15  Identities=27%  Similarity=0.237  Sum_probs=6.5

Q ss_pred             ceEEeehHHHHHHHH
Q psy1807         532 GYMISVGAGVVLFLL  546 (618)
Q Consensus       532 ~~~~~~~~~~~l~l~  546 (618)
                      .+++.|+.++.++|+
T Consensus        51 VIGvVVGVGg~ill~   65 (154)
T PF04478_consen   51 VIGVVVGVGGPILLG   65 (154)
T ss_pred             EEEEEecccHHHHHH
Confidence            344444444444443


No 83 
>PF15102 TMEM154:  TMEM154 protein family
Probab=54.97  E-value=4  Score=36.41  Aligned_cols=21  Identities=24%  Similarity=0.233  Sum_probs=8.9

Q ss_pred             ccceEEeehHHHHHHHHHHHH
Q psy1807         530 TLGYMISVGAGVVLFLLILLA  550 (618)
Q Consensus       530 ~~~~~~~~~~~~~l~l~vl~~  550 (618)
                      .+-+.|-.+..++|+|+|+++
T Consensus        58 iLmIlIP~VLLvlLLl~vV~l   78 (146)
T PF15102_consen   58 ILMILIPLVLLVLLLLSVVCL   78 (146)
T ss_pred             EEEEeHHHHHHHHHHHHHHHh
Confidence            455555433333333333333


No 84 
>PTZ00046 rifin; Provisional
Probab=53.45  E-value=7.6  Score=40.22  Aligned_cols=18  Identities=22%  Similarity=0.301  Sum_probs=10.4

Q ss_pred             HHHHHHHHHHHhcCCCCC
Q psy1807         547 ILLAIIYARLRKATPVKN  564 (618)
Q Consensus       547 vl~~~~~~~~r~k~~~~~  564 (618)
                      |++-+++|.||||+-+|+
T Consensus       332 vIIYLILRYRRKKKMkKK  349 (358)
T PTZ00046        332 VIIYLILRYRRKKKMKKK  349 (358)
T ss_pred             HHHHHHHHhhhcchhHHH
Confidence            444466666666665554


No 85 
>PF05454 DAG1:  Dystroglycan (Dystrophin-associated glycoprotein 1);  InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5.5 kb transcript in Homo sapiens. The protein product is cleaved into two non-covalently associated subunits, [alpha] (N-terminal) and [beta] (C-terminal). In skeletal muscle the dystroglycan complex works as a transmembrane linkage between the extracellular matrix and the cytoskeleton [alpha]-dystroglycan is extracellular and binds to merosin ([alpha]-2 laminin) in the basement membrane, while [beta]-dystroglycan is a transmembrane protein and binds to dystrophin, which is a large rod-like cytoskeletal protein, absent in Duchenne muscular dystrophy patients. Dystrophin binds to intracellular actin cables. In this way, the dystroglycan complex, which links the extracellular matrix to the intracellular actin cables, is thought to provide structural integrity in muscle tissues. The dystroglycan complex is also known to serve as an agrin receptor in muscle, where it may regulate agrin-induced acetylcholine receptor clustering at the neuromuscular junction. There is also evidence which suggests the function of dystroglycan as a part of the signal transduction pathway because it is shown that Grb2, a mediator of the Ras-related signal pathway, can interact with the cytoplasmic domain of dystroglycan. In general, aberrant expression of dystrophin-associated protein complex underlies the pathogenesis of Duchenne muscular dystrophy, Becker muscular dystrophy and severe childhood autosomal recessive muscular dystrophy. Interestingly, no genetic disease has been described for either [alpha]- or [beta]-dystroglycan. Dystroglycan is widely distributed in non-muscle tissues as well as in muscle tissues. During epithelial morphogenesis of kidney, the dystroglycan complex is shown to act as a receptor for the basement membrane. Dystroglycan expression in Mus musculus brain and neural retina has also been reported. However, the physiological role of dystroglycan in non-muscle tissues has remained unclear [].; PDB: 1EG4_P.
Probab=53.17  E-value=4.5  Score=40.89  Aligned_cols=29  Identities=17%  Similarity=0.263  Sum_probs=0.0

Q ss_pred             EEeehHHHHHHHHHHHHHHHHHHHhcCCC
Q psy1807         534 MISVGAGVVLFLLILLAIIYARLRKATPV  562 (618)
Q Consensus       534 ~~~~~~~~~l~l~vl~~~~~~~~r~k~~~  562 (618)
                      ..+|++.++||++++++.+.+|+||+++.
T Consensus       150 IpaVVI~~iLLIA~iIa~icyrrkR~GK~  178 (290)
T PF05454_consen  150 IPAVVIAAILLIAGIIACICYRRKRKGKM  178 (290)
T ss_dssp             -----------------------------
T ss_pred             HHHHHHHHHHHHHHHHHHHhhhhhhcccc
Confidence            33444444444444444444454444443


No 86 
>PF02793 HRM:  Hormone receptor domain;  InterPro: IPR001879 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The secretin-like GPCRs include secretin [], calcitonin [], parathyroid hormone/parathyroid hormone-related peptides [] and vasoactive intestinal peptide [], all of which activate adenylyl cyclase and the phosphatidyl-inositol-calcium pathway. These receptors contain seven transmembrane regions, in a manner reminiscent of the rhodopsins and other receptors believed to interact with G-proteins (however there is no significant sequence identity between these families, the secretin-like receptors thus bear their own unique '7TM' signature). Their N terminus is probably located on the extracellular side of the membrane and potentially glycosylated. This N-terminal region contains a long conserved region which allow the binding of large peptidic ligand such as glucagon, secretin, VIP and PACAP; this region contains five conserved cysteines residues which could be involved in disulphide bond. The C-terminal region of these receptor is probably cytoplasmic. Every receptor gene in this family is encoded on multiple exons, and several of these genes are alternatively spliced to yield functionally distinct products.  This domain is found in the extracellular part of some of the secretin-like (family 2) GPCRs including the calcitonin receptor; corticotropin releasing factor receptor 1; diuretic hormone receptor; glucagon-like peptide 1 receptor; and parathyroid hormone peptide receptor.; GO: 0004930 G-protein coupled receptor activity, 0016020 membrane; PDB: 3EHT_A 3EHU_A 2L27_A 3EHS_A 4DLO_B 2QKH_A 3C5T_A 3C59_A 3IOL_A 3N7R_B ....
Probab=52.17  E-value=12  Score=28.57  Aligned_cols=34  Identities=26%  Similarity=0.570  Sum_probs=26.7

Q ss_pred             CCCCcEEEEEcCCCCe-----eeCCceeEeCcCCcccCC
Q psy1807         474 LNVHSTLEFHCEPGHL-----LVGQASLKCTRQGDWDNM  507 (618)
Q Consensus       474 ~~~g~~v~~~C~~Gy~-----l~G~~~~~C~~~G~Ws~~  507 (618)
                      ...|.++...|..++.     ..|.....|..+|+|...
T Consensus        17 T~~G~~a~~~CP~~~~~~~~~~~g~a~R~C~~nG~W~~~   55 (66)
T PF02793_consen   17 TPAGETASQPCPSGFYGFGSDTSGNATRNCTENGTWEEH   55 (66)
T ss_dssp             BETTEEEEEECSTTSTCCTSSTTSEEEEEEETTSSBECS
T ss_pred             CCCCCEEEeeCcCCccccccccceeEEEEcCCCCcCCCc
Confidence            4568889999998863     136678999999999864


No 87 
>TIGR01477 RIFIN variant surface antigen, rifin family. This model represents the rifin branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of rifin sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 20 bits.
Probab=51.99  E-value=8.3  Score=39.79  Aligned_cols=18  Identities=22%  Similarity=0.301  Sum_probs=10.2

Q ss_pred             HHHHHHHHHHHhcCCCCC
Q psy1807         547 ILLAIIYARLRKATPVKN  564 (618)
Q Consensus       547 vl~~~~~~~~r~k~~~~~  564 (618)
                      |++-+++|.||||+-+|+
T Consensus       327 vIIYLILRYRRKKKMkKK  344 (353)
T TIGR01477       327 VIIYLILRYRRKKKMKKK  344 (353)
T ss_pred             HHHHHHHHhhhcchhHHH
Confidence            444466666666665554


No 88 
>PF15330 SIT:  SHP2-interacting transmembrane adaptor protein, SIT
Probab=51.46  E-value=22  Score=30.24  Aligned_cols=7  Identities=29%  Similarity=0.506  Sum_probs=2.8

Q ss_pred             chhhchh
Q psy1807         594 NIYENVA  600 (618)
Q Consensus       594 ~~~~~~~  600 (618)
                      -||-++.
T Consensus        47 p~YgNL~   53 (107)
T PF15330_consen   47 PCYGNLE   53 (107)
T ss_pred             ccccccc
Confidence            3444443


No 89 
>PF02439 Adeno_E3_CR2:  Adenovirus E3 region protein CR2;  InterPro: IPR003470 Early region 3 (E3) of human adenoviruses (Ads) codes for proteins that appear to control viral interactions with the host []. This region called CR1 (conserved region 1) [] is found three times in Human adenovirus 19 (a subgroup D adenovirus) 49 kDa protein in the E3 region. CR1 is also found in the 20.1 Kd protein of subgroup B adenoviruses. The function of this 80 amino acid region is unknown. This region is probably a divergent immunoglobulin domain.
Probab=49.92  E-value=16  Score=24.59  Aligned_cols=16  Identities=25%  Similarity=0.293  Sum_probs=8.2

Q ss_pred             HHHHHHHHHHHHhcCC
Q psy1807         546 LILLAIIYARLRKATP  561 (618)
Q Consensus       546 ~vl~~~~~~~~r~k~~  561 (618)
                      +++.+.+|..++||.+
T Consensus        19 iii~~~~YaCcykk~~   34 (38)
T PF02439_consen   19 IIICMFYYACCYKKHR   34 (38)
T ss_pred             HHHHHHHHHHHHcccc
Confidence            3444555555555554


No 90 
>KOG0994|consensus
Probab=49.10  E-value=44  Score=39.66  Aligned_cols=123  Identities=22%  Similarity=0.503  Sum_probs=61.5

Q ss_pred             CCceeEecCCcccCCCCceeeecCCCCC---ccCCee-EEecCCCcccCCEEEEEeCCCceecCCCeEEeCCCCCcCCCC
Q psy1807         255 GSATRTCKEGFWTGVAPTCQYVDCGKLE---HIEHGT-VTLETTRTTHGAVAIYACHENYTLIGETRRVCGDGGKWNGTE  330 (618)
Q Consensus       255 g~~~~~C~~g~Ws~~~p~C~~~~C~~~~---~~~ng~-~~~~~~~~~~g~~~~~~C~~Gy~l~G~~~~~C~~~G~Ws~~~  330 (618)
                      |-..-.|+-|.|.  .|.|..-.|..-.   .+..|. +.+.+  +.-|..+. +|..||.  |+..+         +.-
T Consensus       840 grqCnqCqpG~Wg--FPeCr~CqCNgHA~~Cd~~tGaCi~CqD--~T~G~~Cd-rCl~Gyy--GdP~l---------g~g  903 (1758)
T KOG0994|consen  840 GRQCNQCQPGYWG--FPECRPCQCNGHADTCDPITGACIDCQD--STTGHSCD-RCLDGYY--GDPRL---------GSG  903 (1758)
T ss_pred             hhhccccCCCccC--CCcCccccccCcccccCccccccccccc--cccccchh-hhhcccc--CCccc---------CCC
Confidence            4455566789996  6888876664211   111121 22211  22222222 2555663  32211         012


Q ss_pred             CcccccccCCCCCccccEEEe---cccccCceeeecCCCCcEEcCCceEEEccCCeecCC--CCceeeecCCC
Q psy1807         331 PQCLFDWCAEPPQISGGIVTT---SGRRTGSVATYSCEPGFILFGSNELKCGLGGDWSGK--APTCKYVDCGS  398 (618)
Q Consensus       331 p~C~~~~C~~p~~~~~g~~~~---~~~~~g~~~~~~C~~Gy~l~G~~~~~C~~~G~Ws~~--~p~C~~~~C~~  398 (618)
                      -.|.+-.|  |..+..|.-..   .-..+...+...|++||  .|+.--.|. +|.|-.+  --+|+.-.|..
T Consensus       904 ~~CrPCpC--P~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY--~G~RCe~CA-~~~fGnP~~GGtCq~CeC~~  971 (1758)
T KOG0994|consen  904 IGCRPCPC--PDGPASGRQHADSCYLDTRTQQIVCHCQEGY--SGSRCEICA-DNHFGNPSEGGTCQKCECSN  971 (1758)
T ss_pred             CCCCCCCC--CCCCccchhccccccccccccceeeecccCc--cccchhhhc-ccccCCcccCCccccccccC
Confidence            34655344  44444443222   12245567778999999  577666774 7778532  24576555554


No 91 
>PF15069 FAM163:  FAM163 family
Probab=48.74  E-value=36  Score=30.34  Aligned_cols=26  Identities=27%  Similarity=0.529  Sum_probs=13.8

Q ss_pred             ceEEeehHHHHH--HHHHHHHHHHHHHH
Q psy1807         532 GYMISVGAGVVL--FLLILLAIIYARLR  557 (618)
Q Consensus       532 ~~~~~~~~~~~l--~l~vl~~~~~~~~r  557 (618)
                      +..++.++++..  ||.|++++.+.||+
T Consensus         4 GTvVItGgILAtVILLcIIaVLCYCRLQ   31 (143)
T PF15069_consen    4 GTVVITGGILATVILLCIIAVLCYCRLQ   31 (143)
T ss_pred             eeEEEechHHHHHHHHHHHHHHHHHhhH
Confidence            444555554433  33366666666765


No 92 
>PF12768 Rax2:  Cortical protein marker for cell polarity
Probab=47.65  E-value=30  Score=35.09  Aligned_cols=36  Identities=25%  Similarity=0.425  Sum_probs=19.5

Q ss_pred             ccccceEEeehH---HHHHHHHHHHHHHHHHHHhcCCCC
Q psy1807         528 AQTLGYMISVGA---GVVLFLLILLAIIYARLRKATPVK  563 (618)
Q Consensus       528 ~~~~~~~~~~~~---~~~l~l~vl~~~~~~~~r~k~~~~  563 (618)
                      ...+++.+.+++   ..+++|++++.+|+.+.|||+++.
T Consensus       223 ~l~~G~VVlIslAiALG~v~ll~l~Gii~~~~~r~~~~~  261 (281)
T PF12768_consen  223 KLSRGFVVLISLAIALGTVFLLVLIGIILAYIRRRRQGY  261 (281)
T ss_pred             cccceEEEEEehHHHHHHHHHHHHHHHHHHHHHhhhccC
Confidence            344555443322   224466677777777776665543


No 93 
>PF11980 DUF3481:  Domain of unknown function (DUF3481);  InterPro: IPR022579  This domain of unknown function is located in the C terminus of the eukaryotic neuropilin receptor family of proteins. It is found in association with PF00754 from PFAM, PF00431 from PFAM and PF00629 from PFAM. There are two completely conserved residues (Y and E) that may be functionally important.
Probab=47.19  E-value=22  Score=28.40  Aligned_cols=31  Identities=19%  Similarity=0.372  Sum_probs=12.8

Q ss_pred             EEeehHHHHHHHHHHHHHHHHHHHhcCCCCC
Q psy1807         534 MISVGAGVVLFLLILLAIIYARLRKATPVKN  564 (618)
Q Consensus       534 ~~~~~~~~~l~l~vl~~~~~~~~r~k~~~~~  564 (618)
                      +++.++.++++..|.+++++..-|.+-..|+
T Consensus        19 iiA~gga~llL~~v~l~vvL~C~r~~~a~kk   49 (87)
T PF11980_consen   19 IIAMGGALLLLVAVCLGVVLYCHRFHWAAKK   49 (87)
T ss_pred             HHhhccHHHHHHHHHHHHHHhhhhhcccccc
Confidence            3333333333333444444444444444444


No 94 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=45.33  E-value=14  Score=22.16  Aligned_cols=19  Identities=26%  Similarity=0.562  Sum_probs=13.4

Q ss_pred             EEEcCCCCeeeCCceeEeCc
Q psy1807         481 EFHCEPGHLLVGQASLKCTR  500 (618)
Q Consensus       481 ~~~C~~Gy~l~G~~~~~C~~  500 (618)
                      +.+|.+||+|... ..+|.+
T Consensus         3 ~C~C~~Gy~l~~d-~~~C~D   21 (24)
T PF12662_consen    3 TCSCPPGYQLSPD-GRSCED   21 (24)
T ss_pred             EeeCCCCCcCCCC-CCcccc
Confidence            5679999998643 366763


No 95 
>smart00607 FTP eel-Fucolectin Tachylectin-4 Pentaxrin-1 Domain.
Probab=44.94  E-value=20  Score=32.61  Aligned_cols=26  Identities=35%  Similarity=0.386  Sum_probs=23.8

Q ss_pred             CCCceEEEEEeecCCceeeeEEEeee
Q psy1807          42 PQSKSVFIQLVGVEGSLSLCEVEIFT   67 (618)
Q Consensus        42 ~~~~~~~~~~~~~~~~l~~C~~~~~~   67 (618)
                      ..+|+|.++|.+.+..|+|||+.+|.
T Consensus       121 ~~GRYV~i~Lpg~~~~L~LCEVeV~~  146 (151)
T smart00607      121 MIGRYVTVYLPKPNESLILCEVEVNA  146 (151)
T ss_pred             CeeEEEEEEecCCCccEEEeEEEEcc
Confidence            38999999999988899999999986


No 96 
>KOG1025|consensus
Probab=44.78  E-value=37  Score=39.26  Aligned_cols=30  Identities=27%  Similarity=0.564  Sum_probs=23.6

Q ss_pred             cccccCCEEEEEcCCCcee-cCCceeEecCC
Q psy1807         235 VNTKLGSTISYACPEGNML-VGSATRTCKEG  264 (618)
Q Consensus       235 ~~~~~G~~~~y~C~~Gy~l-~g~~~~~C~~g  264 (618)
                      ..|.||.++.=.|+..|.. .|+-.+.|..+
T Consensus       274 gkY~~Ga~CVk~CP~n~lvD~~sCVRaCp~~  304 (1177)
T KOG1025|consen  274 GKYQYGATCVKECPHNLLVDNGSCVRACPDD  304 (1177)
T ss_pred             ccccccCeehhhCCchhEecCCceeeecCCc
Confidence            5699999999999999876 45566778743


No 97 
>TIGR03370 PEPCTERM_Roseo variant PEP-CTERM putative exosortase signal, Roseobacter type. A probable protein export sorting signal, PEP-CTERM, was described by Haft, et al. (PubMed:16930487). It is predicted to interact with a putative transpeptidase we designate exosortase. Most examples of this signal are recognized by model TIGR02595, but some unusual clades require different models. This model describes a variant with conserved motif VPLPA, rather than VPEP. This variant is found prominently in two members of the Rhodobacterales, namely Jannaschia sp. CCS1 and Roseobacter denitrificans OCh 114. One interesting member protein has a full-length duplication and therefore two copies of this putative sorting domain.
Probab=44.13  E-value=37  Score=20.83  Aligned_cols=20  Identities=30%  Similarity=0.295  Sum_probs=10.8

Q ss_pred             HHHHHHHHHHHHHHHHHHhc
Q psy1807         540 GVVLFLLILLAIIYARLRKA  559 (618)
Q Consensus       540 ~~~l~l~vl~~~~~~~~r~k  559 (618)
                      ++.+++..|..+-..|||+|
T Consensus         6 ~~~LLl~gLggl~~~rRRrk   25 (26)
T TIGR03370         6 GALLLLAGLGGLGAMRRRRR   25 (26)
T ss_pred             hHHHHHHHHHHHHHHHHhhc
Confidence            44555555555555555554


No 98 
>KOG3653|consensus
Probab=43.45  E-value=98  Score=33.50  Aligned_cols=29  Identities=31%  Similarity=0.518  Sum_probs=16.0

Q ss_pred             CcceEEeccCCeee--CCCCcee-eecCCCCCcC
Q psy1807         434 AMRRQVCMRDGKWS--GDTPSCE-LITCPEPEVP  464 (618)
Q Consensus       434 G~~~~~C~~~G~Ws--~~~p~C~-~~~C~~p~~~  464 (618)
                      |...+.+  .|-|+  .+.+.|. ...|..-..+
T Consensus        78 ~~~~~v~--qGC~~~~~D~~~c~~~~eCv~s~~~  109 (534)
T KOG3653|consen   78 GTIEVVK--QGCWSHITDDINCEDSSECVVSAEP  109 (534)
T ss_pred             CceeEEe--ecCccccCCccccccccccccCCCC
Confidence            4445555  47775  4566676 4466554333


No 99 
>PF05393 Hum_adeno_E3A:  Human adenovirus early E3A glycoprotein;  InterPro: IPR008652 This family consists of several early glycoproteins (E3A), from human adenovirus type 2.; GO: 0016021 integral to membrane
Probab=43.41  E-value=42  Score=27.05  Aligned_cols=20  Identities=10%  Similarity=0.318  Sum_probs=9.8

Q ss_pred             HHHHHHHHHHHHHHHHhcCC
Q psy1807         542 VLFLLILLAIIYARLRKATP  561 (618)
Q Consensus       542 ~l~l~vl~~~~~~~~r~k~~  561 (618)
                      ++|++++++++.+.++||+.
T Consensus        42 ~iFil~VilwfvCC~kRkrs   61 (94)
T PF05393_consen   42 GIFILLVILWFVCCKKRKRS   61 (94)
T ss_pred             HHHHHHHHHHHHHHHHhhhc
Confidence            34455555555554444333


No 100
>PF03302 VSP:  Giardia variant-specific surface protein;  InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=41.90  E-value=20  Score=38.28  Aligned_cols=31  Identities=23%  Similarity=0.390  Sum_probs=19.4

Q ss_pred             EEcCCCCeeeCCceeEeCc---CCcccCCCCceEE
Q psy1807         482 FHCEPGHLLVGQASLKCTR---QGDWDNMSPQFIS  513 (618)
Q Consensus       482 ~~C~~Gy~l~G~~~~~C~~---~G~Ws~~~P~C~~  513 (618)
                      ..|.+||.+.+..-+.|..   +..|.+ .+.|..
T Consensus       304 t~C~~Gyy~~~~~c~~C~~c~~~~~v~G-v~~C~~  337 (397)
T PF03302_consen  304 TACADGYYKSGSGCVKCTSCSSNNGVTG-VKGCVS  337 (397)
T ss_pred             cchhccccccccCCccccccCCCCcCcC-CCcCcc
Confidence            5788888877776667764   223554 455654


No 101
>PF02038 ATP1G1_PLM_MAT8:  ATP1G1/PLM/MAT8 family;  InterPro: IPR000272  The FXYD protein family contains at least seven members in mammals []. Two other family members that are not obvious orthologs of any identified mammalian FXYD protein exist in zebrafish. All these proteins share a signature sequence of six conserved amino acids comprising the FXYD motif in the NH2-terminus, and two glycines and one serine residue in the transmembrane domain. FXYD proteins are widely distributed in mammalian tissues with prominent expression in tissues that perform fluid and solute transport or that are electrically excitable.   Initial functional characterisation suggested that FXYD proteins act as channels or as modulators of ion channels however studies have revealed that most FXYD proteins have another specific function and act as tissue-specific regulatory subunits of the Na,K-ATPase. Each of these auxiliary subunits produces a distinct functional effect on the transport characteristics of the Na,K-ATPase that is adjusted to the specific functional demands of the tissue in which the FXYD protein is expressed. FXYD proteins appear to preferentially associate with Na,K-ATPase alpha1-beta isozymes, and affect their function in a way that render them operationally complementary or supplementary to coexisting isozymes.; GO: 0005216 ion channel activity, 0006811 ion transport, 0016020 membrane; PDB: 2JO1_A 2JP3_A 2ZXE_G 3A3Y_G 3N23_E 3B8E_H 3KDP_G 3N2F_E.
Probab=41.31  E-value=13  Score=26.64  Aligned_cols=26  Identities=27%  Similarity=0.230  Sum_probs=13.6

Q ss_pred             hHHHHHHHHHHHHHHHHHHHhcCCCC
Q psy1807         538 GAGVVLFLLILLAIIYARLRKATPVK  563 (618)
Q Consensus       538 ~~~~~l~l~vl~~~~~~~~r~k~~~~  563 (618)
                      +..++|+++.+++++-.|.|.|.+.+
T Consensus        20 i~A~vlfi~Gi~iils~kckCk~~qk   45 (50)
T PF02038_consen   20 IFAGVLFILGILIILSGKCKCKFNQK   45 (50)
T ss_dssp             HHHHHHHHHHHHHHCTTHHHHHHSTT
T ss_pred             HHHHHHHHHHHHHHHcCccccCCCCC
Confidence            33445555555555555666544443


No 102
>KOG1219|consensus
Probab=40.90  E-value=1.5e+02  Score=38.57  Aligned_cols=26  Identities=19%  Similarity=0.643  Sum_probs=17.7

Q ss_pred             EEEEeCCCceecCCCeEEeCCCCCcCC
Q psy1807         302 AIYACHENYTLIGETRRVCGDGGKWNG  328 (618)
Q Consensus       302 ~~~~C~~Gy~l~G~~~~~C~~~G~Ws~  328 (618)
                      ..+.|..|....|...++-. +|+|..
T Consensus      3742 l~~~cgsG~Givg~q~~~Vn-DgqWHs 3767 (4289)
T KOG1219|consen 3742 LLADCGSGPGIVGSQKRTVN-DGQWHS 3767 (4289)
T ss_pred             EEEecCCCCCcccccceEee-cCceeE
Confidence            35578888866676655544 778865


No 103
>KOG4289|consensus
Probab=39.68  E-value=1e+02  Score=37.87  Aligned_cols=139  Identities=26%  Similarity=0.443  Sum_probs=83.0

Q ss_pred             eecCCCCcEEcC---CceEEEccCCeecCCCCceeeecCCCC------CccCCCceEeecCccc-ccceEEEEcCCCcee
Q psy1807         361 TYSCEPGFILFG---SNELKCGLGGDWSGKAPTCKYVDCGSP------ANIDNGHYSLLNGTTT-HGSVAEYTCDDDYWL  430 (618)
Q Consensus       361 ~~~C~~Gy~l~G---~~~~~C~~~G~Ws~~~p~C~~~~C~~p------~~~~~g~~~~~~~~~~-~g~~~~~~C~~Gy~l  430 (618)
                      +..|.+||.=.-   .....| ..| |-+ .|+|.+-.|...      ..-++|.-.|....|. .|+  =+.|+=+   
T Consensus      1740 ~C~C~~g~~G~~Ce~~~dq~C-PrG-WWG-~P~CgpC~CavsKgfdp~CnKt~G~CqCKe~hy~~~~~--Cl~CdC~--- 1811 (2531)
T KOG4289|consen 1740 TCECPPGYTGPYCELRADQPC-PRG-WWG-FPTCGPCNCAVSKGFDPDCNKTNGQCQCKENHYRPIGS--CLPCDCY--- 1811 (2531)
T ss_pred             eEECCCcccCcchhhhccCCC-CCc-ccC-CCCccCccccccCCCCCCccccCcceeeccccccCCCc--ceeeccc---
Confidence            457888874210   123456 356 555 899987666543      2345777777655553 455  3445555   


Q ss_pred             ccCCcceEEeccCCeeeCCC----CceeeecCCCCCc--CCCcEEE----c----------ccCCCCcEEEEEcCCCCee
Q psy1807         431 EAGAMRRQVCMRDGKWSGDT----PSCELITCPEPEV--PKGGYVV----G----------YDLNVHSTLEFHCEPGHLL  490 (618)
Q Consensus       431 ~~~G~~~~~C~~~G~Ws~~~----p~C~~~~C~~p~~--~~~g~~~----~----------~~~~~g~~v~~~C~~Gy~l  490 (618)
                      .  |+.++.|.++|+--=.+    -.|.  .|..|-.  -..|-..    +          ...++|..+...|..|-  
T Consensus      1812 ~--Gs~Sr~C~adGqC~C~pgaiGRqCd--rCd~pfaevttlgCrv~y~~CPk~ie~giwWP~tkfG~~a~~pCP~G~-- 1885 (2531)
T KOG4289|consen 1812 F--GSDSRECDADGQCPCKPGAIGRQCD--RCDNPFAEVTTLGCRVVYDKCPKAIEAGIWWPQTKFGSPAAVPCPKGS-- 1885 (2531)
T ss_pred             c--CCCcccccCCCcCCCCCcccccccc--ccCChhhhccccCcEEccccCcHHHhhcccccccccCCcccccCCCCc--
Confidence            3  88899998888642100    0121  3444421  1122111    1          13578999999999995  


Q ss_pred             eCCceeEeCcCCcccCC-CCceEE
Q psy1807         491 VGQASLKCTRQGDWDNM-SPQFIS  513 (618)
Q Consensus       491 ~G~~~~~C~~~G~Ws~~-~P~C~~  513 (618)
                      .|.+...|.+.|.|.++ +-.|+.
T Consensus      1886 ~G~AvR~C~e~~gWl~pnlfNCTs 1909 (2531)
T KOG4289|consen 1886 SGLAVRHCDETGGWLPPNLFNCTS 1909 (2531)
T ss_pred             chhhhhhhhccCCCCCchhcccCC
Confidence            48889999999999974 445665


No 104
>PF02009 Rifin_STEVOR:  Rifin/stevor family;  InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=39.26  E-value=16  Score=37.33  Aligned_cols=27  Identities=26%  Similarity=0.410  Sum_probs=19.0

Q ss_pred             hHHHHHHHHHHHHHHHHHHHhcCCCCC
Q psy1807         538 GAGVVLFLLILLAIIYARLRKATPVKN  564 (618)
Q Consensus       538 ~~~~~l~l~vl~~~~~~~~r~k~~~~~  564 (618)
                      .+.++|++++++++||..||-||++|-
T Consensus       261 SiiaIliIVLIMvIIYLILRYRRKKKm  287 (299)
T PF02009_consen  261 SIIAILIIVLIMVIIYLILRYRRKKKM  287 (299)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence            334455555667788889998888765


No 105
>TIGR03778 VPDSG_CTERM VPDSG-CTERM exosortase interaction domain. Through in silico analysis, we previously described the PEP-CTERM/exosortase system (PubMed:16930487). This model describes a PEP-CTERM-like variant C-terminal protein sorting signal, as found at the C-terminus of twenty otherwise unrelated proteins in Verrucomicrobiae bacterium DG1235. The variant motif, VPDSG, seems an intermediate between the VPEP motif (TIGR02595) of typical exosortase systems and the classical LPXTG of sortase in Gram-positive bacteria.
Probab=37.86  E-value=46  Score=20.42  Aligned_cols=19  Identities=32%  Similarity=0.336  Sum_probs=10.3

Q ss_pred             HHHHHHHHHHHHHHHHHHH
Q psy1807         539 AGVVLFLLILLAIIYARLR  557 (618)
Q Consensus       539 ~~~~l~l~vl~~~~~~~~r  557 (618)
                      ..++|+.+.+++++..|||
T Consensus         6 ST~~Ll~~~l~~l~~~rRr   24 (26)
T TIGR03778         6 STLALLGLGLLGLLGLRRR   24 (26)
T ss_pred             hHHHHHHHHHHHHHHHhhc
Confidence            3445555555555555554


No 106
>PF06084 Cytomega_TRL10:  Cytomegalovirus TRL10 protein;  InterPro: IPR009284 This family consists of several Cytomegalovirus TRL10 proteins. TRL10 represents a structural component of the virus particle and like the other HCMV envelope glycoproteins, is present in a disulphide-linked complex [].
Probab=35.50  E-value=16  Score=30.82  Aligned_cols=17  Identities=29%  Similarity=0.540  Sum_probs=12.1

Q ss_pred             cchhhchh----hhccCChhh
Q psy1807         593 DNIYENVA----ENTYDNPYE  609 (618)
Q Consensus       593 ~~~~~~~~----~~~~~~~~~  609 (618)
                      .||||.||    .|--|+|-.
T Consensus       115 qniyeripyrp~rq~d~~p~~  135 (150)
T PF06084_consen  115 QNIYERIPYRPCRQNDNSPPI  135 (150)
T ss_pred             cchhhhcCCCcccccCCCCcc
Confidence            38999998    566666643


No 107
>TIGR02595 PEP_exosort PEP-CTERM putative exosortase interaction domain. This model describes a 25-residue domain that includes a near-invariant Pro-Glu-Pro (PEP) motif, a thirteen residue strongly hydrophobic sequence likely to span the membrane, and a five-residue strongly basic motif that often contains four Arg residues. In nearly every case, this motif is found within nine residues, and usually within five residues, of the extreme C-terminus of the protein. Proteins with this motif typically have signal sequences at the N-terminus. This region appears many times per genome or not at all, and co-occurs in genomes with a proposed protein-sorting integral membrane protein we designate exosortase (see TIGR02602). PEP-CTERM proteins frequently are poorly conserved, Ser/Thr-rich proteins and may become extensively modified proteinaceous constituents of extracellular material in bacterial biofilms.
Probab=34.42  E-value=48  Score=20.28  Aligned_cols=12  Identities=33%  Similarity=0.329  Sum_probs=4.3

Q ss_pred             HHHHHHHHHHHH
Q psy1807         543 LFLLILLAIIYA  554 (618)
Q Consensus       543 l~l~vl~~~~~~  554 (618)
                      +++++++++..+
T Consensus         9 ll~~g~~~~~~r   20 (26)
T TIGR02595         9 LLLLGLGFLLLR   20 (26)
T ss_pred             HHHHHHHHHHHh
Confidence            333333333333


No 108
>PF07010 Endomucin:  Endomucin;  InterPro: IPR010740 This family consists of several mammalian endomucin proteins. Endomucin is an early endothelial-specific antigen that is also expressed on putative hematopoietic progenitor cells.
Probab=33.13  E-value=1.2e+02  Score=29.27  Aligned_cols=49  Identities=16%  Similarity=0.297  Sum_probs=31.5

Q ss_pred             HHHHHHHHHHHHHhcCCCCCCCccccccccceeceeEeeeeeccCCCCcc
Q psy1807         545 LLILLAIIYARLRKATPVKNTENIEGAVRKEEQTAAVMSYASLSDGGGDN  594 (618)
Q Consensus       545 l~vl~~~~~~~~r~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  594 (618)
                      ++++|+-+|+-.||+... .+.+....+..+.++..|++--.++-+-||.
T Consensus       203 ~vf~LvgLyr~C~k~dPg-~p~~g~~qpqsdke~vklltvkt~s~e~ge~  251 (259)
T PF07010_consen  203 SVFTLVGLYRMCWKTDPG-TPENGPDQPQSDKESVKLLTVKTISHESGEH  251 (259)
T ss_pred             HHHHHHHHHHHhhcCCCC-CcccCCCCCCccccceeEEEEEecccCCccc
Confidence            335555666666676665 3455555555567777888887777777753


No 109
>KOG4564|consensus
Probab=32.64  E-value=45  Score=36.25  Aligned_cols=38  Identities=24%  Similarity=0.510  Sum_probs=27.0

Q ss_pred             cccceEEEEcCCCcee---ccCCcceEEeccCCeeeCCCCc
Q psy1807         415 THGSVAEYTCDDDYWL---EAGAMRRQVCMRDGKWSGDTPS  452 (618)
Q Consensus       415 ~~g~~~~~~C~~Gy~l---~~~G~~~~~C~~~G~Ws~~~p~  452 (618)
                      ..|+.++..|.+.+..   .+.|...+.|..||+|+..++.
T Consensus        78 ~~G~~v~~~CP~yf~~~~~~~~~~v~R~C~~~G~W~~~~~~  118 (473)
T KOG4564|consen   78 PAGTLVTVPCPDYFPGFSYSTSGNVTRNCTSNGTWSERPPS  118 (473)
T ss_pred             CCCceEEecCccccCCCcccCCCceEeecCCCCccCCCCCc
Confidence            3467777788755432   2348889999999999976553


No 110
>PF08114 PMP1_2:  ATPase proteolipid family;  InterPro: IPR012589 This family consists of small proteolipids associated with the plasma membrane H+ ATPase. Two proteolipids (PMP1 and PMP2) are associated with the ATPase and both genes are similarly expressed in the wild-type strain of yeast. No modification of the level of transcription of one PMP gene is detected in a strain deleted of the other. Though both proteolipids show similarity with other small proteolipids associated with other cation -transporting ATPases, their functions remain unclear [].
Probab=32.05  E-value=46  Score=22.69  Aligned_cols=15  Identities=20%  Similarity=0.386  Sum_probs=7.6

Q ss_pred             HHHHHHHHHHHhcCC
Q psy1807         547 ILLAIIYARLRKATP  561 (618)
Q Consensus       547 vl~~~~~~~~r~k~~  561 (618)
                      ++.+++|+++..|++
T Consensus        24 iva~~iYRKw~aRkr   38 (43)
T PF08114_consen   24 IVALFIYRKWQARKR   38 (43)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            444455665554444


No 111
>PHA03265 envelope glycoprotein D; Provisional
Probab=30.91  E-value=42  Score=34.42  Aligned_cols=8  Identities=13%  Similarity=0.347  Sum_probs=3.1

Q ss_pred             ccccceEE
Q psy1807         528 AQTLGYMI  535 (618)
Q Consensus       528 ~~~~~~~~  535 (618)
                      ++..+..+
T Consensus       347 ~~~~g~~i  354 (402)
T PHA03265        347 STFVGISV  354 (402)
T ss_pred             CcccceEE
Confidence            33334333


No 112
>PHA03242 envelope glycoprotein M; Provisional
Probab=28.16  E-value=52  Score=35.15  Aligned_cols=27  Identities=33%  Similarity=0.590  Sum_probs=16.7

Q ss_pred             CCCCcchhhchhhhccCChhhhhhhhccc
Q psy1807         589 DGGGDNIYENVAENTYDNPYETALAFAEW  617 (618)
Q Consensus       589 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  617 (618)
                      ++.+|-|||.+++..-++.|  +-+.++|
T Consensus       402 ~~~~e~iYd~v~~~~~~~~~--~~~~~~~  428 (428)
T PHA03242        402 STDGEPIYDEVAPDEDEVLY--ARIQHEW  428 (428)
T ss_pred             ccCCCccccccCCCcchhHH--HhhccCC
Confidence            36677999999854444444  4444455


No 113
>PF12606 RELT:  Tumour necrosis factor receptor superfamily member 19;  InterPro: IPR022248 The members of tumor necrosis factor receptor (TNFR) superfamily have been designated as the "guardians of the immune system" due to their roles in immune cell proliferation, differentiation, activation, and death (apoptosis).  RELT (receptor expressed in lymphoid tissues) is a member of the TNFR superfamily. The messenger RNA of RELT is especially abundant in hematologic tissues such as spleen, lymph node, and peripheral blood leukocytes as well as in leukemias and lymphomas. RELT is able to activate the NF-kappaB pathway and selectively binds tumor necrosis factor receptor-associated factor 1 []. RELT like proteins 1 and 2 (RELL1 and RELL2) are two RELT homologues that bind to RELT. The expression of RELL1 at the mRNA level is ubiquitous, whereas expression of RELL2 mRNA is more restricted to particular tissues [].
Probab=27.86  E-value=1.3e+02  Score=21.81  Aligned_cols=24  Identities=21%  Similarity=0.227  Sum_probs=12.9

Q ss_pred             HHHHHHHHHHHHHHHHHhcCCCCC
Q psy1807         541 VVLFLLILLAIIYARLRKATPVKN  564 (618)
Q Consensus       541 ~~l~l~vl~~~~~~~~r~k~~~~~  564 (618)
                      .+++++++|++..+...||..-|.
T Consensus         8 ~i~iv~~lLg~~I~~~~K~ygYkh   31 (50)
T PF12606_consen    8 SIFIVMGLLGLSICTTLKAYGYKH   31 (50)
T ss_pred             HHHHHHHHHHHHHHHHhhcccccc
Confidence            344455555555565555555544


No 114
>PF12877 DUF3827:  Domain of unknown function (DUF3827);  InterPro: IPR024606 The function of the proteins in this entry is not currently known, but one of the human proteins (Q9HCM3 from SWISSPROT) has been implicated in pilocytic astrocytomas [, , ]. In the majority of cases of pilocytic astrocytomas a tandem duplication produces an in-frame fusion of the gene encoding this protein and the BRAF oncogene. The resulting fusion protein has constitutive BRAF kinase activity and is capable of transforming cells. 
Probab=27.70  E-value=91  Score=34.93  Aligned_cols=40  Identities=25%  Similarity=0.310  Sum_probs=17.3

Q ss_pred             ccccceEEeehHHH-HHHHHHHHHHHHH-HHHhcCCCCCCCc
Q psy1807         528 AQTLGYMISVGAGV-VLFLLILLAIIYA-RLRKATPVKNTEN  567 (618)
Q Consensus       528 ~~~~~~~~~~~~~~-~l~l~vl~~~~~~-~~r~k~~~~~~~~  567 (618)
                      ......||+++..+ +++++++++++++ -.|+++..+++..
T Consensus       264 s~~~NlWII~gVlvPv~vV~~Iiiil~~~LCRk~K~eFqpDa  305 (684)
T PF12877_consen  264 SPPNNLWIIAGVLVPVLVVLLIIIILYWKLCRKNKLEFQPDA  305 (684)
T ss_pred             CCCCCeEEEehHhHHHHHHHHHHHHHHHHHhcccccCCCchh
Confidence            33344555544433 3333333334443 4455555555443


No 115
>PF07979 Intimin_C:  Intimin C-type lectin domain;  InterPro: IPR013117 This domain is found at the C terminus of intimin. Its structure has been solved and shown to have a C-lectin type of structure []. Intimin is a bacterial adhesion molecule involved in intimate attachment of enteropathogenic and enterohemorrhagic Escherichia coli to mammalian host cells. Intimin targets the translocated intimin receptor (Tir), which is exported by the bacteria and integrated into the host cell plasma membrane.; GO: 0005488 binding, 0009405 pathogenesis, 0009986 cell surface; PDB: 1CWV_A 2ZQK_B 2ZWK_C 1F02_I 1E5U_I 1F00_I 3NCX_B 3NCW_D.
Probab=27.67  E-value=42  Score=28.07  Aligned_cols=31  Identities=19%  Similarity=0.306  Sum_probs=24.1

Q ss_pred             CCCCHHHHHHHHHHcCCeecccCChHHHHHHHH
Q psy1807         100 KGASFADARSKCQASGGDLVHGFRGITNTFLLG  132 (618)
Q Consensus       100 ~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~  132 (618)
                      .+++|.+|...|+..+|.|.+-  ++|+.-|..
T Consensus        11 ~~~~Y~~A~~~C~~~s~~LpsS--~~~L~~vy~   41 (101)
T PF07979_consen   11 SRVTYSEAESICQNNSGRLPSS--QSELKDVYN   41 (101)
T ss_dssp             CEETHHHHHHHTTTTCCESBSS--HHHHHHHHH
T ss_pred             ceEeHHHHHHHHHhccccCccc--HHHHHHHHH
Confidence            3689999999999999999974  455544443


No 116
>PF15345 TMEM51:  Transmembrane protein 51
Probab=27.49  E-value=1.1e+02  Score=29.73  Aligned_cols=25  Identities=36%  Similarity=0.525  Sum_probs=14.4

Q ss_pred             ehHHHHHHHHHHHHHHHHHHHhcCC
Q psy1807         537 VGAGVVLFLLILLAIIYARLRKATP  561 (618)
Q Consensus       537 ~~~~~~l~l~vl~~~~~~~~r~k~~  561 (618)
                      |+++++|+||.+++-|.-|+|++..
T Consensus        65 VG~Gv~LLLLSICL~IR~KRr~rq~   89 (233)
T PF15345_consen   65 VGSGVALLLLSICLSIRDKRRRRQG   89 (233)
T ss_pred             hhHHHHHHHHHHHHHHHHHHHHhhc
Confidence            4556666666666665554444443


No 117
>PF06523 DUF1106:  Protein of unknown function (DUF1106);  InterPro: IPR009490 This family consists of several hypothetical bacterial proteins found in Escherichia coli and Citrobacter rodentium. The function of this family is unknown.
Probab=26.29  E-value=18  Score=27.83  Aligned_cols=22  Identities=18%  Similarity=0.360  Sum_probs=19.4

Q ss_pred             chhhchhhhccCChhhhhhhhc
Q psy1807         594 NIYENVAENTYDNPYETALAFA  615 (618)
Q Consensus       594 ~~~~~~~~~~~~~~~~~~~~~~  615 (618)
                      -|+|.++|.++|+|-|+++|..
T Consensus        69 yi~dklkeifpde~iefsieye   90 (91)
T PF06523_consen   69 YIFDKLKEIFPDEAIEFSIEYE   90 (91)
T ss_pred             hHHHHHHHhCCCCceEEEEEec
Confidence            6899999999999999998864


No 118
>PF04689 S1FA:  DNA binding protein S1FA;  InterPro: IPR006779  S1FA is an unusual small plant peptide of only 70 amino acids with a basic domain which contains a nuclear localization signal and a putative DNA binding helix. S1FA is highly conserved between dicotyledonous and monocotyledonous plants and may be a DNA-binding protein that specifically recognises the negative promoter element S1F [].; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=26.19  E-value=1.7e+02  Score=22.17  Aligned_cols=24  Identities=29%  Similarity=0.422  Sum_probs=13.1

Q ss_pred             eehHHHHHHHHHHHHHHHHHHHhc
Q psy1807         536 SVGAGVVLFLLILLAIIYARLRKA  559 (618)
Q Consensus       536 ~~~~~~~l~l~vl~~~~~~~~r~k  559 (618)
                      .++++++++.++..+++|...++.
T Consensus        18 lvV~g~ll~flvGnyvlY~Yaqk~   41 (69)
T PF04689_consen   18 LVVAGLLLVFLVGNYVLYVYAQKT   41 (69)
T ss_pred             ehHHHHHHHHHHHHHHHHHHHhhc
Confidence            355555555555555555554443


No 119
>PF01708 Gemini_mov:  Geminivirus putative movement protein ;  InterPro: IPR002621 This family consists of putative movement proteins from Maize streak virus and Wheat dwarf virus [].; GO: 0046740 spread of virus in host, cell to cell, 0016021 integral to membrane
Probab=26.13  E-value=1.2e+02  Score=24.72  Aligned_cols=12  Identities=42%  Similarity=0.476  Sum_probs=5.6

Q ss_pred             hcCCCCCCCccc
Q psy1807         558 KATPVKNTENIE  569 (618)
Q Consensus       558 ~k~~~~~~~~~~  569 (618)
                      |.++.++|.++.
T Consensus        67 KAkrqrsTeEig   78 (91)
T PF01708_consen   67 KAKRQRSTEEIG   78 (91)
T ss_pred             eeccCCceeeee
Confidence            444444455443


No 120
>PF13268 DUF4059:  Protein of unknown function (DUF4059)
Probab=25.81  E-value=55  Score=25.37  Aligned_cols=25  Identities=20%  Similarity=0.125  Sum_probs=16.1

Q ss_pred             HHHHHHHHHHHHHHHHHHHhcCCCC
Q psy1807         539 AGVVLFLLILLAIIYARLRKATPVK  563 (618)
Q Consensus       539 ~~~~l~l~vl~~~~~~~~r~k~~~~  563 (618)
                      .+.++++++.++|+.+|.++|+.+.
T Consensus        15 ls~i~V~~~~~~wi~~Ra~~~~DKT   39 (72)
T PF13268_consen   15 LSSILVLLVSGIWILWRALRKKDKT   39 (72)
T ss_pred             HHHHHHHHHHHHHHHHHHHHcCCCc
Confidence            3445555677777777777766653


No 121
>PF08374 Protocadherin:  Protocadherin;  InterPro: IPR013585 The structure of protocadherins is similar to that of classic cadherins (IPR002126 from INTERPRO), but they also have some unique features associated with the cytoplasmic domains. They are expressed in a variety of organisms and are found in high concentrations in the brain where they seem to be localised mainly at cell-cell contact sites. Their expression seems to be developmentally regulated []. 
Probab=25.06  E-value=90  Score=29.94  Aligned_cols=25  Identities=4%  Similarity=0.234  Sum_probs=14.5

Q ss_pred             ccceEEeehHHHHHHHHHHHHHHHH
Q psy1807         530 TLGYMISVGAGVVLFLLILLAIIYA  554 (618)
Q Consensus       530 ~~~~~~~~~~~~~l~l~vl~~~~~~  554 (618)
                      ..+.+.++++++|+++++.++...+
T Consensus        40 ~iaiVAG~~tVILVI~i~v~vR~CR   64 (221)
T PF08374_consen   40 MIAIVAGIMTVILVIFIVVLVRYCR   64 (221)
T ss_pred             eeeeecchhhhHHHHHHHHHHHHHh
Confidence            3445555666666666666666444


No 122
>PF05545 FixQ:  Cbb3-type cytochrome oxidase component FixQ;  InterPro: IPR008621 This family consists of several Cbb3-type cytochrome oxidase components (FixQ/CcoQ). FixQ is found in nitrogen fixing bacteria. Since nitrogen fixation is an energy-consuming process, effective symbioses depend on operation of a respiratory chain with a high affinity for O2, closely coupled to ATP production. This requirement is fulfilled by a special three-subunit terminal oxidase (cytochrome terminal oxidase cbb3), which was first identified in Bradyrhizobium japonicum as the product of the fixNOQP operon [].
Probab=24.90  E-value=84  Score=22.36  Aligned_cols=15  Identities=13%  Similarity=0.319  Sum_probs=6.6

Q ss_pred             HHHHHHHHHHHHhcC
Q psy1807         546 LILLAIIYARLRKAT  560 (618)
Q Consensus       546 ~vl~~~~~~~~r~k~  560 (618)
                      +++++++++-.|+|+
T Consensus        20 ~~F~gi~~w~~~~~~   34 (49)
T PF05545_consen   20 VFFIGIVIWAYRPRN   34 (49)
T ss_pred             HHHHHHHHHHHcccc
Confidence            344444444444443


No 123
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=24.84  E-value=54  Score=30.72  Aligned_cols=52  Identities=23%  Similarity=0.377  Sum_probs=28.4

Q ss_pred             CEEEEEeCCCceecCCCeEEeCCCCCcCCCCCcccccccCCCCCccccEEEecccccCceeeecCCCCcE
Q psy1807         300 AVAIYACHENYTLIGETRRVCGDGGKWNGTEPQCLFDWCAEPPQISGGIVTTSGRRTGSVATYSCEPGFI  369 (618)
Q Consensus       300 ~~~~~~C~~Gy~l~G~~~~~C~~~G~Ws~~~p~C~~~~C~~p~~~~~g~~~~~~~~~g~~~~~~C~~Gy~  369 (618)
                      ......|..||.+...   .|.        +..|....|+      +|.-........+. ..+|+-|+.
T Consensus        68 ~~~~C~C~~gY~~~~~---vCv--------p~~C~~~~Cg------~GKCI~d~~~~~~~-~CSC~IGkV  119 (197)
T PF06247_consen   68 RAYKCDCINGYILKQG---VCV--------PNKCNNKDCG------SGKCILDPDNPNNP-TCSCNIGKV  119 (197)
T ss_dssp             TSEEEEE-TTEEESSS---SEE--------EGGGSS---T------TEEEEEEEGGGSEE-EEEE-TEEE
T ss_pred             eeEEEecccCceeeCC---eEc--------hhhcCceecC------CCeEEecCCCCCCc-eeEeeeceE
Confidence            4566778999988422   333        3455555554      35555444444444 789999987


No 124
>PTZ00046 rifin; Provisional
Probab=24.76  E-value=48  Score=34.48  Aligned_cols=23  Identities=26%  Similarity=0.505  Sum_probs=16.0

Q ss_pred             HHHHHHHHHHHHHHHHhcCCCCC
Q psy1807         542 VLFLLILLAIIYARLRKATPVKN  564 (618)
Q Consensus       542 ~l~l~vl~~~~~~~~r~k~~~~~  564 (618)
                      +++++.+.+++|..||-||++|-
T Consensus       324 IvVIVLIMvIIYLILRYRRKKKM  346 (358)
T PTZ00046        324 IVVIVLIMVIIYLILRYRRKKKM  346 (358)
T ss_pred             HHHHHHHHHHHHHHHHhhhcchh
Confidence            34444555678889999988764


No 125
>PF01299 Lamp:  Lysosome-associated membrane glycoprotein (Lamp);  InterPro: IPR002000 Lysosome-associated membrane glycoproteins (lamp) [] are integral membrane proteins, specific to lysosomes, and whose exact biological function is not yet clear. Structurally, the lamp proteins consist of two internally homologous lysosome-luminal domains separated by a proline-rich hinge region; at the C-terminal extremity there is a transmembrane region (TM) followed by a very short cytoplasmic tail (C). In each of the duplicated domains, there are two conserved disulphide bonds. This structure is schematically represented in the figure below.   +-----+ +-----+ +-----+ +-----+ | | | | | | | | xCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxxxCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxx +--------------------------++Hinge++--------------------------++TM++C+  In mammals, there are two closely related types of lamp: lamp-1 and lamp-2, which form major components of the lysosome membrane. In chicken lamp-1 is known as LEP100.  Also included in this entry is the macrophage protein CD68 (or macrosialin) [] is a heavily glycosylated integral membrane protein whose structure consists of a mucin-like domain followed by a proline-rich hinge; a single lamp-like domain; a transmembrane region and a short cytoplasmic tail.   Similar to CD68, mammalian lamp-3, which is expressed in lymphoid organs, dendritic cells and in lung, contains all the C-terminal regions but lacks the N-terminal lamp-like region []. In a lamp-family protein from nematodes [] only the part C-terminal to the hinge is conserved. ; GO: 0016020 membrane
Probab=24.75  E-value=24  Score=36.33  Aligned_cols=13  Identities=31%  Similarity=0.536  Sum_probs=5.1

Q ss_pred             HHHHHHHHHHHhc
Q psy1807         547 ILLAIIYARLRKA  559 (618)
Q Consensus       547 vl~~~~~~~~r~k  559 (618)
                      ||+..++.|||.+
T Consensus       288 vLiaYli~Rrr~~  300 (306)
T PF01299_consen  288 VLIAYLIGRRRSR  300 (306)
T ss_pred             HHHhheeEecccc
Confidence            3333333344433


No 126
>TIGR01477 RIFIN variant surface antigen, rifin family. This model represents the rifin branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of rifin sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 20 bits.
Probab=24.44  E-value=49  Score=34.33  Aligned_cols=23  Identities=30%  Similarity=0.550  Sum_probs=16.1

Q ss_pred             HHHHHHHHHHHHHHHHhcCCCCC
Q psy1807         542 VLFLLILLAIIYARLRKATPVKN  564 (618)
Q Consensus       542 ~l~l~vl~~~~~~~~r~k~~~~~  564 (618)
                      +++++.+.+++|..||-||++|-
T Consensus       319 IvvIVLIMvIIYLILRYRRKKKM  341 (353)
T TIGR01477       319 ILIIVLIMVIIYLILRYRRKKKM  341 (353)
T ss_pred             HHHHHHHHHHHHHHHHhhhcchh
Confidence            34444555688889999998764


No 127
>KOG0196|consensus
Probab=24.31  E-value=1.2e+02  Score=35.19  Aligned_cols=29  Identities=24%  Similarity=0.446  Sum_probs=18.4

Q ss_pred             EeCCCceec--CCCeEEeCCCCCcCCCCCcc
Q psy1807         305 ACHENYTLI--GETRRVCGDGGKWNGTEPQC  333 (618)
Q Consensus       305 ~C~~Gy~l~--G~~~~~C~~~G~Ws~~~p~C  333 (618)
                      +|-..-.-.  +.....|..+|.|--+.-.|
T Consensus       231 tCV~nA~~~~~~~p~m~C~~dGeWlvpiG~C  261 (996)
T KOG0196|consen  231 SCVPNAEEEPVPVPRMYCSGDGEWLVPIGGC  261 (996)
T ss_pred             cccCCCccCCCCCceeEEcCCCcEEEEcCce
Confidence            465543333  44678899999997655544


No 128
>PTZ00382 Variant-specific surface protein (VSP); Provisional
Probab=24.28  E-value=50  Score=27.51  Aligned_cols=7  Identities=43%  Similarity=0.644  Sum_probs=3.4

Q ss_pred             cCCCcee
Q psy1807         424 CDDDYWL  430 (618)
Q Consensus       424 C~~Gy~l  430 (618)
                      |.+||..
T Consensus         9 C~~g~~~   15 (96)
T PTZ00382          9 CDSDKKP   15 (96)
T ss_pred             CCCCCcc
Confidence            4455543


No 129
>PF14851 FAM176:  FAM176 family
Probab=24.17  E-value=20  Score=32.62  Aligned_cols=27  Identities=30%  Similarity=0.359  Sum_probs=15.7

Q ss_pred             ccccceEEeehHHHHHHHHHHHHHHHH
Q psy1807         528 AQTLGYMISVGAGVVLFLLILLAIIYA  554 (618)
Q Consensus       528 ~~~~~~~~~~~~~~~l~l~vl~~~~~~  554 (618)
                      ...|.+.++|-++++|.|.+|.+-+.+
T Consensus        21 ~~aLYFv~gVC~GLlLtLcllV~risc   47 (153)
T PF14851_consen   21 RFALYFVSGVCAGLLLTLCLLVIRISC   47 (153)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHhhhee
Confidence            444555566666666666566555555


No 130
>PF08391 Ly49:  Ly49-like protein, N-terminal region;  InterPro: IPR013600 The sequences making up this entry are annotated as, or are similar to, Ly49 receptors (e.g. P20937 from SWISSPROT). These are type II transmembrane receptors expressed by mouse natural killer (NK) cells. They are classified as being activating (e.g.Ly49D and H) or inhibitory (e.g. Ly49A and G), depending on their effect on NK cell function []. They are members of the C-type lectin receptor superfamily [], and in fact in many family members this region is found immediately N-terminal to a lectin C-type domain (IPR001304 from INTERPRO). ; PDB: 1QO3_D 3C8J_D 1P4L_D 3C8K_D 3G8K_B 1JA3_B 3CAD_A 3G8L_A.
Probab=24.01  E-value=44  Score=29.01  Aligned_cols=13  Identities=23%  Similarity=0.608  Sum_probs=12.4

Q ss_pred             ccceeecceEEEE
Q psy1807          84 AEITSFDRNCYEF   96 (618)
Q Consensus        84 ~~w~~~~~~CY~~   96 (618)
                      .-|..||-+||+|
T Consensus       107 ~~W~cyg~kCYYF  119 (119)
T PF08391_consen  107 KHWFCYGVKCYYF  119 (119)
T ss_dssp             CEEEEETTEEEEE
T ss_pred             CceeeeCceeeeC
Confidence            8999999999998


No 131
>PF07589 VPEP:  PEP-CTERM motif;  InterPro: IPR013424  This entry describes a 25-residue region including an invariant Pro-Glu-Pro (PEP) motif, a thirteen residue strongly hydrophobic sequence likely to span the membrane, and a five-residue strongly basic motif that often contains four Arg residues. In most cases, this motif is found within nine residues of the C-terminal end of the protein. Proteins containing this motif typically have signal sequences at the N terminus [].
Probab=23.31  E-value=1e+02  Score=18.68  Aligned_cols=12  Identities=33%  Similarity=0.467  Sum_probs=4.8

Q ss_pred             HHHHHHHHHHHH
Q psy1807         542 VLFLLILLAIIY  553 (618)
Q Consensus       542 ~l~l~vl~~~~~  553 (618)
                      .++++.++++..
T Consensus         9 ~l~~~gl~~l~~   20 (25)
T PF07589_consen    9 ALLGLGLLGLAF   20 (25)
T ss_pred             HHHHHHHHHHHH
Confidence            334444444333


No 132
>PF10577 UPF0560:  Uncharacterised protein family UPF0560;  InterPro: IPR018890  This family of proteins has no known function. 
Probab=21.49  E-value=77  Score=36.50  Aligned_cols=18  Identities=17%  Similarity=0.300  Sum_probs=9.6

Q ss_pred             ccccCCEEEEEc-CCCcee
Q psy1807         236 NTKLGSTISYAC-PEGNML  253 (618)
Q Consensus       236 ~~~~G~~~~y~C-~~Gy~l  253 (618)
                      .|..|..+.++= ..||..
T Consensus        46 ~yrlg~~l~v~a~k~gyv~   64 (807)
T PF10577_consen   46 PYRLGTWLIVTASKDGYVP   64 (807)
T ss_pred             eccCCCeEEEEEecCCccC
Confidence            566666655543 345543


No 133
>PRK11056 hypothetical protein; Provisional
Probab=20.92  E-value=1.2e+02  Score=26.10  Aligned_cols=26  Identities=19%  Similarity=0.098  Sum_probs=17.9

Q ss_pred             hHHHHHHHHHHHHHHHHHHHhcCCCC
Q psy1807         538 GAGVVLFLLILLAIIYARLRKATPVK  563 (618)
Q Consensus       538 ~~~~~l~l~vl~~~~~~~~r~k~~~~  563 (618)
                      -...+++.++|++|+.+|++.++++.
T Consensus        93 NF~p~il~~~L~~Wi~~kl~~~~~~~  118 (120)
T PRK11056         93 NFFPAVLSVILVFWIGRKLRNRKKKV  118 (120)
T ss_pred             chHHHHHHHHHHHHHHHHHhcccccc
Confidence            33445666788889998887766543


No 134
>KOG0994|consensus
Probab=20.23  E-value=2.4e+02  Score=34.04  Aligned_cols=68  Identities=24%  Similarity=0.478  Sum_probs=40.3

Q ss_pred             CEEEEEcCCCceecCCceeEecCCcccCC--CCceeeecCCCCCccCC-eeEEecCC------CcccCCEEEEEeCCCce
Q psy1807         241 STISYACPEGNMLVGSATRTCKEGFWTGV--APTCQYVDCGKLEHIEH-GTVTLETT------RTTHGAVAIYACHENYT  311 (618)
Q Consensus       241 ~~~~y~C~~Gy~l~g~~~~~C~~g~Ws~~--~p~C~~~~C~~~~~~~n-g~~~~~~~------~~~~g~~~~~~C~~Gy~  311 (618)
                      ..+.-.|.+||  .|+..-.|-++.|..+  ..+|++-.|...-.+.. |......+      ....|+.+.+ |.+||.
T Consensus       932 ~~ivC~C~~GY--~G~RCe~CA~~~fGnP~~GGtCq~CeC~~NiD~~d~~aCD~~TG~CLkCL~hTeG~hCe~-Ck~Gf~ 1008 (1758)
T KOG0994|consen  932 QQIVCHCQEGY--SGSRCEICADNHFGNPSEGGTCQKCECSNNIDLYDPGACDVATGACLKCLYHTEGDHCEH-CKDGFY 1008 (1758)
T ss_pred             cceeeecccCc--cccchhhhcccccCCcccCCccccccccCCcCccCCCccchhhchhhhhhhcccccchhh-ccccch
Confidence            44667899999  5777778888888643  35788777764321111 11111001      2345677766 888885


Done!