Query         psy592
Match_columns 339
No_of_seqs    279 out of 1723
Neff          7.8 
Searched_HMMs 46136
Date          Fri Aug 16 21:49:32 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy592.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/592hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF01049 Cadherin_C:  Cadherin  100.0   7E-35 1.5E-39  246.1   2.8  111  228-338    13-126 (149)
  2 KOG3514|consensus               99.8 2.2E-20 4.8E-25  190.0  12.5  166    1-174   587-762 (1591)
  3 KOG4289|consensus               99.8 2.3E-20 5.1E-25  193.5  10.6  152    1-162  1469-1621(2531)
  4 KOG3516|consensus               99.7 1.2E-16 2.7E-21  165.5  14.3  151    1-162   925-1097(1306)
  5 KOG4289|consensus               99.6   4E-16 8.7E-21  162.5  10.0  126   27-162  1233-1401(2531)
  6 KOG1219|consensus               99.6   2E-14 4.3E-19  154.7  17.8  137   13-162  3621-3766(4289)
  7 KOG3514|consensus               99.6 2.3E-14   5E-19  146.7  11.8  167    2-175   985-1164(1591)
  8 KOG3516|consensus               99.2 3.9E-10 8.5E-15  117.9  14.3   75   98-175   810-890 (1306)
  9 cd00110 LamG Laminin G domain;  99.1 5.7E-10 1.2E-14   93.6  11.0   81   78-162     2-83  (151)
 10 PF00054 Laminin_G_1:  Laminin   98.9 3.9E-09 8.5E-14   87.4   8.7   68  105-175     1-73  (131)
 11 smart00282 LamG Laminin G doma  98.8 3.4E-08 7.4E-13   81.7   9.7   62   98-162     2-65  (135)
 12 KOG1219|consensus               98.7 1.9E-08   4E-13  110.1   5.6   77    1-81   3825-3909(4289)
 13 PF02210 Laminin_G_2:  Laminin   98.5   6E-07 1.3E-11   72.7   8.9   55  105-162     1-57  (128)
 14 PF00008 EGF:  EGF-like domain   98.4 1.1E-07 2.4E-12   59.0   0.9   31   36-70      1-32  (32)
 15 smart00179 EGF_CA Calcium-bind  98.0 9.7E-06 2.1E-10   51.9   3.9   35   34-72      3-39  (39)
 16 cd00054 EGF_CA Calcium-binding  97.8 3.2E-05 6.9E-10   48.9   4.0   35   34-72      3-38  (38)
 17 cd00053 EGF Epidermal growth f  97.6 8.3E-05 1.8E-09   46.2   3.8   29   44-72      7-36  (36)
 18 smart00181 EGF Epidermal growt  97.5 0.00014   3E-09   45.5   3.7   28   44-72      7-35  (35)
 19 PF07974 EGF_2:  EGF-like domai  97.2 0.00035 7.6E-09   43.1   3.0   26   44-71      7-32  (32)
 20 PF07645 EGF_CA:  Calcium-bindi  97.2 0.00017 3.7E-09   47.4   1.5   33   33-67      2-34  (42)
 21 PF12661 hEGF:  Human growth fa  97.0 0.00021 4.6E-09   34.9   0.3   13   59-71      1-13  (13)
 22 PF12947 EGF_3:  EGF domain;  I  96.0  0.0027 5.9E-08   40.3   1.0   27   43-69      6-32  (36)
 23 smart00210 TSPN Thrombospondin  95.9   0.071 1.5E-06   46.6   9.8   73   97-175    52-128 (184)
 24 PHA02887 EGF-like protein; Pro  95.5   0.011 2.3E-07   47.2   2.6   30   44-74     93-124 (126)
 25 PHA03099 epidermal growth fact  95.2   0.021 4.6E-07   46.3   3.4   30   44-74     52-83  (139)
 26 KOG3509|consensus               93.8   0.051 1.1E-06   58.2   3.4   70    1-75    375-444 (964)
 27 PF12955 DUF3844:  Domain of un  93.3   0.044 9.6E-07   43.1   1.6   45   29-75      1-63  (103)
 28 PF12662 cEGF:  Complement Clr-  91.4    0.14   3E-06   29.3   1.6   16   57-72      1-20  (24)
 29 PF12273 RCR:  Chitin synthesis  87.5    0.61 1.3E-05   38.4   3.3   11  248-258    72-82  (130)
 30 KOG1214|consensus               87.3    0.49 1.1E-05   49.7   3.1   35   34-72    828-864 (1289)
 31 PF00354 Pentaxin:  Pentaxin fa  87.0       5 0.00011   35.4   9.1   74   83-162    11-87  (195)
 32 PF14670 FXa_inhibition:  Coagu  85.3    0.46 9.9E-06   30.0   1.1   19   49-67     10-28  (36)
 33 KOG4260|consensus               84.9    0.62 1.3E-05   42.8   2.2   33   43-75    150-185 (350)
 34 smart00159 PTX Pentraxin / C-r  84.6      12 0.00026   33.1  10.4   78   78-162     9-93  (206)
 35 PF00053 Laminin_EGF:  Laminin   82.5    0.82 1.8E-05   30.7   1.5   24   50-75     12-35  (49)
 36 PF04863 EGF_alliinase:  Alliin  82.4     0.4 8.7E-06   33.0  -0.0   32   44-75     18-53  (56)
 37 cd01475 vWA_Matrilin VWA_Matri  82.1     1.3 2.9E-05   39.6   3.3   37   27-68    182-218 (224)
 38 KOG1217|consensus               81.4     1.4   3E-05   43.2   3.4   36   33-71    271-306 (487)
 39 PF05568 ASFV_J13L:  African sw  81.3     4.2   9E-05   33.9   5.5   51  196-249    32-82  (189)
 40 PF12946 EGF_MSP1_1:  MSP1 EGF   80.1    0.77 1.7E-05   29.1   0.7   24   44-67      6-30  (37)
 41 PF00954 S_locus_glycop:  S-loc  79.7       2 4.3E-05   34.1   3.1   32   33-68     77-108 (110)
 42 PF06679 DUF1180:  Protein of u  78.8     3.2   7E-05   35.6   4.2   18  243-260   145-162 (163)
 43 KOG1225|consensus               78.4     1.6 3.5E-05   44.2   2.7   27   44-74    317-343 (525)
 44 PF15330 SIT:  SHP2-interacting  77.2     5.9 0.00013   31.5   5.1   15  208-222    13-27  (107)
 45 PF01102 Glycophorin_A:  Glycop  77.0     3.8 8.3E-05   33.4   4.0    9  213-221    85-93  (122)
 46 cd00152 PTX Pentraxins are pla  76.5      35 0.00076   29.9  10.5   80   77-162     8-93  (201)
 47 TIGR01478 STEVOR variant surfa  75.9     3.3 7.1E-05   38.5   3.7   33  191-224   258-290 (295)
 48 PF13385 Laminin_G_3:  Concanav  75.9      26 0.00057   28.0   9.0   78   80-162     6-88  (157)
 49 KOG1217|consensus               75.5     2.8   6E-05   41.1   3.5   39   34-74    170-208 (487)
 50 cd00055 EGF_Lam Laminin-type e  75.5       3 6.4E-05   28.1   2.6   19   56-74     17-35  (50)
 51 PTZ00370 STEVOR; Provisional    73.9       4 8.6E-05   38.0   3.7   31  192-223   255-285 (296)
 52 PF05399 EVI2A:  Ectropic viral  72.7     6.6 0.00014   34.9   4.6   14  196-209   128-141 (227)
 53 smart00051 DSL delta serrate l  72.3     2.9 6.3E-05   29.9   2.0   24   45-71     40-63  (63)
 54 PF02009 Rifin_STEVOR:  Rifin/s  72.1     4.1 8.8E-05   38.5   3.5   14  208-221   270-283 (299)
 55 PF15102 TMEM154:  TMEM154 prot  72.1     1.2 2.6E-05   37.2  -0.0   14  288-301   129-142 (146)
 56 KOG1226|consensus               70.4     4.1   9E-05   42.7   3.4   27   44-74    556-582 (783)
 57 PF01034 Syndecan:  Syndecan do  70.0     1.5 3.3E-05   31.3   0.1   15  209-223    26-40  (64)
 58 KOG1225|consensus               66.3     7.4 0.00016   39.6   4.1   27   45-75    256-282 (525)
 59 smart00180 EGF_Lam Laminin-typ  65.0     4.8  0.0001   26.7   1.8   18   57-74     17-34  (46)
 60 PF08693 SKG6:  Transmembrane a  62.6     1.6 3.4E-05   28.3  -0.9    9  211-219    30-38  (40)
 61 PF14575 EphA2_TM:  Ephrin type  59.7     4.5 9.7E-05   30.0   0.9   13  208-220    14-26  (75)
 62 KOG4260|consensus               57.1     6.9 0.00015   36.2   1.8   39   27-68    231-269 (350)
 63 KOG3509|consensus               56.9      13 0.00029   40.4   4.2  117   43-162   178-317 (964)
 64 PF05923 APC_crr:  APC cysteine  56.0     4.1   9E-05   23.7   0.2   21  310-330     2-25  (26)
 65 PF15050 SCIMP:  SCIMP protein   55.0      20 0.00044   28.9   4.0   11  230-240    53-63  (133)
 66 KOG1214|consensus               53.9     7.7 0.00017   41.2   1.8   36   36-73    417-454 (1289)
 67 PTZ00046 rifin; Provisional     51.6      14  0.0003   35.7   3.0   15  208-222   329-343 (358)
 68 KOG1836|consensus               51.1      12 0.00027   43.3   3.0   32   44-75    781-815 (1705)
 69 PF02973 Sialidase:  Sialidase,  50.0 1.2E+02  0.0026   26.7   8.5   46   98-143    34-82  (190)
 70 TIGR01477 RIFIN variant surfac  49.6      15 0.00033   35.3   3.0   15  208-222   324-338 (353)
 71 PF15176 LRR19-TM:  Leucine-ric  49.4      63  0.0014   25.3   5.7   10  196-205    17-26  (102)
 72 PF08374 Protocadherin:  Protoc  48.9      11 0.00024   33.7   1.7   10  196-205    41-50  (221)
 73 PF00558 Vpu:  Vpu protein;  In  44.5      33 0.00072   25.8   3.5    9  210-218    20-28  (81)
 74 PF14991 MLANA:  Protein melan-  42.9       7 0.00015   31.2  -0.4   12  240-251    72-83  (118)
 75 PF12191 stn_TNFRSF12A:  Tumour  42.0     8.5 0.00018   31.4   0.0   14  210-223    94-107 (129)
 76 PF01299 Lamp:  Lysosome-associ  41.9      13 0.00029   35.1   1.3   14  208-221   285-299 (306)
 77 PF15330 SIT:  SHP2-interacting  41.7      56  0.0012   26.0   4.6   21  203-223    11-31  (107)
 78 PF14851 FAM176:  FAM176 family  41.3      36 0.00078   28.9   3.7   11  301-311   138-148 (153)
 79 PF02439 Adeno_E3_CR2:  Adenovi  40.8      45 0.00098   21.3   3.1    6  197-202     7-12  (38)
 80 PF05393 Hum_adeno_E3A:  Human   40.3      73  0.0016   24.3   4.7   14  209-222    46-60  (94)
 81 PF05454 DAG1:  Dystroglycan (D  40.0     9.5 0.00021   35.8   0.0   16  128-143    43-58  (290)
 82 PF15347 PAG:  Phosphoprotein a  39.5      35 0.00076   33.1   3.6   15  288-302   176-190 (428)
 83 PHA03049 IMV membrane protein;  39.3      46   0.001   23.9   3.3   23  197-223     7-29  (68)
 84 PF10731 Anophelin:  Thrombin i  38.6      57  0.0012   23.0   3.6    9  240-248    32-40  (65)
 85 KOG3607|consensus               37.0      25 0.00055   37.3   2.5   28   44-74    631-658 (716)
 86 PF04272 Phospholamban:  Phosph  36.9      31 0.00068   22.8   2.0    9  197-205    37-45  (52)
 87 PF09064 Tme5_EGF_like:  Thromb  36.3      31 0.00068   21.4   1.8   17   50-67     11-27  (34)
 88 PRK00523 hypothetical protein;  35.4      76  0.0016   23.3   4.0   23  195-218     5-27  (72)
 89 PF06024 DUF912:  Nucleopolyhed  35.4     6.9 0.00015   30.7  -1.5    8  214-221    84-91  (101)
 90 TIGR00847 ccoS cytochrome oxid  35.1 1.1E+02  0.0024   20.8   4.6   11  211-221    18-28  (51)
 91 PHA02872 EFc gene family prote  34.8      23 0.00051   28.1   1.4   30  308-337    58-87  (124)
 92 PF02699 YajC:  Preprotein tran  34.7      33 0.00071   25.7   2.2   10  211-220    14-23  (82)
 93 PHA03164 hypothetical protein;  34.6      38 0.00082   25.1   2.4   23  196-218    60-82  (88)
 94 PF06365 CD34_antigen:  CD34/Po  34.2      39 0.00085   30.0   2.9   20  239-260   141-160 (202)
 95 PF02468 PsbN:  Photosystem II   33.8      42  0.0009   22.1   2.2   15  208-222    15-29  (43)
 96 PF03597 CcoS:  Cytochrome oxid  33.6 1.3E+02  0.0028   19.9   4.7   12  210-221    16-27  (45)
 97 PF14316 DUF4381:  Domain of un  33.5      56  0.0012   27.2   3.7    9  289-297    98-106 (146)
 98 TIGR01294 P_lamban phospholamb  33.0      39 0.00085   22.3   2.0    9  197-205    37-45  (52)
 99 PF01414 DSL:  Delta serrate li  32.7      14 0.00031   26.3  -0.0   12   60-71     52-63  (63)
100 TIGR00739 yajC preprotein tran  32.3      25 0.00054   26.6   1.2    8  211-218    15-22  (84)
101 PF10717 ODV-E18:  Occlusion-de  31.4      52  0.0011   24.8   2.7   10  196-205    28-37  (85)
102 PF10577 UPF0560:  Uncharacteri  31.4      49  0.0011   35.3   3.5   10  196-205   276-285 (807)
103 TIGR02595 PEP_exosort PEP-CTER  30.4      56  0.0012   18.8   2.2    9  213-221    15-23  (26)
104 PF11884 DUF3404:  Domain of un  29.0      83  0.0018   29.1   4.2   16  208-223   243-258 (262)
105 PRK05585 yajC preprotein trans  28.8      38 0.00082   26.9   1.7    7  212-218    31-37  (106)
106 PF12606 RELT:  Tumour necrosis  28.1 1.2E+02  0.0026   20.7   3.8    7  199-205     6-12  (50)
107 PHA03286 envelope glycoprotein  27.9      91   0.002   31.1   4.5    9  214-222   412-420 (492)
108 PRK03427 cell division protein  27.6      64  0.0014   30.9   3.3    8  242-249    59-66  (333)
109 PF05961 Chordopox_A13L:  Chord  26.1      87  0.0019   22.6   3.0   14  210-223    16-29  (68)
110 PF06667 PspB:  Phage shock pro  26.0 1.1E+02  0.0024   22.7   3.7    8  214-221    23-30  (75)
111 KOG3054|consensus               26.0      75  0.0016   29.1   3.3   15  208-222    14-28  (299)
112 KOG0994|consensus               25.9      73  0.0016   35.6   3.7   26   50-75    924-951 (1758)
113 COG4367 Uncharacterized protei  25.7      57  0.0012   24.9   2.1   24  287-312    54-77  (97)
114 COG3763 Uncharacterized protei  25.4 1.7E+02  0.0037   21.4   4.4   21  196-217     5-25  (71)
115 PF04689 S1FA:  DNA binding pro  25.1      63  0.0014   23.1   2.1   31  192-222    10-40  (69)
116 PF10078 DUF2316:  Uncharacteri  25.0      70  0.0015   24.6   2.5   26  285-312    53-78  (89)
117 PF00558 Vpu:  Vpu protein;  In  24.9      42 0.00091   25.3   1.3   18  206-223    19-36  (81)
118 PRK13183 psbN photosystem II r  24.8      61  0.0013   21.6   1.8   15  208-222    18-32  (46)
119 KOG1094|consensus               24.7      99  0.0021   32.3   4.2    8  211-218   407-414 (807)
120 COG4736 CcoQ Cbb3-type cytochr  24.1      89  0.0019   22.1   2.7   15  208-222    21-35  (60)
121 PRK14584 hmsS hemin storage sy  23.4   2E+02  0.0043   24.5   5.1   25  196-222    63-88  (153)
122 PF02009 Rifin_STEVOR:  Rifin/s  23.4      96  0.0021   29.4   3.7   25  196-222   263-287 (299)
123 PHA02681 ORF089 virion membran  23.1 1.8E+02  0.0038   22.0   4.2   14  208-221    15-28  (92)
124 PF07213 DAP10:  DAP10 membrane  22.9 1.3E+02  0.0029   22.5   3.6   13  209-221    49-61  (79)
125 PF03908 Sec20:  Sec20;  InterP  22.8      95  0.0021   23.6   3.0   12  208-219    80-91  (92)
126 PF06716 DUF1201:  Protein of u  22.8 1.6E+02  0.0036   19.6   3.6   23  194-218    11-33  (54)
127 TIGR03521 GldG gliding-associa  22.7      92   0.002   32.1   3.7   14  208-221   535-548 (552)
128 PRK01844 hypothetical protein;  22.7 1.8E+02  0.0039   21.4   4.1   23  196-219     5-27  (72)
129 PRK05886 yajC preprotein trans  22.7      94   0.002   24.8   3.0   10  211-220    16-25  (109)
130 PHA02902 putative IMV membrane  22.0 1.4E+02  0.0031   21.3   3.4   14  208-221    15-28  (70)
131 PF12768 Rax2:  Cortical protei  21.8 1.1E+02  0.0023   28.7   3.7   24  196-221   234-257 (281)
132 PF15431 TMEM190:  Transmembran  21.7 2.2E+02  0.0047   22.8   4.7   45  177-222    36-88  (134)
133 KOG3653|consensus               21.6 1.4E+02  0.0031   30.1   4.6   16  209-224   170-185 (534)
134 PF06247 Plasmod_Pvs28:  Plasmo  21.5      28 0.00062   30.5  -0.2   57   10-67     10-79  (197)
135 PF05545 FixQ:  Cbb3-type cytoc  21.1 1.1E+02  0.0024   20.3   2.7    7  214-220    27-33  (49)
136 KOG1834|consensus               21.0 4.3E+02  0.0094   28.0   7.8   66   97-162   366-445 (952)
137 PF11770 GAPT:  GRB2-binding ad  20.9      24 0.00052   29.7  -0.8   10  196-205    10-19  (158)
138 KOG0994|consensus               20.9      66  0.0014   35.9   2.2   16   59-74   1085-1100(1758)
139 PF04478 Mid2:  Mid2 like cell   20.5      15 0.00031   31.1  -2.2    8  214-221    71-78  (154)
140 PF11669 WBP-1:  WW domain-bind  20.4 2.8E+02  0.0062   21.6   5.3   13  235-247    57-69  (102)
141 PRK00269 zipA cell division pr  20.1 2.1E+02  0.0046   26.9   5.1   12  212-223    20-31  (293)

No 1  
>PF01049 Cadherin_C:  Cadherin cytoplasmic region;  InterPro: IPR000233 Cadherins are transmembrane glycoproteins vital in calcium-dependent cell-cell adhesion during tissue differentiation []. Cadherins cluster to form foci of homophilic binding units. A key determinant to the strength of the binding that it is mediated by cadherins is the juxtamembrane region of the cadherin. This region induces clustering and also binds to the protein p120ctn []. The cytoplasmic region is highly conserved in sequence and has been shown experimentally to regulate the cell-cell binding function of the extracellular domain of E-cadherin, possibly through interaction with the cytoskeleton []. This domain is found upstream of the cadherin domain IPR002126 from INTERPRO.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2KOH_B 3IFQ_D 1I7X_D 1I7W_D 3L6Y_F 3L6X_B.
Probab=100.00  E-value=7e-35  Score=246.11  Aligned_cols=111  Identities=50%  Similarity=0.801  Sum_probs=34.8

Q ss_pred             CCCCchhhcccccccCCCCCccCCCcccccccCCCCCCCC-CCccCCC-C-CCCCCCCCCCCCcHHHHHHHHHHHhcCCC
Q psy592          228 PGPDDDVRENIINYDDEGGGEDDMTAFDITPLQIPIGGPH-PEMNNKI-P-YGLGPMPMGVEPNVGIFIEEHKKRADADP  304 (339)
Q Consensus       228 ~~~~~d~reni~~Y~eEggGE~d~~~yd~~~L~~~~~~~~-~~~~~~~-~-~~~~~~~~~~~~~~~~~i~~~~~~ad~d~  304 (339)
                      ..++||||||||+|||||||||||+||||++|++|...+. ..+.|.. + +...+.+.....+|++||++||++||.||
T Consensus        13 ~~~~ediReNii~YddEGGGEeD~~aydi~~L~~~~~~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~i~~kl~~ad~d~   92 (149)
T PF01049_consen   13 PDEEEDIRENIINYDDEGGGEEDTEAYDISALQNPSQPRSRRDILPLIRALPRYPPPPPPDSPDIGDFISQKLQEADEDP   92 (149)
T ss_dssp             -----------------TT---------GGG--------EEEEE--TTSS---------SSGGGHHHHHHHHHHHHHT-T
T ss_pred             CCcHHHHhcCceecccCcCCcCCCCccchhhhhCccccccccccccccccccccccccCCCchHHHHHHHHHHHHcCCcc
Confidence            5688999999999999999999999999999999877510 0011100 0 00011122456799999999999999999


Q ss_pred             CCCCCCcceeeeecCCCCCcccccccccCCCCCC
Q psy592          305 NAPPFDDLRNYAYEGGGSTAGSLSSLASGSSIKN  338 (339)
Q Consensus       305 ~~~p~Dsl~~y~~EG~gS~agSlssl~s~~~~~~  338 (339)
                      ++||||+|++|+|||.||+|||||||+|.+++++
T Consensus        93 ~~~p~Dsl~~Y~~EG~gS~agSLSSL~S~s~~~~  126 (149)
T PF01049_consen   93 SAPPYDSLQVYAYEGQGSSAGSLSSLESCSSDEE  126 (149)
T ss_dssp             TSSS-EEEEE----------TT------------
T ss_pred             ccccccceeeeccCCCCCcCCCcccccCCCcccc
Confidence            9999999999999999999999999999886543


No 2  
>KOG3514|consensus
Probab=99.83  E-value=2.2e-20  Score=189.96  Aligned_cols=166  Identities=17%  Similarity=0.391  Sum_probs=143.8

Q ss_pred             CCccceeeECCEEeecCCCccCCCC---CCCCCCCCc-ccCCCCCCCCCCCCCeeecCCCCceeeCC-CCCCCCCCCCCC
Q psy592            1 MENILNIVTITYLYDLANPGLHKNS---VPGCPATEA-VCNAHSRLHCYPEHGVCVGSMHTAKCQCL-PGWSGTGCVTPT   75 (339)
Q Consensus         1 ~GCIrnl~vn~~~~dl~~~~~~~~s---~~gC~~~~~-~C~~~~~~~pC~~~g~Cv~~~~~~~C~C~-~g~~G~~Ce~~~   75 (339)
                      +||||+|++||+..|+...++..++   .++|..... .|..+    ||+|+|+|...|+.|.|+|. .+|.|+.||.+.
T Consensus       587 vGCirdl~i~G~s~di~q~ae~q~sagvkpsCs~~~~~~C~~n----PC~N~g~C~egwNrfiCDCs~T~~~G~~CerE~  662 (1591)
T KOG3514|consen  587 VGCIRDLFIDGVSTDIRQEAEAQNSAGVKPSCSLSNEKICESN----PCQNGGKCSEGWNRFICDCSGTGFEGRTCEREA  662 (1591)
T ss_pred             hheehhheecceehhhHHHhhhccccccCcccchhhccccCCC----cccCCCCccccccccccccccCcccCcccccee
Confidence            6999999999999999988877664   689987655 89999    99999999999999999999 589999999998


Q ss_pred             CCccccCcceEEEeccCCCCccccEEEEEEeecCCceEEEEEecCCCCceEEEEEECCEEEEEEecCCCcccceeEEe-c
Q psy592           76 SPVTFKPQSYIKYALSFEPDKYTTQLQLKFRTREEFGELFRLSDQHNREYAILEIKDSKLRFRYNLNNLKTDEKDIWL-N  154 (339)
Q Consensus        76 ~~~~F~~~s~v~y~~~~~~~~~~~~i~l~frT~~~~GlL~~~~~~~~~~~i~L~L~~G~l~~~~~lG~~~~~~~~l~~-~  154 (339)
                      ..++|+|.-|+...++.....+..+++++|||..+.|+||..+.....|-+.++|..|.+.+.+++.+    +..+.. .
T Consensus       663 t~ls~nGs~~m~i~L~~~~~tq~E~v~iRF~t~r~~Gll~~Tta~~s~D~l~l~L~~g~vkl~v~ls~----~~nlfag~  738 (1591)
T KOG3514|consen  663 TALSYNGSMSMKIVLPHTMHTQAEDVSIRFRTQRAYGLLFATTARGSADTLRLELDAGQVKLFVNLSG----PENLFAGQ  738 (1591)
T ss_pred             eeEEEcCeeeEEEEecccceeecceEEEEEEecccceeEEEeccCCCCceEEEEEecceEEEEEecCC----Ccceeccc
Confidence            89999999999998886666667899999999999999999888777899999999999999999874    445554 3


Q ss_pred             ccccCccc----ccccCCcceeec
Q psy592          155 AIKDSKLR----FRYNLNNLKTDE  174 (339)
Q Consensus       155 ~v~dG~WH----~r~~~~~~l~~~  174 (339)
                      .++|.+||    .|.+++..|.++
T Consensus       739 ~LnDN~WHtvrv~Rrg~~L~L~vD  762 (1591)
T KOG3514|consen  739 SLNDNEWHTVRVVRRGKSLLLYVD  762 (1591)
T ss_pred             cccCCcceEEEEEEcccceEEEec
Confidence            78999999    566665555554


No 3  
>KOG4289|consensus
Probab=99.82  E-value=2.3e-20  Score=193.46  Aligned_cols=152  Identities=20%  Similarity=0.344  Sum_probs=131.8

Q ss_pred             CCccceeeECCEEeecCCCccCCCCCCCCCCCCcccCCCCCCCCCCCCCeeecCCCCceeeCCCCCCCCCCCCCC-CCcc
Q psy592            1 MENILNIVTITYLYDLANPGLHKNSVPGCPATEAVCNAHSRLHCYPEHGVCVGSMHTAKCQCLPGWSGTGCVTPT-SPVT   79 (339)
Q Consensus         1 ~GCIrnl~vn~~~~dl~~~~~~~~s~~gC~~~~~~C~~~~~~~pC~~~g~Cv~~~~~~~C~C~~g~~G~~Ce~~~-~~~~   79 (339)
                      +||||||.|+|+.+|++......+..+||+...+.|...    +|.|+|+|++.|++|.|.||.+|.|..|...+ .+-.
T Consensus      1469 vGCmrdLsvD~~~VDma~fianngt~eGC~ark~fCdsg----~C~n~g~CvnrWg~~~C~CP~~fggk~c~~~m~~pq~ 1544 (2531)
T KOG4289|consen 1469 VGCMRDLSVDGRDVDMATFIANNGTHEGCKARKNFCDSG----QCSNGGTCVNRWGGFSCECPLGFGGKGCCQGMAHPQH 1544 (2531)
T ss_pred             hhhhhhcccccccccHHHHHhhcCcccCchhhhcccCCC----ccCCCCeeecccCcEeecCccccCCcchhhccCCchh
Confidence            599999999999999999999989999999999999998    99999999999999999999999999999887 5567


Q ss_pred             ccCcceEEEeccCCCCccccEEEEEEeecCCceEEEEEecCCCCceEEEEEECCEEEEEEecCCCcccceeEEecccccC
Q psy592           80 FKPQSYIKYALSFEPDKYTTQLQLKFRTREEFGELFRLSDQHNREYAILEIKDSKLRFRYNLNNLKTDEKDIWLNAIKDS  159 (339)
Q Consensus        80 F~~~s~v~y~~~~~~~~~~~~i~l~frT~~~~GlL~~~~~~~~~~~i~L~L~~G~l~~~~~lG~~~~~~~~l~~~~v~dG  159 (339)
                      |.|+|.+.+.-.+......+.++|+|||+..+|+||.+...+ ..-+.|+|.+|++.+.+  +.   ....+..+.++||
T Consensus      1545 frG~sl~sw~~~~~~vSvPwylsl~FRTr~ad~vl~~~~~~~-rst~~lqld~g~l~~~v--~~---s~v~L~~~~vtdg 1618 (2531)
T KOG4289|consen 1545 FRGHSLVSWEGLPSQVSVPWYLSLMFRTRRADGVLMQAEFGG-RSTYNLQLDDGTLKYNV--GD---SSVELPAPRVTDG 1618 (2531)
T ss_pred             ccccceeeecCCCcceecceEEEEEEEeeccccEEEEEEeCC-CceEEEEEcCCEEEEEe--cC---ceEEccCccccCC
Confidence            999999998855444555789999999999999999887653 45588999999998875  33   1344444688999


Q ss_pred             ccc
Q psy592          160 KLR  162 (339)
Q Consensus       160 ~WH  162 (339)
                      +||
T Consensus      1619 ~Wh 1621 (2531)
T KOG4289|consen 1619 HWH 1621 (2531)
T ss_pred             chh
Confidence            999


No 4  
>KOG3516|consensus
Probab=99.70  E-value=1.2e-16  Score=165.46  Aligned_cols=151  Identities=19%  Similarity=0.324  Sum_probs=122.9

Q ss_pred             CCccceeeECCEEeecCCCcc-CCCCCCCCCCCCcccCCCCCCCCCCCCCeeecCCCCceeeCC-CCCCCCCCCCCCCCc
Q psy592            1 MENILNIVTITYLYDLANPGL-HKNSVPGCPATEAVCNAHSRLHCYPEHGVCVGSMHTAKCQCL-PGWSGTGCVTPTSPV   78 (339)
Q Consensus         1 ~GCIrnl~vn~~~~dl~~~~~-~~~s~~gC~~~~~~C~~~~~~~pC~~~g~Cv~~~~~~~C~C~-~g~~G~~Ce~~~~~~   78 (339)
                      .||||+|.+||+++||..-+. ..++.+||.   ..|++.    +|.|||+|+..|.+|+|+|. ..|+|+.|..++ .+
T Consensus       925 ~GCIRsl~LNGv~ldLe~ra~~~~gv~~GC~---GhCss~----~C~NGG~Cvery~gytCDCs~Tay~Gp~Cs~ei-g~  996 (1306)
T KOG3516|consen  925 LGCIRSLQLNGVMLDLEYRAYGTAGVSPGCE---GHCSSY----PCLNGGHCVERYDGYTCDCSRTAYDGPFCSKEI-GV  996 (1306)
T ss_pred             eeeeeeeeecceeeeehhhhccCCcccCCCc---cccccc----cccCCCEEEEecCceeeccccCcCCCCcccccc-ce
Confidence            499999999999999944443 346789997   599998    99999999999999999999 589999999996 88


Q ss_pred             cccCcceEEEeccCCC-----------------CccccEEEEEEeecCCceEEEEEecCCCCceEEEEE-ECCEEEEEEe
Q psy592           79 TFKPQSYIKYALSFEP-----------------DKYTTQLQLKFRTREEFGELFRLSDQHNREYAILEI-KDSKLRFRYN  140 (339)
Q Consensus        79 ~F~~~s~v~y~~~~~~-----------------~~~~~~i~l~frT~~~~GlL~~~~~~~~~~~i~L~L-~~G~l~~~~~  140 (339)
                      .|+.+++++|.+...-                 ......|.|.|+|....++|+|+++. ..+|+.+.+ .+|.+++++.
T Consensus       997 ~fe~gs~i~y~fq~~~~~a~~~~~~~~~~~~~~~~~~e~i~~sftTt~~ps~LLfvssF-~~~y~~V~v~~nGsLq~ry~ 1075 (1306)
T KOG3516|consen  997 FFERGSSIRYNFQKPMRSAVFESSRVKQKLEIEINPNEEINFSFTTTRAPSDLLFVSSF-TDDYLAVLVKDNGSLQTRYM 1075 (1306)
T ss_pred             EecCCceEEEeccchHHHhhhhhhhhhhccccccCccceEEEEEEeccCceEEEEeecc-ccceEEEEEeCCCceEEEEe
Confidence            9999999999864321                 01246799999999999999999886 366766555 6899999999


Q ss_pred             cCCCcccceeEEe--cccccCccc
Q psy592          141 LNNLKTDEKDIWL--NAIKDSKLR  162 (339)
Q Consensus       141 lG~~~~~~~~l~~--~~v~dG~WH  162 (339)
                      +|..  .+.....  .++.+|+.|
T Consensus      1076 lg~~--e~~~~~~~~kn~~~gq~H 1097 (1306)
T KOG3516|consen 1076 LGFR--EPFEYQFKDKNIALGQPH 1097 (1306)
T ss_pred             cCCc--CceEEecccccccCCCce
Confidence            9982  3344443  377789999


No 5  
>KOG4289|consensus
Probab=99.65  E-value=4e-16  Score=162.54  Aligned_cols=126  Identities=19%  Similarity=0.456  Sum_probs=104.6

Q ss_pred             CCCCCCCcccCCCCCCCCCCCCCeeecCCCCceeeCCCCCCCCCCCCCC-------------------------------
Q psy592           27 PGCPATEAVCNAHSRLHCYPEHGVCVGSMHTAKCQCLPGWSGTGCVTPT-------------------------------   75 (339)
Q Consensus        27 ~gC~~~~~~C~~~~~~~pC~~~g~Cv~~~~~~~C~C~~g~~G~~Ce~~~-------------------------------   75 (339)
                      .-|....+.|.+.    ||+|+|+|....++|+|.|.+||+|.+||...                               
T Consensus      1233 d~CeTeiDlCYs~----pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC~~~~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1233 DYCETEIDLCYSG----PCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTCVNLLNGGFCCHCPYG 1308 (2531)
T ss_pred             ccccchhHhhhcC----CCCCCCceEEecCceeEEecCCccccceeeecccCccccceecCCCEEeecCCCceeccCCCc
Confidence            4577778999988    99999999999999999999999999999642                               


Q ss_pred             ---------CCccccCcceEEEeccCCCCccccEEEEEEeecCCceEEEEEecCCCCceEEEEEECCEEEEEEecCCCcc
Q psy592           76 ---------SPVTFKPQSYIKYALSFEPDKYTTQLQLKFRTREEFGELFRLSDQHNREYAILEIKDSKLRFRYNLNNLKT  146 (339)
Q Consensus        76 ---------~~~~F~~~s~v~y~~~~~~~~~~~~i~l~frT~~~~GlL~~~~~~~~~~~i~L~L~~G~l~~~~~lG~~~~  146 (339)
                               .+.+|.+.||+.|....  .+.++.++|+|.|.+.+|+|+|.+++ ..||+.|++.++.+++.|..|..  
T Consensus      1309 e~e~prC~v~trSFp~~sfv~frglr--qRfh~TlslsfaT~~~nGlL~ynGne-khDFvalevVd~qvqltfS~Ges-- 1383 (2531)
T KOG4289|consen 1309 EFEDPRCEVTTRSFPPESFVTFRGLR--QRFHFTLSLSFATIERNGLLLYNGNE-KHDFVALEVVDEQVQLTFSAGES-- 1383 (2531)
T ss_pred             ccCCCceEEEeeccCchheEEEeccc--cceEEEEEEEEEEeeecceEEecCCc-ccceEeeeeeeeeEEEEEecccc--
Confidence                     24568889999887532  34567788889999999999999965 68999999999999999999863  


Q ss_pred             cceeEEe---cccccCccc
Q psy592          147 DEKDIWL---NAIKDSKLR  162 (339)
Q Consensus       147 ~~~~l~~---~~v~dG~WH  162 (339)
                       ..++.-   ..++||+||
T Consensus      1384 -~t~v~p~Vp~gvsDGqWH 1401 (2531)
T KOG4289|consen 1384 -TTTVSPDVPGGVSDGQWH 1401 (2531)
T ss_pred             -cceecCCCCCCcccCcee
Confidence             233332   268999999


No 6  
>KOG1219|consensus
Probab=99.61  E-value=2e-14  Score=154.72  Aligned_cols=137  Identities=20%  Similarity=0.347  Sum_probs=112.6

Q ss_pred             EeecCCCccCCCCC-----CCCCCCCcccCCCCCCCCCCCCCeeecC-CC-CceeeCCCCCCCCCCCCCCCCccccCcce
Q psy592           13 LYDLANPGLHKNSV-----PGCPATEAVCNAHSRLHCYPEHGVCVGS-MH-TAKCQCLPGWSGTGCVTPTSPVTFKPQSY   85 (339)
Q Consensus        13 ~~dl~~~~~~~~s~-----~gC~~~~~~C~~~~~~~pC~~~g~Cv~~-~~-~~~C~C~~g~~G~~Ce~~~~~~~F~~~s~   85 (339)
                      .+.+.+|.++++..     .-||...+.|...    ||..+..|+.. |+ .|+|.||+|.-| .|... .++++.|+||
T Consensus      3621 rlSfvtPrh~r~a~ClC~~G~Cp~~~~~C~~~----pcp~~~~Cvs~~~~~~~~cVcP~gr~g-~C~g~-~elS~tGnSY 3694 (4289)
T KOG1219|consen 3621 RLSFVTPRHHRTAACLCNRGFCPVETNQCAKS----PCPAGNLCVSSVHNSTYTCVCPIGRFG-FCQGD-FELSSTGNSY 3694 (4289)
T ss_pred             heeeecchhhccceeeecCCcCCcccCccccC----CCcccCcccccccccceeEeccCcccc-cCCCc-ceEeecCcee
Confidence            34566676666643     4588888999998    99999999987 44 699999999665 58877 4889999999


Q ss_pred             EEEeccCCCCccccEEEEEEeecCCceEEEEEecCCCCceEEEEEECCEEEEEEecCCCcccceeEEe--cccccCccc
Q psy592           86 IKYALSFEPDKYTTQLQLKFRTREEFGELFRLSDQHNREYAILEIKDSKLRFRYNLNNLKTDEKDIWL--NAIKDSKLR  162 (339)
Q Consensus        86 v~y~~~~~~~~~~~~i~l~frT~~~~GlL~~~~~~~~~~~i~L~L~~G~l~~~~~lG~~~~~~~~l~~--~~v~dG~WH  162 (339)
                      ++|+++..... +..+.|++||.+++|++||...   .++.+|.|.+|++++.++.|+++   ..+..  ..++||+||
T Consensus      3695 veyrlse~~n~-~~kl~frLkT~~sngIiM~tr~---~d~~iLkLv~G~~~l~~~cgsG~---Givg~q~~~VnDgqWH 3766 (4289)
T KOG1219|consen 3695 VEYRLSENQNT-RMKLGFRLKTLQSNGIIMYTRK---TDLAILKLVGGSPQLLADCGSGP---GIVGSQKRTVNDGQWH 3766 (4289)
T ss_pred             EEEEccccccc-ceEEEEEEEecccCcEEEEEcC---CceEEEEecCCcEEEEEecCCCC---CcccccceEeecCcee
Confidence            99999876544 3889999999999999999973   68999999999999999999874   34433  478999999


No 7  
>KOG3514|consensus
Probab=99.55  E-value=2.3e-14  Score=146.68  Aligned_cols=167  Identities=21%  Similarity=0.343  Sum_probs=135.0

Q ss_pred             CccceeeECCEEeecCCCccC--CCCCCCCCCCCcccCCCCCCCCCCCCCeeecCCCCceeeCCC-CCCCCCCCCCCCCc
Q psy592            2 ENILNIVTITYLYDLANPGLH--KNSVPGCPATEAVCNAHSRLHCYPEHGVCVGSMHTAKCQCLP-GWSGTGCVTPTSPV   78 (339)
Q Consensus         2 GCIrnl~vn~~~~dl~~~~~~--~~s~~gC~~~~~~C~~~~~~~pC~~~g~Cv~~~~~~~C~C~~-g~~G~~Ce~~~~~~   78 (339)
                      ||.-.+-+++.+-+|......  .....+|.-....|..+    .|.|+|.|+..|+++.|+|.. .|+|+.|..+-.++
T Consensus       985 g~~~s~dl~~r~p~L~~~a~~~s~lv~~~~sgpst~c~~~----acanhG~c~q~w~~~~c~csmtS~~Gp~C~d~gtTY 1060 (1591)
T KOG3514|consen  985 GCLASLDLGGRLPDLISDALFESGLVEVGCSGPSTTCSED----ACANHGVCIQQWNGIACDCSMTSYSGPRCNDPGTTY 1060 (1591)
T ss_pred             CCcCccCccccchhHHHHhhhhccceeeeccCCCcccchh----hhhccceeeeeecceeeeccccccCCCccCCCceEE
Confidence            666667777776666555433  33457898877889888    899999999999999999996 79999999987788


Q ss_pred             cccC-cceEEEeccCCC--CccccEEEEEEeecCCceEEEEEecCC-CCceEEEEEECCEEEEEEecCCCcccceeEEe-
Q psy592           79 TFKP-QSYIKYALSFEP--DKYTTQLQLKFRTREEFGELFRLSDQH-NREYAILEIKDSKLRFRYNLNNLKTDEKDIWL-  153 (339)
Q Consensus        79 ~F~~-~s~v~y~~~~~~--~~~~~~i~l~frT~~~~GlL~~~~~~~-~~~~i~L~L~~G~l~~~~~lG~~~~~~~~l~~-  153 (339)
                      .|+. ++.|.|..+...  .....++.+.|+|.+++|+|+.+.+.. ..||+.|+|..|++-+.||+|.-   ...+.. 
T Consensus      1061 iFgk~gglI~YtwPpNdRpsTr~DrlAvGFsTtq~daVLvRVdSAsglgDYlqLhI~qG~igvvfNiGt~---Diti~E~ 1137 (1591)
T KOG3514|consen 1061 IFGKSGGLITYTWPPNDRPSTRKDRLAVGFSTTQPDAVLVRVDSASGLGDYLQLHINQGKIGVVFNIGTD---DITISEH 1137 (1591)
T ss_pred             EECCCCceEEEecCCCCCCCcccceEEEEEEeccCceEEEEEeccCCCCceEEEEEeccEEEEEEeccCc---ccccccc
Confidence            8975 789999987653  334678999999999999999998765 57999999999999999999973   444443 


Q ss_pred             -cccccCccc----ccccCCcceeecC
Q psy592          154 -NAIKDSKLR----FRYNLNNLKTDEK  175 (339)
Q Consensus       154 -~~v~dG~WH----~r~~~~~~l~~~~  175 (339)
                       ..|+||++|    .|-+-+.|++++.
T Consensus      1138 ~~ivNDgkYHVVRFtR~GGNATLQVD~ 1164 (1591)
T KOG3514|consen 1138 NAIVNDGKYHVVRFTRSGGNATLQVDS 1164 (1591)
T ss_pred             cccccCCceEEEEEEecCCceEEEecc
Confidence             378999999    6666667777765


No 8  
>KOG3516|consensus
Probab=99.16  E-value=3.9e-10  Score=117.94  Aligned_cols=75  Identities=13%  Similarity=0.244  Sum_probs=61.4

Q ss_pred             ccEEEEEEeecCCceEEEEEecCCCCceEEEEEECCE-EEEEEecCCCcccceeEEec-ccccCccc----ccccCCcce
Q psy592           98 TTQLQLKFRTREEFGELFRLSDQHNREYAILEIKDSK-LRFRYNLNNLKTDEKDIWLN-AIKDSKLR----FRYNLNNLK  171 (339)
Q Consensus        98 ~~~i~l~frT~~~~GlL~~~~~~~~~~~i~L~L~~G~-l~~~~~lG~~~~~~~~l~~~-~v~dG~WH----~r~~~~~~l  171 (339)
                      ...|+|.|||..++|++|...+.  .||+.++|..+. +.|.++.|+++. +.+|..+ .++|++||    +|..++..|
T Consensus       810 saDIsf~FrTt~~~gvflen~g~--~dfir~eL~~~~~vtf~~dvgnGp~-~~~V~s~t~~nD~qWH~V~~Ern~K~a~L  886 (1306)
T KOG3516|consen  810 SADISFFFRTTASSGVFLENHGI--NDFIRLELSSPVEVTFAFDVGNGPS-QLTVRSPTELNDNQWHQVRAERNSKEASL  886 (1306)
T ss_pred             cccEEEEEEecCCceEeeeccCC--CceEEEEEcCCCceEEEEEcCCCce-eEEEcCCcccCCCceEEEEEEeccccceE
Confidence            35699999999999999988764  799999998665 889999998753 4555555 78999999    888888888


Q ss_pred             eecC
Q psy592          172 TDEK  175 (339)
Q Consensus       172 ~~~~  175 (339)
                      ++|+
T Consensus       887 qVD~  890 (1306)
T KOG3516|consen  887 QVDG  890 (1306)
T ss_pred             EEcC
Confidence            8877


No 9  
>cd00110 LamG Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
Probab=99.12  E-value=5.7e-10  Score=93.65  Aligned_cols=81  Identities=23%  Similarity=0.439  Sum_probs=67.9

Q ss_pred             ccccCcceEEEeccCCCCccccEEEEEEeecCCceEEEEEecCCCCceEEEEEECCEEEEEEecCCCcccceeEEec-cc
Q psy592           78 VTFKPQSYIKYALSFEPDKYTTQLQLKFRTREEFGELFRLSDQHNREYAILEIKDSKLRFRYNLNNLKTDEKDIWLN-AI  156 (339)
Q Consensus        78 ~~F~~~s~v~y~~~~~~~~~~~~i~l~frT~~~~GlL~~~~~~~~~~~i~L~L~~G~l~~~~~lG~~~~~~~~l~~~-~v  156 (339)
                      +.|.+.+|+.|...... .....|+++|||.+++|+||+.+.....+|+.++|.+|++.+.++.|.   ....+... .+
T Consensus         2 ~~F~g~~~i~~~~~~~~-~~~~~i~~~frt~~~~g~l~~~~~~~~~~~~~l~l~~g~l~~~~~~g~---~~~~~~~~~~v   77 (151)
T cd00110           2 VSFSGSSYVRLPTLPAP-RTRLSISFSFRTTSPNGLLLYAGSQNGGDFLALELEDGRLVLRYDLGS---GSLVLSSKTPL   77 (151)
T ss_pred             eEeCCCceEEecCCCCC-cceeEEEEEEEeCCCCeEEEEecCCCCCCEEEEEEECCEEEEEEcCCc---ccEEEEccCcc
Confidence            67999999999876543 557899999999999999999988656899999999999999999984   24555544 59


Q ss_pred             ccCccc
Q psy592          157 KDSKLR  162 (339)
Q Consensus       157 ~dG~WH  162 (339)
                      +||+||
T Consensus        78 ~dg~Wh   83 (151)
T cd00110          78 NDGQWH   83 (151)
T ss_pred             CCCCEE
Confidence            999999


No 10 
>PF00054 Laminin_G_1:  Laminin G domain;  InterPro: IPR012679 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, which includes a large number of extracellular proteins. The C terminus of laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin [].  Laminin G domains can vary in their function, and a variety of binding functions has been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each has five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012680 from INTERPRO).; PDB: 1OKQ_A 1DYK_A 2C5D_A 1H30_A 1LHW_A 1KDK_A 1LHU_A 1KDM_A 1LHO_A 1D2S_A ....
Probab=98.94  E-value=3.9e-09  Score=87.44  Aligned_cols=68  Identities=31%  Similarity=0.484  Sum_probs=53.7

Q ss_pred             EeecCCceEEEEEecCCCCceEEEEEECCEEEEEEecCCCcccceeEEe-cccccCccc----ccccCCcceeecC
Q psy592          105 FRTREEFGELFRLSDQHNREYAILEIKDSKLRFRYNLNNLKTDEKDIWL-NAIKDSKLR----FRYNLNNLKTDEK  175 (339)
Q Consensus       105 frT~~~~GlL~~~~~~~~~~~i~L~L~~G~l~~~~~lG~~~~~~~~l~~-~~v~dG~WH----~r~~~~~~l~~~~  175 (339)
                      |||.+++|+|||.+.....||++|+|.+|++++++++|.+   +..+.. ..++||+||    .|..+...+++++
T Consensus         1 frT~~~~Gllly~g~~~~~dfial~L~~G~l~~~~~~G~~---~~~~~~~~~i~dg~wh~v~~~r~~~~~~L~Vd~   73 (131)
T PF00054_consen    1 FRTSEPNGLLLYLGSKDGKDFIALELRDGRLEFRYNLGSG---PASLRSPQKINDGKWHTVSVSRNGRNGSLSVDG   73 (131)
T ss_dssp             EEESSSSEEEEEEESSTTSSEEEEEEETTEEEEEEESSSE---EEEEEESSETTSSSEEEEEEEEETTEEEEEETT
T ss_pred             CccCCCCceEEECCcCCCCCEEEEEEECCEEEEEEeCCCc---cceecCCCccCCCcceEEEEEEcCcEEEEEECC
Confidence            8999999999999987777999999999999999999985   445543 478999999    4444433444433


No 11 
>smart00282 LamG Laminin G domain.
Probab=98.80  E-value=3.4e-08  Score=81.68  Aligned_cols=62  Identities=23%  Similarity=0.432  Sum_probs=52.7

Q ss_pred             ccEEEEEEeecCCceEEEEEecCCCCceEEEEEECCEEEEEEecCCCcccceeEEe--cccccCccc
Q psy592           98 TTQLQLKFRTREEFGELFRLSDQHNREYAILEIKDSKLRFRYNLNNLKTDEKDIWL--NAIKDSKLR  162 (339)
Q Consensus        98 ~~~i~l~frT~~~~GlL~~~~~~~~~~~i~L~L~~G~l~~~~~lG~~~~~~~~l~~--~~v~dG~WH  162 (339)
                      ..+|+|.|||.+++|+||+..+....+|+.++|.+|++.+.++.|+.   +..+..  ..++||+||
T Consensus         2 ~~~i~~~frt~~~~g~l~~~~~~~~~~~l~l~l~~g~l~~~~~~g~~---~~~~~~~~~~~~dg~WH   65 (135)
T smart00282        2 RLSISFSFRTTSPNGLLLYAGSKNGGDYLALELRDGRLVLRYDLGSG---PARLTSDPTPLNDGQWH   65 (135)
T ss_pred             ceEEEEEEEeCCCCEEEEEeCCCCCCCEEEEEEECCEEEEEEECCCC---CEEEEECCeEeCCCCEE
Confidence            46899999999999999999875457999999999999999999874   445554  478899999


No 12 
>KOG1219|consensus
Probab=98.68  E-value=1.9e-08  Score=110.13  Aligned_cols=77  Identities=22%  Similarity=0.382  Sum_probs=66.2

Q ss_pred             CCccceeeECCEEeecCCCccC-------CCCCCCCCCCCcccCCCCCCCCCCCCCeeecCC-CCceeeCCCCCCCCCCC
Q psy592            1 MENILNIVTITYLYDLANPGLH-------KNSVPGCPATEAVCNAHSRLHCYPEHGVCVGSM-HTAKCQCLPGWSGTGCV   72 (339)
Q Consensus         1 ~GCIrnl~vn~~~~dl~~~~~~-------~~s~~gC~~~~~~C~~~~~~~pC~~~g~Cv~~~-~~~~C~C~~g~~G~~Ce   72 (339)
                      .||++.+++||..++|.++..+       ....+||....+.|..+    ||+|+|+|.... ++|+|.|++.|+|.+||
T Consensus      3825 ~GCldsiyLng~el~l~~k~~s~a~~~el~~l~pgC~l~~d~C~~n----pCqhgG~C~~~~~ggy~CkCpsqysG~~CE 3900 (4289)
T KOG1219|consen 3825 DGCLDSIYLNGMELPLTRKGKSVAGLMELFGLQPGCSLLTDPCNDN----PCQHGGTCISQPKGGYKCKCPSQYSGNHCE 3900 (4289)
T ss_pred             cceeeeEEEccccccccCCCchhhhhhhhhcccccccccccccccC----cccCCCEecCCCCCceEEeCcccccCcccc
Confidence            4999999999999999887643       24569999888999999    999999999985 58999999999999999


Q ss_pred             CCCCCcccc
Q psy592           73 TPTSPVTFK   81 (339)
Q Consensus        73 ~~~~~~~F~   81 (339)
                      ..+.+-.+.
T Consensus      3901 i~~epC~sn 3909 (4289)
T KOG1219|consen 3901 IDLEPCASN 3909 (4289)
T ss_pred             cccccccCC
Confidence            987555543


No 13 
>PF02210 Laminin_G_2:  Laminin G domain;  InterPro: IPR012680 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, including a large number of extracellular proteins. The C terminus of the laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin [].  Laminin G domains can vary in their function, and a variety of binding functions have been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each have five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012679 from INTERPRO).; PDB: 3POY_A 3QCW_B 3R05_B 3ASI_A 3MW4_B 3MW3_A 1QU0_D 1DYK_A 1OKQ_A 3SH4_A ....
Probab=98.51  E-value=6e-07  Score=72.70  Aligned_cols=55  Identities=31%  Similarity=0.496  Sum_probs=45.9

Q ss_pred             EeecCCceEEEEEecCCCCceEEEEEECCEEEEEEecCCCcccceeEEe--cccccCccc
Q psy592          105 FRTREEFGELFRLSDQHNREYAILEIKDSKLRFRYNLNNLKTDEKDIWL--NAIKDSKLR  162 (339)
Q Consensus       105 frT~~~~GlL~~~~~~~~~~~i~L~L~~G~l~~~~~lG~~~~~~~~l~~--~~v~dG~WH  162 (339)
                      |||++++|+|||.+.....+|+.++|.+|++++.+++|..   ...+..  .++++|+||
T Consensus         1 Frt~~~~g~Ll~~~~~~~~~~l~l~l~~g~l~~~~~~g~~---~~~~~~~~~~~~dg~wh   57 (128)
T PF02210_consen    1 FRTRSPNGLLLYIGSEDNGDFLSLELVDGRLVVRYNLGGS---EIVTTFSNSNLNDGQWH   57 (128)
T ss_dssp             EEESSSSEEEEEEEESTTSEEEEEEEETTEEEEEEESSSS---EEEEEECSSSSTSSSEE
T ss_pred             CccCCCCEeEEEEcCCCCCEEEEEEEECCEEEEEEEcccc---ceeeeccCcccccccee
Confidence            8999999999999987556899999999999999999942   233332  378899999


No 14 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=98.37  E-value=1.1e-07  Score=58.97  Aligned_cols=31  Identities=26%  Similarity=0.758  Sum_probs=27.6

Q ss_pred             cCCCCCCCCCCCCCeeecCC-CCceeeCCCCCCCCC
Q psy592           36 CNAHSRLHCYPEHGVCVGSM-HTAKCQCLPGWSGTG   70 (339)
Q Consensus        36 C~~~~~~~pC~~~g~Cv~~~-~~~~C~C~~g~~G~~   70 (339)
                      |.++    ||.|+|+|+... ++|+|.|++||+|++
T Consensus         1 C~~~----~C~n~g~C~~~~~~~y~C~C~~G~~G~~   32 (32)
T PF00008_consen    1 CSSN----PCQNGGTCIDLPGGGYTCECPPGYTGKR   32 (32)
T ss_dssp             TTTT----SSTTTEEEEEESTSEEEEEEBTTEESTT
T ss_pred             CCCC----cCCCCeEEEeCCCCCEEeECCCCCccCC
Confidence            4555    999999999998 899999999999964


No 15 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.98  E-value=9.7e-06  Score=51.92  Aligned_cols=35  Identities=29%  Similarity=0.868  Sum_probs=30.8

Q ss_pred             cccCC-CCCCCCCCCCCeeecCCCCceeeCCCCCC-CCCCC
Q psy592           34 AVCNA-HSRLHCYPEHGVCVGSMHTAKCQCLPGWS-GTGCV   72 (339)
Q Consensus        34 ~~C~~-~~~~~pC~~~g~Cv~~~~~~~C~C~~g~~-G~~Ce   72 (339)
                      +.|.. .    +|.++|+|++..++|.|.|++||. |..|+
T Consensus         3 ~~C~~~~----~C~~~~~C~~~~g~~~C~C~~g~~~g~~C~   39 (39)
T smart00179        3 DECASGN----PCQNGGTCVNTVGSYRCECPPGYTDGRNCE   39 (39)
T ss_pred             ccCcCCC----CcCCCCEeECCCCCeEeECCCCCccCCcCC
Confidence            45665 5    899999999999999999999999 99885


No 16 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.80  E-value=3.2e-05  Score=48.89  Aligned_cols=35  Identities=29%  Similarity=0.854  Sum_probs=30.5

Q ss_pred             cccCC-CCCCCCCCCCCeeecCCCCceeeCCCCCCCCCCC
Q psy592           34 AVCNA-HSRLHCYPEHGVCVGSMHTAKCQCLPGWSGTGCV   72 (339)
Q Consensus        34 ~~C~~-~~~~~pC~~~g~Cv~~~~~~~C~C~~g~~G~~Ce   72 (339)
                      +.|.. .    +|.+++.|+...++|+|.|++||.|..|+
T Consensus         3 ~~C~~~~----~C~~~~~C~~~~~~~~C~C~~g~~g~~C~   38 (38)
T cd00054           3 DECASGN----PCQNGGTCVNTVGSYRCSCPPGYTGRNCE   38 (38)
T ss_pred             ccCCCCC----CcCCCCEeECCCCCeEeECCCCCcCCcCC
Confidence            45665 4    89999999999999999999999999885


No 17 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=97.62  E-value=8.3e-05  Score=46.18  Aligned_cols=29  Identities=31%  Similarity=0.844  Sum_probs=26.8

Q ss_pred             CCCCCCeeecCCCCceeeCCCCCCCC-CCC
Q psy592           44 CYPEHGVCVGSMHTAKCQCLPGWSGT-GCV   72 (339)
Q Consensus        44 pC~~~g~Cv~~~~~~~C~C~~g~~G~-~Ce   72 (339)
                      +|.+++.|+..+++|+|.|+.||.|. .|+
T Consensus         7 ~C~~~~~C~~~~~~~~C~C~~g~~g~~~C~   36 (36)
T cd00053           7 PCSNGGTCVNTPGSYRCVCPPGYTGDRSCE   36 (36)
T ss_pred             CCCCCCEEecCCCCeEeECCCCCcccCCcC
Confidence            89999999999999999999999998 774


No 18 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=97.51  E-value=0.00014  Score=45.51  Aligned_cols=28  Identities=25%  Similarity=0.766  Sum_probs=25.9

Q ss_pred             CCCCCCeeecCCCCceeeCCCCCCC-CCCC
Q psy592           44 CYPEHGVCVGSMHTAKCQCLPGWSG-TGCV   72 (339)
Q Consensus        44 pC~~~g~Cv~~~~~~~C~C~~g~~G-~~Ce   72 (339)
                      +|.++ +|++.+++|+|.|++||.| ..|+
T Consensus         7 ~C~~~-~C~~~~~~~~C~C~~g~~g~~~C~   35 (35)
T smart00181        7 PCSNG-TCINTPGSYTCSCPPGYTGDKRCE   35 (35)
T ss_pred             CCCCC-EEECCCCCeEeECCCCCccCCccC
Confidence            89888 9999999999999999999 8774


No 19 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=97.23  E-value=0.00035  Score=43.11  Aligned_cols=26  Identities=35%  Similarity=0.992  Sum_probs=23.7

Q ss_pred             CCCCCCeeecCCCCceeeCCCCCCCCCC
Q psy592           44 CYPEHGVCVGSMHTAKCQCLPGWSGTGC   71 (339)
Q Consensus        44 pC~~~g~Cv~~~~~~~C~C~~g~~G~~C   71 (339)
                      .|.++|+|+..  ..+|.|.+||+|+.|
T Consensus         7 ~C~~~G~C~~~--~g~C~C~~g~~G~~C   32 (32)
T PF07974_consen    7 ICSGHGTCVSP--CGRCVCDSGYTGPDC   32 (32)
T ss_pred             ccCCCCEEeCC--CCEEECCCCCcCCCC
Confidence            69999999987  678999999999987


No 20 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.19  E-value=0.00017  Score=47.41  Aligned_cols=33  Identities=33%  Similarity=0.783  Sum_probs=27.7

Q ss_pred             CcccCCCCCCCCCCCCCeeecCCCCceeeCCCCCC
Q psy592           33 EAVCNAHSRLHCYPEHGVCVGSMHTAKCQCLPGWS   67 (339)
Q Consensus        33 ~~~C~~~~~~~pC~~~g~Cv~~~~~~~C~C~~g~~   67 (339)
                      ++.|...  .+.|..++.|++..++|+|.|++||.
T Consensus         2 idEC~~~--~~~C~~~~~C~N~~Gsy~C~C~~Gy~   34 (42)
T PF07645_consen    2 IDECAEG--PHNCPENGTCVNTEGSYSCSCPPGYE   34 (42)
T ss_dssp             SSTTTTT--SSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred             ccccCCC--CCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence            3567654  35798899999999999999999996


No 21 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=96.98  E-value=0.00021  Score=34.86  Aligned_cols=13  Identities=62%  Similarity=1.842  Sum_probs=11.1

Q ss_pred             eeeCCCCCCCCCC
Q psy592           59 KCQCLPGWSGTGC   71 (339)
Q Consensus        59 ~C~C~~g~~G~~C   71 (339)
                      +|.|++||+|++|
T Consensus         1 ~C~C~~G~~G~~C   13 (13)
T PF12661_consen    1 TCQCPPGWTGPNC   13 (13)
T ss_dssp             EEEE-TTEETTTT
T ss_pred             CccCcCCCcCCCC
Confidence            5999999999988


No 22 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=96.03  E-value=0.0027  Score=40.26  Aligned_cols=27  Identities=22%  Similarity=0.619  Sum_probs=21.3

Q ss_pred             CCCCCCCeeecCCCCceeeCCCCCCCC
Q psy592           43 HCYPEHGVCVGSMHTAKCQCLPGWSGT   69 (339)
Q Consensus        43 ~pC~~~g~Cv~~~~~~~C~C~~g~~G~   69 (339)
                      ..|..++.|+....+|+|.|.+||.|.
T Consensus         6 ~~C~~nA~C~~~~~~~~C~C~~Gy~Gd   32 (36)
T PF12947_consen    6 GGCHPNATCTNTGGSYTCTCKPGYEGD   32 (36)
T ss_dssp             GGS-TTCEEEE-TTSEEEEE-CEEECC
T ss_pred             CCCCCCcEeecCCCCEEeECCCCCccC
Confidence            368888999999999999999999874


No 23 
>smart00210 TSPN Thrombospondin N-terminal -like domains. Heparin-binding and cell adhesion domain of thrombospondin
Probab=95.92  E-value=0.071  Score=46.58  Aligned_cols=73  Identities=15%  Similarity=0.118  Sum_probs=48.1

Q ss_pred             cccEEEEEEeec-CCceEEEEEecCCCCceEEEEEECCEEEEEEec-CCCcccceeEEe--cccccCcccccccCCccee
Q psy592           97 YTTQLQLKFRTR-EEFGELFRLSDQHNREYAILEIKDSKLRFRYNL-NNLKTDEKDIWL--NAIKDSKLRFRYNLNNLKT  172 (339)
Q Consensus        97 ~~~~i~l~frT~-~~~GlL~~~~~~~~~~~i~L~L~~G~l~~~~~l-G~~~~~~~~l~~--~~v~dG~WH~r~~~~~~l~  172 (339)
                      ....|.+.||+. ...|.||.+.+.++..++.+.+..+...+.+.. +.. .....+..  .++.||+||     .+.++
T Consensus        52 ~~fsi~~~~r~~~~~~g~L~si~~~~~~~~l~v~l~g~~~~~~~~~~~~~-g~~~~~~f~~~~l~dg~WH-----~lal~  125 (184)
T smart00210       52 EDFSLLTTFRQTPKSRGVLFAIYDAQNVRQFGLEVDGRANTLLLRYQGVD-GKQHTVSFRNLPLADGQWH-----KLALS  125 (184)
T ss_pred             CCeEEEEEEEeCCCCCeEEEEEEcCCCcEEEEEEEeCCccEEEEEECCCC-CcEEEEeecCCccccCCce-----EEEEE
Confidence            467888889987 667999999876567788888877665555543 221 11233332  367889999     55555


Q ss_pred             ecC
Q psy592          173 DEK  175 (339)
Q Consensus       173 ~~~  175 (339)
                      +.+
T Consensus       126 V~~  128 (184)
T smart00210      126 VSG  128 (184)
T ss_pred             EeC
Confidence            554


No 24 
>PHA02887 EGF-like protein; Provisional
Probab=95.51  E-value=0.011  Score=47.16  Aligned_cols=30  Identities=23%  Similarity=0.631  Sum_probs=24.6

Q ss_pred             CCCCCCeeec--CCCCceeeCCCCCCCCCCCCC
Q psy592           44 CYPEHGVCVG--SMHTAKCQCLPGWSGTGCVTP   74 (339)
Q Consensus        44 pC~~~g~Cv~--~~~~~~C~C~~g~~G~~Ce~~   74 (339)
                      -|- +|+|.-  +...+.|.|+.||+|.+|+..
T Consensus        93 YCi-HG~C~yI~dL~epsCrC~~GYtG~RCE~v  124 (126)
T PHA02887         93 FCI-NGECMNIIDLDEKFCICNKGYTGIRCDEV  124 (126)
T ss_pred             Eee-CCEEEccccCCCceeECCCCcccCCCCcc
Confidence            565 579965  456899999999999999864


No 25 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=95.18  E-value=0.021  Score=46.25  Aligned_cols=30  Identities=20%  Similarity=0.577  Sum_probs=25.1

Q ss_pred             CCCCCCeeec--CCCCceeeCCCCCCCCCCCCC
Q psy592           44 CYPEHGVCVG--SMHTAKCQCLPGWSGTGCVTP   74 (339)
Q Consensus        44 pC~~~g~Cv~--~~~~~~C~C~~g~~G~~Ce~~   74 (339)
                      -|-|| +|.-  +...+.|.|..||+|.+||..
T Consensus        52 YClHG-~C~yI~dl~~~~CrC~~GYtGeRCEh~   83 (139)
T PHA03099         52 YCLHG-DCIHARDIDGMYCRCSHGYTGIRCQHV   83 (139)
T ss_pred             EeECC-EEEeeccCCCceeECCCCcccccccce
Confidence            68764 9965  457899999999999999865


No 26 
>KOG3509|consensus
Probab=93.78  E-value=0.051  Score=58.20  Aligned_cols=70  Identities=19%  Similarity=0.347  Sum_probs=54.2

Q ss_pred             CCccceeeECCEEeecCCCccCCCCCCCCCCCCcccCCCCCCCCCCCCCeeecCCCCceeeCCCCCCCCCCCCCC
Q psy592            1 MENILNIVTITYLYDLANPGLHKNSVPGCPATEAVCNAHSRLHCYPEHGVCVGSMHTAKCQCLPGWSGTGCVTPT   75 (339)
Q Consensus         1 ~GCIrnl~vn~~~~dl~~~~~~~~s~~gC~~~~~~C~~~~~~~pC~~~g~Cv~~~~~~~C~C~~g~~G~~Ce~~~   75 (339)
                      .|||+.++++++.++...-....-...+| ...+.|...    ||...+.|.+..-...|.|+++|+|..|+..+
T Consensus       375 ~gci~~~~~~~k~l~~~~~~~~~v~~~~~-c~g~~c~~~----p~~~~g~c~p~~~~~~c~c~~g~~G~~c~d~~  444 (964)
T KOG3509|consen  375 AGCIRDLVMNLKDLRVTLQRASYVAAQGT-CLGDVCWRI----PCQHDGPCLQTLEGKQCLCPPGYTGDSCEDCM  444 (964)
T ss_pred             cceehhhhhhccccccccccccccccccc-cCCCccccc----cCCCCccccccccccceeccccccCchhhccC
Confidence            48999999999988765443322222222 234688888    99999999999999999999999999999765


No 27 
>PF12955 DUF3844:  Domain of unknown function (DUF3844);  InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=93.33  E-value=0.044  Score=43.09  Aligned_cols=45  Identities=27%  Similarity=0.742  Sum_probs=32.6

Q ss_pred             CCCCCcccCCCCCCCCCCCCCeeecCCC-----CceeeCCC-------------CCCCCCCCCCC
Q psy592           29 CPATEAVCNAHSRLHCYPEHGVCVGSMH-----TAKCQCLP-------------GWSGTGCVTPT   75 (339)
Q Consensus        29 C~~~~~~C~~~~~~~pC~~~g~Cv~~~~-----~~~C~C~~-------------g~~G~~Ce~~~   75 (339)
                      |-.+.+.|...  .+.|..||.|+....     =|.|.|.+             .|.|+.|++.-
T Consensus         1 Cf~S~~aC~~~--Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~~~~~~~ktt~W~G~aCqKkD   63 (103)
T PF12955_consen    1 CFSSNDACENA--TNNCSGHGSCVKKYGSGGGDCFACKCKPTVVKTGSGKGKTTHWGGPACQKKD   63 (103)
T ss_pred             CcCCHHHHHHh--ccCCCCCceEeeccCCCccceEEEEeeccccccccccCceeeeccccccccc
Confidence            33445667644  447999999999742     38999987             57888888763


No 28 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=91.40  E-value=0.14  Score=29.34  Aligned_cols=16  Identities=38%  Similarity=1.149  Sum_probs=12.5

Q ss_pred             CceeeCCCCCC----CCCCC
Q psy592           57 TAKCQCLPGWS----GTGCV   72 (339)
Q Consensus        57 ~~~C~C~~g~~----G~~Ce   72 (339)
                      +|+|.|++||.    |..|.
T Consensus         1 sy~C~C~~Gy~l~~d~~~C~   20 (24)
T PF12662_consen    1 SYTCSCPPGYQLSPDGRSCE   20 (24)
T ss_pred             CEEeeCCCCCcCCCCCCccc
Confidence            68999999985    55664


No 29 
>PF12273 RCR:  Chitin synthesis regulation, resistance to Congo red;  InterPro: IPR020999  RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall []. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5 []. 
Probab=87.45  E-value=0.61  Score=38.36  Aligned_cols=11  Identities=18%  Similarity=0.311  Sum_probs=6.4

Q ss_pred             ccCCCcccccc
Q psy592          248 EDDMTAFDITP  258 (339)
Q Consensus       248 E~d~~~yd~~~  258 (339)
                      +.|..-||-..
T Consensus        72 ~~~~g~Yd~~g   82 (130)
T PF12273_consen   72 DNDPGYYDQQG   82 (130)
T ss_pred             CCCCCCCCCCC
Confidence            55666677543


No 30 
>KOG1214|consensus
Probab=87.26  E-value=0.49  Score=49.68  Aligned_cols=35  Identities=26%  Similarity=0.698  Sum_probs=30.9

Q ss_pred             cccCCCCCCCCCCCCCeeecCCCCceeeCCCCCCCC--CCC
Q psy592           34 AVCNAHSRLHCYPEHGVCVGSMHTAKCQCLPGWSGT--GCV   72 (339)
Q Consensus        34 ~~C~~~~~~~pC~~~g~Cv~~~~~~~C~C~~g~~G~--~Ce   72 (339)
                      +.|..+    .|...++|++..++|+|.|.+||.|.  +|-
T Consensus       828 DeC~ps----rChp~A~CyntpgsfsC~C~pGy~GDGf~CV  864 (1289)
T KOG1214|consen  828 DECSPS----RCHPAATCYNTPGSFSCRCQPGYYGDGFQCV  864 (1289)
T ss_pred             cccCcc----ccCCCceEecCCCcceeecccCccCCCceec
Confidence            788877    89999999999999999999999865  553


No 31 
>PF00354 Pentaxin:  Pentaxin family;  InterPro: IPR001759 Pentaxins (or pentraxins) [, ] are a family of proteins which show, under electron microscopy, a discoid arrangement of five noncovalently bound subunits. Proteins of the pentaxin family are involved in acute immunological responses []. Three of the principal members of the pentaxin family are serum proteins: namely, C-reactive protein (CRP) [], serum amyloid P component protein (SAP) [], and female protein (FP) []. CRP is expressed during acute phase response to tissue injury or inflammation in mammals. The protein resembles antibody and performs several functions associated with host defence: it promotes agglutination, bacterial capsular swelling and phagocytosis, and activates the classical complement pathway through its calcium-dependent binding to phosphocholine. CRPs have also been sequenced in an invertebrate, Limulus polyphemus (Atlantic horseshoe crab), where they are a normal constituent of the hemolymph. SAP is a vertebrate protein that is a precursor of amyloid component P. It is found in all types of amyloid deposits, in glomerular basement menbrane and in elastic fibres in blood vessels. SAP binds to various lipoprotein ligands in a calcium-dependent manner, and it has been suggested that, in mammals, this may have important implications in atherosclerosis and amyloidosis. FP is a SAP homologue found in Mesocricetus auratus (Golden hamster). The concentration of this plasma protein is altered by sex steroids and stimuli that elicit an acute phase response. Pentaxin proteins expressed in the nervous system are neural pentaxin I (NPI) and II (NPII) []. NPI and NPII are homologous and can exist within one species. It is suggested that both proteins mediate the uptake of synaptic macromolecules and play a role in synaptic plasticity. Apexin, a sperm acrosomal protein, is a homologue of NPII found in Cavia porcellus (Guinea pig) []. PTX3 (or TSG-14) protein is a cytokine-induced protein that is homologous to CRPs and SAPs, but its function is not yet known.; PDB: 2A3W_F 3KQR_C 3D5O_D 2A3X_G 1SAC_D 2W08_B 1GYK_B 1LGN_A 2A3Y_A 1B09_D ....
Probab=86.96  E-value=5  Score=35.37  Aligned_cols=74  Identities=15%  Similarity=0.246  Sum_probs=38.3

Q ss_pred             cceEEEeccCCCCccccEEEEEEeecCC--ceEEEEEecCCC-CceEEEEEECCEEEEEEecCCCcccceeEEecccccC
Q psy592           83 QSYIKYALSFEPDKYTTQLQLKFRTREE--FGELFRLSDQHN-REYAILEIKDSKLRFRYNLNNLKTDEKDIWLNAIKDS  159 (339)
Q Consensus        83 ~s~v~y~~~~~~~~~~~~i~l~frT~~~--~GlL~~~~~~~~-~~~i~L~L~~G~l~~~~~lG~~~~~~~~l~~~~v~dG  159 (339)
                      ..|+++............+-+++||...  .+.||..+.... .+++.+.-..+.+.+.  +++.   ...+. ..+.+|
T Consensus        11 ~~yv~l~~~~~~pL~~fTvC~w~k~~~~~~~~tifSYat~~~~nell~~~~~~~~~~l~--i~~~---~~~~~-~~~~~~   84 (195)
T PF00354_consen   11 TDYVRLKPSVPLPLSAFTVCFWVKTDDSSNDGTIFSYATSSQDNELLLFGSSSGSLRLY--INGS---SVSFS-GPIRDG   84 (195)
T ss_dssp             SBEEEEEESS-S-BSEEEEEEEEEESGSGS-EEEEEEEETTEEEEEEEEEETTTEEEEE--ETTE---EEEEE-ECS-TS
T ss_pred             cceEEEecCCCCCcccEEEEEEEEeccCCCceEEEEEccCCCCccEEEEEeCCceEEEE--ECCe---EeEec-cccCCC
Confidence            4677776543222446777888888765  688886654322 2343333334555444  4431   12222 356789


Q ss_pred             ccc
Q psy592          160 KLR  162 (339)
Q Consensus       160 ~WH  162 (339)
                      +||
T Consensus        85 ~Wh   87 (195)
T PF00354_consen   85 QWH   87 (195)
T ss_dssp             S-E
T ss_pred             CcE
Confidence            999


No 32 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=85.29  E-value=0.46  Score=30.05  Aligned_cols=19  Identities=32%  Similarity=0.981  Sum_probs=15.8

Q ss_pred             CeeecCCCCceeeCCCCCC
Q psy592           49 GVCVGSMHTAKCQCLPGWS   67 (339)
Q Consensus        49 g~Cv~~~~~~~C~C~~g~~   67 (339)
                      ..|++..++|+|.|++||.
T Consensus        10 h~C~~~~g~~~C~C~~Gy~   28 (36)
T PF14670_consen   10 HICVNTPGSYRCSCPPGYK   28 (36)
T ss_dssp             SEEEEETTSEEEE-STTEE
T ss_pred             CCCccCCCceEeECCCCCE
Confidence            3799999999999999984


No 33 
>KOG4260|consensus
Probab=84.87  E-value=0.62  Score=42.83  Aligned_cols=33  Identities=27%  Similarity=0.564  Sum_probs=28.3

Q ss_pred             CCCCCCCeeecCC---CCceeeCCCCCCCCCCCCCC
Q psy592           43 HCYPEHGVCVGSM---HTAKCQCLPGWSGTGCVTPT   75 (339)
Q Consensus        43 ~pC~~~g~Cv~~~---~~~~C~C~~g~~G~~Ce~~~   75 (339)
                      .||..+|.|..+.   ++-+|.|.+||+|+.|.+-.
T Consensus       150 r~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg  185 (350)
T KOG4260|consen  150 RPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCG  185 (350)
T ss_pred             CCcCCCCcccCCCCCCCCCcccccCCCCCccccccc
Confidence            4898899999874   57899999999999998764


No 34 
>smart00159 PTX Pentraxin / C-reactive protein / pentaxin family. This family form a doscoid pentameric structure. Human serum amyloid P demonstrates calcium-mediated ligand-binding.
Probab=84.64  E-value=12  Score=33.14  Aligned_cols=78  Identities=15%  Similarity=0.186  Sum_probs=42.1

Q ss_pred             ccccC---cceEEEeccCCCCccccEEEEEEeecC--CceEEEEEecCCC-CceEEEEEECCEEEEEEecCCCcccceeE
Q psy592           78 VTFKP---QSYIKYALSFEPDKYTTQLQLKFRTRE--EFGELFRLSDQHN-REYAILEIKDSKLRFRYNLNNLKTDEKDI  151 (339)
Q Consensus        78 ~~F~~---~s~v~y~~~~~~~~~~~~i~l~frT~~--~~GlL~~~~~~~~-~~~i~L~L~~G~l~~~~~lG~~~~~~~~l  151 (339)
                      +.|..   ..|+.+............+.+++|+..  ..+.||..+.... .+++...-.+|.+.+.  +++    . .+
T Consensus         9 ~~fp~~s~~~yv~l~~~~~~~l~~fTvc~W~k~~~~~~~~~ifSy~~~~~~ne~~~~~~~~~~~~l~--i~g----~-~~   81 (206)
T smart00159        9 FVFPKESDTSYVKLKPELPKPLQAFTVCLWFYSDLSPRGYSLFSYATKGQDNELLLYKEKQGEYSLY--IGG----K-KV   81 (206)
T ss_pred             EECCCCCCCCeEEEccCCCCChhHEEEEEEEEecCCCCceEEEEEeCCCCCCeEEEEEcCCcEEEEE--EcC----e-EE
Confidence            44543   466666544332344677888888864  4567775544332 3333333344555444  333    1 22


Q ss_pred             Ee-cccccCccc
Q psy592          152 WL-NAIKDSKLR  162 (339)
Q Consensus       152 ~~-~~v~dG~WH  162 (339)
                      .. ..+.+|+||
T Consensus        82 ~~~~~~~~g~W~   93 (206)
T smart00159       82 QFPVPESDGKWH   93 (206)
T ss_pred             EecccccCCceE
Confidence            22 367789999


No 35 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=82.46  E-value=0.82  Score=30.68  Aligned_cols=24  Identities=33%  Similarity=0.753  Sum_probs=18.3

Q ss_pred             eeecCCCCceeeCCCCCCCCCCCCCC
Q psy592           50 VCVGSMHTAKCQCLPGWSGTGCVTPT   75 (339)
Q Consensus        50 ~Cv~~~~~~~C~C~~g~~G~~Ce~~~   75 (339)
                      .|..  ....|.|.++|+|++|+.-.
T Consensus        12 ~C~~--~~G~C~C~~~~~G~~C~~C~   35 (49)
T PF00053_consen   12 TCDP--STGQCVCKPGTTGPRCDQCK   35 (49)
T ss_dssp             SEEE--TCEEESBSTTEESTTS-EE-
T ss_pred             cccC--CCCEEeccccccCCcCcCCC
Confidence            5665  55689999999999999764


No 36 
>PF04863 EGF_alliinase:  Alliinase EGF-like domain;  InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=82.38  E-value=0.4  Score=33.03  Aligned_cols=32  Identities=22%  Similarity=0.456  Sum_probs=18.6

Q ss_pred             CCCCCCeeecC----CCCceeeCCCCCCCCCCCCCC
Q psy592           44 CYPEHGVCVGS----MHTAKCQCLPGWSGTGCVTPT   75 (339)
Q Consensus        44 pC~~~g~Cv~~----~~~~~C~C~~g~~G~~Ce~~~   75 (339)
                      +|..||+...+    .+...|.|..-|+|+.|++.+
T Consensus        18 ~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS~~~   53 (56)
T PF04863_consen   18 SCSGHGRAFLDGLIADGSPVCECNSCYGGPDCSTLI   53 (56)
T ss_dssp             --TTSEE--TTS-EETTEE--EE-TTEESTTS-EE-
T ss_pred             CcCCCCeeeeccccccCCccccccCCcCCCCcccCC
Confidence            78888988754    345889999999999998654


No 37 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=82.07  E-value=1.3  Score=39.61  Aligned_cols=37  Identities=22%  Similarity=0.536  Sum_probs=27.2

Q ss_pred             CCCCCCCcccCCCCCCCCCCCCCeeecCCCCceeeCCCCCCC
Q psy592           27 PGCPATEAVCNAHSRLHCYPEHGVCVGSMHTAKCQCLPGWSG   68 (339)
Q Consensus        27 ~gC~~~~~~C~~~~~~~pC~~~g~Cv~~~~~~~C~C~~g~~G   68 (339)
                      .-|. ..+.|...  .++|.  ..|....++|.|.|++||+.
T Consensus       182 ~~C~-~~~~C~~~--~~~c~--~~C~~~~g~~~c~c~~g~~~  218 (224)
T cd01475         182 KICV-VPDLCATL--SHVCQ--QVCISTPGSYLCACTEGYAL  218 (224)
T ss_pred             ccCc-CchhhcCC--CCCcc--ceEEcCCCCEEeECCCCccC
Confidence            4575 34677643  23675  47999999999999999974


No 38 
>KOG1217|consensus
Probab=81.44  E-value=1.4  Score=43.25  Aligned_cols=36  Identities=31%  Similarity=0.766  Sum_probs=30.3

Q ss_pred             CcccCCCCCCCCCCCCCeeecCCCCceeeCCCCCCCCCC
Q psy592           33 EAVCNAHSRLHCYPEHGVCVGSMHTAKCQCLPGWSGTGC   71 (339)
Q Consensus        33 ~~~C~~~~~~~pC~~~g~Cv~~~~~~~C~C~~g~~G~~C   71 (339)
                      .+.|...  . +|.++++|+...+.|.|.|++||+|..|
T Consensus       271 ~~~C~~~--~-~c~~~~~C~~~~~~~~C~C~~g~~g~~~  306 (487)
T KOG1217|consen  271 VDSCALI--A-SCPNGGTCVNVPGSYRCTCPPGFTGRLC  306 (487)
T ss_pred             ccccCCC--C-ccCCCCeeecCCCcceeeCCCCCCCCCC
Confidence            3556554  1 3999999999999999999999999998


No 39 
>PF05568 ASFV_J13L:  African swine fever virus J13L protein;  InterPro: IPR008385 This family consists of several African swine fever virus (ASFV) j13L proteins [, , ].
Probab=81.27  E-value=4.2  Score=33.91  Aligned_cols=51  Identities=12%  Similarity=0.280  Sum_probs=23.3

Q ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCchhhcccccccCCCCCcc
Q psy592          196 SALFVTLLISPNQVLVLVFVVYSRRREAHIKYPGPDDDVRENIINYDDEGGGED  249 (339)
Q Consensus       196 ~~il~~~~~l~~l~~~l~~~~~~~~~~~~~~~~~~~~d~reni~~Y~eEggGE~  249 (339)
                      +.||+.|++++ +++++++.++.+||++.. -...++|++ =|=.|+|..-+|-
T Consensus        32 ~tILiaIvVli-iiiivli~lcssRKkKaa-AAi~eediQ-finpyqDqqw~~v   82 (189)
T PF05568_consen   32 YTILIAIVVLI-IIIIVLIYLCSSRKKKAA-AAIEEEDIQ-FINPYQDQQWAGV   82 (189)
T ss_pred             HHHHHHHHHHH-HHHHHHHHHHhhhhHHHH-hhhhhhccc-ccCcccchhhccC
Confidence            56676666651 233444445555554321 123344553 2334666433333


No 40 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=80.14  E-value=0.77  Score=29.13  Aligned_cols=24  Identities=25%  Similarity=0.649  Sum_probs=17.7

Q ss_pred             CCCCCCeeecCC-CCceeeCCCCCC
Q psy592           44 CYPEHGVCVGSM-HTAKCQCLPGWS   67 (339)
Q Consensus        44 pC~~~g~Cv~~~-~~~~C~C~~g~~   67 (339)
                      +|..++.|+... +.+.|.|.+||.
T Consensus         6 ~cP~NA~C~~~~dG~eecrCllgyk   30 (37)
T PF12946_consen    6 KCPANAGCFRYDDGSEECRCLLGYK   30 (37)
T ss_dssp             ---TTEEEEEETTSEEEEEE-TTEE
T ss_pred             cCCCCcccEEcCCCCEEEEeeCCcc
Confidence            888899999987 679999999984


No 41 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=79.70  E-value=2  Score=34.07  Aligned_cols=32  Identities=28%  Similarity=0.807  Sum_probs=23.9

Q ss_pred             CcccCCCCCCCCCCCCCeeecCCCCceeeCCCCCCC
Q psy592           33 EAVCNAHSRLHCYPEHGVCVGSMHTAKCQCLPGWSG   68 (339)
Q Consensus        33 ~~~C~~~~~~~pC~~~g~Cv~~~~~~~C~C~~g~~G   68 (339)
                      .+.|...   ..|+..|.|... ...+|.|.+||.-
T Consensus        77 ~d~Cd~y---~~CG~~g~C~~~-~~~~C~Cl~GF~P  108 (110)
T PF00954_consen   77 KDQCDVY---GFCGPNGICNSN-NSPKCSCLPGFEP  108 (110)
T ss_pred             ccCCCCc---cccCCccEeCCC-CCCceECCCCcCC
Confidence            3577653   389999999654 5667999999953


No 42 
>PF06679 DUF1180:  Protein of unknown function (DUF1180);  InterPro: IPR009565 This entry consists of several hypothetical eukaryotic proteins thought to be membrane proteins. Their function is unknown.
Probab=78.77  E-value=3.2  Score=35.58  Aligned_cols=18  Identities=28%  Similarity=0.492  Sum_probs=12.7

Q ss_pred             CCCCCccCCCcccccccC
Q psy592          243 DEGGGEDDMTAFDITPLQ  260 (339)
Q Consensus       243 eEggGE~d~~~yd~~~L~  260 (339)
                      ||...|||++=||...+|
T Consensus       145 ~~ddedeD~TvFd~~~~r  162 (163)
T PF06679_consen  145 EEDDEDEDSTVFDANHPR  162 (163)
T ss_pred             CCCccccccceeeecccc
Confidence            444567777889987765


No 43 
>KOG1225|consensus
Probab=78.44  E-value=1.6  Score=44.25  Aligned_cols=27  Identities=30%  Similarity=0.839  Sum_probs=23.8

Q ss_pred             CCCCCCeeecCCCCceeeCCCCCCCCCCCCC
Q psy592           44 CYPEHGVCVGSMHTAKCQCLPGWSGTGCVTP   74 (339)
Q Consensus        44 pC~~~g~Cv~~~~~~~C~C~~g~~G~~Ce~~   74 (339)
                      .|.++|+|+    .-+|.|.+||+|..|++.
T Consensus       317 dC~g~G~Ci----~G~C~C~~Gy~G~~C~~~  343 (525)
T KOG1225|consen  317 DCSGHGKCI----DGECLCDEGYTGELCIQR  343 (525)
T ss_pred             cCCCCCccc----CCceEeCCCCcCCccccc
Confidence            798999999    347999999999999985


No 44 
>PF15330 SIT:  SHP2-interacting transmembrane adaptor protein, SIT
Probab=77.18  E-value=5.9  Score=31.53  Aligned_cols=15  Identities=7%  Similarity=0.224  Sum_probs=7.0

Q ss_pred             HHHHHHHHHHHHHhh
Q psy592          208 QVLVLVFVVYSRRRE  222 (339)
Q Consensus       208 l~~~l~~~~~~~~~~  222 (339)
                      |++++-++.|+..++
T Consensus        13 l~l~asl~~wr~~~r   27 (107)
T PF15330_consen   13 LSLAASLLAWRMKQR   27 (107)
T ss_pred             HHHHHHHHHHHHHhh
Confidence            334444455554443


No 45 
>PF01102 Glycophorin_A:  Glycophorin A;  InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=76.95  E-value=3.8  Score=33.38  Aligned_cols=9  Identities=11%  Similarity=0.386  Sum_probs=3.0

Q ss_pred             HHHHHHHHh
Q psy592          213 VFVVYSRRR  221 (339)
Q Consensus       213 ~~~~~~~~~  221 (339)
                      .|+++|+||
T Consensus        85 ~y~irR~~K   93 (122)
T PF01102_consen   85 SYCIRRLRK   93 (122)
T ss_dssp             HHHHHHHS-
T ss_pred             HHHHHHHhc
Confidence            333333333


No 46 
>cd00152 PTX Pentraxins are plasma proteins characterized by their pentameric discoid assembly and their Ca2+ dependent ligand binding, such as Serum amyloid P component (SAP) and C-reactive Protein (CRP), which are cytokine-inducible acute-phase proteins implicated in innate immunity. CRP binds to ligands containing phosphocholine, SAP binds to amyloid fibrils, DNA, chromatin, fibronectin, C4-binding proteins and glycosaminoglycans. "Long" pentraxins have N-terminal extensions to the common pentraxin domain; one group, the neuronal pentraxins, may be involved in synapse formation and remodeling, and they may also be able to form heteromultimers.
Probab=76.46  E-value=35  Score=29.94  Aligned_cols=80  Identities=18%  Similarity=0.110  Sum_probs=42.8

Q ss_pred             CccccC---cceEEEeccCCCCccccEEEEEEeecC--CceEEEEEecCCCC-ceEEEEEECCEEEEEEecCCCccccee
Q psy592           77 PVTFKP---QSYIKYALSFEPDKYTTQLQLKFRTRE--EFGELFRLSDQHNR-EYAILEIKDSKLRFRYNLNNLKTDEKD  150 (339)
Q Consensus        77 ~~~F~~---~s~v~y~~~~~~~~~~~~i~l~frT~~--~~GlL~~~~~~~~~-~~i~L~L~~G~l~~~~~lG~~~~~~~~  150 (339)
                      .+.|..   ..|+.+............+.+++|+..  ..+.||........ +++...-..|.+.+.+  ++.   ...
T Consensus         8 ~l~f~~~s~~~yv~l~~~~~~~l~~fTv~~Wv~~~~~~~~~~ifSy~~~~~~~~~~l~~~~~g~~~~~i--~~~---~~~   82 (201)
T cd00152           8 VFVFPKESDTSYVKLKPELPKPLQAFTLCLWVYTDLSTREYSLFSYATKGQDNELLLYKEKDGGYSLYI--GGK---EVT   82 (201)
T ss_pred             EEECCCCCCCceEEEccCCCCChhhEEEEEEEEecCCCCCeEEEEEeCCCCCCeEEEEEcCCCeEEEEE--cCE---EEE
Confidence            345544   356666544332344677888888764  45667755443223 3333333456666654  321   222


Q ss_pred             EEecccccCccc
Q psy592          151 IWLNAIKDSKLR  162 (339)
Q Consensus       151 l~~~~v~dG~WH  162 (339)
                      +. .+..+|+||
T Consensus        83 ~~-~~~~~g~W~   93 (201)
T cd00152          83 FK-VPESDGAWH   93 (201)
T ss_pred             Ee-ccCCCCCEE
Confidence            22 345889999


No 47 
>TIGR01478 STEVOR variant surface antigen, stevor family. This model represents the stevor branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of stevor sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 8 bits.
Probab=75.95  E-value=3.3  Score=38.48  Aligned_cols=33  Identities=24%  Similarity=0.552  Sum_probs=15.9

Q ss_pred             ccccchHHHHHHHHHHHHHHHHHHHHHHHHhhhc
Q psy592          191 FVKTGSALFVTLLISPNQVLVLVFVVYSRRREAH  224 (339)
Q Consensus       191 ~~~~g~~il~~~~~l~~l~~~l~~~~~~~~~~~~  224 (339)
                      +.+-|||+|+.|++. +++++|-+++|||||+.+
T Consensus       258 F~Pcgiaalvllil~-vvliiLYiWlyrrRK~sw  290 (295)
T TIGR01478       258 FLPYGIAALVLIILT-VVLIILYIWLYRRRKKSW  290 (295)
T ss_pred             hcccHHHHHHHHHHH-HHHHHHHHHHHHhhcccc
Confidence            344456666554443 123333445566665443


No 48 
>PF13385 Laminin_G_3:  Concanavalin A-like lectin/glucanases superfamily; PDB: 4DQA_A 1N1Y_A 1MZ6_A 1MZ5_A 1N1S_A 2A75_A 1WCS_A 1N1T_A 1N1V_A 2FHR_A ....
Probab=75.94  E-value=26  Score=28.02  Aligned_cols=78  Identities=17%  Similarity=0.092  Sum_probs=37.8

Q ss_pred             ccCcceEEEeccCCCCccccEEEEEEeecCCce---EEEEEecCCCCceEEEEEE-CCEEEEEEecCCCcccceeEE-ec
Q psy592           80 FKPQSYIKYALSFEPDKYTTQLQLKFRTREEFG---ELFRLSDQHNREYAILEIK-DSKLRFRYNLNNLKTDEKDIW-LN  154 (339)
Q Consensus        80 F~~~s~v~y~~~~~~~~~~~~i~l~frT~~~~G---lL~~~~~~~~~~~i~L~L~-~G~l~~~~~lG~~~~~~~~l~-~~  154 (339)
                      |++++|+.+.....+ .....|++.||......   .++.  .....+...+.+. +|.+.+.+..+..  ....+. ..
T Consensus         6 ~g~~~~i~~~~~~~~-~~~fTi~~w~~~~~~~~~~~~~~~--~~~~~~~~~l~~~~~~~l~~~~~~~~~--~~~~~~~~~   80 (157)
T PF13385_consen    6 DGSNDYISIPNSDFP-SGSFTISFWVKPDSPSSSQSFVFM--DSSGSGGFGLFINNNGRLRFYIGNGGG--GNYSFSSDS   80 (157)
T ss_dssp             -STT-EEEEESGGGG-GTEEEEEEEEEESS--SSEEEEEE--SSSSSEEEEEEEETTSEEEEEETTSEE--ESS-EE-BS
T ss_pred             CCCCCEEEECCcCCC-CCCEEEEEEEEeCCCCCCceEEEE--ecCCCCEEEEEEECCCEEEEEEeCCCc--eeEEEecCc
Confidence            345788888753222 44678888888765432   3333  2112345555555 5767766554431  112232 34


Q ss_pred             ccccCccc
Q psy592          155 AIKDSKLR  162 (339)
Q Consensus       155 ~v~dG~WH  162 (339)
                      .+..++||
T Consensus        81 ~~~~~~W~   88 (157)
T PF13385_consen   81 NLPDNKWH   88 (157)
T ss_dssp             ---TT-EE
T ss_pred             ccCCCCEE
Confidence            67779999


No 49 
>KOG1217|consensus
Probab=75.50  E-value=2.8  Score=41.14  Aligned_cols=39  Identities=28%  Similarity=0.625  Sum_probs=32.8

Q ss_pred             cccCCCCCCCCCCCCCeeecCCCCceeeCCCCCCCCCCCCC
Q psy592           34 AVCNAHSRLHCYPEHGVCVGSMHTAKCQCLPGWSGTGCVTP   74 (339)
Q Consensus        34 ~~C~~~~~~~pC~~~g~Cv~~~~~~~C~C~~g~~G~~Ce~~   74 (339)
                      +.|...  ..+|.+++.|.+...+|.|.|+++|+|..|+..
T Consensus       170 ~~C~~~--~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~  208 (487)
T KOG1217|consen  170 DECIQY--SSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETT  208 (487)
T ss_pred             cccccC--CCCcCCCcccccCCCCeeEeCCCCccCCcCcCC
Confidence            466633  237999999999999999999999999999865


No 50 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=75.45  E-value=3  Score=28.12  Aligned_cols=19  Identities=32%  Similarity=0.744  Sum_probs=16.1

Q ss_pred             CCceeeCCCCCCCCCCCCC
Q psy592           56 HTAKCQCLPGWSGTGCVTP   74 (339)
Q Consensus        56 ~~~~C~C~~g~~G~~Ce~~   74 (339)
                      ..-+|.|.++++|..|++-
T Consensus        17 ~~G~C~C~~~~~G~~C~~C   35 (50)
T cd00055          17 GTGQCECKPNTTGRRCDRC   35 (50)
T ss_pred             CCCEEeCCCcCCCCCCCCC
Confidence            3457999999999999976


No 51 
>PTZ00370 STEVOR; Provisional
Probab=73.86  E-value=4  Score=38.03  Aligned_cols=31  Identities=26%  Similarity=0.495  Sum_probs=14.9

Q ss_pred             cccchHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy592          192 VKTGSALFVTLLISPNQVLVLVFVVYSRRREA  223 (339)
Q Consensus       192 ~~~g~~il~~~~~l~~l~~~l~~~~~~~~~~~  223 (339)
                      .+-|||+|+.+++. +++++|-+++||||++.
T Consensus       255 ~Pygiaalvllil~-vvliilYiwlyrrRK~s  285 (296)
T PTZ00370        255 YPYGIAALVLLILA-VVLIILYIWLYRRRKNS  285 (296)
T ss_pred             cccHHHHHHHHHHH-HHHHHHHHHHHHhhcch
Confidence            44456655544333 13334444566666553


No 52 
>PF05399 EVI2A:  Ectropic viral integration site 2A protein (EVI2A);  InterPro: IPR008608 This family contains several mammalian ectropic viral integration site 2A (EVI2A) proteins. The function of this protein is unknown although it is thought to be a membrane protein and may function as an oncogene in retrovirus induced myeloid tumours [, ].; GO: 0016021 integral to membrane
Probab=72.69  E-value=6.6  Score=34.87  Aligned_cols=14  Identities=21%  Similarity=0.335  Sum_probs=9.0

Q ss_pred             hHHHHHHHHHHHHH
Q psy592          196 SALFVTLLISPNQV  209 (339)
Q Consensus       196 ~~il~~~~~l~~l~  209 (339)
                      .|+|+|++++|+|+
T Consensus       128 ~amLIClIIIAVLf  141 (227)
T PF05399_consen  128 MAMLICLIIIAVLF  141 (227)
T ss_pred             hhHHHHHHHHHHHH
Confidence            47777777764443


No 53 
>smart00051 DSL delta serrate ligand.
Probab=72.32  E-value=2.9  Score=29.91  Aligned_cols=24  Identities=33%  Similarity=0.790  Sum_probs=16.1

Q ss_pred             CCCCCeeecCCCCceeeCCCCCCCCCC
Q psy592           45 YPEHGVCVGSMHTAKCQCLPGWSGTGC   71 (339)
Q Consensus        45 C~~~g~Cv~~~~~~~C~C~~g~~G~~C   71 (339)
                      +..+.+|-.   .-.|.|.+||+|+.|
T Consensus        40 ~~~~~~Cd~---~G~~~C~~Gw~G~~C   63 (63)
T smart00051       40 FFGHYTCDE---NGNKGCLEGWMGPYC   63 (63)
T ss_pred             ccCCccCCc---CCCEecCCCCcCCCC
Confidence            334455632   235779999999988


No 54 
>PF02009 Rifin_STEVOR:  Rifin/stevor family;  InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=72.11  E-value=4.1  Score=38.50  Aligned_cols=14  Identities=21%  Similarity=0.738  Sum_probs=6.8

Q ss_pred             HHHHHHHHHHHHHh
Q psy592          208 QVLVLVFVVYSRRR  221 (339)
Q Consensus       208 l~~~l~~~~~~~~~  221 (339)
                      |+||+++++||.||
T Consensus       270 LIMvIIYLILRYRR  283 (299)
T PF02009_consen  270 LIMVIIYLILRYRR  283 (299)
T ss_pred             HHHHHHHHHHHHHH
Confidence            44445555555444


No 55 
>PF15102 TMEM154:  TMEM154 protein family
Probab=72.06  E-value=1.2  Score=37.23  Aligned_cols=14  Identities=14%  Similarity=0.418  Sum_probs=6.2

Q ss_pred             cHHHHHHHHHHHhc
Q psy592          288 NVGIFIEEHKKRAD  301 (339)
Q Consensus       288 ~~~~~i~~~~~~ad  301 (339)
                      ++..+++..=+.||
T Consensus       129 eldkwm~s~n~n~~  142 (146)
T PF15102_consen  129 ELDKWMNSMNRNAD  142 (146)
T ss_pred             HHHhHHHhhccCcc
Confidence            34445544433443


No 56 
>KOG1226|consensus
Probab=70.44  E-value=4.1  Score=42.69  Aligned_cols=27  Identities=41%  Similarity=1.114  Sum_probs=20.0

Q ss_pred             CCCCCCeeecCCCCceeeCCCCCCCCCCCCC
Q psy592           44 CYPEHGVCVGSMHTAKCQCLPGWSGTGCVTP   74 (339)
Q Consensus        44 pC~~~g~Cv~~~~~~~C~C~~g~~G~~Ce~~   74 (339)
                      -|..+|+|.-.    .|.|.+||+|..|+-.
T Consensus       556 lC~g~G~C~CG----~CvC~~GwtG~~C~C~  582 (783)
T KOG1226|consen  556 LCGGHGRCECG----RCVCNPGWTGSACNCP  582 (783)
T ss_pred             ccCCCCeEeCC----cEEcCCCCccCCCCCC
Confidence            46666666543    6999999999988754


No 57 
>PF01034 Syndecan:  Syndecan domain;  InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ]. Structurally, these proteins consist of four separate domains:   A signal sequence; An extracellular domain (ectodomain) of variable length whose sequence is not evolutionary conserved in the various forms of syndecans. The ectodomain contains the sites of attachment of the heparan sulphate glycosaminoglycan side chains;  A transmembrane region;  A highly conserved cytoplasmic domain of about 30 to 35 residues, which could interact with cytoskeletal proteins.    The proteins known to belong to this family are:    Syndecan 1.  Syndecan 2 or fibroglycan.  Syndecan 3 or neuroglycan or N-syndecan.  Syndecan 4 or amphiglycan or ryudocan.  Drosophila syndecan.   Caenorhabditis elegans probable syndecan (F57C7.3).    Syndecan-4, a transmembrane heparan sulphate proteoglycan, is a coreceptor with integrins in cell adhesion. It has been suggested to form a ternary signalling complex with protein kinase Calpha and phosphatidylinositol 4,5-bisphosphate (PIP2). Structural studies have demonstrated that the cytoplasmic domain undergoes a conformational transition and forms a symmetric dimer in the presence of phospholipid activator PIP2, and whose overall structure in solution exhibits a twisted clamp shape having a cavity in the centre of dimeric interface. In addition, it has been observed that the syndecan-4 variable domain interacts, strongly, not only with fatty acyl groups but also the anionic head group of PIP2. These findings indicate that PIP2 promotes oligomerisation of the syndecan-4 cytoplasmic domain for transmembrane signalling and cell-matrix adhesion [, ].; GO: 0008092 cytoskeletal protein binding, 0016020 membrane; PDB: 1EJQ_B 1EJP_B 1YBO_C 1OBY_Q.
Probab=70.01  E-value=1.5  Score=31.31  Aligned_cols=15  Identities=27%  Similarity=0.879  Sum_probs=0.6

Q ss_pred             HHHHHHHHHHHHhhh
Q psy592          209 VLVLVFVVYSRRREA  223 (339)
Q Consensus       209 ~~~l~~~~~~~~~~~  223 (339)
                      +++++|++||-|++.
T Consensus        26 ilLIlf~iyR~rkkd   40 (64)
T PF01034_consen   26 ILLILFLIYRMRKKD   40 (64)
T ss_dssp             -----------S---
T ss_pred             HHHHHHHHHHHHhcC
Confidence            344556677655543


No 58 
>KOG1225|consensus
Probab=66.31  E-value=7.4  Score=39.59  Aligned_cols=27  Identities=33%  Similarity=0.897  Sum_probs=19.5

Q ss_pred             CCCCCeeecCCCCceeeCCCCCCCCCCCCCC
Q psy592           45 YPEHGVCVGSMHTAKCQCLPGWSGTGCVTPT   75 (339)
Q Consensus        45 C~~~g~Cv~~~~~~~C~C~~g~~G~~Ce~~~   75 (339)
                      |.++|.|++-    .|+|++||+|..|++..
T Consensus       256 c~~~g~c~~G----~CIC~~Gf~G~dC~e~~  282 (525)
T KOG1225|consen  256 CTGRGQCVEG----RCICPPGFTGDDCDELV  282 (525)
T ss_pred             CcccceEeCC----eEeCCCCCcCCCCCccc
Confidence            4444555553    69999999999998854


No 59 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=65.05  E-value=4.8  Score=26.65  Aligned_cols=18  Identities=33%  Similarity=0.827  Sum_probs=15.6

Q ss_pred             CceeeCCCCCCCCCCCCC
Q psy592           57 TAKCQCLPGWSGTGCVTP   74 (339)
Q Consensus        57 ~~~C~C~~g~~G~~Ce~~   74 (339)
                      +-.|.|.++++|++|+.-
T Consensus        17 ~G~C~C~~~~~G~~C~~C   34 (46)
T smart00180       17 TGQCECKPNVTGRRCDRC   34 (46)
T ss_pred             CCEEECCCCCCCCCCCcC
Confidence            448999999999999965


No 60 
>PF08693 SKG6:  Transmembrane alpha-helix domain;  InterPro: IPR014805 SKG6 and AXL2 are membrane proteins that show polarised intracellular localisation [, ]. This entry represents the highly conserved transmembrane alpha-helical domain found in these proteins [, ]. The full-length AXL2 protein has a negative regulatory function in cytokinesis [].
Probab=62.58  E-value=1.6  Score=28.25  Aligned_cols=9  Identities=33%  Similarity=0.766  Sum_probs=3.7

Q ss_pred             HHHHHHHHH
Q psy592          211 VLVFVVYSR  219 (339)
Q Consensus       211 ~l~~~~~~~  219 (339)
                      ++++++|+|
T Consensus        30 ~~l~~~~rR   38 (40)
T PF08693_consen   30 AFLFFWYRR   38 (40)
T ss_pred             HHhheEEec
Confidence            333444444


No 61 
>PF14575 EphA2_TM:  Ephrin type-A receptor 2 transmembrane domain; PDB: 3KUL_A 2XVD_A 2VX1_A 2VWV_A 2VX0_A 2VWY_A 2VWZ_A 2VWW_A 2VWU_A 2VWX_A ....
Probab=59.70  E-value=4.5  Score=29.98  Aligned_cols=13  Identities=46%  Similarity=0.713  Sum_probs=4.9

Q ss_pred             HHHHHHHHHHHHH
Q psy592          208 QVLVLVFVVYSRR  220 (339)
Q Consensus       208 l~~~l~~~~~~~~  220 (339)
                      |+++++++++.||
T Consensus        14 l~~v~~~~~~~rr   26 (75)
T PF14575_consen   14 LVLVIIVIVCFRR   26 (75)
T ss_dssp             HHHHHHHHCCCTT
T ss_pred             HHhheeEEEEEee
Confidence            3333333333333


No 62 
>KOG4260|consensus
Probab=57.13  E-value=6.9  Score=36.20  Aligned_cols=39  Identities=23%  Similarity=0.468  Sum_probs=30.6

Q ss_pred             CCCCCCCcccCCCCCCCCCCCCCeeecCCCCceeeCCCCCCC
Q psy592           27 PGCPATEAVCNAHSRLHCYPEHGVCVGSMHTAKCQCLPGWSG   68 (339)
Q Consensus        27 ~gC~~~~~~C~~~~~~~pC~~~g~Cv~~~~~~~C~C~~g~~G   68 (339)
                      .+|. ..+.|...  .+||..+-.|++.-++|+|.+.+||.+
T Consensus       231 ~gCv-DvnEC~~e--p~~c~~~qfCvNteGSf~C~dk~Gy~~  269 (350)
T KOG4260|consen  231 EGCV-DVNECQNE--PAPCKAHQFCVNTEGSFKCEDKEGYKK  269 (350)
T ss_pred             cccc-cHHHHhcC--CCCCChhheeecCCCceEecccccccC
Confidence            4553 35677654  458988889999999999999999865


No 63 
>KOG3509|consensus
Probab=56.90  E-value=13  Score=40.44  Aligned_cols=117  Identities=10%  Similarity=0.020  Sum_probs=59.0

Q ss_pred             CCCCCCCeeecCC-CCceeeCCCCCCCCCCCCCCCCc--cccC-----cceEEEeccCCC------------CccccEEE
Q psy592           43 HCYPEHGVCVGSM-HTAKCQCLPGWSGTGCVTPTSPV--TFKP-----QSYIKYALSFEP------------DKYTTQLQ  102 (339)
Q Consensus        43 ~pC~~~g~Cv~~~-~~~~C~C~~g~~G~~Ce~~~~~~--~F~~-----~s~v~y~~~~~~------------~~~~~~i~  102 (339)
                      ++|+-+++|.... .+|.|.|+..-.+..|+......  .|..     ...+.|-.+...            ...+..-.
T Consensus       178 ~~~q~g~tC~~~~~~~~~~~~~~~~~~~~c~~~~~r~~~~f~~~~~g~~~~~~~~vp~~~e~S~~~~~~~h~~~~~~~~~  257 (964)
T KOG3509|consen  178 PVCQGGATCEVRNGKGYSLECPDCKVRVVCEACKPRAFCPFEKSVEGCLKCFCFGVPRPSESSLHAFRAIHGATLHVDSL  257 (964)
T ss_pred             cccccceeEEecCCcceeeeccccccceehhhccCceecccccccccccceeecCCCccccchhhhHhhhccchhccchh
Confidence            3899999998875 58888888776666677653110  0111     111112111100            00111112


Q ss_pred             EEEeecCC--ceEEEEEecCCCCceEEEEEECCEEEEEEecCCCcccceeEEe-cccccCccc
Q psy592          103 LKFRTREE--FGELFRLSDQHNREYAILEIKDSKLRFRYNLNNLKTDEKDIWL-NAIKDSKLR  162 (339)
Q Consensus       103 l~frT~~~--~GlL~~~~~~~~~~~i~L~L~~G~l~~~~~lG~~~~~~~~l~~-~~v~dG~WH  162 (339)
                      ..|++...  +.++.+.......+|+.+.+..|.+.+++..+..   ...+.. .+..+|+||
T Consensus       258 ~~~~~~s~~~~~~~~~~~~~~~~~f~~lt~~~g~~g~~~~~~~~---~~~~~~~~~~~~~E~~  317 (964)
T KOG3509|consen  258 GVFFSDSEHRDILGNFLFSSFKDGFRALTLDGGTDGVRYDCGLP---QREDRLDVTSYIGEWR  317 (964)
T ss_pred             eeeccccccccccccccccccccceeeeccCCCCccccccccCc---chhhhhccccccceee
Confidence            22333332  3344444433346788888877777777766542   222222 345567777


No 64 
>PF05923 APC_crr:  APC cysteine-rich region;  InterPro: IPR009223 This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin [].; GO: 0016055 Wnt receptor signaling pathway; PDB: 1T08_C 1TH1_C 1V18_B.
Probab=56.05  E-value=4.1  Score=23.75  Aligned_cols=21  Identities=48%  Similarity=0.540  Sum_probs=1.3

Q ss_pred             CcceeeeecCCC---CCccccccc
Q psy592          310 DDLRNYAYEGGG---STAGSLSSL  330 (339)
Q Consensus       310 Dsl~~y~~EG~g---S~agSlssl  330 (339)
                      |.++.|.-|+-=   |.|.|||||
T Consensus         2 d~~~~~~vE~TP~~fS~~sSlSsL   25 (26)
T PF05923_consen    2 DSPKKYCVEDTPACFSRASSLSSL   25 (26)
T ss_dssp             --------SS--------------
T ss_pred             CccccceecCCCccccccccccCC
Confidence            456667766644   677888887


No 65 
>PF15050 SCIMP:  SCIMP protein
Probab=55.02  E-value=20  Score=28.91  Aligned_cols=11  Identities=27%  Similarity=0.670  Sum_probs=5.6

Q ss_pred             CCchhhccccc
Q psy592          230 PDDDVRENIIN  240 (339)
Q Consensus       230 ~~~d~reni~~  240 (339)
                      ++|.+=||+++
T Consensus        53 deEkmYENv~n   63 (133)
T PF15050_consen   53 DEEKMYENVLN   63 (133)
T ss_pred             cHHHHHHHhhc
Confidence            34445566653


No 66 
>KOG1214|consensus
Probab=53.90  E-value=7.7  Score=41.22  Aligned_cols=36  Identities=25%  Similarity=0.686  Sum_probs=29.3

Q ss_pred             cCCCCCCCCCCCCCeeecCCCCceeeCCCCCCC--CCCCC
Q psy592           36 CNAHSRLHCYPEHGVCVGSMHTAKCQCLPGWSG--TGCVT   73 (339)
Q Consensus        36 C~~~~~~~pC~~~g~Cv~~~~~~~C~C~~g~~G--~~Ce~   73 (339)
                      |+.+  -+.|..++.|.+...+|.|+|..||+|  ++|-.
T Consensus       417 C~~n--~hqCs~HA~CtdyatgFcC~C~~gy~GnGr~C~~  454 (1289)
T KOG1214|consen  417 CEHN--HHQCSRHAFCTDYATGFCCHCQSGYYGNGRHCLP  454 (1289)
T ss_pred             hhcc--cccccccceeccCCCceeeeecccccCCCceecc
Confidence            5544  357888899999999999999999975  57854


No 67 
>PTZ00046 rifin; Provisional
Probab=51.56  E-value=14  Score=35.75  Aligned_cols=15  Identities=20%  Similarity=0.716  Sum_probs=7.5

Q ss_pred             HHHHHHHHHHHHHhh
Q psy592          208 QVLVLVFVVYSRRRE  222 (339)
Q Consensus       208 l~~~l~~~~~~~~~~  222 (339)
                      |+||++++++|-||+
T Consensus       329 LIMvIIYLILRYRRK  343 (358)
T PTZ00046        329 LIMVIIYLILRYRRK  343 (358)
T ss_pred             HHHHHHHHHHHhhhc
Confidence            455555555554443


No 68 
>KOG1836|consensus
Probab=51.08  E-value=12  Score=43.29  Aligned_cols=32  Identities=28%  Similarity=0.690  Sum_probs=27.2

Q ss_pred             CCCCCCeeecCC--CCceee-CCCCCCCCCCCCCC
Q psy592           44 CYPEHGVCVGSM--HTAKCQ-CLPGWSGTGCVTPT   75 (339)
Q Consensus        44 pC~~~g~Cv~~~--~~~~C~-C~~g~~G~~Ce~~~   75 (339)
                      ||.+++.|....  ....|. |++||+|.+|+...
T Consensus       781 ~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~c~  815 (1705)
T KOG1836|consen  781 PCPNGGACGQTPEILEVVCKNCPPGYTGLRCEECA  815 (1705)
T ss_pred             CCCCChhhcCcCcccceecCCCCCCCcccccccCC
Confidence            788888898764  578899 99999999999764


No 69 
>PF02973 Sialidase:  Sialidase, N-terminal domain;  InterPro: IPR004124 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Sialidases (GH33 from CAZY) hydrolyse alpha-(2->3)-, alpha-(2->6)-, alpha-(2->8)-glycosidic linkages of terminal sialic residues in oligosaccharides, glycoproteins, glycolipids, colominic acid and synthetic substrates. Sialidases may act as pathogenic factors in microbial infections [].  The 1.8 A structure of trans-sialidase from leech (Macrobdella decora, Q27701 from SWISSPROT) in complex with 2-deoxy-2, 3-didehydro-NeuAc was solved. The refined model comprising residues 81-769 has a catalytic beta-propeller domain, a N-terminal lectin-like domain and an irregular beta-stranded domain inserted into the catalytic domain [].; GO: 0004308 exo-alpha-sialidase activity, 0005975 carbohydrate metabolic process; PDB: 2JKB_A 2VW2_A 2VW0_A 2VW1_A 2V73_B 2V72_A 1SLI_A 1SLL_A 2SLI_A 4SLI_A ....
Probab=50.00  E-value=1.2e+02  Score=26.68  Aligned_cols=46  Identities=24%  Similarity=0.467  Sum_probs=34.2

Q ss_pred             ccEEEEEEeecCCceE--EEEEecC-CCCceEEEEEECCEEEEEEecCC
Q psy592           98 TTQLQLKFRTREEFGE--LFRLSDQ-HNREYAILEIKDSKLRFRYNLNN  143 (339)
Q Consensus        98 ~~~i~l~frT~~~~Gl--L~~~~~~-~~~~~i~L~L~~G~l~~~~~lG~  143 (339)
                      .-.|.++||+...+++  ||.+++. ...+|+.|.+.++.+-+.++-..
T Consensus        34 ~gTI~i~Fk~~~~~~~~sLfsiSn~~~~n~YF~lyv~~~~~G~E~R~~~   82 (190)
T PF02973_consen   34 EGTIVIRFKSDSNSGIQSLFSISNSTKGNEYFSLYVSNNKLGFELRDTK   82 (190)
T ss_dssp             SEEEEEEEEESS-SSEEEEEEEE-TSTTSEEEEEEEETTEEEEEEEETT
T ss_pred             ccEEEEEEecCCCcceeEEEEecCCCCccceEEEEEECCEEEEEEecCC
Confidence            4579999999666543  7777754 34689999999999888887654


No 70 
>TIGR01477 RIFIN variant surface antigen, rifin family. This model represents the rifin branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of rifin sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 20 bits.
Probab=49.60  E-value=15  Score=35.30  Aligned_cols=15  Identities=20%  Similarity=0.716  Sum_probs=7.1

Q ss_pred             HHHHHHHHHHHHHhh
Q psy592          208 QVLVLVFVVYSRRRE  222 (339)
Q Consensus       208 l~~~l~~~~~~~~~~  222 (339)
                      |+||++++++|-||+
T Consensus       324 LIMvIIYLILRYRRK  338 (353)
T TIGR01477       324 LIMVIIYLILRYRRK  338 (353)
T ss_pred             HHHHHHHHHHHhhhc
Confidence            444555554444443


No 71 
>PF15176 LRR19-TM:  Leucine-rich repeat family 19 TM domain
Probab=49.36  E-value=63  Score=25.33  Aligned_cols=10  Identities=10%  Similarity=0.384  Sum_probs=6.3

Q ss_pred             hHHHHHHHHH
Q psy592          196 SALFVTLLIS  205 (339)
Q Consensus       196 ~~il~~~~~l  205 (339)
                      ++.|+++++.
T Consensus        17 W~~LVGVv~~   26 (102)
T PF15176_consen   17 WPFLVGVVVT   26 (102)
T ss_pred             cHhHHHHHHH
Confidence            5666666655


No 72 
>PF08374 Protocadherin:  Protocadherin;  InterPro: IPR013585 The structure of protocadherins is similar to that of classic cadherins (IPR002126 from INTERPRO), but they also have some unique features associated with the cytoplasmic domains. They are expressed in a variety of organisms and are found in high concentrations in the brain where they seem to be localised mainly at cell-cell contact sites. Their expression seems to be developmentally regulated []. 
Probab=48.95  E-value=11  Score=33.67  Aligned_cols=10  Identities=10%  Similarity=0.218  Sum_probs=5.5

Q ss_pred             hHHHHHHHHH
Q psy592          196 SALFVTLLIS  205 (339)
Q Consensus       196 ~~il~~~~~l  205 (339)
                      +||++.++..
T Consensus        41 iaiVAG~~tV   50 (221)
T PF08374_consen   41 IAIVAGIMTV   50 (221)
T ss_pred             eeeecchhhh
Confidence            4555555554


No 73 
>PF00558 Vpu:  Vpu protein;  InterPro: IPR008187 The Human immunodeficiency virus 1 (HIV-1) Vpu protein acts in the degradation of CD4 in the endoplasmic reticulum and in the enhancement of virion release from the plasma membrane of infected cells [].; GO: 0019076 release of virus from host; PDB: 2JPX_A 1PI8_A 2GOH_A 2GOF_A 1PI7_A 1PJE_A 1VPU_A 2K7Y_A.
Probab=44.52  E-value=33  Score=25.81  Aligned_cols=9  Identities=33%  Similarity=0.582  Sum_probs=2.7

Q ss_pred             HHHHHHHHH
Q psy592          210 LVLVFVVYS  218 (339)
Q Consensus       210 ~~l~~~~~~  218 (339)
                      +++..++|.
T Consensus        20 IvvW~iv~i   28 (81)
T PF00558_consen   20 IVVWTIVYI   28 (81)
T ss_dssp             HHHHHHH--
T ss_pred             HHHHHHHHH
Confidence            333334443


No 74 
>PF14991 MLANA:  Protein melan-A; PDB: 2GTZ_F 2GT9_F 3MRO_P 2GUO_C 3MRQ_P 2GTW_C 3L6F_C 3MRP_P.
Probab=42.94  E-value=7  Score=31.24  Aligned_cols=12  Identities=42%  Similarity=0.709  Sum_probs=0.0

Q ss_pred             cccCCCCCccCC
Q psy592          240 NYDDEGGGEDDM  251 (339)
Q Consensus       240 ~Y~eEggGE~d~  251 (339)
                      .+..|+++-+|.
T Consensus        72 rc~~e~~~~qds   83 (118)
T PF14991_consen   72 RCYSEGGGHQDS   83 (118)
T ss_dssp             ------------
T ss_pred             cccccccccccc
Confidence            355577777763


No 75 
>PF12191 stn_TNFRSF12A:  Tumour necrosis factor receptor stn_TNFRSF12A_TNFR domain;  InterPro: IPR022316 The tumour necrosis factor (TNF) receptor (TNFR) superfamily comprises more than 20 type-I transmembrane proteins. Family members are defined based on similarity in their extracellular domain - a region that contains many cysteine residues arranged in a specific repetitive pattern []. The cysteines allow formation of an extended rod-like structure, responsible for ligand binding []. Upon receptor activation, different intracellular signalling complexes are assembled for different members of the TNFR superfamily, depending on their intracellular domains and sequences []. Activation of TNFRs can therefore induce a range of disparate effects, including cell proliferation, differentiation, survival, or apoptotic cell death, depending upon the receptor involved []. TNFRs are widely distributed and play important roles in many crucial biological processes, such as lymphoid and neuronal development, innate and adaptive immunity, and maintenance of cellular homeostasis []. Drugs that manipulate their signalling have potential roles in the prevention and treatment of many diseases, such as viral infections, coronary heart disease, transplant rejection, and immune disease []. TNF receptor 12 (also known as TWEAK receptor, and fibroblast growth factor-inducible-14 (Fn14)) has been implicated in endothelial cell growth and migration []. The receptor may also play a role in cell-matrix interactions [].; PDB: 2KN0_A 2RPJ_A 2KMZ_A 2EQP_A.
Probab=42.03  E-value=8.5  Score=31.37  Aligned_cols=14  Identities=36%  Similarity=0.560  Sum_probs=0.0

Q ss_pred             HHHHHHHHHHHhhh
Q psy592          210 LVLVFVVYSRRREA  223 (339)
Q Consensus       210 ~~l~~~~~~~~~~~  223 (339)
                      ++.+|++|+|+|++
T Consensus        94 llsg~lv~rrcrrr  107 (129)
T PF12191_consen   94 LLSGFLVWRRCRRR  107 (129)
T ss_dssp             --------------
T ss_pred             HHHHHHHHhhhhcc
Confidence            33456777776654


No 76 
>PF01299 Lamp:  Lysosome-associated membrane glycoprotein (Lamp);  InterPro: IPR002000 Lysosome-associated membrane glycoproteins (lamp) [] are integral membrane proteins, specific to lysosomes, and whose exact biological function is not yet clear. Structurally, the lamp proteins consist of two internally homologous lysosome-luminal domains separated by a proline-rich hinge region; at the C-terminal extremity there is a transmembrane region (TM) followed by a very short cytoplasmic tail (C). In each of the duplicated domains, there are two conserved disulphide bonds. This structure is schematically represented in the figure below.   +-----+ +-----+ +-----+ +-----+ | | | | | | | | xCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxxxCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxx +--------------------------++Hinge++--------------------------++TM++C+  In mammals, there are two closely related types of lamp: lamp-1 and lamp-2, which form major components of the lysosome membrane. In chicken lamp-1 is known as LEP100.  Also included in this entry is the macrophage protein CD68 (or macrosialin) [] is a heavily glycosylated integral membrane protein whose structure consists of a mucin-like domain followed by a proline-rich hinge; a single lamp-like domain; a transmembrane region and a short cytoplasmic tail.   Similar to CD68, mammalian lamp-3, which is expressed in lymphoid organs, dendritic cells and in lung, contains all the C-terminal regions but lacks the N-terminal lamp-like region []. In a lamp-family protein from nematodes [] only the part C-terminal to the hinge is conserved. ; GO: 0016020 membrane
Probab=41.86  E-value=13  Score=35.07  Aligned_cols=14  Identities=29%  Similarity=0.681  Sum_probs=5.9

Q ss_pred             HHHHHH-HHHHHHHh
Q psy592          208 QVLVLV-FVVYSRRR  221 (339)
Q Consensus       208 l~~~l~-~~~~~~~~  221 (339)
                      +++||+ +++.|||+
T Consensus       285 vlivLiaYli~Rrr~  299 (306)
T PF01299_consen  285 VLIVLIAYLIGRRRS  299 (306)
T ss_pred             HHHHHHhheeEeccc
Confidence            444443 34444443


No 77 
>PF15330 SIT:  SHP2-interacting transmembrane adaptor protein, SIT
Probab=41.72  E-value=56  Score=25.97  Aligned_cols=21  Identities=19%  Similarity=0.313  Sum_probs=10.2

Q ss_pred             HHHHHHHHHHHHHHHHHHhhh
Q psy592          203 LISPNQVLVLVFVVYSRRREA  223 (339)
Q Consensus       203 ~~l~~l~~~l~~~~~~~~~~~  223 (339)
                      |+|+|.+-++.+...+|+++.
T Consensus        11 Lll~l~asl~~wr~~~rq~k~   31 (107)
T PF15330_consen   11 LLLSLAASLLAWRMKQRQKKA   31 (107)
T ss_pred             HHHHHHHHHHHHHHHhhhccc
Confidence            333445555555555555553


No 78 
>PF14851 FAM176:  FAM176 family
Probab=41.33  E-value=36  Score=28.87  Aligned_cols=11  Identities=9%  Similarity=0.190  Sum_probs=5.6

Q ss_pred             cCCCCCCCCCc
Q psy592          301 DADPNAPPFDD  311 (339)
Q Consensus       301 d~d~~~~p~Ds  311 (339)
                      +..|+.++--+
T Consensus       138 n~~~d~~~~~~  148 (153)
T PF14851_consen  138 NGQPDILGTRT  148 (153)
T ss_pred             hcCCCCCCCCC
Confidence            45565554433


No 79 
>PF02439 Adeno_E3_CR2:  Adenovirus E3 region protein CR2;  InterPro: IPR003470 Early region 3 (E3) of human adenoviruses (Ads) codes for proteins that appear to control viral interactions with the host []. This region called CR1 (conserved region 1) [] is found three times in Human adenovirus 19 (a subgroup D adenovirus) 49 kDa protein in the E3 region. CR1 is also found in the 20.1 Kd protein of subgroup B adenoviruses. The function of this 80 amino acid region is unknown. This region is probably a divergent immunoglobulin domain.
Probab=40.81  E-value=45  Score=21.28  Aligned_cols=6  Identities=33%  Similarity=0.667  Sum_probs=2.4

Q ss_pred             HHHHHH
Q psy592          197 ALFVTL  202 (339)
Q Consensus       197 ~il~~~  202 (339)
                      +|.+.+
T Consensus         7 aIIv~V   12 (38)
T PF02439_consen    7 AIIVAV   12 (38)
T ss_pred             hHHHHH
Confidence            444443


No 80 
>PF05393 Hum_adeno_E3A:  Human adenovirus early E3A glycoprotein;  InterPro: IPR008652 This family consists of several early glycoproteins (E3A), from human adenovirus type 2.; GO: 0016021 integral to membrane
Probab=40.31  E-value=73  Score=24.33  Aligned_cols=14  Identities=29%  Similarity=0.738  Sum_probs=6.3

Q ss_pred             HHHHHHHHHH-HHhh
Q psy592          209 VLVLVFVVYS-RRRE  222 (339)
Q Consensus       209 ~~~l~~~~~~-~~~~  222 (339)
                      ++|++++++. +||+
T Consensus        46 l~VilwfvCC~kRkr   60 (94)
T PF05393_consen   46 LLVILWFVCCKKRKR   60 (94)
T ss_pred             HHHHHHHHHHHHhhh
Confidence            3344555444 4443


No 81 
>PF05454 DAG1:  Dystroglycan (Dystrophin-associated glycoprotein 1);  InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5.5 kb transcript in Homo sapiens. The protein product is cleaved into two non-covalently associated subunits, [alpha] (N-terminal) and [beta] (C-terminal). In skeletal muscle the dystroglycan complex works as a transmembrane linkage between the extracellular matrix and the cytoskeleton [alpha]-dystroglycan is extracellular and binds to merosin ([alpha]-2 laminin) in the basement membrane, while [beta]-dystroglycan is a transmembrane protein and binds to dystrophin, which is a large rod-like cytoskeletal protein, absent in Duchenne muscular dystrophy patients. Dystrophin binds to intracellular actin cables. In this way, the dystroglycan complex, which links the extracellular matrix to the intracellular actin cables, is thought to provide structural integrity in muscle tissues. The dystroglycan complex is also known to serve as an agrin receptor in muscle, where it may regulate agrin-induced acetylcholine receptor clustering at the neuromuscular junction. There is also evidence which suggests the function of dystroglycan as a part of the signal transduction pathway because it is shown that Grb2, a mediator of the Ras-related signal pathway, can interact with the cytoplasmic domain of dystroglycan. In general, aberrant expression of dystrophin-associated protein complex underlies the pathogenesis of Duchenne muscular dystrophy, Becker muscular dystrophy and severe childhood autosomal recessive muscular dystrophy. Interestingly, no genetic disease has been described for either [alpha]- or [beta]-dystroglycan. Dystroglycan is widely distributed in non-muscle tissues as well as in muscle tissues. During epithelial morphogenesis of kidney, the dystroglycan complex is shown to act as a receptor for the basement membrane. Dystroglycan expression in Mus musculus brain and neural retina has also been reported. However, the physiological role of dystroglycan in non-muscle tissues has remained unclear [].; PDB: 1EG4_P.
Probab=39.98  E-value=9.5  Score=35.84  Aligned_cols=16  Identities=13%  Similarity=0.111  Sum_probs=0.0

Q ss_pred             EEEECCEEEEEEecCC
Q psy592          128 LEIKDSKLRFRYNLNN  143 (339)
Q Consensus       128 L~L~~G~l~~~~~lG~  143 (339)
                      ..+.+|...+.|...+
T Consensus        43 ~sIt~gstiVtwtNnT   58 (290)
T PF05454_consen   43 RSITSGSTIVTWTNNT   58 (290)
T ss_dssp             ----------------
T ss_pred             EEecCCCEEEEEEcCC
Confidence            3556666666665443


No 82 
>PF15347 PAG:  Phosphoprotein associated with glycosphingolipid-enriched
Probab=39.52  E-value=35  Score=33.07  Aligned_cols=15  Identities=20%  Similarity=0.020  Sum_probs=8.2

Q ss_pred             cHHHHHHHHHHHhcC
Q psy592          288 NVGIFIEEHKKRADA  302 (339)
Q Consensus       288 ~~~~~i~~~~~~ad~  302 (339)
                      .|.|.|.+-+++...
T Consensus       176 ~vEDcLYETVKEiKe  190 (428)
T PF15347_consen  176 MVEDCLYETVKEIKE  190 (428)
T ss_pred             hHHHHHHHHHHHHhh
Confidence            345666665555543


No 83 
>PHA03049 IMV membrane protein; Provisional
Probab=39.35  E-value=46  Score=23.95  Aligned_cols=23  Identities=13%  Similarity=0.478  Sum_probs=12.4

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy592          197 ALFVTLLISPNQVLVLVFVVYSRRREA  223 (339)
Q Consensus       197 ~il~~~~~l~~l~~~l~~~~~~~~~~~  223 (339)
                      .+++|++++    .++++.+|+|.+..
T Consensus         7 l~iICVaIi----~lIvYgiYnkk~~~   29 (68)
T PHA03049          7 LVIICVVII----GLIVYGIYNKKTTT   29 (68)
T ss_pred             HHHHHHHHH----HHHHHHHHhccccc
Confidence            344454444    34456778776653


No 84 
>PF10731 Anophelin:  Thrombin inhibitor from mosquito;  InterPro: IPR018932  Members of this family are all inhibitors of thrombin, the peptidase that is at the end of the blood coagulation cascade and which creates the clot by cleaving fibrinogen. The interaction between thrombin and fibrinogen involves two different areas of contact - via the thrombin active site and via a second substrate-binding site known as an exosite. The inhibitor acts by blocking the exosite, rather than by interacting with the active site. The inhibitors are from mosquitoes that feed on human blood and which, by inhibiting thrombin, prevent the blood from clotting and keep it flowing. 
Probab=38.63  E-value=57  Score=23.00  Aligned_cols=9  Identities=22%  Similarity=0.807  Sum_probs=4.2

Q ss_pred             cccCCCCCc
Q psy592          240 NYDDEGGGE  248 (339)
Q Consensus       240 ~Y~eEggGE  248 (339)
                      .|||+-+.+
T Consensus        32 ~YDEdd~dd   40 (65)
T PF10731_consen   32 SYDEDDDDD   40 (65)
T ss_pred             CcCcccCcc
Confidence            455544333


No 85 
>KOG3607|consensus
Probab=37.02  E-value=25  Score=37.34  Aligned_cols=28  Identities=36%  Similarity=0.879  Sum_probs=21.0

Q ss_pred             CCCCCCeeecCCCCceeeCCCCCCCCCCCCC
Q psy592           44 CYPEHGVCVGSMHTAKCQCLPGWSGTGCVTP   74 (339)
Q Consensus        44 pC~~~g~Cv~~~~~~~C~C~~g~~G~~Ce~~   74 (339)
                      .|..+|.|-.   .+.|+|.+||.++.|++.
T Consensus       631 ~C~g~GVCnn---~~~ChC~~gwapp~C~~~  658 (716)
T KOG3607|consen  631 TCNGHGVCNN---ELNCHCEPGWAPPFCFIF  658 (716)
T ss_pred             ccCCCcccCC---CcceeeCCCCCCCccccc
Confidence            3555566654   457999999999999765


No 86 
>PF04272 Phospholamban:  Phospholamban;  InterPro: IPR005984  Phospholamban (PLB) is a small protein (52 amino acids) that regulates the affinity of the cardiac sarcoplasmic reticulum Ca2+-ATPase (SERCA2a) for calcium. PLB is present in cardiac myocytes, in slow-twitch and smooth muscle and is expressed also in aorta endothelial cells in which it could play a role in tissue relaxation. The phosphorylation/dephosphorylation of phospholamban removes and restores, respectively, its inhibitory activity on SERCA2a. It has in fact been shown that phospholamban, in its non-phosphorylated form, binds to SERCA2a and inhibits this pump by lowering its affinity for Ca2+, whereas the phosphorylated form does not exert the inhibition. PLB is phosphorylated at two sites, namely at Ser-16 for a cAMP-dependent phosphokinase and at Thr-17 for a Ca2+/calmodulin-dependent phosphokinase, phosphorylation at Ser-16 being a prerequisite for the phosphorylation at Thr-17.   The structure of a 36-amino-acid-long N-terminal fragment of human phospholamban phosphorylated at Ser-16 and Thr-17 and Cys36Ser mutated was determined from nuclear magnetic resonance data. The peptide assumes a conformation characterised by two alpha-helices connected by an irregular strand, which comprises the amino acids from Arg-13 to Pro-21. The proline is in a trans conformation. The two phosphate groups on Ser-16 and Thr-17 are shown to interact preferably with the side chains of Arg-14 and Arg-13, respectively [].; GO: 0005246 calcium channel regulator activity, 0042030 ATPase inhibitor activity, 0006816 calcium ion transport, 0016020 membrane; PDB: 1N7L_A 1FJP_A 1FJK_A 2HYN_C 1ZLL_D 1PLP_A 3O7L_I.
Probab=36.88  E-value=31  Score=22.77  Aligned_cols=9  Identities=22%  Similarity=0.563  Sum_probs=4.6

Q ss_pred             HHHHHHHHH
Q psy592          197 ALFVTLLIS  205 (339)
Q Consensus       197 ~il~~~~~l  205 (339)
                      .||+|++++
T Consensus        37 lilicllli   45 (52)
T PF04272_consen   37 LILICLLLI   45 (52)
T ss_dssp             HHHHHHHHH
T ss_pred             HHHHHHHHH
Confidence            455555444


No 87 
>PF09064 Tme5_EGF_like:  Thrombomodulin like fifth domain, EGF-like;  InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=36.27  E-value=31  Score=21.43  Aligned_cols=17  Identities=24%  Similarity=0.632  Sum_probs=10.9

Q ss_pred             eeecCCCCceeeCCCCCC
Q psy592           50 VCVGSMHTAKCQCLPGWS   67 (339)
Q Consensus        50 ~Cv~~~~~~~C~C~~g~~   67 (339)
                      .|.+.. ...|.|+.||.
T Consensus        11 ~CDpn~-~~~C~CPeGyI   27 (34)
T PF09064_consen   11 DCDPNS-PGQCFCPEGYI   27 (34)
T ss_pred             ccCCCC-CCceeCCCceE
Confidence            344432 34899999983


No 88 
>PRK00523 hypothetical protein; Provisional
Probab=35.41  E-value=76  Score=23.33  Aligned_cols=23  Identities=17%  Similarity=0.075  Sum_probs=10.0

Q ss_pred             chHHHHHHHHHHHHHHHHHHHHHH
Q psy592          195 GSALFVTLLISPNQVLVLVFVVYS  218 (339)
Q Consensus       195 g~~il~~~~~l~~l~~~l~~~~~~  218 (339)
                      |++|++.++.| ++.+++.|++-+
T Consensus         5 ~l~I~l~i~~l-i~G~~~Gffiar   27 (72)
T PRK00523          5 GLALGLGIPLL-IVGGIIGYFVSK   27 (72)
T ss_pred             HHHHHHHHHHH-HHHHHHHHHHHH
Confidence            34555444443 133444454433


No 89 
>PF06024 DUF912:  Nucleopolyhedrovirus protein of unknown function (DUF912);  InterPro: IPR009261 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf78; it is a family of uncharacterised viral proteins.
Probab=35.37  E-value=6.9  Score=30.73  Aligned_cols=8  Identities=38%  Similarity=0.829  Sum_probs=3.1

Q ss_pred             HHHHHHHh
Q psy592          214 FVVYSRRR  221 (339)
Q Consensus       214 ~~~~~~~~  221 (339)
                      |++.|.|+
T Consensus        84 FVILRer~   91 (101)
T PF06024_consen   84 FVILRERQ   91 (101)
T ss_pred             EEEEeccc
Confidence            34444333


No 90 
>TIGR00847 ccoS cytochrome oxidase maturation protein, cbb3-type. CcoS from Rhodobacter capsulatus has been shown essential for incorporation of redox-active prosthetic groups (heme, Cu) into cytochrome cbb(3) oxidase. FixS of Bradyrhizobium japonicum appears to have the same function. Members of this family are found so far in organisms with a cbb3-type cytochrome oxidase, including Neisseria meningitidis, Helicobacter pylori, Campylobacter jejuni, Caulobacter crescentus, Bradyrhizobium japonicum, and Rhodobacter capsulatus.
Probab=35.12  E-value=1.1e+02  Score=20.85  Aligned_cols=11  Identities=27%  Similarity=0.313  Sum_probs=5.2

Q ss_pred             HHHHHHHHHHh
Q psy592          211 VLVFVVYSRRR  221 (339)
Q Consensus       211 ~l~~~~~~~~~  221 (339)
                      .+.+++|.-|+
T Consensus        18 ~l~~f~Wavk~   28 (51)
T TIGR00847        18 GLVAFLWSLKS   28 (51)
T ss_pred             HHHHHHHHHcc
Confidence            33445555444


No 91 
>PHA02872 EFc gene family protein; Provisional
Probab=34.76  E-value=23  Score=28.10  Aligned_cols=30  Identities=27%  Similarity=0.283  Sum_probs=26.0

Q ss_pred             CCCcceeeeecCCCCCcccccccccCCCCC
Q psy592          308 PFDDLRNYAYEGGGSTAGSLSSLASGSSIK  337 (339)
Q Consensus       308 p~Dsl~~y~~EG~gS~agSlssl~s~~~~~  337 (339)
                      .-|++.+|-+||.|-..=|.|-.+|.+.++
T Consensus        58 ~gdtvkIYflEGkG~LIfSv~dv~sp~~ee   87 (124)
T PHA02872         58 AGDTVKIYFLEGKGGLIFSVSDVGSPDNEE   87 (124)
T ss_pred             CCCeEEEEEEecCCcEEEEEEecCCCCccc
Confidence            347899999999999999999999987754


No 92 
>PF02699 YajC:  Preprotein translocase subunit;  InterPro: IPR003849 Secretion across the inner membrane in some Gram-negative bacteria occurs via the preprotein translocase pathway. Proteins are produced in the cytoplasm as precursors, and require a chaperone subunit to direct them to the translocase component []. From there, the mature proteins are either targeted to the outer membrane, or remain as periplasmic proteins []. The translocase protein subunits are encoded on the bacterial chromosome.  The translocase itself comprises 7 proteins, including a chaperone (SecB), ATPase (SecA), an integral membrane complex (SecY, SecE and SecG), and two additional membrane proteins that promote the release of the mature peptide into the periplasm (SecD and SecF) []. Other cytoplasmic/periplasmic proteins play a part in preprotein translocase activity, namely YidC and YajC []. The latter is bound in a complex to SecD and SecF, and plays a part in stabilising and regulating secretion through the SecYEG integral membrane component via SecA [].  Homologues of the YajC gene have been found in a range of pathogenic and commensal microbes. Brucella abortis YajC- and SecD-like proteins were shown to stimulate a Th1 cell-mediated immune response in mice, and conferred protection when challenged with B.abortis []. Therefore, these proteins may have an antigenic role as well as a secretory one in virulent bacteria []. A number of previously uncharacterised "hypothetical" proteins also show similarity to E.coli YajC, suggesting that this family is wider than first thought [].  More recently, the precise interactions between the E.coli SecYEG complex, SecD, SecF, YajC and YidC have been studied []. Rather than acting individually, the four proteins form a heterotetrameric complex and associate with the SecYEG heterotrimeric complex []. The SecF and YajC subunits link the complex to the integral membrane translocase. ; PDB: 2RDD_B.
Probab=34.69  E-value=33  Score=25.74  Aligned_cols=10  Identities=10%  Similarity=0.471  Sum_probs=4.1

Q ss_pred             HHHHHHHHHH
Q psy592          211 VLVFVVYSRR  220 (339)
Q Consensus       211 ~l~~~~~~~~  220 (339)
                      ++.|+.++.+
T Consensus        14 i~yf~~~rpq   23 (82)
T PF02699_consen   14 IFYFLMIRPQ   23 (82)
T ss_dssp             HHHHHTHHHH
T ss_pred             HHhhheecHH
Confidence            3344444433


No 93 
>PHA03164 hypothetical protein; Provisional
Probab=34.62  E-value=38  Score=25.05  Aligned_cols=23  Identities=17%  Similarity=0.313  Sum_probs=10.8

Q ss_pred             hHHHHHHHHHHHHHHHHHHHHHH
Q psy592          196 SALFVTLLISPNQVLVLVFVVYS  218 (339)
Q Consensus       196 ~~il~~~~~l~~l~~~l~~~~~~  218 (339)
                      ..||.|+++...|.+++++.++.
T Consensus        60 FlvLtgLaIamILfiifvlyvFn   82 (88)
T PHA03164         60 FLVLTGLAIAMILFIIFVLYVFN   82 (88)
T ss_pred             hHHHHHHHHHHHHHHHHHHHhee
Confidence            56666665542233344444443


No 94 
>PF06365 CD34_antigen:  CD34/Podocalyxin family;  InterPro: IPR013836 This family consists of several mammalian CD34 antigen proteins. The CD34 antigen is a human leukocyte membrane protein expressed specifically by lymphohematopoietic progenitor cells. CD34 is a phosphoprotein. Activation of protein kinase C (PKC) has been found to enhance CD34 phosphorylation [, ]. This family contains several eukaryotic podocalyxin proteins. Podocalyxin is a major membrane protein of the glomerular epithelium and is thought to be involved in maintenance of the architecture of the foot processes and filtration slits characteristic of this unique epithelium by virtue of its high negative charge. Podocalyxin functions as an anti-adhesin that maintains an open filtration pathway between neighbouring foot processes in the glomerular epithelium by charge repulsion [].
Probab=34.22  E-value=39  Score=30.02  Aligned_cols=20  Identities=15%  Similarity=0.235  Sum_probs=10.7

Q ss_pred             ccccCCCCCccCCCcccccccC
Q psy592          239 INYDDEGGGEDDMTAFDITPLQ  260 (339)
Q Consensus       239 ~~Y~eEggGE~d~~~yd~~~L~  260 (339)
                      .+|.|+  |-+|...++....+
T Consensus       141 ~~~vEn--g~h~n~~l~v~~~~  160 (202)
T PF06365_consen  141 LYTVEN--GYHDNPTLSVAESQ  160 (202)
T ss_pred             ceeccc--CccCCcccccCCCC
Confidence            445443  34555678877333


No 95 
>PF02468 PsbN:  Photosystem II reaction centre N protein (psbN);  InterPro: IPR003398 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection [].   This family represents the low molecular weight transmembrane protein PsbN found in PSII. PsbN may have a role in PSII stability, however its actual function unknown. PsbN does not appear to be essential for photoautotrophic growth or normal PSII function.; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane
Probab=33.77  E-value=42  Score=22.09  Aligned_cols=15  Identities=13%  Similarity=0.235  Sum_probs=8.7

Q ss_pred             HHHHHHHHHHHHHhh
Q psy592          208 QVLVLVFVVYSRRRE  222 (339)
Q Consensus       208 l~~~l~~~~~~~~~~  222 (339)
                      |+-+..+.+|.....
T Consensus        15 lv~~Tgy~iYtaFGp   29 (43)
T PF02468_consen   15 LVSITGYAIYTAFGP   29 (43)
T ss_pred             HHHHHhhhhhheeCC
Confidence            444555667766553


No 96 
>PF03597 CcoS:  Cytochrome oxidase maturation protein cbb3-type;  InterPro: IPR004714 Cytochrome cbb3 oxidases are found almost exclusively in Proteobacteria, and represent a distinctive class of proton-pumping respiratory haem-copper oxidases (HCO) that lack many of the key structural features that contribute to the reaction cycle of the intensely studied mitochondrial cytochrome c oxidase (CcO). Expression of cytochrome cbb3 oxidase allows human pathogens to colonise anoxic tissues and agronomically important diazotrophs to sustain nitrogen fixation []. Genes encoding a cytochrome cbb3 oxidase were initially designated fixNOQP (ccoNOQP), the ccoNOQP operon is always found close to a second gene cluster, known as fixGHIS (ccoGHIS) whose expression is necessary for the assembly of a functional cbb3 oxidase. On the basis of their derived amino acid sequences each of the four proteins encoded by the ccoGHIS operon are thought to be membrane-bound. It has been suggested that they may function in concert as a multi-subunit complex, possibly playing a role in the uptake and metabolism of copper required for the assembly of the binuclear centre of cytochrome cbb3 oxidase. 
Probab=33.60  E-value=1.3e+02  Score=19.91  Aligned_cols=12  Identities=8%  Similarity=0.343  Sum_probs=5.8

Q ss_pred             HHHHHHHHHHHh
Q psy592          210 LVLVFVVYSRRR  221 (339)
Q Consensus       210 ~~l~~~~~~~~~  221 (339)
                      +.+.+++|.-|+
T Consensus        16 ~~l~~f~Wavk~   27 (45)
T PF03597_consen   16 IALAAFLWAVKS   27 (45)
T ss_pred             HHHHHHHHHHcc
Confidence            344445555554


No 97 
>PF14316 DUF4381:  Domain of unknown function (DUF4381)
Probab=33.49  E-value=56  Score=27.17  Aligned_cols=9  Identities=11%  Similarity=0.139  Sum_probs=3.9

Q ss_pred             HHHHHHHHH
Q psy592          289 VGIFIEEHK  297 (339)
Q Consensus       289 ~~~~i~~~~  297 (339)
                      .-+||.+..
T Consensus        98 W~~fL~~~~  106 (146)
T PF14316_consen   98 WLQFLDQQC  106 (146)
T ss_pred             HHHHHHHhC
Confidence            444444443


No 98 
>TIGR01294 P_lamban phospholamban. This model represents the short (52 residue) transmembrane phosphoprotein phospholamban. Phospholamban, in its unphosphorylated form, inhibits SERCA2, the cardiac sarcoplasmic reticulum Ca-ATPase.
Probab=33.00  E-value=39  Score=22.31  Aligned_cols=9  Identities=22%  Similarity=0.563  Sum_probs=4.7

Q ss_pred             HHHHHHHHH
Q psy592          197 ALFVTLLIS  205 (339)
Q Consensus       197 ~il~~~~~l  205 (339)
                      .||+|++++
T Consensus        37 lilicllli   45 (52)
T TIGR01294        37 LILICLLLI   45 (52)
T ss_pred             HHHHHHHHH
Confidence            455565444


No 99 
>PF01414 DSL:  Delta serrate ligand;  InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=32.73  E-value=14  Score=26.30  Aligned_cols=12  Identities=58%  Similarity=1.655  Sum_probs=9.2

Q ss_pred             eeCCCCCCCCCC
Q psy592           60 CQCLPGWSGTGC   71 (339)
Q Consensus        60 C~C~~g~~G~~C   71 (339)
                      =.|.+||+|+.|
T Consensus        52 ~~C~~Gw~G~~C   63 (63)
T PF01414_consen   52 KVCLPGWTGPNC   63 (63)
T ss_dssp             EEE-TTEESTTS
T ss_pred             CCCCCCCcCCCC
Confidence            368899999988


No 100
>TIGR00739 yajC preprotein translocase, YajC subunit. While this protein is part of the preprotein translocase in Escherichia coli, it is not essential for viability or protein secretion. The N-terminus region contains a predicted membrane-spanning region followed by a region consisting almost entirely of residues with charged (acidic, basic, or zwitterionic) side chains. This small protein is about 100 residues in length, and is restricted to bacteria; however, this protein is absent from some lineages, including spirochetes and Mycoplasmas.
Probab=32.30  E-value=25  Score=26.62  Aligned_cols=8  Identities=13%  Similarity=0.625  Sum_probs=3.4

Q ss_pred             HHHHHHHH
Q psy592          211 VLVFVVYS  218 (339)
Q Consensus       211 ~l~~~~~~  218 (339)
                      ++.|+.++
T Consensus        15 i~yf~~~r   22 (84)
T TIGR00739        15 IFYFLIIR   22 (84)
T ss_pred             HHHHheec
Confidence            33444443


No 101
>PF10717 ODV-E18:  Occlusion-derived virus envelope protein ODV-E18;  InterPro: IPR019655  Baculovirus occlusion-derived virus (ODV) derives its envelope from an intranuclear membrane source. Occlusion-derived viral envelope proteins that are detected in viral-induced intranuclear microvesicles, but not detected in the plasma membrane, cytoplasmic membranes, or the nuclear envelope. This entry represents ODV-E18 protein which is encoded by baculovirus late genes with transcription initiating from a TAAG motif. ODV-E18 exists as a dimer in the ODV envelope, which contains a hydrophobic domain that putatively acts as a target or retention signal for intranuclear microvesicles []. ; GO: 0019031 viral envelope
Probab=31.39  E-value=52  Score=24.83  Aligned_cols=10  Identities=20%  Similarity=0.504  Sum_probs=4.7

Q ss_pred             hHHHHHHHHH
Q psy592          196 SALFVTLLIS  205 (339)
Q Consensus       196 ~~il~~~~~l  205 (339)
                      ..||+.++|+
T Consensus        28 MtILivLVII   37 (85)
T PF10717_consen   28 MTILIVLVII   37 (85)
T ss_pred             HHHHHHHHHH
Confidence            3455544444


No 102
>PF10577 UPF0560:  Uncharacterised protein family UPF0560;  InterPro: IPR018890  This family of proteins has no known function. 
Probab=31.37  E-value=49  Score=35.30  Aligned_cols=10  Identities=10%  Similarity=-0.024  Sum_probs=6.4

Q ss_pred             hHHHHHHHHH
Q psy592          196 SALFVTLLIS  205 (339)
Q Consensus       196 ~~il~~~~~l  205 (339)
                      ++||..+++|
T Consensus       276 l~ILG~~~li  285 (807)
T PF10577_consen  276 LAILGGTALI  285 (807)
T ss_pred             HHHHHHHHHH
Confidence            5677766555


No 103
>TIGR02595 PEP_exosort PEP-CTERM putative exosortase interaction domain. This model describes a 25-residue domain that includes a near-invariant Pro-Glu-Pro (PEP) motif, a thirteen residue strongly hydrophobic sequence likely to span the membrane, and a five-residue strongly basic motif that often contains four Arg residues. In nearly every case, this motif is found within nine residues, and usually within five residues, of the extreme C-terminus of the protein. Proteins with this motif typically have signal sequences at the N-terminus. This region appears many times per genome or not at all, and co-occurs in genomes with a proposed protein-sorting integral membrane protein we designate exosortase (see TIGR02602). PEP-CTERM proteins frequently are poorly conserved, Ser/Thr-rich proteins and may become extensively modified proteinaceous constituents of extracellular material in bacterial biofilms.
Probab=30.43  E-value=56  Score=18.85  Aligned_cols=9  Identities=44%  Similarity=0.785  Sum_probs=4.1

Q ss_pred             HHHHHHHHh
Q psy592          213 VFVVYSRRR  221 (339)
Q Consensus       213 ~~~~~~~~~  221 (339)
                      .++..+|||
T Consensus        15 ~~~~~rrrk   23 (26)
T TIGR02595        15 GFLLLRRRR   23 (26)
T ss_pred             HHHHHhhcc
Confidence            444444444


No 104
>PF11884 DUF3404:  Domain of unknown function (DUF3404);  InterPro: IPR021821  This domain is functionally uncharacterised. This domain is found in bacteria. This presumed domain is about 260 amino acids in length. This domain is found associated with PF02518 from PFAM, PF00512 from PFAM. 
Probab=28.99  E-value=83  Score=29.13  Aligned_cols=16  Identities=25%  Similarity=0.434  Sum_probs=8.5

Q ss_pred             HHHHHHHHHHHHHhhh
Q psy592          208 QVLVLVFVVYSRRREA  223 (339)
Q Consensus       208 l~~~l~~~~~~~~~~~  223 (339)
                      +++++.-.+|+++++.
T Consensus       243 i~l~~gw~~y~~~~kr  258 (262)
T PF11884_consen  243 ILLVLGWSLYRWNQKR  258 (262)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            4445555666665543


No 105
>PRK05585 yajC preprotein translocase subunit YajC; Validated
Probab=28.77  E-value=38  Score=26.86  Aligned_cols=7  Identities=14%  Similarity=0.567  Sum_probs=2.9

Q ss_pred             HHHHHHH
Q psy592          212 LVFVVYS  218 (339)
Q Consensus       212 l~~~~~~  218 (339)
                      +.|+.+|
T Consensus        31 ~yf~~~R   37 (106)
T PRK05585         31 FYFLIIR   37 (106)
T ss_pred             HHHHhcc
Confidence            3344444


No 106
>PF12606 RELT:  Tumour necrosis factor receptor superfamily member 19;  InterPro: IPR022248 The members of tumor necrosis factor receptor (TNFR) superfamily have been designated as the "guardians of the immune system" due to their roles in immune cell proliferation, differentiation, activation, and death (apoptosis).  RELT (receptor expressed in lymphoid tissues) is a member of the TNFR superfamily. The messenger RNA of RELT is especially abundant in hematologic tissues such as spleen, lymph node, and peripheral blood leukocytes as well as in leukemias and lymphomas. RELT is able to activate the NF-kappaB pathway and selectively binds tumor necrosis factor receptor-associated factor 1 []. RELT like proteins 1 and 2 (RELL1 and RELL2) are two RELT homologues that bind to RELT. The expression of RELL1 at the mRNA level is ubiquitous, whereas expression of RELL2 mRNA is more restricted to particular tissues [].
Probab=28.13  E-value=1.2e+02  Score=20.67  Aligned_cols=7  Identities=29%  Similarity=0.653  Sum_probs=2.7

Q ss_pred             HHHHHHH
Q psy592          199 FVTLLIS  205 (339)
Q Consensus       199 l~~~~~l  205 (339)
                      ++.++++
T Consensus         6 iV~i~iv   12 (50)
T PF12606_consen    6 IVSIFIV   12 (50)
T ss_pred             HHHHHHH
Confidence            3344333


No 107
>PHA03286 envelope glycoprotein E; Provisional
Probab=27.85  E-value=91  Score=31.12  Aligned_cols=9  Identities=44%  Similarity=0.678  Sum_probs=4.2

Q ss_pred             HHHHHHHhh
Q psy592          214 FVVYSRRRE  222 (339)
Q Consensus       214 ~~~~~~~~~  222 (339)
                      ..+++|||+
T Consensus       412 ~~~~~r~~~  420 (492)
T PHA03286        412 AGLYRRRRR  420 (492)
T ss_pred             HhHhhhhhh
Confidence            344555444


No 108
>PRK03427 cell division protein ZipA; Provisional
Probab=27.57  E-value=64  Score=30.92  Aligned_cols=8  Identities=75%  Similarity=0.983  Sum_probs=5.7

Q ss_pred             cCCCCCcc
Q psy592          242 DDEGGGED  249 (339)
Q Consensus       242 ~eEggGE~  249 (339)
                      |+||-||-
T Consensus        59 d~dGvGev   66 (333)
T PRK03427         59 DDEGVGEV   66 (333)
T ss_pred             cccCccce
Confidence            57777775


No 109
>PF05961 Chordopox_A13L:  Chordopoxvirus A13L protein;  InterPro: IPR009236 This family consists of A13L proteins from the Chordopoxviruses. A13L or p8 is one of the three most abundant membrane proteins of the intracellular mature Vaccinia virus [].
Probab=26.09  E-value=87  Score=22.64  Aligned_cols=14  Identities=14%  Similarity=0.624  Sum_probs=8.7

Q ss_pred             HHHHHHHHHHHhhh
Q psy592          210 LVLVFVVYSRRREA  223 (339)
Q Consensus       210 ~~l~~~~~~~~~~~  223 (339)
                      .++++.+|+|.+..
T Consensus        16 ~lIlY~iYnr~~~~   29 (68)
T PF05961_consen   16 GLILYGIYNRKKTT   29 (68)
T ss_pred             HHHHHHHHhccccc
Confidence            34557778776643


No 110
>PF06667 PspB:  Phage shock protein B;  InterPro: IPR009554 This family consists of several bacterial phage shock protein B (PspB) sequences. The phage shock protein (psp) operon is induced in response to heat, ethanol, osmotic shock and infection by filamentous bacteriophages []. Expression of the operon requires the alternative sigma factor sigma54 and the transcriptional activator PspF. In addition, PspA plays a negative regulatory role, and the integral-membrane proteins PspB and PspC play a positive one [].; GO: 0006355 regulation of transcription, DNA-dependent, 0009271 phage shock
Probab=26.04  E-value=1.1e+02  Score=22.67  Aligned_cols=8  Identities=13%  Similarity=0.389  Sum_probs=3.4

Q ss_pred             HHHHHHHh
Q psy592          214 FVVYSRRR  221 (339)
Q Consensus       214 ~~~~~~~~  221 (339)
                      ++.|+.++
T Consensus        23 ~lHY~sk~   30 (75)
T PF06667_consen   23 ILHYRSKW   30 (75)
T ss_pred             HHHHHHhc
Confidence            34444443


No 111
>KOG3054|consensus
Probab=26.02  E-value=75  Score=29.07  Aligned_cols=15  Identities=33%  Similarity=0.685  Sum_probs=7.4

Q ss_pred             HHHHHHHHHHHHHhh
Q psy592          208 QVLVLVFVVYSRRRE  222 (339)
Q Consensus       208 l~~~l~~~~~~~~~~  222 (339)
                      |+.++++++|.+++.
T Consensus        14 lV~~i~l~l~~r~ra   28 (299)
T KOG3054|consen   14 LVAVILLFLWKRRRA   28 (299)
T ss_pred             HHHHHHHHHHHhhcc
Confidence            444445555555543


No 112
>KOG0994|consensus
Probab=25.93  E-value=73  Score=35.56  Aligned_cols=26  Identities=27%  Similarity=0.669  Sum_probs=20.3

Q ss_pred             eeecCC--CCceeeCCCCCCCCCCCCCC
Q psy592           50 VCVGSM--HTAKCQCLPGWSGTGCVTPT   75 (339)
Q Consensus        50 ~Cv~~~--~~~~C~C~~g~~G~~Ce~~~   75 (339)
                      .|..+.  ....|+|.+||+|.+|+.-.
T Consensus       924 sC~~d~~t~~ivC~C~~GY~G~RCe~CA  951 (1758)
T KOG0994|consen  924 SCYLDTRTQQIVCHCQEGYSGSRCEICA  951 (1758)
T ss_pred             cccccccccceeeecccCccccchhhhc
Confidence            366553  35689999999999999874


No 113
>COG4367 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=25.66  E-value=57  Score=24.95  Aligned_cols=24  Identities=21%  Similarity=0.379  Sum_probs=20.8

Q ss_pred             CcHHHHHHHHHHHhcCCCCCCCCCcc
Q psy592          287 PNVGIFIEEHKKRADADPNAPPFDDL  312 (339)
Q Consensus       287 ~~~~~~i~~~~~~ad~d~~~~p~Dsl  312 (339)
                      -++++||.+++++..+-|  .||-.|
T Consensus        54 W~lRd~l~~~i~e~GqtP--~PfT~L   77 (97)
T COG4367          54 WRLRDFLVQAIRESGQTP--VPFTAL   77 (97)
T ss_pred             HHHHHHHHHHHHhcCCCC--CCeeee
Confidence            488999999999999999  899554


No 114
>COG3763 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=25.37  E-value=1.7e+02  Score=21.39  Aligned_cols=21  Identities=19%  Similarity=0.162  Sum_probs=8.6

Q ss_pred             hHHHHHHHHHHHHHHHHHHHHH
Q psy592          196 SALFVTLLISPNQVLVLVFVVY  217 (339)
Q Consensus       196 ~~il~~~~~l~~l~~~l~~~~~  217 (339)
                      +|||+.++.| |..+++.|++-
T Consensus         5 lail~ivl~l-l~G~~~G~fia   25 (71)
T COG3763           5 LAILLIVLAL-LAGLIGGFFIA   25 (71)
T ss_pred             HHHHHHHHHH-HHHHHHHHHHH
Confidence            4554444333 12344444443


No 115
>PF04689 S1FA:  DNA binding protein S1FA;  InterPro: IPR006779  S1FA is an unusual small plant peptide of only 70 amino acids with a basic domain which contains a nuclear localization signal and a putative DNA binding helix. S1FA is highly conserved between dicotyledonous and monocotyledonous plants and may be a DNA-binding protein that specifically recognises the negative promoter element S1F [].; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=25.15  E-value=63  Score=23.11  Aligned_cols=31  Identities=16%  Similarity=0.190  Sum_probs=14.9

Q ss_pred             cccchHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy592          192 VKTGSALFVTLLISPNQVLVLVFVVYSRRRE  222 (339)
Q Consensus       192 ~~~g~~il~~~~~l~~l~~~l~~~~~~~~~~  222 (339)
                      +..|+++|+-+.-++|++++-.+++|.-+++
T Consensus        10 lnPGlIVLlvV~g~ll~flvGnyvlY~Yaqk   40 (69)
T PF04689_consen   10 LNPGLIVLLVVAGLLLVFLVGNYVLYVYAQK   40 (69)
T ss_pred             CCCCeEEeehHHHHHHHHHHHHHHHHHHHhh
Confidence            3445444444433333555555666555443


No 116
>PF10078 DUF2316:  Uncharacterized protein conserved in bacteria (DUF2316);  InterPro: IPR018757  Members of this family of hypothetical bacterial proteins have no known function. 
Probab=24.97  E-value=70  Score=24.56  Aligned_cols=26  Identities=19%  Similarity=0.319  Sum_probs=22.1

Q ss_pred             CCCcHHHHHHHHHHHhcCCCCCCCCCcc
Q psy592          285 VEPNVGIFIEEHKKRADADPNAPPFDDL  312 (339)
Q Consensus       285 ~~~~~~~~i~~~~~~ad~d~~~~p~Dsl  312 (339)
                      ..-++++||.+++++....|  .||-.|
T Consensus        53 ~vW~lRdyL~~~i~~~G~eP--~PfS~L   78 (89)
T PF10078_consen   53 DVWILRDYLNDKIKEQGKEP--VPFSAL   78 (89)
T ss_pred             cchHHHHHHHHHHHHcCCCC--CCeeee
Confidence            34589999999999999999  898655


No 117
>PF00558 Vpu:  Vpu protein;  InterPro: IPR008187 The Human immunodeficiency virus 1 (HIV-1) Vpu protein acts in the degradation of CD4 in the endoplasmic reticulum and in the enhancement of virion release from the plasma membrane of infected cells [].; GO: 0019076 release of virus from host; PDB: 2JPX_A 1PI8_A 2GOH_A 2GOF_A 1PI7_A 1PJE_A 1VPU_A 2K7Y_A.
Probab=24.87  E-value=42  Score=25.27  Aligned_cols=18  Identities=17%  Similarity=0.409  Sum_probs=4.3

Q ss_pred             HHHHHHHHHHHHHHHhhh
Q psy592          206 PNQVLVLVFVVYSRRREA  223 (339)
Q Consensus       206 ~~l~~~l~~~~~~~~~~~  223 (339)
                      +..+-.+++.-|++.+++
T Consensus        19 aIvvW~iv~ieYrk~~rq   36 (81)
T PF00558_consen   19 AIVVWTIVYIEYRKIKRQ   36 (81)
T ss_dssp             HHHHHHHH----------
T ss_pred             HHHHHHHHHHHHHHHHHH
Confidence            335555666666665543


No 118
>PRK13183 psbN photosystem II reaction center protein N; Provisional
Probab=24.80  E-value=61  Score=21.61  Aligned_cols=15  Identities=13%  Similarity=0.157  Sum_probs=7.8

Q ss_pred             HHHHHHHHHHHHHhh
Q psy592          208 QVLVLVFVVYSRRRE  222 (339)
Q Consensus       208 l~~~l~~~~~~~~~~  222 (339)
                      |+-+-.+.+|.....
T Consensus        18 L~~~TgyaiYtaFGp   32 (46)
T PRK13183         18 LLALTGFGIYTAFGP   32 (46)
T ss_pred             HHHHhhheeeeccCC
Confidence            444455556655543


No 119
>KOG1094|consensus
Probab=24.74  E-value=99  Score=32.27  Aligned_cols=8  Identities=0%  Similarity=0.638  Sum_probs=3.3

Q ss_pred             HHHHHHHH
Q psy592          211 VLVFVVYS  218 (339)
Q Consensus       211 ~l~~~~~~  218 (339)
                      ++++++||
T Consensus       407 ii~~~L~R  414 (807)
T KOG1094|consen  407 IIALMLWR  414 (807)
T ss_pred             HHHHHHHH
Confidence            33334444


No 120
>COG4736 CcoQ Cbb3-type cytochrome oxidase, subunit 3 [Posttranslational modification, protein turnover, chaperones]
Probab=24.14  E-value=89  Score=22.13  Aligned_cols=15  Identities=7%  Similarity=0.450  Sum_probs=7.5

Q ss_pred             HHHHHHHHHHHHHhh
Q psy592          208 QVLVLVFVVYSRRRE  222 (339)
Q Consensus       208 l~~~l~~~~~~~~~~  222 (339)
                      +++..++.+++++++
T Consensus        21 ~fiavi~~ayr~~~K   35 (60)
T COG4736          21 FFIAVIYFAYRPGKK   35 (60)
T ss_pred             HHHHHHHHHhcccch
Confidence            334444555655554


No 121
>PRK14584 hmsS hemin storage system protein; Provisional
Probab=23.40  E-value=2e+02  Score=24.46  Aligned_cols=25  Identities=20%  Similarity=0.449  Sum_probs=12.5

Q ss_pred             hHHHHHHHHHHHHH-HHHHHHHHHHHhh
Q psy592          196 SALFVTLLISPNQV-LVLVFVVYSRRRE  222 (339)
Q Consensus       196 ~~il~~~~~l~~l~-~~l~~~~~~~~~~  222 (339)
                      +++.++|+++  .. ++++...|.++|-
T Consensus        63 l~~yl~ial~--nAvlLI~WA~YN~~RF   88 (153)
T PRK14584         63 IALYLAIAAF--NAVLLIIWAKYNQVRF   88 (153)
T ss_pred             HHHHHHHHHH--HHHHHHHHHHHHHHHh
Confidence            5555555554  33 3333455666554


No 122
>PF02009 Rifin_STEVOR:  Rifin/stevor family;  InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=23.36  E-value=96  Score=29.36  Aligned_cols=25  Identities=16%  Similarity=0.397  Sum_probs=16.6

Q ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy592          196 SALFVTLLISPNQVLVLVFVVYSRRRE  222 (339)
Q Consensus       196 ~~il~~~~~l~~l~~~l~~~~~~~~~~  222 (339)
                      +||++.+|++  +++-|++--+|+++-
T Consensus       263 iaIliIVLIM--vIIYLILRYRRKKKm  287 (299)
T PF02009_consen  263 IAILIIVLIM--VIIYLILRYRRKKKM  287 (299)
T ss_pred             HHHHHHHHHH--HHHHHHHHHHHHhhh
Confidence            5667666666  666666767776554


No 123
>PHA02681 ORF089 virion membrane protein; Provisional
Probab=23.11  E-value=1.8e+02  Score=22.02  Aligned_cols=14  Identities=29%  Similarity=0.458  Sum_probs=8.8

Q ss_pred             HHHHHHHHHHHHHh
Q psy592          208 QVLVLVFVVYSRRR  221 (339)
Q Consensus       208 l~~~l~~~~~~~~~  221 (339)
                      ++.+|++.+|+|-+
T Consensus        15 IVclliya~YRR~~   28 (92)
T PHA02681         15 IVCYIVIMMYRRSC   28 (92)
T ss_pred             HHHHHHHHHHHhcc
Confidence            45556677777654


No 124
>PF07213 DAP10:  DAP10 membrane protein;  InterPro: IPR009861 This family consists of several mammalian DAP10 membrane proteins. In activated mouse natural killer (NK) cells, the NKG2D receptor associates with two intracellular adaptors, DAP10 and DAP12, which trigger phosphatidyl inositol 3 kinase (PI3K) and Syk family protein tyrosine kinases, respectively. It has been suggested that the DAP10-PI3K pathway is sufficient to initiate NKG2D-mediated killing of target cells [].
Probab=22.86  E-value=1.3e+02  Score=22.47  Aligned_cols=13  Identities=38%  Similarity=0.575  Sum_probs=5.4

Q ss_pred             HHHHHHHHHHHHh
Q psy592          209 VLVLVFVVYSRRR  221 (339)
Q Consensus       209 ~~~l~~~~~~~~~  221 (339)
                      ++++..+.+.|.+
T Consensus        49 LIv~~vy~car~r   61 (79)
T PF07213_consen   49 LIVLVVYYCARPR   61 (79)
T ss_pred             HHHHHHHhhcccc
Confidence            3344444444433


No 125
>PF03908 Sec20:  Sec20;  InterPro: IPR005606 Sec20 is a membrane glycoprotein associated with secretory pathway.
Probab=22.83  E-value=95  Score=23.60  Aligned_cols=12  Identities=25%  Similarity=0.800  Sum_probs=7.1

Q ss_pred             HHHHHHHHHHHH
Q psy592          208 QVLVLVFVVYSR  219 (339)
Q Consensus       208 l~~~l~~~~~~~  219 (339)
                      +++++++++|+|
T Consensus        80 f~~~v~yI~~rR   91 (92)
T PF03908_consen   80 FLLVVLYILWRR   91 (92)
T ss_pred             HHHHHHHHhhhc
Confidence            345666666654


No 126
>PF06716 DUF1201:  Protein of unknown function (DUF1201);  InterPro: IPR009591 This entry consists of several Beet yellows virus (BYV) putative membrane-binding proteins of around 54 residues in length. The function of this currently unknown.
Probab=22.81  E-value=1.6e+02  Score=19.59  Aligned_cols=23  Identities=17%  Similarity=0.496  Sum_probs=11.9

Q ss_pred             cchHHHHHHHHHHHHHHHHHHHHHH
Q psy592          194 TGSALFVTLLISPNQVLVLVFVVYS  218 (339)
Q Consensus       194 ~g~~il~~~~~l~~l~~~l~~~~~~  218 (339)
                      .|.+.++|+.+.  .+.+++.++++
T Consensus        11 ~~F~~lIC~Fl~--~~~~F~~F~~K   33 (54)
T PF06716_consen   11 LAFGFLICLFLF--CLVVFIWFVYK   33 (54)
T ss_pred             HHHHHHHHHHHH--HHHHHHHHHHH
Confidence            346677776665  33333334443


No 127
>TIGR03521 GldG gliding-associated putative ABC transporter substrate-binding component GldG. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldG is a protein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae and Cytophaga hutchinsonii. Knockouts of GldG abolish the gliding phenotype. GldG, along with GldA and GldF are believed to compose an ABC transporter and are observed as an operon. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility.
Probab=22.72  E-value=92  Score=32.06  Aligned_cols=14  Identities=14%  Similarity=0.169  Sum_probs=6.8

Q ss_pred             HHHHHHHHHHHHHh
Q psy592          208 QVLVLVFVVYSRRR  221 (339)
Q Consensus       208 l~~~l~~~~~~~~~  221 (339)
                      +++++.+++|.+||
T Consensus       535 ~~l~~G~~~~~~Rr  548 (552)
T TIGR03521       535 LLLLFGLSFTYIRK  548 (552)
T ss_pred             HHHHHHHHHHHHHH
Confidence            34444555554444


No 128
>PRK01844 hypothetical protein; Provisional
Probab=22.71  E-value=1.8e+02  Score=21.38  Aligned_cols=23  Identities=13%  Similarity=0.199  Sum_probs=10.1

Q ss_pred             hHHHHHHHHHHHHHHHHHHHHHHH
Q psy592          196 SALFVTLLISPNQVLVLVFVVYSR  219 (339)
Q Consensus       196 ~~il~~~~~l~~l~~~l~~~~~~~  219 (339)
                      ++|++.|+.| ++.+++.|++-++
T Consensus         5 ~~I~l~I~~l-i~G~~~Gff~ark   27 (72)
T PRK01844          5 LGILVGVVAL-VAGVALGFFIARK   27 (72)
T ss_pred             HHHHHHHHHH-HHHHHHHHHHHHH
Confidence            4455444433 1334555544433


No 129
>PRK05886 yajC preprotein translocase subunit YajC; Validated
Probab=22.70  E-value=94  Score=24.79  Aligned_cols=10  Identities=10%  Similarity=0.325  Sum_probs=4.0

Q ss_pred             HHHHHHHHHH
Q psy592          211 VLVFVVYSRR  220 (339)
Q Consensus       211 ~l~~~~~~~~  220 (339)
                      ++.|++.|++
T Consensus        16 i~yF~~iRPQ   25 (109)
T PRK05886         16 GFMYFASRRQ   25 (109)
T ss_pred             HHHHHHccHH
Confidence            3334444433


No 130
>PHA02902 putative IMV membrane protein; Provisional
Probab=21.99  E-value=1.4e+02  Score=21.35  Aligned_cols=14  Identities=21%  Similarity=0.581  Sum_probs=8.1

Q ss_pred             HHHHHHHHHHHHHh
Q psy592          208 QVLVLVFVVYSRRR  221 (339)
Q Consensus       208 l~~~l~~~~~~~~~  221 (339)
                      ++.++++.+|+|-|
T Consensus        15 Ivclliya~YrR~k   28 (70)
T PHA02902         15 IFCLLIYAAYKRYK   28 (70)
T ss_pred             HHHHHHHHHHHHhc
Confidence            34455667776654


No 131
>PF12768 Rax2:  Cortical protein marker for cell polarity
Probab=21.81  E-value=1.1e+02  Score=28.69  Aligned_cols=24  Identities=29%  Similarity=0.438  Sum_probs=11.0

Q ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy592          196 SALFVTLLISPNQVLVLVFVVYSRRR  221 (339)
Q Consensus       196 ~~il~~~~~l~~l~~~l~~~~~~~~~  221 (339)
                      +||.+.+++|  |+++-+++.+.+||
T Consensus       234 lAiALG~v~l--l~l~Gii~~~~~r~  257 (281)
T PF12768_consen  234 LAIALGTVFL--LVLIGIILAYIRRR  257 (281)
T ss_pred             hHHHHHHHHH--HHHHHHHHHHHHhh
Confidence            4554444444  44444444444444


No 132
>PF15431 TMEM190:  Transmembrane protein 190
Probab=21.71  E-value=2.2e+02  Score=22.79  Aligned_cols=45  Identities=9%  Similarity=0.096  Sum_probs=23.3

Q ss_pred             EEEccccCCCccccc-ccc-----c--hHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy592          177 IWLNAVAVNDGQWHF-VKT-----G--SALFVTLLISPNQVLVLVFVVYSRRRE  222 (339)
Q Consensus       177 v~v~~~~~~~~~~~~-~~~-----g--~~il~~~~~l~~l~~~l~~~~~~~~~~  222 (339)
                      +|+-.|..+|+|+.. ..+     .  ..-..|.-+|+ |+..+-++.|.+|+.
T Consensus        36 LrLrCCYrdGvCYhQRpDEnmrrKHmWaL~wtC~gll~-Li~~iclFWWAkRrd   88 (134)
T PF15431_consen   36 LRLRCCYRDGVCYHQRPDENMRRKHMWALGWTCGGLLL-LICSICLFWWAKRRD   88 (134)
T ss_pred             eeeeeecccceeeccCcchhHHHHHHHHHHHHHHhHHH-HHHHHHHHHHHHHhc
Confidence            566668889988751 111     1  33456655551 222333455555553


No 133
>KOG3653|consensus
Probab=21.59  E-value=1.4e+02  Score=30.11  Aligned_cols=16  Identities=31%  Similarity=0.764  Sum_probs=10.0

Q ss_pred             HHHHHHHHHHHHhhhc
Q psy592          209 VLVLVFVVYSRRREAH  224 (339)
Q Consensus       209 ~~~l~~~~~~~~~~~~  224 (339)
                      +++++|++|++||.++
T Consensus       170 lvi~~~~~~r~~k~~~  185 (534)
T KOG3653|consen  170 LVILAFLGYRQRKNAR  185 (534)
T ss_pred             HHHHHHHHHHHhhccc
Confidence            3455667787777653


No 134
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=21.47  E-value=28  Score=30.47  Aligned_cols=57  Identities=18%  Similarity=0.376  Sum_probs=34.2

Q ss_pred             CCEEeecCCCccC-------CCCCCCCCCCCcccCC-CCCCCCCCCCCeeecCC-----CCceeeCCCCCC
Q psy592           10 ITYLYDLANPGLH-------KNSVPGCPATEAVCNA-HSRLHCYPEHGVCVGSM-----HTAKCQCLPGWS   67 (339)
Q Consensus        10 n~~~~dl~~~~~~-------~~s~~gC~~~~~~C~~-~~~~~pC~~~g~Cv~~~-----~~~~C~C~~g~~   67 (339)
                      ||.++.+++..+-       .-++..|.... .|.. .....+|++.++|+...     ..|+|.|.+||+
T Consensus        10 NG~LiQMSNHfEC~Cnegfvl~~EntCE~kv-~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~   79 (197)
T PF06247_consen   10 NGYLIQMSNHFECKCNEGFVLKNENTCEEKV-ECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYI   79 (197)
T ss_dssp             TEEEEEESSEEEEEESTTEEEEETTEEEE-----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEE
T ss_pred             CCEEEEccCceEEEcCCCcEEccccccccce-ecCcccccCccccchhhhhcCCCcccceeEEEecccCce
Confidence            6666666665431       12455665433 5654 22356999999999875     479999999984


No 135
>PF05545 FixQ:  Cbb3-type cytochrome oxidase component FixQ;  InterPro: IPR008621 This family consists of several Cbb3-type cytochrome oxidase components (FixQ/CcoQ). FixQ is found in nitrogen fixing bacteria. Since nitrogen fixation is an energy-consuming process, effective symbioses depend on operation of a respiratory chain with a high affinity for O2, closely coupled to ATP production. This requirement is fulfilled by a special three-subunit terminal oxidase (cytochrome terminal oxidase cbb3), which was first identified in Bradyrhizobium japonicum as the product of the fixNOQP operon [].
Probab=21.06  E-value=1.1e+02  Score=20.29  Aligned_cols=7  Identities=29%  Similarity=0.472  Sum_probs=3.0

Q ss_pred             HHHHHHH
Q psy592          214 FVVYSRR  220 (339)
Q Consensus       214 ~~~~~~~  220 (339)
                      +.+++++
T Consensus        27 ~w~~~~~   33 (49)
T PF05545_consen   27 IWAYRPR   33 (49)
T ss_pred             HHHHccc
Confidence            3444444


No 136
>KOG1834|consensus
Probab=21.05  E-value=4.3e+02  Score=27.96  Aligned_cols=66  Identities=18%  Similarity=0.194  Sum_probs=40.5

Q ss_pred             cccEEEEEEeecC-------CceEEEEEecC--CCCceEEEEEECCEEEEEEecC-----CCcccceeEEecccccCccc
Q psy592           97 YTTQLQLKFRTRE-------EFGELFRLSDQ--HNREYAILEIKDSKLRFRYNLN-----NLKTDEKDIWLNAIKDSKLR  162 (339)
Q Consensus        97 ~~~~i~l~frT~~-------~~GlL~~~~~~--~~~~~i~L~L~~G~l~~~~~lG-----~~~~~~~~l~~~~v~dG~WH  162 (339)
                      .++.|+|++|-..       ..-.|+-.++.  -+.++..|.+.+=++.|.++-.     +++.-+....+..|.|.+||
T Consensus       366 dhFTlSfwMkHg~~p~~~~~eketIlCnsdk~emnrhHyslyvh~Crl~fllr~d~~~~~~fRpaef~Wkl~qVCD~EWH  445 (952)
T KOG1834|consen  366 DHFTLSFWMKHGPGPKDEQSEKETILCNSDKTEMNRHHYSLYVHGCRLEFLLRRDAGATSDFRPAEFHWKLPQVCDNEWH  445 (952)
T ss_pred             CceEEEEeeecCCCCccccccceeEEecccccccccceeEEEEeccEEEEEEccCccccccccchheeccchhhhhhhhh
Confidence            3455666655321       12345544432  2567889999999999998752     22222344446789999999


No 137
>PF11770 GAPT:  GRB2-binding adapter (GAPT);  InterPro: IPR021082  This entry represents a family of transmembrane proteins which bind the growth factor receptor-bound protein 2 (GRB2) in B cells []. In contrast to other transmembrane adaptor proteins, GAPT, which this entry represents, is not phosphorylated upon BCR ligation. It associates with GRB2 constitutively through its proline-rich region []. 
Probab=20.86  E-value=24  Score=29.69  Aligned_cols=10  Identities=10%  Similarity=0.162  Sum_probs=4.2

Q ss_pred             hHHHHHHHHH
Q psy592          196 SALFVTLLIS  205 (339)
Q Consensus       196 ~~il~~~~~l  205 (339)
                      ++|++.|.+|
T Consensus        10 v~i~igi~Ll   19 (158)
T PF11770_consen   10 VAISIGISLL   19 (158)
T ss_pred             HHHHHHHHHH
Confidence            3444444443


No 138
>KOG0994|consensus
Probab=20.86  E-value=66  Score=35.88  Aligned_cols=16  Identities=44%  Similarity=1.165  Sum_probs=13.4

Q ss_pred             eeeCCCCCCCCCCCCC
Q psy592           59 KCQCLPGWSGTGCVTP   74 (339)
Q Consensus        59 ~C~C~~g~~G~~Ce~~   74 (339)
                      .|+|.|||.|..|++-
T Consensus      1085 QCqCkpGfGGR~C~qC 1100 (1758)
T KOG0994|consen 1085 QCQCKPGFGGRTCSQC 1100 (1758)
T ss_pred             ceeccCCCCCcchhHH
Confidence            5889999999988765


No 139
>PF04478 Mid2:  Mid2 like cell wall stress sensor;  InterPro: IPR007567 This family represents a region near the C terminus of Mid2, which contains a transmembrane region. The remainder of the protein sequence is serine-rich and of low complexity, and is therefore impossible to align accurately. Mid2 is thought to act as a mechanosensor of cell wall stress. The C-terminal cytoplasmic region of Mid2 is known to interact with Rom2, a guanine nucleotide exchange factor (GEF) for Rho1, which is part of the cell wall integrity signalling pathway [].
Probab=20.47  E-value=15  Score=31.14  Aligned_cols=8  Identities=38%  Similarity=0.833  Sum_probs=3.3

Q ss_pred             HHHHHHHh
Q psy592          214 FVVYSRRR  221 (339)
Q Consensus       214 ~~~~~~~~  221 (339)
                      |+++.|+|
T Consensus        71 f~~c~r~k   78 (154)
T PF04478_consen   71 FIFCIRRK   78 (154)
T ss_pred             eeEEEecc
Confidence            44444444


No 140
>PF11669 WBP-1:  WW domain-binding protein 1;  InterPro: IPR021684  This family of proteins represents WBP-1, a ligand of the WW domain of Yes-associated protein. This protein has a proline-rich domain. WBP-1 does not bind to the SH3 domain []. 
Probab=20.36  E-value=2.8e+02  Score=21.64  Aligned_cols=13  Identities=23%  Similarity=0.371  Sum_probs=7.7

Q ss_pred             hcccccccCCCCC
Q psy592          235 RENIINYDDEGGG  247 (339)
Q Consensus       235 reni~~Y~eEggG  247 (339)
                      +-|++.|.+....
T Consensus        57 e~~~~~~~~a~~~   69 (102)
T PF11669_consen   57 EINLIAYQEAHNY   69 (102)
T ss_pred             cccccccccCCCC
Confidence            3466777665544


No 141
>PRK00269 zipA cell division protein ZipA; Reviewed
Probab=20.06  E-value=2.1e+02  Score=26.94  Aligned_cols=12  Identities=25%  Similarity=0.465  Sum_probs=7.2

Q ss_pred             HHHHHHHHHhhh
Q psy592          212 LVFVVYSRRREA  223 (339)
Q Consensus       212 l~~~~~~~~~~~  223 (339)
                      ++|--|+|.|..
T Consensus        20 il~~~~~r~r~~   31 (293)
T PRK00269         20 ILFDGWRRMRGG   31 (293)
T ss_pred             HHHHHHHHHhcc
Confidence            455567776543


Done!