Query         psy12432
Match_columns 229
No_of_seqs    157 out of 1495
Neff          5.7 
Searched_HMMs 46136
Date          Fri Aug 16 19:08:19 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy12432.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/12432hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1219|consensus               99.4   5E-13 1.1E-17  141.9  10.7  170   22-208  3884-4102(4289)
  2 PHA03099 epidermal growth fact  99.3 1.4E-12 3.1E-17  104.7   5.8   90   24-119    41-130 (139)
  3 PF02158 Neuregulin:  Neureguli  98.8 1.4E-09   3E-14  101.2   0.1   83   79-166     5-106 (404)
  4 PHA02887 EGF-like protein; Pro  98.4 1.5E-07 3.3E-12   74.8   3.3   60    8-71     66-125 (126)
  5 KOG1219|consensus               98.2 1.7E-06 3.7E-11   93.9   6.0   96   16-119  3916-4025(4289)
  6 KOG4289|consensus               97.9 1.5E-05 3.2E-10   84.2   5.0   53   21-77   1219-1283(2531)
  7 PF00008 EGF:  EGF-like domain   97.7   3E-05 6.4E-10   48.1   2.9   29   35-66      4-32  (32)
  8 smart00179 EGF_CA Calcium-bind  96.8  0.0018   4E-08   40.1   3.4   30   35-68      9-39  (39)
  9 cd00054 EGF_CA Calcium-binding  96.5  0.0037 8.1E-08   38.0   3.3   30   35-68      9-38  (38)
 10 KOG4289|consensus               96.2  0.0048   1E-07   66.1   4.3   52   16-70   1252-1318(2531)
 11 cd00053 EGF Epidermal growth f  96.1   0.008 1.7E-07   35.9   3.4   30   35-68      6-36  (36)
 12 PF12955 DUF3844:  Domain of un  96.0   0.018 3.9E-07   45.2   5.7   38   35-72     13-64  (103)
 13 PF02439 Adeno_E3_CR2:  Adenovi  95.8   0.014 2.9E-07   37.9   3.4   21   86-106     4-24  (38)
 14 smart00181 EGF Epidermal growt  95.2   0.035 7.5E-07   33.8   3.7   29   35-68      6-35  (35)
 15 PF14575 EphA2_TM:  Ephrin type  95.2  0.0094   2E-07   44.0   1.3   12  152-163    54-65  (75)
 16 PF01102 Glycophorin_A:  Glycop  94.8   0.036 7.7E-07   44.7   3.7   29   87-117    66-94  (122)
 17 PF07974 EGF_2:  EGF-like domai  94.7   0.049 1.1E-06   33.8   3.4   27   35-67      6-32  (32)
 18 PF12661 hEGF:  Human growth fa  94.5   0.017 3.7E-07   29.1   0.8   13   55-67      1-13  (13)
 19 PF14991 MLANA:  Protein melan-  94.0  0.0097 2.1E-07   47.4  -1.1   31   91-121    26-56  (118)
 20 PF12273 RCR:  Chitin synthesis  94.0   0.052 1.1E-06   43.6   3.1    6  171-176    78-83  (130)
 21 PF05454 DAG1:  Dystroglycan (D  93.4   0.022 4.7E-07   52.2   0.0   10  141-150   186-195 (290)
 22 PHA03099 epidermal growth fact  93.1    0.17 3.6E-06   41.3   4.6   22   19-40     62-83  (139)
 23 KOG1214|consensus               93.0   0.096 2.1E-06   54.1   3.7   46   21-70    806-866 (1289)
 24 PF13908 Shisa:  Wnt and FGF in  92.6    0.26 5.7E-06   41.4   5.4   14   88-101    78-91  (179)
 25 PF01034 Syndecan:  Syndecan do  91.3   0.051 1.1E-06   39.1  -0.3   19   88-106    12-30  (64)
 26 PF07645 EGF_CA:  Calcium-bindi  90.0    0.37 8.1E-06   31.1   2.9   31   27-63      4-34  (42)
 27 PF05393 Hum_adeno_E3A:  Human   89.2     0.5 1.1E-05   36.2   3.5   15   96-110    41-55  (94)
 28 PF15069 FAM163:  FAM163 family  89.0     1.4   3E-05   36.5   6.2   24   88-111     6-29  (143)
 29 PF04478 Mid2:  Mid2 like cell   88.8   0.025 5.5E-07   47.2  -4.1   26   91-118    55-80  (154)
 30 PF07204 Orthoreo_P10:  Orthore  88.7    0.42 9.2E-06   36.9   2.8   32   87-119    42-73  (98)
 31 KOG2052|consensus               88.5     1.3 2.7E-05   43.3   6.5   32   16-47     27-60  (513)
 32 PF12662 cEGF:  Complement Clr-  88.4    0.35 7.5E-06   28.3   1.7   21   23-45      1-21  (24)
 33 PF08374 Protocadherin:  Protoc  87.4     1.4   3E-05   38.9   5.5   10  183-192   146-155 (221)
 34 KOG3516|consensus               87.0    0.42   9E-06   51.0   2.5   40   23-70    543-583 (1306)
 35 PF12877 DUF3827:  Domain of un  86.2    0.44 9.4E-06   48.0   2.0   29   90-118   271-299 (684)
 36 KOG3514|consensus               86.2    0.41 8.9E-06   50.7   1.9   34   34-71    628-662 (1591)
 37 PF00558 Vpu:  Vpu protein;  In  85.4    0.97 2.1E-05   34.0   3.1   26   95-120    13-38  (81)
 38 KOG3607|consensus               84.8     1.4 2.9E-05   45.3   4.8   29   37-72    632-660 (716)
 39 PF15330 SIT:  SHP2-interacting  84.7     1.1 2.5E-05   35.2   3.4    7  107-113    19-25  (107)
 40 PF05545 FixQ:  Cbb3-type cytoc  84.2     2.6 5.7E-05   28.1   4.5   22   93-114    12-33  (49)
 41 KOG1225|consensus               83.3     1.3 2.7E-05   43.9   3.7   38   25-70    297-343 (525)
 42 PF08693 SKG6:  Transmembrane a  82.9    0.11 2.4E-06   34.1  -2.5   18   88-105    11-28  (40)
 43 KOG1225|consensus               81.9     1.8   4E-05   42.8   4.3   14   55-68    297-310 (525)
 44 KOG1217|consensus               80.7     2.4 5.3E-05   38.8   4.5   42   22-67    250-306 (487)
 45 PF01299 Lamp:  Lysosome-associ  80.4    0.79 1.7E-05   41.8   1.1   28   87-116   272-299 (306)
 46 PF05568 ASFV_J13L:  African sw  79.9     3.2 6.8E-05   34.9   4.4   15  105-119    46-60  (189)
 47 KOG0994|consensus               79.7     1.5 3.2E-05   47.1   2.9   50   18-69    879-949 (1758)
 48 PF12191 stn_TNFRSF12A:  Tumour  79.1    0.61 1.3E-05   37.9   0.0   27   88-114    79-105 (129)
 49 PF07213 DAP10:  DAP10 membrane  79.0     2.5 5.4E-05   31.7   3.2   18  101-118    48-65  (79)
 50 PF14584 DUF4446:  Protein of u  78.8     3.4 7.5E-05   34.4   4.4   25   91-115     4-28  (151)
 51 cd01324 cbb3_Oxidase_CcoQ Cyto  78.5     3.2 6.9E-05   28.1   3.3   32   88-119     8-39  (48)
 52 PF12947 EGF_3:  EGF domain;  I  78.3     1.8   4E-05   27.4   2.0   27   35-65      6-32  (36)
 53 KOG1094|consensus               78.2      18 0.00039   37.0   9.7   32   88-119   390-421 (807)
 54 TIGR01478 STEVOR variant surfa  78.0     2.7 5.9E-05   38.6   3.8    7   24-30    143-149 (295)
 55 PTZ00370 STEVOR; Provisional    77.6     2.7 5.9E-05   38.6   3.7    9   24-32    143-151 (296)
 56 PF12301 CD99L2:  CD99 antigen   77.5     2.5 5.4E-05   35.9   3.2   29   88-116   114-142 (169)
 57 KOG3637|consensus               77.4       1 2.2E-05   48.0   0.9   33   85-117   976-1010(1030)
 58 PF08374 Protocadherin:  Protoc  77.2     1.1 2.4E-05   39.5   1.0   21   88-108    41-61  (221)
 59 PF06697 DUF1191:  Protein of u  76.6     0.9   2E-05   41.5   0.3   13   87-99    212-224 (278)
 60 PF03302 VSP:  Giardia variant-  76.6    0.65 1.4E-05   44.2  -0.7   12   85-97    364-375 (397)
 61 PF02009 Rifin_STEVOR:  Rifin/s  76.5     4.4 9.5E-05   37.4   4.8   12  110-121   276-287 (299)
 62 KOG1217|consensus               75.7     3.3 7.2E-05   37.9   3.8   45   22-70    150-208 (487)
 63 PF08114 PMP1_2:  ATPase proteo  74.4     4.6  0.0001   26.7   3.1   23   96-118    17-39  (43)
 64 PF02038 ATP1G1_PLM_MAT8:  ATP1  74.1     4.4 9.6E-05   27.8   3.1   24   96-119    22-45  (50)
 65 PF06305 DUF1049:  Protein of u  73.6      10 0.00022   26.4   5.0   26   90-115    24-49  (68)
 66 KOG1836|consensus               72.0     2.5 5.5E-05   47.1   2.4   45   26-72    759-816 (1705)
 67 PRK00523 hypothetical protein;  71.5      11 0.00024   27.7   4.9   17   90-106     6-22  (72)
 68 PF06084 Cytomega_TRL10:  Cytom  71.5     2.5 5.5E-05   34.1   1.7   19  152-170   118-136 (150)
 69 COG4736 CcoQ Cbb3-type cytochr  70.8     8.9 0.00019   27.3   4.1   35   95-129    14-51  (60)
 70 PF11770 GAPT:  GRB2-binding ad  70.7     1.2 2.5E-05   37.3  -0.4   16   88-103    10-25  (158)
 71 smart00051 DSL delta serrate l  70.4     3.5 7.6E-05   29.3   2.1   41   24-67     17-63  (63)
 72 PF15102 TMEM154:  TMEM154 prot  69.7     1.2 2.5E-05   37.1  -0.6    6  114-119    83-88  (146)
 73 PF12768 Rax2:  Cortical protei  68.6     4.4 9.5E-05   36.9   2.8    8   92-99    234-241 (281)
 74 PF06365 CD34_antigen:  CD34/Po  68.5     3.6 7.7E-05   36.0   2.1   20   98-117   111-130 (202)
 75 PF15065 NCU-G1:  Lysosomal tra  68.0       2 4.4E-05   40.4   0.5   28   91-118   322-349 (350)
 76 COG1862 YajC Preprotein transl  66.8     5.9 0.00013   30.7   2.8   19  100-118    15-33  (97)
 77 PF15345 TMEM51:  Transmembrane  65.6     8.7 0.00019   34.3   4.0   25  142-166   120-144 (233)
 78 KOG3516|consensus               65.1     7.3 0.00016   42.0   3.9   33   35-71    961-994 (1306)
 79 PF10577 UPF0560:  Uncharacteri  65.1     6.1 0.00013   40.9   3.3   11   36-46    220-230 (807)
 80 PF15050 SCIMP:  SCIMP protein   64.9      12 0.00025   30.4   4.2    9  107-115    26-34  (133)
 81 KOG1226|consensus               64.8     7.5 0.00016   40.1   3.8   49   18-70    571-622 (783)
 82 PF01299 Lamp:  Lysosome-associ  64.7       4 8.7E-05   37.2   1.8   32   89-121   270-301 (306)
 83 PF14610 DUF4448:  Protein of u  64.6     1.2 2.5E-05   37.9  -1.6   22   89-110   159-180 (189)
 84 PRK11901 hypothetical protein;  64.4     7.2 0.00016   36.5   3.3   19   89-107    37-55  (327)
 85 PF11980 DUF3481:  Domain of un  64.1     7.4 0.00016   29.6   2.8   29   87-115    16-45  (87)
 86 PF04971 Lysis_S:  Lysis protei  63.1      12 0.00026   27.3   3.6   15   91-105    35-49  (68)
 87 PF10873 DUF2668:  Protein of u  62.9       4 8.6E-05   34.0   1.2   29   86-114    62-90  (155)
 88 PHA03265 envelope glycoprotein  62.2     3.8 8.3E-05   38.7   1.2   15  108-122   368-382 (402)
 89 KOG0196|consensus               61.5     7.8 0.00017   40.5   3.3   17  147-163   602-618 (996)
 90 PF05568 ASFV_J13L:  African sw  60.9      18  0.0004   30.4   4.8   36   86-121    30-65  (189)
 91 PF06143 Baculo_11_kDa:  Baculo  60.6      27 0.00059   26.5   5.3   27   85-111    34-60  (84)
 92 PF02480 Herpes_gE:  Alphaherpe  59.9       3 6.4E-05   40.4   0.0   31   91-121   354-384 (439)
 93 PF14851 FAM176:  FAM176 family  59.7       7 0.00015   32.8   2.2   20   88-107    24-43  (153)
 94 PTZ00046 rifin; Provisional     59.6      16 0.00035   34.6   4.8   16   93-108   323-338 (358)
 95 PRK12705 hypothetical protein;  59.2      12 0.00026   37.0   4.0   29   91-119     5-33  (508)
 96 PF15099 PIRT:  Phosphoinositid  59.0     8.4 0.00018   31.4   2.4   11  101-111    93-103 (129)
 97 PF14914 LRRC37AB_C:  LRRC37A/B  58.9      12 0.00025   31.4   3.3   27   90-116   123-149 (154)
 98 KOG3488|consensus               58.8      16 0.00035   27.0   3.6   21   88-108    48-68  (81)
 99 TIGR02976 phageshock_pspB phag  58.2      16 0.00035   27.0   3.7   12  104-115    18-29  (75)
100 PF12597 DUF3767:  Protein of u  58.0      24 0.00052   28.2   4.9   27   95-121    72-98  (118)
101 PHA02902 putative IMV membrane  57.6      15 0.00032   26.7   3.2   22   97-118     9-30  (70)
102 TIGR01477 RIFIN variant surfac  57.5      18 0.00038   34.3   4.7   14  108-121   328-341 (353)
103 PF03302 VSP:  Giardia variant-  57.2     2.9 6.4E-05   39.7  -0.5   27   86-112   368-395 (397)
104 PF05083 LST1:  LST-1 protein;   56.9      11 0.00023   27.8   2.5    9  108-116    16-24  (74)
105 PF11884 DUF3404:  Domain of un  56.9      22 0.00049   32.3   5.1   10  105-114   248-257 (262)
106 PF10873 DUF2668:  Protein of u  56.6      25 0.00054   29.4   4.8   27   88-114    60-86  (155)
107 PF04478 Mid2:  Mid2 like cell   56.4    0.96 2.1E-05   37.9  -3.4   31   90-121    50-80  (154)
108 PF12259 DUF3609:  Protein of u  56.4     8.1 0.00018   36.5   2.3   17  100-116   311-327 (361)
109 PF15099 PIRT:  Phosphoinositid  55.2      11 0.00024   30.6   2.6   29   91-119    87-115 (129)
110 PF15330 SIT:  SHP2-interacting  54.8      19 0.00042   28.3   3.8   21  100-120     9-29  (107)
111 PRK10905 cell division protein  53.4     8.3 0.00018   36.0   1.8   13   93-105     3-15  (328)
112 PF00053 Laminin_EGF:  Laminin   53.1     9.7 0.00021   25.0   1.7   23   41-69     11-33  (49)
113 PHA03265 envelope glycoprotein  53.1     6.9 0.00015   37.1   1.2   39   83-121   346-384 (402)
114 PF05399 EVI2A:  Ectropic viral  52.8      31 0.00068   30.6   5.1   10   62-71    116-125 (227)
115 PF01102 Glycophorin_A:  Glycop  51.4      21 0.00045   28.8   3.6   29   92-121    67-95  (122)
116 KOG1214|consensus               51.1      15 0.00032   38.7   3.3   41   24-65    757-820 (1289)
117 PF06247 Plasmod_Pvs28:  Plasmo  51.1     8.5 0.00018   33.5   1.4   42   23-64     19-80  (197)
118 TIGR00847 ccoS cytochrome oxid  50.5      22 0.00048   24.4   3.1   21   94-114     8-28  (51)
119 PTZ00045 apical membrane antig  50.4      48   0.001   33.4   6.6   29   23-64    459-487 (595)
120 PF07423 DUF1510:  Protein of u  50.4      12 0.00026   33.1   2.2   12   94-105    20-31  (217)
121 PRK00523 hypothetical protein;  50.2      32  0.0007   25.3   4.1   34   91-124     3-37  (72)
122 PF15176 LRR19-TM:  Leucine-ric  49.9      26 0.00057   27.5   3.8   23   86-108    16-38  (102)
123 COG4499 Predicted membrane pro  49.6      20 0.00043   34.5   3.6   29   92-120   223-251 (434)
124 PF15048 OSTbeta:  Organic solu  48.6      16 0.00035   29.6   2.6   32   87-118    35-66  (125)
125 PRK09458 pspB phage shock prot  48.2      18  0.0004   26.8   2.6   17   98-114    12-28  (75)
126 cd00055 EGF_Lam Laminin-type e  47.3      15 0.00033   24.3   1.9   17   53-69     18-34  (50)
127 PRK14748 kdpF potassium-transp  46.6      39 0.00085   20.6   3.3   14   91-104     6-19  (29)
128 PF02439 Adeno_E3_CR2:  Adenovi  46.3      42  0.0009   21.8   3.7   16   92-107     6-21  (38)
129 PF15347 PAG:  Phosphoprotein a  46.3      26 0.00057   33.5   3.9   26   96-121    24-49  (428)
130 PF02699 YajC:  Preprotein tran  46.2      40 0.00086   24.9   4.2   10  112-121    24-33  (82)
131 COG3763 Uncharacterized protei  45.8      57  0.0012   24.0   4.8   14   92-105     8-21  (71)
132 PF05808 Podoplanin:  Podoplani  45.7     6.9 0.00015   33.1   0.0   12   88-99    132-143 (162)
133 PTZ00208 65 kDa invariant surf  45.7      12 0.00027   36.0   1.7   31   86-116   384-414 (436)
134 PF05510 Sarcoglycan_2:  Sarcog  45.5      20 0.00043   34.4   3.0   25   91-115   290-314 (386)
135 KOG4260|consensus               45.1      18 0.00038   33.5   2.5   43   27-71    143-185 (350)
136 PF06305 DUF1049:  Protein of u  44.4      70  0.0015   22.0   5.1    7   92-98     30-36  (68)
137 PHA02681 ORF089 virion membran  44.0      37  0.0008   25.9   3.7   18   98-115    10-27  (92)
138 PF05283 MGC-24:  Multi-glycosy  43.9      21 0.00045   30.9   2.6   29   84-113   157-185 (186)
139 PF14575 EphA2_TM:  Ephrin type  43.2      11 0.00024   27.6   0.8   28   91-118     3-30  (75)
140 PF05393 Hum_adeno_E3A:  Human   42.7      35 0.00076   26.2   3.4   28   93-120    34-62  (94)
141 KOG3653|consensus               42.7 1.2E+02  0.0025   30.3   7.8    6  181-186   286-291 (534)
142 PRK13855 type IV secretion sys  42.1      68  0.0015   30.7   6.0    9  100-108    39-47  (376)
143 KOG1226|consensus               42.1      25 0.00054   36.4   3.3   20   25-44    606-626 (783)
144 PF03597 CcoS:  Cytochrome oxid  41.7      39 0.00084   22.5   3.2   20   95-114     8-27  (45)
145 PF07010 Endomucin:  Endomucin;  40.8      35 0.00076   30.6   3.6    6  108-113   210-215 (259)
146 PF11694 DUF3290:  Protein of u  40.8      33 0.00072   28.5   3.3   19   95-113    23-41  (149)
147 PF04863 EGF_alliinase:  Alliin  40.8      10 0.00023   26.6   0.3   36   36-71     18-53  (56)
148 PF04689 S1FA:  DNA binding pro  40.6      23  0.0005   25.7   2.0   14  100-113    26-39  (69)
149 PF00558 Vpu:  Vpu protein;  In  40.4      37  0.0008   25.6   3.2   24   98-121    19-42  (81)
150 PF09835 DUF2062:  Uncharacteri  40.4      67  0.0015   25.9   5.1   19   91-109   122-140 (154)
151 KOG1024|consensus               40.4      20 0.00044   35.0   2.2   29   91-119   188-216 (563)
152 PRK01658 holin-like protein; V  40.4      59  0.0013   26.0   4.6   22   87-108    88-109 (122)
153 KOG4482|consensus               39.9      31 0.00068   33.1   3.4   28   91-118   302-329 (449)
154 TIGR01195 oadG_fam sodium pump  39.9      67  0.0014   23.9   4.6   10   91-100    12-21  (82)
155 PF07271 Cytadhesin_P30:  Cytad  39.3      15 0.00032   33.6   1.1    9  140-148   148-156 (279)
156 PF14991 MLANA:  Protein melan-  39.2       6 0.00013   31.7  -1.2   33   86-119    25-57  (118)
157 PF10883 DUF2681:  Protein of u  39.1      52  0.0011   25.1   3.9    8  107-114    20-27  (87)
158 PRK01844 hypothetical protein;  38.9      79  0.0017   23.3   4.7   25  100-124    12-36  (72)
159 KOG3653|consensus               38.2   1E+02  0.0022   30.7   6.7    9   53-61    115-123 (534)
160 PF11157 DUF2937:  Protein of u  38.1      51  0.0011   27.7   4.1   17   85-101   131-147 (167)
161 PF03908 Sec20:  Sec20;  InterP  38.0      38 0.00082   25.3   3.0   15  100-114    77-91  (92)
162 PF10883 DUF2681:  Protein of u  37.4      85  0.0018   23.9   4.8   26   90-115     6-31  (87)
163 PF07297 DPM2:  Dolichol phosph  37.3      59  0.0013   24.3   3.9   28   89-116    47-75  (78)
164 PF09472 MtrF:  Tetrahydrometha  37.0      41 0.00089   24.2   2.9   20   87-106    42-61  (64)
165 PF09835 DUF2062:  Uncharacteri  36.3      72  0.0016   25.7   4.7   12   88-99    123-134 (154)
166 smart00180 EGF_Lam Laminin-typ  36.3      27 0.00059   22.8   1.8   16   54-69     18-33  (46)
167 PF01414 DSL:  Delta serrate li  34.7      19  0.0004   25.6   0.8   41   24-67     17-63  (63)
168 PF03554 Herpes_UL73:  UL73 vir  34.7 1.2E+02  0.0025   22.9   5.1   29   86-114    47-75  (82)
169 PF12768 Rax2:  Cortical protei  34.5      41 0.00089   30.7   3.2   12   88-99    234-245 (281)
170 PHA02887 EGF-like protein; Pro  34.5      32 0.00069   27.9   2.2   21   19-39    103-123 (126)
171 PTZ00233 variable surface prot  34.4      28 0.00061   34.5   2.3   27   95-121   443-469 (509)
172 KOG3054|consensus               34.3      41 0.00089   30.6   3.1   18  165-184    78-96  (299)
173 PRK13275 mtrF tetrahydromethan  34.2      78  0.0017   23.0   4.0   20   87-106    42-61  (67)
174 PF02430 AMA-1:  Apical membran  32.6      15 0.00032   35.9   0.0   31   87-117   429-459 (471)
175 PF14670 FXa_inhibition:  Coagu  32.4      45 0.00097   21.0   2.2   18   42-63     11-28  (36)
176 PF11395 DUF2873:  Protein of u  32.4      61  0.0013   21.1   2.8   11   98-108    18-28  (43)
177 PRK11486 flagellar biosynthesi  32.3      59  0.0013   26.3   3.4   19   97-115    24-42  (124)
178 PF13124 DUF3963:  Protein of u  31.7      84  0.0018   20.3   3.3   21   87-107    18-38  (40)
179 COG3115 ZipA Cell division pro  31.7 1.6E+02  0.0035   27.5   6.5   10  208-217   133-142 (324)
180 PF01034 Syndecan:  Syndecan do  31.7      17 0.00036   26.3   0.1   17   89-105    17-33  (64)
181 PF14584 DUF4446:  Protein of u  31.6   1E+02  0.0023   25.5   4.9   23   93-115     9-31  (151)
182 PF12158 DUF3592:  Protein of u  31.5      70  0.0015   24.9   3.8    9  183-191   101-109 (148)
183 cd01475 vWA_Matrilin VWA_Matri  31.2      50  0.0011   28.2   3.1   36   22-65    180-219 (224)
184 PF15339 Afaf:  Acrosome format  31.0   1E+02  0.0022   26.7   4.7    6   91-96    133-138 (200)
185 PF13703 PepSY_TM_2:  PepSY-ass  31.0      80  0.0017   23.2   3.8   15   91-105    19-33  (88)
186 PF05356 Phage_Coat_B:  Phage C  30.9      91   0.002   23.3   3.9   23   91-113    59-81  (83)
187 PF13268 DUF4059:  Protein of u  30.7      74  0.0016   23.5   3.4   19  100-118    20-38  (72)
188 PF12606 RELT:  Tumour necrosis  30.7      66  0.0014   22.0   3.0   17   99-115    10-26  (50)
189 TIGR00859 ENaC sodium channel   30.0 3.5E+02  0.0076   27.2   9.2   22  178-200   564-585 (595)
190 PRK13707 conjugal transfer pil  29.7 1.2E+02  0.0025   23.5   4.6   14  106-119    53-66  (101)
191 KOG3054|consensus               29.6      52  0.0011   30.0   2.9    8  110-117    21-28  (299)
192 PF13131 DUF3951:  Protein of u  29.5      58  0.0013   22.6   2.5   19   96-114    11-29  (53)
193 TIGR02736 cbb3_Q_epsi cytochro  28.5      74  0.0016   22.4   2.9   19  100-118    11-29  (56)
194 PRK04125 murein hydrolase regu  28.4 1.2E+02  0.0027   24.9   4.8   21   87-107    91-111 (141)
195 CHL00008 petG cytochrome b6/f   28.4 1.1E+02  0.0023   19.7   3.4   24   91-114     6-31  (37)
196 PF07271 Cytadhesin_P30:  Cytad  28.3      41 0.00088   30.8   2.1   13   93-105    78-90  (279)
197 COG4218 MtrF Tetrahydromethano  28.3      60  0.0013   23.9   2.5   19   87-105    51-69  (73)
198 PF07178 TraL:  TraL protein;    28.0 1.3E+02  0.0027   22.7   4.5   10  108-117    49-58  (95)
199 PHA03283 envelope glycoprotein  27.9      36 0.00079   33.9   1.8   19   99-117   412-430 (542)
200 PF09064 Tme5_EGF_like:  Thromb  27.8      32 0.00068   21.8   0.9   10   23-32     17-26  (34)
201 PF11353 DUF3153:  Protein of u  27.7      59  0.0013   28.0   2.9   14  103-116   195-208 (209)
202 PF04415 DUF515:  Protein of un  27.7      66  0.0014   31.2   3.5   26   90-115    32-57  (416)
203 PHA03163 hypothetical protein;  27.6 2.4E+02  0.0052   21.7   5.8   29   86-114    56-84  (92)
204 TIGR01432 QOXA cytochrome aa3   27.5 1.3E+02  0.0029   26.0   5.1   10  106-115    42-51  (217)
205 PRK00665 petG cytochrome b6-f   27.2 1.1E+02  0.0025   19.6   3.4   24   91-114     6-31  (37)
206 PRK09400 secE preprotein trans  27.2 1.4E+02   0.003   21.1   4.2   26   85-110    30-55  (61)
207 KOG3514|consensus               27.0      47   0.001   36.1   2.5   25   87-111  1514-1538(1591)
208 PF02404 SCF:  Stem cell factor  27.0      21 0.00046   32.5   0.0   20   96-115   223-242 (273)
209 PF03988 DUF347:  Repeat of Unk  26.9      92   0.002   21.3   3.2   10  101-110    38-47  (55)
210 PF11431 Transport_MerF:  Membr  26.7      77  0.0017   21.4   2.7   18  100-117    28-45  (46)
211 PF10669 Phage_Gp23:  Protein g  26.4 1.8E+02  0.0038   22.9   5.0   12   95-106    23-34  (121)
212 TIGR00383 corA magnesium Mg(2+  26.3      72  0.0016   28.6   3.4   14  103-116   303-316 (318)
213 PF04156 IncA:  IncA protein;    26.3      96  0.0021   25.8   3.9   11  100-110    50-60  (191)
214 PHA03231 glycoprotein BALF4; P  26.2   1E+02  0.0023   32.4   4.8   13  106-118   718-730 (829)
215 PF11174 DUF2970:  Protein of u  26.1 1.1E+02  0.0024   21.3   3.5   19   87-105    31-49  (56)
216 PF11743 DUF3301:  Protein of u  26.0      90   0.002   23.8   3.4   17  102-118     8-24  (97)
217 PF13179 DUF4006:  Family of un  25.9 1.4E+02   0.003   21.7   4.1   18  101-118    24-41  (66)
218 COG3088 CcmH Uncharacterized p  25.7 1.1E+02  0.0024   25.7   4.0   18   91-108   108-125 (153)
219 TIGR00327 secE_euk_arch protei  25.5 1.8E+02  0.0039   20.6   4.6   26   85-110    26-51  (61)
220 PF03408 Foamy_virus_ENV:  Foam  25.5      42 0.00091   35.2   1.8    9   93-101   957-965 (981)
221 PF14163 SieB:  Superinfection   25.4 1.7E+02  0.0036   23.7   5.0   21   87-107    35-55  (151)
222 PHA03185 UL14 tegument protein  25.2      88  0.0019   27.5   3.5   31  181-211   171-201 (214)
223 PF00954 S_locus_glycop:  S-loc  25.0      61  0.0013   24.6   2.3   26   35-65     84-109 (110)
224 PF11157 DUF2937:  Protein of u  25.0      85  0.0018   26.4   3.3   18   87-104   137-154 (167)
225 PF12877 DUF3827:  Domain of un  24.6      55  0.0012   33.5   2.4   27   87-114   272-298 (684)
226 PF11359 gpUL132:  Glycoprotein  24.5      64  0.0014   28.7   2.5   30   88-118    55-84  (235)
227 PHA03164 hypothetical protein;  24.1      45 0.00097   25.1   1.3   25   89-115    61-85  (88)
228 PRK08455 fliL flagellar basal   24.0      74  0.0016   27.1   2.8    6  106-111    36-41  (182)
229 COG5353 Uncharacterized protei  23.9      45 0.00097   28.1   1.4   21   91-111     8-28  (161)
230 KOG3498|consensus               23.9 1.5E+02  0.0031   21.6   3.8   25   84-108    30-54  (67)
231 PF06809 NPDC1:  Neural prolife  23.8      18  0.0004   33.8  -1.0    9  142-150   270-278 (341)
232 PRK01026 tetrahydromethanopter  23.7 1.6E+02  0.0035   22.0   4.1   19   88-106    50-68  (77)
233 PHA03156 hypothetical protein;  23.7 3.1E+02  0.0067   21.1   5.8   29   86-114    55-83  (90)
234 PF02529 PetG:  Cytochrome B6-F  23.6 1.7E+02  0.0038   18.8   3.7   25   91-115     6-32  (37)
235 PF04202 Mfp-3:  Foot protein 3  23.3      52  0.0011   24.0   1.4   11   96-106     8-18  (71)
236 COG3630 OadG Na+-transporting   23.1 1.8E+02  0.0038   22.1   4.3   20   91-110    15-34  (84)
237 TIGR02507 MtrF tetrahydrometha  23.1 1.4E+02   0.003   21.6   3.6   19   87-105    42-60  (65)
238 cd00930 Cyt_c_Oxidase_VIII Cyt  23.1 1.5E+02  0.0032   19.7   3.5   23   91-113    16-38  (43)
239 PF05337 CSF-1:  Macrophage col  23.1      28  0.0006   32.0   0.0   14  103-116   241-254 (285)
240 PHA03286 envelope glycoprotein  23.1 1.2E+02  0.0025   30.0   4.2    9   98-106   401-409 (492)
241 KOG1025|consensus               22.9 1.1E+02  0.0024   32.8   4.2    8   58-65    571-578 (1177)
242 PF13974 YebO:  YebO-like prote  22.8      94   0.002   23.4   2.8   13   96-108     5-17  (80)
243 PF07438 DUF1514:  Protein of u  22.7      84  0.0018   22.8   2.4   10   92-101     5-14  (66)
244 PRK11901 hypothetical protein;  22.7      67  0.0015   30.2   2.4   21   87-108    39-59  (327)
245 PF01372 Melittin:  Melittin;    22.6 1.8E+02  0.0039   17.2   3.4    6  109-114    19-24  (26)
246 PF06692 MNSV_P7B:  Melon necro  22.6      93   0.002   22.0   2.5    6  100-105    18-23  (61)
247 PRK05886 yajC preprotein trans  22.5 1.3E+02  0.0028   23.8   3.7   10  112-121    26-35  (109)
248 PRK03814 oxaloacetate decarbox  22.4 1.8E+02  0.0039   21.9   4.3    8   91-98     16-23  (85)
249 PHA03291 envelope glycoprotein  22.1      46 0.00099   31.7   1.2   30   89-118   290-319 (401)
250 COG0598 CorA Mg2+ and Co2+ tra  22.0      87  0.0019   28.6   3.1   20   97-116   301-320 (322)
251 PF13120 DUF3974:  Domain of un  21.9      79  0.0017   24.8   2.3   20   98-117    14-33  (126)
252 PF11857 DUF3377:  Domain of un  21.9      36 0.00077   25.3   0.4   11  110-120    52-62  (74)
253 PF14017 DUF4233:  Protein of u  21.8 2.6E+02  0.0057   21.8   5.3   21  101-121    84-104 (107)
254 PTZ00234 variable surface prot  21.7      57  0.0012   31.7   1.8   14   97-110   370-383 (433)
255 COG4736 CcoQ Cbb3-type cytochr  21.6 1.9E+02   0.004   20.6   4.0   32   91-122     7-38  (60)
256 PF07178 TraL:  TraL protein;    21.5 1.4E+02  0.0031   22.5   3.7   23   98-120    42-64  (95)
257 PF11085 YqhR:  Conserved membr  21.5 1.7E+02  0.0036   25.1   4.4   28   87-114    67-94  (173)
258 PF05961 Chordopox_A13L:  Chord  21.4 1.2E+02  0.0025   22.2   2.9    9  108-116    19-27  (68)
259 TIGR01495 ETRAMP Plasmodium ri  21.4      81  0.0018   23.8   2.2   13   97-109    62-74  (85)
260 PF07226 DUF1422:  Protein of u  21.4      99  0.0021   24.9   2.8   20   97-116    96-115 (117)
261 PF05827 ATP-synt_S1:  Vacuolar  21.1   1E+02  0.0022   27.4   3.2   23   88-110   257-279 (282)
262 COG3105 Uncharacterized protei  20.8 1.6E+02  0.0035   24.2   4.0   13   87-99      8-20  (138)
263 PF13706 PepSY_TM_3:  PepSY-ass  20.8 1.8E+02  0.0039   18.2   3.5   12   92-103    11-22  (37)
264 PRK09546 zntB zinc transporter  20.6 1.1E+02  0.0023   27.9   3.3   10  106-115   312-321 (324)
265 PRK02919 oxaloacetate decarbox  20.6 2.7E+02  0.0058   20.8   4.9    6   91-96     15-20  (82)
266 PRK04778 septation ring format  20.6 1.4E+02   0.003   29.7   4.3   22   91-112     6-27  (569)
267 PF10251 PEN-2:  Presenilin enh  20.5 2.8E+02   0.006   21.4   5.1   28   87-114    50-77  (94)
268 PRK11056 hypothetical protein;  20.5      98  0.0021   25.0   2.6   21   97-117    96-116 (120)
269 PF10855 DUF2648:  Protein of u  20.4      73  0.0016   20.0   1.5   15  102-116     8-22  (33)
270 PF15069 FAM163:  FAM163 family  20.4   1E+02  0.0022   25.6   2.8   27   88-114    10-37  (143)
271 PF02285 COX8:  Cytochrome oxid  20.2 2.6E+02  0.0057   18.6   4.2   21   91-111    16-36  (44)
272 PF02937 COX6C:  Cytochrome c o  20.1 2.7E+02  0.0058   20.4   4.7   11   95-105    22-32  (73)
273 PF04341 DUF485:  Protein of un  20.1 1.6E+02  0.0034   22.1   3.6   20   96-115    60-79  (91)
274 PF13955 Fst_toxin:  Toxin Fst,  20.1 1.7E+02  0.0036   16.7   2.7   11   90-100     4-14  (21)
275 PRK10905 cell division protein  20.1      67  0.0014   30.2   1.8   16   92-107     5-20  (328)

No 1  
>KOG1219|consensus
Probab=99.43  E-value=5e-13  Score=141.87  Aligned_cols=170  Identities=21%  Similarity=0.381  Sum_probs=99.0

Q ss_pred             CCceeeCCCCCCC------------CCCCCCCEEEeeccCCCCceEeecCCCCCCCCCcccC----C-------------
Q psy12432         22 TFHTYACPPTYAT------------WYCLNGATCFTVKIGESLLYNCECADGYMGQRCEFKD----L-------------   72 (229)
Q Consensus        22 ~gy~C~C~~gy~g------------npClNGGtC~~~~~~~~~~~sC~C~~Gy~G~rCe~~~----~-------------   72 (229)
                      .||.|+|+.-|.|            +||++||+|....    ++|.|.|+.||+|.|||...    .             
T Consensus      3884 ggy~CkCpsqysG~~CEi~~epC~snPC~~GgtCip~~----n~f~CnC~~gyTG~~Ce~~Gi~eCs~n~C~~gg~C~n~ 3959 (4289)
T KOG1219|consen 3884 GGYKCKCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFY----NGFLCNCPNGYTGKRCEARGISECSKNVCGTGGQCINI 3959 (4289)
T ss_pred             CceEEeCcccccCcccccccccccCCCCCCCCEEEecC----CCeeEeCCCCccCceeecccccccccccccCCceeecc
Confidence            3899999999997            5999999998763    37999999999999999872    1             


Q ss_pred             CCCcc----cc-c----------chhhhHHHHH--hhhHHHHHHHHHHHHHHHHHHHHHhhhhhhhccccccCCCCCCCC
Q psy12432         73 DGSYL----PS-R----------KQVMLETASI--ASGASIAVFLVVILCFSLYVHCQRRKKQAQAASVCCTDGPGSSLQ  135 (229)
Q Consensus        73 ~~~~~----p~-~----------~~~~~~~~~I--a~gI~V~lLLVlii~vvly~~~kRrk~~k~~~~~~~~d~~~~~~~  135 (229)
                      .|+|+    +. .          ......|.+.  +++|+|+|++ ++++++++|+++|++.+|+-...+-+|--     
T Consensus      3960 ~gsf~CncT~g~~gr~c~~~~pni~~~~~~~gkaEli~I~V~l~~-ifilvvlf~~crKk~~rkky~~~~~~~ll----- 4033 (4289)
T KOG1219|consen 3960 PGSFHCNCTPGILGRTCCAEKPNILSTVLWLGKAELIIIIVLLAL-IFILVVLFWKCRKKNSRKKYGAHCPVDLL----- 4033 (4289)
T ss_pred             CCceEeccChhHhcccCccccCccccccchhcccceeehhHHHHH-HHHHHHHHHhhhhhccCCcccccCChhHH-----
Confidence            11121    10 0          0112233222  2334444433 33444466666666555544333322210     


Q ss_pred             CCCCCCccCCCCcceeeeeccccCCCCCCCCCccc--ccCCCCCCcCCCccccCCCCCC-CCCCCCCCCCCCCCCC
Q psy12432        136 RPRMPFERRPSPADFVLTRITTEAPRAADTRTSIT--ITGKGDSVSASQLYHQSSPPLP-HMSHPPSCTPLPPAPS  208 (229)
Q Consensus       136 ~~~~~~~~~~~p~~~~~~~~~~e~~~~~~~~~~ld--~~~~g~~~~~~~~f~~~~~~~~-~~~~~~~~~~~~~~~~  208 (229)
                             ++++=....|++-----|++...+||||  ..+.+..++||.+|++++..+. .-|-+|--.|++|--|
T Consensus      4034 -------~rn~nVy~~v~PqvplrP~sytsrnNld~n~~s~ssv~~e~~Tf~Pes~~r~vvcSvapnlp~pspc~S 4102 (4289)
T KOG1219|consen 4034 -------RRNTNVYVPVVPQVPLRPRSYTSRNNLDSNFISTSSVPTELVTFRPESQVRPVVCSVAPNLPPPSPCGS 4102 (4289)
T ss_pred             -------hcCCCccccCCCCCCcCCcccccccccCcCCcccCCccccccccCCCccccceeeccCCCCCCCCCCCC
Confidence                   0111111111110001133444789999  8899999999999999999877 5555555444443333


No 2  
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=99.34  E-value=1.4e-12  Score=104.71  Aligned_cols=90  Identities=27%  Similarity=0.449  Sum_probs=58.7

Q ss_pred             ceeeCCCCCCCCCCCCCCEEEeeccCCCCceEeecCCCCCCCCCcccCCCCCcccccchhhhHHHHHhhhHHHHHHHHHH
Q psy12432         24 HTYACPPTYATWYCLNGATCFTVKIGESLLYNCECADGYMGQRCEFKDLDGSYLPSRKQVMLETASIASGASIAVFLVVI  103 (229)
Q Consensus        24 y~C~C~~gy~gnpClNGGtC~~~~~~~~~~~sC~C~~Gy~G~rCe~~~~~~~~~p~~~~~~~~~~~Ia~gI~V~lLLVli  103 (229)
                      .+-.|++.|+ +||+|| +|+.+...  ..+.|+|..||+|+|||+.++. .|.+..+.... +..|+..+++++|++++
T Consensus        41 ~i~~Cp~ey~-~YClHG-~C~yI~dl--~~~~CrC~~GYtGeRCEh~dLl-~~~~~~k~n~~-t~Yia~~~il~il~~i~  114 (139)
T PHA03099         41 AIRLCGPEGD-GYCLHG-DCIHARDI--DGMYCRCSHGYTGIRCQHVVLV-DYQRSEKPNTT-TSYIPSPGIVLVLVGII  114 (139)
T ss_pred             ccccCChhhC-CEeECC-EEEeeccC--CCceeECCCCcccccccceeee-eeeccccccch-hhhhhhhHHHHHHHHHH
Confidence            3789999996 899997 89988643  3799999999999999998753 34443333322 34555444455555444


Q ss_pred             HHHHHHHHHHHhhhhh
Q psy12432        104 LCFSLYVHCQRRKKQA  119 (229)
Q Consensus       104 i~vvly~~~kRrk~~k  119 (229)
                      +.+++++.++..||+|
T Consensus       115 is~~~~~~yr~~r~~~  130 (139)
T PHA03099        115 ITCCLLSVYRFTRRTK  130 (139)
T ss_pred             HHHHHHhhheeeeccc
Confidence            4444444444433333


No 3  
>PF02158 Neuregulin:  Neuregulin family;  InterPro: IPR002154 Neuregulins are a sub-family of EGF-like molecules that have been shown to play multiple essential roles in vertebrate embryogenesis including: cardiac development, Schwann cell and oligodendrocyte differentiation, some aspects of neuronal development, as well as the formation of neuromuscular synapses [, ]. Included in the family are heregulin; neu differentiation factor; acetylcholine receptor synthesis stimulator; glial growth factor; and sensory and motor-neuron derived factor []. Multiple family members are generated by alternate splicing or by use of several cell type-specific transcription initiation sites. In general, they bind to and activate the erbB family of receptor tyrosine kinases (erbB2 (HER2), erbB3 (HER3), and erbB4 (HER4)), functioning both as heterodimers and homodimers.  The transmembrane forms of neuregulin 1 (NRG1) are present within synaptic vesicles, including those containing glutamate []. After exocytosis, NRG1 is in the presynaptic membrane, where the ectodomain of NRG1 may be cleaved off. The ectodomain then migrates across the synaptic cleft and binds to and activates a member of the EGF-receptor family on the postsynaptic membrane. This has been shown to increase the expression of certain glutamate-receptor subunits. NRG1 appears to signal for glutamate-receptor subunit expression, localisation, and /or phosphorylation facilitating subsequent glutamate transmission.   The NRG1 gene has been identified as a potential gene determining susceptibility to schizophrenia by a combination of genetic linkage and association approaches []. ; GO: 0005102 receptor binding, 0009790 embryo development; PDB: 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=98.76  E-value=1.4e-09  Score=101.16  Aligned_cols=83  Identities=16%  Similarity=0.283  Sum_probs=0.4

Q ss_pred             ccchhhhHHHHHhhhHHHHHHHHHHHHHH-HHHHHHHhhhhhhh--cc-------c--cccCCCCCC--CCC-CCCCC--
Q psy12432         79 SRKQVMLETASIASGASIAVFLVVILCFS-LYVHCQRRKKQAQA--AS-------V--CCTDGPGSS--LQR-PRMPF--  141 (229)
Q Consensus        79 ~~~~~~~~~~~Ia~gI~V~lLLVlii~vv-ly~~~kRrk~~k~~--~~-------~--~~~d~~~~~--~~~-~~~~~--  141 (229)
                      ++++|++     |.||||+||||+|+|++ +||+.||++|+++.  .+       +  .+.+||++.  ..+ +++||  
T Consensus         5 YQkrVLT-----ITgIcvaLlVVGi~Cvv~aYCKTKKQRkklh~hLkqs~~~~~~n~~n~angp~~p~n~pp~e~vQlvn   79 (404)
T PF02158_consen    5 YQKRVLT-----ITGICVALLVVGIVCVVDAYCKTKKQRKKLHEHLKQSQRSKNYNLRNLANGPNHPRNPPPQEEVQLVN   79 (404)
T ss_dssp             S-------------------------------------------------------------------------------
T ss_pred             hhhhhhh-----hhhhhHHHHHHHHHHHHHHHHHhHHHHHHHHHHHhhhcccchhhhhhhhcCCCCCCCCCccccccccc
Confidence            4566665     88999999999999999 99999999888753  22       2  246777653  444 88886  


Q ss_pred             --ccCCCCcceeeeeccccCCCCCCCC
Q psy12432        142 --ERRPSPADFVLTRITTEAPRAADTR  166 (229)
Q Consensus       142 --~~~~~p~~~~~~~~~~e~~~~~~~~  166 (229)
                        .+|+++++++|++..+|..|.+...
T Consensus        80 ~y~SKn~~~t~~v~~~~tEtSFs~Sh~  106 (404)
T PF02158_consen   80 QYISKNVRHTEHVIEKETETSFSGSHS  106 (404)
T ss_dssp             ---------------------------
T ss_pred             cccccCCCCchhhhccccccccCCCCc
Confidence              6899999999999999999977553


No 4  
>PHA02887 EGF-like protein; Provisional
Probab=98.43  E-value=1.5e-07  Score=74.81  Aligned_cols=60  Identities=20%  Similarity=0.417  Sum_probs=48.0

Q ss_pred             CCCCCCCCCcCCCCCCceeeCCCCCCCCCCCCCCEEEeeccCCCCceEeecCCCCCCCCCcccC
Q psy12432          8 PKPRPPSPTPRPNITFHTYACPPTYATWYCLNGATCFTVKIGESLLYNCECADGYMGQRCEFKD   71 (229)
Q Consensus         8 ~~~~~~~~~~~~~~~gy~C~C~~gy~gnpClNGGtC~~~~~~~~~~~sC~C~~Gy~G~rCe~~~   71 (229)
                      ++-.+-.-+.+-.+.+|+-+|++.|. +||+| |+|......  ..+.|+|..||+|.||+...
T Consensus        66 y~en~~~~~~~rk~~~hf~pC~~eyk-~YCiH-G~C~yI~dL--~epsCrC~~GYtG~RCE~vs  125 (126)
T PHA02887         66 YKENANAQNFKRKNSMFFEKCKNDFN-DFCIN-GECMNIIDL--DEKFCICNKGYTGIRCDEVS  125 (126)
T ss_pred             hhccccccchhhccccCccccChHhh-CEeeC-CEEEccccC--CCceeECCCCcccCCCCccc
Confidence            44445555667777899999999995 89997 599987643  37999999999999999763


No 5  
>KOG1219|consensus
Probab=98.21  E-value=1.7e-06  Score=93.89  Aligned_cols=96  Identities=22%  Similarity=0.352  Sum_probs=65.3

Q ss_pred             CcCCCCCCceeeCCCCCCC-------------CCCCCCCEEEeeccCCCCceEeecCCCCCCCCCcccCCCCCcccccch
Q psy12432         16 TPRPNITFHTYACPPTYAT-------------WYCLNGATCFTVKIGESLLYNCECADGYMGQRCEFKDLDGSYLPSRKQ   82 (229)
Q Consensus        16 ~~~~~~~gy~C~C~~gy~g-------------npClNGGtC~~~~~~~~~~~sC~C~~Gy~G~rCe~~~~~~~~~p~~~~   82 (229)
                      |+.+...++.|+|+.||.|             |.|.|||.|.++.    +.|.|.|..||.|+.|...... .+..... 
T Consensus      3916 tCip~~n~f~CnC~~gyTG~~Ce~~Gi~eCs~n~C~~gg~C~n~~----gsf~CncT~g~~gr~c~~~~pn-i~~~~~~- 3989 (4289)
T KOG1219|consen 3916 TCIPFYNGFLCNCPNGYTGKRCEARGISECSKNVCGTGGQCINIP----GSFHCNCTPGILGRTCCAEKPN-ILSTVLW- 3989 (4289)
T ss_pred             EEEecCCCeeEeCCCCccCceeecccccccccccccCCceeeccC----CceEeccChhHhcccCccccCc-cccccch-
Confidence            6677778899999999996             6899999999975    3699999999999999865432 1111111 


Q ss_pred             hhh-HHHHHhhhHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy12432         83 VML-ETASIASGASIAVFLVVILCFSLYVHCQRRKKQA  119 (229)
Q Consensus        83 ~~~-~~~~Ia~gI~V~lLLVlii~vvly~~~kRrk~~k  119 (229)
                      +++ |..+|++ +++++++++. ++|+|.++.++|+..
T Consensus      3990 ~gkaEli~I~V-~l~~ifilvv-lf~~crKk~~rkky~ 4025 (4289)
T KOG1219|consen 3990 LGKAELIIIIV-LLALIFILVV-LFWKCRKKNSRKKYG 4025 (4289)
T ss_pred             hcccceeehhH-HHHHHHHHHH-HHHhhhhhccCCccc
Confidence            122 3444444 4444444444 667788877666554


No 6  
>KOG4289|consensus
Probab=97.87  E-value=1.5e-05  Score=84.15  Aligned_cols=53  Identities=28%  Similarity=0.654  Sum_probs=44.3

Q ss_pred             CCCceeeCCCCCCC------------CCCCCCCEEEeeccCCCCceEeecCCCCCCCCCcccCCCCCcc
Q psy12432         21 ITFHTYACPPTYAT------------WYCLNGATCFTVKIGESLLYNCECADGYMGQRCEFKDLDGSYL   77 (229)
Q Consensus        21 ~~gy~C~C~~gy~g------------npClNGGtC~~~~~~~~~~~sC~C~~Gy~G~rCe~~~~~~~~~   77 (229)
                      +.|..|+||+||.|            +||-|+|.|...+    ++|+|.|..||+|++||.....+..+
T Consensus      1219 vnglrCrCPpGFTgd~CeTeiDlCYs~pC~nng~C~srE----ggYtCeCrpg~tGehCEvs~~agrCv 1283 (2531)
T KOG4289|consen 1219 VNGLRCRCPPGFTGDYCETEIDLCYSGPCGNNGRCRSRE----GGYTCECRPGFTGEHCEVSARAGRCV 1283 (2531)
T ss_pred             cCceeEeCCCCCCcccccchhHhhhcCCCCCCCceEEec----CceeEEecCCccccceeeecccCccc
Confidence            34899999999997            4899999999875    37999999999999999876544433


No 7  
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.73  E-value=3e-05  Score=48.07  Aligned_cols=29  Identities=45%  Similarity=0.953  Sum_probs=25.3

Q ss_pred             CCCCCCCEEEeeccCCCCceEeecCCCCCCCC
Q psy12432         35 WYCLNGATCFTVKIGESLLYNCECADGYMGQR   66 (229)
Q Consensus        35 npClNGGtC~~~~~~~~~~~sC~C~~Gy~G~r   66 (229)
                      +||.|||+|....   ...|.|.|+.||.|.+
T Consensus         4 ~~C~n~g~C~~~~---~~~y~C~C~~G~~G~~   32 (32)
T PF00008_consen    4 NPCQNGGTCIDLP---GGGYTCECPPGYTGKR   32 (32)
T ss_dssp             TSSTTTEEEEEES---TSEEEEEEBTTEESTT
T ss_pred             CcCCCCeEEEeCC---CCCEEeECCCCCccCC
Confidence            6999999999875   2379999999999975


No 8  
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=96.76  E-value=0.0018  Score=40.10  Aligned_cols=30  Identities=47%  Similarity=1.022  Sum_probs=25.7

Q ss_pred             CCCCCCCEEEeeccCCCCceEeecCCCCC-CCCCc
Q psy12432         35 WYCLNGATCFTVKIGESLLYNCECADGYM-GQRCE   68 (229)
Q Consensus        35 npClNGGtC~~~~~~~~~~~sC~C~~Gy~-G~rCe   68 (229)
                      ++|.++|+|.+..    ..|.|.|+.||. |.+|+
T Consensus         9 ~~C~~~~~C~~~~----g~~~C~C~~g~~~g~~C~   39 (39)
T smart00179        9 NPCQNGGTCVNTV----GSYRCECPPGYTDGRNCE   39 (39)
T ss_pred             CCcCCCCEeECCC----CCeEeECCCCCccCCcCC
Confidence            5899999999764    369999999999 98885


No 9  
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.46  E-value=0.0037  Score=38.02  Aligned_cols=30  Identities=43%  Similarity=0.985  Sum_probs=25.4

Q ss_pred             CCCCCCCEEEeeccCCCCceEeecCCCCCCCCCc
Q psy12432         35 WYCLNGATCFTVKIGESLLYNCECADGYMGQRCE   68 (229)
Q Consensus        35 npClNGGtC~~~~~~~~~~~sC~C~~Gy~G~rCe   68 (229)
                      .+|.+++.|....    ..|.|.|..+|.|.+|+
T Consensus         9 ~~C~~~~~C~~~~----~~~~C~C~~g~~g~~C~   38 (38)
T cd00054           9 NPCQNGGTCVNTV----GSYRCSCPPGYTGRNCE   38 (38)
T ss_pred             CCcCCCCEeECCC----CCeEeECCCCCcCCcCC
Confidence            5799999998763    36999999999998885


No 10 
>KOG4289|consensus
Probab=96.22  E-value=0.0048  Score=66.13  Aligned_cols=52  Identities=29%  Similarity=0.577  Sum_probs=44.1

Q ss_pred             CcCCCCCCceeeCCCCCCC--------------CCCCCCCEEEeeccCCCCceEeecCCC-CCCCCCccc
Q psy12432         16 TPRPNITFHTYACPPTYAT--------------WYCLNGATCFTVKIGESLLYNCECADG-YMGQRCEFK   70 (229)
Q Consensus        16 ~~~~~~~gy~C~C~~gy~g--------------npClNGGtC~~~~~~~~~~~sC~C~~G-y~G~rCe~~   70 (229)
                      +++..-.||.|.|.+||.|              .+|.|||+|.+..   ..++-|.|+.| |.+.+|+..
T Consensus      1252 ~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC~~~~---nggf~c~Cp~ge~e~prC~v~ 1318 (2531)
T KOG4289|consen 1252 RCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTCVNLL---NGGFCCHCPYGEFEDPRCEVT 1318 (2531)
T ss_pred             ceEEecCceeEEecCCccccceeeecccCccccceecCCCEEeecC---CCceeccCCCcccCCCceEEE
Confidence            5666667899999999997              4899999999875   34799999997 999999975


No 11 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=96.14  E-value=0.008  Score=35.86  Aligned_cols=30  Identities=43%  Similarity=0.956  Sum_probs=25.4

Q ss_pred             CCCCCCCEEEeeccCCCCceEeecCCCCCCC-CCc
Q psy12432         35 WYCLNGATCFTVKIGESLLYNCECADGYMGQ-RCE   68 (229)
Q Consensus        35 npClNGGtC~~~~~~~~~~~sC~C~~Gy~G~-rCe   68 (229)
                      ++|.+++.|....    ..|.|.|+.||.|. .|+
T Consensus         6 ~~C~~~~~C~~~~----~~~~C~C~~g~~g~~~C~   36 (36)
T cd00053           6 NPCSNGGTCVNTP----GSYRCVCPPGYTGDRSCE   36 (36)
T ss_pred             CCCCCCCEEecCC----CCeEeECCCCCcccCCcC
Confidence            6899999999764    36999999999998 774


No 12 
>PF12955 DUF3844:  Domain of unknown function (DUF3844);  InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=96.01  E-value=0.018  Score=45.17  Aligned_cols=38  Identities=21%  Similarity=0.490  Sum_probs=28.0

Q ss_pred             CCCCCCCEEEeeccCC-CCceEeecCC-------------CCCCCCCcccCC
Q psy12432         35 WYCLNGATCFTVKIGE-SLLYNCECAD-------------GYMGQRCEFKDL   72 (229)
Q Consensus        35 npClNGGtC~~~~~~~-~~~~sC~C~~-------------Gy~G~rCe~~~~   72 (229)
                      |-|.+.|.|....... ..=|.|.|..             .|.|.-|+.++.
T Consensus        13 n~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~~~~~~~ktt~W~G~aCqKkDv   64 (103)
T PF12955_consen   13 NNCSGHGSCVKKYGSGGGDCFACKCKPTVVKTGSGKGKTTHWGGPACQKKDV   64 (103)
T ss_pred             cCCCCCceEeeccCCCccceEEEEeeccccccccccCceeeecccccccccc
Confidence            6899999998763211 1249999988             488888998875


No 13 
>PF02439 Adeno_E3_CR2:  Adenovirus E3 region protein CR2;  InterPro: IPR003470 Early region 3 (E3) of human adenoviruses (Ads) codes for proteins that appear to control viral interactions with the host []. This region called CR1 (conserved region 1) [] is found three times in Human adenovirus 19 (a subgroup D adenovirus) 49 kDa protein in the E3 region. CR1 is also found in the 20.1 Kd protein of subgroup B adenoviruses. The function of this 80 amino acid region is unknown. This region is probably a divergent immunoglobulin domain.
Probab=95.78  E-value=0.014  Score=37.87  Aligned_cols=21  Identities=19%  Similarity=0.403  Sum_probs=11.8

Q ss_pred             HHHHHhhhHHHHHHHHHHHHH
Q psy12432         86 ETASIASGASIAVFLVVILCF  106 (229)
Q Consensus        86 ~~~~Ia~gI~V~lLLVlii~v  106 (229)
                      ++.+|++|+.+++.+++|..+
T Consensus         4 s~IaIIv~V~vg~~iiii~~~   24 (38)
T PF02439_consen    4 STIAIIVAVVVGMAIIIICMF   24 (38)
T ss_pred             chhhHHHHHHHHHHHHHHHHH
Confidence            356667666666554444443


No 14 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=95.19  E-value=0.035  Score=33.77  Aligned_cols=29  Identities=48%  Similarity=1.078  Sum_probs=23.9

Q ss_pred             CCCCCCCEEEeeccCCCCceEeecCCCCCC-CCCc
Q psy12432         35 WYCLNGATCFTVKIGESLLYNCECADGYMG-QRCE   68 (229)
Q Consensus        35 npClNGGtC~~~~~~~~~~~sC~C~~Gy~G-~rCe   68 (229)
                      ++|.++ .|....    ..|.|.|..||.| ..|+
T Consensus         6 ~~C~~~-~C~~~~----~~~~C~C~~g~~g~~~C~   35 (35)
T smart00181        6 GPCSNG-TCINTP----GSYTCSCPPGYTGDKRCE   35 (35)
T ss_pred             CCCCCC-EEECCC----CCeEeECCCCCccCCccC
Confidence            589998 998763    3799999999999 7774


No 15 
>PF14575 EphA2_TM:  Ephrin type-A receptor 2 transmembrane domain; PDB: 3KUL_A 2XVD_A 2VX1_A 2VWV_A 2VX0_A 2VWY_A 2VWZ_A 2VWW_A 2VWU_A 2VWX_A ....
Probab=95.17  E-value=0.0094  Score=43.98  Aligned_cols=12  Identities=33%  Similarity=0.382  Sum_probs=7.6

Q ss_pred             eeeccccCCCCC
Q psy12432        152 LTRITTEAPRAA  163 (229)
Q Consensus       152 ~~~~~~e~~~~~  163 (229)
                      +=+++||||-++
T Consensus        54 IDP~TYEDP~qA   65 (75)
T PF14575_consen   54 IDPHTYEDPNQA   65 (75)
T ss_dssp             --GGGSSSHHHH
T ss_pred             cCcccccCHHHH
Confidence            557889998543


No 16 
>PF01102 Glycophorin_A:  Glycophorin A;  InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=94.78  E-value=0.036  Score=44.72  Aligned_cols=29  Identities=28%  Similarity=0.250  Sum_probs=13.5

Q ss_pred             HHHHhhhHHHHHHHHHHHHHHHHHHHHHhhh
Q psy12432         87 TASIASGASIAVFLVVILCFSLYVHCQRRKK  117 (229)
Q Consensus        87 ~~~Ia~gI~V~lLLVlii~vvly~~~kRrk~  117 (229)
                      +++|++|+..++++  ++++++|+.+++|||
T Consensus        66 i~~Ii~gv~aGvIg--~Illi~y~irR~~Kk   94 (122)
T PF01102_consen   66 IIGIIFGVMAGVIG--IILLISYCIRRLRKK   94 (122)
T ss_dssp             HHHHHHHHHHHHHH--HHHHHHHHHHHHS--
T ss_pred             eeehhHHHHHHHHH--HHHHHHHHHHHHhcc
Confidence            45566666555533  333445554444444


No 17 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=94.69  E-value=0.049  Score=33.83  Aligned_cols=27  Identities=33%  Similarity=0.813  Sum_probs=23.2

Q ss_pred             CCCCCCCEEEeeccCCCCceEeecCCCCCCCCC
Q psy12432         35 WYCLNGATCFTVKIGESLLYNCECADGYMGQRC   67 (229)
Q Consensus        35 npClNGGtC~~~~~~~~~~~sC~C~~Gy~G~rC   67 (229)
                      ..|.+.|+|...      ...|.|..||.|..|
T Consensus         6 ~~C~~~G~C~~~------~g~C~C~~g~~G~~C   32 (32)
T PF07974_consen    6 NICSGHGTCVSP------CGRCVCDSGYTGPDC   32 (32)
T ss_pred             CccCCCCEEeCC------CCEEECCCCCcCCCC
Confidence            579999999853      379999999999887


No 18 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=94.49  E-value=0.017  Score=29.06  Aligned_cols=13  Identities=38%  Similarity=1.359  Sum_probs=10.0

Q ss_pred             EeecCCCCCCCCC
Q psy12432         55 NCECADGYMGQRC   67 (229)
Q Consensus        55 sC~C~~Gy~G~rC   67 (229)
                      .|.|+.||+|.+|
T Consensus         1 ~C~C~~G~~G~~C   13 (13)
T PF12661_consen    1 TCQCPPGWTGPNC   13 (13)
T ss_dssp             EEEE-TTEETTTT
T ss_pred             CccCcCCCcCCCC
Confidence            4888999998887


No 19 
>PF14991 MLANA:  Protein melan-A; PDB: 2GTZ_F 2GT9_F 3MRO_P 2GUO_C 3MRQ_P 2GTW_C 3L6F_C 3MRP_P.
Probab=94.01  E-value=0.0097  Score=47.35  Aligned_cols=31  Identities=26%  Similarity=0.346  Sum_probs=2.1

Q ss_pred             hhhHHHHHHHHHHHHHHHHHHHHHhhhhhhh
Q psy12432         91 ASGASIAVFLVVILCFSLYVHCQRRKKQAQA  121 (229)
Q Consensus        91 a~gI~V~lLLVlii~vvly~~~kRrk~~k~~  121 (229)
                      |+||+++++|+.|++++-||++|||..=|.+
T Consensus        26 AaGIGiL~VILgiLLliGCWYckRRSGYk~L   56 (118)
T PF14991_consen   26 AAGIGILIVILGILLLIGCWYCKRRSGYKTL   56 (118)
T ss_dssp             --SSS--------------------------
T ss_pred             hccceeHHHHHHHHHHHhheeeeecchhhhh
Confidence            6677777777777777777777777555544


No 20 
>PF12273 RCR:  Chitin synthesis regulation, resistance to Congo red;  InterPro: IPR020999  RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall []. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5 []. 
Probab=93.98  E-value=0.052  Score=43.55  Aligned_cols=6  Identities=17%  Similarity=0.479  Sum_probs=3.1

Q ss_pred             ccCCCC
Q psy12432        171 ITGKGD  176 (229)
Q Consensus       171 ~~~~g~  176 (229)
                      ||..|.
T Consensus        78 Yd~~g~   83 (130)
T PF12273_consen   78 YDQQGN   83 (130)
T ss_pred             CCCCCC
Confidence            555543


No 21 
>PF05454 DAG1:  Dystroglycan (Dystrophin-associated glycoprotein 1);  InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5.5 kb transcript in Homo sapiens. The protein product is cleaved into two non-covalently associated subunits, [alpha] (N-terminal) and [beta] (C-terminal). In skeletal muscle the dystroglycan complex works as a transmembrane linkage between the extracellular matrix and the cytoskeleton [alpha]-dystroglycan is extracellular and binds to merosin ([alpha]-2 laminin) in the basement membrane, while [beta]-dystroglycan is a transmembrane protein and binds to dystrophin, which is a large rod-like cytoskeletal protein, absent in Duchenne muscular dystrophy patients. Dystrophin binds to intracellular actin cables. In this way, the dystroglycan complex, which links the extracellular matrix to the intracellular actin cables, is thought to provide structural integrity in muscle tissues. The dystroglycan complex is also known to serve as an agrin receptor in muscle, where it may regulate agrin-induced acetylcholine receptor clustering at the neuromuscular junction. There is also evidence which suggests the function of dystroglycan as a part of the signal transduction pathway because it is shown that Grb2, a mediator of the Ras-related signal pathway, can interact with the cytoplasmic domain of dystroglycan. In general, aberrant expression of dystrophin-associated protein complex underlies the pathogenesis of Duchenne muscular dystrophy, Becker muscular dystrophy and severe childhood autosomal recessive muscular dystrophy. Interestingly, no genetic disease has been described for either [alpha]- or [beta]-dystroglycan. Dystroglycan is widely distributed in non-muscle tissues as well as in muscle tissues. During epithelial morphogenesis of kidney, the dystroglycan complex is shown to act as a receptor for the basement membrane. Dystroglycan expression in Mus musculus brain and neural retina has also been reported. However, the physiological role of dystroglycan in non-muscle tissues has remained unclear [].; PDB: 1EG4_P.
Probab=93.43  E-value=0.022  Score=52.17  Aligned_cols=10  Identities=30%  Similarity=0.477  Sum_probs=0.0

Q ss_pred             CccCCCCcce
Q psy12432        141 FERRPSPADF  150 (229)
Q Consensus       141 ~~~~~~p~~~  150 (229)
                      |.+|.+|.-+
T Consensus       186 f~~KGiPvIF  195 (290)
T PF05454_consen  186 FISKGIPVIF  195 (290)
T ss_dssp             ----------
T ss_pred             HHhcCCceec
Confidence            6666666543


No 22 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=93.14  E-value=0.17  Score=41.31  Aligned_cols=22  Identities=18%  Similarity=0.259  Sum_probs=17.7

Q ss_pred             CCCCCceeeCCCCCCCCCCCCC
Q psy12432         19 PNITFHTYACPPTYATWYCLNG   40 (229)
Q Consensus        19 ~~~~gy~C~C~~gy~gnpClNG   40 (229)
                      ++...+.|.|..||.|..|++.
T Consensus        62 ~dl~~~~CrC~~GYtGeRCEh~   83 (139)
T PHA03099         62 RDIDGMYCRCSHGYTGIRCQHV   83 (139)
T ss_pred             ccCCCceeECCCCcccccccce
Confidence            3455789999999999888764


No 23 
>KOG1214|consensus
Probab=92.96  E-value=0.096  Score=54.07  Aligned_cols=46  Identities=28%  Similarity=0.680  Sum_probs=36.8

Q ss_pred             CCCceeeCCCCCCC-------------CCCCCCCEEEeeccCCCCceEeecCCCCCCC--CCccc
Q psy12432         21 ITFHTYACPPTYAT-------------WYCLNGATCFTVKIGESLLYNCECADGYMGQ--RCEFK   70 (229)
Q Consensus        21 ~~gy~C~C~~gy~g-------------npClNGGtC~~~~~~~~~~~sC~C~~Gy~G~--rCe~~   70 (229)
                      ...|+|.|.+||+|             +.|.....|++..    +++.|+|..||.|.  +|...
T Consensus       806 gs~y~C~CLPGfsGDG~~c~dvDeC~psrChp~A~Cyntp----gsfsC~C~pGy~GDGf~CVP~  866 (1289)
T KOG1214|consen  806 GSTYSCACLPGFSGDGHQCTDVDECSPSRCHPAATCYNTP----GSFSCRCQPGYYGDGFQCVPD  866 (1289)
T ss_pred             CceEEEeecCCccCCccccccccccCccccCCCceEecCC----CcceeecccCccCCCceecCC
Confidence            45589999999997             3677788999864    47999999999987  45544


No 24 
>PF13908 Shisa:  Wnt and FGF inhibitory regulator
Probab=92.61  E-value=0.26  Score=41.39  Aligned_cols=14  Identities=21%  Similarity=0.330  Sum_probs=7.3

Q ss_pred             HHHhhhHHHHHHHH
Q psy12432         88 ASIASGASIAVFLV  101 (229)
Q Consensus        88 ~~Ia~gI~V~lLLV  101 (229)
                      +.|++||+++|++|
T Consensus        78 ~~iivgvi~~Vi~I   91 (179)
T PF13908_consen   78 TGIIVGVICGVIAI   91 (179)
T ss_pred             eeeeeehhhHHHHH
Confidence            44566665544433


No 25 
>PF01034 Syndecan:  Syndecan domain;  InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ]. Structurally, these proteins consist of four separate domains:   A signal sequence; An extracellular domain (ectodomain) of variable length whose sequence is not evolutionary conserved in the various forms of syndecans. The ectodomain contains the sites of attachment of the heparan sulphate glycosaminoglycan side chains;  A transmembrane region;  A highly conserved cytoplasmic domain of about 30 to 35 residues, which could interact with cytoskeletal proteins.    The proteins known to belong to this family are:    Syndecan 1.  Syndecan 2 or fibroglycan.  Syndecan 3 or neuroglycan or N-syndecan.  Syndecan 4 or amphiglycan or ryudocan.  Drosophila syndecan.   Caenorhabditis elegans probable syndecan (F57C7.3).    Syndecan-4, a transmembrane heparan sulphate proteoglycan, is a coreceptor with integrins in cell adhesion. It has been suggested to form a ternary signalling complex with protein kinase Calpha and phosphatidylinositol 4,5-bisphosphate (PIP2). Structural studies have demonstrated that the cytoplasmic domain undergoes a conformational transition and forms a symmetric dimer in the presence of phospholipid activator PIP2, and whose overall structure in solution exhibits a twisted clamp shape having a cavity in the centre of dimeric interface. In addition, it has been observed that the syndecan-4 variable domain interacts, strongly, not only with fatty acyl groups but also the anionic head group of PIP2. These findings indicate that PIP2 promotes oligomerisation of the syndecan-4 cytoplasmic domain for transmembrane signalling and cell-matrix adhesion [, ].; GO: 0008092 cytoskeletal protein binding, 0016020 membrane; PDB: 1EJQ_B 1EJP_B 1YBO_C 1OBY_Q.
Probab=91.33  E-value=0.051  Score=39.11  Aligned_cols=19  Identities=16%  Similarity=0.585  Sum_probs=0.0

Q ss_pred             HHHhhhHHHHHHHHHHHHH
Q psy12432         88 ASIASGASIAVFLVVILCF  106 (229)
Q Consensus        88 ~~Ia~gI~V~lLLVlii~v  106 (229)
                      +++++|+++++++++++++
T Consensus        12 aavIaG~Vvgll~ailLIl   30 (64)
T PF01034_consen   12 AAVIAGGVVGLLFAILLIL   30 (64)
T ss_dssp             -------------------
T ss_pred             HHHHHHHHHHHHHHHHHHH
Confidence            4555555566554444443


No 26 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=89.96  E-value=0.37  Score=31.10  Aligned_cols=31  Identities=29%  Similarity=0.743  Sum_probs=24.4

Q ss_pred             eCCCCCCCCCCCCCCEEEeeccCCCCceEeecCCCCC
Q psy12432         27 ACPPTYATWYCLNGATCFTVKIGESLLYNCECADGYM   63 (229)
Q Consensus        27 ~C~~gy~gnpClNGGtC~~~~~~~~~~~sC~C~~Gy~   63 (229)
                      .|..+-  +.|.+++.|.+..    ++|.|.|+.||.
T Consensus         4 EC~~~~--~~C~~~~~C~N~~----Gsy~C~C~~Gy~   34 (42)
T PF07645_consen    4 ECAEGP--HNCPENGTCVNTE----GSYSCSCPPGYE   34 (42)
T ss_dssp             TTTTTS--SSSSTTSEEEEET----TEEEEEESTTEE
T ss_pred             ccCCCC--CcCCCCCEEEcCC----CCEEeeCCCCcE
Confidence            355532  6898899999985    379999999997


No 27 
>PF05393 Hum_adeno_E3A:  Human adenovirus early E3A glycoprotein;  InterPro: IPR008652 This family consists of several early glycoproteins (E3A), from human adenovirus type 2.; GO: 0016021 integral to membrane
Probab=89.21  E-value=0.5  Score=36.16  Aligned_cols=15  Identities=7%  Similarity=0.498  Sum_probs=7.3

Q ss_pred             HHHHHHHHHHHHHHH
Q psy12432         96 IAVFLVVILCFSLYV  110 (229)
Q Consensus        96 V~lLLVlii~vvly~  110 (229)
                      .++|+++++++++||
T Consensus        41 ~~iFil~VilwfvCC   55 (94)
T PF05393_consen   41 CGIFILLVILWFVCC   55 (94)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            344455555554444


No 28 
>PF15069 FAM163:  FAM163 family
Probab=89.03  E-value=1.4  Score=36.52  Aligned_cols=24  Identities=25%  Similarity=0.418  Sum_probs=14.1

Q ss_pred             HHHhhhHHHHHHHHHHHHHHHHHH
Q psy12432         88 ASIASGASIAVFLVVILCFSLYVH  111 (229)
Q Consensus        88 ~~Ia~gI~V~lLLVlii~vvly~~  111 (229)
                      ++|.+||...|+||.||+++.||+
T Consensus         6 vVItGgILAtVILLcIIaVLCYCR   29 (143)
T PF15069_consen    6 VVITGGILATVILLCIIAVLCYCR   29 (143)
T ss_pred             EEEechHHHHHHHHHHHHHHHHHh
Confidence            445566655565666666666655


No 29 
>PF04478 Mid2:  Mid2 like cell wall stress sensor;  InterPro: IPR007567 This family represents a region near the C terminus of Mid2, which contains a transmembrane region. The remainder of the protein sequence is serine-rich and of low complexity, and is therefore impossible to align accurately. Mid2 is thought to act as a mechanosensor of cell wall stress. The C-terminal cytoplasmic region of Mid2 is known to interact with Rom2, a guanine nucleotide exchange factor (GEF) for Rho1, which is part of the cell wall integrity signalling pathway [].
Probab=88.76  E-value=0.025  Score=47.20  Aligned_cols=26  Identities=27%  Similarity=0.498  Sum_probs=11.3

Q ss_pred             hhhHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy12432         91 ASGASIAVFLVVILCFSLYVHCQRRKKQ  118 (229)
Q Consensus        91 a~gI~V~lLLVlii~vvly~~~kRrk~~  118 (229)
                      ++||++++||  ++++++|+++.|+|+.
T Consensus        55 VVGVGg~ill--~il~lvf~~c~r~kkt   80 (154)
T PF04478_consen   55 VVGVGGPILL--GILALVFIFCIRRKKT   80 (154)
T ss_pred             EecccHHHHH--HHHHhheeEEEecccC
Confidence            4444444443  3333444444444443


No 30 
>PF07204 Orthoreo_P10:  Orthoreovirus membrane fusion protein p10;  InterPro: IPR009854 This family consists of several Orthoreovirus membrane fusion protein p10 sequences. p10 is thought to be a multifunctional protein that plays a key role in virus-host interaction [].
Probab=88.70  E-value=0.42  Score=36.93  Aligned_cols=32  Identities=13%  Similarity=0.364  Sum_probs=13.8

Q ss_pred             HHHHhhhHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy12432         87 TASIASGASIAVFLVVILCFSLYVHCQRRKKQA  119 (229)
Q Consensus        87 ~~~Ia~gI~V~lLLVlii~vvly~~~kRrk~~k  119 (229)
                      |-+++.|..++| +|+|+++++||+.|+|..++
T Consensus        42 WpyLA~GGG~iL-ilIii~Lv~CC~~K~K~~~~   73 (98)
T PF07204_consen   42 WPYLAAGGGLIL-ILIIIALVCCCRAKHKTSAA   73 (98)
T ss_pred             hHHhhccchhhh-HHHHHHHHHHhhhhhhhHhh
Confidence            445555543333 33334444444444443443


No 31 
>KOG2052|consensus
Probab=88.46  E-value=1.3  Score=43.30  Aligned_cols=32  Identities=16%  Similarity=0.438  Sum_probs=20.9

Q ss_pred             CcCCCCCCceeeCCCCCCC--CCCCCCCEEEeec
Q psy12432         16 TPRPNITFHTYACPPTYAT--WYCLNGATCFTVK   47 (229)
Q Consensus        16 ~~~~~~~gy~C~C~~gy~g--npClNGGtC~~~~   47 (229)
                      ...++..-+.|.|......  +-|.+||.|+...
T Consensus        27 ~~~~~~~~~~C~C~~~~~~n~~tC~t~g~C~~s~   60 (513)
T KOG2052|consen   27 SVPPGVQLLKCYCSSCLCSNANTCETDGACFVSV   60 (513)
T ss_pred             CCCCCceeEEEEecCCCcCCCCeeeecCeEEEEE
Confidence            3344444688888765443  4688899998653


No 32 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=88.40  E-value=0.35  Score=28.28  Aligned_cols=21  Identities=33%  Similarity=0.649  Sum_probs=12.8

Q ss_pred             CceeeCCCCCCCCCCCCCCEEEe
Q psy12432         23 FHTYACPPTYATWYCLNGATCFT   45 (229)
Q Consensus        23 gy~C~C~~gy~gnpClNGGtC~~   45 (229)
                      +|.|.|++||..++  +|-.|..
T Consensus         1 sy~C~C~~Gy~l~~--d~~~C~D   21 (24)
T PF12662_consen    1 SYTCSCPPGYQLSP--DGRSCED   21 (24)
T ss_pred             CEEeeCCCCCcCCC--CCCcccc
Confidence            47788888875332  4555554


No 33 
>PF08374 Protocadherin:  Protocadherin;  InterPro: IPR013585 The structure of protocadherins is similar to that of classic cadherins (IPR002126 from INTERPRO), but they also have some unique features associated with the cytoplasmic domains. They are expressed in a variety of organisms and are found in high concentrations in the brain where they seem to be localised mainly at cell-cell contact sites. Their expression seems to be developmentally regulated []. 
Probab=87.40  E-value=1.4  Score=38.93  Aligned_cols=10  Identities=30%  Similarity=0.604  Sum_probs=7.1

Q ss_pred             ccccCCCCCC
Q psy12432        183 LYHQSSPPLP  192 (229)
Q Consensus       183 ~f~~~~~~~~  192 (229)
                      +|.+.+|...
T Consensus       146 ~~~P~sPDLa  155 (221)
T PF08374_consen  146 TFKPDSPDLA  155 (221)
T ss_pred             CCCCCCcchh
Confidence            7777777654


No 34 
>KOG3516|consensus
Probab=87.02  E-value=0.42  Score=50.96  Aligned_cols=40  Identities=28%  Similarity=0.660  Sum_probs=33.0

Q ss_pred             CceeeCCCCCCCCCCCCCCEEEeeccCCCCceEeecC-CCCCCCCCccc
Q psy12432         23 FHTYACPPTYATWYCLNGATCFTVKIGESLLYNCECA-DGYMGQRCEFK   70 (229)
Q Consensus        23 gy~C~C~~gy~gnpClNGGtC~~~~~~~~~~~sC~C~-~Gy~G~rCe~~   70 (229)
                      +..|+|.+    |+|.|||.|.-.  ++  .+.|.|. .||.|..|...
T Consensus       543 ~i~drClP----N~CehgG~C~Qs--~~--~f~C~C~~TGY~GatCHts  583 (1306)
T KOG3516|consen  543 GISDRCLP----NPCEHGGKCSQS--WD--DFECNCELTGYKGATCHTS  583 (1306)
T ss_pred             ccccccCC----ccccCCCccccc--cc--ceeEeccccccccccccCC
Confidence            35789988    899999999873  22  6999999 79999988754


No 35 
>PF12877 DUF3827:  Domain of unknown function (DUF3827);  InterPro: IPR024606 The function of the proteins in this entry is not currently known, but one of the human proteins (Q9HCM3 from SWISSPROT) has been implicated in pilocytic astrocytomas [, , ]. In the majority of cases of pilocytic astrocytomas a tandem duplication produces an in-frame fusion of the gene encoding this protein and the BRAF oncogene. The resulting fusion protein has constitutive BRAF kinase activity and is capable of transforming cells. 
Probab=86.22  E-value=0.44  Score=47.98  Aligned_cols=29  Identities=28%  Similarity=0.508  Sum_probs=18.0

Q ss_pred             HhhhHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy12432         90 IASGASIAVFLVVILCFSLYVHCQRRKKQ  118 (229)
Q Consensus        90 Ia~gI~V~lLLVlii~vvly~~~kRrk~~  118 (229)
                      |++|+++.|+||++|++++||+.-|++|.
T Consensus       271 II~gVlvPv~vV~~Iiiil~~~LCRk~K~  299 (684)
T PF12877_consen  271 IIAGVLVPVLVVLLIIIILYWKLCRKNKL  299 (684)
T ss_pred             EEehHhHHHHHHHHHHHHHHHHHhccccc
Confidence            34456677777777777777665555443


No 36 
>KOG3514|consensus
Probab=86.20  E-value=0.41  Score=50.73  Aligned_cols=34  Identities=29%  Similarity=0.700  Sum_probs=28.7

Q ss_pred             CCCCCCCCEEEeeccCCCCceEeecCC-CCCCCCCcccC
Q psy12432         34 TWYCLNGATCFTVKIGESLLYNCECAD-GYMGQRCEFKD   71 (229)
Q Consensus        34 gnpClNGGtC~~~~~~~~~~~sC~C~~-Gy~G~rCe~~~   71 (229)
                      +|||+|||.|...  .  +.+.|.|.. +|.|..||...
T Consensus       628 ~nPC~N~g~C~eg--w--NrfiCDCs~T~~~G~~CerE~  662 (1591)
T KOG3514|consen  628 SNPCQNGGKCSEG--W--NRFICDCSGTGFEGRTCEREA  662 (1591)
T ss_pred             CCcccCCCCcccc--c--cccccccccCcccCcccccee
Confidence            4899999999874  3  389999987 79999999753


No 37 
>PF00558 Vpu:  Vpu protein;  InterPro: IPR008187 The Human immunodeficiency virus 1 (HIV-1) Vpu protein acts in the degradation of CD4 in the endoplasmic reticulum and in the enhancement of virion release from the plasma membrane of infected cells [].; GO: 0019076 release of virus from host; PDB: 2JPX_A 1PI8_A 2GOH_A 2GOF_A 1PI7_A 1PJE_A 1VPU_A 2K7Y_A.
Probab=85.42  E-value=0.97  Score=34.03  Aligned_cols=26  Identities=15%  Similarity=0.466  Sum_probs=5.5

Q ss_pred             HHHHHHHHHHHHHHHHHHHHhhhhhh
Q psy12432         95 SIAVFLVVILCFSLYVHCQRRKKQAQ  120 (229)
Q Consensus        95 ~V~lLLVlii~vvly~~~kRrk~~k~  120 (229)
                      +|++++.+++..++|..++|.|++++
T Consensus        13 iv~~iiaIvvW~iv~ieYrk~~rqrk   38 (81)
T PF00558_consen   13 IVALIIAIVVWTIVYIEYRKIKRQRK   38 (81)
T ss_dssp             HHHHHHHHHHHHHH------------
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence            33333333333344555665555554


No 38 
>KOG3607|consensus
Probab=84.81  E-value=1.4  Score=45.26  Aligned_cols=29  Identities=24%  Similarity=0.587  Sum_probs=23.3

Q ss_pred             CCCCCEEEeeccCCCCceEeecCCCCCCCCCcccCC
Q psy12432         37 CLNGATCFTVKIGESLLYNCECADGYMGQRCEFKDL   72 (229)
Q Consensus        37 ClNGGtC~~~~~~~~~~~sC~C~~Gy~G~rCe~~~~   72 (229)
                      |...|+|.+       ...|.|..||.+..|+....
T Consensus       632 C~g~GVCnn-------~~~ChC~~gwapp~C~~~~~  660 (716)
T KOG3607|consen  632 CNGHGVCNN-------ELNCHCEPGWAPPFCFIFGY  660 (716)
T ss_pred             cCCCcccCC-------CcceeeCCCCCCCccccccC
Confidence            666677754       58999999999999998754


No 39 
>PF15330 SIT:  SHP2-interacting transmembrane adaptor protein, SIT
Probab=84.74  E-value=1.1  Score=35.21  Aligned_cols=7  Identities=14%  Similarity=-0.078  Sum_probs=2.7

Q ss_pred             HHHHHHH
Q psy12432        107 SLYVHCQ  113 (229)
Q Consensus       107 vly~~~k  113 (229)
                      ++.|+.+
T Consensus        19 l~~wr~~   25 (107)
T PF15330_consen   19 LLAWRMK   25 (107)
T ss_pred             HHHHHHH
Confidence            3334333


No 40 
>PF05545 FixQ:  Cbb3-type cytochrome oxidase component FixQ;  InterPro: IPR008621 This family consists of several Cbb3-type cytochrome oxidase components (FixQ/CcoQ). FixQ is found in nitrogen fixing bacteria. Since nitrogen fixation is an energy-consuming process, effective symbioses depend on operation of a respiratory chain with a high affinity for O2, closely coupled to ATP production. This requirement is fulfilled by a special three-subunit terminal oxidase (cytochrome terminal oxidase cbb3), which was first identified in Bradyrhizobium japonicum as the product of the fixNOQP operon [].
Probab=84.17  E-value=2.6  Score=28.14  Aligned_cols=22  Identities=14%  Similarity=0.345  Sum_probs=10.1

Q ss_pred             hHHHHHHHHHHHHHHHHHHHHH
Q psy12432         93 GASIAVFLVVILCFSLYVHCQR  114 (229)
Q Consensus        93 gI~V~lLLVlii~vvly~~~kR  114 (229)
                      ++..++++++++.+++|.+++|
T Consensus        12 ~~~~v~~~~~F~gi~~w~~~~~   33 (49)
T PF05545_consen   12 SIGTVLFFVFFIGIVIWAYRPR   33 (49)
T ss_pred             HHHHHHHHHHHHHHHHHHHccc
Confidence            3344444444444445444443


No 41 
>KOG1225|consensus
Probab=83.29  E-value=1.3  Score=43.91  Aligned_cols=38  Identities=29%  Similarity=0.763  Sum_probs=22.1

Q ss_pred             eeeCCCCCCCC---------CCCCCCEEEeeccCCCCceEeecCCCCCCCCCccc
Q psy12432         25 TYACPPTYATW---------YCLNGATCFTVKIGESLLYNCECADGYMGQRCEFK   70 (229)
Q Consensus        25 ~C~C~~gy~gn---------pClNGGtC~~~~~~~~~~~sC~C~~Gy~G~rCe~~   70 (229)
                      .|.|++||.|.         .|.+.|-|..        -.|.|..||.|.-|+..
T Consensus       297 ~CiC~~g~~G~dCs~~~cpadC~g~G~Ci~--------G~C~C~~Gy~G~~C~~~  343 (525)
T KOG1225|consen  297 ECICNPGYSGKDCSIRRCPADCSGHGKCID--------GECLCDEGYTGELCIQR  343 (525)
T ss_pred             EeecCCCccccccccccCCccCCCCCcccC--------CceEeCCCCcCCccccc
Confidence            56666666552         3455555541        35666666666666664


No 42 
>PF08693 SKG6:  Transmembrane alpha-helix domain;  InterPro: IPR014805 SKG6 and AXL2 are membrane proteins that show polarised intracellular localisation [, ]. This entry represents the highly conserved transmembrane alpha-helical domain found in these proteins [, ]. The full-length AXL2 protein has a negative regulatory function in cytokinesis [].
Probab=82.90  E-value=0.11  Score=34.05  Aligned_cols=18  Identities=28%  Similarity=0.486  Sum_probs=7.7

Q ss_pred             HHHhhhHHHHHHHHHHHH
Q psy12432         88 ASIASGASIAVFLVVILC  105 (229)
Q Consensus        88 ~~Ia~gI~V~lLLVlii~  105 (229)
                      +.|+.|+.+-+.++++++
T Consensus        11 vaIa~~VvVPV~vI~~vl   28 (40)
T PF08693_consen   11 VAIAVGVVVPVGVIIIVL   28 (40)
T ss_pred             EEEEEEEEechHHHHHHH
Confidence            344444444444333333


No 43 
>KOG1225|consensus
Probab=81.90  E-value=1.8  Score=42.77  Aligned_cols=14  Identities=43%  Similarity=1.253  Sum_probs=9.7

Q ss_pred             EeecCCCCCCCCCc
Q psy12432         55 NCECADGYMGQRCE   68 (229)
Q Consensus        55 sC~C~~Gy~G~rCe   68 (229)
                      .|.|..+|.|..|+
T Consensus       297 ~CiC~~g~~G~dCs  310 (525)
T KOG1225|consen  297 ECICNPGYSGKDCS  310 (525)
T ss_pred             EeecCCCccccccc
Confidence            67777777777664


No 44 
>KOG1217|consensus
Probab=80.73  E-value=2.4  Score=38.80  Aligned_cols=42  Identities=38%  Similarity=0.818  Sum_probs=33.2

Q ss_pred             CCceeeCCCCCCCC---------------CCCCCCEEEeeccCCCCceEeecCCCCCCCCC
Q psy12432         22 TFHTYACPPTYATW---------------YCLNGATCFTVKIGESLLYNCECADGYMGQRC   67 (229)
Q Consensus        22 ~gy~C~C~~gy~gn---------------pClNGGtC~~~~~~~~~~~sC~C~~Gy~G~rC   67 (229)
                      .+|.|.|++||.+.               +|.|+++|....    ..|.|.|+.+|.|..|
T Consensus       250 ~~~~C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C~~~~----~~~~C~C~~g~~g~~~  306 (487)
T KOG1217|consen  250 GSYTCRCPEGYTGDACVTCVDVDSCALIASCPNGGTCVNVP----GSYRCTCPPGFTGRLC  306 (487)
T ss_pred             CceeeeCCCCccccccceeeeccccCCCCccCCCCeeecCC----CcceeeCCCCCCCCCC
Confidence            34778888777643               388899999864    2499999999999998


No 45 
>PF01299 Lamp:  Lysosome-associated membrane glycoprotein (Lamp);  InterPro: IPR002000 Lysosome-associated membrane glycoproteins (lamp) [] are integral membrane proteins, specific to lysosomes, and whose exact biological function is not yet clear. Structurally, the lamp proteins consist of two internally homologous lysosome-luminal domains separated by a proline-rich hinge region; at the C-terminal extremity there is a transmembrane region (TM) followed by a very short cytoplasmic tail (C). In each of the duplicated domains, there are two conserved disulphide bonds. This structure is schematically represented in the figure below.   +-----+ +-----+ +-----+ +-----+ | | | | | | | | xCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxxxCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxx +--------------------------++Hinge++--------------------------++TM++C+  In mammals, there are two closely related types of lamp: lamp-1 and lamp-2, which form major components of the lysosome membrane. In chicken lamp-1 is known as LEP100.  Also included in this entry is the macrophage protein CD68 (or macrosialin) [] is a heavily glycosylated integral membrane protein whose structure consists of a mucin-like domain followed by a proline-rich hinge; a single lamp-like domain; a transmembrane region and a short cytoplasmic tail.   Similar to CD68, mammalian lamp-3, which is expressed in lymphoid organs, dendritic cells and in lung, contains all the C-terminal regions but lacks the N-terminal lamp-like region []. In a lamp-family protein from nematodes [] only the part C-terminal to the hinge is conserved. ; GO: 0016020 membrane
Probab=80.43  E-value=0.79  Score=41.77  Aligned_cols=28  Identities=29%  Similarity=0.522  Sum_probs=13.1

Q ss_pred             HHHHhhhHHHHHHHHHHHHHHHHHHHHHhh
Q psy12432         87 TASIASGASIAVFLVVILCFSLYVHCQRRK  116 (229)
Q Consensus        87 ~~~Ia~gI~V~lLLVlii~vvly~~~kRrk  116 (229)
                      ++-||+|++|++|||++  ++.|+..+||.
T Consensus       272 ~vPIaVG~~La~lvliv--LiaYli~Rrr~  299 (306)
T PF01299_consen  272 LVPIAVGAALAGLVLIV--LIAYLIGRRRS  299 (306)
T ss_pred             hHHHHHHHHHHHHHHHH--HHhheeEeccc
Confidence            45556665554443333  33444444433


No 46 
>PF05568 ASFV_J13L:  African swine fever virus J13L protein;  InterPro: IPR008385 This family consists of several African swine fever virus (ASFV) j13L proteins [, , ].
Probab=79.85  E-value=3.2  Score=34.87  Aligned_cols=15  Identities=33%  Similarity=0.541  Sum_probs=7.4

Q ss_pred             HHHHHHHHHHhhhhh
Q psy12432        105 CFSLYVHCQRRKKQA  119 (229)
Q Consensus       105 ~vvly~~~kRrk~~k  119 (229)
                      ++++|++-.||||+.
T Consensus        46 ivli~lcssRKkKaa   60 (189)
T PF05568_consen   46 IVLIYLCSSRKKKAA   60 (189)
T ss_pred             HHHHHHHhhhhHHHH
Confidence            334555555555544


No 47 
>KOG0994|consensus
Probab=79.67  E-value=1.5  Score=47.15  Aligned_cols=50  Identities=30%  Similarity=0.664  Sum_probs=33.2

Q ss_pred             CCCCCCcee-eCCCCCCCC------------CCCCC--------CEEEeeccCCCCceEeecCCCCCCCCCcc
Q psy12432         18 RPNITFHTY-ACPPTYATW------------YCLNG--------ATCFTVKIGESLLYNCECADGYMGQRCEF   69 (229)
Q Consensus        18 ~~~~~gy~C-~C~~gy~gn------------pClNG--------GtC~~~~~~~~~~~sC~C~~Gy~G~rCe~   69 (229)
                      +...+||.| +|..||.|+            ||..|        ..|+....  +....|.|..||+|.||+.
T Consensus       879 qD~T~G~~CdrCl~GyyGdP~lg~g~~CrPCpCP~gp~Sg~~~A~sC~~d~~--t~~ivC~C~~GY~G~RCe~  949 (1758)
T KOG0994|consen  879 QDSTTGHSCDRCLDGYYGDPRLGSGIGCRPCPCPDGPASGRQHADSCYLDTR--TQQIVCHCQEGYSGSRCEI  949 (1758)
T ss_pred             cccccccchhhhhccccCCcccCCCCCCCCCCCCCCCccchhcccccccccc--ccceeeecccCccccchhh
Confidence            445567665 688888875            33333        34554321  2358999999999999986


No 48 
>PF12191 stn_TNFRSF12A:  Tumour necrosis factor receptor stn_TNFRSF12A_TNFR domain;  InterPro: IPR022316 The tumour necrosis factor (TNF) receptor (TNFR) superfamily comprises more than 20 type-I transmembrane proteins. Family members are defined based on similarity in their extracellular domain - a region that contains many cysteine residues arranged in a specific repetitive pattern []. The cysteines allow formation of an extended rod-like structure, responsible for ligand binding []. Upon receptor activation, different intracellular signalling complexes are assembled for different members of the TNFR superfamily, depending on their intracellular domains and sequences []. Activation of TNFRs can therefore induce a range of disparate effects, including cell proliferation, differentiation, survival, or apoptotic cell death, depending upon the receptor involved []. TNFRs are widely distributed and play important roles in many crucial biological processes, such as lymphoid and neuronal development, innate and adaptive immunity, and maintenance of cellular homeostasis []. Drugs that manipulate their signalling have potential roles in the prevention and treatment of many diseases, such as viral infections, coronary heart disease, transplant rejection, and immune disease []. TNF receptor 12 (also known as TWEAK receptor, and fibroblast growth factor-inducible-14 (Fn14)) has been implicated in endothelial cell growth and migration []. The receptor may also play a role in cell-matrix interactions [].; PDB: 2KN0_A 2RPJ_A 2KMZ_A 2EQP_A.
Probab=79.15  E-value=0.61  Score=37.87  Aligned_cols=27  Identities=22%  Similarity=0.101  Sum_probs=0.0

Q ss_pred             HHHhhhHHHHHHHHHHHHHHHHHHHHH
Q psy12432         88 ASIASGASIAVFLVVILCFSLYVHCQR  114 (229)
Q Consensus        88 ~~Ia~gI~V~lLLVlii~vvly~~~kR  114 (229)
                      +.|.+++..++||+.++.+++|+++-|
T Consensus        79 ~pi~~sal~v~lVl~llsg~lv~rrcr  105 (129)
T PF12191_consen   79 WPILGSALSVVLVLALLSGFLVWRRCR  105 (129)
T ss_dssp             ---------------------------
T ss_pred             hhhhhhHHHHHHHHHHHHHHHHHhhhh
Confidence            334444444444444444444444333


No 49 
>PF07213 DAP10:  DAP10 membrane protein;  InterPro: IPR009861 This family consists of several mammalian DAP10 membrane proteins. In activated mouse natural killer (NK) cells, the NKG2D receptor associates with two intracellular adaptors, DAP10 and DAP12, which trigger phosphatidyl inositol 3 kinase (PI3K) and Syk family protein tyrosine kinases, respectively. It has been suggested that the DAP10-PI3K pathway is sufficient to initiate NKG2D-mediated killing of target cells [].
Probab=79.00  E-value=2.5  Score=31.67  Aligned_cols=18  Identities=22%  Similarity=0.429  Sum_probs=7.9

Q ss_pred             HHHHHHHHHHHHHHhhhh
Q psy12432        101 VVILCFSLYVHCQRRKKQ  118 (229)
Q Consensus       101 Vlii~vvly~~~kRrk~~  118 (229)
                      ++|+.+++||-+.|+|++
T Consensus        48 LLIv~~vy~car~r~r~~   65 (79)
T PF07213_consen   48 LLIVLVVYYCARPRRRPT   65 (79)
T ss_pred             HHHHHHHHhhcccccCCc
Confidence            444444454444443333


No 50 
>PF14584 DUF4446:  Protein of unknown function (DUF4446)
Probab=78.78  E-value=3.4  Score=34.35  Aligned_cols=25  Identities=20%  Similarity=0.352  Sum_probs=11.4

Q ss_pred             hhhHHHHHHHHHHHHHHHHHHHHHh
Q psy12432         91 ASGASIAVFLVVILCFSLYVHCQRR  115 (229)
Q Consensus        91 a~gI~V~lLLVlii~vvly~~~kRr  115 (229)
                      +++++++++|+++++++++.+.+|.
T Consensus         4 ~i~l~~l~iilli~~~~~~~kl~kl   28 (151)
T PF14584_consen    4 IIGLLVLVIILLILIIILNIKLRKL   28 (151)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4444444444444444454544433


No 51 
>cd01324 cbb3_Oxidase_CcoQ Cytochrome cbb oxidase CcoQ.  Cytochrome cbb3 oxidase, the terminal oxidase in the respiratory chains of proteobacteria, is a multi-chain transmembrane protein located in the cell membrane. Like other cytochrome oxidases, it catalyzes the reduction of O2 and simultaneously pumps protons across the membrane.  Found exclusively in proteobacteria, cbb3 is believed to be a modern enzyme that has evolved independently to perform a specialized function in microaerobic energy metabolism. The cbb3 operon contains four genes (ccoNOQP or fixNOQP), with ccoN coding for subunit I.  Instead of a CuA-containing subunit II analogous to other cytochrome oxidases, cbb3 utilizes subunits ccoO and ccoP, which contain one and two hemes, respectively, to transfer electrons to the binuclear center.  ccoQ, the fourth subunit, is a single transmembrane helix protein.  It has been shown to protect the core complex from proteolytic degradation by serine proteases.  See cd00919, cd01322
Probab=78.47  E-value=3.2  Score=28.06  Aligned_cols=32  Identities=16%  Similarity=0.201  Sum_probs=19.2

Q ss_pred             HHHhhhHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy12432         88 ASIASGASIAVFLVVILCFSLYVHCQRRKKQA  119 (229)
Q Consensus        88 ~~Ia~gI~V~lLLVlii~vvly~~~kRrk~~k  119 (229)
                      ..++...++++++++++++++|.+..++|++-
T Consensus         8 r~~a~~~~l~~~~~~Figiv~wa~~p~~k~~f   39 (48)
T cd01324           8 RGLADSWGLLYLALFFLGVVVWAFRPGRKKAF   39 (48)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhCCCcchhH
Confidence            34454455556667777777777666655543


No 52 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=78.34  E-value=1.8  Score=27.38  Aligned_cols=27  Identities=37%  Similarity=0.719  Sum_probs=19.8

Q ss_pred             CCCCCCCEEEeeccCCCCceEeecCCCCCCC
Q psy12432         35 WYCLNGATCFTVKIGESLLYNCECADGYMGQ   65 (229)
Q Consensus        35 npClNGGtC~~~~~~~~~~~sC~C~~Gy~G~   65 (229)
                      ..|....+|.+..    ..+.|.|..||.|.
T Consensus         6 ~~C~~nA~C~~~~----~~~~C~C~~Gy~Gd   32 (36)
T PF12947_consen    6 GGCHPNATCTNTG----GSYTCTCKPGYEGD   32 (36)
T ss_dssp             GGS-TTCEEEE-T----TSEEEEE-CEEECC
T ss_pred             CCCCCCcEeecCC----CCEEeECCCCCccC
Confidence            3677889999874    27999999999886


No 53 
>KOG1094|consensus
Probab=78.22  E-value=18  Score=36.96  Aligned_cols=32  Identities=28%  Similarity=0.312  Sum_probs=16.8

Q ss_pred             HHHhhhHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy12432         88 ASIASGASIAVFLVVILCFSLYVHCQRRKKQA  119 (229)
Q Consensus        88 ~~Ia~gI~V~lLLVlii~vvly~~~kRrk~~k  119 (229)
                      ..|+++|.|+++++++++++++.+++|+.+.|
T Consensus       390 t~~~~~~f~~if~iva~ii~~~L~R~rr~~~k  421 (807)
T KOG1094|consen  390 TAILIIIFVAIFLIVALIIALMLWRWRRLLSK  421 (807)
T ss_pred             ceehHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence            34556666666655555554444444444333


No 54 
>TIGR01478 STEVOR variant surface antigen, stevor family. This model represents the stevor branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of stevor sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 8 bits.
Probab=77.99  E-value=2.7  Score=38.59  Aligned_cols=7  Identities=14%  Similarity=0.586  Sum_probs=5.1

Q ss_pred             ceeeCCC
Q psy12432         24 HTYACPP   30 (229)
Q Consensus        24 y~C~C~~   30 (229)
                      ..|.|.+
T Consensus       143 s~cectd  149 (295)
T TIGR01478       143 KSCECTN  149 (295)
T ss_pred             Cceeeec
Confidence            5788876


No 55 
>PTZ00370 STEVOR; Provisional
Probab=77.63  E-value=2.7  Score=38.61  Aligned_cols=9  Identities=22%  Similarity=0.397  Sum_probs=5.9

Q ss_pred             ceeeCCCCC
Q psy12432         24 HTYACPPTY   32 (229)
Q Consensus        24 y~C~C~~gy   32 (229)
                      ..|.|.+-.
T Consensus       143 s~cectd~n  151 (296)
T PTZ00370        143 STCECTDIN  151 (296)
T ss_pred             Cceeeeecc
Confidence            378887643


No 56 
>PF12301 CD99L2:  CD99 antigen like protein 2;  InterPro: IPR022078  This family of proteins is found in eukaryotes. Proteins in this family are typically between 165 and 237 amino acids in length. CD99L2 and CD99 are involved in trans-endothelial migration of neutrophils in vitro and in the recruitment of neutrophils into inflamed peritoneum. 
Probab=77.50  E-value=2.5  Score=35.95  Aligned_cols=29  Identities=28%  Similarity=0.408  Sum_probs=19.8

Q ss_pred             HHHhhhHHHHHHHHHHHHHHHHHHHHHhh
Q psy12432         88 ASIASGASIAVFLVVILCFSLYVHCQRRK  116 (229)
Q Consensus        88 ~~Ia~gI~V~lLLVlii~vvly~~~kRrk  116 (229)
                      ..+|.||+-+|++.|+..+.-|+.++|||
T Consensus       114 ~g~IaGIvsav~valvGAvsSyiaYqkKK  142 (169)
T PF12301_consen  114 AGTIAGIVSAVVVALVGAVSSYIAYQKKK  142 (169)
T ss_pred             cchhhhHHHHHHHHHHHHHHHHHHHHhhc
Confidence            44566777677777777777777776655


No 57 
>KOG3637|consensus
Probab=77.43  E-value=1  Score=47.98  Aligned_cols=33  Identities=18%  Similarity=0.236  Sum_probs=24.9

Q ss_pred             hHHHHHhhhHHHHHHHHHHHHHHHHHH--HHHhhh
Q psy12432         85 LETASIASGASIAVFLVVILCFSLYVH--CQRRKK  117 (229)
Q Consensus        85 ~~~~~Ia~gI~V~lLLVlii~vvly~~--~kRrk~  117 (229)
                      ..+|+|+++++++|||++++.+++|..  +||+|+
T Consensus       976 vp~wiIi~svl~GLLlL~llv~~LwK~GFFKR~r~ 1010 (1030)
T KOG3637|consen  976 VPLWIIILSVLGGLLLLALLVLLLWKCGFFKRNRK 1010 (1030)
T ss_pred             cceeeehHHHHHHHHHHHHHHHHHHhcCccccCCC
Confidence            457888888988888888877777664  666665


No 58 
>PF08374 Protocadherin:  Protocadherin;  InterPro: IPR013585 The structure of protocadherins is similar to that of classic cadherins (IPR002126 from INTERPRO), but they also have some unique features associated with the cytoplasmic domains. They are expressed in a variety of organisms and are found in high concentrations in the brain where they seem to be localised mainly at cell-cell contact sites. Their expression seems to be developmentally regulated []. 
Probab=77.19  E-value=1.1  Score=39.50  Aligned_cols=21  Identities=14%  Similarity=0.384  Sum_probs=8.9

Q ss_pred             HHHhhhHHHHHHHHHHHHHHH
Q psy12432         88 ASIASGASIAVFLVVILCFSL  108 (229)
Q Consensus        88 ~~Ia~gI~V~lLLVlii~vvl  108 (229)
                      ++|+.|++.++||+++++++-
T Consensus        41 iaiVAG~~tVILVI~i~v~vR   61 (221)
T PF08374_consen   41 IAIVAGIMTVILVIFIVVLVR   61 (221)
T ss_pred             eeeecchhhhHHHHHHHHHHH
Confidence            344444444444444444333


No 59 
>PF06697 DUF1191:  Protein of unknown function (DUF1191);  InterPro: IPR010605 This family contains hypothetical plant proteins of unknown function.
Probab=76.58  E-value=0.9  Score=41.48  Aligned_cols=13  Identities=23%  Similarity=0.036  Sum_probs=6.2

Q ss_pred             HHHHhhhHHHHHH
Q psy12432         87 TASIASGASIAVF   99 (229)
Q Consensus        87 ~~~Ia~gI~V~lL   99 (229)
                      +|.|++|+.++++
T Consensus       212 ~W~iv~g~~~G~~  224 (278)
T PF06697_consen  212 WWKIVVGVVGGVV  224 (278)
T ss_pred             eEEEEEEehHHHH
Confidence            4555555444443


No 60 
>PF03302 VSP:  Giardia variant-specific surface protein;  InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=76.56  E-value=0.65  Score=44.15  Aligned_cols=12  Identities=58%  Similarity=0.783  Sum_probs=5.1

Q ss_pred             hHHHHHhhhHHHH
Q psy12432         85 LETASIASGASIA   97 (229)
Q Consensus        85 ~~~~~Ia~gI~V~   97 (229)
                      +++.+|+ ||.|+
T Consensus       364 LstgaIa-GIsva  375 (397)
T PF03302_consen  364 LSTGAIA-GISVA  375 (397)
T ss_pred             cccccee-eeeeh
Confidence            3444443 44444


No 61 
>PF02009 Rifin_STEVOR:  Rifin/stevor family;  InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=76.50  E-value=4.4  Score=37.38  Aligned_cols=12  Identities=8%  Similarity=0.127  Sum_probs=6.8

Q ss_pred             HHHHHhhhhhhh
Q psy12432        110 VHCQRRKKQAQA  121 (229)
Q Consensus       110 ~~~kRrk~~k~~  121 (229)
                      +...|.|++|++
T Consensus       276 YLILRYRRKKKm  287 (299)
T PF02009_consen  276 YLILRYRRKKKM  287 (299)
T ss_pred             HHHHHHHHHhhh
Confidence            346666665543


No 62 
>KOG1217|consensus
Probab=75.67  E-value=3.3  Score=37.89  Aligned_cols=45  Identities=33%  Similarity=0.693  Sum_probs=35.1

Q ss_pred             CCceeeCCCCCCCC--------------CCCCCCEEEeeccCCCCceEeecCCCCCCCCCccc
Q psy12432         22 TFHTYACPPTYATW--------------YCLNGATCFTVKIGESLLYNCECADGYMGQRCEFK   70 (229)
Q Consensus        22 ~gy~C~C~~gy~gn--------------pClNGGtC~~~~~~~~~~~sC~C~~Gy~G~rCe~~   70 (229)
                      ..+.|.|..||.+.              +|.|++.|....    ..|.|.|+.+|.|..|+..
T Consensus       150 ~~~~c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~----~~~~C~c~~~~~~~~~~~~  208 (487)
T KOG1217|consen  150 GPFRCSCTEGYEGEPCETDLDECIQYSSPCQNGGTCVNTG----GSYLCSCPPGYTGSTCETT  208 (487)
T ss_pred             CceeeeeCCCcccccccccccccccCCCCcCCCcccccCC----CCeeEeCCCCccCCcCcCC
Confidence            35778888888752              477888888764    2599999999999998865


No 63 
>PF08114 PMP1_2:  ATPase proteolipid family;  InterPro: IPR012589 This family consists of small proteolipids associated with the plasma membrane H+ ATPase. Two proteolipids (PMP1 and PMP2) are associated with the ATPase and both genes are similarly expressed in the wild-type strain of yeast. No modification of the level of transcription of one PMP gene is detected in a strain deleted of the other. Though both proteolipids show similarity with other small proteolipids associated with other cation -transporting ATPases, their functions remain unclear [].
Probab=74.40  E-value=4.6  Score=26.68  Aligned_cols=23  Identities=22%  Similarity=0.394  Sum_probs=11.0

Q ss_pred             HHHHHHHHHHHHHHHHHHHhhhh
Q psy12432         96 IAVFLVVILCFSLYVHCQRRKKQ  118 (229)
Q Consensus        96 V~lLLVlii~vvly~~~kRrk~~  118 (229)
                      |++..++|++.++|-++..||+.
T Consensus        17 Vglv~i~iva~~iYRKw~aRkr~   39 (43)
T PF08114_consen   17 VGLVGIGIVALFIYRKWQARKRA   39 (43)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHH
Confidence            33444455555555555444443


No 64 
>PF02038 ATP1G1_PLM_MAT8:  ATP1G1/PLM/MAT8 family;  InterPro: IPR000272  The FXYD protein family contains at least seven members in mammals []. Two other family members that are not obvious orthologs of any identified mammalian FXYD protein exist in zebrafish. All these proteins share a signature sequence of six conserved amino acids comprising the FXYD motif in the NH2-terminus, and two glycines and one serine residue in the transmembrane domain. FXYD proteins are widely distributed in mammalian tissues with prominent expression in tissues that perform fluid and solute transport or that are electrically excitable.   Initial functional characterisation suggested that FXYD proteins act as channels or as modulators of ion channels however studies have revealed that most FXYD proteins have another specific function and act as tissue-specific regulatory subunits of the Na,K-ATPase. Each of these auxiliary subunits produces a distinct functional effect on the transport characteristics of the Na,K-ATPase that is adjusted to the specific functional demands of the tissue in which the FXYD protein is expressed. FXYD proteins appear to preferentially associate with Na,K-ATPase alpha1-beta isozymes, and affect their function in a way that render them operationally complementary or supplementary to coexisting isozymes.; GO: 0005216 ion channel activity, 0006811 ion transport, 0016020 membrane; PDB: 2JO1_A 2JP3_A 2ZXE_G 3A3Y_G 3N23_E 3B8E_H 3KDP_G 3N2F_E.
Probab=74.08  E-value=4.4  Score=27.82  Aligned_cols=24  Identities=17%  Similarity=0.289  Sum_probs=10.5

Q ss_pred             HHHHHHHHHHHHHHHHHHHhhhhh
Q psy12432         96 IAVFLVVILCFSLYVHCQRRKKQA  119 (229)
Q Consensus        96 V~lLLVlii~vvly~~~kRrk~~k  119 (229)
                      .+||.++.+++++.-+++.+.++|
T Consensus        22 A~vlfi~Gi~iils~kckCk~~qk   45 (50)
T PF02038_consen   22 AGVLFILGILIILSGKCKCKFNQK   45 (50)
T ss_dssp             HHHHHHHHHHHHCTTHHHHHHSTT
T ss_pred             HHHHHHHHHHHHHcCccccCCCCC
Confidence            344444444444444444443333


No 65 
>PF06305 DUF1049:  Protein of unknown function (DUF1049);  InterPro: IPR010445 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=73.63  E-value=10  Score=26.42  Aligned_cols=26  Identities=19%  Similarity=0.230  Sum_probs=12.8

Q ss_pred             HhhhHHHHHHHHHHHHHHHHHHHHHh
Q psy12432         90 IASGASIAVFLVVILCFSLYVHCQRR  115 (229)
Q Consensus        90 Ia~gI~V~lLLVlii~vvly~~~kRr  115 (229)
                      |++...+++++..++.+..+++.+++
T Consensus        24 il~~f~~G~llg~l~~~~~~~~~r~~   49 (68)
T PF06305_consen   24 ILIAFLLGALLGWLLSLPSRLRLRRR   49 (68)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            33444455555555555555544433


No 66 
>KOG1836|consensus
Probab=72.01  E-value=2.5  Score=47.15  Aligned_cols=45  Identities=31%  Similarity=0.554  Sum_probs=34.0

Q ss_pred             eeCCCCCCCC------------CCCCCCEEEeeccCCCCceEee-cCCCCCCCCCcccCC
Q psy12432         26 YACPPTYATW------------YCLNGATCFTVKIGESLLYNCE-CADGYMGQRCEFKDL   72 (229)
Q Consensus        26 C~C~~gy~gn------------pClNGGtC~~~~~~~~~~~sC~-C~~Gy~G~rCe~~~~   72 (229)
                      -+|..||.|+            +|.|++.|.....  .....|+ |+.||+|.||+.-..
T Consensus       759 ~~C~~GfYg~~~~~~~~dC~~C~Cp~~~~~~~~~~--~~~~iCk~Cp~gytG~rCe~c~d  816 (1705)
T KOG1836|consen  759 AQCVDGFYGLPDLGTSGDCQPCPCPNGGACGQTPE--ILEVVCKNCPPGYTGLRCEECAD  816 (1705)
T ss_pred             hhhcCCCCCccccCCCCCCccCCCCCChhhcCcCc--ccceecCCCCCCCcccccccCCC
Confidence            3577777653            6778888887642  3479999 999999999997543


No 67 
>PRK00523 hypothetical protein; Provisional
Probab=71.54  E-value=11  Score=27.74  Aligned_cols=17  Identities=24%  Similarity=0.235  Sum_probs=7.2

Q ss_pred             HhhhHHHHHHHHHHHHH
Q psy12432         90 IASGASIAVFLVVILCF  106 (229)
Q Consensus        90 Ia~gI~V~lLLVlii~v  106 (229)
                      ++++++++.|+++++..
T Consensus         6 l~I~l~i~~li~G~~~G   22 (72)
T PRK00523          6 LALGLGIPLLIVGGIIG   22 (72)
T ss_pred             HHHHHHHHHHHHHHHHH
Confidence            34444444444444443


No 68 
>PF06084 Cytomega_TRL10:  Cytomegalovirus TRL10 protein;  InterPro: IPR009284 This family consists of several Cytomegalovirus TRL10 proteins. TRL10 represents a structural component of the virus particle and like the other HCMV envelope glycoproteins, is present in a disulphide-linked complex [].
Probab=71.47  E-value=2.5  Score=34.06  Aligned_cols=19  Identities=26%  Similarity=0.317  Sum_probs=11.7

Q ss_pred             eeeccccCCCCCCCCCccc
Q psy12432        152 LTRITTEAPRAADTRTSIT  170 (229)
Q Consensus       152 ~~~~~~e~~~~~~~~~~ld  170 (229)
                      +.||-|--+|+.|....|.
T Consensus       118 yeripyrp~rq~d~~p~~~  136 (150)
T PF06084_consen  118 YERIPYRPCRQNDNSPPIE  136 (150)
T ss_pred             hhhcCCCcccccCCCCccc
Confidence            3456666677766665555


No 69 
>COG4736 CcoQ Cbb3-type cytochrome oxidase, subunit 3 [Posttranslational modification, protein turnover, chaperones]
Probab=70.81  E-value=8.9  Score=27.28  Aligned_cols=35  Identities=14%  Similarity=0.210  Sum_probs=17.4

Q ss_pred             HHHHHHHHHHHHHHHHHHHHhhhhh-hhcc--ccccCC
Q psy12432         95 SIAVFLVVILCFSLYVHCQRRKKQA-QAAS--VCCTDG  129 (229)
Q Consensus        95 ~V~lLLVlii~vvly~~~kRrk~~k-~~~~--~~~~d~  129 (229)
                      .+++|.+++++++.+.++.++|++. +.++  ..+.|+
T Consensus        14 ~t~~~~l~fiavi~~ayr~~~K~~~d~aa~~~l~l~Dd   51 (60)
T COG4736          14 GTIAFTLFFIAVIYFAYRPGKKGEFDEAARGILPLNDD   51 (60)
T ss_pred             HHHHHHHHHHHHHHHHhcccchhhHHHHhccCCCCCcc
Confidence            3444555555555555555555554 3333  334444


No 70 
>PF11770 GAPT:  GRB2-binding adapter (GAPT);  InterPro: IPR021082  This entry represents a family of transmembrane proteins which bind the growth factor receptor-bound protein 2 (GRB2) in B cells []. In contrast to other transmembrane adaptor proteins, GAPT, which this entry represents, is not phosphorylated upon BCR ligation. It associates with GRB2 constitutively through its proline-rich region []. 
Probab=70.72  E-value=1.2  Score=37.33  Aligned_cols=16  Identities=25%  Similarity=0.509  Sum_probs=7.6

Q ss_pred             HHHhhhHHHHHHHHHH
Q psy12432         88 ASIASGASIAVFLVVI  103 (229)
Q Consensus        88 ~~Ia~gI~V~lLLVli  103 (229)
                      +.|++||.++||||+.
T Consensus        10 v~i~igi~Ll~lLl~c   25 (158)
T PF11770_consen   10 VAISIGISLLLLLLLC   25 (158)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            3445555555443333


No 71 
>smart00051 DSL delta serrate ligand.
Probab=70.45  E-value=3.5  Score=29.28  Aligned_cols=41  Identities=24%  Similarity=0.330  Sum_probs=25.2

Q ss_pred             ceeeCCCCCCCCCCCCCCEEEeeccCCCCc------eEeecCCCCCCCCC
Q psy12432         24 HTYACPPTYATWYCLNGATCFTVKIGESLL------YNCECADGYMGQRC   67 (229)
Q Consensus        24 y~C~C~~gy~gnpClNGGtC~~~~~~~~~~------~sC~C~~Gy~G~rC   67 (229)
                      +.-.|.++|.|..|.  -.|....+ ....      -.|.|.+||.|..|
T Consensus        17 ~rv~C~~~~yG~~C~--~~C~~~~d-~~~~~~Cd~~G~~~C~~Gw~G~~C   63 (63)
T smart00051       17 IRVTCDENYYGEGCN--KFCRPRDD-FFGHYTCDENGNKGCLEGWMGPYC   63 (63)
T ss_pred             EEeeCCCCCcCCccC--CEeCcCcc-ccCCccCCcCCCEecCCCCcCCCC
Confidence            467788988887664  23432110 0112      34669999999887


No 72 
>PF15102 TMEM154:  TMEM154 protein family
Probab=69.73  E-value=1.2  Score=37.12  Aligned_cols=6  Identities=33%  Similarity=0.717  Sum_probs=2.3

Q ss_pred             Hhhhhh
Q psy12432        114 RRKKQA  119 (229)
Q Consensus       114 Rrk~~k  119 (229)
                      |||+.|
T Consensus        83 kRkr~K   88 (146)
T PF15102_consen   83 KRKRTK   88 (146)
T ss_pred             eecccC
Confidence            333334


No 73 
>PF12768 Rax2:  Cortical protein marker for cell polarity
Probab=68.62  E-value=4.4  Score=36.93  Aligned_cols=8  Identities=0%  Similarity=0.090  Sum_probs=3.1

Q ss_pred             hhHHHHHH
Q psy12432         92 SGASIAVF   99 (229)
Q Consensus        92 ~gI~V~lL   99 (229)
                      ++|.++++
T Consensus       234 lAiALG~v  241 (281)
T PF12768_consen  234 LAIALGTV  241 (281)
T ss_pred             hHHHHHHH
Confidence            33334433


No 74 
>PF06365 CD34_antigen:  CD34/Podocalyxin family;  InterPro: IPR013836 This family consists of several mammalian CD34 antigen proteins. The CD34 antigen is a human leukocyte membrane protein expressed specifically by lymphohematopoietic progenitor cells. CD34 is a phosphoprotein. Activation of protein kinase C (PKC) has been found to enhance CD34 phosphorylation [, ]. This family contains several eukaryotic podocalyxin proteins. Podocalyxin is a major membrane protein of the glomerular epithelium and is thought to be involved in maintenance of the architecture of the foot processes and filtration slits characteristic of this unique epithelium by virtue of its high negative charge. Podocalyxin functions as an anti-adhesin that maintains an open filtration pathway between neighbouring foot processes in the glomerular epithelium by charge repulsion [].
Probab=68.53  E-value=3.6  Score=35.98  Aligned_cols=20  Identities=35%  Similarity=0.333  Sum_probs=11.0

Q ss_pred             HHHHHHHHHHHHHHHHHhhh
Q psy12432         98 VFLVVILCFSLYVHCQRRKK  117 (229)
Q Consensus        98 lLLVlii~vvly~~~kRrk~  117 (229)
                      +||++++++.+|+.++||..
T Consensus       111 ~lLla~~~~~~Y~~~~Rrs~  130 (202)
T PF06365_consen  111 FLLLAILLGAGYCCHQRRSW  130 (202)
T ss_pred             HHHHHHHHHHHHHhhhhccC
Confidence            34445555556666666543


No 75 
>PF15065 NCU-G1:  Lysosomal transcription factor, NCU-G1
Probab=68.04  E-value=2  Score=40.41  Aligned_cols=28  Identities=29%  Similarity=0.469  Sum_probs=13.2

Q ss_pred             hhhHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy12432         91 ASGASIAVFLVVILCFSLYVHCQRRKKQ  118 (229)
Q Consensus        91 a~gI~V~lLLVlii~vvly~~~kRrk~~  118 (229)
                      +++|++++=++++++..+|+..+|+|++
T Consensus       322 i~~vgLG~P~l~li~Ggl~v~~~r~r~~  349 (350)
T PF15065_consen  322 IMAVGLGVPLLLLILGGLYVCLRRRRKR  349 (350)
T ss_pred             HHHHHhhHHHHHHHHhhheEEEeccccC
Confidence            3444444444444444455555554444


No 76 
>COG1862 YajC Preprotein translocase subunit YajC [Intracellular trafficking and secretion]
Probab=66.81  E-value=5.9  Score=30.71  Aligned_cols=19  Identities=21%  Similarity=0.475  Sum_probs=7.2

Q ss_pred             HHHHHHHHHHHHHHHhhhh
Q psy12432        100 LVVILCFSLYVHCQRRKKQ  118 (229)
Q Consensus       100 LVlii~vvly~~~kRrk~~  118 (229)
                      +|++++++++...+.+|++
T Consensus        15 ~vl~~~ifyFli~RPQrKr   33 (97)
T COG1862          15 LVLIFAIFYFLIIRPQRKR   33 (97)
T ss_pred             HHHHHHHHHHhhcCHHHHH
Confidence            3344443333333333333


No 77 
>PF15345 TMEM51:  Transmembrane protein 51
Probab=65.64  E-value=8.7  Score=34.28  Aligned_cols=25  Identities=16%  Similarity=0.104  Sum_probs=19.3

Q ss_pred             ccCCCCcceeeeeccccCCCCCCCC
Q psy12432        142 ERRPSPADFVLTRITTEAPRAADTR  166 (229)
Q Consensus       142 ~~~~~p~~~~~~~~~~e~~~~~~~~  166 (229)
                      .+-.+|.-+.|+...+...++.+.+
T Consensus       120 s~y~vPSYEEvv~s~~~~~r~~~~~  144 (233)
T PF15345_consen  120 SRYYVPSYEEVVNSGYSPVRQSNQR  144 (233)
T ss_pred             ccccCCChHHHHhcCCCCccccccC
Confidence            4566888888998888888877664


No 78 
>KOG3516|consensus
Probab=65.11  E-value=7.3  Score=42.00  Aligned_cols=33  Identities=33%  Similarity=0.761  Sum_probs=28.0

Q ss_pred             CCCCCCCEEEeeccCCCCceEeecCC-CCCCCCCcccC
Q psy12432         35 WYCLNGATCFTVKIGESLLYNCECAD-GYMGQRCEFKD   71 (229)
Q Consensus        35 npClNGGtC~~~~~~~~~~~sC~C~~-Gy~G~rCe~~~   71 (229)
                      .+|+|||.|...-.    +|+|.|.. .|.|..|....
T Consensus       961 ~~C~NGG~Cvery~----gytCDCs~Tay~Gp~Cs~ei  994 (1306)
T KOG3516|consen  961 YPCLNGGHCVERYD----GYTCDCSRTAYDGPFCSKEI  994 (1306)
T ss_pred             ccccCCCEEEEecC----ceeeccccCcCCCCcccccc
Confidence            59999999998642    69999987 59999999764


No 79 
>PF10577 UPF0560:  Uncharacterised protein family UPF0560;  InterPro: IPR018890  This family of proteins has no known function. 
Probab=65.05  E-value=6.1  Score=40.94  Aligned_cols=11  Identities=0%  Similarity=-0.091  Sum_probs=5.1

Q ss_pred             CCCCCCEEEee
Q psy12432         36 YCLNGATCFTV   46 (229)
Q Consensus        36 pClNGGtC~~~   46 (229)
                      .+.+.|.-.+.
T Consensus       220 ~Wv~~G~G~Vk  230 (807)
T PF10577_consen  220 AWVKSGLGMVK  230 (807)
T ss_pred             eeEecceEEEE
Confidence            34444554443


No 80 
>PF15050 SCIMP:  SCIMP protein
Probab=64.91  E-value=12  Score=30.44  Aligned_cols=9  Identities=33%  Similarity=0.774  Sum_probs=3.9

Q ss_pred             HHHHHHHHh
Q psy12432        107 SLYVHCQRR  115 (229)
Q Consensus       107 vly~~~kRr  115 (229)
                      ++||.++++
T Consensus        26 IlyCvcR~~   34 (133)
T PF15050_consen   26 ILYCVCRWQ   34 (133)
T ss_pred             HHHHHHHHH
Confidence            444444433


No 81 
>KOG1226|consensus
Probab=64.82  E-value=7.5  Score=40.08  Aligned_cols=49  Identities=29%  Similarity=0.506  Sum_probs=28.4

Q ss_pred             CCCCCCceeeCCCCCCCCCCC--CCCEEEeeccCCCCceEeecCCC-CCCCCCccc
Q psy12432         18 RPNITFHTYACPPTYATWYCL--NGATCFTVKIGESLLYNCECADG-YMGQRCEFK   70 (229)
Q Consensus        18 ~~~~~gy~C~C~~gy~gnpCl--NGGtC~~~~~~~~~~~sC~C~~G-y~G~rCe~~   70 (229)
                      .++|+|..|.|+..-  +-|.  +|++|.-..  .=.=..|.|... |.|..||.-
T Consensus       571 ~~GwtG~~C~C~~st--d~C~~~~G~iCSGrG--~C~Cg~C~C~~~~~sG~~CE~c  622 (783)
T KOG1226|consen  571 NPGWTGSACNCPLST--DTCESSDGQICSGRG--TCECGRCKCTDPPYSGEFCEKC  622 (783)
T ss_pred             CCCCccCCCCCCCCC--ccccCCCCceeCCCc--eeeCCceEcCCCCcCcchhhcC
Confidence            456666666666532  3443  455553221  000146678877 899999995


No 82 
>PF01299 Lamp:  Lysosome-associated membrane glycoprotein (Lamp);  InterPro: IPR002000 Lysosome-associated membrane glycoproteins (lamp) [] are integral membrane proteins, specific to lysosomes, and whose exact biological function is not yet clear. Structurally, the lamp proteins consist of two internally homologous lysosome-luminal domains separated by a proline-rich hinge region; at the C-terminal extremity there is a transmembrane region (TM) followed by a very short cytoplasmic tail (C). In each of the duplicated domains, there are two conserved disulphide bonds. This structure is schematically represented in the figure below.   +-----+ +-----+ +-----+ +-----+ | | | | | | | | xCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxxxCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxx +--------------------------++Hinge++--------------------------++TM++C+  In mammals, there are two closely related types of lamp: lamp-1 and lamp-2, which form major components of the lysosome membrane. In chicken lamp-1 is known as LEP100.  Also included in this entry is the macrophage protein CD68 (or macrosialin) [] is a heavily glycosylated integral membrane protein whose structure consists of a mucin-like domain followed by a proline-rich hinge; a single lamp-like domain; a transmembrane region and a short cytoplasmic tail.   Similar to CD68, mammalian lamp-3, which is expressed in lymphoid organs, dendritic cells and in lung, contains all the C-terminal regions but lacks the N-terminal lamp-like region []. In a lamp-family protein from nematodes [] only the part C-terminal to the hinge is conserved. ; GO: 0016020 membrane
Probab=64.70  E-value=4  Score=37.15  Aligned_cols=32  Identities=13%  Similarity=0.113  Sum_probs=17.1

Q ss_pred             HHhhhHHHHHHHHHHHHHHHHHHHHHhhhhhhh
Q psy12432         89 SIASGASIAVFLVVILCFSLYVHCQRRKKQAQA  121 (229)
Q Consensus        89 ~Ia~gI~V~lLLVlii~vvly~~~kRrk~~k~~  121 (229)
                      .+++.|+|++.| +++++++.+-+.=.|||.+.
T Consensus       270 ~~~vPIaVG~~L-a~lvlivLiaYli~Rrr~~~  301 (306)
T PF01299_consen  270 SDLVPIAVGAAL-AGLVLIVLIAYLIGRRRSRA  301 (306)
T ss_pred             cchHHHHHHHHH-HHHHHHHHHhheeEeccccc
Confidence            455556555543 44455555555555555543


No 83 
>PF14610 DUF4448:  Protein of unknown function (DUF4448)
Probab=64.61  E-value=1.2  Score=37.86  Aligned_cols=22  Identities=9%  Similarity=0.486  Sum_probs=9.7

Q ss_pred             HHhhhHHHHHHHHHHHHHHHHH
Q psy12432         89 SIASGASIAVFLVVILCFSLYV  110 (229)
Q Consensus        89 ~Ia~gI~V~lLLVlii~vvly~  110 (229)
                      ++++++-++|++++++++++++
T Consensus       159 ~laI~lPvvv~~~~~~~~~~~~  180 (189)
T PF14610_consen  159 ALAIALPVVVVVLALIMYGFFF  180 (189)
T ss_pred             eEEEEccHHHHHHHHHHHhhhe
Confidence            4445554444444444433333


No 84 
>PRK11901 hypothetical protein; Reviewed
Probab=64.39  E-value=7.2  Score=36.46  Aligned_cols=19  Identities=26%  Similarity=0.536  Sum_probs=9.4

Q ss_pred             HHhhhHHHHHHHHHHHHHH
Q psy12432         89 SIASGASIAVFLVVILCFS  107 (229)
Q Consensus        89 ~Ia~gI~V~lLLVlii~vv  107 (229)
                      .+.+||+|+|||||||.+.
T Consensus        37 h~MiGiGilVLlLLIi~Ig   55 (327)
T PRK11901         37 HMMIGIGILVLLLLIIAIG   55 (327)
T ss_pred             HHHHHHHHHHHHHHHHHHh
Confidence            3444555555555555443


No 85 
>PF11980 DUF3481:  Domain of unknown function (DUF3481);  InterPro: IPR022579  This domain of unknown function is located in the C terminus of the eukaryotic neuropilin receptor family of proteins. It is found in association with PF00754 from PFAM, PF00431 from PFAM and PF00629 from PFAM. There are two completely conserved residues (Y and E) that may be functionally important.
Probab=64.07  E-value=7.4  Score=29.58  Aligned_cols=29  Identities=21%  Similarity=0.154  Sum_probs=13.1

Q ss_pred             HHHHhhhHHHHH-HHHHHHHHHHHHHHHHh
Q psy12432         87 TASIASGASIAV-FLVVILCFSLYVHCQRR  115 (229)
Q Consensus        87 ~~~Ia~gI~V~l-LLVlii~vvly~~~kRr  115 (229)
                      |..|+.+..++| |+.+.+.+++||.+.+-
T Consensus        16 ~yyiiA~gga~llL~~v~l~vvL~C~r~~~   45 (87)
T PF11980_consen   16 WYYIIAMGGALLLLVAVCLGVVLYCHRFHW   45 (87)
T ss_pred             eeHHHhhccHHHHHHHHHHHHHHhhhhhcc
Confidence            344443333333 33333355566655544


No 86 
>PF04971 Lysis_S:  Lysis protein S ;  InterPro: IPR007054 The lysis S protein is a cytotoxic protein forming holes in membranes causing cell lysis. The action of Lysis S is independent of the proportion of acidic phospholipids in the membrane [].
Probab=63.06  E-value=12  Score=27.33  Aligned_cols=15  Identities=20%  Similarity=0.293  Sum_probs=6.5

Q ss_pred             hhhHHHHHHHHHHHH
Q psy12432         91 ASGASIAVFLVVILC  105 (229)
Q Consensus        91 a~gI~V~lLLVlii~  105 (229)
                      ++|++.+|++.++..
T Consensus        35 aIGvi~gi~~~~lt~   49 (68)
T PF04971_consen   35 AIGVIGGIFFGLLTY   49 (68)
T ss_pred             hHHHHHHHHHHHHHH
Confidence            334444444444443


No 87 
>PF10873 DUF2668:  Protein of unknown function (DUF2668);  InterPro: IPR022640  Members in this family of proteins are annotated as cysteine and tyrosine-rich protein 1, however currently no function is known []. 
Probab=62.90  E-value=4  Score=34.05  Aligned_cols=29  Identities=14%  Similarity=0.147  Sum_probs=15.1

Q ss_pred             HHHHHhhhHHHHHHHHHHHHHHHHHHHHH
Q psy12432         86 ETASIASGASIAVFLVVILCFSLYVHCQR  114 (229)
Q Consensus        86 ~~~~Ia~gI~V~lLLVlii~vvly~~~kR  114 (229)
                      .+++|++||++++.+|..+++.++...++
T Consensus        62 AIaGIVfgiVfimgvva~i~icvCmc~kn   90 (155)
T PF10873_consen   62 AIAGIVFGIVFIMGVVAGIAICVCMCMKN   90 (155)
T ss_pred             eeeeeehhhHHHHHHHHHHHHHHhhhhhc
Confidence            34566666655555555544444444443


No 88 
>PHA03265 envelope glycoprotein D; Provisional
Probab=62.24  E-value=3.8  Score=38.74  Aligned_cols=15  Identities=40%  Similarity=0.638  Sum_probs=7.5

Q ss_pred             HHHHHHHhhhhhhhc
Q psy12432        108 LYVHCQRRKKQAQAA  122 (229)
Q Consensus       108 ly~~~kRrk~~k~~~  122 (229)
                      +|+.++|||..++..
T Consensus       368 l~~~~rr~k~~~k~~  382 (402)
T PHA03265        368 LYVCLRRKKELKKSA  382 (402)
T ss_pred             HHHHhhhhhhhhhhh
Confidence            444455555555443


No 89 
>KOG0196|consensus
Probab=61.53  E-value=7.8  Score=40.53  Aligned_cols=17  Identities=24%  Similarity=0.196  Sum_probs=12.7

Q ss_pred             CcceeeeeccccCCCCC
Q psy12432        147 PADFVLTRITTEAPRAA  163 (229)
Q Consensus       147 p~~~~~~~~~~e~~~~~  163 (229)
                      .....+-+++||+|-++
T Consensus       602 G~k~YiDP~TYEDPnqA  618 (996)
T KOG0196|consen  602 GTKTYIDPHTYEDPNQA  618 (996)
T ss_pred             CceeecCCccccCccHH
Confidence            45667788999999654


No 90 
>PF05568 ASFV_J13L:  African swine fever virus J13L protein;  InterPro: IPR008385 This family consists of several African swine fever virus (ASFV) j13L proteins [, , ].
Probab=60.87  E-value=18  Score=30.39  Aligned_cols=36  Identities=6%  Similarity=0.184  Sum_probs=26.4

Q ss_pred             HHHHHhhhHHHHHHHHHHHHHHHHHHHHHhhhhhhh
Q psy12432         86 ETASIASGASIAVFLVVILCFSLYVHCQRRKKQAQA  121 (229)
Q Consensus        86 ~~~~Ia~gI~V~lLLVlii~vvly~~~kRrk~~k~~  121 (229)
                      -+..|+++|+|.++++++++.|.--++||+...-..
T Consensus        30 hm~tILiaIvVliiiiivli~lcssRKkKaaAAi~e   65 (189)
T PF05568_consen   30 HMYTILIAIVVLIIIIIVLIYLCSSRKKKAAAAIEE   65 (189)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHhhhhh
Confidence            356677788888777777777788888887766543


No 91 
>PF06143 Baculo_11_kDa:  Baculovirus 11 kDa family;  InterPro: IPR009313 This is a family of uncharacterised Baculovirus proteins that are all about 11 kDa in size.
Probab=60.58  E-value=27  Score=26.46  Aligned_cols=27  Identities=7%  Similarity=0.133  Sum_probs=13.1

Q ss_pred             hHHHHHhhhHHHHHHHHHHHHHHHHHH
Q psy12432         85 LETASIASGASIAVFLVVILCFSLYVH  111 (229)
Q Consensus        85 ~~~~~Ia~gI~V~lLLVlii~vvly~~  111 (229)
                      .++..++.|+.|.+++++++.+++.+.
T Consensus        34 rdFvLVic~~lVfVii~lFi~ll~~i~   60 (84)
T PF06143_consen   34 RDFVLVICCFLVFVIIVLFILLLYNIN   60 (84)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            345555555555555555554444433


No 92 
>PF02480 Herpes_gE:  Alphaherpesvirus glycoprotein E;  InterPro: IPR003404 Glycoprotein E (gE) of Alphaherpesvirus forms a complex with glycoprotein I (gI), functioning as an immunoglobulin G (IgG) Fc binding protein. gE is involved in virus spread but is not essential for propagation [].; GO: 0016020 membrane; PDB: 2GJ7_F 2GIY_B.
Probab=59.85  E-value=3  Score=40.39  Aligned_cols=31  Identities=13%  Similarity=0.316  Sum_probs=0.0

Q ss_pred             hhhHHHHHHHHHHHHHHHHHHHHHhhhhhhh
Q psy12432         91 ASGASIAVFLVVILCFSLYVHCQRRKKQAQA  121 (229)
Q Consensus        91 a~gI~V~lLLVlii~vvly~~~kRrk~~k~~  121 (229)
                      ++++++++++++++++++++.+.++|++++.
T Consensus       354 ~l~vVlgvavlivVv~viv~vc~~~rrrR~~  384 (439)
T PF02480_consen  354 LLGVVLGVAVLIVVVGVIVWVCLRCRRRRRQ  384 (439)
T ss_dssp             -------------------------------
T ss_pred             hHHHHHHHHHHHHHHHHHhheeeeehhcccc
Confidence            3444444445555555566667787777755


No 93 
>PF14851 FAM176:  FAM176 family
Probab=59.69  E-value=7  Score=32.76  Aligned_cols=20  Identities=20%  Similarity=0.268  Sum_probs=13.6

Q ss_pred             HHHhhhHHHHHHHHHHHHHH
Q psy12432         88 ASIASGASIAVFLVVILCFS  107 (229)
Q Consensus        88 ~~Ia~gI~V~lLLVlii~vv  107 (229)
                      .+.+.|||++|||.|.++++
T Consensus        24 LYFv~gVC~GLlLtLcllV~   43 (153)
T PF14851_consen   24 LYFVSGVCAGLLLTLCLLVI   43 (153)
T ss_pred             HHHHHHHHHHHHHHHHHHHh
Confidence            35578888888766655544


No 94 
>PTZ00046 rifin; Provisional
Probab=59.56  E-value=16  Score=34.59  Aligned_cols=16  Identities=25%  Similarity=0.515  Sum_probs=8.2

Q ss_pred             hHHHHHHHHHHHHHHH
Q psy12432         93 GASIAVFLVVILCFSL  108 (229)
Q Consensus        93 gI~V~lLLVlii~vvl  108 (229)
                      +|+|.||+++|+=+++
T Consensus       323 AIvVIVLIMvIIYLIL  338 (358)
T PTZ00046        323 AIVVIVLIMVIIYLIL  338 (358)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            4445555555555443


No 95 
>PRK12705 hypothetical protein; Provisional
Probab=59.24  E-value=12  Score=37.05  Aligned_cols=29  Identities=7%  Similarity=0.071  Sum_probs=14.9

Q ss_pred             hhhHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy12432         91 ASGASIAVFLVVILCFSLYVHCQRRKKQA  119 (229)
Q Consensus        91 a~gI~V~lLLVlii~vvly~~~kRrk~~k  119 (229)
                      ++++++++|+.++.+|+++|.++|+++.+
T Consensus         5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   33 (508)
T PRK12705          5 ILLVILLLLIGLLLGVLVVLLKKRQRLAK   33 (508)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            33344455555555555555555554443


No 96 
>PF15099 PIRT:  Phosphoinositide-interacting protein family
Probab=58.95  E-value=8.4  Score=31.39  Aligned_cols=11  Identities=18%  Similarity=0.495  Sum_probs=4.2

Q ss_pred             HHHHHHHHHHH
Q psy12432        101 VVILCFSLYVH  111 (229)
Q Consensus       101 Vlii~vvly~~  111 (229)
                      +++++..++|+
T Consensus        93 mlL~~~alcW~  103 (129)
T PF15099_consen   93 MLLACSALCWK  103 (129)
T ss_pred             HHHHhhhheeh
Confidence            33333334443


No 97 
>PF14914 LRRC37AB_C:  LRRC37A/B like protein 1 C-terminal domain
Probab=58.92  E-value=12  Score=31.40  Aligned_cols=27  Identities=19%  Similarity=0.453  Sum_probs=11.6

Q ss_pred             HhhhHHHHHHHHHHHHHHHHHHHHHhh
Q psy12432         90 IASGASIAVFLVVILCFSLYVHCQRRK  116 (229)
Q Consensus        90 Ia~gI~V~lLLVlii~vvly~~~kRrk  116 (229)
                      +++.+.+++++++|+..++.++.+|+.
T Consensus       123 laisvtvv~~iliii~CLiei~shr~a  149 (154)
T PF14914_consen  123 LAISVTVVVMILIIIFCLIEICSHRRA  149 (154)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHhcccc
Confidence            344444444444443334444444443


No 98 
>KOG3488|consensus
Probab=58.83  E-value=16  Score=27.02  Aligned_cols=21  Identities=14%  Similarity=0.271  Sum_probs=11.0

Q ss_pred             HHHhhhHHHHHHHHHHHHHHH
Q psy12432         88 ASIASGASIAVFLVVILCFSL  108 (229)
Q Consensus        88 ~~Ia~gI~V~lLLVlii~vvl  108 (229)
                      +.|.+.+.++++++.++..++
T Consensus        48 yAi~iPvaagl~ll~lig~Fi   68 (81)
T KOG3488|consen   48 YAITIPVAAGLFLLCLIGTFI   68 (81)
T ss_pred             HHhhhHHHHHHHHHHHHHHHH
Confidence            445555666665555444433


No 99 
>TIGR02976 phageshock_pspB phage shock protein B. This model describes the PspB protein of the psp (phage shock protein) operon, as found in Escherichia coli and many related species. Expression of a phage protein called secretin protein IV, and a number of other stresses including ethanol, heat shock, and defects in protein secretion trigger sigma-54-dependent expression of the phage shock regulon. PspB is both a regulator and an effector protein of the phage shock response.
Probab=58.21  E-value=16  Score=26.97  Aligned_cols=12  Identities=17%  Similarity=0.251  Sum_probs=4.8

Q ss_pred             HHHHHHHHHHHh
Q psy12432        104 LCFSLYVHCQRR  115 (229)
Q Consensus       104 i~vvly~~~kRr  115 (229)
                      ..++++.++.+|
T Consensus        18 ap~wl~lHY~~k   29 (75)
T TIGR02976        18 APLWLILHYRSK   29 (75)
T ss_pred             HHHHHHHHHHhh
Confidence            333444444433


No 100
>PF12597 DUF3767:  Protein of unknown function (DUF3767);  InterPro: IPR022533  This group of proteins includes mitochodrial cytochrome c oxidase proteins [], and some transmembrane domain-containing proteins of unknown function known as FAM36A. Proteins in this family are typically between 112 and 199 amino acids in length. 
Probab=58.00  E-value=24  Score=28.16  Aligned_cols=27  Identities=26%  Similarity=0.349  Sum_probs=18.0

Q ss_pred             HHHHHHHHHHHHHHHHHHHHhhhhhhh
Q psy12432         95 SIAVFLVVILCFSLYVHCQRRKKQAQA  121 (229)
Q Consensus        95 ~V~lLLVlii~vvly~~~kRrk~~k~~  121 (229)
                      .|+-|+++-++.|.||+++|++.+.++
T Consensus        72 avgsF~l~s~~~we~Cr~~r~~~~~~~   98 (118)
T PF12597_consen   72 AVGSFFLGSLGSWEYCRYNRRKERQQM   98 (118)
T ss_pred             hhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            344456666667788888877776654


No 101
>PHA02902 putative IMV membrane protein; Provisional
Probab=57.63  E-value=15  Score=26.69  Aligned_cols=22  Identities=27%  Similarity=0.576  Sum_probs=13.9

Q ss_pred             HHHHHHHHHHHHHHHHHHhhhh
Q psy12432         97 AVFLVVILCFSLYVHCQRRKKQ  118 (229)
Q Consensus        97 ~lLLVlii~vvly~~~kRrk~~  118 (229)
                      .++.|+|+|.++|..++|.|..
T Consensus         9 ~~v~v~Ivclliya~YrR~kci   30 (70)
T PHA02902          9 LAVIVIIFCLLIYAAYKRYKCI   30 (70)
T ss_pred             HHHHHHHHHHHHHHHHHHhcCC
Confidence            3445666777777777775443


No 102
>TIGR01477 RIFIN variant surface antigen, rifin family. This model represents the rifin branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of rifin sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 20 bits.
Probab=57.48  E-value=18  Score=34.26  Aligned_cols=14  Identities=7%  Similarity=0.159  Sum_probs=9.3

Q ss_pred             HHHHHHHhhhhhhh
Q psy12432        108 LYVHCQRRKKQAQA  121 (229)
Q Consensus       108 ly~~~kRrk~~k~~  121 (229)
                      +.+..-|.|++|++
T Consensus       328 IIYLILRYRRKKKM  341 (353)
T TIGR01477       328 IIYLILRYRRKKKM  341 (353)
T ss_pred             HHHHHHHhhhcchh
Confidence            33447888888765


No 103
>PF03302 VSP:  Giardia variant-specific surface protein;  InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=57.18  E-value=2.9  Score=39.73  Aligned_cols=27  Identities=26%  Similarity=0.348  Sum_probs=19.0

Q ss_pred             HHHHHhhhHHHHHH-HHHHHHHHHHHHH
Q psy12432         86 ETASIASGASIAVF-LVVILCFSLYVHC  112 (229)
Q Consensus        86 ~~~~Ia~gI~V~lL-LVlii~vvly~~~  112 (229)
                      .+++|.++++|+|- ||.|++||+.+|-
T Consensus       368 aIaGIsvavvvvVgglvGfLcWwf~crg  395 (397)
T PF03302_consen  368 AIAGISVAVVVVVGGLVGFLCWWFICRG  395 (397)
T ss_pred             ceeeeeehhHHHHHHHHHHHhhheeecc
Confidence            56778777666655 7788888876643


No 104
>PF05083 LST1:  LST-1 protein;  InterPro: IPR007775 B144/LST1 is a gene encoded in the human major histocompatibility complex that produces multiple forms of alternatively spliced mRNA and encodes peptides fewer than 100 amino acids in length. B144/LST1 is strongly expressed in dendritic cells. Transfection of B144/LST1 into a variety of cells induces morphologic changes including the production of long, thin filopodia []. A possible role in modulating immune responses. Induces morphological changes including production of filopodia and microspikes when overexpressed in a variety of cell types and may be involved in dendritic cell maturation. Isoform 1 and isoform 2 have an inhibitory effect on lymphocyte proliferation [, ]. ; GO: 0000902 cell morphogenesis, 0006955 immune response, 0016020 membrane
Probab=56.90  E-value=11  Score=27.79  Aligned_cols=9  Identities=33%  Similarity=0.265  Sum_probs=3.7

Q ss_pred             HHHHHHHhh
Q psy12432        108 LYVHCQRRK  116 (229)
Q Consensus       108 ly~~~kRrk  116 (229)
                      +|+..+|.+
T Consensus        16 lC~lsrRvk   24 (74)
T PF05083_consen   16 LCRLSRRVK   24 (74)
T ss_pred             HHHHHhhhh
Confidence            444444433


No 105
>PF11884 DUF3404:  Domain of unknown function (DUF3404);  InterPro: IPR021821  This domain is functionally uncharacterised. This domain is found in bacteria. This presumed domain is about 260 amino acids in length. This domain is found associated with PF02518 from PFAM, PF00512 from PFAM. 
Probab=56.90  E-value=22  Score=32.28  Aligned_cols=10  Identities=40%  Similarity=0.548  Sum_probs=3.8

Q ss_pred             HHHHHHHHHH
Q psy12432        105 CFSLYVHCQR  114 (229)
Q Consensus       105 ~vvly~~~kR  114 (229)
                      .+.+|-+.+|
T Consensus       248 gw~~y~~~~k  257 (262)
T PF11884_consen  248 GWSLYRWNQK  257 (262)
T ss_pred             HHHHHHHHHH
Confidence            3344433333


No 106
>PF10873 DUF2668:  Protein of unknown function (DUF2668);  InterPro: IPR022640  Members in this family of proteins are annotated as cysteine and tyrosine-rich protein 1, however currently no function is known []. 
Probab=56.56  E-value=25  Score=29.42  Aligned_cols=27  Identities=4%  Similarity=0.143  Sum_probs=16.5

Q ss_pred             HHHhhhHHHHHHHHHHHHHHHHHHHHH
Q psy12432         88 ASIASGASIAVFLVVILCFSLYVHCQR  114 (229)
Q Consensus        88 ~~Ia~gI~V~lLLVlii~vvly~~~kR  114 (229)
                      ..++.||+++++++++++..+.+....
T Consensus        60 gtAIaGIVfgiVfimgvva~i~icvCm   86 (155)
T PF10873_consen   60 GTAIAGIVFGIVFIMGVVAGIAICVCM   86 (155)
T ss_pred             cceeeeeehhhHHHHHHHHHHHHHHhh
Confidence            556677877776666666555444443


No 107
>PF04478 Mid2:  Mid2 like cell wall stress sensor;  InterPro: IPR007567 This family represents a region near the C terminus of Mid2, which contains a transmembrane region. The remainder of the protein sequence is serine-rich and of low complexity, and is therefore impossible to align accurately. Mid2 is thought to act as a mechanosensor of cell wall stress. The C-terminal cytoplasmic region of Mid2 is known to interact with Rom2, a guanine nucleotide exchange factor (GEF) for Rho1, which is part of the cell wall integrity signalling pathway [].
Probab=56.44  E-value=0.96  Score=37.90  Aligned_cols=31  Identities=19%  Similarity=0.027  Sum_probs=16.7

Q ss_pred             HhhhHHHHHHHHHHHHHHHHHHHHHhhhhhhh
Q psy12432         90 IASGASIAVFLVVILCFSLYVHCQRRKKQAQA  121 (229)
Q Consensus        90 Ia~gI~V~lLLVlii~vvly~~~kRrk~~k~~  121 (229)
                      |++|++|+| -+.|+++++.+.+.-.+++|+.
T Consensus        50 IVIGvVVGV-Gg~ill~il~lvf~~c~r~kkt   80 (154)
T PF04478_consen   50 IVIGVVVGV-GGPILLGILALVFIFCIRRKKT   80 (154)
T ss_pred             EEEEEEecc-cHHHHHHHHHhheeEEEecccC
Confidence            355555553 3355555555555555555554


No 108
>PF12259 DUF3609:  Protein of unknown function (DUF3609);  InterPro: IPR022048  This domain family is found in eukaryotes and viruses, and is typically between 348 and 360 amino acids in length. 
Probab=56.41  E-value=8.1  Score=36.50  Aligned_cols=17  Identities=35%  Similarity=0.469  Sum_probs=7.3

Q ss_pred             HHHHHHHHHHHHHHHhh
Q psy12432        100 LVVILCFSLYVHCQRRK  116 (229)
Q Consensus       100 LVlii~vvly~~~kRrk  116 (229)
                      +|++.++|+|.++.||+
T Consensus       311 ~vl~~~~~~~~~~~~~~  327 (361)
T PF12259_consen  311 IVLISLAWLYRTFRRRQ  327 (361)
T ss_pred             HHHHHHHhheeehHHHH
Confidence            33444444544444433


No 109
>PF15099 PIRT:  Phosphoinositide-interacting protein family
Probab=55.24  E-value=11  Score=30.65  Aligned_cols=29  Identities=10%  Similarity=0.353  Sum_probs=17.9

Q ss_pred             hhhHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy12432         91 ASGASIAVFLVVILCFSLYVHCQRRKKQA  119 (229)
Q Consensus        91 a~gI~V~lLLVlii~vvly~~~kRrk~~k  119 (229)
                      ...+++.+|+...+||..-.++||+|+++
T Consensus        87 lLs~GLmlL~~~alcW~~~~rkK~~kr~e  115 (129)
T PF15099_consen   87 LLSLGLMLLACSALCWKPIIRKKKKKRRE  115 (129)
T ss_pred             HHHHHHHHHHhhhheehhhhHhHHHHhhh
Confidence            34455666666667777766666555555


No 110
>PF15330 SIT:  SHP2-interacting transmembrane adaptor protein, SIT
Probab=54.76  E-value=19  Score=28.26  Aligned_cols=21  Identities=19%  Similarity=0.362  Sum_probs=11.5

Q ss_pred             HHHHHHHHHHHHHHHhhhhhh
Q psy12432        100 LVVILCFSLYVHCQRRKKQAQ  120 (229)
Q Consensus       100 LVlii~vvly~~~kRrk~~k~  120 (229)
                      |+|++.+++-+...|.++++.
T Consensus         9 llLll~l~asl~~wr~~~rq~   29 (107)
T PF15330_consen    9 LLLLLSLAASLLAWRMKQRQK   29 (107)
T ss_pred             HHHHHHHHHHHHHHHHHhhhc
Confidence            334444556666666666553


No 111
>PRK10905 cell division protein DamX; Validated
Probab=53.39  E-value=8.3  Score=36.01  Aligned_cols=13  Identities=38%  Similarity=0.710  Sum_probs=5.2

Q ss_pred             hHHHHHHHHHHHH
Q psy12432         93 GASIAVFLVVILC  105 (229)
Q Consensus        93 gI~V~lLLVlii~  105 (229)
                      ||+|+|||||||.
T Consensus         3 GiGilVLlLLIig   15 (328)
T PRK10905          3 GVGILVLLLLIIG   15 (328)
T ss_pred             chhHHHHHHHHHH
Confidence            3334444434433


No 112
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=53.15  E-value=9.7  Score=24.98  Aligned_cols=23  Identities=35%  Similarity=0.733  Sum_probs=18.0

Q ss_pred             CEEEeeccCCCCceEeecCCCCCCCCCcc
Q psy12432         41 ATCFTVKIGESLLYNCECADGYMGQRCEF   69 (229)
Q Consensus        41 GtC~~~~~~~~~~~sC~C~~Gy~G~rCe~   69 (229)
                      ..|...      ...|.|..+|.|.+|+.
T Consensus        11 ~~C~~~------~G~C~C~~~~~G~~C~~   33 (49)
T PF00053_consen   11 QTCDPS------TGQCVCKPGTTGPRCDQ   33 (49)
T ss_dssp             SSEEET------CEEESBSTTEESTTS-E
T ss_pred             CcccCC------CCEEeccccccCCcCcC
Confidence            467653      47999999999999996


No 113
>PHA03265 envelope glycoprotein D; Provisional
Probab=53.11  E-value=6.9  Score=37.10  Aligned_cols=39  Identities=15%  Similarity=0.024  Sum_probs=27.9

Q ss_pred             hhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhhhhhhh
Q psy12432         83 VMLETASIASGASIAVFLVVILCFSLYVHCQRRKKQAQA  121 (229)
Q Consensus        83 ~~~~~~~Ia~gI~V~lLLVlii~vvly~~~kRrk~~k~~  121 (229)
                      .+....+|+++.+++|++|+.|+.+++-++|..||+.+.
T Consensus       346 ~~~~~g~~ig~~i~glv~vg~il~~~~rr~k~~~k~~~~  384 (402)
T PHA03265        346 NSTFVGISVGLGIAGLVLVGVILYVCLRRKKELKKSAQN  384 (402)
T ss_pred             CCcccceEEccchhhhhhhhHHHHHHhhhhhhhhhhhhc
Confidence            343344455566677889999999999999877766543


No 114
>PF05399 EVI2A:  Ectropic viral integration site 2A protein (EVI2A);  InterPro: IPR008608 This family contains several mammalian ectropic viral integration site 2A (EVI2A) proteins. The function of this protein is unknown although it is thought to be a membrane protein and may function as an oncogene in retrovirus induced myeloid tumours [, ].; GO: 0016021 integral to membrane
Probab=52.80  E-value=31  Score=30.55  Aligned_cols=10  Identities=20%  Similarity=0.537  Sum_probs=5.5

Q ss_pred             CCCCCCcccC
Q psy12432         62 YMGQRCEFKD   71 (229)
Q Consensus        62 y~G~rCe~~~   71 (229)
                      |.-+.||...
T Consensus       116 ~kk~~CEen~  125 (227)
T PF05399_consen  116 FKKEICEENN  125 (227)
T ss_pred             cchhhhhcCc
Confidence            3345677664


No 115
>PF01102 Glycophorin_A:  Glycophorin A;  InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=51.37  E-value=21  Score=28.80  Aligned_cols=29  Identities=14%  Similarity=0.236  Sum_probs=18.2

Q ss_pred             hhHHHHHHHHHHHHHHHHHHHHHhhhhhhh
Q psy12432         92 SGASIAVFLVVILCFSLYVHCQRRKKQAQA  121 (229)
Q Consensus        92 ~gI~V~lLLVlii~vvly~~~kRrk~~k~~  121 (229)
                      ++|++++ +.+++++++++.+.-+|++|+.
T Consensus        67 ~~Ii~gv-~aGvIg~Illi~y~irR~~Kk~   95 (122)
T PF01102_consen   67 IGIIFGV-MAGVIGIILLISYCIRRLRKKS   95 (122)
T ss_dssp             HHHHHHH-HHHHHHHHHHHHHHHHHHS---
T ss_pred             eehhHHH-HHHHHHHHHHHHHHHHHHhccC
Confidence            4444444 5677777788888888888875


No 116
>KOG1214|consensus
Probab=51.12  E-value=15  Score=38.74  Aligned_cols=41  Identities=27%  Similarity=0.622  Sum_probs=28.4

Q ss_pred             ceeeCCCCCC----------------CCCCCCC-------CEEEeeccCCCCceEeecCCCCCCC
Q psy12432         24 HTYACPPTYA----------------TWYCLNG-------ATCFTVKIGESLLYNCECADGYMGQ   65 (229)
Q Consensus        24 y~C~C~~gy~----------------gnpClNG-------GtC~~~~~~~~~~~sC~C~~Gy~G~   65 (229)
                      |.|.|..+|.                -|+|..|       |.|..+..+ -+.|+|.|.+||.|.
T Consensus       757 ~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hG-gs~y~C~CLPGfsGD  820 (1289)
T KOG1214|consen  757 YRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHG-GSTYSCACLPGFSGD  820 (1289)
T ss_pred             eeEEEeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecC-CceEEEeecCCccCC
Confidence            6888888886                1578766       444333212 236999999999997


No 117
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=51.09  E-value=8.5  Score=33.45  Aligned_cols=42  Identities=26%  Similarity=0.607  Sum_probs=28.4

Q ss_pred             CceeeCCCCCCC-------------------CCCCCCCEEEeecc-CCCCceEeecCCCCCC
Q psy12432         23 FHTYACPPTYAT-------------------WYCLNGATCFTVKI-GESLLYNCECADGYMG   64 (229)
Q Consensus        23 gy~C~C~~gy~g-------------------npClNGGtC~~~~~-~~~~~~sC~C~~Gy~G   64 (229)
                      .+.|.|++||..                   .+|-+-+.|..... +.+..|.|.|..||.-
T Consensus        19 HfEC~Cnegfvl~~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~   80 (197)
T PF06247_consen   19 HFECKCNEGFVLKNENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYIL   80 (197)
T ss_dssp             EEEEEESTTEEEEETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEE
T ss_pred             ceEEEcCCCcEEccccccccceecCcccccCccccchhhhhcCCCcccceeEEEecccCcee
Confidence            357999998862                   36777788887652 2245799999999754


No 118
>TIGR00847 ccoS cytochrome oxidase maturation protein, cbb3-type. CcoS from Rhodobacter capsulatus has been shown essential for incorporation of redox-active prosthetic groups (heme, Cu) into cytochrome cbb(3) oxidase. FixS of Bradyrhizobium japonicum appears to have the same function. Members of this family are found so far in organisms with a cbb3-type cytochrome oxidase, including Neisseria meningitidis, Helicobacter pylori, Campylobacter jejuni, Caulobacter crescentus, Bradyrhizobium japonicum, and Rhodobacter capsulatus.
Probab=50.54  E-value=22  Score=24.43  Aligned_cols=21  Identities=10%  Similarity=0.055  Sum_probs=10.6

Q ss_pred             HHHHHHHHHHHHHHHHHHHHH
Q psy12432         94 ASIAVFLVVILCFSLYVHCQR  114 (229)
Q Consensus        94 I~V~lLLVlii~vvly~~~kR  114 (229)
                      |.+.++++++.+++++|-.|+
T Consensus         8 IpiSl~l~~~~l~~f~Wavk~   28 (51)
T TIGR00847         8 IPISLLLGGVGLVAFLWSLKS   28 (51)
T ss_pred             HHHHHHHHHHHHHHHHHHHcc
Confidence            344555555555555554444


No 119
>PTZ00045 apical membrane antigen 1; Provisional
Probab=50.41  E-value=48  Score=33.40  Aligned_cols=29  Identities=24%  Similarity=0.440  Sum_probs=16.0

Q ss_pred             CceeeCCCCCCCCCCCCCCEEEeeccCCCCceEeecCCCCCC
Q psy12432         23 FHTYACPPTYATWYCLNGATCFTVKIGESLLYNCECADGYMG   64 (229)
Q Consensus        23 gy~C~C~~gy~gnpClNGGtC~~~~~~~~~~~sC~C~~Gy~G   64 (229)
                      ..+|.|.+...     -..+|.        .|.|.|...--|
T Consensus       459 ~~~c~~t~~~~-----~~~~C~--------~yvc~~~~~sc~  487 (595)
T PTZ00045        459 SPFCPATPLKV-----SNSTCS--------FYVCERVEKSCG  487 (595)
T ss_pred             cccCccccccc-----cCcccc--------eeEeeeeecccc
Confidence            45777766432     234453        477777765433


No 120
>PF07423 DUF1510:  Protein of unknown function (DUF1510);  InterPro: IPR009988 This family consists of several hypothetical bacterial proteins of around 200 residues in length. The function of this family is unknown.
Probab=50.37  E-value=12  Score=33.06  Aligned_cols=12  Identities=25%  Similarity=0.589  Sum_probs=4.6

Q ss_pred             HHHHHHHHHHHH
Q psy12432         94 ASIAVFLVVILC  105 (229)
Q Consensus        94 I~V~lLLVlii~  105 (229)
                      |+|++|||+|++
T Consensus        20 I~IV~lLIiiva   31 (217)
T PF07423_consen   20 IGIVSLLIIIVA   31 (217)
T ss_pred             HHHHHHHHHHHh
Confidence            333333333333


No 121
>PRK00523 hypothetical protein; Provisional
Probab=50.19  E-value=32  Score=25.34  Aligned_cols=34  Identities=12%  Similarity=0.041  Sum_probs=19.4

Q ss_pred             hhhHHHHHH-HHHHHHHHHHHHHHHhhhhhhhccc
Q psy12432         91 ASGASIAVF-LVVILCFSLYVHCQRRKKQAQAASV  124 (229)
Q Consensus        91 a~gI~V~lL-LVlii~vvly~~~kRrk~~k~~~~~  124 (229)
                      ++++.+++. +.+++.+++=++..|+.-+|.+..|
T Consensus         3 ~~~l~I~l~i~~li~G~~~Gffiark~~~k~l~~N   37 (72)
T PRK00523          3 AIGLALGLGIPLLIVGGIIGYFVSKKMFKKQIREN   37 (72)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC
Confidence            455555544 3444445666667777777766433


No 122
>PF15176 LRR19-TM:  Leucine-rich repeat family 19 TM domain
Probab=49.93  E-value=26  Score=27.47  Aligned_cols=23  Identities=4%  Similarity=0.068  Sum_probs=9.1

Q ss_pred             HHHHHhhhHHHHHHHHHHHHHHH
Q psy12432         86 ETASIASGASIAVFLVVILCFSL  108 (229)
Q Consensus        86 ~~~~Ia~gI~V~lLLVlii~vvl  108 (229)
                      .|..+++.++++|++-++|.+++
T Consensus        16 sW~~LVGVv~~al~~SlLIalaa   38 (102)
T PF15176_consen   16 SWPFLVGVVVTALVTSLLIALAA   38 (102)
T ss_pred             ccHhHHHHHHHHHHHHHHHHHHH
Confidence            34444433333444433333333


No 123
>COG4499 Predicted membrane protein [Function unknown]
Probab=49.58  E-value=20  Score=34.53  Aligned_cols=29  Identities=24%  Similarity=0.429  Sum_probs=17.3

Q ss_pred             hhHHHHHHHHHHHHHHHHHHHHHhhhhhh
Q psy12432         92 SGASIAVFLVVILCFSLYVHCQRRKKQAQ  120 (229)
Q Consensus        92 ~gI~V~lLLVlii~vvly~~~kRrk~~k~  120 (229)
                      +||++.+++|++++++.|+++..+.++..
T Consensus       223 ~giGliillvl~li~~~Y~~f~~~p~qea  251 (434)
T COG4499         223 FGIGLIILLVLLLIYFTYYYFSNQPKQEA  251 (434)
T ss_pred             HHHhHHHHHHHHHHHHHHHHHHcChhHHH
Confidence            45555566666666666666666555543


No 124
>PF15048 OSTbeta:  Organic solute transporter subunit beta protein
Probab=48.63  E-value=16  Score=29.61  Aligned_cols=32  Identities=13%  Similarity=0.176  Sum_probs=13.5

Q ss_pred             HHHHhhhHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy12432         87 TASIASGASIAVFLVVILCFSLYVHCQRRKKQ  118 (229)
Q Consensus        87 ~~~Ia~gI~V~lLLVlii~vvly~~~kRrk~~  118 (229)
                      |-+.+.++.++++++.++++..-+...|+||.
T Consensus        35 WNysiL~Ls~vvlvi~~~LLgrsi~ANRnrK~   66 (125)
T PF15048_consen   35 WNYSILALSFVVLVISFFLLGRSIQANRNRKM   66 (125)
T ss_pred             cchHHHHHHHHHHHHHHHHHHHHhHhcccccc
Confidence            43334444444444444444444444444333


No 125
>PRK09458 pspB phage shock protein B; Provisional
Probab=48.17  E-value=18  Score=26.83  Aligned_cols=17  Identities=18%  Similarity=0.530  Sum_probs=6.7

Q ss_pred             HHHHHHHHHHHHHHHHH
Q psy12432         98 VFLVVILCFSLYVHCQR  114 (229)
Q Consensus        98 lLLVlii~vvly~~~kR  114 (229)
                      +|++++.-+|++.+++.
T Consensus        12 iF~ifVaPiWL~LHY~s   28 (75)
T PRK09458         12 IFVLFVAPIWLWLHYRS   28 (75)
T ss_pred             HHHHHHHHHHHHHhhcc
Confidence            33333333444444443


No 126
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=47.26  E-value=15  Score=24.32  Aligned_cols=17  Identities=35%  Similarity=0.835  Sum_probs=14.8

Q ss_pred             ceEeecCCCCCCCCCcc
Q psy12432         53 LYNCECADGYMGQRCEF   69 (229)
Q Consensus        53 ~~sC~C~~Gy~G~rCe~   69 (229)
                      ...|.|..+|.|.+|+.
T Consensus        18 ~G~C~C~~~~~G~~C~~   34 (50)
T cd00055          18 TGQCECKPNTTGRRCDR   34 (50)
T ss_pred             CCEEeCCCcCCCCCCCC
Confidence            36899999999999985


No 127
>PRK14748 kdpF potassium-transporting ATPase subunit F; Provisional
Probab=46.56  E-value=39  Score=20.60  Aligned_cols=14  Identities=21%  Similarity=0.384  Sum_probs=6.4

Q ss_pred             hhhHHHHHHHHHHH
Q psy12432         91 ASGASIAVFLVVIL  104 (229)
Q Consensus        91 a~gI~V~lLLVlii  104 (229)
                      ++|++++++|++-+
T Consensus         6 i~G~ilv~lLlgYL   19 (29)
T PRK14748          6 ITGVLLVFLLLGYL   19 (29)
T ss_pred             HHHHHHHHHHHHHH
Confidence            44555444444433


No 128
>PF02439 Adeno_E3_CR2:  Adenovirus E3 region protein CR2;  InterPro: IPR003470 Early region 3 (E3) of human adenoviruses (Ads) codes for proteins that appear to control viral interactions with the host []. This region called CR1 (conserved region 1) [] is found three times in Human adenovirus 19 (a subgroup D adenovirus) 49 kDa protein in the E3 region. CR1 is also found in the 20.1 Kd protein of subgroup B adenoviruses. The function of this 80 amino acid region is unknown. This region is probably a divergent immunoglobulin domain.
Probab=46.29  E-value=42  Score=21.81  Aligned_cols=16  Identities=13%  Similarity=0.113  Sum_probs=8.0

Q ss_pred             hhHHHHHHHHHHHHHH
Q psy12432         92 SGASIAVFLVVILCFS  107 (229)
Q Consensus        92 ~gI~V~lLLVlii~vv  107 (229)
                      ++|++++++.++++++
T Consensus         6 IaIIv~V~vg~~iiii   21 (38)
T PF02439_consen    6 IAIIVAVVVGMAIIII   21 (38)
T ss_pred             hhHHHHHHHHHHHHHH
Confidence            4555665554444433


No 129
>PF15347 PAG:  Phosphoprotein associated with glycosphingolipid-enriched
Probab=46.29  E-value=26  Score=33.53  Aligned_cols=26  Identities=31%  Similarity=0.572  Sum_probs=14.0

Q ss_pred             HHHHHHHHHHHHHHHHHHHhhhhhhh
Q psy12432         96 IAVFLVVILCFSLYVHCQRRKKQAQA  121 (229)
Q Consensus        96 V~lLLVlii~vvly~~~kRrk~~k~~  121 (229)
                      |+.||||.+++++|--+.|.||.++.
T Consensus        24 v~~f~lis~LifLCsSC~reKK~~~~   49 (428)
T PF15347_consen   24 VTTFLLISFLIFLCSSCDREKKPKQQ   49 (428)
T ss_pred             HHHHHHHHHHHHHhhcccccccCCCC
Confidence            33444444445555556677766643


No 130
>PF02699 YajC:  Preprotein translocase subunit;  InterPro: IPR003849 Secretion across the inner membrane in some Gram-negative bacteria occurs via the preprotein translocase pathway. Proteins are produced in the cytoplasm as precursors, and require a chaperone subunit to direct them to the translocase component []. From there, the mature proteins are either targeted to the outer membrane, or remain as periplasmic proteins []. The translocase protein subunits are encoded on the bacterial chromosome.  The translocase itself comprises 7 proteins, including a chaperone (SecB), ATPase (SecA), an integral membrane complex (SecY, SecE and SecG), and two additional membrane proteins that promote the release of the mature peptide into the periplasm (SecD and SecF) []. Other cytoplasmic/periplasmic proteins play a part in preprotein translocase activity, namely YidC and YajC []. The latter is bound in a complex to SecD and SecF, and plays a part in stabilising and regulating secretion through the SecYEG integral membrane component via SecA [].  Homologues of the YajC gene have been found in a range of pathogenic and commensal microbes. Brucella abortis YajC- and SecD-like proteins were shown to stimulate a Th1 cell-mediated immune response in mice, and conferred protection when challenged with B.abortis []. Therefore, these proteins may have an antigenic role as well as a secretory one in virulent bacteria []. A number of previously uncharacterised "hypothetical" proteins also show similarity to E.coli YajC, suggesting that this family is wider than first thought [].  More recently, the precise interactions between the E.coli SecYEG complex, SecD, SecF, YajC and YidC have been studied []. Rather than acting individually, the four proteins form a heterotetrameric complex and associate with the SecYEG heterotrimeric complex []. The SecF and YajC subunits link the complex to the integral membrane translocase. ; PDB: 2RDD_B.
Probab=46.21  E-value=40  Score=24.94  Aligned_cols=10  Identities=10%  Similarity=0.272  Sum_probs=6.2

Q ss_pred             HHHhhhhhhh
Q psy12432        112 CQRRKKQAQA  121 (229)
Q Consensus       112 ~kRrk~~k~~  121 (229)
                      +||+|+++.+
T Consensus        24 kk~~k~~~~m   33 (82)
T PF02699_consen   24 KKQQKEHQEM   33 (82)
T ss_dssp             HHHHHHHTTG
T ss_pred             HHHHHHHHHH
Confidence            5566677654


No 131
>COG3763 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=45.84  E-value=57  Score=23.97  Aligned_cols=14  Identities=14%  Similarity=-0.102  Sum_probs=5.7

Q ss_pred             hhHHHHHHHHHHHH
Q psy12432         92 SGASIAVFLVVILC  105 (229)
Q Consensus        92 ~gI~V~lLLVlii~  105 (229)
                      +++++++|+-+++.
T Consensus         8 l~ivl~ll~G~~~G   21 (71)
T COG3763           8 LLIVLALLAGLIGG   21 (71)
T ss_pred             HHHHHHHHHHHHHH
Confidence            33444444433333


No 132
>PF05808 Podoplanin:  Podoplanin;  InterPro: IPR008783 This family consists of several mammalian podoplanin-like proteins which are thought to control specifically the unique shape of podocytes [].; GO: 0016021 integral to membrane; PDB: 3IET_X.
Probab=45.67  E-value=6.9  Score=33.11  Aligned_cols=12  Identities=25%  Similarity=0.290  Sum_probs=0.0

Q ss_pred             HHHhhhHHHHHH
Q psy12432         88 ASIASGASIAVF   99 (229)
Q Consensus        88 ~~Ia~gI~V~lL   99 (229)
                      ++|++|+++++-
T Consensus       132 VGIIVGVLlaIG  143 (162)
T PF05808_consen  132 VGIIVGVLLAIG  143 (162)
T ss_dssp             ------------
T ss_pred             eeehhhHHHHHH
Confidence            344444444433


No 133
>PTZ00208 65 kDa invariant surface glycoprotein; Provisional
Probab=45.65  E-value=12  Score=35.97  Aligned_cols=31  Identities=19%  Similarity=0.471  Sum_probs=16.1

Q ss_pred             HHHHHhhhHHHHHHHHHHHHHHHHHHHHHhh
Q psy12432         86 ETASIASGASIAVFLVVILCFSLYVHCQRRK  116 (229)
Q Consensus        86 ~~~~Ia~gI~V~lLLVlii~vvly~~~kRrk  116 (229)
                      ++.+|++.+.|-+++|.|+.++++++.+|||
T Consensus       384 ~~~~i~~avl~p~~il~~~~~~~~~~v~rrr  414 (436)
T PTZ00208        384 RTAMIILAVLVPAIILAIIAVAFFIMVKRRR  414 (436)
T ss_pred             hhHHHHHHHHHHHHHHHHHHHHhheeeeecc
Confidence            3455666555555555555544545444444


No 134
>PF05510 Sarcoglycan_2:  Sarcoglycan alpha/epsilon;  InterPro: IPR008908 Sarcoglycans are a subcomplex of transmembrane proteins which are part of the dystrophin-glycoprotein complex. They are expressed in the skeletal, cardiac and smooth muscle. Although numerous studies have been conducted on the sarcoglycan subcomplex in skeletal and cardiac muscle, the manner of the distribution and localisation of these proteins along the nonjunctional sarcolemma is not clear []. This family contains alpha and epsilon members.; GO: 0016012 sarcoglycan complex
Probab=45.45  E-value=20  Score=34.38  Aligned_cols=25  Identities=20%  Similarity=0.313  Sum_probs=11.0

Q ss_pred             hhhHHHHHHHHHHHHHHHHHHHHHh
Q psy12432         91 ASGASIAVFLVVILCFSLYVHCQRR  115 (229)
Q Consensus        91 a~gI~V~lLLVlii~vvly~~~kRr  115 (229)
                      ++..+|++||++++..++|+++...
T Consensus       290 ~iPl~i~llL~llLs~Imc~rREG~  314 (386)
T PF05510_consen  290 AIPLIIALLLLLLLSYIMCCRREGV  314 (386)
T ss_pred             HHHHHHHHHHHHHHHHHheechHHh
Confidence            3333444444444444454444443


No 135
>KOG4260|consensus
Probab=45.13  E-value=18  Score=33.48  Aligned_cols=43  Identities=23%  Similarity=0.620  Sum_probs=29.6

Q ss_pred             eCCCCCCCCCCCCCCEEEeeccCCCCceEeecCCCCCCCCCcccC
Q psy12432         27 ACPPTYATWYCLNGATCFTVKIGESLLYNCECADGYMGQRCEFKD   71 (229)
Q Consensus        27 ~C~~gy~gnpClNGGtC~~~~~~~~~~~sC~C~~Gy~G~rCe~~~   71 (229)
                      .|+.|- ..+|...|.|.-.. .-.+.-.|+|..||.|..|..=.
T Consensus       143 ~Cpggs-er~C~GnG~C~GdG-sR~GsGkCkC~~GY~Gp~C~~Cg  185 (350)
T KOG4260|consen  143 QCPGGS-ERPCFGNGSCHGDG-SREGSGKCKCETGYTGPLCRYCG  185 (350)
T ss_pred             cCCCCC-cCCcCCCCcccCCC-CCCCCCcccccCCCCCccccccc
Confidence            344443 25888888887532 11346899999999999988653


No 136
>PF06305 DUF1049:  Protein of unknown function (DUF1049);  InterPro: IPR010445 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=44.40  E-value=70  Score=22.01  Aligned_cols=7  Identities=29%  Similarity=0.264  Sum_probs=2.5

Q ss_pred             hhHHHHH
Q psy12432         92 SGASIAV   98 (229)
Q Consensus        92 ~gI~V~l   98 (229)
                      +|++++.
T Consensus        30 ~G~llg~   36 (68)
T PF06305_consen   30 LGALLGW   36 (68)
T ss_pred             HHHHHHH
Confidence            3333333


No 137
>PHA02681 ORF089 virion membrane protein; Provisional
Probab=43.98  E-value=37  Score=25.85  Aligned_cols=18  Identities=22%  Similarity=0.663  Sum_probs=11.9

Q ss_pred             HHHHHHHHHHHHHHHHHh
Q psy12432         98 VFLVVILCFSLYVHCQRR  115 (229)
Q Consensus        98 lLLVlii~vvly~~~kRr  115 (229)
                      +++|.|+|.++|..++|.
T Consensus        10 ~V~V~IVclliya~YRR~   27 (92)
T PHA02681         10 VIVISIVCYIVIMMYRRS   27 (92)
T ss_pred             HHHHHHHHHHHHHHHHhc
Confidence            445666677777777765


No 138
>PF05283 MGC-24:  Multi-glycosylated core protein 24 (MGC-24);  InterPro: IPR007947 CD164 is a mucin-like receptor, or sialomucin, with specificity in receptor/ ligand interactions that depends on the structural characteristics of the mucin-like receptor. Its functions include mediating, or regulating, haematopoietic progenitor cell adhesion and the negative regulation of their growth and/or-differentiation. It exists in the native state as a disulphide- linked homodimer of two 80-85kDa subunits. It is usually expressed by CD34+ and CD341o/- haematopoietic stem cells and associated microenvironmental cells. It contains, in its extracellular region, two mucin domains (I and II) linked by a non-mucin domain, which has been predicted to contain intra- disulphide bridges. This receptor may play a key role in haematopoiesis by facilitating the adhesion of human CD34+ cells to bone marrow stroma and by negatively regulating CD34+ CD341o/- haematopoietic progenitor cell proliferation. These effects involve the CD164 class I and/or II epitopes recognised by the monoclonal antibodies (mAbs) 105A5 and 103B2/9E10. These epitopes are carbohydrate-dependent and are located on the N-terminal mucin domain I [, ]. It has been found that murine MGC-24v and rat endolyn share significant sequence similarities with human CD164. However, CD164 lacks the consensus glycosaminoglycan (GAG)-attachment site found in MGC-24; it is possible that GAG-association is responsible for the high molecular weight of the epithelial-derived MGC-24 glycoprotein [].  Genomic structure studies have placed CD164 within the mucin-subgroup that comprises multiple exons, and demonstrate the diverse chromosomal distribution of this family of molecules. Molecules with such multiple exons may have sophisticated regulatory mechanisms that involve not only post-translational modifications of the oligosaccharide side chains, but also differential exon usage. Although differences in the intron and exon sizes are seen between the mouse and human genes, the predicted proteins are similar in size and structure, maintaining functionally important motifs that regulate cell proliferation or subcellular distribution [].  CD164 is a gene whose expression depends on differential usage of poly- adenylation sites within the 3'-UTR. The conserved distribution of the 3.2- and 1.2-kb CD164 transcripts between mouse and human suggests that (i) a mechanism may exist to regulate tissue-specific polyadenylation, and (ii) differences in polyadenylation are important for the expression and function of CD164 in different tissues. Two other aspects of the structure of CD164 are of particular interest. First, it shares one of several conserved features of a cytokine-binding pocket - in this respect, it is notable that evidence exists for a class of cell-surface sialomucin modulators that directly interact with growth factor receptors to regulate their response to physiological ligands. Second, its cytoplasmic tail contains a C-terminal YHTL motif found in many endocytic membrane proteins or receptors. These Tyr-based motifs bind to adaptor proteins, which mediate the sorting of membrane proteins into transport vesicles from the plasma membrane to the endosomes, and between intracellular compartments. 
Probab=43.90  E-value=21  Score=30.88  Aligned_cols=29  Identities=24%  Similarity=0.534  Sum_probs=15.1

Q ss_pred             hhHHHHHhhhHHHHHHHHHHHHHHHHHHHH
Q psy12432         84 MLETASIASGASIAVFLVVILCFSLYVHCQ  113 (229)
Q Consensus        84 ~~~~~~Ia~gI~V~lLLVlii~vvly~~~k  113 (229)
                      .++.+..|+||++.|-| +.|++++|.++|
T Consensus       157 ~FD~~SFiGGIVL~LGv-~aI~ff~~KF~k  185 (186)
T PF05283_consen  157 TFDAASFIGGIVLTLGV-LAIIFFLYKFCK  185 (186)
T ss_pred             CCchhhhhhHHHHHHHH-HHHHHHHhhhcc
Confidence            34556667777555443 333444544443


No 139
>PF14575 EphA2_TM:  Ephrin type-A receptor 2 transmembrane domain; PDB: 3KUL_A 2XVD_A 2VX1_A 2VWV_A 2VX0_A 2VWY_A 2VWZ_A 2VWW_A 2VWU_A 2VWX_A ....
Probab=43.19  E-value=11  Score=27.62  Aligned_cols=28  Identities=21%  Similarity=0.343  Sum_probs=8.9

Q ss_pred             hhhHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy12432         91 ASGASIAVFLVVILCFSLYVHCQRRKKQ  118 (229)
Q Consensus        91 a~gI~V~lLLVlii~vvly~~~kRrk~~  118 (229)
                      ++++++++++++++++++.+.++|+++.
T Consensus         3 i~~~~~g~~~ll~~v~~~~~~~rr~~~~   30 (75)
T PF14575_consen    3 IASIIVGVLLLLVLVIIVIVCFRRCKYS   30 (75)
T ss_dssp             HHHHHHHHHHHHHHHHHHHCCCTT----
T ss_pred             EehHHHHHHHHHHhheeEEEEEeeEcCC
Confidence            3334444443333333333333333333


No 140
>PF05393 Hum_adeno_E3A:  Human adenovirus early E3A glycoprotein;  InterPro: IPR008652 This family consists of several early glycoproteins (E3A), from human adenovirus type 2.; GO: 0016021 integral to membrane
Probab=42.72  E-value=35  Score=26.24  Aligned_cols=28  Identities=25%  Similarity=0.524  Sum_probs=17.0

Q ss_pred             hHHHHHHH-HHHHHHHHHHHHHHhhhhhh
Q psy12432         93 GASIAVFL-VVILCFSLYVHCQRRKKQAQ  120 (229)
Q Consensus        93 gI~V~lLL-Vlii~vvly~~~kRrk~~k~  120 (229)
                      |+-.+++. +.++++++|+.+-++||+.+
T Consensus        34 gm~~lvI~~iFil~VilwfvCC~kRkrsR   62 (94)
T PF05393_consen   34 GMWFLVICGIFILLVILWFVCCKKRKRSR   62 (94)
T ss_pred             chhHHHHHHHHHHHHHHHHHHHHHhhhcc
Confidence            34445544 44555677777887777654


No 141
>KOG3653|consensus
Probab=42.67  E-value=1.2e+02  Score=30.31  Aligned_cols=6  Identities=17%  Similarity=0.396  Sum_probs=3.1

Q ss_pred             CCcccc
Q psy12432        181 SQLYHQ  186 (229)
Q Consensus       181 ~~~f~~  186 (229)
                      +.+||.
T Consensus       286 Vt~fh~  291 (534)
T KOG3653|consen  286 VTEFHP  291 (534)
T ss_pred             Eeeecc
Confidence            445554


No 142
>PRK13855 type IV secretion system protein VirB10; Provisional
Probab=42.15  E-value=68  Score=30.67  Aligned_cols=9  Identities=0%  Similarity=0.032  Sum_probs=3.4

Q ss_pred             HHHHHHHHH
Q psy12432        100 LVVILCFSL  108 (229)
Q Consensus       100 LVlii~vvl  108 (229)
                      +|.++++++
T Consensus        39 ~~~~~~~~~   47 (376)
T PRK13855         39 VLALSLSLI   47 (376)
T ss_pred             HHHHHHHHH
Confidence            333333333


No 143
>KOG1226|consensus
Probab=42.14  E-value=25  Score=36.43  Aligned_cols=20  Identities=25%  Similarity=0.609  Sum_probs=11.6

Q ss_pred             eeeCCCC-CCCCCCCCCCEEE
Q psy12432         25 TYACPPT-YATWYCLNGATCF   44 (229)
Q Consensus        25 ~C~C~~g-y~gnpClNGGtC~   44 (229)
                      .|+|.+. |.|..|..-.+|.
T Consensus       606 ~C~C~~~~~sG~~CE~cptc~  626 (783)
T KOG1226|consen  606 RCKCTDPPYSGEFCEKCPTCP  626 (783)
T ss_pred             ceEcCCCCcCcchhhcCCCCC
Confidence            5666666 6666665444443


No 144
>PF03597 CcoS:  Cytochrome oxidase maturation protein cbb3-type;  InterPro: IPR004714 Cytochrome cbb3 oxidases are found almost exclusively in Proteobacteria, and represent a distinctive class of proton-pumping respiratory haem-copper oxidases (HCO) that lack many of the key structural features that contribute to the reaction cycle of the intensely studied mitochondrial cytochrome c oxidase (CcO). Expression of cytochrome cbb3 oxidase allows human pathogens to colonise anoxic tissues and agronomically important diazotrophs to sustain nitrogen fixation []. Genes encoding a cytochrome cbb3 oxidase were initially designated fixNOQP (ccoNOQP), the ccoNOQP operon is always found close to a second gene cluster, known as fixGHIS (ccoGHIS) whose expression is necessary for the assembly of a functional cbb3 oxidase. On the basis of their derived amino acid sequences each of the four proteins encoded by the ccoGHIS operon are thought to be membrane-bound. It has been suggested that they may function in concert as a multi-subunit complex, possibly playing a role in the uptake and metabolism of copper required for the assembly of the binuclear centre of cytochrome cbb3 oxidase. 
Probab=41.72  E-value=39  Score=22.54  Aligned_cols=20  Identities=10%  Similarity=0.167  Sum_probs=9.1

Q ss_pred             HHHHHHHHHHHHHHHHHHHH
Q psy12432         95 SIAVFLVVILCFSLYVHCQR  114 (229)
Q Consensus        95 ~V~lLLVlii~vvly~~~kR  114 (229)
                      .+.++++++.+++++|-.|+
T Consensus         8 p~sl~l~~~~l~~f~Wavk~   27 (45)
T PF03597_consen    8 PVSLILGLIALAAFLWAVKS   27 (45)
T ss_pred             HHHHHHHHHHHHHHHHHHcc
Confidence            34444444444444444443


No 145
>PF07010 Endomucin:  Endomucin;  InterPro: IPR010740 This family consists of several mammalian endomucin proteins. Endomucin is an early endothelial-specific antigen that is also expressed on putative hematopoietic progenitor cells.
Probab=40.83  E-value=35  Score=30.64  Aligned_cols=6  Identities=50%  Similarity=0.960  Sum_probs=2.4

Q ss_pred             HHHHHH
Q psy12432        108 LYVHCQ  113 (229)
Q Consensus       108 ly~~~k  113 (229)
                      +|..++
T Consensus       210 Lyr~C~  215 (259)
T PF07010_consen  210 LYRMCW  215 (259)
T ss_pred             HHHHhh
Confidence            444343


No 146
>PF11694 DUF3290:  Protein of unknown function (DUF3290);  InterPro: IPR021707  This family of proteins with unknown function appears to be restricted to Firmicutes. 
Probab=40.79  E-value=33  Score=28.46  Aligned_cols=19  Identities=32%  Similarity=0.572  Sum_probs=8.5

Q ss_pred             HHHHHHHHHHHHHHHHHHH
Q psy12432         95 SIAVFLVVILCFSLYVHCQ  113 (229)
Q Consensus        95 ~V~lLLVlii~vvly~~~k  113 (229)
                      .+++++++++.++.|++.+
T Consensus        23 i~~ll~~l~~~~~~Y~r~r   41 (149)
T PF11694_consen   23 IIILLLVLIFFFIKYLRNR   41 (149)
T ss_pred             HHHHHHHHHHHHHHHHHhc
Confidence            3344444444444555443


No 147
>PF04863 EGF_alliinase:  Alliinase EGF-like domain;  InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=40.76  E-value=10  Score=26.55  Aligned_cols=36  Identities=22%  Similarity=0.324  Sum_probs=17.9

Q ss_pred             CCCCCCEEEeeccCCCCceEeecCCCCCCCCCcccC
Q psy12432         36 YCLNGATCFTVKIGESLLYNCECADGYMGQRCEFKD   71 (229)
Q Consensus        36 pClNGGtC~~~~~~~~~~~sC~C~~Gy~G~rCe~~~   71 (229)
                      +|...|..+.......+...|.|..-|.|..|+...
T Consensus        18 ~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS~~~   53 (56)
T PF04863_consen   18 SCSGHGRAFLDGLIADGSPVCECNSCYGGPDCSTLI   53 (56)
T ss_dssp             --TTSEE--TTS-EETTEE--EE-TTEESTTS-EE-
T ss_pred             CcCCCCeeeeccccccCCccccccCCcCCCCcccCC
Confidence            566666666322111235999999999999999864


No 148
>PF04689 S1FA:  DNA binding protein S1FA;  InterPro: IPR006779  S1FA is an unusual small plant peptide of only 70 amino acids with a basic domain which contains a nuclear localization signal and a putative DNA binding helix. S1FA is highly conserved between dicotyledonous and monocotyledonous plants and may be a DNA-binding protein that specifically recognises the negative promoter element S1F [].; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=40.57  E-value=23  Score=25.65  Aligned_cols=14  Identities=29%  Similarity=0.498  Sum_probs=6.6

Q ss_pred             HHHHHHHHHHHHHH
Q psy12432        100 LVVILCFSLYVHCQ  113 (229)
Q Consensus       100 LVlii~vvly~~~k  113 (229)
                      +.++.-+++|.+.+
T Consensus        26 ~flvGnyvlY~Yaq   39 (69)
T PF04689_consen   26 VFLVGNYVLYVYAQ   39 (69)
T ss_pred             HHHHHHHHHHHHHh
Confidence            33344455555544


No 149
>PF00558 Vpu:  Vpu protein;  InterPro: IPR008187 The Human immunodeficiency virus 1 (HIV-1) Vpu protein acts in the degradation of CD4 in the endoplasmic reticulum and in the enhancement of virion release from the plasma membrane of infected cells [].; GO: 0019076 release of virus from host; PDB: 2JPX_A 1PI8_A 2GOH_A 2GOF_A 1PI7_A 1PJE_A 1VPU_A 2K7Y_A.
Probab=40.42  E-value=37  Score=25.58  Aligned_cols=24  Identities=17%  Similarity=0.326  Sum_probs=5.9

Q ss_pred             HHHHHHHHHHHHHHHHHhhhhhhh
Q psy12432         98 VFLVVILCFSLYVHCQRRKKQAQA  121 (229)
Q Consensus        98 lLLVlii~vvly~~~kRrk~~k~~  121 (229)
                      .++|-.+++..|.+.+|+|+..+.
T Consensus        19 aIvvW~iv~ieYrk~~rqrkId~l   42 (81)
T PF00558_consen   19 AIVVWTIVYIEYRKIKRQRKIDRL   42 (81)
T ss_dssp             HHHHHHHH------------CHHH
T ss_pred             HHHHHHHHHHHHHHHHHHHhHHHH
Confidence            444445555677777766666543


No 150
>PF09835 DUF2062:  Uncharacterized protein conserved in bacteria (DUF2062);  InterPro: IPR018639  This domain, found in various prokaryotic proteins, has no known function. It is found at the C-terminal of family 2 glycosyltransferase proteins, in addition to proteins of unknown function.
Probab=40.41  E-value=67  Score=25.90  Aligned_cols=19  Identities=21%  Similarity=0.401  Sum_probs=7.4

Q ss_pred             hhhHHHHHHHHHHHHHHHH
Q psy12432         91 ASGASIAVFLVVILCFSLY  109 (229)
Q Consensus        91 a~gI~V~lLLVlii~vvly  109 (229)
                      ++++++++++.+++-++.+
T Consensus       122 ~G~~i~~~v~~~i~Y~l~~  140 (154)
T PF09835_consen  122 LGSLILGIVLGIISYFLVY  140 (154)
T ss_pred             HHHHHHHHHHHHHHHHHHH
Confidence            3344444443333333333


No 151
>KOG1024|consensus
Probab=40.41  E-value=20  Score=35.03  Aligned_cols=29  Identities=28%  Similarity=0.402  Sum_probs=20.3

Q ss_pred             hhhHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy12432         91 ASGASIAVFLVVILCFSLYVHCQRRKKQA  119 (229)
Q Consensus        91 a~gI~V~lLLVlii~vvly~~~kRrk~~k  119 (229)
                      ++|++|++.+|.+++.++||.+++.+|+.
T Consensus       188 ~V~v~va~a~v~lva~il~c~~~~s~Kr~  216 (563)
T KOG1024|consen  188 IVGVIVALALVSLVALILYCAKGPSKKRS  216 (563)
T ss_pred             eeeHHHHHHHHHHHHHHHHHHhcccchhh
Confidence            45666777677777777888877776664


No 152
>PRK01658 holin-like protein; Validated
Probab=40.40  E-value=59  Score=26.03  Aligned_cols=22  Identities=27%  Similarity=0.238  Sum_probs=10.1

Q ss_pred             HHHHhhhHHHHHHHHHHHHHHH
Q psy12432         87 TASIASGASIAVFLVVILCFSL  108 (229)
Q Consensus        87 ~~~Ia~gI~V~lLLVlii~vvl  108 (229)
                      .+.|++.+.+..++++++..+.
T Consensus        88 ~~~il~~ivvsT~l~l~vtg~~  109 (122)
T PRK01658         88 GISLFLVVVISTFVVMIVTGYL  109 (122)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHH
Confidence            3444444444444444444443


No 153
>KOG4482|consensus
Probab=39.92  E-value=31  Score=33.14  Aligned_cols=28  Identities=25%  Similarity=0.412  Sum_probs=14.8

Q ss_pred             hhhHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy12432         91 ASGASIAVFLVVILCFSLYVHCQRRKKQ  118 (229)
Q Consensus        91 a~gI~V~lLLVlii~vvly~~~kRrk~~  118 (229)
                      ++...|++||+++++.++|+++.+++++
T Consensus       302 aIpl~Valll~~~La~imc~rrEg~~~r  329 (449)
T KOG4482|consen  302 AIPLGVALLLVLALAYIMCCRREGQKKR  329 (449)
T ss_pred             HHHHHHHHHHHHHHHHHHhhhhhccccc
Confidence            3444455555555555565555544444


No 154
>TIGR01195 oadG_fam sodium pump decarboxylases, gamma subunit. Most sequences scoring between the noise and trusted cutoffs are eukaryotic sodium channel proteins.
Probab=39.87  E-value=67  Score=23.89  Aligned_cols=10  Identities=20%  Similarity=0.418  Sum_probs=4.4

Q ss_pred             hhhHHHHHHH
Q psy12432         91 ASGASIAVFL  100 (229)
Q Consensus        91 a~gI~V~lLL  100 (229)
                      ++|+++++++
T Consensus        12 v~GM~~VF~f   21 (82)
T TIGR01195        12 VLGMGIVFLF   21 (82)
T ss_pred             HHHHHHHHHH
Confidence            4444444333


No 155
>PF07271 Cytadhesin_P30:  Cytadhesin P30/P32;  InterPro: IPR009896 This family consists of several Mycoplasma species specific Cytadhesin P32 and P30 proteins. P30 has been found to be membrane associated and localised on the tip organelle. It is thought that it is important in cytadherence and virulence [].; GO: 0007157 heterophilic cell-cell adhesion, 0009405 pathogenesis, 0016021 integral to membrane
Probab=39.31  E-value=15  Score=33.57  Aligned_cols=9  Identities=22%  Similarity=0.479  Sum_probs=3.6

Q ss_pred             CCccCCCCc
Q psy12432        140 PFERRPSPA  148 (229)
Q Consensus       140 ~~~~~~~p~  148 (229)
                      ++...++|.
T Consensus       148 ~v~~~p~p~  156 (279)
T PF07271_consen  148 QVPVNPQPQ  156 (279)
T ss_pred             CCCCCCCCc
Confidence            344444443


No 156
>PF14991 MLANA:  Protein melan-A; PDB: 2GTZ_F 2GT9_F 3MRO_P 2GUO_C 3MRQ_P 2GTW_C 3L6F_C 3MRP_P.
Probab=39.16  E-value=6  Score=31.68  Aligned_cols=33  Identities=24%  Similarity=0.144  Sum_probs=1.8

Q ss_pred             HHHHHhhhHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy12432         86 ETASIASGASIAVFLVVILCFSLYVHCQRRKKQA  119 (229)
Q Consensus        86 ~~~~Ia~gI~V~lLLVlii~vvly~~~kRrk~~k  119 (229)
                      |.++|.+-++|+.+|++|.| |+|-++.-.|..+
T Consensus        25 EAaGIGiL~VILgiLLliGC-WYckRRSGYk~L~   57 (118)
T PF14991_consen   25 EAAGIGILIVILGILLLIGC-WYCKRRSGYKTLR   57 (118)
T ss_dssp             ---SSS----------------------------
T ss_pred             HhccceeHHHHHHHHHHHhh-eeeeecchhhhhh
Confidence            45566554544444445554 7889887666554


No 157
>PF10883 DUF2681:  Protein of unknown function (DUF2681);  InterPro: IPR020274 This entry contains membrane proteins with no known function.
Probab=39.06  E-value=52  Score=25.07  Aligned_cols=8  Identities=13%  Similarity=0.351  Sum_probs=3.4

Q ss_pred             HHHHHHHH
Q psy12432        107 SLYVHCQR  114 (229)
Q Consensus       107 vly~~~kR  114 (229)
                      .++|+.+|
T Consensus        20 y~~~k~~k   27 (87)
T PF10883_consen   20 YLWWKVKK   27 (87)
T ss_pred             HHHHHHHH
Confidence            44444443


No 158
>PRK01844 hypothetical protein; Provisional
Probab=38.87  E-value=79  Score=23.31  Aligned_cols=25  Identities=8%  Similarity=0.014  Sum_probs=15.4

Q ss_pred             HHHHHHHHHHHHHHHhhhhhhhccc
Q psy12432        100 LVVILCFSLYVHCQRRKKQAQAASV  124 (229)
Q Consensus       100 LVlii~vvly~~~kRrk~~k~~~~~  124 (229)
                      |.+++.+++=++..|+.-+|.+..|
T Consensus        12 ~~li~G~~~Gff~ark~~~k~lk~N   36 (72)
T PRK01844         12 VALVAGVALGFFIARKYMMNYLQKN   36 (72)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHC
Confidence            5556666666666677666665443


No 159
>KOG3653|consensus
Probab=38.18  E-value=1e+02  Score=30.67  Aligned_cols=9  Identities=44%  Similarity=1.106  Sum_probs=5.1

Q ss_pred             ceEeecCCC
Q psy12432         53 LYNCECADG   61 (229)
Q Consensus        53 ~~sC~C~~G   61 (229)
                      .|.|-|..+
T Consensus       115 ~~~CcCs~~  123 (534)
T KOG3653|consen  115 LYFCCCSTD  123 (534)
T ss_pred             EEEEecCCC
Confidence            466666443


No 160
>PF11157 DUF2937:  Protein of unknown function (DUF2937);  InterPro: IPR022584  This family of proteins with unknown function appears to be found mainly in Proteobacteria. 
Probab=38.07  E-value=51  Score=27.74  Aligned_cols=17  Identities=24%  Similarity=0.149  Sum_probs=9.1

Q ss_pred             hHHHHHhhhHHHHHHHH
Q psy12432         85 LETASIASGASIAVFLV  101 (229)
Q Consensus        85 ~~~~~Ia~gI~V~lLLV  101 (229)
                      +...+|+.|++.++++.
T Consensus       131 lt~~gi~~g~vg~l~~~  147 (167)
T PF11157_consen  131 LTPEGIVFGLVGALLGA  147 (167)
T ss_pred             CCHHHHHHHHHHHHHHH
Confidence            34556666665555433


No 161
>PF03908 Sec20:  Sec20;  InterPro: IPR005606 Sec20 is a membrane glycoprotein associated with secretory pathway.
Probab=37.96  E-value=38  Score=25.32  Aligned_cols=15  Identities=27%  Similarity=0.549  Sum_probs=7.1

Q ss_pred             HHHHHHHHHHHHHHH
Q psy12432        100 LVVILCFSLYVHCQR  114 (229)
Q Consensus       100 LVlii~vvly~~~kR  114 (229)
                      +++++++++|+.++|
T Consensus        77 ~~~f~~~v~yI~~rR   91 (92)
T PF03908_consen   77 FLFFLLVVLYILWRR   91 (92)
T ss_pred             HHHHHHHHHHHhhhc
Confidence            334444445555554


No 162
>PF10883 DUF2681:  Protein of unknown function (DUF2681);  InterPro: IPR020274 This entry contains membrane proteins with no known function.
Probab=37.42  E-value=85  Score=23.89  Aligned_cols=26  Identities=15%  Similarity=0.268  Sum_probs=15.3

Q ss_pred             HhhhHHHHHHHHHHHHHHHHHHHHHh
Q psy12432         90 IASGASIAVFLVVILCFSLYVHCQRR  115 (229)
Q Consensus        90 Ia~gI~V~lLLVlii~vvly~~~kRr  115 (229)
                      |+++++.++++++.-++|-..+-+++
T Consensus         6 iv~~~~~v~~~i~~y~~~k~~ka~~~   31 (87)
T PF10883_consen    6 IVGGVGAVVALILAYLWWKVKKAKKQ   31 (87)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            34445555555666666666666655


No 163
>PF07297 DPM2:  Dolichol phosphate-mannose biosynthesis regulatory protein (DPM2);  InterPro: IPR009914 This family consists of several eukaryotic dolichol phosphate-mannose biosynthesis regulatory (DPM2) proteins. Biosynthesis of glycosylphosphatidylinositol and N-glycan precursor is dependent upon a mannosyl donor, dolichol phosphate-mannose (DPM). DPM2, an 84 amino acid membrane protein expressed in the endoplasmic reticulum (ER), makes a complex with DPM1 that is essential for the ER localisation and stable expression of DPM1. Moreover, DPM2 enhances binding of dolichol phosphate, a substrate of DPM synthase. Biosynthesis of DPM in mammalian cells is regulated by DPM2 [].; GO: 0009059 macromolecule biosynthetic process, 0030176 integral to endoplasmic reticulum membrane
Probab=37.27  E-value=59  Score=24.29  Aligned_cols=28  Identities=21%  Similarity=0.202  Sum_probs=11.5

Q ss_pred             HHhhhHHHHHH-HHHHHHHHHHHHHHHhh
Q psy12432         89 SIASGASIAVF-LVVILCFSLYVHCQRRK  116 (229)
Q Consensus        89 ~Ia~gI~V~lL-LVlii~vvly~~~kRrk  116 (229)
                      +|.+.+.++++ +.++..++.+...|.+|
T Consensus        47 Ai~lP~~lll~~~~~vg~f~g~vmik~~~   75 (78)
T PF07297_consen   47 AIILPIFLLLLGLSGVGTFLGYVMIKSKK   75 (78)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhhccc
Confidence            34444444443 33333344444444333


No 164
>PF09472 MtrF:  Tetrahydromethanopterin S-methyltransferase, F subunit (MtrF);  InterPro: IPR013347  Many archaea have evolved energy-yielding pathways marked by one-carbon biochemistry featuring novel cofactors and enzymes. This domain is mostly found in MtrF, where it covers the entire length of the protein. This polypeptide is one of eight subunits of the N5-methyltetrahydromethanopterin: coenzyme M methyltransferase complex found in methanogenic archaea. This is a membrane-associated enzyme complex that uses methyl-transfer reactions to drive a sodium-ion pump []. MtrF itself is involved in the transfer of the methyl group from N5-methyltetrahydromethanopterin to coenzyme M. Subsequently, methane is produced by two-electron reduction of the methyl moiety in methyl-coenzyme M by another enzyme, methyl-coenzyme M reductase. In some organisms this domain is found at the C-terminal region of what appears to be a fusion of the MtrA and MtrF proteins [, ]. The function of these proteins is unknown, though it is likely that they are involved in C1 metabolism.; GO: 0030269 tetrahydromethanopterin S-methyltransferase activity, 0015948 methanogenesis, 0016020 membrane
Probab=36.99  E-value=41  Score=24.20  Aligned_cols=20  Identities=30%  Similarity=0.255  Sum_probs=11.2

Q ss_pred             HHHHhhhHHHHHHHHHHHHH
Q psy12432         87 TASIASGASIAVFLVVILCF  106 (229)
Q Consensus        87 ~~~Ia~gI~V~lLLVlii~v  106 (229)
                      ..+.++|+.++++++++..+
T Consensus        42 ~~GfaiG~~~AlvLv~ip~~   61 (64)
T PF09472_consen   42 IKGFAIGFLFALVLVGIPIL   61 (64)
T ss_pred             hHHHHHHHHHHHHHHHHHHH
Confidence            34556666666655555443


No 165
>PF09835 DUF2062:  Uncharacterized protein conserved in bacteria (DUF2062);  InterPro: IPR018639  This domain, found in various prokaryotic proteins, has no known function. It is found at the C-terminal of family 2 glycosyltransferase proteins, in addition to proteins of unknown function.
Probab=36.34  E-value=72  Score=25.74  Aligned_cols=12  Identities=17%  Similarity=0.412  Sum_probs=5.5

Q ss_pred             HHHhhhHHHHHH
Q psy12432         88 ASIASGASIAVF   99 (229)
Q Consensus        88 ~~Ia~gI~V~lL   99 (229)
                      ..++.|++++++
T Consensus       123 G~~i~~~v~~~i  134 (154)
T PF09835_consen  123 GSLILGIVLGII  134 (154)
T ss_pred             HHHHHHHHHHHH
Confidence            344444444444


No 166
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=36.31  E-value=27  Score=22.81  Aligned_cols=16  Identities=38%  Similarity=0.931  Sum_probs=14.2

Q ss_pred             eEeecCCCCCCCCCcc
Q psy12432         54 YNCECADGYMGQRCEF   69 (229)
Q Consensus        54 ~sC~C~~Gy~G~rCe~   69 (229)
                      ..|.|..++.|.+|+.
T Consensus        18 G~C~C~~~~~G~~C~~   33 (46)
T smart00180       18 GQCECKPNVTGRRCDR   33 (46)
T ss_pred             CEEECCCCCCCCCCCc
Confidence            5899999999999984


No 167
>PF01414 DSL:  Delta serrate ligand;  InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=34.74  E-value=19  Score=25.57  Aligned_cols=41  Identities=24%  Similarity=0.555  Sum_probs=19.1

Q ss_pred             ceeeCCCCCCCCCCCCCCEEEeeccCCCCceEee------cCCCCCCCCC
Q psy12432         24 HTYACPPTYATWYCLNGATCFTVKIGESLLYNCE------CADGYMGQRC   67 (229)
Q Consensus        24 y~C~C~~gy~gnpClNGGtC~~~~~~~~~~~sC~------C~~Gy~G~rC   67 (229)
                      +.-.|.+.|.|..|..  .|....+. ...|.|.      |..||.|..|
T Consensus        17 ~rv~C~~nyyG~~C~~--~C~~~~d~-~ghy~Cd~~G~~~C~~Gw~G~~C   63 (63)
T PF01414_consen   17 IRVVCDENYYGPNCSK--FCKPRDDS-FGHYTCDSNGNKVCLPGWTGPNC   63 (63)
T ss_dssp             ------TTEETTTT-E--E---EEET-TEEEEE-SS--EEE-TTEESTTS
T ss_pred             EEEECCCCCCCccccC--CcCCCcCC-cCCcccCCCCCCCCCCCCcCCCC
Confidence            4678888888877753  45443211 2357774      8888998877


No 168
>PF03554 Herpes_UL73:  UL73 viral envelope glycoprotein  ;  InterPro: IPR005211 This entry represents a conserved region found in a number of viral proteins: BLRF1, U46, 53, and UL73, collectively known as glycoprotein N. These UL73-like envelope glycoproteins, which associate in a high molecular mass complex with their counterpart protein gM, induce neutralizing antibody responses in the host. These glycoproteins are highly polymorphic, particularly in the N-terminal region [].; GO: 0019031 viral envelope
Probab=34.67  E-value=1.2e+02  Score=22.89  Aligned_cols=29  Identities=17%  Similarity=0.277  Sum_probs=14.6

Q ss_pred             HHHHHhhhHHHHHHHHHHHHHHHHHHHHH
Q psy12432         86 ETASIASGASIAVFLVVILCFSLYVHCQR  114 (229)
Q Consensus        86 ~~~~Ia~gI~V~lLLVlii~vvly~~~kR  114 (229)
                      .+..|.+-+-+.+|++..++.+.|+.++|
T Consensus        47 SFsSIW~iiN~~il~~A~~vyLry~Cf~~   75 (82)
T PF03554_consen   47 SFSSIWAIINVVILLCAFCVYLRYLCFQK   75 (82)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            45555543444444445555555555554


No 169
>PF12768 Rax2:  Cortical protein marker for cell polarity
Probab=34.54  E-value=41  Score=30.66  Aligned_cols=12  Identities=25%  Similarity=0.338  Sum_probs=5.6

Q ss_pred             HHHhhhHHHHHH
Q psy12432         88 ASIASGASIAVF   99 (229)
Q Consensus        88 ~~Ia~gI~V~lL   99 (229)
                      ..|++|+++++.
T Consensus       234 lAiALG~v~ll~  245 (281)
T PF12768_consen  234 LAIALGTVFLLV  245 (281)
T ss_pred             hHHHHHHHHHHH
Confidence            345555544443


No 170
>PHA02887 EGF-like protein; Provisional
Probab=34.47  E-value=32  Score=27.85  Aligned_cols=21  Identities=14%  Similarity=0.265  Sum_probs=16.6

Q ss_pred             CCCCCceeeCCCCCCCCCCCC
Q psy12432         19 PNITFHTYACPPTYATWYCLN   39 (229)
Q Consensus        19 ~~~~gy~C~C~~gy~gnpClN   39 (229)
                      .+.....|.|..||.|..|..
T Consensus       103 ~dL~epsCrC~~GYtG~RCE~  123 (126)
T PHA02887        103 IDLDEKFCICNKGYTGIRCDE  123 (126)
T ss_pred             ccCCCceeECCCCcccCCCCc
Confidence            345568999999999888853


No 171
>PTZ00233 variable surface protein Vir18; Provisional
Probab=34.36  E-value=28  Score=34.50  Aligned_cols=27  Identities=15%  Similarity=0.315  Sum_probs=12.4

Q ss_pred             HHHHHHHHHHHHHHHHHHHHhhhhhhh
Q psy12432         95 SIAVFLVVILCFSLYVHCQRRKKQAQA  121 (229)
Q Consensus        95 ~V~lLLVlii~vvly~~~kRrk~~k~~  121 (229)
                      +++|||-||+=..-.|+..-||+||+-
T Consensus       443 GIvLLLGLLFKyTPLWRvLTKknRKk~  469 (509)
T PTZ00233        443 GIALLLGLLFKYTPLWRVLTKKNRKKG  469 (509)
T ss_pred             hHHHHHHHhhccchhHHhhhhcccccc
Confidence            344444333334445555555555543


No 172
>KOG3054|consensus
Probab=34.26  E-value=41  Score=30.62  Aligned_cols=18  Identities=22%  Similarity=0.171  Sum_probs=9.4

Q ss_pred             CCCccc-ccCCCCCCcCCCcc
Q psy12432        165 TRTSIT-ITGKGDSVSASQLY  184 (229)
Q Consensus       165 ~~~~ld-~~~~g~~~~~~~~f  184 (229)
                      ++++-| -++.|++  +-|++
T Consensus        78 ~sd~ee~~~~dg~e--e~~e~   96 (299)
T KOG3054|consen   78 TSDVEEEGSGDGDE--EEPEA   96 (299)
T ss_pred             cccccccccccccc--ccccc
Confidence            345555 5555655  55544


No 173
>PRK13275 mtrF tetrahydromethanopterin S-methyltransferase subunit F; Provisional
Probab=34.22  E-value=78  Score=23.02  Aligned_cols=20  Identities=30%  Similarity=0.373  Sum_probs=11.7

Q ss_pred             HHHHhhhHHHHHHHHHHHHH
Q psy12432         87 TASIASGASIAVFLVVILCF  106 (229)
Q Consensus        87 ~~~Ia~gI~V~lLLVlii~v  106 (229)
                      +.++++|..++++++++...
T Consensus        42 ~~G~aiG~~~AlvLv~ip~~   61 (67)
T PRK13275         42 IIGFAIGFLLALLLVVVPPL   61 (67)
T ss_pred             HHHHHHHHHHHHHHHHHHHH
Confidence            45566666666665554443


No 174
>PF02430 AMA-1:  Apical membrane antigen 1;  InterPro: IPR003298 A novel antigen of Plasmodium falciparum has been cloned that contains a hydrophobic domain typical of an integral membrane protein. The antigen is designated apical membrane antigen 1 (AMA-1) by virtue of appearing to be located in the apical complex []. AMA-1 appears to be transported to the merozoite surface close to the time of schizont rupture. The 66kDa merozoite surface antigen (PK66) of Plasmodium knowlesi, a simian malaria, possesses vaccine-related properties believed to originate from a receptor-like role in parasite invasion of erythrocytes []. The sequence of PK66 is conserved throughout plasmodium, and shows high similarity to P. falciparum AMA-1. Following schizont rupture, the distribution of PK66 changes in a coordinate manner associated with merozoite invasion. Prior to rupture, the protein is concentrated at the apical end, following which it distributes itself entirely across the surface of the free merozoite. Immunofluorescence studies suggest that, during invasion, PK66 is excluded from the erythrocyte at, and behind, the invasion interface []. ; GO: 0009405 pathogenesis, 0016020 membrane; PDB: 2Y8R_D 2X2Z_A 2Y8T_A 2Y8S_D 1HN6_A 2Z8V_B 2Z8W_A 1Z40_E 2Q8B_A 2Q8A_A ....
Probab=32.57  E-value=15  Score=35.92  Aligned_cols=31  Identities=16%  Similarity=0.215  Sum_probs=0.0

Q ss_pred             HHHHhhhHHHHHHHHHHHHHHHHHHHHHhhh
Q psy12432         87 TASIASGASIAVFLVVILCFSLYVHCQRRKK  117 (229)
Q Consensus        87 ~~~Ia~gI~V~lLLVlii~vvly~~~kRrk~  117 (229)
                      +.+|++++.++++||.|++.++|-+.++.|+
T Consensus       429 ~~~~~~~~~~~~~~~~l~~~~~~k~~~~~kk  459 (471)
T PF02430_consen  429 RIVIIISIATGAVLLALAIYWYYKRLWKTKK  459 (471)
T ss_dssp             -------------------------------
T ss_pred             eeehhhHHHHHHHHHHHHHHHHHhhhhcccc
Confidence            3445555666665555555555554444433


No 175
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=32.44  E-value=45  Score=21.03  Aligned_cols=18  Identities=33%  Similarity=0.894  Sum_probs=13.0

Q ss_pred             EEEeeccCCCCceEeecCCCCC
Q psy12432         42 TCFTVKIGESLLYNCECADGYM   63 (229)
Q Consensus        42 tC~~~~~~~~~~~sC~C~~Gy~   63 (229)
                      .|.+..    ..|.|.|+.||.
T Consensus        11 ~C~~~~----g~~~C~C~~Gy~   28 (36)
T PF14670_consen   11 ICVNTP----GSYRCSCPPGYK   28 (36)
T ss_dssp             EEEEET----TSEEEE-STTEE
T ss_pred             CCccCC----CceEeECCCCCE
Confidence            677764    369999999984


No 176
>PF11395 DUF2873:  Protein of unknown function (DUF2873);  InterPro: IPR021532 This entry is represented by the human SARS coronavirus, Orf7b; it is a family of uncharacterised viral proteins.
Probab=32.39  E-value=61  Score=21.08  Aligned_cols=11  Identities=36%  Similarity=0.781  Sum_probs=4.5

Q ss_pred             HHHHHHHHHHH
Q psy12432         98 VFLVVILCFSL  108 (229)
Q Consensus        98 lLLVlii~vvl  108 (229)
                      +++|+|+.+++
T Consensus        18 lflv~imliif   28 (43)
T PF11395_consen   18 LFLVIIMLIIF   28 (43)
T ss_pred             HHHHHHHHHHH
Confidence            33444444333


No 177
>PRK11486 flagellar biosynthesis protein FliO; Provisional
Probab=32.26  E-value=59  Score=26.31  Aligned_cols=19  Identities=16%  Similarity=0.148  Sum_probs=8.3

Q ss_pred             HHHHHHHHHHHHHHHHHHh
Q psy12432         97 AVFLVVILCFSLYVHCQRR  115 (229)
Q Consensus        97 ~lLLVlii~vvly~~~kRr  115 (229)
                      +|++|+.+++++.|..||-
T Consensus        24 ~L~lVl~lI~~~aWLlkR~   42 (124)
T PRK11486         24 ALIGIIALILAAAWLVKRL   42 (124)
T ss_pred             HHHHHHHHHHHHHHHHHHc
Confidence            3333444444444545553


No 178
>PF13124 DUF3963:  Protein of unknown function (DUF3963)
Probab=31.72  E-value=84  Score=20.29  Aligned_cols=21  Identities=14%  Similarity=0.099  Sum_probs=13.6

Q ss_pred             HHHHhhhHHHHHHHHHHHHHH
Q psy12432         87 TASIASGASIAVFLVVILCFS  107 (229)
Q Consensus        87 ~~~Ia~gI~V~lLLVlii~vv  107 (229)
                      .+.-.+..|+++|+|.++.+|
T Consensus        18 kwirnit~cfal~vv~lvslw   38 (40)
T PF13124_consen   18 KWIRNITFCFALLVVVLVSLW   38 (40)
T ss_pred             HHHHHHHHHHHHHHHHHHHHH
Confidence            344455567888877777655


No 179
>COG3115 ZipA Cell division protein [Cell division and chromosome partitioning]
Probab=31.68  E-value=1.6e+02  Score=27.47  Aligned_cols=10  Identities=30%  Similarity=0.155  Sum_probs=5.3

Q ss_pred             CCCCchhhhh
Q psy12432        208 SQPPDDIKAD  217 (229)
Q Consensus       208 ~~~~~~~~~~  217 (229)
                      ++++..++|+
T Consensus       133 ~~~~~~~~a~  142 (324)
T COG3115         133 RQPVLPQEAP  142 (324)
T ss_pred             ccCccccccc
Confidence            4455555555


No 180
>PF01034 Syndecan:  Syndecan domain;  InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ]. Structurally, these proteins consist of four separate domains:   A signal sequence; An extracellular domain (ectodomain) of variable length whose sequence is not evolutionary conserved in the various forms of syndecans. The ectodomain contains the sites of attachment of the heparan sulphate glycosaminoglycan side chains;  A transmembrane region;  A highly conserved cytoplasmic domain of about 30 to 35 residues, which could interact with cytoskeletal proteins.    The proteins known to belong to this family are:    Syndecan 1.  Syndecan 2 or fibroglycan.  Syndecan 3 or neuroglycan or N-syndecan.  Syndecan 4 or amphiglycan or ryudocan.  Drosophila syndecan.   Caenorhabditis elegans probable syndecan (F57C7.3).    Syndecan-4, a transmembrane heparan sulphate proteoglycan, is a coreceptor with integrins in cell adhesion. It has been suggested to form a ternary signalling complex with protein kinase Calpha and phosphatidylinositol 4,5-bisphosphate (PIP2). Structural studies have demonstrated that the cytoplasmic domain undergoes a conformational transition and forms a symmetric dimer in the presence of phospholipid activator PIP2, and whose overall structure in solution exhibits a twisted clamp shape having a cavity in the centre of dimeric interface. In addition, it has been observed that the syndecan-4 variable domain interacts, strongly, not only with fatty acyl groups but also the anionic head group of PIP2. These findings indicate that PIP2 promotes oligomerisation of the syndecan-4 cytoplasmic domain for transmembrane signalling and cell-matrix adhesion [, ].; GO: 0008092 cytoskeletal protein binding, 0016020 membrane; PDB: 1EJQ_B 1EJP_B 1YBO_C 1OBY_Q.
Probab=31.66  E-value=17  Score=26.27  Aligned_cols=17  Identities=24%  Similarity=0.386  Sum_probs=0.0

Q ss_pred             HHhhhHHHHHHHHHHHH
Q psy12432         89 SIASGASIAVFLVVILC  105 (229)
Q Consensus        89 ~Ia~gI~V~lLLVlii~  105 (229)
                      ++++|++++++|+++++
T Consensus        17 G~Vvgll~ailLIlf~i   33 (64)
T PF01034_consen   17 GGVVGLLFAILLILFLI   33 (64)
T ss_dssp             -----------------
T ss_pred             HHHHHHHHHHHHHHHHH
Confidence            33444444444444444


No 181
>PF14584 DUF4446:  Protein of unknown function (DUF4446)
Probab=31.63  E-value=1e+02  Score=25.53  Aligned_cols=23  Identities=17%  Similarity=0.226  Sum_probs=11.5

Q ss_pred             hHHHHHHHHHHHHHHHHHHHHHh
Q psy12432         93 GASIAVFLVVILCFSLYVHCQRR  115 (229)
Q Consensus        93 gI~V~lLLVlii~vvly~~~kRr  115 (229)
                      ++++++|+++++...-+-+.+|+
T Consensus         9 ~l~iilli~~~~~~~kl~kl~r~   31 (151)
T PF14584_consen    9 VLVIILLILIIILNIKLRKLKRR   31 (151)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHH
Confidence            44445555555555555554444


No 182
>PF12158 DUF3592:  Protein of unknown function (DUF3592);  InterPro: IPR021994  This family of proteins is functionally uncharacterised.This family of proteins is found in bacteria, archaea, eukaryotes and viruses. Proteins in this family are typically between 150 and 242 amino acids in length. 
Probab=31.46  E-value=70  Score=24.85  Aligned_cols=9  Identities=22%  Similarity=0.552  Sum_probs=4.6

Q ss_pred             ccccCCCCC
Q psy12432        183 LYHQSSPPL  191 (229)
Q Consensus       183 ~f~~~~~~~  191 (229)
                      -|++.||..
T Consensus       101 ~Y~P~~P~~  109 (148)
T PF12158_consen  101 YYNPNNPEE  109 (148)
T ss_pred             EECCcCCCe
Confidence            455555544


No 183
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=31.21  E-value=50  Score=28.18  Aligned_cols=36  Identities=22%  Similarity=0.544  Sum_probs=23.7

Q ss_pred             CCcee----eCCCCCCCCCCCCCCEEEeeccCCCCceEeecCCCCCCC
Q psy12432         22 TFHTY----ACPPTYATWYCLNGATCFTVKIGESLLYNCECADGYMGQ   65 (229)
Q Consensus        22 ~gy~C----~C~~gy~gnpClNGGtC~~~~~~~~~~~sC~C~~Gy~G~   65 (229)
                      .+..|    .|..+  ...|.  ..|.+..    +.|.|.|..||+..
T Consensus       180 ~~~~C~~~~~C~~~--~~~c~--~~C~~~~----g~~~c~c~~g~~~~  219 (224)
T cd01475         180 QGKICVVPDLCATL--SHVCQ--QVCISTP----GSYLCACTEGYALL  219 (224)
T ss_pred             ccccCcCchhhcCC--CCCcc--ceEEcCC----CCEEeECCCCccCC
Confidence            34556    47653  24565  3687653    36999999999764


No 184
>PF15339 Afaf:  Acrosome formation-associated factor
Probab=31.04  E-value=1e+02  Score=26.69  Aligned_cols=6  Identities=33%  Similarity=0.445  Sum_probs=2.5

Q ss_pred             hhhHHH
Q psy12432         91 ASGASI   96 (229)
Q Consensus        91 a~gI~V   96 (229)
                      ..||.+
T Consensus       133 mLGIsL  138 (200)
T PF15339_consen  133 MLGISL  138 (200)
T ss_pred             HHHHHH
Confidence            344443


No 185
>PF13703 PepSY_TM_2:  PepSY-associated TM helix
Probab=30.98  E-value=80  Score=23.18  Aligned_cols=15  Identities=13%  Similarity=0.379  Sum_probs=6.0

Q ss_pred             hhhHHHHHHHHHHHH
Q psy12432         91 ASGASIAVFLVVILC  105 (229)
Q Consensus        91 a~gI~V~lLLVlii~  105 (229)
                      ++++..++++++++.
T Consensus        19 iv~~~al~~l~~~is   33 (88)
T PF13703_consen   19 IVGILALLLLLLLIS   33 (88)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            444443333333333


No 186
>PF05356 Phage_Coat_B:  Phage Coat protein B ;  InterPro: IPR008020 The major coat protein in the capsid of filamentous bacteriophage forms a helical assembly of about 7000 identical protomers, with each protomer comprised of 46 amino acids, after the cleavage of the signal peptide. Each protomer forms a slightly curved helix that combines to form a tubular structure that encapsulates the viral DNA [].; PDB: 1QL1_A 2XKM_A 4IFM_A 1QL2_A 1IFM_A 2KLV_A 1IFN_A 2IFN_A 3IFM_A 2KSJ_A ....
Probab=30.89  E-value=91  Score=23.34  Aligned_cols=23  Identities=13%  Similarity=0.301  Sum_probs=13.0

Q ss_pred             hhhHHHHHHHHHHHHHHHHHHHH
Q psy12432         91 ASGASIAVFLVVILCFSLYVHCQ  113 (229)
Q Consensus        91 a~gI~V~lLLVlii~vvly~~~k  113 (229)
                      ++|.+|++|+++.+..++|-..+
T Consensus        59 vgg~IVgvl~~laVaGlI~~l~R   81 (83)
T PF05356_consen   59 VGGYIVGVLVILAVAGLIYSLLR   81 (83)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             hhHHHHHHHHHHHHHHHHHHHHh
Confidence            34556666666666665554433


No 187
>PF13268 DUF4059:  Protein of unknown function (DUF4059)
Probab=30.74  E-value=74  Score=23.47  Aligned_cols=19  Identities=11%  Similarity=0.254  Sum_probs=9.0

Q ss_pred             HHHHHHHHHHHHHHHhhhh
Q psy12432        100 LVVILCFSLYVHCQRRKKQ  118 (229)
Q Consensus       100 LVlii~vvly~~~kRrk~~  118 (229)
                      ++++.++|+.++..|++.+
T Consensus        20 V~~~~~~wi~~Ra~~~~DK   38 (72)
T PF13268_consen   20 VLLVSGIWILWRALRKKDK   38 (72)
T ss_pred             HHHHHHHHHHHHHHHcCCC
Confidence            3444445555555444433


No 188
>PF12606 RELT:  Tumour necrosis factor receptor superfamily member 19;  InterPro: IPR022248 The members of tumor necrosis factor receptor (TNFR) superfamily have been designated as the "guardians of the immune system" due to their roles in immune cell proliferation, differentiation, activation, and death (apoptosis).  RELT (receptor expressed in lymphoid tissues) is a member of the TNFR superfamily. The messenger RNA of RELT is especially abundant in hematologic tissues such as spleen, lymph node, and peripheral blood leukocytes as well as in leukemias and lymphomas. RELT is able to activate the NF-kappaB pathway and selectively binds tumor necrosis factor receptor-associated factor 1 []. RELT like proteins 1 and 2 (RELL1 and RELL2) are two RELT homologues that bind to RELT. The expression of RELL1 at the mRNA level is ubiquitous, whereas expression of RELL2 mRNA is more restricted to particular tissues [].
Probab=30.72  E-value=66  Score=22.02  Aligned_cols=17  Identities=18%  Similarity=0.049  Sum_probs=6.1

Q ss_pred             HHHHHHHHHHHHHHHHh
Q psy12432         99 FLVVILCFSLYVHCQRR  115 (229)
Q Consensus        99 LLVlii~vvly~~~kRr  115 (229)
                      |+|++++++..|...|+
T Consensus        10 ~iv~~lLg~~I~~~~K~   26 (50)
T PF12606_consen   10 FIVMGLLGLSICTTLKA   26 (50)
T ss_pred             HHHHHHHHHHHHHHhhc
Confidence            33333333333333333


No 189
>TIGR00859 ENaC sodium channel transporter. This model is designed from the vertebrate members of the ENaC family.
Probab=30.05  E-value=3.5e+02  Score=27.20  Aligned_cols=22  Identities=23%  Similarity=0.336  Sum_probs=8.7

Q ss_pred             CcCCCccccCCCCCCCCCCCCCC
Q psy12432        178 VSASQLYHQSSPPLPHMSHPPSC  200 (229)
Q Consensus       178 ~~~~~~f~~~~~~~~~~~~~~~~  200 (229)
                      +...+.+++. ..-+++.+++-+
T Consensus       564 ~~~~~~~~~~-~~~~~~~~~~~~  585 (595)
T TIGR00859       564 LDDPPTFPSA-LPLPHASGLSLP  585 (595)
T ss_pred             ccCCCCCCcc-cCCCCccCCCCC
Confidence            3334455443 333344444333


No 190
>PRK13707 conjugal transfer pilus assembly protein TraL; Provisional
Probab=29.71  E-value=1.2e+02  Score=23.55  Aligned_cols=14  Identities=14%  Similarity=0.199  Sum_probs=6.0

Q ss_pred             HHHHHHHHHhhhhh
Q psy12432        106 FSLYVHCQRRKKQA  119 (229)
Q Consensus       106 vvly~~~kRrk~~k  119 (229)
                      ++++...+|-|+.+
T Consensus        53 ~~~~~~~r~lK~g~   66 (101)
T PRK13707         53 VLVWFGIRKLKKGR   66 (101)
T ss_pred             HHHHHHHHHHHcCC
Confidence            34444444444444


No 191
>KOG3054|consensus
Probab=29.60  E-value=52  Score=29.99  Aligned_cols=8  Identities=25%  Similarity=0.434  Sum_probs=3.2

Q ss_pred             HHHHHhhh
Q psy12432        110 VHCQRRKK  117 (229)
Q Consensus       110 ~~~kRrk~  117 (229)
                      +.++|++.
T Consensus        21 ~l~~r~ra   28 (299)
T KOG3054|consen   21 FLWKRRRA   28 (299)
T ss_pred             HHHHhhcc
Confidence            33444433


No 192
>PF13131 DUF3951:  Protein of unknown function (DUF3951)
Probab=29.49  E-value=58  Score=22.55  Aligned_cols=19  Identities=16%  Similarity=0.481  Sum_probs=8.3

Q ss_pred             HHHHHHHHHHHHHHHHHHH
Q psy12432         96 IAVFLVVILCFSLYVHCQR  114 (229)
Q Consensus        96 V~lLLVlii~vvly~~~kR  114 (229)
                      +.+++.+++.++.|..+-+
T Consensus        11 ~~~~I~~lIgfity~mfV~   29 (53)
T PF13131_consen   11 FTIFIFFLIGFITYKMFVK   29 (53)
T ss_pred             HHHHHHHHHHHHHHHhhee
Confidence            3444444444444444443


No 193
>TIGR02736 cbb3_Q_epsi cytochrome c oxidase, cbb3-type, CcoQ subunit, epsilon-Proteobacterial. Members of this protein family are restricted to the epsilon branch of the Proteobacteria. All members are found in operons containing the other three structural subunits of the cbb3 type of cytochrome c oxidase. These small proteins show remote sequence similarity to the CcoQ subunit in other cytochrome c oxidase systems, so this family is assumed to represent the epsilonproteobacterial variant of CcoQ.
Probab=28.46  E-value=74  Score=22.36  Aligned_cols=19  Identities=32%  Similarity=0.443  Sum_probs=7.6

Q ss_pred             HHHHHHHHHHHHHHHhhhh
Q psy12432        100 LVVILCFSLYVHCQRRKKQ  118 (229)
Q Consensus       100 LVlii~vvly~~~kRrk~~  118 (229)
                      |++++-.-+|..++++|+-
T Consensus        11 lvv~LYgY~yhLYrsek~G   29 (56)
T TIGR02736        11 LVIFLYAYIYHLYRSQKKG   29 (56)
T ss_pred             HHHHHHHHHHHhhhhhccc
Confidence            3333333344444444433


No 194
>PRK04125 murein hydrolase regulator LrgA; Provisional
Probab=28.44  E-value=1.2e+02  Score=24.95  Aligned_cols=21  Identities=14%  Similarity=0.036  Sum_probs=9.8

Q ss_pred             HHHHhhhHHHHHHHHHHHHHH
Q psy12432         87 TASIASGASIAVFLVVILCFS  107 (229)
Q Consensus        87 ~~~Ia~gI~V~lLLVlii~vv  107 (229)
                      ++.|++.+.+.-++++++..+
T Consensus        91 ~~~Il~~ivvSTllvl~vtg~  111 (141)
T PRK04125         91 PVQIIGVIIVATILLLACTGL  111 (141)
T ss_pred             HHHHHHHHHHHHHHHHHHHHH
Confidence            444444444444444444433


No 195
>CHL00008 petG cytochrome b6/f complex subunit V
Probab=28.42  E-value=1.1e+02  Score=19.71  Aligned_cols=24  Identities=17%  Similarity=0.426  Sum_probs=10.7

Q ss_pred             hhhHHHHHHHHHH--HHHHHHHHHHH
Q psy12432         91 ASGASIAVFLVVI--LCFSLYVHCQR  114 (229)
Q Consensus        91 a~gI~V~lLLVli--i~vvly~~~kR  114 (229)
                      ..||+++++-+-+  +.+..|..++|
T Consensus         6 L~GiVLGlipvTl~GlfvaAylQYrR   31 (37)
T CHL00008          6 LFGIVLGLIPITLAGLFVTAYLQYRR   31 (37)
T ss_pred             hhhHHHHhHHHHHHHHHHHHHHHHhh
Confidence            3455555542222  22345555554


No 196
>PF07271 Cytadhesin_P30:  Cytadhesin P30/P32;  InterPro: IPR009896 This family consists of several Mycoplasma species specific Cytadhesin P32 and P30 proteins. P30 has been found to be membrane associated and localised on the tip organelle. It is thought that it is important in cytadherence and virulence [].; GO: 0007157 heterophilic cell-cell adhesion, 0009405 pathogenesis, 0016021 integral to membrane
Probab=28.34  E-value=41  Score=30.80  Aligned_cols=13  Identities=8%  Similarity=0.169  Sum_probs=4.8

Q ss_pred             hHHHHHHHHHHHH
Q psy12432         93 GASIAVFLVVILC  105 (229)
Q Consensus        93 gI~V~lLLVlii~  105 (229)
                      |+++++|||++.+
T Consensus        78 G~~~v~liLgl~i   90 (279)
T PF07271_consen   78 GLLAVALILGLAI   90 (279)
T ss_pred             hHHHHHHHHHHhh
Confidence            3333333333333


No 197
>COG4218 MtrF Tetrahydromethanopterin S-methyltransferase, subunit F [Coenzyme metabolism]
Probab=28.26  E-value=60  Score=23.90  Aligned_cols=19  Identities=42%  Similarity=0.347  Sum_probs=11.1

Q ss_pred             HHHHhhhHHHHHHHHHHHH
Q psy12432         87 TASIASGASIAVFLVVILC  105 (229)
Q Consensus        87 ~~~Ia~gI~V~lLLVlii~  105 (229)
                      +.++++|.+++++++++..
T Consensus        51 i~GlaiGfvfA~vLv~ill   69 (73)
T COG4218          51 IAGLAIGFVFAGVLVGILL   69 (73)
T ss_pred             hHHHHHHHHHHHHHHHHHH
Confidence            4566666666665555544


No 198
>PF07178 TraL:  TraL protein;  InterPro: IPR009838 This entry represents bacterial TraL proteins. TraL is a predicted peripheral membrane protein involved in bacterial sex pilus assembly []. TraL is part of the type IV secretion system for conjugative plasmid transfer []. The exact function of TraL is unknown.; GO: 0000746 conjugation, 0019867 outer membrane
Probab=28.00  E-value=1.3e+02  Score=22.73  Aligned_cols=10  Identities=30%  Similarity=0.368  Sum_probs=3.6

Q ss_pred             HHHHHHHhhh
Q psy12432        108 LYVHCQRRKK  117 (229)
Q Consensus       108 ly~~~kRrk~  117 (229)
                      +++.++|-|+
T Consensus        49 ~~~~~~k~K~   58 (95)
T PF07178_consen   49 LWWGYRKFKK   58 (95)
T ss_pred             HHHHHHHHHc
Confidence            3333333333


No 199
>PHA03283 envelope glycoprotein E; Provisional
Probab=27.91  E-value=36  Score=33.86  Aligned_cols=19  Identities=21%  Similarity=0.216  Sum_probs=9.4

Q ss_pred             HHHHHHHHHHHHHHHHhhh
Q psy12432         99 FLVVILCFSLYVHCQRRKK  117 (229)
Q Consensus        99 LLVlii~vvly~~~kRrk~  117 (229)
                      ++|+.+++|.|++++|+++
T Consensus       412 ~~~~~l~vw~c~~~r~~~~  430 (542)
T PHA03283        412 ALLVALVVWGCILYRRSNR  430 (542)
T ss_pred             HHHHHHhhhheeeehhhcC
Confidence            3444445556665544333


No 200
>PF09064 Tme5_EGF_like:  Thrombomodulin like fifth domain, EGF-like;  InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=27.75  E-value=32  Score=21.83  Aligned_cols=10  Identities=30%  Similarity=0.577  Sum_probs=8.1

Q ss_pred             CceeeCCCCC
Q psy12432         23 FHTYACPPTY   32 (229)
Q Consensus        23 gy~C~C~~gy   32 (229)
                      ...|.||+||
T Consensus        17 ~~~C~CPeGy   26 (34)
T PF09064_consen   17 PGQCFCPEGY   26 (34)
T ss_pred             CCceeCCCce
Confidence            4589999987


No 201
>PF11353 DUF3153:  Protein of unknown function (DUF3153);  InterPro: IPR021499  This family of proteins with unknown function appear to be restricted to Cyanobacteria. Some members are annotated as membrane proteins however this cannot be confirmed. 
Probab=27.72  E-value=59  Score=27.97  Aligned_cols=14  Identities=7%  Similarity=0.289  Sum_probs=6.4

Q ss_pred             HHHHHHHHHHHHhh
Q psy12432        103 ILCFSLYVHCQRRK  116 (229)
Q Consensus       103 ii~vvly~~~kRrk  116 (229)
                      ++++.+|++++|.+
T Consensus       195 l~~~~~~l~~~r~~  208 (209)
T PF11353_consen  195 LILLGFLLRRRRLP  208 (209)
T ss_pred             HHHHHHHHHHhhcC
Confidence            33334555555543


No 202
>PF04415 DUF515:  Protein of unknown function (DUF515)    ;  InterPro: IPR007509 This is a family of hypothetical archaeal proteins.
Probab=27.68  E-value=66  Score=31.17  Aligned_cols=26  Identities=19%  Similarity=0.625  Sum_probs=13.4

Q ss_pred             HhhhHHHHHHHHHHHHHHHHHHHHHh
Q psy12432         90 IASGASIAVFLVVILCFSLYVHCQRR  115 (229)
Q Consensus        90 Ia~gI~V~lLLVlii~vvly~~~kRr  115 (229)
                      +++|..|+.++++++.+.+|+..+-+
T Consensus        32 iiiGa~Vl~iIii~~~~~~Y~~~~~~   57 (416)
T PF04415_consen   32 IIIGAAVLIIIIIFIVYNIYYFLQNQ   57 (416)
T ss_pred             hhhhhhhHhHHHHHHHHHHHHHhhhh
Confidence            34555555555555555555554433


No 203
>PHA03163 hypothetical protein; Provisional
Probab=27.61  E-value=2.4e+02  Score=21.69  Aligned_cols=29  Identities=14%  Similarity=0.359  Sum_probs=14.0

Q ss_pred             HHHHHhhhHHHHHHHHHHHHHHHHHHHHH
Q psy12432         86 ETASIASGASIAVFLVVILCFSLYVHCQR  114 (229)
Q Consensus        86 ~~~~Ia~gI~V~lLLVlii~vvly~~~kR  114 (229)
                      .+..|.+-+=+.++++..++.+.|+.++|
T Consensus        56 SFSSIWaliNv~Ivl~A~~iyL~y~CF~k   84 (92)
T PHA03163         56 SFSSIWAILNVLIMLIACIIYCIYMCFNK   84 (92)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            34444443334444444445555555544


No 204
>TIGR01432 QOXA cytochrome aa3 quinol oxidase, subunit II. This enzyme catalyzes the oxidation of quinol with the concomitant reduction of molecular oxygen to water. This acts as the terminal electron acceptor in the respiratory chain. This subunit contains two transmembrane helices and a large external domain responsible for the binding and oxidation of quinol. QuoX is (presently) only found in gram positive bacteria of the Bacillus/Staphylococcus group. Like CyoA, the ubiquinol oxidase found in proteobacteria, the residues responsible for the ligation of Cu(a) and cytochrome c (found in the related cyt. c oxidases) are absent. Unlike CyoA, QoxA is in complex with a subunit I which contains cytochromes a similar to the cyt. c oxidases (as opposed to cytochromes b).
Probab=27.45  E-value=1.3e+02  Score=25.98  Aligned_cols=10  Identities=20%  Similarity=0.198  Sum_probs=3.9

Q ss_pred             HHHHHHHHHh
Q psy12432        106 FSLYVHCQRR  115 (229)
Q Consensus       106 vvly~~~kRr  115 (229)
                      +++.++++++
T Consensus        42 i~~~~kyr~r   51 (217)
T TIGR01432        42 TIFLVKYRYR   51 (217)
T ss_pred             HHHHHHHHhc
Confidence            3333444333


No 205
>PRK00665 petG cytochrome b6-f complex subunit PetG; Reviewed
Probab=27.22  E-value=1.1e+02  Score=19.61  Aligned_cols=24  Identities=17%  Similarity=0.358  Sum_probs=10.7

Q ss_pred             hhhHHHHHHHHHH--HHHHHHHHHHH
Q psy12432         91 ASGASIAVFLVVI--LCFSLYVHCQR  114 (229)
Q Consensus        91 a~gI~V~lLLVli--i~vvly~~~kR  114 (229)
                      ..||+++++-+-+  +.+..|..++|
T Consensus         6 L~GiVLGlipiTl~GlfvaAylQYrR   31 (37)
T PRK00665          6 LCGIVLGLIPVTLAGLFVAAWNQYKR   31 (37)
T ss_pred             hhhHHHHhHHHHHHHHHHHHHHHHhc
Confidence            3455555542222  22345555554


No 206
>PRK09400 secE preprotein translocase subunit SecE; Reviewed
Probab=27.21  E-value=1.4e+02  Score=21.13  Aligned_cols=26  Identities=19%  Similarity=0.411  Sum_probs=15.3

Q ss_pred             hHHHHHhhhHHHHHHHHHHHHHHHHH
Q psy12432         85 LETASIASGASIAVFLVVILCFSLYV  110 (229)
Q Consensus        85 ~~~~~Ia~gI~V~lLLVlii~vvly~  110 (229)
                      .|...++-...+++++++++..+++.
T Consensus        30 ~Ef~~ia~~~~iG~~i~G~iGf~Ikl   55 (61)
T PRK09400         30 EEFLLVAKVTGLGILLIGLIGFIIYL   55 (61)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            35555555556666666666655543


No 207
>KOG3514|consensus
Probab=26.99  E-value=47  Score=36.14  Aligned_cols=25  Identities=16%  Similarity=0.251  Sum_probs=14.6

Q ss_pred             HHHHhhhHHHHHHHHHHHHHHHHHH
Q psy12432         87 TASIASGASIAVFLVVILCFSLYVH  111 (229)
Q Consensus        87 ~~~Ia~gI~V~lLLVlii~vvly~~  111 (229)
                      +.++++||.+++.|.+|++.++..+
T Consensus      1514 ttGmVvGIvaAaaLcILilL~am~k 1538 (1591)
T KOG3514|consen 1514 TTGMVVGIVAAAALCILILLYAMYK 1538 (1591)
T ss_pred             ccchhhHHHHHHHHHHHHHHhhccc
Confidence            4566777776666555555444443


No 208
>PF02404 SCF:  Stem cell factor;  InterPro: IPR003452 Stem cell factor (SCF) is a homodimer involved in hematopoiesis. SCF binds to and activates the SCF receptor (SCFR), a receptor tyrosine kinase. SCF stimulates the proliferation of mast cells and is able to augment the proliferation of both myeloid and lymphoid hematopoietic progenitors in bone marrow culture. It also mediates cell-cell adhesion and acts synergistically with other cytokines. SCF is a type I membrane protein, but is also found in a secretable, soluble form. The crystal structure of human SCF has been resolved and a potential receptor-binding site identified [].; GO: 0005173 stem cell factor receptor binding, 0007155 cell adhesion, 0016020 membrane; PDB: 1EXZ_A 1SCF_D 2E9W_C 2O26_A 2O27_A.
Probab=26.98  E-value=21  Score=32.50  Aligned_cols=20  Identities=15%  Similarity=0.252  Sum_probs=0.0

Q ss_pred             HHHHHHHHHHHHHHHHHHHh
Q psy12432         96 IAVFLVVILCFSLYVHCQRR  115 (229)
Q Consensus        96 V~lLLVlii~vvly~~~kRr  115 (229)
                      ++-||++++++++||+++.+
T Consensus       223 l~SLVIGFvlG~l~WKkkq~  242 (273)
T PF02404_consen  223 LFSLVIGFVLGALYWKKKQR  242 (273)
T ss_dssp             --------------------
T ss_pred             HHHHHHHHHHHHHHHHhhcc
Confidence            44456777777777755544


No 209
>PF03988 DUF347:  Repeat of Unknown Function (DUF347) ;  InterPro: IPR007136 This repeat is found as four tandem repeats in a family of bacterial membrane proteins. Each repeat contains two transmembrane regions and a conserved tryptophan.
Probab=26.87  E-value=92  Score=21.27  Aligned_cols=10  Identities=0%  Similarity=-0.074  Sum_probs=3.6

Q ss_pred             HHHHHHHHHH
Q psy12432        101 VVILCFSLYV  110 (229)
Q Consensus       101 Vlii~vvly~  110 (229)
                      ++++++++++
T Consensus        38 ~~l~~~~~~~   47 (55)
T PF03988_consen   38 ALLAVVLALW   47 (55)
T ss_pred             HHHHHHHHHH
Confidence            3333333333


No 210
>PF11431 Transport_MerF:  Membrane transport protein MerF;  InterPro: IPR021091  This entry represents Mercury ion transport protein MerF, which has a core helix-loop-helix domain. It also has two vicinal pairs of cysteine residues which are involved in the transport of Hg(II) across the membrane and are exposed to the cytoplasm []. ; PDB: 1WAZ_A 2LJ2_A 2H3O_A.
Probab=26.74  E-value=77  Score=21.41  Aligned_cols=18  Identities=33%  Similarity=0.693  Sum_probs=10.5

Q ss_pred             HHHHHHHHHHHHHHHhhh
Q psy12432        100 LVVILCFSLYVHCQRRKK  117 (229)
Q Consensus       100 LVlii~vvly~~~kRrk~  117 (229)
                      +.+++.+.+|-.++|+|+
T Consensus        28 La~fi~lt~yal~r~~~~   45 (46)
T PF11431_consen   28 LAVFIGLTIYALWRRRRK   45 (46)
T ss_dssp             HHHHHHHHHHHHHHHHHS
T ss_pred             HHHHHHHHHHHHHHHhcc
Confidence            445555566666666554


No 211
>PF10669 Phage_Gp23:  Protein gp23 (Bacteriophage A118);  InterPro: IPR018926  This entry is represented by the major tail subunit protein, Gp23 of Listeria phage A118 and prophage found in Bacilli. The function is currently unknown. 
Probab=26.36  E-value=1.8e+02  Score=22.94  Aligned_cols=12  Identities=25%  Similarity=0.603  Sum_probs=5.2

Q ss_pred             HHHHHHHHHHHH
Q psy12432         95 SIAVFLVVILCF  106 (229)
Q Consensus        95 ~V~lLLVlii~v  106 (229)
                      .|+||++++++.
T Consensus        23 ~i~~FiILLIi~   34 (121)
T PF10669_consen   23 FIVVFIILLIIT   34 (121)
T ss_pred             HHHHHHHHHHHH
Confidence            344444444443


No 212
>TIGR00383 corA magnesium Mg(2+) and cobalt Co(2+) transport protein (corA). The article in Microb Comp Genomics 1998;3(3):151-69 (Medline:98448512) discusses this family and suggests that some members may have functions other than Mg2+ transport.
Probab=26.33  E-value=72  Score=28.57  Aligned_cols=14  Identities=7%  Similarity=0.344  Sum_probs=6.4

Q ss_pred             HHHHHHHHHHHHhh
Q psy12432        103 ILCFSLYVHCQRRK  116 (229)
Q Consensus       103 ii~vvly~~~kRrk  116 (229)
                      +++++.|+++|||+
T Consensus       303 ~i~~~~~~~fkrk~  316 (318)
T TIGR00383       303 VIALGPLIYFRRKG  316 (318)
T ss_pred             HHHHHHHHHHHHcC
Confidence            33334455555543


No 213
>PF04156 IncA:  IncA protein;  InterPro: IPR007285 Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation of chlamydial development is concurrent with modification of the inclusion membrane by a set of C. trachomatis-encoded proteins collectively designated Incs. One of these Incs, IncA (Inclusion membrane protein A), is functionally associated with the homotypic fusion of inclusions [].
Probab=26.26  E-value=96  Score=25.75  Aligned_cols=11  Identities=18%  Similarity=0.129  Sum_probs=4.3

Q ss_pred             HHHHHHHHHHH
Q psy12432        100 LVVILCFSLYV  110 (229)
Q Consensus       100 LVlii~vvly~  110 (229)
                      +++++.++.++
T Consensus        50 ~vL~~~g~~~~   60 (191)
T PF04156_consen   50 VVLLSLGLLCL   60 (191)
T ss_pred             HHHHHHHHHHH
Confidence            33333333333


No 214
>PHA03231 glycoprotein BALF4; Provisional
Probab=26.21  E-value=1e+02  Score=32.41  Aligned_cols=13  Identities=8%  Similarity=0.427  Sum_probs=5.5

Q ss_pred             HHHHHHHHHhhhh
Q psy12432        106 FSLYVHCQRRKKQ  118 (229)
Q Consensus       106 vvly~~~kRrk~~  118 (229)
                      ++++..++|.++.
T Consensus       718 v~v~l~~rr~~~~  730 (829)
T PHA03231        718 VAVFLAYRRVRRL  730 (829)
T ss_pred             hhhhHHHHHHHHH
Confidence            3344444444433


No 215
>PF11174 DUF2970:  Protein of unknown function (DUF2970);  InterPro: IPR021344  This short family is conserved in Proteobacteria. The function is not known. 
Probab=26.15  E-value=1.1e+02  Score=21.32  Aligned_cols=19  Identities=26%  Similarity=0.263  Sum_probs=10.3

Q ss_pred             HHHHhhhHHHHHHHHHHHH
Q psy12432         87 TASIASGASIAVFLVVILC  105 (229)
Q Consensus        87 ~~~Ia~gI~V~lLLVlii~  105 (229)
                      +..|++|++.++++++.+.
T Consensus        31 ~~~Ii~gii~~~~fV~~Lv   49 (56)
T PF11174_consen   31 VHFIIVGIILAALFVAGLV   49 (56)
T ss_pred             chHHHHHHHHHHHHHHHHH
Confidence            4556666665555444443


No 216
>PF11743 DUF3301:  Protein of unknown function (DUF3301);  InterPro: IPR021732  This family is conserved in Proteobacteria, but the function is not known. 
Probab=26.03  E-value=90  Score=23.81  Aligned_cols=17  Identities=12%  Similarity=0.284  Sum_probs=7.5

Q ss_pred             HHHHHHHHHHHHHhhhh
Q psy12432        102 VILCFSLYVHCQRRKKQ  118 (229)
Q Consensus       102 lii~vvly~~~kRrk~~  118 (229)
                      +++++++++..+|.++.
T Consensus         8 l~~~~~~~w~~~~~~E~   24 (97)
T PF11743_consen    8 LALVGWFWWQSRRQRER   24 (97)
T ss_pred             HHHHHHHHHHHhhHHHH
Confidence            33334444544444444


No 217
>PF13179 DUF4006:  Family of unknown function (DUF4006)
Probab=25.86  E-value=1.4e+02  Score=21.65  Aligned_cols=18  Identities=28%  Similarity=0.420  Sum_probs=7.3

Q ss_pred             HHHHHHHHHHHHHHhhhh
Q psy12432        101 VVILCFSLYVHCQRRKKQ  118 (229)
Q Consensus       101 Vlii~vvly~~~kRrk~~  118 (229)
                      +.|+.++-|+-.+-++..
T Consensus        24 LsIl~~lt~~ai~~Qq~~   41 (66)
T PF13179_consen   24 LSILAFLTYWAIKVQQEQ   41 (66)
T ss_pred             HHHHHHHHHHHHHHHHHH
Confidence            333334444444444433


No 218
>COG3088 CcmH Uncharacterized protein involved in biosynthesis of c-type cytochromes [Posttranslational modification, protein turnover, chaperones]
Probab=25.73  E-value=1.1e+02  Score=25.69  Aligned_cols=18  Identities=11%  Similarity=0.102  Sum_probs=8.2

Q ss_pred             hhhHHHHHHHHHHHHHHH
Q psy12432         91 ASGASIAVFLVVILCFSL  108 (229)
Q Consensus        91 a~gI~V~lLLVlii~vvl  108 (229)
                      ..+.-+++||+++++++.
T Consensus       108 LW~~Pv~llllG~~~~~~  125 (153)
T COG3088         108 LWGLPVVLLLLGGVLLVR  125 (153)
T ss_pred             HHHhHHHHHHHHHHHHHH
Confidence            344445555444444333


No 219
>TIGR00327 secE_euk_arch protein translocase SEC61 complex gamma subunit, archaeal and eukaryotic. This model describes archaeal SEC61-like and eukaryotic SEC61 but not bacterial secE proteins, for which a Pfam pfam00584 (SecE) has been created.
Probab=25.50  E-value=1.8e+02  Score=20.62  Aligned_cols=26  Identities=19%  Similarity=0.316  Sum_probs=15.3

Q ss_pred             hHHHHHhhhHHHHHHHHHHHHHHHHH
Q psy12432         85 LETASIASGASIAVFLVVILCFSLYV  110 (229)
Q Consensus        85 ~~~~~Ia~gI~V~lLLVlii~vvly~  110 (229)
                      .|...++-...+++++++++..+++.
T Consensus        26 ~Ef~~iak~t~iG~~i~G~IGf~Ikl   51 (61)
T TIGR00327        26 EEYLKVAKVTGIGIIIVGIIGYIIKI   51 (61)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            35555555556666666666665544


No 220
>PF03408 Foamy_virus_ENV:  Foamy virus envelope protein  ;  InterPro: IPR005070  Expression of the envelope (Env) glycoprotein is essential for viral particle egress. This feature is unique to the Spumavirinae, a subclass of the Retroviridae. ; GO: 0019031 viral envelope
Probab=25.47  E-value=42  Score=35.24  Aligned_cols=9  Identities=22%  Similarity=0.656  Sum_probs=3.3

Q ss_pred             hHHHHHHHH
Q psy12432         93 GASIAVFLV  101 (229)
Q Consensus        93 gI~V~lLLV  101 (229)
                      ||+|.+|++
T Consensus       957 Gi~ii~lii  965 (981)
T PF03408_consen  957 GIVIILLII  965 (981)
T ss_pred             HHHHHHHHH
Confidence            333333333


No 221
>PF14163 SieB:  Superinfection exclusion protein B
Probab=25.40  E-value=1.7e+02  Score=23.67  Aligned_cols=21  Identities=14%  Similarity=0.151  Sum_probs=11.2

Q ss_pred             HHHHhhhHHHHHHHHHHHHHH
Q psy12432         87 TASIASGASIAVFLVVILCFS  107 (229)
Q Consensus        87 ~~~Ia~gI~V~lLLVlii~vv  107 (229)
                      +.+++.-++++.++..++..+
T Consensus        35 ~i~~~fl~s~s~li~~~~~~~   55 (151)
T PF14163_consen   35 WIGLIFLFSVSYLIAQLLSFI   55 (151)
T ss_pred             HHHHHHHHHHHHHHHHHHHHH
Confidence            455555555555555555443


No 222
>PHA03185 UL14 tegument protein; Provisional
Probab=25.22  E-value=88  Score=27.49  Aligned_cols=31  Identities=26%  Similarity=0.389  Sum_probs=25.6

Q ss_pred             CCccccCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy12432        181 SQLYHQSSPPLPHMSHPPSCTPLPPAPSQPP  211 (229)
Q Consensus       181 ~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~  211 (229)
                      .|+|.+.-.-......|++.||+.|+|..|.
T Consensus       171 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  201 (214)
T PHA03185        171 RPTSGPVAGGLPGGGAPAPPTPPRPGPTDPA  201 (214)
T ss_pred             CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCc
Confidence            5788888888888899999999999886553


No 223
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=25.01  E-value=61  Score=24.63  Aligned_cols=26  Identities=19%  Similarity=0.498  Sum_probs=19.3

Q ss_pred             CCCCCCCEEEeeccCCCCceEeecCCCCCCC
Q psy12432         35 WYCLNGATCFTVKIGESLLYNCECADGYMGQ   65 (229)
Q Consensus        35 npClNGGtC~~~~~~~~~~~sC~C~~Gy~G~   65 (229)
                      +.|-..|.|...     ....|.|..||...
T Consensus        84 ~~CG~~g~C~~~-----~~~~C~Cl~GF~P~  109 (110)
T PF00954_consen   84 GFCGPNGICNSN-----NSPKCSCLPGFEPK  109 (110)
T ss_pred             cccCCccEeCCC-----CCCceECCCCcCCC
Confidence            578888999532     25779999999643


No 224
>PF11157 DUF2937:  Protein of unknown function (DUF2937);  InterPro: IPR022584  This family of proteins with unknown function appears to be found mainly in Proteobacteria. 
Probab=25.00  E-value=85  Score=26.38  Aligned_cols=18  Identities=11%  Similarity=0.165  Sum_probs=8.1

Q ss_pred             HHHHhhhHHHHHHHHHHH
Q psy12432         87 TASIASGASIAVFLVVIL  104 (229)
Q Consensus        87 ~~~Ia~gI~V~lLLVlii  104 (229)
                      .+++++++.+.+++-+++
T Consensus       137 ~~g~vg~l~~~~l~~~l~  154 (167)
T PF11157_consen  137 VFGLVGALLGALLVELLL  154 (167)
T ss_pred             HHHHHHHHHHHHHHHHHH
Confidence            344455555444443333


No 225
>PF12877 DUF3827:  Domain of unknown function (DUF3827);  InterPro: IPR024606 The function of the proteins in this entry is not currently known, but one of the human proteins (Q9HCM3 from SWISSPROT) has been implicated in pilocytic astrocytomas [, , ]. In the majority of cases of pilocytic astrocytomas a tandem duplication produces an in-frame fusion of the gene encoding this protein and the BRAF oncogene. The resulting fusion protein has constitutive BRAF kinase activity and is capable of transforming cells. 
Probab=24.62  E-value=55  Score=33.48  Aligned_cols=27  Identities=11%  Similarity=0.319  Sum_probs=16.5

Q ss_pred             HHHHhhhHHHHHHHHHHHHHHHHHHHHH
Q psy12432         87 TASIASGASIAVFLVVILCFSLYVHCQR  114 (229)
Q Consensus        87 ~~~Ia~gI~V~lLLVlii~vvly~~~kR  114 (229)
                      ++++++.+.|++|+|+|+.| .+|++.|
T Consensus       272 I~gVlvPv~vV~~Iiiil~~-~LCRk~K  298 (684)
T PF12877_consen  272 IAGVLVPVLVVLLIIIILYW-KLCRKNK  298 (684)
T ss_pred             EehHhHHHHHHHHHHHHHHH-HHhcccc
Confidence            36667777776666665555 5566543


No 226
>PF11359 gpUL132:  Glycoprotein UL132;  InterPro: IPR021023  Glycoprotein UL132 is a low-abundance structural component of Human herpesvirus 5 []. The function of this protein is not fully understood. 
Probab=24.54  E-value=64  Score=28.68  Aligned_cols=30  Identities=30%  Similarity=0.499  Sum_probs=14.4

Q ss_pred             HHHhhhHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy12432         88 ASIASGASIAVFLVVILCFSLYVHCQRRKKQ  118 (229)
Q Consensus        88 ~~Ia~gI~V~lLLVlii~vvly~~~kRrk~~  118 (229)
                      +++++|..++-||++++++ +|.-++|+..+
T Consensus        55 iYcVTg~sllsli~Vtvaa-lYsSC~~~pg~   84 (235)
T PF11359_consen   55 IYCVTGFSLLSLIVVTVAA-LYSSCCRRPGR   84 (235)
T ss_pred             eeeehhHHHHHHHHHHHHH-HHHHHHhCCCc
Confidence            4445555555555555554 44444443333


No 227
>PHA03164 hypothetical protein; Provisional
Probab=24.09  E-value=45  Score=25.06  Aligned_cols=25  Identities=32%  Similarity=0.632  Sum_probs=11.4

Q ss_pred             HHhhhHHHHHHHHHHHHHHHHHHHHHh
Q psy12432         89 SIASGASIAVFLVVILCFSLYVHCQRR  115 (229)
Q Consensus        89 ~Ia~gI~V~lLLVlii~vvly~~~kRr  115 (229)
                      .|..|+.++  +++++++++|++.-++
T Consensus        61 lvLtgLaIa--mILfiifvlyvFnVnr   85 (88)
T PHA03164         61 LVLTGLAIA--MILFIIFVLYVFNVNR   85 (88)
T ss_pred             HHHHHHHHH--HHHHHHHHHHheeecc
Confidence            334444333  3444445566554443


No 228
>PRK08455 fliL flagellar basal body-associated protein FliL; Reviewed
Probab=23.95  E-value=74  Score=27.10  Aligned_cols=6  Identities=0%  Similarity=-0.175  Sum_probs=2.4

Q ss_pred             HHHHHH
Q psy12432        106 FSLYVH  111 (229)
Q Consensus       106 vvly~~  111 (229)
                      +++|+.
T Consensus        36 ~~~~~~   41 (182)
T PRK08455         36 VIAMLL   41 (182)
T ss_pred             HHHHHH
Confidence            334433


No 229
>COG5353 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=23.88  E-value=45  Score=28.09  Aligned_cols=21  Identities=24%  Similarity=0.463  Sum_probs=10.2

Q ss_pred             hhhHHHHHHHHHHHHHHHHHH
Q psy12432         91 ASGASIAVFLVVILCFSLYVH  111 (229)
Q Consensus        91 a~gI~V~lLLVlii~vvly~~  111 (229)
                      +++|.+++|+++++.+++++.
T Consensus         8 ~i~ii~viflai~~s~~~~~~   28 (161)
T COG5353           8 IIIIILVIFLAIILSIALFFW   28 (161)
T ss_pred             eehhHHHHHHHHHHHHHHHHh
Confidence            444444555555555444443


No 230
>KOG3498|consensus
Probab=23.87  E-value=1.5e+02  Score=21.59  Aligned_cols=25  Identities=20%  Similarity=0.276  Sum_probs=16.9

Q ss_pred             hhHHHHHhhhHHHHHHHHHHHHHHH
Q psy12432         84 MLETASIASGASIAVFLVVILCFSL  108 (229)
Q Consensus        84 ~~~~~~Ia~gI~V~lLLVlii~vvl  108 (229)
                      ..|+..|+.++.+++++++++..++
T Consensus        30 rKEf~ki~~~~aiGf~~mG~iGf~v   54 (67)
T KOG3498|consen   30 RKEFTKIAKATAIGFVIMGFIGFFV   54 (67)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4566677777777777777776544


No 231
>PF06809 NPDC1:  Neural proliferation differentiation control-1 protein (NPDC1);  InterPro: IPR009635 This family consists of several neural proliferation differentiation control-1 (NPDC1) proteins. NPDC1 plays a role in the control of neural cell proliferation and differentiation. It has been suggested that NPDC1 may be involved in the development of several secretion glands. This family also contains the C-terminal region of the Caenorhabditis elegans protein CAB-1 (Q93249 from SWISSPROT) which is known to interact with AEX-3 [].; GO: 0016021 integral to membrane
Probab=23.76  E-value=18  Score=33.80  Aligned_cols=9  Identities=11%  Similarity=-0.101  Sum_probs=4.7

Q ss_pred             ccCCCCcce
Q psy12432        142 ERRPSPADF  150 (229)
Q Consensus       142 ~~~~~p~~~  150 (229)
                      .+.-+-+|+
T Consensus       270 QKQQMiamE  278 (341)
T PF06809_consen  270 QKQQMIAME  278 (341)
T ss_pred             HHHHHHhhc
Confidence            444555555


No 232
>PRK01026 tetrahydromethanopterin S-methyltransferase subunit G; Provisional
Probab=23.72  E-value=1.6e+02  Score=22.00  Aligned_cols=19  Identities=21%  Similarity=0.426  Sum_probs=10.5

Q ss_pred             HHHhhhHHHHHHHHHHHHH
Q psy12432         88 ASIASGASIAVFLVVILCF  106 (229)
Q Consensus        88 ~~Ia~gI~V~lLLVlii~v  106 (229)
                      .+|+-|+++++|++++...
T Consensus        50 iGIlYG~viGlli~~i~~~   68 (77)
T PRK01026         50 IGILYGLVIGLLIVLVYII   68 (77)
T ss_pred             HHHHHHHHHHHHHHHHHHH
Confidence            4566666666665444443


No 233
>PHA03156 hypothetical protein; Provisional
Probab=23.66  E-value=3.1e+02  Score=21.06  Aligned_cols=29  Identities=17%  Similarity=0.087  Sum_probs=14.2

Q ss_pred             HHHHHhhhHHHHHHHHHHHHHHHHHHHHH
Q psy12432         86 ETASIASGASIAVFLVVILCFSLYVHCQR  114 (229)
Q Consensus        86 ~~~~Ia~gI~V~lLLVlii~vvly~~~kR  114 (229)
                      .+..|.+-+=+.++++..++.+.|+.++|
T Consensus        55 SFSSIWallN~~i~~~A~~ifL~y~CF~k   83 (90)
T PHA03156         55 TFSSIWAILNGIIFFCASLFFLRHLCFVK   83 (90)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            34445443344444455555555555544


No 234
>PF02529 PetG:  Cytochrome B6-F complex subunit 5;  InterPro: IPR003683 This family consists of cytochrome b6/f complex subunit 5 (PetG). The cytochrome bf complex, found in green plants, eukaryotic algae and cyanobacteria, connects photosystem I to photosystem II in the electron transport chain, functioning as a plastoquinol:plastocyanin/cytochrome c6 oxidoreductase []. The purified complex from the unicellular alga Chlamydomonas reinhardtii contains seven subunits; namely four high molecular weight subunits (cytochrome f, Rieske iron-sulphur protein, cytochrome b6, and subunit IV) and three approximately miniproteins (PetG, PetL, and PetX) []. Stoichiometry measurements are consistent with every subunit being present as two copies per b6/f dimer. The absence of PetG affects either the assembly or stability of the cytochrome bf complex in C. reinhardtii [].; GO: 0009512 cytochrome b6f complex; PDB: 1Q90_G 2ZT9_G 1VF5_G 2D2C_G 2E74_G 2E75_G 2E76_G.
Probab=23.64  E-value=1.7e+02  Score=18.83  Aligned_cols=25  Identities=20%  Similarity=0.376  Sum_probs=11.3

Q ss_pred             hhhHHHHHHHHHHHH--HHHHHHHHHh
Q psy12432         91 ASGASIAVFLVVILC--FSLYVHCQRR  115 (229)
Q Consensus        91 a~gI~V~lLLVlii~--vvly~~~kRr  115 (229)
                      ..||+++++-+.+..  +..|..++|-
T Consensus         6 L~GiVlGli~vtl~Glfv~Ay~QY~Rg   32 (37)
T PF02529_consen    6 LSGIVLGLIPVTLAGLFVAAYLQYRRG   32 (37)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHCS-
T ss_pred             hhhHHHHhHHHHHHHHHHHHHHHHhcc
Confidence            455666655332222  3455555543


No 235
>PF04202 Mfp-3:  Foot protein 3;  InterPro: IPR007328 Mytilus foot protein-3 (Mfp-3) is a highly polymorphic protein family located in the byssal adhesive plaques of blue mussels.
Probab=23.27  E-value=52  Score=23.98  Aligned_cols=11  Identities=36%  Similarity=0.525  Sum_probs=4.8

Q ss_pred             HHHHHHHHHHH
Q psy12432         96 IAVFLVVILCF  106 (229)
Q Consensus        96 V~lLLVlii~v  106 (229)
                      |+|.||||.++
T Consensus         8 VLlaLvLIg~f   18 (71)
T PF04202_consen    8 VLLALVLIGSF   18 (71)
T ss_pred             HHHHHHHHhhh
Confidence            34444444443


No 236
>COG3630 OadG Na+-transporting methylmalonyl-CoA/oxaloacetate decarboxylase, gamma subunit [Energy production and conversion]
Probab=23.10  E-value=1.8e+02  Score=22.12  Aligned_cols=20  Identities=15%  Similarity=0.353  Sum_probs=9.8

Q ss_pred             hhhHHHHHHHHHHHHHHHHH
Q psy12432         91 ASGASIAVFLVVILCFSLYV  110 (229)
Q Consensus        91 a~gI~V~lLLVlii~vvly~  110 (229)
                      +.|+.++++++.|++.+++.
T Consensus        15 vlGmg~VflfL~iLi~~~~~   34 (84)
T COG3630          15 VLGMGFVFLFLSILIYAMRG   34 (84)
T ss_pred             HHHHHHHHHHHHHHHHHHHH
Confidence            45555555544444444444


No 237
>TIGR02507 MtrF tetrahydromethanopterin S-methyltransferase, F subunit. coenzyme M methyltransferase in methanogenic archaea. This methyltranferase is membrane-associated enzyme complex that uses methy-transfer reaction to drive sodium-ion pump. Archaea domain, have evolved energy-yielding pathways marked by one-carbon biochemistry featuring novel cofactors and enzymes. This transferase is involved in the transfer of 'methyl' group from N5-methyltetrahydromethanopterin to coenzyme M. In an accompanying reaction, methane is produced by two-electron reduction of the methyl moiety in methyl-coenzyme M by another enzyme methyl-coenzyme M reductase.
Probab=23.09  E-value=1.4e+02  Score=21.62  Aligned_cols=19  Identities=32%  Similarity=0.385  Sum_probs=10.8

Q ss_pred             HHHHhhhHHHHHHHHHHHH
Q psy12432         87 TASIASGASIAVFLVVILC  105 (229)
Q Consensus        87 ~~~Ia~gI~V~lLLVlii~  105 (229)
                      +.++++|..++|++|++-.
T Consensus        42 ~~G~~iG~~~Al~lV~IP~   60 (65)
T TIGR02507        42 ITGLAYGFLFAVLLVAVPI   60 (65)
T ss_pred             hHHHHHHHHHHHHHHHHHH
Confidence            4455666666666544433


No 238
>cd00930 Cyt_c_Oxidase_VIII Cytochrome oxidase c subunit VIII.  Cytochrome c oxidase (CcO), the terminal oxidase in the respiratory chains of eukaryotes and most bacteria, is a multi-chain transmembrane protein located in the inner membrane of mitochondria and the cell membrane of prokaryotes. It catalyzes the reduction of O2 and simultaneously pumps protons across the membrane. The number of subunits varies from three to five in bacteria and up to 13 in mammalian mitochondria. Subunits I, II, and III of mammalian CcO are encoded within the mitochondrial genome and the remaining 10 subunits are encoded within the nuclear genome. Found only in eukaryotes, subunit VIII is the smallest of the nuclear-encoded subunits. It exists in muscle-specific and non-muscle-specific isoforms that are differently expressed in different species, suggesting species-specific regulation of energy metabolism.
Probab=23.08  E-value=1.5e+02  Score=19.71  Aligned_cols=23  Identities=17%  Similarity=0.103  Sum_probs=10.9

Q ss_pred             hhhHHHHHHHHHHHHHHHHHHHH
Q psy12432         91 ASGASIAVFLVVILCFSLYVHCQ  113 (229)
Q Consensus        91 a~gI~V~lLLVlii~vvly~~~k  113 (229)
                      ++|+.+.++-+++...|+....+
T Consensus        16 aigl~~~f~~~L~p~gWVLshL~   38 (43)
T cd00930          16 AIGLSVFFTTFLLPAGWVLSHLE   38 (43)
T ss_pred             HHHHHHHHHHHHhhHHHHHHHHH
Confidence            45555555444444444444433


No 239
>PF05337 CSF-1:  Macrophage colony stimulating factor-1 (CSF-1);  InterPro: IPR008001 Colony stimulating factor 1 (CSF-1) is a homodimeric polypeptide growth factor whose primary function is to regulate the survival, proliferation, differentiation, and function of cells of the mononuclear phagocytic lineage. This lineage includes mononuclear phagocytic precursors, blood monocytes, tissue macrophages, osteoclasts, and microglia of the brain, all of which possess cell surface receptors for CSF-1. The protein has also been linked with male fertility [] and mutations in the Csf-1 gene have been found to cause osteopetrosis and failure of tooth eruption [].; GO: 0005125 cytokine activity, 0008083 growth factor activity, 0016021 integral to membrane; PDB: 3EJJ_A.
Probab=23.07  E-value=28  Score=32.00  Aligned_cols=14  Identities=21%  Similarity=0.218  Sum_probs=0.0

Q ss_pred             HHHHHHHHHHHHhh
Q psy12432        103 ILCFSLYVHCQRRK  116 (229)
Q Consensus       103 ii~vvly~~~kRrk  116 (229)
                      .+..++||++|||.
T Consensus       241 aVGGLLfYr~rrRs  254 (285)
T PF05337_consen  241 AVGGLLFYRRRRRS  254 (285)
T ss_dssp             --------------
T ss_pred             hccceeeecccccc
Confidence            33344545555444


No 240
>PHA03286 envelope glycoprotein E; Provisional
Probab=23.06  E-value=1.2e+02  Score=29.97  Aligned_cols=9  Identities=11%  Similarity=0.718  Sum_probs=3.4

Q ss_pred             HHHHHHHHH
Q psy12432         98 VFLVVILCF  106 (229)
Q Consensus        98 lLLVlii~v  106 (229)
                      ++||+++.+
T Consensus       401 ~~~~~~~~~  409 (492)
T PHA03286        401 ILVVLLFAL  409 (492)
T ss_pred             HHHHHHHHH
Confidence            333333333


No 241
>KOG1025|consensus
Probab=22.86  E-value=1.1e+02  Score=32.84  Aligned_cols=8  Identities=50%  Similarity=1.157  Sum_probs=4.8

Q ss_pred             cCCCCCCC
Q psy12432         58 CADGYMGQ   65 (229)
Q Consensus        58 C~~Gy~G~   65 (229)
                      |+.|..|.
T Consensus       571 CP~G~~G~  578 (1177)
T KOG1025|consen  571 CPDGVTGP  578 (1177)
T ss_pred             CCCcccCC
Confidence            66665554


No 242
>PF13974 YebO:  YebO-like protein
Probab=22.81  E-value=94  Score=23.36  Aligned_cols=13  Identities=23%  Similarity=0.508  Sum_probs=5.5

Q ss_pred             HHHHHHHHHHHHH
Q psy12432         96 IAVFLVVILCFSL  108 (229)
Q Consensus        96 V~lLLVlii~vvl  108 (229)
                      +.++|+++++|++
T Consensus         5 ~~~~lv~livWFF   17 (80)
T PF13974_consen    5 VLVLLVGLIVWFF   17 (80)
T ss_pred             HHHHHHHHHHHHH
Confidence            3444444444433


No 243
>PF07438 DUF1514:  Protein of unknown function (DUF1514);  InterPro: IPR009999 This entry is represented by Bacteriophage phi PVL, Orf60. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several Staphylococcus aureus and related bacteriophage proteins of around 65 residues in length. The function of this family is unknown.
Probab=22.68  E-value=84  Score=22.76  Aligned_cols=10  Identities=20%  Similarity=0.288  Sum_probs=4.0

Q ss_pred             hhHHHHHHHH
Q psy12432         92 SGASIAVFLV  101 (229)
Q Consensus        92 ~gI~V~lLLV  101 (229)
                      +.|+++++++
T Consensus         5 iSIvLai~lL   14 (66)
T PF07438_consen    5 ISIVLAIALL   14 (66)
T ss_pred             HHHHHHHHHH
Confidence            3344444433


No 244
>PRK11901 hypothetical protein; Reviewed
Probab=22.66  E-value=67  Score=30.15  Aligned_cols=21  Identities=10%  Similarity=0.025  Sum_probs=13.8

Q ss_pred             HHHHhhhHHHHHHHHHHHHHHH
Q psy12432         87 TASIASGASIAVFLVVILCFSL  108 (229)
Q Consensus        87 ~~~Ia~gI~V~lLLVlii~vvl  108 (229)
                      +++|. .++++||||.|...+-
T Consensus        39 MiGiG-ilVLlLLIi~IgSALk   59 (327)
T PRK11901         39 MIGIG-ILVLLLLIIAIGSALK   59 (327)
T ss_pred             HHHHH-HHHHHHHHHHHhhhcc
Confidence            34443 4667788888888774


No 245
>PF01372 Melittin:  Melittin;  InterPro: IPR002116 Allergies are hypersensitivity reactions of the immune system to specific substances called allergens (such as pollen, stings, drugs, or food) that, in most people, result in no symptoms. A nomenclature system has been established for antigens (allergens) that cause IgE-mediated atopic allergies in humans [WHO/IUIS Allergen Nomenclature Subcommittee King T.P., Hoffmann D., Loewenstein H., Marsh D.G., Platts-Mills T.A.E., Thomas W. Bull. World Health Organ. 72:797-806(1994)]. This nomenclature system is defined by a designation that is composed of the first three letters of the genus; a space; the first letter of the species name; a space and an Arabic number. In the event that two species names have identical designations, they are discriminated from one another by adding one or more letters (as necessary) to each species designation.  The allergens in this family include allergens with the following designations: Api m 3. Melittin is the principal protein component of the venom of the honeybee, Apis mellifera. It inhibits protein kinase C, Ca2+/calmodulin-dependent protein kinase II, myosin light chain kinase and Na+/K+-ATPase (synaptosomal membrane) and is a cell membrane lytic factor. Melittin is a small peptide with no disulphide bridge; the N-terminal part of the molecule is predominantly hydrophobic and the C-terminal part is hydrophilic and strongly basic. The molecular mechanisms underlying the various effects of melittin on membranes have not been completely defined and much of the evidence indicates that different molecular mechanisms may underlie different actions of the peptide []. Extensive work with melittin has shown that the venom has multiple effects, probably, as a result of its interaction with negatively changed phospholipids. It inhibits well known transport pumps such as the Na+-K+-ATPase and the H+-K+-ATPase. Melittin increases the permeability of cell membranes to ions, particularly Na+ and indirectly Ca2+, because of the Na+-Ca2+-exchange. This effect results in marked morphological and functional changes, particularly in excitable tissues such as cardiac myocytes. In some other tissues, e.g., cornea, not only Na+ but Cl- permeability is also increased by melittin. Similar effects to melittin on H+-K+-ATPase have been found with the synthetic amphipathic polypeptide Trp-3 [].  The study of melittin in model membranes has been useful for the development of methodology for determination of membrane protein structures. A molecular dynamics simulation of melittin in a hydrated dipalmitoylphosphatidylcholine (DPPC) bilayer was carried out. The effect of melittin on the surrounding membrane was localised to its immediate vicinity, and its asymmetry with respect to the two layers may be a result of the fact that it is not fully transmembranal. Melittin's hydrophilic C terminus anchors it at the extracellular interface, leaving the N terminus "loose" in the lower layer of the membrane [].; GO: 0004860 protein kinase inhibitor activity, 0005576 extracellular region; PDB: 3QRX_B 2MLT_A 1BH1_A.
Probab=22.62  E-value=1.8e+02  Score=17.21  Aligned_cols=6  Identities=17%  Similarity=0.678  Sum_probs=2.2

Q ss_pred             HHHHHH
Q psy12432        109 YVHCQR  114 (229)
Q Consensus       109 y~~~kR  114 (229)
                      |++.+|
T Consensus        19 WIK~kr   24 (26)
T PF01372_consen   19 WIKNKR   24 (26)
T ss_dssp             HHHHHH
T ss_pred             HHHHHh
Confidence            333333


No 246
>PF06692 MNSV_P7B:  Melon necrotic spot virus P7B protein;  InterPro: IPR009575 This family consists of several Melon necrotic spot virus (MNSV) P7B proteins. The function of this family is unknown.
Probab=22.55  E-value=93  Score=21.96  Aligned_cols=6  Identities=33%  Similarity=0.877  Sum_probs=2.3

Q ss_pred             HHHHHH
Q psy12432        100 LVVILC  105 (229)
Q Consensus       100 LVlii~  105 (229)
                      |++|++
T Consensus        18 Liliis   23 (61)
T PF06692_consen   18 LILIIS   23 (61)
T ss_pred             HHHHHH
Confidence            333333


No 247
>PRK05886 yajC preprotein translocase subunit YajC; Validated
Probab=22.53  E-value=1.3e+02  Score=23.76  Aligned_cols=10  Identities=0%  Similarity=-0.097  Sum_probs=5.9

Q ss_pred             HHHhhhhhhh
Q psy12432        112 CQRRKKQAQA  121 (229)
Q Consensus       112 ~kRrk~~k~~  121 (229)
                      +||+|+.+.+
T Consensus        26 kKr~K~~~~m   35 (109)
T PRK05886         26 RKAMQATIDL   35 (109)
T ss_pred             HHHHHHHHHH
Confidence            4566666654


No 248
>PRK03814 oxaloacetate decarboxylase subunit gamma; Provisional
Probab=22.38  E-value=1.8e+02  Score=21.87  Aligned_cols=8  Identities=13%  Similarity=0.414  Sum_probs=3.5

Q ss_pred             hhhHHHHH
Q psy12432         91 ASGASIAV   98 (229)
Q Consensus        91 a~gI~V~l   98 (229)
                      ++|.++++
T Consensus        16 ~~GM~~VF   23 (85)
T PRK03814         16 LTGMGVVF   23 (85)
T ss_pred             HHHHHHHH
Confidence            44444433


No 249
>PHA03291 envelope glycoprotein I; Provisional
Probab=22.12  E-value=46  Score=31.74  Aligned_cols=30  Identities=27%  Similarity=0.285  Sum_probs=13.0

Q ss_pred             HHhhhHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy12432         89 SIASGASIAVFLVVILCFSLYVHCQRRKKQ  118 (229)
Q Consensus        89 ~Ia~gI~V~lLLVlii~vvly~~~kRrk~~  118 (229)
                      -|++-..++++|+|.-|+....++.||+++
T Consensus       290 QiAIPasii~cV~lGSC~Ccl~R~~rRr~r  319 (401)
T PHA03291        290 QIAIPASIIACVFLGSCACCLHRRCRRRRR  319 (401)
T ss_pred             eeccchHHHHHhhhhhhhhhhhhhhhcccC
Confidence            344444444444444444333344443333


No 250
>COG0598 CorA Mg2+ and Co2+ transporters [Inorganic ion transport and metabolism]
Probab=22.02  E-value=87  Score=28.59  Aligned_cols=20  Identities=30%  Similarity=0.637  Sum_probs=9.8

Q ss_pred             HHHHHHHHHHHHHHHHHHhh
Q psy12432         97 AVFLVVILCFSLYVHCQRRK  116 (229)
Q Consensus        97 ~lLLVlii~vvly~~~kRrk  116 (229)
                      ++++++++++++|++++||+
T Consensus       301 ~l~~m~~~~~~~~~~frrk~  320 (322)
T COG0598         301 ALILMLLLALLLYLYFRRKG  320 (322)
T ss_pred             HHHHHHHHHHHHHHHHHhcC
Confidence            33344444455555555554


No 251
>PF13120 DUF3974:  Domain of unknown function (DUF3974)
Probab=21.86  E-value=79  Score=24.84  Aligned_cols=20  Identities=20%  Similarity=0.180  Sum_probs=10.2

Q ss_pred             HHHHHHHHHHHHHHHHHhhh
Q psy12432         98 VFLVVILCFSLYVHCQRRKK  117 (229)
Q Consensus        98 lLLVlii~vvly~~~kRrk~  117 (229)
                      +||+++.++++.+++-|+.+
T Consensus        14 llligftivvl~vyfgrk~y   33 (126)
T PF13120_consen   14 LLLIGFTIVVLLVYFGRKFY   33 (126)
T ss_pred             HHHHHHHHHhhhheecceee
Confidence            34455555555555555443


No 252
>PF11857 DUF3377:  Domain of unknown function (DUF3377);  InterPro: IPR021805  This domain is functionally uncharacterised and found at the C terminus of peptidases belonging to MEROPS peptidase family M10A, membrane-type matrix metallopeptidases (clan MA). ; GO: 0004222 metalloendopeptidase activity
Probab=21.86  E-value=36  Score=25.26  Aligned_cols=11  Identities=9%  Similarity=0.277  Sum_probs=4.5

Q ss_pred             HHHHHhhhhhh
Q psy12432        110 VHCQRRKKQAQ  120 (229)
Q Consensus       110 ~~~kRrk~~k~  120 (229)
                      ++++|+--.++
T Consensus        52 ~~fkrkGtPr~   62 (74)
T PF11857_consen   52 FQFKRKGTPRR   62 (74)
T ss_pred             heeeecCCCcE
Confidence            33444433333


No 253
>PF14017 DUF4233:  Protein of unknown function (DUF4233)
Probab=21.78  E-value=2.6e+02  Score=21.85  Aligned_cols=21  Identities=14%  Similarity=0.195  Sum_probs=12.1

Q ss_pred             HHHHHHHHHHHHHHhhhhhhh
Q psy12432        101 VVILCFSLYVHCQRRKKQAQA  121 (229)
Q Consensus       101 Vlii~vvly~~~kRrk~~k~~  121 (229)
                      +++..++.|..+.|++-.+++
T Consensus        84 ~iF~~~W~~~l~lg~~i~~~~  104 (107)
T PF14017_consen   84 VIFAAVWWYALYLGRRIDRRM  104 (107)
T ss_pred             HHHHHHHHHHHHHHHHHHHHH
Confidence            444555666666666655543


No 254
>PTZ00234 variable surface protein Vir12; Provisional
Probab=21.75  E-value=57  Score=31.70  Aligned_cols=14  Identities=14%  Similarity=0.467  Sum_probs=6.7

Q ss_pred             HHHHHHHHHHHHHH
Q psy12432         97 AVFLVVILCFSLYV  110 (229)
Q Consensus        97 ~lLLVlii~vvly~  110 (229)
                      ++-||++|.+++|+
T Consensus       370 ~~ailGtifFlfyy  383 (433)
T PTZ00234        370 GASIIGVLVFLFFF  383 (433)
T ss_pred             HHHHHHHHHHhhhh
Confidence            33344555555554


No 255
>COG4736 CcoQ Cbb3-type cytochrome oxidase, subunit 3 [Posttranslational modification, protein turnover, chaperones]
Probab=21.56  E-value=1.9e+02  Score=20.57  Aligned_cols=32  Identities=13%  Similarity=-0.050  Sum_probs=22.5

Q ss_pred             hhhHHHHHHHHHHHHHHHHHHHHHhhhhhhhc
Q psy12432         91 ASGASIAVFLVVILCFSLYVHCQRRKKQAQAA  122 (229)
Q Consensus        91 a~gI~V~lLLVlii~vvly~~~kRrk~~k~~~  122 (229)
                      ...+-...+++++++++.++.+.=++.+|+..
T Consensus         7 ~~~a~a~~t~~~~l~fiavi~~ayr~~~K~~~   38 (60)
T COG4736           7 RGFADAWGTIAFTLFFIAVIYFAYRPGKKGEF   38 (60)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhcccchhhH
Confidence            34445566677777788888888877777664


No 256
>PF07178 TraL:  TraL protein;  InterPro: IPR009838 This entry represents bacterial TraL proteins. TraL is a predicted peripheral membrane protein involved in bacterial sex pilus assembly []. TraL is part of the type IV secretion system for conjugative plasmid transfer []. The exact function of TraL is unknown.; GO: 0000746 conjugation, 0019867 outer membrane
Probab=21.53  E-value=1.4e+02  Score=22.46  Aligned_cols=23  Identities=9%  Similarity=0.147  Sum_probs=16.3

Q ss_pred             HHHHHHHHHHHHHHHHHhhhhhh
Q psy12432         98 VFLVVILCFSLYVHCQRRKKQAQ  120 (229)
Q Consensus        98 lLLVlii~vvly~~~kRrk~~k~  120 (229)
                      .+++.++++..|-+.|+++.+.-
T Consensus        42 g~i~g~~~~~~~~k~K~~~~~g~   64 (95)
T PF07178_consen   42 GLILGIVLWWGYRKFKKGRGRGF   64 (95)
T ss_pred             HHHHHHHHHHHHHHHHccCCccH
Confidence            34566667888888888877763


No 257
>PF11085 YqhR:  Conserved membrane protein YqhR;  InterPro: IPR024563 This family of proteins is conserved in the Bacillaceae family of the Firmicutes. Their function is not known.
Probab=21.46  E-value=1.7e+02  Score=25.12  Aligned_cols=28  Identities=7%  Similarity=0.083  Sum_probs=13.2

Q ss_pred             HHHHhhhHHHHHHHHHHHHHHHHHHHHH
Q psy12432         87 TASIASGASIAVFLVVILCFSLYVHCQR  114 (229)
Q Consensus        87 ~~~Ia~gI~V~lLLVlii~vvly~~~kR  114 (229)
                      +++.++|+++..++-++.+++.|...+|
T Consensus        67 ~~G~~igi~~~gv~Si~aAllY~~~l~k   94 (173)
T PF11085_consen   67 WLGNLIGIVFIGVFSIVAALLYYALLKK   94 (173)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4455555544444444444444444444


No 258
>PF05961 Chordopox_A13L:  Chordopoxvirus A13L protein;  InterPro: IPR009236 This family consists of A13L proteins from the Chordopoxviruses. A13L or p8 is one of the three most abundant membrane proteins of the intracellular mature Vaccinia virus [].
Probab=21.42  E-value=1.2e+02  Score=22.22  Aligned_cols=9  Identities=44%  Similarity=0.634  Sum_probs=3.8

Q ss_pred             HHHHHHHhh
Q psy12432        108 LYVHCQRRK  116 (229)
Q Consensus       108 ly~~~kRrk  116 (229)
                      +|-.+.|++
T Consensus        19 lY~iYnr~~   27 (68)
T PF05961_consen   19 LYGIYNRKK   27 (68)
T ss_pred             HHHHHhccc
Confidence            344444433


No 259
>TIGR01495 ETRAMP Plasmodium ring stage membrane protein ETRAMP. These genes have been shown to be found in the sub-telomeric regions of both P. falciparum and P. yoelii chromosomes.
Probab=21.42  E-value=81  Score=23.81  Aligned_cols=13  Identities=15%  Similarity=0.266  Sum_probs=4.7

Q ss_pred             HHHHHHHHHHHHH
Q psy12432         97 AVFLVVILCFSLY  109 (229)
Q Consensus        97 ~lLLVlii~vvly  109 (229)
                      ++|+..++.+.+|
T Consensus        62 a~lv~t~~G~g~y   74 (85)
T TIGR01495        62 ALLVGAGVGLGYY   74 (85)
T ss_pred             HHHHHHHHHHhhh
Confidence            3333333333333


No 260
>PF07226 DUF1422:  Protein of unknown function (DUF1422);  InterPro: IPR009867 This family consists of several hypothetical bacterial proteins of around 120 residues in length. The function of this family is unknown.
Probab=21.36  E-value=99  Score=24.87  Aligned_cols=20  Identities=30%  Similarity=0.306  Sum_probs=10.1

Q ss_pred             HHHHHHHHHHHHHHHHHHhh
Q psy12432         97 AVFLVVILCFSLYVHCQRRK  116 (229)
Q Consensus        97 ~lLLVlii~vvly~~~kRrk  116 (229)
                      .+++.+++++|++.+.+.+|
T Consensus        96 p~il~l~L~~Wi~~kl~~~k  115 (117)
T PF07226_consen   96 PSILCLILVFWIGYKLGFRK  115 (117)
T ss_pred             HHHHHHHHHHHHHHHHhhhc
Confidence            34455555555555554443


No 261
>PF05827 ATP-synt_S1:  Vacuolar ATP synthase subunit S1 (ATP6S1);  InterPro: IPR024722 This family consists of metazoan vacuolar ATP synthase subunit S1 proteins [] and fungal proteins belonging to the BIG family. In Candida albicans BIG is required for normal beta-1,6-glucan synthesis, hyphal morphogenesis, adhesion and virulence [].
Probab=21.06  E-value=1e+02  Score=27.44  Aligned_cols=23  Identities=30%  Similarity=0.390  Sum_probs=14.1

Q ss_pred             HHHhhhHHHHHHHHHHHHHHHHH
Q psy12432         88 ASIASGASIAVFLVVILCFSLYV  110 (229)
Q Consensus        88 ~~Ia~gI~V~lLLVlii~vvly~  110 (229)
                      -+|.+|++++++|+.|+.+.+.+
T Consensus       257 pgi~mglii~~~ll~IL~~gl~~  279 (282)
T PF05827_consen  257 PGIWMGLIISLVLLSILYVGLSM  279 (282)
T ss_pred             ccHHHHHHHHHHHHHHHHHHHHH
Confidence            45666776666666666655443


No 262
>COG3105 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.85  E-value=1.6e+02  Score=24.23  Aligned_cols=13  Identities=8%  Similarity=0.077  Sum_probs=6.1

Q ss_pred             HHHHhhhHHHHHH
Q psy12432         87 TASIASGASIAVF   99 (229)
Q Consensus        87 ~~~Ia~gI~V~lL   99 (229)
                      |.++++|++|+|+
T Consensus         8 W~~a~igLvvGi~   20 (138)
T COG3105           8 WEYALIGLVVGII   20 (138)
T ss_pred             HHHHHHHHHHHHH
Confidence            4444554444443


No 263
>PF13706 PepSY_TM_3:  PepSY-associated TM helix
Probab=20.79  E-value=1.8e+02  Score=18.15  Aligned_cols=12  Identities=33%  Similarity=0.559  Sum_probs=5.1

Q ss_pred             hhHHHHHHHHHH
Q psy12432         92 SGASIAVFLVVI  103 (229)
Q Consensus        92 ~gI~V~lLLVli  103 (229)
                      +|+.++++++++
T Consensus        11 ~Gl~~g~~l~~~   22 (37)
T PF13706_consen   11 LGLILGLLLFVI   22 (37)
T ss_pred             HHHHHHHHHHHH
Confidence            344444443333


No 264
>PRK09546 zntB zinc transporter; Reviewed
Probab=20.63  E-value=1.1e+02  Score=27.92  Aligned_cols=10  Identities=10%  Similarity=-0.134  Sum_probs=4.4

Q ss_pred             HHHHHHHHHh
Q psy12432        106 FSLYVHCQRR  115 (229)
Q Consensus       106 vvly~~~kRr  115 (229)
                      +++++++|||
T Consensus       312 ~~~~~~fkrk  321 (324)
T PRK09546        312 GGVAWWLKRS  321 (324)
T ss_pred             HHHHHHHHhc
Confidence            3344444444


No 265
>PRK02919 oxaloacetate decarboxylase subunit gamma; Provisional
Probab=20.60  E-value=2.7e+02  Score=20.84  Aligned_cols=6  Identities=17%  Similarity=0.163  Sum_probs=2.7

Q ss_pred             hhhHHH
Q psy12432         91 ASGASI   96 (229)
Q Consensus        91 a~gI~V   96 (229)
                      ++|.++
T Consensus        15 vlGMg~   20 (82)
T PRK02919         15 FLGMGF   20 (82)
T ss_pred             HHHHHH
Confidence            444444


No 266
>PRK04778 septation ring formation regulator EzrA; Provisional
Probab=20.56  E-value=1.4e+02  Score=29.73  Aligned_cols=22  Identities=5%  Similarity=0.054  Sum_probs=9.1

Q ss_pred             hhhHHHHHHHHHHHHHHHHHHH
Q psy12432         91 ASGASIAVFLVVILCFSLYVHC  112 (229)
Q Consensus        91 a~gI~V~lLLVlii~vvly~~~  112 (229)
                      +++|+|++++++++++++=-+.
T Consensus         6 ii~i~ii~i~~~~~~~~~rr~~   27 (569)
T PRK04778          6 IIAIVVIIIIAYLAGLILRKRN   27 (569)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHH
Confidence            3344444444443444444433


No 267
>PF10251 PEN-2:  Presenilin enhancer-2 subunit of gamma secretase;  InterPro: IPR019379  This entry is a short, 101 peptide protein, which is the smallest subunit of the gamma-secretase aspartyl protease complex. It catalyses the intra-membrane cleavage of a subset of type I transmembrane proteins. The other active constituents of the complex are presenilin (PS) nicastrin and anterior pharynx defective-1 (APH-1) protein. Presenilin enhancer-2 (PEN-2) adopts a hairpin orientation in the membrane with its N- and C-terminal domains facing the luminal/extracellular space. The C-terminal domain maintains PS stability within the complex []. 
Probab=20.54  E-value=2.8e+02  Score=21.43  Aligned_cols=28  Identities=25%  Similarity=0.374  Sum_probs=18.6

Q ss_pred             HHHHhhhHHHHHHHHHHHHHHHHHHHHH
Q psy12432         87 TASIASGASIAVFLVVILCFSLYVHCQR  114 (229)
Q Consensus        87 ~~~Ia~gI~V~lLLVlii~vvly~~~kR  114 (229)
                      ...+..+|+.++.+++++.+++.+..+|
T Consensus        50 ~YVi~SaiG~~vw~v~l~~W~~~Fq~~R   77 (94)
T PF10251_consen   50 KYVIRSAIGFLVWTVVLISWILIFQINR   77 (94)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4555566666677777777777776644


No 268
>PRK11056 hypothetical protein; Provisional
Probab=20.48  E-value=98  Score=24.99  Aligned_cols=21  Identities=38%  Similarity=0.436  Sum_probs=10.6

Q ss_pred             HHHHHHHHHHHHHHHHHHhhh
Q psy12432         97 AVFLVVILCFSLYVHCQRRKK  117 (229)
Q Consensus        97 ~lLLVlii~vvly~~~kRrk~  117 (229)
                      .+++.+++++|++.+.+.+|+
T Consensus        96 p~il~~~L~~Wi~~kl~~~~~  116 (120)
T PRK11056         96 PAVLSVILVFWIGRKLRNRKK  116 (120)
T ss_pred             HHHHHHHHHHHHHHHHhcccc
Confidence            344555555566555554433


No 269
>PF10855 DUF2648:  Protein of unknown function (DUF2648);  InterPro: IPR022561  This family of proteins with unknown function appears to be restricted to eubacteia. 
Probab=20.40  E-value=73  Score=19.96  Aligned_cols=15  Identities=13%  Similarity=0.184  Sum_probs=6.9

Q ss_pred             HHHHHHHHHHHHHhh
Q psy12432        102 VILCFSLYVHCQRRK  116 (229)
Q Consensus       102 lii~vvly~~~kRrk  116 (229)
                      +++....|+-+||.+
T Consensus         8 L~l~ga~f~~fKKyQ   22 (33)
T PF10855_consen    8 LILGGAAFYGFKKYQ   22 (33)
T ss_pred             hhhhhHHHHHHHHHH
Confidence            333444555555544


No 270
>PF15069 FAM163:  FAM163 family
Probab=20.36  E-value=1e+02  Score=25.64  Aligned_cols=27  Identities=19%  Similarity=0.469  Sum_probs=11.5

Q ss_pred             HHHhhhHHHHHHHHHHH-HHHHHHHHHH
Q psy12432         88 ASIASGASIAVFLVVIL-CFSLYVHCQR  114 (229)
Q Consensus        88 ~~Ia~gI~V~lLLVlii-~vvly~~~kR  114 (229)
                      .+|+..++++.+|+++. |=+-|+.+||
T Consensus        10 GgILAtVILLcIIaVLCYCRLQYYCCKK   37 (143)
T PF15069_consen   10 GGILATVILLCIIAVLCYCRLQYYCCKK   37 (143)
T ss_pred             chHHHHHHHHHHHHHHHHHhhHHHHhhc
Confidence            44444444444333333 3344444554


No 271
>PF02285 COX8:  Cytochrome oxidase c subunit VIII;  InterPro: IPR003205 Cytochrome c oxidase (1.9.3.1 from EC) is an oligomeric enzymatic complex which is a component of the respiratory chain complex and is involved in the transfer of electrons from cytochrome c to oxygen []. In eukaryotes this enzyme complex is located in the mitochondrial inner membrane; in aerobic prokaryotes it is found in the plasma membrane.  In eukaryotes, in addition to the three large subunits, I, II and III, that form the catalytic centre of the enzyme complex, there are a variable number of small polypeptidic subunits.This family is composed of cytochrome c oxidase subunit VIII. ; GO: 0004129 cytochrome-c oxidase activity; PDB: 3AG3_Z 3ABM_M 1OCC_Z 3ASO_Z 3AG2_Z 3ABL_M 3AG4_M 3AG1_M 3ASN_M 1OCZ_M ....
Probab=20.20  E-value=2.6e+02  Score=18.63  Aligned_cols=21  Identities=14%  Similarity=0.064  Sum_probs=9.5

Q ss_pred             hhhHHHHHHHHHHHHHHHHHH
Q psy12432         91 ASGASIAVFLVVILCFSLYVH  111 (229)
Q Consensus        91 a~gI~V~lLLVlii~vvly~~  111 (229)
                      ++|+.+.++-+++-..|+...
T Consensus        16 aigltv~f~~~L~PagWVLsh   36 (44)
T PF02285_consen   16 AIGLTVCFVTFLGPAGWVLSH   36 (44)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHT
T ss_pred             HHHHHHHHHHHHhhHHHHHHH
Confidence            445555544444444444333


No 272
>PF02937 COX6C:  Cytochrome c oxidase subunit VIc;  InterPro: IPR004204 Cytochrome c oxidase, a 13 subunit complex, 1.9.3.1 from EC is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc.; GO: 0004129 cytochrome-c oxidase activity; PDB: 3AG4_I 2DYS_V 3ASO_I 2EIK_V 2EIM_I 1OCC_V 1V54_V 1OCO_V 3ASN_V 2EIL_I ....
Probab=20.12  E-value=2.7e+02  Score=20.40  Aligned_cols=11  Identities=18%  Similarity=0.223  Sum_probs=4.1

Q ss_pred             HHHHHHHHHHH
Q psy12432         95 SIAVFLVVILC  105 (229)
Q Consensus        95 ~V~lLLVlii~  105 (229)
                      +++++|.+++.
T Consensus        22 ~~a~~ls~~~~   32 (73)
T PF02937_consen   22 VVAFVLSLGVA   32 (73)
T ss_dssp             HHHHHHHHHHH
T ss_pred             HHHHHHHHHHH
Confidence            33333333333


No 273
>PF04341 DUF485:  Protein of unknown function, DUF485;  InterPro: IPR007436 This family includes several putative integral membrane proteins.
Probab=20.08  E-value=1.6e+02  Score=22.05  Aligned_cols=20  Identities=20%  Similarity=0.271  Sum_probs=11.4

Q ss_pred             HHHHHHHHHHHHHHHHHHHh
Q psy12432         96 IAVFLVVILCFSLYVHCQRR  115 (229)
Q Consensus        96 V~lLLVlii~vvly~~~kRr  115 (229)
                      ++++++..++.++|.++.++
T Consensus        60 ~~~~~~~~~l~~~Yv~~An~   79 (91)
T PF04341_consen   60 LGQIVFAWVLTWLYVRRANR   79 (91)
T ss_pred             HHHHHHHHHHHHHHHHHHcc
Confidence            34445555556677766655


No 274
>PF13955 Fst_toxin:  Toxin Fst, type I toxin-antitoxin system; PDB: 2KV5_A.
Probab=20.07  E-value=1.7e+02  Score=16.65  Aligned_cols=11  Identities=27%  Similarity=0.365  Sum_probs=4.8

Q ss_pred             HhhhHHHHHHH
Q psy12432         90 IASGASIAVFL  100 (229)
Q Consensus        90 Ia~gI~V~lLL  100 (229)
                      |+..+.|++++
T Consensus         4 iIaPi~VGvvl   14 (21)
T PF13955_consen    4 IIAPIVVGVVL   14 (21)
T ss_dssp             HHHHHHHHHHH
T ss_pred             ehhhHHHHHHH
Confidence            34444444443


No 275
>PRK10905 cell division protein DamX; Validated
Probab=20.05  E-value=67  Score=30.16  Aligned_cols=16  Identities=6%  Similarity=-0.060  Sum_probs=11.9

Q ss_pred             hhHHHHHHHHHHHHHH
Q psy12432         92 SGASIAVFLVVILCFS  107 (229)
Q Consensus        92 ~gI~V~lLLVlii~vv  107 (229)
                      +.++++||||.|...+
T Consensus         5 GilVLlLLIigIgSAL   20 (328)
T PRK10905          5 GILVLLLLIIGIGSAL   20 (328)
T ss_pred             hHHHHHHHHHHHhHhh
Confidence            3467788888888864


Done!