Query         psy12980
Match_columns 609
No_of_seqs    208 out of 253
Neff          5.3 
Searched_HMMs 46136
Date          Fri Aug 16 16:04:23 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy12980.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/12980hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF05454 DAG1:  Dystroglycan (D 100.0 4.4E-82 9.6E-87  649.3   0.1  272  322-609     1-290 (290)
  2 KOG3781|consensus              100.0 2.2E-47 4.7E-52  408.1  17.6  327  214-594   227-558 (559)
  3 smart00736 CADG Dystroglycan-t  99.8 4.5E-19 9.6E-24  155.9  12.6   94  218-316     2-95  (97)
  4 PF05454 DAG1:  Dystroglycan (D  99.7 1.4E-17   3E-22  172.6   0.0   95    2-100    18-112 (290)
  5 smart00736 CADG Dystroglycan-t  99.0 7.2E-10 1.6E-14   97.5   6.8   90  118-212     2-96  (97)
  6 PF05510 Sarcoglycan_2:  Sarcog  98.8 3.1E-07 6.7E-12   99.1  17.8  259  222-491    22-310 (386)
  7 KOG3781|consensus               98.8 1.3E-10 2.9E-15  126.4  -7.8  288  113-430   226-526 (559)
  8 PF05345 He_PIG:  Putative Ig d  98.3 1.4E-06 3.1E-11   68.2   6.1   47  253-301     2-49  (49)
  9 KOG4482|consensus               97.9   8E-05 1.7E-09   79.5  11.3  205  222-430    28-248 (449)
 10 cd00031 CA Cadherin repeat dom  96.9   0.018   4E-07   55.1  13.4  162  140-313    21-198 (199)
 11 PF12877 DUF3827:  Domain of un  95.8   0.064 1.4E-06   61.5  11.3   78  342-430   161-252 (684)
 12 PF15102 TMEM154:  TMEM154 prot  95.5   0.026 5.7E-07   53.9   5.7   37  463-499    54-90  (146)
 13 PF10873 DUF2668:  Protein of u  95.4    0.03 6.5E-07   53.2   5.8   22  474-495    68-89  (155)
 14 PF13908 Shisa:  Wnt and FGF in  95.3   0.069 1.5E-06   52.2   8.2    9  468-476    81-89  (179)
 15 PF12273 RCR:  Chitin synthesis  93.8   0.031 6.8E-07   52.0   1.7   27  474-500     6-32  (130)
 16 smart00112 CA Cadherin repeats  93.0    0.91   2E-05   37.5   9.1   68  242-314     2-73  (79)
 17 PF00028 Cadherin:  Cadherin do  92.5     1.6 3.4E-05   37.2  10.3   68  240-312    20-90  (93)
 18 PF02439 Adeno_E3_CR2:  Adenovi  91.6    0.13 2.8E-06   38.5   2.1   11  490-500    28-38  (38)
 19 PF04478 Mid2:  Mid2 like cell   90.2    0.19 4.1E-06   48.4   2.4   31  467-497    50-80  (154)
 20 cd00031 CA Cadherin repeat dom  89.9     3.2   7E-05   39.6  10.6   70  240-314    21-94  (199)
 21 PF07495 Y_Y_Y:  Y_Y_Y domain;   89.2     2.5 5.5E-05   33.8   7.9   44  267-312    20-65  (66)
 22 cd00146 PKD polycystic kidney   88.6    0.96 2.1E-05   37.7   5.2   27  286-312    55-81  (81)
 23 PF15050 SCIMP:  SCIMP protein   88.5    0.46   1E-05   44.1   3.4   28  473-500    12-40  (133)
 24 PF15418 DUF4625:  Domain of un  87.2     4.3 9.2E-05   38.4   9.1   86  220-312    27-131 (132)
 25 PF01102 Glycophorin_A:  Glycop  86.4    0.38 8.3E-06   44.9   1.6   28  471-498    65-95  (122)
 26 PF14991 MLANA:  Protein melan-  84.2    0.32 6.9E-06   44.7   0.1    9  548-556    99-107 (118)
 27 PF12245 Big_3_2:  Bacterial Ig  81.9     3.5 7.5E-05   33.5   5.2   40  274-313     9-48  (60)
 28 PF01299 Lamp:  Lysosome-associ  77.8     2.2 4.7E-05   45.3   3.6   61  422-500   241-303 (306)
 29 PF11669 WBP-1:  WW domain-bind  75.3     2.6 5.5E-05   38.2   2.8   19  537-556    72-90  (102)
 30 PF01835 A2M_N:  MG2 domain;  I  75.1     7.4 0.00016   33.8   5.7   59  250-310    37-99  (99)
 31 PF13750 Big_3_3:  Bacterial Ig  74.6      25 0.00055   34.0   9.7   34  282-315   117-150 (158)
 32 PF02494 HYR:  HYR domain;  Int  73.5     4.9 0.00011   33.9   4.0   26  287-312    56-81  (81)
 33 PF05478 Prominin:  Prominin;    72.8     1.4 3.1E-05   52.8   0.8   33  470-502    96-129 (806)
 34 PF06667 PspB:  Phage shock pro  72.6     1.3 2.9E-05   38.0   0.3   31  472-502     6-36  (75)
 35 KOG4289|consensus               72.1 1.4E+02   0.003   38.6  16.7   69  240-312   807-877 (2531)
 36 PF10577 UPF0560:  Uncharacteri  71.7     1.7 3.6E-05   51.6   0.9   30  466-495   268-299 (807)
 37 TIGR01965 VCBS_repeat VCBS rep  69.4      17 0.00037   32.9   6.7   49  278-330    25-79  (99)
 38 PHA03283 envelope glycoprotein  69.1      13 0.00028   42.5   7.1   25  480-504   411-437 (542)
 39 PF02480 Herpes_gE:  Alphaherpe  68.2     1.7 3.6E-05   48.8   0.0   49  225-275   143-194 (439)
 40 PRK09458 pspB phage shock prot  66.8     2.6 5.7E-05   36.2   0.9   30  474-503     8-37  (75)
 41 KOG1094|consensus               64.0      15 0.00033   42.8   6.5   30  471-500   392-422 (807)
 42 smart00089 PKD Repeats in poly  63.3      12 0.00026   30.9   4.2   25  287-312    54-78  (79)
 43 PF02439 Adeno_E3_CR2:  Adenovi  63.2     4.3 9.3E-05   30.6   1.3   27  474-500     7-35  (38)
 44 PF05345 He_PIG:  Putative Ig d  62.5     9.6 0.00021   29.9   3.3   27  160-186    14-43  (49)
 45 PF05568 ASFV_J13L:  African sw  62.4     5.3 0.00011   38.5   2.1   55  482-544    43-97  (189)
 46 PRK13211 N-acetylglucosamine-b  61.1      30 0.00065   39.5   8.1   64  250-314   342-406 (478)
 47 PF14610 DUF4448:  Protein of u  60.3     8.4 0.00018   38.0   3.3   29  466-496   158-186 (189)
 48 PF00974 Rhabdo_glycop:  Rhabdo  59.8       3 6.4E-05   47.6   0.0   17  230-246   146-162 (501)
 49 PF13754 Big_3_4:  Bacterial Ig  58.8      28 0.00061   27.6   5.4   27  286-312    22-49  (54)
 50 TIGR02976 phageshock_pspB phag  58.5     3.5 7.6E-05   35.4   0.2   27  471-499     7-33  (75)
 51 PF12273 RCR:  Chitin synthesis  58.4     3.1 6.8E-05   38.7  -0.1   26  475-500     4-29  (130)
 52 TIGR00864 PCC polycystin catio  57.0 1.7E+02  0.0037   40.1  14.7   61  245-311  1799-1864(2740)
 53 KOG1219|consensus               56.0 1.2E+02  0.0026   41.2  12.4  186  112-311   949-1150(4289)
 54 PF15069 FAM163:  FAM163 family  55.6       9 0.00019   36.8   2.4   18  469-486     5-22  (143)
 55 PF08693 SKG6:  Transmembrane a  55.1      10 0.00022   29.0   2.1   23  470-495    17-39  (40)
 56 PF14979 TMEM52:  Transmembrane  54.7     6.1 0.00013   38.1   1.2   26  463-489    15-40  (154)
 57 PF08374 Protocadherin:  Protoc  54.7      11 0.00024   38.5   3.0   30  464-493    36-65  (221)
 58 PF14575 EphA2_TM:  Ephrin type  53.6     9.7 0.00021   32.6   2.1   13  488-500    20-32  (75)
 59 KOG4331|consensus               52.4     3.9 8.5E-05   48.7  -0.6   37  463-500   103-141 (865)
 60 PF01034 Syndecan:  Syndecan do  51.9       5 0.00011   33.5   0.1   35  468-503    11-47  (64)
 61 TIGR01965 VCBS_repeat VCBS rep  51.7      25 0.00054   31.8   4.5   17  202-218    72-88  (99)
 62 PF06365 CD34_antigen:  CD34/Po  50.5      13 0.00027   37.8   2.7   23  358-384    38-60  (202)
 63 KOG4289|consensus               49.6 1.9E+02  0.0042   37.5  12.5   84  241-332   503-593 (2531)
 64 PHA03286 envelope glycoprotein  49.0     7.1 0.00015   43.8   0.7   26  471-496   395-420 (492)
 65 PHA03291 envelope glycoprotein  49.0     8.6 0.00019   41.9   1.3   29  282-313   136-164 (401)
 66 PF00558 Vpu:  Vpu protein;  In  48.0      13 0.00028   32.5   2.1   30  470-499     8-37  (81)
 67 PF05393 Hum_adeno_E3A:  Human   44.7      13 0.00028   33.0   1.5   12  489-500    54-66  (94)
 68 PHA03099 epidermal growth fact  44.3      17 0.00037   34.4   2.4   10  384-393    38-47  (139)
 69 PF14914 LRRC37AB_C:  LRRC37A/B  44.3      10 0.00022   36.7   0.9   21  475-495   129-149 (154)
 70 PF13908 Shisa:  Wnt and FGF in  43.1      15 0.00032   35.9   1.9   22  466-487    75-96  (179)
 71 PF15065 NCU-G1:  Lysosomal tra  42.2     7.2 0.00016   42.6  -0.5   29  276-304   117-145 (350)
 72 PRK00269 zipA cell division pr  42.2      14  0.0003   39.4   1.6   27  475-501     9-36  (293)
 73 PHA03099 epidermal growth fact  41.5      24 0.00051   33.5   2.8   16  487-502   118-133 (139)
 74 PF02038 ATP1G1_PLM_MAT8:  ATP1  41.3      11 0.00023   30.1   0.5   25  471-495    15-39  (50)
 75 PF07204 Orthoreo_P10:  Orthore  40.9      12 0.00026   33.6   0.8    7  432-438     6-12  (98)
 76 PF09394 Inhibitor_I42:  Chagas  40.5      59  0.0013   27.7   5.1   74  223-313     1-74  (92)
 77 TIGR00864 PCC polycystin catio  39.9 2.7E+02  0.0059   38.4  12.6   61  239-314  1449-1509(2740)
 78 PF07523 Big_3:  Bacterial Ig-l  39.4      81  0.0018   25.7   5.5   50  250-312    18-67  (67)
 79 PF08329 ChitinaseA_N:  Chitina  37.3      55  0.0012   31.2   4.6   49  287-336    82-130 (133)
 80 PF15471 TMEM171:  Transmembran  36.0 1.2E+02  0.0026   32.3   7.2   27  463-490   156-182 (319)
 81 PF15298 AJAP1_PANP_C:  AJAP1/P  33.8      63  0.0014   32.8   4.6   54  465-518    98-159 (205)
 82 PF05337 CSF-1:  Macrophage col  33.7      14  0.0003   39.2   0.0   27  470-499   229-255 (285)
 83 PF13753 SWM_repeat:  Putative   33.6 5.9E+02   0.013   26.8  15.0  207   80-314    11-242 (317)
 84 PF00801 PKD:  PKD domain;  Int  33.4 2.5E+02  0.0055   22.5   8.0   18  287-304    51-68  (69)
 85 PF11770 GAPT:  GRB2-binding ad  33.1      38 0.00083   32.9   2.8   17  480-496    20-36  (158)
 86 PF02009 Rifin_STEVOR:  Rifin/s  32.2      11 0.00025   40.3  -0.9   20  478-497   266-286 (299)
 87 KOG1219|consensus               32.0 3.6E+02  0.0078   37.2  11.3  185  107-313  1048-1251(4289)
 88 PF15347 PAG:  Phosphoprotein a  31.9      70  0.0015   35.4   4.9   36  468-504    15-52  (428)
 89 PF10164 DUF2367:  Uncharacteri  31.6      33 0.00072   31.0   2.0   23  474-496    68-90  (98)
 90 TIGR03660 T1SS_rpt_143 T1SS-14  31.1 3.4E+02  0.0074   25.9   8.9   31  286-316    83-113 (137)
 91 PTZ00382 Variant-specific surf  30.4      40 0.00086   30.2   2.4   11  485-495    85-95  (96)
 92 PF04234 CopC:  CopC domain;  I  30.0 3.2E+02  0.0069   23.9   8.0   62  251-313    35-97  (97)
 93 cd07912 Tweety_N N-terminal do  29.6      41 0.00088   37.8   2.8   11  468-478    41-51  (418)
 94 PF05915 DUF872:  Eukaryotic pr  29.1      25 0.00054   32.6   0.9   25  472-496    77-105 (115)
 95 PF11857 DUF3377:  Domain of un  28.9      88  0.0019   27.0   4.0   31  467-499    31-61  (74)
 96 PF05545 FixQ:  Cbb3-type cytoc  28.7      36 0.00078   26.4   1.6   18  482-500    21-38  (49)
 97 PF13860 FlgD_ig:  FlgD Ig-like  28.4 1.2E+02  0.0027   25.6   5.0   12  288-299    69-80  (81)
 98 PRK12633 flgD flagellar basal   28.2 2.1E+02  0.0045   29.6   7.4   25  279-305   166-190 (230)
 99 PTZ00364 dipeptidyl-peptidase   27.9      48   0.001   38.5   3.1   38  463-500   468-514 (548)
100 COG4736 CcoQ Cbb3-type cytochr  27.6      30 0.00066   28.6   1.0   28  490-528    28-55  (60)
101 PF06679 DUF1180:  Protein of u  26.6      47   0.001   32.7   2.3   16  498-513   127-142 (163)
102 PF15415 DUF4622:  Protein of u  26.3 1.5E+02  0.0032   31.3   5.8  147  207-385   148-305 (310)
103 PF09680 Tiny_TM_bacill:  Prote  25.3      66  0.0014   22.1   2.1   16  474-491     8-23  (24)
104 PF09315 DUF1973:  Domain of un  25.1 4.7E+02    0.01   26.0   9.0   76  250-332    42-120 (179)
105 PF12768 Rax2:  Cortical protei  24.9      88  0.0019   33.3   4.1   22  474-495   237-258 (281)
106 PRK06439 NADH dehydrogenase su  24.6      66  0.0014   27.7   2.5   29  467-495    43-71  (72)
107 TIGR03370 PEPCTERM_Roseo varia  24.6      33 0.00072   23.9   0.6   18  480-497     8-25  (26)
108 COG5513 Predicted secreted pro  24.5 5.9E+02   0.013   23.8   9.5   90  224-329    22-111 (113)
109 PLN03150 hypothetical protein;  24.5      29 0.00062   40.6   0.5    7  308-314   340-346 (623)
110 PF08496 Peptidase_S49_N:  Pept  24.3      59  0.0013   31.6   2.5   32  472-503    14-45  (155)
111 TIGR00868 hCaCC calcium-activa  24.0 6.2E+02   0.013   31.4  11.3  102  223-332   495-604 (863)
112 PF06365 CD34_antigen:  CD34/Po  23.9      73  0.0016   32.5   3.2   26  469-495   101-128 (202)
113 PRK12634 flgD flagellar basal   23.8 1.5E+02  0.0033   30.4   5.5   26  280-307   160-185 (221)
114 PF05140 ResB:  ResB-like famil  23.7      42  0.0009   37.7   1.6   33  464-496   396-431 (464)
115 PF00974 Rhabdo_glycop:  Rhabdo  23.7      26 0.00057   40.1   0.0    9  462-470   449-457 (501)
116 PF08372 PRT_C:  Plant phosphor  23.4      51  0.0011   32.2   1.9   24  520-543    36-59  (156)
117 PHA03289 envelope glycoprotein  23.1 2.6E+02  0.0056   30.6   7.1   29  282-313   129-157 (352)
118 KOG3521|consensus               22.8 1.5E+02  0.0032   35.4   5.6   38  504-551   768-807 (846)
119 PF01601 Corona_S2:  Coronaviru  22.3      29 0.00063   40.4   0.0   24  121-144   146-170 (610)
120 KOG4221|consensus               22.1      64  0.0014   40.7   2.7   34  460-494  1052-1085(1381)
121 PF05808 Podoplanin:  Podoplani  22.0      30 0.00065   34.0   0.0   35  465-501   124-161 (162)
122 PF12669 P12:  Virus attachment  21.7      64  0.0014   26.3   1.9    8  491-498    20-27  (58)
123 PF15102 TMEM154:  TMEM154 prot  21.6      60  0.0013   31.4   1.9   40  461-500    49-88  (146)
124 PF14967 FAM70:  FAM70 protein   21.5 2.2E+02  0.0048   30.8   6.2   41  449-489   175-216 (327)
125 PF10577 UPF0560:  Uncharacteri  21.4      32 0.00068   41.4   0.0   30  460-489   267-297 (807)
126 PF13750 Big_3_3:  Bacterial Ig  20.5 2.1E+02  0.0045   27.7   5.5   36  288-324    15-51  (158)

No 1  
>PF05454 DAG1:  Dystroglycan (Dystrophin-associated glycoprotein 1);  InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5.5 kb transcript in Homo sapiens. The protein product is cleaved into two non-covalently associated subunits, [alpha] (N-terminal) and [beta] (C-terminal). In skeletal muscle the dystroglycan complex works as a transmembrane linkage between the extracellular matrix and the cytoskeleton [alpha]-dystroglycan is extracellular and binds to merosin ([alpha]-2 laminin) in the basement membrane, while [beta]-dystroglycan is a transmembrane protein and binds to dystrophin, which is a large rod-like cytoskeletal protein, absent in Duchenne muscular dystrophy patients. Dystrophin binds to intracellular actin cables. In this way, the dystroglycan complex, which links the extracellular matrix to the intracellular actin cables, is thought to provide structural integrity in muscle tissues. The dystroglycan complex is also known to serve as an agrin receptor in muscle, where it may regulate agrin-induced acetylcholine receptor clustering at the neuromuscular junction. There is also evidence which suggests the function of dystroglycan as a part of the signal transduction pathway because it is shown that Grb2, a mediator of the Ras-related signal pathway, can interact with the cytoplasmic domain of dystroglycan. In general, aberrant expression of dystrophin-associated protein complex underlies the pathogenesis of Duchenne muscular dystrophy, Becker muscular dystrophy and severe childhood autosomal recessive muscular dystrophy. Interestingly, no genetic disease has been described for either [alpha]- or [beta]-dystroglycan. Dystroglycan is widely distributed in non-muscle tissues as well as in muscle tissues. During epithelial morphogenesis of kidney, the dystroglycan complex is shown to act as a receptor for the basement membrane. Dystroglycan expression in Mus musculus brain and neural retina has also been reported. However, the physiological role of dystroglycan in non-muscle tissues has remained unclear [].; PDB: 1EG4_P.
Probab=100.00  E-value=4.4e-82  Score=649.26  Aligned_cols=272  Identities=42%  Similarity=0.692  Sum_probs=1.0

Q ss_pred             eEEEEEecCCCCCcCchHHH-HHHHHHHHHHhCCCCCCcEEEEeeecCCCCCCCcEEEEEEecCCCCCCCCCHHHHHHHH
Q psy12980        322 ASFSLTLARPSSTFAQPAAQ-RAFLEKLAGLFGDRNASKIVNIEFEDNPQEPGKSTVVRWANRTLASNVECPETTIGLLR  400 (609)
Q Consensus       322 vef~L~l~~~~~~ft~~~~q-~~LVekLA~llGD~d~s~ItVrsI~~~~~~~~gSTvV~w~N~TLp~~~~CP~~eI~~L~  400 (609)
                      |+|+++|++++++|+++.++ ++||++||.+|||+|+++|+||+|+      +|||+|+|||+||+. +.||+++|++|+
T Consensus         1 ~~F~~~l~~d~~~f~~dv~~ki~lVekLA~~~GD~nts~ItV~sIt------~gstiVtwtNnTLp~-~~CP~eeI~~L~   73 (290)
T PF05454_consen    1 VEFSATLDIDYESFNNDVQRKILLVEKLARLFGDRNTSSITVRSIT------SGSTIVTWTNNTLPT-SPCPKEEIEKLR   73 (290)
T ss_dssp             --------------------------------------------------------------------------------
T ss_pred             CeeEEEEcCCHHHhhhhHHHHHHHHHHHHHHhCCCCCCeEEEEEec------CCCEEEEEEcCCCCC-CCCCHHHHHHHH
Confidence            68999999999999986544 7899999999999999999999999      899999999999998 899999999999


Q ss_pred             HHHhhccchhhHHHHHhhcccceeeEEeeeeccccCCCCCCCCCCCCCCCCC------CCCCCCCCCCCccEEEEeehHH
Q psy12980        401 EVLLTEDDLLQESLVTSLEKDYGVKHASVIPTGICEGLKTPLHTPGVERKPA------YSGKNTPANASVEYLITFVAPL  474 (609)
Q Consensus       401 ~kL~~~dg~~s~~f~~am~peF~V~~Vs~~g~G~C~~~t~~~~~pe~~~p~~------~~~~~~~~na~~~~Llv~ViPa  474 (609)
                      ++|.+++|+++++|+++|+|+|+|++|++.++|+|++.......+.+..|..      +......+.++++||+ ++||+
T Consensus        74 ~~L~~~~g~~~~~f~~am~pef~V~svsv~~~G~C~~~~~ip~~~~~~iP~~~~~t~~p~~~~~~~s~~d~yL~-T~Ipa  152 (290)
T PF05454_consen   74 KRLVDDDGKPSQEFVRAMGPEFKVKSVSVIPIGSCQDTSFIPTPPDSEIPTPAPPTNTPDEPSPKSSFSDDYLH-TFIPA  152 (290)
T ss_dssp             --------------------------------------------------------------------------------
T ss_pred             HHHhcCCCCcCHHHHHHhCCCCceeEEEEEEeeccCCCccCCCCCcCcCCCCCCCCCCCCCCCCCcccccchHH-HHHHH
Confidence            9999999999999999999999999999999999999766543222223321      1111123444565655 55799


Q ss_pred             HHHHHHHHHHHHhheeeecccCCCCcccCCCcccccccccCCCCEEcccccccccCCCCCCCCCeeccCCCCCCCCCCCC
Q psy12980        475 VVIVIMLLCAALIACLLYRRRHTGKMSVGNGVDSGILIRSKGIPVIFQDELEERLEVSSGTKSPIILKEEKPPLPPPEYN  554 (609)
Q Consensus       475 VIIa~mllia~iIaC~cyRrKRkgK~~i~d~deE~~tF~~KGiPvIF~dElde~~~~~~~~~~P~IlkeEkpplpppey~  554 (609)
                      |||++|||||++||||||||||||||++    |||++||+|||||||||||||. ||++++|+||||||||||||||||+
T Consensus       153 VVI~~iLLIA~iIa~icyrrkR~GK~~~----ee~~~f~~KGiPvIF~dElee~-kp~~~~~~P~IlkeEkPPl~pp~y~  227 (290)
T PF05454_consen  153 VVIAAILLIAGIIACICYRRKRKGKMSL----EEQKTFISKGIPVIFQDELEES-KPEPGSKSPVILKEEKPPLPPPEYP  227 (290)
T ss_dssp             --------------------------------------------------------------------------------
T ss_pred             HHHHHHHHHHHHHHHHhhhhhhcccccc----chhHHHHhcCCceecccccccc-CCCCCCCCCeeecccCCCCCCCCCC
Confidence            9999999999999999999999999998    5699999999999999999995 4999999999999999999999999


Q ss_pred             CCCCCCCCCcCccCCCCCCccccccCCCCCCCCCCCCCCC---------C--CCCCCCCCCCCCCC
Q psy12980        555 RDKNDFSRDKNDYSRSQPNATTALLSDIDDTSPYHPPPPI---------R--PAPTYRLPPPYSPP  609 (609)
Q Consensus       555 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~pppp~---------r--~~~~yr~pppyvpp  609 (609)
                      +++++++++|+.++.+   +..++.++|+|++||||||||         |  +||+||+|||||||
T Consensus       228 ~~~~~~~~p~~~~~~~---~~~~~~~~~~~~~~y~ppp~~~~~~~~~~~~p~~~~~yr~PPpyvpp  290 (290)
T PF05454_consen  228 NSNMPESTPLNQDTLG---EYTPLRDEDPDAPPYQPPPPFTSPGEGRRSRPKNTPPYRSPPPYVPP  290 (290)
T ss_dssp             -------------------------------------------------------S----------
T ss_pred             CCCCCCCCCccccccc---cccccccCCCCCCccCCCCCcccCCCCCCCCCCCCCCCCCCCCCCCC
Confidence            9999999988433311   112767777899999999999         3  69999999999998


No 2  
>KOG3781|consensus
Probab=100.00  E-value=2.2e-47  Score=408.06  Aligned_cols=327  Identities=25%  Similarity=0.367  Sum_probs=289.7

Q ss_pred             ceecCccceeeeeeCCeEEEEcCCCccccCCCCCeeeEEEEEeccCCCCCCCCccceeccCCceEEeecCCCCceeEEEE
Q psy12980        214 PLIRNPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIPQLTDLGRREYQ  293 (609)
Q Consensus       214 Pvv~n~i~~~~at~G~~f~f~IP~dTF~D~eDGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~G~P~~~dvG~~~f~  293 (609)
                      |++.|++..+++++|+.|...||+|||.|.||+++..+++...+.||..    .|++|+..++-++|+|++  +|+++|.
T Consensus       227 P~l~n~~~~~~~~~g~~~~m~IpSdTf~dsed~~~f~~kl~ilsld~ae----vwv~f~sNskd~~glPms--~gkf~y~  300 (559)
T KOG3781|consen  227 PLLKNQVLRVNCLSGQHCHMHIPSDTFEDSEDITSFIIKLEILSLDGAE----VWVKFVSNSKDSIGLPMS--CGKFHYV  300 (559)
T ss_pred             ccccCcceeeeeecchhhheeCCccchhhhhhcccceeeeeeecCCCce----eEEEeccCCccccccccc--ccceeee
Confidence            9999999999999999999999999999999999999999999999976    899999988999999999  6999999


Q ss_pred             EEEEcCCCCeeEeeEEEEEECCCCCCcceEEEEEecCCCCCcCchHH-HHHHHHHHHHHhCCCCCCcEEEEeeecCCCCC
Q psy12980        294 LVCKDSNGLMANDGLEVIVKSPGPAYYSASFSLTLARPSSTFAQPAA-QRAFLEKLAGLFGDRNASKIVNIEFEDNPQEP  372 (609)
Q Consensus       294 LtATDs~G~sasd~F~V~V~~~p~~~~~vef~L~l~~~~~~ft~~~~-q~~LVekLA~llGD~d~s~ItVrsI~~~~~~~  372 (609)
                      +.|+|.+|++++|+|+|+|.+.+..++.|.|.+.+..+ +++.+++. +-.|++.||..+|++. ..|+++.|...    
T Consensus       301 lea~dk~Gq~a~~pf~vnv~~~~s~n~~~~f~~~~p~d-~~~~~np~~rt~Lv~~La~a~~~~~-~~itLq~I~~~----  374 (559)
T KOG3781|consen  301 LEASDKDGQSATDPFVVNVIPLPSANHLHTFHFPFPPD-TFYDQNPGNRTHLVLELASALGYPS-GEITLQFIMTI----  374 (559)
T ss_pred             eeeecCCCCcccCCeEEeeeecCCCccceEEeccCCCc-hhhhcCCcchhHHHHHHHHhhcCCc-cchhhhhhhhh----
Confidence            99999999999999999999998888988888887755 45555555 4689999999999986 69999999842    


Q ss_pred             CCcEEEEEEecCCCCCCCCCHHHHHHHHH-HHhhccchhhHHHHHhhcccceeeEEeeeeccccCCCCCCCCCCCCCCCC
Q psy12980        373 GKSTVVRWANRTLASNVECPETTIGLLRE-VLLTEDDLLQESLVTSLEKDYGVKHASVIPTGICEGLKTPLHTPGVERKP  451 (609)
Q Consensus       373 ~gSTvV~w~N~TLp~~~~CP~~eI~~L~~-kL~~~dg~~s~~f~~am~peF~V~~Vs~~g~G~C~~~t~~~~~pe~~~p~  451 (609)
                      .++++++|.|+++|+ +.|+.++|+.+.. ++..++|+++.+|.+.|+++|+++.+++..+|+|.+..++...       
T Consensus       375 r~~~~vsf~nnslp~-e~C~s~qI~~va~kmi~~~~~rpr~~f~~tm~~dfk~kt~s~~k~gnc~hl~fI~lt-------  446 (559)
T KOG3781|consen  375 RYGTPVSFDNNSLPY-EFCDSDQIGLVAQKMITIELGRPRSSFCHTMGLDFKVKTASFIKRGNCIHLFFIKLT-------  446 (559)
T ss_pred             ccceeEEeecccCcc-ccCcHHHHHHHhhhhhHHhcCCccchhhhhccccceeeEEEeeccccccceeeeecc-------
Confidence            789999999999999 8899999999995 6677788999999999999999999999999999876433210       


Q ss_pred             CCCCCCCCCCCCccEEEEeehHHHHHHHHHHHHHHhheeeecccCCCCcccCCCcccccccccCCCCEEcccccccccCC
Q psy12980        452 AYSGKNTPANASVEYLITFVAPLVVIVIMLLCAALIACLLYRRRHTGKMSVGNGVDSGILIRSKGIPVIFQDELEERLEV  531 (609)
Q Consensus       452 ~~~~~~~~~na~~~~Llv~ViPaVIIa~mllia~iIaC~cyRrKRkgK~~i~d~deE~~tF~~KGiPvIF~dElde~~~~  531 (609)
                                  -||++..+.|.++++.++..|.||+|+|  .++|||++.++     ..|.+||.||||.+|+++    
T Consensus       447 ------------PD~~l~~~~~~l~l~tl~t~a~Ii~~~~--~lk~Gk~ks~~-----~~~~kk~lpIvf~lel~d----  503 (559)
T KOG3781|consen  447 ------------PDYLLSSNPPQLVLITLRTGATIISWIC--ALKKGKSKSLK-----WCPNKKILPIVFKLELPD----  503 (559)
T ss_pred             ------------chhhhccCCceeeehhhHhhhHHHHhHH--HHhhccccccc-----eeeccCceeEEEEeecCc----
Confidence                        1788889999999999999999999998  67789999875     799999999999999998    


Q ss_pred             CCCCCCCeeccCCCCCCCCCCCCCCCCCCCCCc---CccCCCCCCccccccCCCCCCCCCCCCCCC
Q psy12980        532 SSGTKSPIILKEEKPPLPPPEYNRDKNDFSRDK---NDYSRSQPNATTALLSDIDDTSPYHPPPPI  594 (609)
Q Consensus       532 ~~~~~~P~IlkeEkpplpppey~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~y~pppp~  594 (609)
                        ++.+|+++|||||||++|+|+|. -..+.+|   |+|.        .....++++++||+|+|+
T Consensus       504 --~~~~pmlvkeek~plk~~~~~N~-i~~t~pl~t~g~~~--------l~~~nden~p~~q~p~p~  558 (559)
T KOG3781|consen  504 --GNVHPMLVKEEKPPLKIPTVENI-ICLTLPLQTIGNSI--------LTSSNDENSPWIQNPIPA  558 (559)
T ss_pred             --cCCchhHHHhhcCCCCccccCcc-cccCCChhhcCCce--------ecccCCccCCcccCCCCC
Confidence              59999999999999999999987 5555555   5554        334456799999999986


No 3  
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=99.80  E-value=4.5e-19  Score=155.88  Aligned_cols=94  Identities=30%  Similarity=0.558  Sum_probs=87.8

Q ss_pred             CccceeeeeeCCeEEEEcCCCccccCCCCCeeeEEEEEeccCCCCCCCCccceeccCCceEEeecCCCCceeEEEEEEEE
Q psy12980        218 NPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIPQLTDLGRREYQLVCK  297 (609)
Q Consensus       218 n~i~~~~at~G~~f~f~IP~dTF~D~eDGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~G~P~~~dvG~~~f~LtAT  297 (609)
                      |.|.+++|++|+.|+|+||.++|.|.| |+  +|+|++...+|.+||  +||+||+.+++|+|+|...++|.|+|+|+|+
T Consensus         2 ~~i~~~~a~~g~~f~~~i~~~tF~d~d-~~--~lty~~~~~~~~~lP--~Wl~fd~~~~~~~GtP~~~~~g~~~i~v~a~   76 (97)
T smart00736        2 NAIGDQTATEGESFSYTIPSSTFTDAD-GD--TLTYSATLSDGSALP--SWLSFDSDTGTLSGTPTNSDVGSLSLKVTAT   76 (97)
T ss_pred             CcccceEEeCCCeEEEEcChhceECCC-CC--eEEEEEEeCCCCCCC--CeEEEeCCCCEEEEECCCCCCcEEEEEEEEE
Confidence            678999999999999999999999996 77  888888888888887  8999999999999999999999999999999


Q ss_pred             cCCCCeeEeeEEEEEECCC
Q psy12980        298 DSNGLMANDGLEVIVKSPG  316 (609)
Q Consensus       298 Ds~G~sasd~F~V~V~~~p  316 (609)
                      |+.|.++++.|+|.|.+++
T Consensus        77 D~~g~~~~~~f~i~V~~~~   95 (97)
T smart00736       77 DSSGASASDTFTITVVNTN   95 (97)
T ss_pred             ECCCCEEEEEEEEEEeCCC
Confidence            9999999999999998753


No 4  
>PF05454 DAG1:  Dystroglycan (Dystrophin-associated glycoprotein 1);  InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5.5 kb transcript in Homo sapiens. The protein product is cleaved into two non-covalently associated subunits, [alpha] (N-terminal) and [beta] (C-terminal). In skeletal muscle the dystroglycan complex works as a transmembrane linkage between the extracellular matrix and the cytoskeleton [alpha]-dystroglycan is extracellular and binds to merosin ([alpha]-2 laminin) in the basement membrane, while [beta]-dystroglycan is a transmembrane protein and binds to dystrophin, which is a large rod-like cytoskeletal protein, absent in Duchenne muscular dystrophy patients. Dystrophin binds to intracellular actin cables. In this way, the dystroglycan complex, which links the extracellular matrix to the intracellular actin cables, is thought to provide structural integrity in muscle tissues. The dystroglycan complex is also known to serve as an agrin receptor in muscle, where it may regulate agrin-induced acetylcholine receptor clustering at the neuromuscular junction. There is also evidence which suggests the function of dystroglycan as a part of the signal transduction pathway because it is shown that Grb2, a mediator of the Ras-related signal pathway, can interact with the cytoplasmic domain of dystroglycan. In general, aberrant expression of dystrophin-associated protein complex underlies the pathogenesis of Duchenne muscular dystrophy, Becker muscular dystrophy and severe childhood autosomal recessive muscular dystrophy. Interestingly, no genetic disease has been described for either [alpha]- or [beta]-dystroglycan. Dystroglycan is widely distributed in non-muscle tissues as well as in muscle tissues. During epithelial morphogenesis of kidney, the dystroglycan complex is shown to act as a receptor for the basement membrane. Dystroglycan expression in Mus musculus brain and neural retina has also been reported. However, the physiological role of dystroglycan in non-muscle tissues has remained unclear [].; PDB: 1EG4_P.
Probab=99.65  E-value=1.4e-17  Score=172.56  Aligned_cols=95  Identities=33%  Similarity=0.610  Sum_probs=0.0

Q ss_pred             cchhhHHHHHHHHHhCCCCCcccEEEEeeecCCCcEEEEEecCCCCCCCCCHHHHHHHHHHhccCCCCCchHHHHHhhcc
Q psy12980          2 VDWQIFVYNILLDLYGNSGNSSYITVRSVVTNKEPYVFTYTNESLPRNECPTAQINNLLEVISNTTTGQAKPVLTRALGR   81 (609)
Q Consensus         2 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~f~~~~~~~p~~~c~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~   81 (609)
                      ++|+++++++||++|||. |+++|+||+|+  ++.+.|+|+|.|||.+.||+++|++|.++|.+. .|+.+++|+++|+|
T Consensus        18 v~~ki~lVekLA~~~GD~-nts~ItV~sIt--~gstiVtwtNnTLp~~~CP~eeI~~L~~~L~~~-~g~~~~~f~~am~p   93 (290)
T PF05454_consen   18 VQRKILLVEKLARLFGDR-NTSSITVRSIT--SGSTIVTWTNNTLPTSPCPKEEIEKLRKRLVDD-DGKPSQEFVRAMGP   93 (290)
T ss_dssp             --------------------------------------------------------------------------------
T ss_pred             HHHHHHHHHHHHHHhCCC-CCCeEEEEEec--CCCEEEEEEcCCCCCCCCCHHHHHHHHHHHhcC-CCCcCHHHHHHhCC
Confidence            688999999999999998 99999999998  578999999999999999999999999999977 99999999999999


Q ss_pred             ceeeeeeeeecccccCCCC
Q psy12980         82 LMTVKNITWRGIGHCEDSP  100 (609)
Q Consensus        82 ~~~vk~~~~~~~g~~~~~~  100 (609)
                      ++.|+++++.++|+|....
T Consensus        94 ef~V~svsv~~~G~C~~~~  112 (290)
T PF05454_consen   94 EFKVKSVSVIPIGSCQDTS  112 (290)
T ss_dssp             -------------------
T ss_pred             CCceeEEEEEEeeccCCCc
Confidence            9999999999999999854


No 5  
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=99.00  E-value=7.2e-10  Score=97.55  Aligned_cols=90  Identities=22%  Similarity=0.266  Sum_probs=67.4

Q ss_pred             CcCccceeecCccEEEEcCCCCcccCCCCCeeeEEEEEecc---cceecccC--ccCcCccccccccceeeEEEeeeecc
Q psy12980        118 NPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTM---DRYVISNT--TTGQAKPVLTRALGRLMTVKNITWRG  192 (609)
Q Consensus       118 ~pi~~~~a~e~~~fs~~iP~~tF~D~d~Gdt~~ltlsl~~~---d~~~~~~t--~T~sgtp~~~d~~~~~v~v~~~~~~~  192 (609)
                      |.|.+++|++|+.|+|+||.++|.|.| |++.+|++++..+   -+|+.||.  ++++|+|..++.+..++.|+..+..+
T Consensus         2 ~~i~~~~a~~g~~f~~~i~~~tF~d~d-~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~GtP~~~~~g~~~i~v~a~D~~g   80 (97)
T smart00736        2 NAIGDQTATEGESFSYTIPSSTFTDAD-GDTLTYSATLSDGSALPSWLSFDSDTGTLSGTPTNSDVGSLSLKVTATDSSG   80 (97)
T ss_pred             CcccceEEeCCCeEEEEcChhceECCC-CCeEEEEEEeCCCCCCCCeEEEeCCCCEEEEECCCCCCcEEEEEEEEEECCC
Confidence            678999999999999999999999998 8875555544332   25999976  66789999999988888888766654


Q ss_pred             cccccCCCCCCceecccCCC
Q psy12980        193 IGHCEDSPPPPVVKVKLENQ  212 (609)
Q Consensus       193 ~~~~~~~~~~~tv~v~~~N~  212 (609)
                      .. +..   .+++.|.+.|+
T Consensus        81 ~~-~~~---~f~i~V~~~~~   96 (97)
T smart00736       81 AS-ASD---TFTITVVNTND   96 (97)
T ss_pred             CE-EEE---EEEEEEeCCCC
Confidence            22 211   36666666665


No 6  
>PF05510 Sarcoglycan_2:  Sarcoglycan alpha/epsilon;  InterPro: IPR008908 Sarcoglycans are a subcomplex of transmembrane proteins which are part of the dystrophin-glycoprotein complex. They are expressed in the skeletal, cardiac and smooth muscle. Although numerous studies have been conducted on the sarcoglycan subcomplex in skeletal and cardiac muscle, the manner of the distribution and localisation of these proteins along the nonjunctional sarcolemma is not clear []. This family contains alpha and epsilon members.; GO: 0016012 sarcoglycan complex
Probab=98.76  E-value=3.1e-07  Score=99.08  Aligned_cols=259  Identities=15%  Similarity=0.220  Sum_probs=158.7

Q ss_pred             eeeeeeCCeEEEEcCCCccccCCCCCeeeEEEEEeccCCCCCCCCccceeccCC----ceEEeecCCCCcee-EEEEEEE
Q psy12980        222 HLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKN----QEFYGIPQLTDLGR-REYQLVC  296 (609)
Q Consensus       222 ~~~at~G~~f~f~IP~dTF~D~eDGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t----~tf~G~P~~~dvG~-~~f~LtA  296 (609)
                      .+++.+|..|.|+|=..+|-.-.++.+...+++....+-=.||  +||.|-..+    +-+||+|...++|+ ..+.|+|
T Consensus        22 ~~~~~~g~~Fv~~l~~~~F~~~~~~~~~~~~f~~sL~g~PDLP--~WLry~q~~~~~~GfLYGTP~~~~~g~~~~iEVia   99 (386)
T PF05510_consen   22 TLTALVGELFVHELEPETFNWTFQPSSDQFTFRASLQGYPDLP--SWLRYTQSHPYHPGFLYGTPTSEDAGKPVVIEVIA   99 (386)
T ss_pred             eeccccceEEEEEeCHHHCCCcCCCCcceEEEecccCCCCCCc--chhhhccCCCCCCeeEecCCCCccCCCceEEEEEE
Confidence            6789999999999999999555556666788888755443466  999998875    47999999999999 9999999


Q ss_pred             EcCCCC-eeEeeEEEEEECCCC-CCcceEEEEEecCCCCCcCchHHHHHHHHHHHHHhCCCCCCcEEEEeeecCCCC---
Q psy12980        297 KDSNGL-MANDGLEVIVKSPGP-AYYSASFSLTLARPSSTFAQPAAQRAFLEKLAGLFGDRNASKIVNIEFEDNPQE---  371 (609)
Q Consensus       297 TDs~G~-sasd~F~V~V~~~p~-~~~~vef~L~l~~~~~~ft~~~~q~~LVekLA~llGD~d~s~ItVrsI~~~~~~---  371 (609)
                      -++..- ++...+.|.|..... ..+.+||.+.- -+.+.+.....+..|...+..+...+ ..++.|.+|+.. .+   
T Consensus       100 ~Nr~tyet~r~~lvi~i~~~~~~~~~qae~~I~n-lnvEdll~~~~~~~l~~~~r~lW~~~-~~~L~v~~i~SA-ld~Gg  176 (386)
T PF05510_consen  100 LNRDTYETRRQVLVINIGEKEGPTPYQAEFKIDN-LNVEDLLDSERIERLKNIFRKLWPNS-PEDLYVVNITSA-LDRGG  176 (386)
T ss_pred             eecccceeeeEEEEEEeccCCCCCCeEEEEEEcC-CCHHHhcChhHHHHHHHHHHHHhhcc-cCCceEEeeeeh-hhccC
Confidence            998653 555577777766432 56777777652 33456665445677777888888865 479999888642 22   


Q ss_pred             --------CCCcEEEEEEecCCCCCCCCCHHHHHHHHHHHhhccc-hhhHHHHHhhccccee--eEEeeeeccccCC---
Q psy12980        372 --------PGKSTVVRWANRTLASNVECPETTIGLLREVLLTEDD-LLQESLVTSLEKDYGV--KHASVIPTGICEG---  437 (609)
Q Consensus       372 --------~~gSTvV~w~N~TLp~~~~CP~~eI~~L~~kL~~~dg-~~s~~f~~am~peF~V--~~Vs~~g~G~C~~---  437 (609)
                              +.| ++|  .+.|-..|+.|=.+-+.-|.+. .=..+ .+-....+.+.+.|+|  |++++...-.=..   
T Consensus       177 RvPL~~~~~EG-V~V--~vGS~~~FS~~L~~lv~pl~~~-~C~~k~t~v~~~~~~f~p~F~iDWC~f~Lv~~s~~~~~~~  252 (386)
T PF05510_consen  177 RVPLPPNQKEG-VYV--KVGSQAPFSPCLLELVRPLQKL-SCPYKRTSVESCFRNFAPGFQIDWCNFRLVDLSSPEPMPE  252 (386)
T ss_pred             CCCCCccccce-EEE--EecCCCCcCHHHHHHhChHhhc-cCCCCCCcceecccccCCCceeeeeeeEEEccCcCCcCCc
Confidence                    122 222  4555555677766554444444 22222 3333333445677775  4555432222100   


Q ss_pred             CCCC-CC---C-CCCCC-CCCCCCCCCCCCCCccEEEEeehHHHHHHHHHHHHHHhheee
Q psy12980        438 LKTP-LH---T-PGVER-KPAYSGKNTPANASVEYLITFVAPLVVIVIMLLCAALIACLL  491 (609)
Q Consensus       438 ~t~~-~~---~-pe~~~-p~~~~~~~~~~na~~~~Llv~ViPaVIIa~mllia~iIaC~c  491 (609)
                      .... +.   . .+..+ |+  .......+-.++++++++||++|.++++++.+.|-||.
T Consensus       253 ~~~~~~g~gi~~~~~~~~Pp--~~~~p~R~y~~d~~vtl~iPl~i~llL~llLs~Imc~r  310 (386)
T PF05510_consen  253 QPTQPYGSGILEEDGWFNPP--KESVPGRDYFPDFLVTLAIPLIIALLLLLLLSYIMCCR  310 (386)
T ss_pred             ccccccCCCccCCCCCCCCC--cccccccccHHHHHHHHHHHHHHHHHHHHHHHHHheec
Confidence            0000 00   0 00001 11  11112233455677778888877776666666665553


No 7  
>KOG3781|consensus
Probab=98.76  E-value=1.3e-10  Score=126.44  Aligned_cols=288  Identities=21%  Similarity=0.289  Sum_probs=189.2

Q ss_pred             CCcccCcCccceeecCccEEEEcCCCCcccCCCCCeeeEEEEEecccc---eecccCc--cCcCccccccccceeeEEEe
Q psy12980        113 VPLIRNPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDR---YVISNTT--TGQAKPVLTRALGRLMTVKN  187 (609)
Q Consensus       113 ~P~v~~pi~~~~a~e~~~fs~~iP~~tF~D~d~Gdt~~ltlsl~~~d~---~~~~~t~--T~sgtp~~~d~~~~~v~v~~  187 (609)
                      -|++.|++...++++|++|...||.|||.|.+++++.++.+.....||   |..|+..  -..|=|-.  .|...--++.
T Consensus       226 ~P~l~n~~~~~~~~~g~~~~m~IpSdTf~dsed~~~f~~kl~ilsld~aevwv~f~sNskd~~glPms--~gkf~y~lea  303 (559)
T KOG3781|consen  226 TPLLKNQVLRVNCLSGQHCHMHIPSDTFEDSEDITSFIIKLEILSLDGAEVWVKFVSNSKDSIGLPMS--CGKFHYVLEA  303 (559)
T ss_pred             CccccCcceeeeeecchhhheeCCccchhhhhhcccceeeeeeecCCCceeEEEeccCCccccccccc--ccceeeeeee
Confidence            389999999999999999999999999999999999999999999998   9999554  33466654  2333222221


Q ss_pred             eeecccccccCCCCCCceecccCCCCceecCccceeeeeeCCeEEEEcCCCccccCCCCCeeeEEEEEeccCCCCCCCCc
Q psy12980        188 ITWRGIGHCEDSPPPPVVKVKLENQVPLIRNPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTH  267 (609)
Q Consensus       188 ~~~~~~~~~~~~~~~~tv~v~~~N~aPvv~n~i~~~~at~G~~f~f~IP~dTF~D~eDGdT~~L~lsl~~~Dg~~l~~~s  267 (609)
                      .+-+  |              ..+..|++-|.++...+..+..|.+..|.++|+|-++|..--|..+|..+-|.+.. ..
T Consensus       304 ~dk~--G--------------q~a~~pf~vnv~~~~s~n~~~~f~~~~p~d~~~~~np~~rt~Lv~~La~a~~~~~~-~i  366 (559)
T KOG3781|consen  304 SDKD--G--------------QSATDPFVVNVIPLPSANHLHTFHFPFPPDTFYDQNPGNRTHLVLELASALGYPSG-EI  366 (559)
T ss_pred             ecCC--C--------------CcccCCeEEeeeecCCCccceEEeccCCCchhhhcCCcchhHHHHHHHHhhcCCcc-ch
Confidence            1111  1              24567888888999999999999999999999999999843477777778888877 78


Q ss_pred             cceeccCCceEEeecCCCCceeEEEEEEEEcCCCCeeEeeEEEEEECCCCCCcceEEEEEecCCCCCcCchHHH---HHH
Q psy12980        268 WLQFDAKNQEFYGIPQLTDLGRREYQLVCKDSNGLMANDGLEVIVKSPGPAYYSASFSLTLARPSSTFAQPAAQ---RAF  344 (609)
Q Consensus       268 WL~Fd~~t~tf~G~P~~~dvG~~~f~LtATDs~G~sasd~F~V~V~~~p~~~~~vef~L~l~~~~~~ft~~~~q---~~L  344 (609)
                      ||||.-..+  ||++.+=+--..-|...++|..|+.+.--...+-.. |.-...+-+..++....-+|... .+   ..|
T Consensus       367 tLq~I~~~r--~~~~vsf~nnslp~e~C~s~qI~~va~kmi~~~~~r-pr~~f~~tm~~dfk~kt~s~~k~-gnc~hl~f  442 (559)
T KOG3781|consen  367 TLQFIMTIR--YGTPVSFDNNSLPYEFCDSDQIGLVAQKMITIELGR-PRSSFCHTMGLDFKVKTASFIKR-GNCIHLFF  442 (559)
T ss_pred             hhhhhhhhc--cceeEEeecccCccccCcHHHHHHHhhhhhHHhcCC-ccchhhhhccccceeeEEEeecc-ccccceee
Confidence            999997633  444433322334444477887777775555444433 11222222222222222223221 12   133


Q ss_pred             HHHHHHHhCCCCCCcEEEEeeecCCCCCCCcEEEEEEe-----cCCCCCCCCCHHHHHHHHHHHhhccchhhHHHHHhhc
Q psy12980        345 LEKLAGLFGDRNASKIVNIEFEDNPQEPGKSTVVRWAN-----RTLASNVECPETTIGLLREVLLTEDDLLQESLVTSLE  419 (609)
Q Consensus       345 VekLA~llGD~d~s~ItVrsI~~~~~~~~gSTvV~w~N-----~TLp~~~~CP~~eI~~L~~kL~~~dg~~s~~f~~am~  419 (609)
                      |.-.-+++.-.+...+.|+.+.      .+++++.|.+     ++-.. +.|+-++|-...=++.-++|....-+.+.+.
T Consensus       443 I~ltPD~~l~~~~~~l~l~tl~------t~a~Ii~~~~~lk~Gk~ks~-~~~~~kk~lpIvf~lel~d~~~~pmlvkeek  515 (559)
T KOG3781|consen  443 IKLTPDYLLSSNPPQLVLITLR------TGATIISWICALKKGKSKSL-KWCPNKKILPIVFKLELPDGNVHPMLVKEEK  515 (559)
T ss_pred             eeccchhhhccCCceeeehhhH------hhhHHHHhHHHHhhcccccc-ceeeccCceeEEEEeecCccCCchhHHHhhc
Confidence            3222144444556788888888      7888999988     11112 3577666654444454455666677788777


Q ss_pred             ccceeeEEeee
Q psy12980        420 KDYGVKHASVI  430 (609)
Q Consensus       420 peF~V~~Vs~~  430 (609)
                      +.+++-.+.-.
T Consensus       516 ~plk~~~~~N~  526 (559)
T KOG3781|consen  516 PPLKIPTVENI  526 (559)
T ss_pred             CCCCccccCcc
Confidence            77777665543


No 8  
>PF05345 He_PIG:  Putative Ig domain;  InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=98.31  E-value=1.4e-06  Score=68.18  Aligned_cols=47  Identities=21%  Similarity=0.372  Sum_probs=40.9

Q ss_pred             EEEeccCCCCCCCCccceeccCCceEEeecCCC-CceeEEEEEEEEcCCC
Q psy12980        253 LHLMTMDRTTIPPTHWLQFDAKNQEFYGIPQLT-DLGRREYQLVCKDSNG  301 (609)
Q Consensus       253 lsl~~~Dg~~l~~~sWL~Fd~~t~tf~G~P~~~-dvG~~~f~LtATDs~G  301 (609)
                      +++....+..||  +||.||+.+++|+|.|... ..|.|.|.|+|+|..|
T Consensus         2 ys~~~~~~~~LP--~gLs~d~~tG~isGtp~~~~~~G~y~~~vtatd~~G   49 (49)
T PF05345_consen    2 YSLTTPTGGGLP--SGLSLDPSTGTISGTPTSSVQPGTYTFTVTATDGSG   49 (49)
T ss_pred             EEEEcCCCCCCC--CcEEEeCCCCEEEeecCCCccccEEEEEEEEEcCCC
Confidence            445556677788  9999999999999999988 5699999999999987


No 9  
>KOG4482|consensus
Probab=97.90  E-value=8e-05  Score=79.51  Aligned_cols=205  Identities=15%  Similarity=0.213  Sum_probs=116.9

Q ss_pred             eeeeeeCCeEEEEcCCCccccCCCCCeeeEEEEEeccCCCCCCCCccceeccCC----ceEEeecCCCCceeEEEEEEEE
Q psy12980        222 HLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKN----QEFYGIPQLTDLGRREYQLVCK  297 (609)
Q Consensus       222 ~~~at~G~~f~f~IP~dTF~D~eDGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t----~tf~G~P~~~dvG~~~f~LtAT  297 (609)
                      .+.+++|..|.++|-..+|.-----.+-..++......--.+|  +||.+=++.    +-+||+|.....|+..+.|+|-
T Consensus        28 ~v~a~vG~lFvhtLe~e~F~~~h~p~~~~v~y~a~L~~yPdlP--~WLRy~~s~~yh~gfLyGtp~ae~~gk~~leVtA~  105 (449)
T KOG4482|consen   28 HVTATVGKLFVHTLESEHFFWTHQPSPVKVKYEATLNGYPDLP--NWLRYLQSRPYHIGFLYGTPTAEVQGKVTLEVTAY  105 (449)
T ss_pred             eeecccceEEEEeccHHHhcCcCcCCcceeEeehhccCCCCch--hHHHhcccCCCCcceEecCCCcccCceeEEEEEEe
Confidence            4689999999999999998832111223456666544333356  999998875    5899999999999999999998


Q ss_pred             cCCCCee-EeeEEEEE--ECCCCCCcceEEEEEecCCCCCcCchHHHHHHHHHHHH-HhCCCCCCcEEEEeeecC-----
Q psy12980        298 DSNGLMA-NDGLEVIV--KSPGPAYYSASFSLTLARPSSTFAQPAAQRAFLEKLAG-LFGDRNASKIVNIEFEDN-----  368 (609)
Q Consensus       298 Ds~G~sa-sd~F~V~V--~~~p~~~~~vef~L~l~~~~~~ft~~~~q~~LVekLA~-llGD~d~s~ItVrsI~~~-----  368 (609)
                      ....--. .--..|++  .......+.+||.+. .-+.+.+......-.|...|.. +-|-.+-+.|-+.+...-     
T Consensus       106 nrdnfdt~r~~vVini~~ae~fptp~v~e~~In-nldveell~s~~rs~l~nalrn~lw~~e~l~~i~~tsAld~ggRvp  184 (449)
T KOG4482|consen  106 NRDNFDTQRRMVVINIVEAELFPTPYVAEFPIN-NLDVEELLSSRDRSQLLNALRNTLWGPEVLPYINGTSALDRGGRVP  184 (449)
T ss_pred             cccchhhhhheEEEEEeccccCCCcceeeeeec-CCCHHHhcCcccHHHHHHHHhccccCcccccceeeeehhhcCCCCC
Confidence            8754322 21233333  333456677788755 2233444432122355555555 333223345554443210     


Q ss_pred             -CCCCCCcEEEEEEecCCCCCCCCCHHHHHHHHHHHhhccchhhHHHHHhhccccee--eEEeee
Q psy12980        369 -PQEPGKSTVVRWANRTLASNVECPETTIGLLREVLLTEDDLLQESLVTSLEKDYGV--KHASVI  430 (609)
Q Consensus       369 -~~~~~gSTvV~w~N~TLp~~~~CP~~eI~~L~~kL~~~dg~~s~~f~~am~peF~V--~~Vs~~  430 (609)
                       +.. +.+-=+-=++.+-..+++|=.+-.+.+...--.....+...-.+.++|.|+|  |+|.+.
T Consensus       185 l~i~-~rkeGv~V~VGs~~~Fsp~l~ev~~~~~~l~c~~~~psv~~~~k~fap~F~vDWCnv~lV  248 (449)
T KOG4482|consen  185 LPIF-KRKEGVYVYVGSQQPFSPCLMEVVHPLQTLRCFQSNPSVVVCNKYFAPTFPVDWCNVALV  248 (449)
T ss_pred             cCcc-cccceeEEEecccCCcchhHHHhhccccccchhccCCceEEhhhhcCCcceeeeeeeEee
Confidence             000 1111122366777778899775544333321222223344455677888886  566653


No 10 
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=96.93  E-value=0.018  Score=55.11  Aligned_cols=162  Identities=15%  Similarity=0.139  Sum_probs=94.2

Q ss_pred             cccCCCCCeeeEEEEEecccc--eecccCccCcC--cccc----ccccceeeEEEeeeecccccccCCCCCCceecccCC
Q psy12980        140 FFDHEDGGTRKLKLHLMTMDR--YVISNTTTGQA--KPVL----TRALGRLMTVKNITWRGIGHCEDSPPPPVVKVKLEN  211 (609)
Q Consensus       140 F~D~d~Gdt~~ltlsl~~~d~--~~~~~t~T~sg--tp~~----~d~~~~~v~v~~~~~~~~~~~~~~~~~~tv~v~~~N  211 (609)
                      -.|.|.|....+++++...+.  |..-+..||.-  +..+    .....+.|.+++.   +. ........++|.|.+.|
T Consensus        21 a~D~D~~~~~~~~y~i~~~~~~~~F~i~~~tG~l~~~~~lD~e~~~~~~l~v~a~D~---g~-~~~~~~~~v~I~V~d~N   96 (199)
T cd00031          21 ATDPDSGENGRVTYSILGGNEDGLFSIDPNTGVITTTKPLDREEQSEYTLTVVASDG---GG-PPLSSTATVTVTVLDVN   96 (199)
T ss_pred             EECCCCCCCceEEEEEeCCCCcccEEEeCCCCEEEECCCCCCcCCceEEEEEEEEEC---Cc-CcceeEEEEEEEEccCC
Confidence            356666643456667666653  44445555531  1111    1123334444432   11 11112234667777888


Q ss_pred             C-CceecCccceeeeeeCC---eEEEEcCCCccccCCCCCeeeEEEEEeccCCCCCCCCccceeccCCceEEee-cCCCC
Q psy12980        212 Q-VPLIRNPIDHLEAISGE---LLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGI-PQLTD  286 (609)
Q Consensus       212 ~-aPvv~n~i~~~~at~G~---~f~f~IP~dTF~D~eDGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~G~-P~~~d  286 (609)
                      + +|......-.+.+.++.   ....++-   ..|.+.|....++|++.....     ..|+++|..++.++-. ++.-.
T Consensus        97 d~~P~~~~~~~~~~v~e~~~~~~~i~~~~---a~D~D~~~~~~~~y~l~~~~~-----~~~f~i~~~~G~i~~~~~ld~e  168 (199)
T cd00031          97 DNPPVFEQSSYEASVPENAPPGTVVGTVT---ATDADSGENAKLTYSILSGND-----KELFSIDPNTGIITLAKPLDRE  168 (199)
T ss_pred             CCCCcccccceEEEEeCCCCCCCEEEEEE---EEcCCCCCCccEEEEEeCCCC-----CCEEEEeCCceEEEeCCccCCc
Confidence            7 67776655566665553   2223332   677766533478888865433     3799999988877766 34333


Q ss_pred             c-eeEEEEEEEEcCC--CCeeEeeEEEEEE
Q psy12980        287 L-GRREYQLVCKDSN--GLMANDGLEVIVK  313 (609)
Q Consensus       287 v-G~~~f~LtATDs~--G~sasd~F~V~V~  313 (609)
                      . +.|.+.|+|+|.+  +.+.+-.+.|.|.
T Consensus       169 ~~~~~~l~v~a~D~~~~~~~~~~~i~i~v~  198 (199)
T cd00031         169 EKSSYELTVVATDGGGPPLSSTATVTVTVL  198 (199)
T ss_pred             cCceEEEEEEEEECCCCCceeEEEEEEEEE
Confidence            3 5999999999998  6777777777664


No 11 
>PF12877 DUF3827:  Domain of unknown function (DUF3827);  InterPro: IPR024606 The function of the proteins in this entry is not currently known, but one of the human proteins (Q9HCM3 from SWISSPROT) has been implicated in pilocytic astrocytomas [, , ]. In the majority of cases of pilocytic astrocytomas a tandem duplication produces an in-frame fusion of the gene encoding this protein and the BRAF oncogene. The resulting fusion protein has constitutive BRAF kinase activity and is capable of transforming cells. 
Probab=95.83  E-value=0.064  Score=61.52  Aligned_cols=78  Identities=18%  Similarity=0.208  Sum_probs=50.1

Q ss_pred             HHHHHHHHHHhCCC-------------CCCcEEEEeeecCCCC-CCCcEEEEEEecCCCCCCCCCHHHHHHHHHHHhhcc
Q psy12980        342 RAFLEKLAGLFGDR-------------NASKIVNIEFEDNPQE-PGKSTVVRWANRTLASNVECPETTIGLLREVLLTED  407 (609)
Q Consensus       342 ~~LVekLA~llGD~-------------d~s~ItVrsI~~~~~~-~~gSTvV~w~N~TLp~~~~CP~~eI~~L~~kL~~~d  407 (609)
                      .++-+.||++|+..             +...+.+.+++  ... +++-+-++||...... ++.++..-+++++.|..+ 
T Consensus       161 r~laqll~e~~~~~~~~~~R~rRattvGn~tVQmV~~s--Rl~G~~nP~EL~YyV~~~~G-~pl~a~~AA~~Ln~ld~Q-  236 (684)
T PF12877_consen  161 RRLAQLLAELFGVAQRQRRRFRRATTVGNYTVQMVNMS--RLEGPDNPVELTYYVEGQNG-KPLPAVTAAKDLNLLDSQ-  236 (684)
T ss_pred             HHHHHHHHHHhhhhhcccceeeccccccceEEEEEEee--eccCCCCceEEEEEEEcCCC-cCCcHHHHHHHHhccCHH-
Confidence            34445667777665             22344444444  222 1334899999987777 899999999999988755 


Q ss_pred             chhhHHHHHhhcccceeeEEeee
Q psy12980        408 DLLQESLVTSLEKDYGVKHASVI  430 (609)
Q Consensus       408 g~~s~~f~~am~peF~V~~Vs~~  430 (609)
                           +.  ++...|.|..+--+
T Consensus       237 -----~~--Al~LGy~V~~~~Aq  252 (684)
T PF12877_consen  237 -----RM--ALILGYRVQGIVAQ  252 (684)
T ss_pred             -----HH--HHhcCceecccccc
Confidence                 22  45567777665443


No 12 
>PF15102 TMEM154:  TMEM154 protein family
Probab=95.50  E-value=0.026  Score=53.85  Aligned_cols=37  Identities=24%  Similarity=0.585  Sum_probs=27.8

Q ss_pred             CccEEEEeehHHHHHHHHHHHHHHhheeeecccCCCC
Q psy12980        463 SVEYLITFVAPLVVIVIMLLCAALIACLLYRRRHTGK  499 (609)
Q Consensus       463 ~~~~Llv~ViPaVIIa~mllia~iIaC~cyRrKRkgK  499 (609)
                      ....++..+||+|+++++||++.+++.+|+|||+|++
T Consensus        54 q~efiLmIlIP~VLLvlLLl~vV~lv~~~kRkr~K~~   90 (146)
T PF15102_consen   54 QLEFILMILIPLVLLVLLLLSVVCLVIYYKRKRTKQE   90 (146)
T ss_pred             CcceEEEEeHHHHHHHHHHHHHHHheeEEeecccCCC
Confidence            3344666889998888888888777777777777775


No 13 
>PF10873 DUF2668:  Protein of unknown function (DUF2668);  InterPro: IPR022640  Members in this family of proteins are annotated as cysteine and tyrosine-rich protein 1, however currently no function is known []. 
Probab=95.44  E-value=0.03  Score=53.21  Aligned_cols=22  Identities=18%  Similarity=0.383  Sum_probs=15.9

Q ss_pred             HHHHHHHHHHHHHhheeeeccc
Q psy12980        474 LVVIVIMLLCAALIACLLYRRR  495 (609)
Q Consensus       474 aVIIa~mllia~iIaC~cyRrK  495 (609)
                      +-||.+|..+|+|++|+|+.-|
T Consensus        68 fgiVfimgvva~i~icvCmc~k   89 (155)
T PF10873_consen   68 FGIVFIMGVVAGIAICVCMCMK   89 (155)
T ss_pred             hhhHHHHHHHHHHHHHHhhhhh
Confidence            4556667788988888777754


No 14 
>PF13908 Shisa:  Wnt and FGF inhibitory regulator
Probab=95.30  E-value=0.069  Score=52.19  Aligned_cols=9  Identities=33%  Similarity=0.017  Sum_probs=3.4

Q ss_pred             EEeehHHHH
Q psy12980        468 ITFVAPLVV  476 (609)
Q Consensus       468 lv~ViPaVI  476 (609)
                      ++.||-+||
T Consensus        81 ivgvi~~Vi   89 (179)
T PF13908_consen   81 IVGVICGVI   89 (179)
T ss_pred             eeehhhHHH
Confidence            333443333


No 15 
>PF12273 RCR:  Chitin synthesis regulation, resistance to Congo red;  InterPro: IPR020999  RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall []. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5 []. 
Probab=93.81  E-value=0.031  Score=52.02  Aligned_cols=27  Identities=37%  Similarity=0.536  Sum_probs=15.0

Q ss_pred             HHHHHHHHHHHHHhheeeecccCCCCc
Q psy12980        474 LVVIVIMLLCAALIACLLYRRRHTGKM  500 (609)
Q Consensus       474 aVIIa~mllia~iIaC~cyRrKRkgK~  500 (609)
                      +|||++++|+..+++|+-+||||||--
T Consensus         6 ~iii~~i~l~~~~~~~~~rRR~r~G~~   32 (130)
T PF12273_consen    6 AIIIVAILLFLFLFYCHNRRRRRRGLQ   32 (130)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHhhcCCC
Confidence            455555555555555555566666544


No 16 
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=92.95  E-value=0.91  Score=37.47  Aligned_cols=68  Identities=16%  Similarity=0.255  Sum_probs=48.8

Q ss_pred             cCCCCCeeeEEEEEeccCCCCCCCCccceeccCCceEEee-cCCCCc-eeEEEEEEEEcCCC--CeeEeeEEEEEEC
Q psy12980        242 DHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGI-PQLTDL-GRREYQLVCKDSNG--LMANDGLEVIVKS  314 (609)
Q Consensus       242 D~eDGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~G~-P~~~dv-G~~~f~LtATDs~G--~sasd~F~V~V~~  314 (609)
                      |++.|....++|++...+..     .|+..|..++.+.-. ++.-.. ..|.+.|.|+|.++  .++...+.|.|..
T Consensus         2 D~D~g~n~~i~Y~i~~~~~~-----~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D   73 (79)
T smart00112        2 DADSGENGKVTYSILSGNED-----GLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLD   73 (79)
T ss_pred             CCCCCcCcEEEEEEecCCCC-----CEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEE
Confidence            56666434678888644322     789999998855543 554443 68999999999987  7888888888865


No 17 
>PF00028 Cadherin:  Cadherin domain;  InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=92.55  E-value=1.6  Score=37.20  Aligned_cols=68  Identities=12%  Similarity=0.120  Sum_probs=46.8

Q ss_pred             cccCCCCCeeeEEEEEeccCCCCCCCCccceeccCCceEEee-cCCC-CceeEEEEEEEEcC-CCCeeEeeEEEEE
Q psy12980        240 FFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGI-PQLT-DLGRREYQLVCKDS-NGLMANDGLEVIV  312 (609)
Q Consensus       240 F~D~eDGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~G~-P~~~-dvG~~~f~LtATDs-~G~sasd~F~V~V  312 (609)
                      -.|+|.|....+.+++...+.     ..|...|+.+++++=. ++.- ..+.|.+.+.|+|. ++..-+.+..|.|
T Consensus        20 a~D~D~~~n~~i~y~i~~~~~-----~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D~~~~~~~~~~~~V~I   90 (93)
T PF00028_consen   20 ATDPDSGPNSQITYSILGGNP-----DGLFSIDPNTGEISLKKPLDRETQSSYQLTVRATDSGGSPPLSSTATVTI   90 (93)
T ss_dssp             EEESSTSTTSSEEEEEEETTS-----TTSEEEETTTTEEEESSSSCTTTTSEEEEEEEEEETTTSSEEEEEEEEEE
T ss_pred             EEeCCCCCCceEEEEEecCcc-----cCceEEeeeeeccccceecCcccCCEEEEEEEEEECCCCCCCEEEEEEEE
Confidence            356655555578888875443     3799999999886654 4433 34899999999999 5555555555554


No 18 
>PF02439 Adeno_E3_CR2:  Adenovirus E3 region protein CR2;  InterPro: IPR003470 Early region 3 (E3) of human adenoviruses (Ads) codes for proteins that appear to control viral interactions with the host []. This region called CR1 (conserved region 1) [] is found three times in Human adenovirus 19 (a subgroup D adenovirus) 49 kDa protein in the E3 region. CR1 is also found in the 20.1 Kd protein of subgroup B adenoviruses. The function of this 80 amino acid region is unknown. This region is probably a divergent immunoglobulin domain.
Probab=91.64  E-value=0.13  Score=38.48  Aligned_cols=11  Identities=27%  Similarity=0.537  Sum_probs=5.9

Q ss_pred             eeecccCCCCc
Q psy12980        490 LLYRRRHTGKM  500 (609)
Q Consensus       490 ~cyRrKRkgK~  500 (609)
                      |||||.|.-|+
T Consensus        28 Ccykk~~~~k~   38 (38)
T PF02439_consen   28 CCYKKHRRQKM   38 (38)
T ss_pred             HHHcccccccC
Confidence            56666553453


No 19 
>PF04478 Mid2:  Mid2 like cell wall stress sensor;  InterPro: IPR007567 This family represents a region near the C terminus of Mid2, which contains a transmembrane region. The remainder of the protein sequence is serine-rich and of low complexity, and is therefore impossible to align accurately. Mid2 is thought to act as a mechanosensor of cell wall stress. The C-terminal cytoplasmic region of Mid2 is known to interact with Rom2, a guanine nucleotide exchange factor (GEF) for Rho1, which is part of the cell wall integrity signalling pathway [].
Probab=90.22  E-value=0.19  Score=48.45  Aligned_cols=31  Identities=29%  Similarity=0.178  Sum_probs=23.7

Q ss_pred             EEEeehHHHHHHHHHHHHHHhheeeecccCC
Q psy12980        467 LITFVAPLVVIVIMLLCAALIACLLYRRRHT  497 (609)
Q Consensus       467 Llv~ViPaVIIa~mllia~iIaC~cyRrKRk  497 (609)
                      .++=|+.+|-+++||++++|+.|||+||||+
T Consensus        50 IVIGvVVGVGg~ill~il~lvf~~c~r~kkt   80 (154)
T PF04478_consen   50 IVIGVVVGVGGPILLGILALVFIFCIRRKKT   80 (154)
T ss_pred             EEEEEEecccHHHHHHHHHhheeEEEecccC
Confidence            3445566777777788888999999998874


No 20 
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=89.92  E-value=3.2  Score=39.60  Aligned_cols=70  Identities=11%  Similarity=0.158  Sum_probs=48.9

Q ss_pred             cccCCCCCeeeEEEEEeccCCCCCCCCccceeccCCceEEeec-C-CCCceeEEEEEEEEcCCCCe--eEeeEEEEEEC
Q psy12980        240 FFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIP-Q-LTDLGRREYQLVCKDSNGLM--ANDGLEVIVKS  314 (609)
Q Consensus       240 F~D~eDGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~G~P-~-~~dvG~~~f~LtATDs~G~s--asd~F~V~V~~  314 (609)
                      -.|++.|....++|++...+..     .|+..|..++.++-.. + ....+.|.+.+.|+|+++..  +.-.+.|.|..
T Consensus        21 a~D~D~~~~~~~~y~i~~~~~~-----~~F~i~~~tG~l~~~~~lD~e~~~~~~l~v~a~D~g~~~~~~~~~v~I~V~d   94 (199)
T cd00031          21 ATDPDSGENGRVTYSILGGNED-----GLFSIDPNTGVITTTKPLDREEQSEYTLTVVASDGGGPPLSSTATVTVTVLD   94 (199)
T ss_pred             EECCCCCCCceEEEEEeCCCCc-----ccEEEeCCCCEEEECCCCCCcCCceEEEEEEEEECCcCcceeEEEEEEEEcc
Confidence            4566555323788877644322     6999999877666553 3 33348999999999987766  77788888865


No 21 
>PF07495 Y_Y_Y:  Y_Y_Y domain;  InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=89.23  E-value=2.5  Score=33.83  Aligned_cols=44  Identities=18%  Similarity=0.397  Sum_probs=29.9

Q ss_pred             ccceeccCCceEEeecCCCCceeEEEEEEEEcCCCCeeEe--eEEEEE
Q psy12980        267 HWLQFDAKNQEFYGIPQLTDLGRREYQLVCKDSNGLMAND--GLEVIV  312 (609)
Q Consensus       267 sWL~Fd~~t~tf~G~P~~~dvG~~~f~LtATDs~G~sasd--~F~V~V  312 (609)
                      .|......+..+.=.++..  |+|+|+|.|.|..|....+  ++.|+|
T Consensus        20 ~W~~~~~~~~~~~~~~L~~--G~Y~l~V~a~~~~~~~~~~~~~l~i~I   65 (66)
T PF07495_consen   20 EWITLGSYSNSISYTNLPP--GKYTLEVRAKDNNGKWSSDEKSLTITI   65 (66)
T ss_dssp             SEEEESSTS-EEEEES--S--EEEEEEEEEEETTS-B-SS-EEEEEEE
T ss_pred             eEEECCCCcEEEEEEeCCC--EEEEEEEEEECCCCCcCcccEEEEEEE
Confidence            7998877654555555555  9999999999999998776  455554


No 22 
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=88.60  E-value=0.96  Score=37.67  Aligned_cols=27  Identities=19%  Similarity=0.241  Sum_probs=24.0

Q ss_pred             CceeEEEEEEEEcCCCCeeEeeEEEEE
Q psy12980        286 DLGRREYQLVCKDSNGLMANDGLEVIV  312 (609)
Q Consensus       286 dvG~~~f~LtATDs~G~sasd~F~V~V  312 (609)
                      ..|.|.+.|+|+|..|.+.+++..|.|
T Consensus        55 ~~G~y~v~l~v~d~~g~~~~~~~~V~V   81 (81)
T cd00146          55 KPGTYTVTLTVTNAVGSSSTKTTTVVV   81 (81)
T ss_pred             CCcEEEEEEEEEeCCCCEEEEEEEEEC
Confidence            349999999999999999998888875


No 23 
>PF15050 SCIMP:  SCIMP protein
Probab=88.53  E-value=0.46  Score=44.10  Aligned_cols=28  Identities=18%  Similarity=0.366  Sum_probs=16.9

Q ss_pred             HHHHHHHHHHH-HHHhheeeecccCCCCc
Q psy12980        473 PLVVIVIMLLC-AALIACLLYRRRHTGKM  500 (609)
Q Consensus       473 PaVIIa~mlli-a~iIaC~cyRrKRkgK~  500 (609)
                      -||-|+++-++ +-|..|+|+|..|.||.
T Consensus        12 LAVaII~vS~~lglIlyCvcR~~lRqGkk   40 (133)
T PF15050_consen   12 LAVAIILVSVVLGLILYCVCRWQLRQGKK   40 (133)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHcccc
Confidence            34444444444 44466778877788887


No 24 
>PF15418 DUF4625:  Domain of unknown function (DUF4625)
Probab=87.19  E-value=4.3  Score=38.38  Aligned_cols=86  Identities=16%  Similarity=0.304  Sum_probs=54.6

Q ss_pred             cceeeeeeCCeEEEEcCCCccccCCCCCeeeEEEEE------eccCCCCCC--CCccceeccCC--------ceEEe---
Q psy12980        220 IDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHL------MTMDRTTIP--PTHWLQFDAKN--------QEFYG---  280 (609)
Q Consensus       220 i~~~~at~G~~f~f~IP~dTF~D~eDGdT~~L~lsl------~~~Dg~~l~--~~sWL~Fd~~t--------~tf~G---  280 (609)
                      -..+.+..|.-|.|..   .|.|...=.  .+++.+      -. ++....  ...|. |+..=        -+|.=   
T Consensus        27 ~~~~~~~~G~~ihfe~---~i~d~~~i~--si~VeIH~nfd~H~-h~~~~~~~~~~~~-~~~~~~~~~g~~~~~~h~~i~   99 (132)
T PF15418_consen   27 ENCKVATRGDDIHFEA---DISDNSAIK--SIKVEIHNNFDHHT-HSTEAGECEKPWV-FEQDYDIYGGKKNYDFHEHID   99 (132)
T ss_pred             CCCeEEecCCcEEEEE---EEEccccee--EEEEEEecCcCccc-ccccccccccCcE-EEEEEcccCCcccEeEEEeee
Confidence            4567888888776653   255533333  555555      21 221111  33676 55321        12222   


Q ss_pred             ecCCCCceeEEEEEEEEcCCCCeeEeeEEEEE
Q psy12980        281 IPQLTDLGRREYQLVCKDSNGLMANDGLEVIV  312 (609)
Q Consensus       281 ~P~~~dvG~~~f~LtATDs~G~sasd~F~V~V  312 (609)
                      +|.+...|.|.|.+++||..|......-.|.|
T Consensus       100 IPa~a~~G~YH~~i~VtD~~Gn~~~~~~~i~I  131 (132)
T PF15418_consen  100 IPADAPAGDYHFMITVTDAAGNQTEEERSIKI  131 (132)
T ss_pred             CCCCCCCcceEEEEEEEECCCCEEEEEEEEEE
Confidence            58888889999999999999999988888776


No 25 
>PF01102 Glycophorin_A:  Glycophorin A;  InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=86.36  E-value=0.38  Score=44.89  Aligned_cols=28  Identities=14%  Similarity=0.128  Sum_probs=13.6

Q ss_pred             ehHHHHHHHHHHHHHHhh---eeeecccCCC
Q psy12980        471 VAPLVVIVIMLLCAALIA---CLLYRRRHTG  498 (609)
Q Consensus       471 ViPaVIIa~mllia~iIa---C~cyRrKRkg  498 (609)
                      ++-++|+++|++++++|+   .||+|||+|.
T Consensus        65 ~i~~Ii~gv~aGvIg~Illi~y~irR~~Kk~   95 (122)
T PF01102_consen   65 AIIGIIFGVMAGVIGIILLISYCIRRLRKKS   95 (122)
T ss_dssp             CHHHHHHHHHHHHHHHHHHHHHHHHHHS---
T ss_pred             ceeehhHHHHHHHHHHHHHHHHHHHHHhccC
Confidence            455666666666555544   3444444443


No 26 
>PF14991 MLANA:  Protein melan-A; PDB: 2GTZ_F 2GT9_F 3MRO_P 2GUO_C 3MRQ_P 2GTW_C 3L6F_C 3MRP_P.
Probab=84.23  E-value=0.32  Score=44.70  Aligned_cols=9  Identities=33%  Similarity=0.744  Sum_probs=0.9

Q ss_pred             CCCCCCCCC
Q psy12980        548 LPPPEYNRD  556 (609)
Q Consensus       548 lpppey~~~  556 (609)
                      =.||.|-+-
T Consensus        99 nAPPAYeKi  107 (118)
T PF14991_consen   99 NAPPAYEKI  107 (118)
T ss_dssp             ----B----
T ss_pred             CCCchhhhc
Confidence            348888865


No 27 
>PF12245 Big_3_2:  Bacterial Ig-like domain (group 3);  InterPro: IPR022038  This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT. 
Probab=81.87  E-value=3.5  Score=33.52  Aligned_cols=40  Identities=13%  Similarity=0.145  Sum_probs=31.7

Q ss_pred             CCceEEeecCCCCceeEEEEEEEEcCCCCeeEeeEEEEEE
Q psy12980        274 KNQEFYGIPQLTDLGRREYQLVCKDSNGLMANDGLEVIVK  313 (609)
Q Consensus       274 ~t~tf~G~P~~~dvG~~~f~LtATDs~G~sasd~F~V~V~  313 (609)
                      ....-+++|.....|.|.+.++|+|..|...+..-...+.
T Consensus         9 ~~~~~~~~P~~~~dg~yt~~v~a~D~AGN~~~~~~~~~i~   48 (60)
T PF12245_consen    9 SGVWSTVIPENDADGEYTLTVTATDKAGNTSSSTTQIVIV   48 (60)
T ss_pred             ccceeccccCccCCccEEEEEEEEECCCCEEEeeeEEEEE
Confidence            3445678899977799999999999999988766555543


No 28 
>PF01299 Lamp:  Lysosome-associated membrane glycoprotein (Lamp);  InterPro: IPR002000 Lysosome-associated membrane glycoproteins (lamp) [] are integral membrane proteins, specific to lysosomes, and whose exact biological function is not yet clear. Structurally, the lamp proteins consist of two internally homologous lysosome-luminal domains separated by a proline-rich hinge region; at the C-terminal extremity there is a transmembrane region (TM) followed by a very short cytoplasmic tail (C). In each of the duplicated domains, there are two conserved disulphide bonds. This structure is schematically represented in the figure below.   +-----+ +-----+ +-----+ +-----+ | | | | | | | | xCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxxxCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxx +--------------------------++Hinge++--------------------------++TM++C+  In mammals, there are two closely related types of lamp: lamp-1 and lamp-2, which form major components of the lysosome membrane. In chicken lamp-1 is known as LEP100.  Also included in this entry is the macrophage protein CD68 (or macrosialin) [] is a heavily glycosylated integral membrane protein whose structure consists of a mucin-like domain followed by a proline-rich hinge; a single lamp-like domain; a transmembrane region and a short cytoplasmic tail.   Similar to CD68, mammalian lamp-3, which is expressed in lymphoid organs, dendritic cells and in lung, contains all the C-terminal regions but lacks the N-terminal lamp-like region []. In a lamp-family protein from nematodes [] only the part C-terminal to the hinge is conserved. ; GO: 0016020 membrane
Probab=77.77  E-value=2.2  Score=45.31  Aligned_cols=61  Identities=23%  Similarity=0.267  Sum_probs=33.1

Q ss_pred             ceeeEEeeeeccccCCCCCCCCCCCCCCCCCCCCCCCCCCCCccEEEEeehHHHHHHHH--HHHHHHhheeeecccCCCC
Q psy12980        422 YGVKHASVIPTGICEGLKTPLHTPGVERKPAYSGKNTPANASVEYLITFVAPLVVIVIM--LLCAALIACLLYRRRHTGK  499 (609)
Q Consensus       422 F~V~~Vs~~g~G~C~~~t~~~~~pe~~~p~~~~~~~~~~na~~~~Llv~ViPaVIIa~m--llia~iIaC~cyRrKRkgK  499 (609)
                      +.+.++.+++.+.=.+. +              +  ....|..|.- -.+||.+|=++|  |+|..|||+++.|||++.-
T Consensus       241 l~~~~lqvQaF~~~~~~-F--------------g--~a~~C~~D~~-~~~vPIaVG~~La~lvlivLiaYli~Rrr~~~g  302 (306)
T PF01299_consen  241 LNLSDLQVQAFRVKNNT-F--------------G--TAEECSSDDT-SDLVPIAVGAALAGLVLIVLIAYLIGRRRSRAG  302 (306)
T ss_pred             EEEeEEEEEEEEecCCC-C--------------C--ChhcCCcCCc-cchHHHHHHHHHHHHHHHHHHhheeEecccccc
Confidence            67888888876553222 1              1  1123543332 245555443332  3355678999999886543


Q ss_pred             c
Q psy12980        500 M  500 (609)
Q Consensus       500 ~  500 (609)
                      |
T Consensus       303 Y  303 (306)
T PF01299_consen  303 Y  303 (306)
T ss_pred             c
Confidence            3


No 29 
>PF11669 WBP-1:  WW domain-binding protein 1;  InterPro: IPR021684  This family of proteins represents WBP-1, a ligand of the WW domain of Yes-associated protein. This protein has a proline-rich domain. WBP-1 does not bind to the SH3 domain []. 
Probab=75.30  E-value=2.6  Score=38.16  Aligned_cols=19  Identities=26%  Similarity=0.670  Sum_probs=9.8

Q ss_pred             CCeeccCCCCCCCCCCCCCC
Q psy12980        537 SPIILKEEKPPLPPPEYNRD  556 (609)
Q Consensus       537 ~P~IlkeEkpplpppey~~~  556 (609)
                      .|+-++- -+.-+||.|...
T Consensus        72 ~p~~~~~-~~~~~pP~Y~ev   90 (102)
T PF11669_consen   72 APLDIRV-LSPYKPPSYSEV   90 (102)
T ss_pred             Ccccccc-CCCCCCCCcHHh
Confidence            3444433 334467788743


No 30 
>PF01835 A2M_N:  MG2 domain;  InterPro: IPR002890 The proteinase-binding alpha-macroglobulins (A2M) [] are large glycoproteins found in the plasma of vertebrates, in the hemolymph of some invertebrates and in reptilian and avian egg white. A2M-like proteins are able to inhibit all four classes of proteinases by a 'trapping' mechanism. They have a peptide stretch, called the 'bait region', which contains specific cleavage sites for different proteinases. When a proteinase cleaves the bait region, a conformational change is induced in the protein, thus trapping the proteinase. The entrapped enzyme remains active against low molecular weight substrates, whilst its activity toward larger substrates is greatly reduced, due to steric hindrance. Following cleavage in the bait region, a thiol ester bond, formed between the side chains of a cysteine and a glutamine, is cleaved and mediates the covalent binding of the A2M-like protein to the proteinase. This family includes the N-terminal region of the alpha-2-macroglobulin family. The inhibitor domains belong to MEROPS inhibitor family I39.; GO: 0004866 endopeptidase inhibitor activity; PDB: 2B39_B 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 4ACQ_C 2P9R_B ....
Probab=75.12  E-value=7.4  Score=33.76  Aligned_cols=59  Identities=17%  Similarity=0.278  Sum_probs=43.6

Q ss_pred             eEEEEEeccCCCCCCCCccce-eccCCceEEee---cCCCCceeEEEEEEEEcCCCCeeEeeEEE
Q psy12980        250 KLKLHLMTMDRTTIPPTHWLQ-FDAKNQEFYGI---PQLTDLGRREYQLVCKDSNGLMANDGLEV  310 (609)
Q Consensus       250 ~L~lsl~~~Dg~~l~~~sWL~-Fd~~t~tf~G~---P~~~dvG~~~f~LtATDs~G~sasd~F~V  310 (609)
                      .+++.+...+|..+.  .|.. -....+.+++.   |.+...|.|.+.+...+.++...+.+|.|
T Consensus        37 ~~~v~i~dp~g~~v~--~~~~~~~~~~G~~~~~~~lp~~~~~G~y~i~~~~~~~~~~~~~~~F~V   99 (99)
T PF01835_consen   37 PVTVTIKDPSGNEVF--RWSVNTTNENGIFSGSFQLPDDAPLGTYTIRVKTDDDGGQSFSKTFQV   99 (99)
T ss_dssp             EEEEEEEETTSEEEE--EEEEEETTCTTEEEEEEE--SS---EEEEEEEEETTTTCEEEEEEEEE
T ss_pred             ceEEEEECCCCCEEE--EEEeeeeCCCCEEEEEEECCCCCCCEeEEEEEEEccCCCCEEEEEEEC
Confidence            677788877777765  7776 44567777774   66666799999999988889999999986


No 31 
>PF13750 Big_3_3:  Bacterial Ig-like domain (group 3)
Probab=74.58  E-value=25  Score=34.03  Aligned_cols=34  Identities=15%  Similarity=0.229  Sum_probs=28.3

Q ss_pred             cCCCCceeEEEEEEEEcCCCCeeEeeEEEEEECC
Q psy12980        282 PQLTDLGRREYQLVCKDSNGLMANDGLEVIVKSP  315 (609)
Q Consensus       282 P~~~dvG~~~f~LtATDs~G~sasd~F~V~V~~~  315 (609)
                      |.-..-+.|...+.|+|..|.....+......|.
T Consensus       117 psle~~~~YtLtV~a~D~aGN~~~~si~F~y~P~  150 (158)
T PF13750_consen  117 PSLEADDSYTLTVSATDKAGNQSTKSISFSYMPP  150 (158)
T ss_pred             CCcCCCCeEEEEEEEEecCCCEEEEEEEEEEeCC
Confidence            4333448999999999999999999999888864


No 32 
>PF02494 HYR:  HYR domain;  InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=73.48  E-value=4.9  Score=33.91  Aligned_cols=26  Identities=23%  Similarity=0.300  Sum_probs=24.4

Q ss_pred             ceeEEEEEEEEcCCCCeeEeeEEEEE
Q psy12980        287 LGRREYQLVCKDSNGLMANDGLEVIV  312 (609)
Q Consensus       287 vG~~~f~LtATDs~G~sasd~F~V~V  312 (609)
                      +|.|.+..+|+|..|.++.=.|.|.|
T Consensus        56 ~G~t~V~ytA~D~~GN~a~C~f~V~V   81 (81)
T PF02494_consen   56 VGTTTVTYTATDAAGNSATCSFTVTV   81 (81)
T ss_pred             eceEEEEEEEEECCCCEEEEEEEEEC
Confidence            59999999999999999999999986


No 33 
>PF05478 Prominin:  Prominin;  InterPro: IPR008795 The prominins are an emerging family of proteins that, among the multispan membrane proteins, display a novel topology. Mouse and Homo sapiens prominin and (Mus musculus) prominin-like 1 (PROML1) are predicted to contain five membrane spanning domains, with an N-terminal domain exposed to the extracellular space followed by four, alternating small cytoplasmic and large extracellular, loops and a cytoplasmic C-terminal domain []. The exact function of prominin is unknown although in humans defects in PROM1, the gene coding for prominin, cause retinal degeneration [].; GO: 0016021 integral to membrane
Probab=72.83  E-value=1.4  Score=52.75  Aligned_cols=33  Identities=15%  Similarity=0.208  Sum_probs=21.7

Q ss_pred             eehHHHHHHHHHHHHHHhheeeecccC-CCCccc
Q psy12980        470 FVAPLVVIVIMLLCAALIACLLYRRRH-TGKMSV  502 (609)
Q Consensus       470 ~ViPaVIIa~mllia~iIaC~cyRrKR-kgK~~i  502 (609)
                      .++.++++++++-+.|+|.|||++++| .|||.-
T Consensus        96 ~~~i~ll~~il~P~vg~~fCcCRCc~~CGg~~~~  129 (806)
T PF05478_consen   96 CAVIGLLFIILMPLVGLCFCCCRCCGNCGGRMHQ  129 (806)
T ss_pred             HHHHHHHHHHHHHHHHHHHhccccCCCcCCcccc
Confidence            333355555555577899999976665 788843


No 34 
>PF06667 PspB:  Phage shock protein B;  InterPro: IPR009554 This family consists of several bacterial phage shock protein B (PspB) sequences. The phage shock protein (psp) operon is induced in response to heat, ethanol, osmotic shock and infection by filamentous bacteriophages []. Expression of the operon requires the alternative sigma factor sigma54 and the transcriptional activator PspF. In addition, PspA plays a negative regulatory role, and the integral-membrane proteins PspB and PspC play a positive one [].; GO: 0006355 regulation of transcription, DNA-dependent, 0009271 phage shock
Probab=72.60  E-value=1.3  Score=38.01  Aligned_cols=31  Identities=23%  Similarity=0.365  Sum_probs=21.3

Q ss_pred             hHHHHHHHHHHHHHHhheeeecccCCCCccc
Q psy12980        472 APLVVIVIMLLCAALIACLLYRRRHTGKMSV  502 (609)
Q Consensus       472 iPaVIIa~mllia~iIaC~cyRrKRkgK~~i  502 (609)
                      +-..+|++|+++|-+|.++-||.|++.--.+
T Consensus         6 l~~plivf~ifVap~WL~lHY~sk~~~~~gL   36 (75)
T PF06667_consen    6 LFVPLIVFMIFVAPIWLILHYRSKWKSSQGL   36 (75)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhcccCCCC
Confidence            3455666778888888888888776443334


No 35 
>KOG4289|consensus
Probab=72.13  E-value=1.4e+02  Score=38.64  Aligned_cols=69  Identities=9%  Similarity=0.042  Sum_probs=37.2

Q ss_pred             cccCCCCCeeeEEEEEeccCCCCCCCCccceeccCCceEEeec--CCCCceeEEEEEEEEcCCCCeeEeeEEEEE
Q psy12980        240 FFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIP--QLTDLGRREYQLVCKDSNGLMANDGLEVIV  312 (609)
Q Consensus       240 F~D~eDGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~G~P--~~~dvG~~~f~LtATDs~G~sasd~F~V~V  312 (609)
                      |.|+|-|---...+++...++.+    -=...++.++++--.-  ....+..|...+.|+|.+.-..+.+.+|+|
T Consensus       807 atDaD~g~Ng~v~y~~qg~~d~p----~~F~IEptSGviRtl~rLdRE~~avy~L~a~avDrg~p~ls~~~eItv  877 (2531)
T KOG4289|consen  807 ATDADSGPNGRVYYTFQGGDDGP----GDFYIEPTSGVIRTLRRLDRENVAVYVLAAYAVDRGNPPLSAPVEITV  877 (2531)
T ss_pred             EeccCCCCCceEEEEecCCCCCC----CceEEccCcceeehhhhhcchheeEEEEEEEEeeCCCCCcCCceEEEE
Confidence            66666655333444444444433    1122444444332221  223457889999999998776666665554


No 36 
>PF10577 UPF0560:  Uncharacterised protein family UPF0560;  InterPro: IPR018890  This family of proteins has no known function. 
Probab=71.69  E-value=1.7  Score=51.62  Aligned_cols=30  Identities=27%  Similarity=0.153  Sum_probs=14.4

Q ss_pred             EEEEeehHHHHHHHHHHHH--HHhheeeeccc
Q psy12980        466 YLITFVAPLVVIVIMLLCA--ALIACLLYRRR  495 (609)
Q Consensus       466 ~Llv~ViPaVIIa~mllia--~iIaC~cyRrK  495 (609)
                      ..|++++-..|++.|+||.  -+|.+||||||
T Consensus       268 ~~YHT~fLl~ILG~~~livl~lL~vLl~yCrr  299 (807)
T PF10577_consen  268 TTYHTVFLLAILGGTALIVLILLCVLLCYCRR  299 (807)
T ss_pred             hhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc
Confidence            3455665555555444422  22444555544


No 37 
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=69.38  E-value=17  Score=32.86  Aligned_cols=49  Identities=20%  Similarity=0.271  Sum_probs=31.3

Q ss_pred             EEeecCCCCceeEEEEEEEEc------CCCCeeEeeEEEEEECCCCCCcceEEEEEecC
Q psy12980        278 FYGIPQLTDLGRREYQLVCKD------SNGLMANDGLEVIVKSPGPAYYSASFSLTLAR  330 (609)
Q Consensus       278 f~G~P~~~dvG~~~f~LtATD------s~G~sasd~F~V~V~~~p~~~~~vef~L~l~~  330 (609)
                      -||.-.-+.-|.|.|.|--.+      ..|.+..++|+|.|...  .  ...+.+++..
T Consensus        25 ~yGtlti~~~G~wtYtl~n~~~avq~L~~Ge~~tdsFtvtv~DG--t--t~~vtItI~G   79 (99)
T TIGR01965        25 QYGTFSIDADGQWTYQADNSQTAVQALKAGETLTDTFTVTSADG--T--SQTVTITITG   79 (99)
T ss_pred             CcEEEEECCCCcEEEEeCCCcHHHHhhcCCCEEEEEEEEEEeCC--C--eEEEEEEEEc
Confidence            445443333488999885332      37889999999998753  1  4455555554


No 38 
>PHA03283 envelope glycoprotein E; Provisional
Probab=69.07  E-value=13  Score=42.49  Aligned_cols=25  Identities=40%  Similarity=0.577  Sum_probs=15.6

Q ss_pred             HHHHHHH--hheeeecccCCCCcccCC
Q psy12980        480 MLLCAAL--IACLLYRRRHTGKMSVGN  504 (609)
Q Consensus       480 mllia~i--IaC~cyRrKRkgK~~i~d  504 (609)
                      ++++++|  |+|+.|||+|+-.|.|++
T Consensus       411 ~~~~~~l~vw~c~~~r~~~~~~y~iln  437 (542)
T PHA03283        411 AALLVALVVWGCILYRRSNRKPYEVLN  437 (542)
T ss_pred             HHHHHHHhhhheeeehhhcCCcccccC
Confidence            3334444  777777777777776655


No 39 
>PF02480 Herpes_gE:  Alphaherpesvirus glycoprotein E;  InterPro: IPR003404 Glycoprotein E (gE) of Alphaherpesvirus forms a complex with glycoprotein I (gI), functioning as an immunoglobulin G (IgG) Fc binding protein. gE is involved in virus spread but is not essential for propagation [].; GO: 0016020 membrane; PDB: 2GJ7_F 2GIY_B.
Probab=68.19  E-value=1.7  Score=48.79  Aligned_cols=49  Identities=18%  Similarity=0.266  Sum_probs=16.3

Q ss_pred             eeeCCeEEEEcCCCccccCCCCCeeeEEEEEecc--CCCC-CCCCccceeccCC
Q psy12980        225 AISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTM--DRTT-IPPTHWLQFDAKN  275 (609)
Q Consensus       225 at~G~~f~f~IP~dTF~D~eDGdT~~L~lsl~~~--Dg~~-l~~~sWL~Fd~~t  275 (609)
                      -..|-.|.++-+.---|  -.|+|..|..+|...  |..+ --.-.|+-|+...
T Consensus       143 ~~~~~~~~~~~~~s~vf--~~Gdtf~~~v~l~~~~~d~~~f~~~i~W~~~~~~~  194 (439)
T PF02480_consen  143 HRHGATFHLKNYHSHVF--SPGDTFHLSVHLQSEAHDDPPFSLEIDWYYMPTDP  194 (439)
T ss_dssp             ---SEEEEEE--SEEE----TT--EE---EEEEEESSS--EEEEEEEEEE---T
T ss_pred             cccccEEEEeccceEEe--cCCCcEEEeEEEEeccCCCCCeeEEEEEEEecCCC
Confidence            45566777766644323  257766666665532  2111 0112577777654


No 40 
>PRK09458 pspB phage shock protein B; Provisional
Probab=66.76  E-value=2.6  Score=36.24  Aligned_cols=30  Identities=20%  Similarity=0.461  Sum_probs=21.4

Q ss_pred             HHHHHHHHHHHHHhheeeecccCCCCcccC
Q psy12980        474 LVVIVIMLLCAALIACLLYRRRHTGKMSVG  503 (609)
Q Consensus       474 aVIIa~mllia~iIaC~cyRrKRkgK~~i~  503 (609)
                      +-+|++|+++|=+|.++-||.||++.-.+.
T Consensus         8 ~PliiF~ifVaPiWL~LHY~sk~~~~~~Ls   37 (75)
T PRK09458          8 IPLTIFVLFVAPIWLWLHYRSKRQGSQGLS   37 (75)
T ss_pred             HhHHHHHHHHHHHHHHHhhcccccCCCCCC
Confidence            445666788888888888998876655554


No 41 
>KOG1094|consensus
Probab=64.02  E-value=15  Score=42.82  Aligned_cols=30  Identities=30%  Similarity=0.564  Sum_probs=19.1

Q ss_pred             ehHHHHHHHHHHHHHHhheeeecccC-CCCc
Q psy12980        471 VAPLVVIVIMLLCAALIACLLYRRRH-TGKM  500 (609)
Q Consensus       471 ViPaVIIa~mllia~iIaC~cyRrKR-kgK~  500 (609)
                      ++..+++++++++|.||+.|+||+|| +.|.
T Consensus       392 ~~~~~f~~if~iva~ii~~~L~R~rr~~~ka  422 (807)
T KOG1094|consen  392 ILIIIFVAIFLIVALIIALMLWRWRRLLSKA  422 (807)
T ss_pred             ehHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence            33345555556678888889999654 3433


No 42 
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=63.28  E-value=12  Score=30.88  Aligned_cols=25  Identities=24%  Similarity=0.335  Sum_probs=21.8

Q ss_pred             ceeEEEEEEEEcCCCCeeEeeEEEEE
Q psy12980        287 LGRREYQLVCKDSNGLMANDGLEVIV  312 (609)
Q Consensus       287 vG~~~f~LtATDs~G~sasd~F~V~V  312 (609)
                      .|.|.++|+|+|..| +++...+|.|
T Consensus        54 ~G~y~v~l~v~n~~g-~~~~~~~i~v   78 (79)
T smart00089       54 PGTYTVTLTVTNAVG-SASATVTVVV   78 (79)
T ss_pred             CcEEEEEEEEEcCCC-cEEEEEEEEE
Confidence            399999999999999 7777887776


No 43 
>PF02439 Adeno_E3_CR2:  Adenovirus E3 region protein CR2;  InterPro: IPR003470 Early region 3 (E3) of human adenoviruses (Ads) codes for proteins that appear to control viral interactions with the host []. This region called CR1 (conserved region 1) [] is found three times in Human adenovirus 19 (a subgroup D adenovirus) 49 kDa protein in the E3 region. CR1 is also found in the 20.1 Kd protein of subgroup B adenoviruses. The function of this 80 amino acid region is unknown. This region is probably a divergent immunoglobulin domain.
Probab=63.24  E-value=4.3  Score=30.59  Aligned_cols=27  Identities=15%  Similarity=0.435  Sum_probs=17.9

Q ss_pred             HHHHHHHHHHHHHhheeee--cccCCCCc
Q psy12980        474 LVVIVIMLLCAALIACLLY--RRRHTGKM  500 (609)
Q Consensus       474 aVIIa~mllia~iIaC~cy--RrKRkgK~  500 (609)
                      ++|++++..++-++.|++|  +++||.+.
T Consensus         7 aIIv~V~vg~~iiii~~~~YaCcykk~~~   35 (38)
T PF02439_consen    7 AIIVAVVVGMAIIIICMFYYACCYKKHRR   35 (38)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHccccc
Confidence            5566666667767666555  78888764


No 44 
>PF05345 He_PIG:  Putative Ig domain;  InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=62.55  E-value=9.6  Score=29.87  Aligned_cols=27  Identities=19%  Similarity=0.146  Sum_probs=18.9

Q ss_pred             ceecccCccC--cCcccccc-ccceeeEEE
Q psy12980        160 RYVISNTTTG--QAKPVLTR-ALGRLMTVK  186 (609)
Q Consensus       160 ~~~~~~t~T~--sgtp~~~d-~~~~~v~v~  186 (609)
                      .|+.||..|+  +|+|...+ .+...++|+
T Consensus        14 ~gLs~d~~tG~isGtp~~~~~~G~y~~~vt   43 (49)
T PF05345_consen   14 SGLSLDPSTGTISGTPTSSVQPGTYTFTVT   43 (49)
T ss_pred             CcEEEeCCCCEEEeecCCCccccEEEEEEE
Confidence            3999987776  69998773 355555554


No 45 
>PF05568 ASFV_J13L:  African swine fever virus J13L protein;  InterPro: IPR008385 This family consists of several African swine fever virus (ASFV) j13L proteins [, , ].
Probab=62.38  E-value=5.3  Score=38.51  Aligned_cols=55  Identities=11%  Similarity=0.152  Sum_probs=29.9

Q ss_pred             HHHHHhheeeecccCCCCcccCCCcccccccccCCCCEEcccccccccCCCCCCCCCeeccCC
Q psy12980        482 LCAALIACLLYRRRHTGKMSVGNGVDSGILIRSKGIPVIFQDELEERLEVSSGTKSPIILKEE  544 (609)
Q Consensus       482 lia~iIaC~cyRrKRkgK~~i~d~deE~~tF~~KGiPvIF~dElde~~~~~~~~~~P~IlkeE  544 (609)
                      ++.|+.+.+|-|||||.-..|+   ||..-|+    ++.=.-|--+ ..|.|+.+.|.=-.+-
T Consensus        43 iiiivli~lcssRKkKaaAAi~---eediQfi----npyqDqqw~~-v~pqPgt~kPAgAtt~   97 (189)
T PF05568_consen   43 IIIIVLIYLCSSRKKKAAAAIE---EEDIQFI----NPYQDQQWAG-VTPQPGTSKPAGATTG   97 (189)
T ss_pred             HHHHHHHHHHhhhhHHHHhhhh---hhccccc----Ccccchhhcc-CCCCCCCCCCCCCccc
Confidence            3444445567777766665563   5566676    3322222222 2477777776655443


No 46 
>PRK13211 N-acetylglucosamine-binding protein A; Reviewed
Probab=61.06  E-value=30  Score=39.47  Aligned_cols=64  Identities=9%  Similarity=0.133  Sum_probs=44.3

Q ss_pred             eEEEEEeccCCCCCCCCccceeccCCceEEeecCCCCceeEEEEEEEEcCCCCee-EeeEEEEEEC
Q psy12980        250 KLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIPQLTDLGRREYQLVCKDSNGLMA-NDGLEVIVKS  314 (609)
Q Consensus       250 ~L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~G~P~~~dvG~~~f~LtATDs~G~sa-sd~F~V~V~~  314 (609)
                      ++++.|...+|+.+...+ ...+..++.|.=.-.+...|.|.+.++|+|+.|... ++++.|.|..
T Consensus       342 ~vta~V~d~~g~~~~~~~-~~v~d~s~~vtL~Ls~~~AG~y~Lvv~~t~~dG~~~~q~~~~~~v~~  406 (478)
T PRK13211        342 NVEATVYNHDGEALGSKS-QTVNDGSQSVSLDLSKLKAGHHMLVVKAKPKDGELIKQQTLDFMLEA  406 (478)
T ss_pred             EEEEEEEcCCCCeeeeee-EEecCCceeEEEecccCCCceEEEEEEEEeCCCceeeeeeEEEEEEe
Confidence            677777888888776332 222333344443333334599999999999999988 8899999863


No 47 
>PF14610 DUF4448:  Protein of unknown function (DUF4448)
Probab=60.26  E-value=8.4  Score=38.04  Aligned_cols=29  Identities=24%  Similarity=0.477  Sum_probs=15.0

Q ss_pred             EEEEeehHHHHHHHHHHHHHHhheeeecccC
Q psy12980        466 YLITFVAPLVVIVIMLLCAALIACLLYRRRH  496 (609)
Q Consensus       466 ~Llv~ViPaVIIa~mllia~iIaC~cyRrKR  496 (609)
                      +.+...||.++++++++++  +.|+++||+|
T Consensus       158 ~~laI~lPvvv~~~~~~~~--~~~~~~R~~R  186 (189)
T PF14610_consen  158 YALAIALPVVVVVLALIMY--GFFFWNRKKR  186 (189)
T ss_pred             eeEEEEccHHHHHHHHHHH--hhheeeccce
Confidence            3444558888777333332  3334455554


No 48 
>PF00974 Rhabdo_glycop:  Rhabdovirus spike glycoprotein;  InterPro: IPR001903 Different families of ssRNA negative-strand viruses contain glycoproteins responsible for forming spikes on the surface of the virion. The glycoprotein spike is made up of a trimer of glycoproteins. These proteins are frequently abbreviated to G protein. Channel formed by glycoprotein spike is thought to function in a similar manner to Influenza virus M2 protein channel, thus allowing a signal to pass across the viral membrane to signal for viral uncoating [, ].; GO: 0019031 viral envelope; PDB: 2CMZ_C 2J6J_A 3EGD_D.
Probab=59.81  E-value=3  Score=47.57  Aligned_cols=17  Identities=24%  Similarity=0.403  Sum_probs=7.7

Q ss_pred             eEEEEcCCCccccCCCC
Q psy12980        230 LLVYHVPEDTFFDHEDG  246 (609)
Q Consensus       230 ~f~f~IP~dTF~D~eDG  246 (609)
                      .|....|-.-.+|+=+|
T Consensus       146 ~f~~i~ph~V~~DpYs~  162 (501)
T PF00974_consen  146 TFVIITPHPVLLDPYSG  162 (501)
T ss_dssp             EEEEEEE--EEEETTTT
T ss_pred             EEEEEecccceeccccC
Confidence            55555555555554433


No 49 
>PF13754 Big_3_4:  Bacterial Ig-like domain (group 3)
Probab=58.78  E-value=28  Score=27.59  Aligned_cols=27  Identities=11%  Similarity=0.214  Sum_probs=21.0

Q ss_pred             CceeEEEEEEEEcCCCCeeEe-eEEEEE
Q psy12980        286 DLGRREYQLVCKDSNGLMAND-GLEVIV  312 (609)
Q Consensus       286 dvG~~~f~LtATDs~G~sasd-~F~V~V  312 (609)
                      .-|.|.|.++|+|..|..... ...|.|
T Consensus        22 ~dG~y~itv~a~D~AGN~s~~~~~~~ti   49 (54)
T PF13754_consen   22 ADGTYTITVTATDAAGNTSTSSSVTFTI   49 (54)
T ss_pred             CCccEEEEEEEEeCCCCCCCccceeEEE
Confidence            359999999999999987665 344544


No 50 
>TIGR02976 phageshock_pspB phage shock protein B. This model describes the PspB protein of the psp (phage shock protein) operon, as found in Escherichia coli and many related species. Expression of a phage protein called secretin protein IV, and a number of other stresses including ethanol, heat shock, and defects in protein secretion trigger sigma-54-dependent expression of the phage shock regulon. PspB is both a regulator and an effector protein of the phage shock response.
Probab=58.50  E-value=3.5  Score=35.44  Aligned_cols=27  Identities=26%  Similarity=0.536  Sum_probs=17.8

Q ss_pred             ehHHHHHHHHHHHHHHhheeeecccCCCC
Q psy12980        471 VAPLVVIVIMLLCAALIACLLYRRRHTGK  499 (609)
Q Consensus       471 ViPaVIIa~mllia~iIaC~cyRrKRkgK  499 (609)
                      ++|++|+  |+++|-+|-++-||+|+++.
T Consensus         7 ~~Pliif--~ifVap~wl~lHY~~k~~~~   33 (75)
T TIGR02976         7 AIPLIIF--VIFVAPLWLILHYRSKRKTA   33 (75)
T ss_pred             HHHHHHH--HHHHHHHHHHHHHHhhhccC
Confidence            4555444  67778888888888665443


No 51 
>PF12273 RCR:  Chitin synthesis regulation, resistance to Congo red;  InterPro: IPR020999  RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall []. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5 []. 
Probab=58.36  E-value=3.1  Score=38.68  Aligned_cols=26  Identities=8%  Similarity=0.211  Sum_probs=15.2

Q ss_pred             HHHHHHHHHHHHhheeeecccCCCCc
Q psy12980        475 VVIVIMLLCAALIACLLYRRRHTGKM  500 (609)
Q Consensus       475 VIIa~mllia~iIaC~cyRrKRkgK~  500 (609)
                      +++++|++|..+|+++|+++||+-|.
T Consensus         4 l~~iii~~i~l~~~~~~~~~rRR~r~   29 (130)
T PF12273_consen    4 LFAIIIVAILLFLFLFYCHNRRRRRR   29 (130)
T ss_pred             eHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence            34444444555566667777777666


No 52 
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=56.96  E-value=1.7e+02  Score=40.13  Aligned_cols=61  Identities=11%  Similarity=0.072  Sum_probs=39.2

Q ss_pred             CCCeeeEEEEEeccCCCCCCCCccceeccCC----ceEEeecCCCCceeEEEEEEEEcC-CCCeeEeeEEEE
Q psy12980        245 DGGTRKLKLHLMTMDRTTIPPTHWLQFDAKN----QEFYGIPQLTDLGRREYQLVCKDS-NGLMANDGLEVI  311 (609)
Q Consensus       245 DGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t----~tf~G~P~~~dvG~~~f~LtATDs-~G~sasd~F~V~  311 (609)
                      .|.  ++.|.....+|+++ ...|- |+..+    +...=++...  |.|.+.|.|++. ++.+++-.|.|+
T Consensus      1799 ag~--~v~F~a~vstGsnV-sw~W~-f~~g~s~~gk~v~~Tf~~a--G~ytV~L~AsN~vs~~~~s~~~~VQ 1864 (2740)
T TIGR00864      1799 ADS--SVCFQGELATGTNV-SWCWA-IDGGSSKMGKHACMTFPDA--GTFAIRLNASNAVSGKSASREFFAE 1864 (2740)
T ss_pred             cCc--EEEEEEEccCCCee-EEEEE-eCCCCccccceeEEecCCC--eEEEEEEEEEcccCcceeeeeEEEE
Confidence            466  56666666788887 45675 55332    3333344443  999999999998 555666666654


No 53 
>KOG1219|consensus
Probab=56.03  E-value=1.2e+02  Score=41.22  Aligned_cols=186  Identities=16%  Similarity=0.154  Sum_probs=90.5

Q ss_pred             CCCcccCcCccceeecCccE---EEEcCCCCcccCCCCCeeeEEEEEecccceecc--cCccCcCc-cccccc-cc--ee
Q psy12980        112 QVPLIRNPIDHLEAISGELL---VYHVPEDTFFDHEDGGTRKLKLHLMTMDRYVIS--NTTTGQAK-PVLTRA-LG--RL  182 (609)
Q Consensus       112 ~~P~v~~pi~~~~a~e~~~f---s~~iP~~tF~D~d~Gdt~~ltlsl~~~d~~~~~--~t~T~sgt-p~~~d~-~~--~~  182 (609)
                      ++|+...=+....+.|+..-   -.+|-|   .|.|.|-.-.|+|.+..+|||..|  |..||+-. -..=|. -.  .-
T Consensus       949 hpp~F~~~v~e~~V~EnapiGT~vi~i~A---~dedsgldg~l~Y~I~~gdg~g~FsId~~tG~irTl~~lDrE~ks~Yw 1025 (4289)
T KOG1219|consen  949 HPPEFISFVTEGHVLENAPIGTIVIRIQA---RDEDSGLDGELSYKIRTGDGDGIFSIDSTTGSIRTLKALDREKKSSYW 1025 (4289)
T ss_pred             CCcchheeeeeeeEeecCCcceEEEEEEE---ecCCCCccceEEEEEEcCCcceeEEecCCcceEeechhhchhhcceEE
Confidence            45666555555566665432   122332   577888888899999999999999  66666521 000010 00  01


Q ss_pred             eEEEeeeecccccccCC-CCCCceecccCCC-CceecCccceeeeeeCCeE---EEEcCCCccccCCCCCeeeEEEEEec
Q psy12980        183 MTVKNITWRGIGHCEDS-PPPPVVKVKLENQ-VPLIRNPIDHLEAISGELL---VYHVPEDTFFDHEDGGTRKLKLHLMT  257 (609)
Q Consensus       183 v~v~~~~~~~~~~~~~~-~~~~tv~v~~~N~-aPvv~n~i~~~~at~G~~f---~f~IP~dTF~D~eDGdT~~L~lsl~~  257 (609)
                      +||.+.+.   |+..-+ --...+.|..+|+ +|....+.=--.+.+...-   ..++  + =+|+|.-.-.+|.|.++.
T Consensus      1026 ltveA~D~---gt~~~ssv~~vyI~ieDvNDn~Pq~s~pvy~asI~enSp~~vsivq~--e-a~D~Dsssn~kLmykI~s 1099 (4289)
T KOG1219|consen 1026 LTVEAKDL---GTVPLSSVCEVYIEIEDVNDNVPQFSSPVYYASISENSPETVSIVQA--E-ANDPDSSSNQKLMYKITS 1099 (4289)
T ss_pred             EEEEEEec---CCCccccceeEEEEEEecCCCCcccCCceEeeeeccCCCCceEEEEe--c-cCCCCcccCcceEEEEcc
Confidence            22221111   111000 0011122333333 2222222111111111111   1112  1 235543333377777764


Q ss_pred             cCCCCCCCCccceeccCCceEEeecCCCC--ceeEEEEEEEEcCCCCeeEeeEEEE
Q psy12980        258 MDRTTIPPTHWLQFDAKNQEFYGIPQLTD--LGRREYQLVCKDSNGLMANDGLEVI  311 (609)
Q Consensus       258 ~Dg~~l~~~sWL~Fd~~t~tf~G~P~~~d--vG~~~f~LtATDs~G~sasd~F~V~  311 (609)
                      .|  +   ....+.|+.++.+.|.-.-+.  -.-|...|+.+|.+--+-...--|.
T Consensus      1100 Gn--y---q~FF~Id~~TG~iTt~r~LDRE~qdEHiLeVTi~D~gep~l~s~~rvi 1150 (4289)
T KOG1219|consen 1100 GN--Y---QGFFQIDPETGLITTIRRLDREKQDEHILEVTIQDNGEPWLCSNQRVI 1150 (4289)
T ss_pred             CC--c---cceEEEccccceeeeehhhcccccccceEEEEEecCCCCccccceEEE
Confidence            43  2   368899999998887643222  2468888999997655533333333


No 54 
>PF15069 FAM163:  FAM163 family
Probab=55.63  E-value=9  Score=36.81  Aligned_cols=18  Identities=17%  Similarity=0.231  Sum_probs=10.2

Q ss_pred             EeehHHHHHHHHHHHHHH
Q psy12980        469 TFVAPLVVIVIMLLCAAL  486 (609)
Q Consensus       469 v~ViPaVIIa~mllia~i  486 (609)
                      ++||-+.|+|.++||..|
T Consensus         5 TvVItGgILAtVILLcII   22 (143)
T PF15069_consen    5 TVVITGGILATVILLCII   22 (143)
T ss_pred             eEEEechHHHHHHHHHHH
Confidence            355666666666554433


No 55 
>PF08693 SKG6:  Transmembrane alpha-helix domain;  InterPro: IPR014805 SKG6 and AXL2 are membrane proteins that show polarised intracellular localisation [, ]. This entry represents the highly conserved transmembrane alpha-helical domain found in these proteins [, ]. The full-length AXL2 protein has a negative regulatory function in cytokinesis [].
Probab=55.07  E-value=10  Score=28.99  Aligned_cols=23  Identities=35%  Similarity=0.616  Sum_probs=10.7

Q ss_pred             eehHHHHHHHHHHHHHHhheeeeccc
Q psy12980        470 FVAPLVVIVIMLLCAALIACLLYRRR  495 (609)
Q Consensus       470 ~ViPaVIIa~mllia~iIaC~cyRrK  495 (609)
                      .++|.++|++   +.+++-++-||||
T Consensus        17 VvVPV~vI~~---vl~~~l~~~~rR~   39 (40)
T PF08693_consen   17 VVVPVGVIII---VLGAFLFFWYRRK   39 (40)
T ss_pred             EEechHHHHH---HHHHHhheEEecc
Confidence            4566655542   2223333445555


No 56 
>PF14979 TMEM52:  Transmembrane 52
Probab=54.75  E-value=6.1  Score=38.08  Aligned_cols=26  Identities=23%  Similarity=0.313  Sum_probs=17.2

Q ss_pred             CccEEEEeehHHHHHHHHHHHHHHhhe
Q psy12980        463 SVEYLITFVAPLVVIVIMLLCAALIAC  489 (609)
Q Consensus       463 ~~~~Llv~ViPaVIIa~mllia~iIaC  489 (609)
                      +|.-|. ||.-.+++++++|++|+.+.
T Consensus        15 ~W~~LW-yIwLill~~~llLLCG~ta~   40 (154)
T PF14979_consen   15 RWSSLW-YIWLILLIGFLLLLCGLTAS   40 (154)
T ss_pred             ceehhh-HHHHHHHHHHHHHHHHHHHH
Confidence            444443 66667777888888888543


No 57 
>PF08374 Protocadherin:  Protocadherin;  InterPro: IPR013585 The structure of protocadherins is similar to that of classic cadherins (IPR002126 from INTERPRO), but they also have some unique features associated with the cytoplasmic domains. They are expressed in a variety of organisms and are found in high concentrations in the brain where they seem to be localised mainly at cell-cell contact sites. Their expression seems to be developmentally regulated []. 
Probab=54.70  E-value=11  Score=38.54  Aligned_cols=30  Identities=7%  Similarity=0.163  Sum_probs=18.7

Q ss_pred             ccEEEEeehHHHHHHHHHHHHHHhheeeec
Q psy12980        464 VEYLITFVAPLVVIVIMLLCAALIACLLYR  493 (609)
Q Consensus       464 ~~~Llv~ViPaVIIa~mllia~iIaC~cyR  493 (609)
                      -...++-||.+++.|++++++..++++|+.
T Consensus        36 ~~~I~iaiVAG~~tVILVI~i~v~vR~CRq   65 (221)
T PF08374_consen   36 YVKIMIAIVAGIMTVILVIFIVVLVRYCRQ   65 (221)
T ss_pred             ceeeeeeeecchhhhHHHHHHHHHHHHHhh
Confidence            344555667677766666666666666663


No 58 
>PF14575 EphA2_TM:  Ephrin type-A receptor 2 transmembrane domain; PDB: 3KUL_A 2XVD_A 2VX1_A 2VWV_A 2VX0_A 2VWY_A 2VWZ_A 2VWW_A 2VWU_A 2VWX_A ....
Probab=53.63  E-value=9.7  Score=32.56  Aligned_cols=13  Identities=23%  Similarity=0.340  Sum_probs=3.4

Q ss_pred             heeeecccCCCCc
Q psy12980        488 ACLLYRRRHTGKM  500 (609)
Q Consensus       488 aC~cyRrKRkgK~  500 (609)
                      ..+|.||++.+|.
T Consensus        20 ~~~~~rr~~~~~~   32 (75)
T PF14575_consen   20 VIVCFRRCKYSKA   32 (75)
T ss_dssp             HHCCCTT------
T ss_pred             EEEEEeeEcCCCC
Confidence            3355555544444


No 59 
>KOG4331|consensus
Probab=52.45  E-value=3.9  Score=48.68  Aligned_cols=37  Identities=19%  Similarity=0.214  Sum_probs=16.1

Q ss_pred             CccEEEEeehHHHHHHHHHHHHHH--hheeeecccCCCCc
Q psy12980        463 SVEYLITFVAPLVVIVIMLLCAAL--IACLLYRRRHTGKM  500 (609)
Q Consensus       463 ~~~~Llv~ViPaVIIa~mllia~i--IaC~cyRrKRkgK~  500 (609)
                      +|+|++-.|+-+++|+.+ -++++  |.|+||||+++++.
T Consensus       103 ~~g~lv~sV~~v~~iil~-p~~~~~yccc~C~~rc~~r~~  141 (865)
T KOG4331|consen  103 EAGMLVCSVILVLYIILK-PSEGLCYCCCRCCRRCGGRIK  141 (865)
T ss_pred             HHHHHHHHHHHHHHHHHH-HHHhHHhheeeeHhhhcCCCC
Confidence            445544334433333322 24455  44455556544444


No 60 
>PF01034 Syndecan:  Syndecan domain;  InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ]. Structurally, these proteins consist of four separate domains:   A signal sequence; An extracellular domain (ectodomain) of variable length whose sequence is not evolutionary conserved in the various forms of syndecans. The ectodomain contains the sites of attachment of the heparan sulphate glycosaminoglycan side chains;  A transmembrane region;  A highly conserved cytoplasmic domain of about 30 to 35 residues, which could interact with cytoskeletal proteins.    The proteins known to belong to this family are:    Syndecan 1.  Syndecan 2 or fibroglycan.  Syndecan 3 or neuroglycan or N-syndecan.  Syndecan 4 or amphiglycan or ryudocan.  Drosophila syndecan.   Caenorhabditis elegans probable syndecan (F57C7.3).    Syndecan-4, a transmembrane heparan sulphate proteoglycan, is a coreceptor with integrins in cell adhesion. It has been suggested to form a ternary signalling complex with protein kinase Calpha and phosphatidylinositol 4,5-bisphosphate (PIP2). Structural studies have demonstrated that the cytoplasmic domain undergoes a conformational transition and forms a symmetric dimer in the presence of phospholipid activator PIP2, and whose overall structure in solution exhibits a twisted clamp shape having a cavity in the centre of dimeric interface. In addition, it has been observed that the syndecan-4 variable domain interacts, strongly, not only with fatty acyl groups but also the anionic head group of PIP2. These findings indicate that PIP2 promotes oligomerisation of the syndecan-4 cytoplasmic domain for transmembrane signalling and cell-matrix adhesion [, ].; GO: 0008092 cytoskeletal protein binding, 0016020 membrane; PDB: 1EJQ_B 1EJP_B 1YBO_C 1OBY_Q.
Probab=51.89  E-value=5  Score=33.55  Aligned_cols=35  Identities=29%  Similarity=0.425  Sum_probs=1.8

Q ss_pred             EEeehHHHHHHHHHHHHHHhheeeecccC--CCCcccC
Q psy12980        468 ITFVAPLVVIVIMLLCAALIACLLYRRRH--TGKMSVG  503 (609)
Q Consensus       468 lv~ViPaVIIa~mllia~iIaC~cyRrKR--kgK~~i~  503 (609)
                      +..+|.++|++ +++...+|.+++||-|+  .|-+.++
T Consensus        11 laavIaG~Vvg-ll~ailLIlf~iyR~rkkdEGSY~l~   47 (64)
T PF01034_consen   11 LAAVIAGGVVG-LLFAILLILFLIYRMRKKDEGSYDLD   47 (64)
T ss_dssp             ---------------------------S------SS--
T ss_pred             HHHHHHHHHHH-HHHHHHHHHHHHHHHHhcCCCCccCC
Confidence            34555555543 33333456677777443  6666663


No 61 
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=51.72  E-value=25  Score=31.85  Aligned_cols=17  Identities=12%  Similarity=0.210  Sum_probs=13.6

Q ss_pred             CCceecccCCCCceecC
Q psy12980        202 PPVVKVKLENQVPLIRN  218 (609)
Q Consensus       202 ~~tv~v~~~N~aPvv~n  218 (609)
                      .++|+|...|++|++..
T Consensus        72 ~vtItI~GtNDapvi~~   88 (99)
T TIGR01965        72 TVTITITGANDAAVIGG   88 (99)
T ss_pred             EEEEEEEccCCCCEEec
Confidence            36777889999998765


No 62 
>PF06365 CD34_antigen:  CD34/Podocalyxin family;  InterPro: IPR013836 This family consists of several mammalian CD34 antigen proteins. The CD34 antigen is a human leukocyte membrane protein expressed specifically by lymphohematopoietic progenitor cells. CD34 is a phosphoprotein. Activation of protein kinase C (PKC) has been found to enhance CD34 phosphorylation [, ]. This family contains several eukaryotic podocalyxin proteins. Podocalyxin is a major membrane protein of the glomerular epithelium and is thought to be involved in maintenance of the architecture of the foot processes and filtration slits characteristic of this unique epithelium by virtue of its high negative charge. Podocalyxin functions as an anti-adhesin that maintains an open filtration pathway between neighbouring foot processes in the glomerular epithelium by charge repulsion [].
Probab=50.49  E-value=13  Score=37.81  Aligned_cols=23  Identities=13%  Similarity=0.159  Sum_probs=10.2

Q ss_pred             CcEEEEeeecCCCCCCCcEEEEEEecC
Q psy12980        358 SKIVNIEFEDNPQEPGKSTVVRWANRT  384 (609)
Q Consensus       358 s~ItVrsI~~~~~~~~gSTvV~w~N~T  384 (609)
                      =.|.+-.-.   .+ ..-.+++-++.+
T Consensus        38 C~l~Laq~~---~~-~q~Lll~l~~~t   60 (202)
T PF06365_consen   38 CSLSLAQSE---EN-QQCLLLVLTNET   60 (202)
T ss_pred             cEEEEecCC---CC-cceEEEEeeccc
Confidence            456665433   11 233444445544


No 63 
>KOG4289|consensus
Probab=49.65  E-value=1.9e+02  Score=37.50  Aligned_cols=84  Identities=12%  Similarity=0.191  Sum_probs=47.9

Q ss_pred             ccCCCCCeeeEEEEEeccCCCCCCCCccceeccCCceEE--eecCCCCceeEEEEEEEEcCCCC--eeEeeEEEEEECCC
Q psy12980        241 FDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFY--GIPQLTDLGRREYQLVCKDSNGL--MANDGLEVIVKSPG  316 (609)
Q Consensus       241 ~D~eDGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~--G~P~~~dvG~~~f~LtATDs~G~--sasd~F~V~V~~~p  316 (609)
                      .|++.|.--.|.|++...   .     =.|.|..++.+.  +.-....+-.|.|.|+|+|.+--  +++.+..|.|..-.
T Consensus       503 idadsg~na~l~y~laG~---~-----pf~I~~~SG~Itvtk~ldrEt~~~ysl~V~ard~gtp~l~tstsI~Vtv~dvn  574 (2531)
T KOG4289|consen  503 IDADSGENARLHYSLAGV---G-----PFQINNGSGWITVTKELDRETVEHYSLGVEARDHGTPPLSTSTSISVTVLDVN  574 (2531)
T ss_pred             ccCCCCcccceeeeeccC---C-----CeeEecCCceEEEeecccccccceEEEEEEEcCCCCCcccccceEEEEecccC
Confidence            467778744566665422   2     125666666444  34455567889999999998654  45556666664411


Q ss_pred             ---CCCcceEEEEEecCCC
Q psy12980        317 ---PAYYSASFSLTLARPS  332 (609)
Q Consensus       317 ---~~~~~vef~L~l~~~~  332 (609)
                         +.-..-|..++++.++
T Consensus       575 dndP~Ft~~eytl~inED~  593 (2531)
T KOG4289|consen  575 DNDPTFTQKEYTLRINEDA  593 (2531)
T ss_pred             CCCCccccCceEEEecCCc
Confidence               1112224556665543


No 64 
>PHA03286 envelope glycoprotein E; Provisional
Probab=49.04  E-value=7.1  Score=43.78  Aligned_cols=26  Identities=38%  Similarity=0.423  Sum_probs=16.7

Q ss_pred             ehHHHHHHHHHHHHHHhheeeecccC
Q psy12980        471 VAPLVVIVIMLLCAALIACLLYRRRH  496 (609)
Q Consensus       471 ViPaVIIa~mllia~iIaC~cyRrKR  496 (609)
                      -+.++.|+++++++.+|||+++||||
T Consensus       395 s~~~~~~~~~~~~~~~~~~~~~r~~~  420 (492)
T PHA03286        395 SMAAGAILVVLLFALCIAGLYRRRRR  420 (492)
T ss_pred             HHHHHHHHHHHHHHHHhHhHhhhhhh
Confidence            34455555566677778888777664


No 65 
>PHA03291 envelope glycoprotein I; Provisional
Probab=49.01  E-value=8.6  Score=41.88  Aligned_cols=29  Identities=7%  Similarity=0.005  Sum_probs=24.0

Q ss_pred             cCCCCceeEEEEEEEEcCCCCeeEeeEEEEEE
Q psy12980        282 PQLTDLGRREYQLVCKDSNGLMANDGLEVIVK  313 (609)
Q Consensus       282 P~~~dvG~~~f~LtATDs~G~sasd~F~V~V~  313 (609)
                      |.-.|.|.|.++|.+   +|++..|-|-+-+.
T Consensus       136 P~~~DsGiYvLRV~v---dgat~adlF~lg~~  164 (401)
T PHA03291        136 AARAVVGLYVLRVWV---EGATNASLFPLGLA  164 (401)
T ss_pred             CCcccceeEEEEEEe---CCCcccceEEEeee
Confidence            666678999999998   67788888988876


No 66 
>PF00558 Vpu:  Vpu protein;  InterPro: IPR008187 The Human immunodeficiency virus 1 (HIV-1) Vpu protein acts in the degradation of CD4 in the endoplasmic reticulum and in the enhancement of virion release from the plasma membrane of infected cells [].; GO: 0019076 release of virus from host; PDB: 2JPX_A 1PI8_A 2GOH_A 2GOF_A 1PI7_A 1PJE_A 1VPU_A 2K7Y_A.
Probab=48.04  E-value=13  Score=32.51  Aligned_cols=30  Identities=20%  Similarity=0.374  Sum_probs=10.3

Q ss_pred             eehHHHHHHHHHHHHHHhheeeecccCCCC
Q psy12980        470 FVAPLVVIVIMLLCAALIACLLYRRRHTGK  499 (609)
Q Consensus       470 ~ViPaVIIa~mllia~iIaC~cyRrKRkgK  499 (609)
                      .++.++|..++..+.|.++.+.|||-++.|
T Consensus         8 ~iialiv~~iiaIvvW~iv~ieYrk~~rqr   37 (81)
T PF00558_consen    8 AIIALIVALIIAIVVWTIVYIEYRKIKRQR   37 (81)
T ss_dssp             HHHHHHHHHHHHHHHHHHH-----------
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            334344444444577888888888654433


No 67 
>PF05393 Hum_adeno_E3A:  Human adenovirus early E3A glycoprotein;  InterPro: IPR008652 This family consists of several early glycoproteins (E3A), from human adenovirus type 2.; GO: 0016021 integral to membrane
Probab=44.72  E-value=13  Score=33.03  Aligned_cols=12  Identities=25%  Similarity=0.401  Sum_probs=4.6

Q ss_pred             eeeeccc-CCCCc
Q psy12980        489 CLLYRRR-HTGKM  500 (609)
Q Consensus       489 C~cyRrK-RkgK~  500 (609)
                      ||..||| |..=|
T Consensus        54 CC~kRkrsRrPIY   66 (94)
T PF05393_consen   54 CCKKRKRSRRPIY   66 (94)
T ss_pred             HHHHhhhccCCcc
Confidence            3333433 43333


No 68 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=44.32  E-value=17  Score=34.41  Aligned_cols=10  Identities=10%  Similarity=0.202  Sum_probs=6.1

Q ss_pred             CCCCCCCCCH
Q psy12980        384 TLASNVECPE  393 (609)
Q Consensus       384 TLp~~~~CP~  393 (609)
                      ..|....||.
T Consensus        38 ~~~~i~~Cp~   47 (139)
T PHA03099         38 DIPAIRLCGP   47 (139)
T ss_pred             CCcccccCCh
Confidence            3455577876


No 69 
>PF14914 LRRC37AB_C:  LRRC37A/B like protein 1 C-terminal domain
Probab=44.27  E-value=10  Score=36.69  Aligned_cols=21  Identities=33%  Similarity=0.491  Sum_probs=11.9

Q ss_pred             HHHHHHHHHHHHhheeeeccc
Q psy12980        475 VVIVIMLLCAALIACLLYRRR  495 (609)
Q Consensus       475 VIIa~mllia~iIaC~cyRrK  495 (609)
                      +|+.+++++.++|-+||+||.
T Consensus       129 vv~~iliii~CLiei~shr~a  149 (154)
T PF14914_consen  129 VVVMILIIIFCLIEICSHRRA  149 (154)
T ss_pred             HHHHHHHHHHHHHHHHhcccc
Confidence            333333345556777777776


No 70 
>PF13908 Shisa:  Wnt and FGF inhibitory regulator
Probab=43.11  E-value=15  Score=35.90  Aligned_cols=22  Identities=18%  Similarity=0.324  Sum_probs=13.4

Q ss_pred             EEEEeehHHHHHHHHHHHHHHh
Q psy12980        466 YLITFVAPLVVIVIMLLCAALI  487 (609)
Q Consensus       466 ~Llv~ViPaVIIa~mllia~iI  487 (609)
                      ++.+.++-+||+.+++++.+|+
T Consensus        75 ~~~~~iivgvi~~Vi~Iv~~Iv   96 (179)
T PF13908_consen   75 YFITGIIVGVICGVIAIVVLIV   96 (179)
T ss_pred             cceeeeeeehhhHHHHHHHhHh
Confidence            4566677777776665555443


No 71 
>PF15065 NCU-G1:  Lysosomal transcription factor, NCU-G1
Probab=42.18  E-value=7.2  Score=42.63  Aligned_cols=29  Identities=24%  Similarity=0.305  Sum_probs=23.8

Q ss_pred             ceEEeecCCCCceeEEEEEEEEcCCCCee
Q psy12980        276 QEFYGIPQLTDLGRREYQLVCKDSNGLMA  304 (609)
Q Consensus       276 ~tf~G~P~~~dvG~~~f~LtATDs~G~sa  304 (609)
                      .+|.|.+..+.-|...|+|.|-+..|..+
T Consensus       117 ~~~~g~~~~~~ngsi~~~~~af~~~gr~~  145 (350)
T PF15065_consen  117 ATFRGHPYSDPNGSIAFKLQAFSTSGRDA  145 (350)
T ss_pred             EEEEecccCCCCCeEEEEEEEecCCCCCc
Confidence            47888887766799999999999988754


No 72 
>PRK00269 zipA cell division protein ZipA; Reviewed
Probab=42.17  E-value=14  Score=39.41  Aligned_cols=27  Identities=19%  Similarity=0.345  Sum_probs=17.1

Q ss_pred             HHHHHHHHHHHHhheeeeccc-CCCCcc
Q psy12980        475 VVIVIMLLCAALIACLLYRRR-HTGKMS  501 (609)
Q Consensus       475 VIIa~mllia~iIaC~cyRrK-RkgK~~  501 (609)
                      +||+.+++|+||+.--.+|+| ||||++
T Consensus         9 livig~i~i~~il~~~~~r~r~~~gk~~   36 (293)
T PRK00269          9 LIVIGIIVIAGILFDGWRRMRGGKGKLK   36 (293)
T ss_pred             HHHHHHHHHHHHHHHHHHHHhccccccc
Confidence            345555667777665555554 688885


No 73 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=41.48  E-value=24  Score=33.54  Aligned_cols=16  Identities=19%  Similarity=0.262  Sum_probs=7.0

Q ss_pred             hheeeecccCCCCccc
Q psy12980        487 IACLLYRRRHTGKMSV  502 (609)
Q Consensus       487 IaC~cyRrKRkgK~~i  502 (609)
                      ++.+.||+-||.|+-+
T Consensus       118 ~~~~~yr~~r~~~~~~  133 (139)
T PHA03099        118 CLLSVYRFTRRTKLPL  133 (139)
T ss_pred             HHHhhheeeecccCch
Confidence            3333444444445544


No 74 
>PF02038 ATP1G1_PLM_MAT8:  ATP1G1/PLM/MAT8 family;  InterPro: IPR000272  The FXYD protein family contains at least seven members in mammals []. Two other family members that are not obvious orthologs of any identified mammalian FXYD protein exist in zebrafish. All these proteins share a signature sequence of six conserved amino acids comprising the FXYD motif in the NH2-terminus, and two glycines and one serine residue in the transmembrane domain. FXYD proteins are widely distributed in mammalian tissues with prominent expression in tissues that perform fluid and solute transport or that are electrically excitable.   Initial functional characterisation suggested that FXYD proteins act as channels or as modulators of ion channels however studies have revealed that most FXYD proteins have another specific function and act as tissue-specific regulatory subunits of the Na,K-ATPase. Each of these auxiliary subunits produces a distinct functional effect on the transport characteristics of the Na,K-ATPase that is adjusted to the specific functional demands of the tissue in which the FXYD protein is expressed. FXYD proteins appear to preferentially associate with Na,K-ATPase alpha1-beta isozymes, and affect their function in a way that render them operationally complementary or supplementary to coexisting isozymes.; GO: 0005216 ion channel activity, 0006811 ion transport, 0016020 membrane; PDB: 2JO1_A 2JP3_A 2ZXE_G 3A3Y_G 3N23_E 3B8E_H 3KDP_G 3N2F_E.
Probab=41.34  E-value=11  Score=30.14  Aligned_cols=25  Identities=8%  Similarity=0.046  Sum_probs=19.3

Q ss_pred             ehHHHHHHHHHHHHHHhheeeeccc
Q psy12980        471 VAPLVVIVIMLLCAALIACLLYRRR  495 (609)
Q Consensus       471 ViPaVIIa~mllia~iIaC~cyRrK  495 (609)
                      -+-++|+|++|+++||+...+.|-|
T Consensus        15 rigGLi~A~vlfi~Gi~iils~kck   39 (50)
T PF02038_consen   15 RIGGLIFAGVLFILGILIILSGKCK   39 (50)
T ss_dssp             HHHHHHHHHHHHHHHHHHHCTTHHH
T ss_pred             hccchHHHHHHHHHHHHHHHcCccc
Confidence            4678999999999999777655544


No 75 
>PF07204 Orthoreo_P10:  Orthoreovirus membrane fusion protein p10;  InterPro: IPR009854 This family consists of several Orthoreovirus membrane fusion protein p10 sequences. p10 is thought to be a multifunctional protein that plays a key role in virus-host interaction [].
Probab=40.94  E-value=12  Score=33.58  Aligned_cols=7  Identities=43%  Similarity=1.146  Sum_probs=4.0

Q ss_pred             ccccCCC
Q psy12980        432 TGICEGL  438 (609)
Q Consensus       432 ~G~C~~~  438 (609)
                      .|+|++.
T Consensus         6 ~g~c~g~   12 (98)
T PF07204_consen    6 SGSCNGA   12 (98)
T ss_pred             CCCcccH
Confidence            4666654


No 76 
>PF09394 Inhibitor_I42:  Chagasin family peptidase inhibitor I42;  InterPro: IPR018990 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties.   Chagasin reversible inhibitor of papain-like cysteine proteases []. Chagasin has a beta-barrel structure, which is a unique variant of the immunoglobulin fold with homology to human CD8alpha [, ].; PDB: 2NQD_A 2NNR_A 2H7W_B 3E1Z_A 3CBK_B 3CBJ_B 2OUL_B 2FO8_A 2WGN_B 2C34_A ....
Probab=40.54  E-value=59  Score=27.71  Aligned_cols=74  Identities=11%  Similarity=0.224  Sum_probs=38.7

Q ss_pred             eeeeeCCeEEEEcCCCccccCCCCCeeeEEEEEeccCCCCCCCCccceeccCCceEEeecCCCCceeEEEEEEEEcCCCC
Q psy12980        223 LEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIPQLTDLGRREYQLVCKDSNGL  302 (609)
Q Consensus       223 ~~at~G~~f~f~IP~dTF~D~eDGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~G~P~~~dvG~~~f~LtATDs~G~  302 (609)
                      +++..|+.|...||+|.    ..|.  .-.++. ..++-.+....++..+...+      +-+.-|.+.|.+.|...+  
T Consensus         1 I~v~~g~~~~I~L~~np----stGY--~W~~~~-~~~~l~l~~~~~~~~~~~~~------~vG~~g~~~f~f~a~~~G--   65 (92)
T PF09394_consen    1 ITVKVGDTFEIELPENP----STGY--SWSLSS-DSDGLQLVSEEYIPDNSPSG------LVGAPGTRTFTFKALKPG--   65 (92)
T ss_dssp             -EEETTSEEEEEEEEBC----CGTB--EEEECT-STTTEEEEEEEEEESSTSST------SSTSSEEEEEEEEESSSE--
T ss_pred             CeecCCCEEEEEECCCC----CCCe--EEEEec-CCCeEEEcCCcEEeCCCCcC------CCCCCcEEEEEEEEecCe--
Confidence            46788999999999873    2233  122211 12221122223444333322      333348899999998873  


Q ss_pred             eeEeeEEEEEE
Q psy12980        303 MANDGLEVIVK  313 (609)
Q Consensus       303 sasd~F~V~V~  313 (609)
                        ...+.+.-.
T Consensus        66 --~~~i~~~y~   74 (92)
T PF09394_consen   66 --TTTIKFEYR   74 (92)
T ss_dssp             --EEEEEEEEE
T ss_pred             --eEEEEEEEE
Confidence              455555543


No 77 
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=39.93  E-value=2.7e+02  Score=38.42  Aligned_cols=61  Identities=18%  Similarity=0.160  Sum_probs=39.7

Q ss_pred             ccccCCCCCeeeEEEEEeccCCCCCCCCccceeccCCceEEeecCCCCceeEEEEEEEEcCCCCeeEeeEEEEEEC
Q psy12980        239 TFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIPQLTDLGRREYQLVCKDSNGLMANDGLEVIVKS  314 (609)
Q Consensus       239 TF~D~eDGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~G~P~~~dvG~~~f~LtATDs~G~sasd~F~V~V~~  314 (609)
                      +|.|..+|+  ..++.=.-.||..      .    ..+.+  .......|.|.+.|+|+|..|.+ +.++.|.|..
T Consensus      1449 ~Ftd~StGt--~~Sy~WdFGDGst------s----t~qnp--tHtY~~~GtYtVtLTvtN~~Gss-t~T~~VtV~~ 1509 (2740)
T TIGR00864      1449 LFSAFGRAR--NASYLWDFGDGGL------L----EGPEI--LHAFNSPGDFNIRLAAANEVGKN-EATLNVAVKA 1509 (2740)
T ss_pred             EEeecCCCC--ceEEEEECCCCCc------c----Cccce--EEEcCCCceEEEEEEEECCCCce-EEEEEEEEec
Confidence            477776776  6666666677742      1    12222  22333449999999999999985 5577787754


No 78 
>PF07523 Big_3:  Bacterial Ig-like domain (group 3);  InterPro: IPR011080 This entry represents bacterial domains with an Ig-like fold. These domains are found in a variety of bacterial surface proteins.; PDB: 2L7Y_A 2KPN_A.
Probab=39.39  E-value=81  Score=25.65  Aligned_cols=50  Identities=18%  Similarity=0.201  Sum_probs=33.2

Q ss_pred             eEEEEEeccCCCCCCCCccceeccCCceEEeecCCCCceeEEEEEEEEcCCCCeeEeeEEEEE
Q psy12980        250 KLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIPQLTDLGRREYQLVCKDSNGLMANDGLEVIV  312 (609)
Q Consensus       250 ~L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~G~P~~~dvG~~~f~LtATDs~G~sasd~F~V~V  312 (609)
                      .|.++++..||..++        ...-+++|.......|.|.+.++..+     .+.+|.|.|
T Consensus        18 ~~~v~at~~dG~~~~--------~~~~~vs~~~d~~~~G~y~Vt~~y~~-----~t~t~~VtV   67 (67)
T PF07523_consen   18 GLFVTATYSDGTSLP--------LSDVTVSGTVDTSKAGTYTVTYTYKG-----VTATFTVTV   67 (67)
T ss_dssp             CHEEEEEETTS-ES---------GCCSEEES---TTS-CCEEEEEEECT-----EEEEEEEEE
T ss_pred             CCEEEEEEcCCCEec--------eeeeEEEeeeecCCCceEEEEEEECC-----EEEEEEEEC
Confidence            356777888888632        23468888555666799999999866     778888876


No 79 
>PF08329 ChitinaseA_N:  Chitinase A, N-terminal domain;  InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=37.31  E-value=55  Score=31.20  Aligned_cols=49  Identities=16%  Similarity=0.232  Sum_probs=27.4

Q ss_pred             ceeEEEEEEEEcCCCCeeEeeEEEEEECCCCCCcceEEEEEecCCCCCcC
Q psy12980        287 LGRREYQLVCKDSNGLMANDGLEVIVKSPGPAYYSASFSLTLARPSSTFA  336 (609)
Q Consensus       287 vG~~~f~LtATDs~G~sasd~F~V~V~~~p~~~~~vef~L~l~~~~~~ft  336 (609)
                      -|+|+|++.+-|..|-+.+++.+|.|... ...|+--+.+.+....+.|.
T Consensus        82 gG~y~~~VeLCN~~GCS~S~~~~V~VaDT-DGsHl~pL~~~~~~nN~~y~  130 (133)
T PF08329_consen   82 GGRYQMQVELCNADGCSTSAPVEVVVADT-DGSHLAPLPYNWDENNKPYT  130 (133)
T ss_dssp             -EEEEEEEEEEETTEEEE---EEEEEE-T-TSTTS--------TTS--BE
T ss_pred             CCEEEEEEEEECCCCcccCCCEEEEEeCC-CccccccccCcccccCCCcc
Confidence            49999999999999999999999999863 23444445555555545554


No 80 
>PF15471 TMEM171:  Transmembrane protein family 171
Probab=36.04  E-value=1.2e+02  Score=32.30  Aligned_cols=27  Identities=22%  Similarity=0.294  Sum_probs=17.7

Q ss_pred             CccEEEEeehHHHHHHHHHHHHHHhhee
Q psy12980        463 SVEYLITFVAPLVVIVIMLLCAALIACL  490 (609)
Q Consensus       463 ~~~~Llv~ViPaVIIa~mllia~iIaC~  490 (609)
                      -|+.|- .-|-+-+|+++.|...+||++
T Consensus       156 ~CGFLs-lQImGPlIVl~GLCFFVVAHv  182 (319)
T PF15471_consen  156 ICGFLS-LQIMGPLIVLVGLCFFVVAHV  182 (319)
T ss_pred             ccceee-hhhhhhHHHHHhhhhhheeee
Confidence            344443 455566677777777888888


No 81 
>PF15298 AJAP1_PANP_C:  AJAP1/PANP C-terminus
Probab=33.77  E-value=63  Score=32.80  Aligned_cols=54  Identities=17%  Similarity=0.200  Sum_probs=30.5

Q ss_pred             cEEEEeehHHHHHHHHHHHHHHhhe-eeec----ccCCCCcccCCCcccccc---cccCCCC
Q psy12980        465 EYLITFVAPLVVIVIMLLCAALIAC-LLYR----RRHTGKMSVGNGVDSGIL---IRSKGIP  518 (609)
Q Consensus       465 ~~Llv~ViPaVIIa~mllia~iIaC-~cyR----rKRkgK~~i~d~deE~~t---F~~KGiP  518 (609)
                      -.+|++|-.-+|+++..||.+||.= ||-|    ||-++--++..|||.++.   |.-.++|
T Consensus        98 vh~~iTITvSlImViaAliTtlvlK~C~~~s~~~r~~s~qr~~~qqeeS~Q~Ltd~~p~~~p  159 (205)
T PF15298_consen   98 VHQIITITVSLIMVIAALITTLVLKNCCAQSQNRRRNSHQRKINQQEESCQNLTDFTPARVP  159 (205)
T ss_pred             ceEEEEEeeehhHHHHHhhhhhhhhhhhhhhcccCCCccccccccchhhccccccCCcccCc
Confidence            3566666545555555556666543 4433    333555567777777775   6655555


No 82 
>PF05337 CSF-1:  Macrophage colony stimulating factor-1 (CSF-1);  InterPro: IPR008001 Colony stimulating factor 1 (CSF-1) is a homodimeric polypeptide growth factor whose primary function is to regulate the survival, proliferation, differentiation, and function of cells of the mononuclear phagocytic lineage. This lineage includes mononuclear phagocytic precursors, blood monocytes, tissue macrophages, osteoclasts, and microglia of the brain, all of which possess cell surface receptors for CSF-1. The protein has also been linked with male fertility [] and mutations in the Csf-1 gene have been found to cause osteopetrosis and failure of tooth eruption [].; GO: 0005125 cytokine activity, 0008083 growth factor activity, 0016021 integral to membrane; PDB: 3EJJ_A.
Probab=33.72  E-value=14  Score=39.16  Aligned_cols=27  Identities=26%  Similarity=0.581  Sum_probs=0.0

Q ss_pred             eehHHHHHHHHHHHHHHhheeeecccCCCC
Q psy12980        470 FVAPLVVIVIMLLCAALIACLLYRRRHTGK  499 (609)
Q Consensus       470 ~ViPaVIIa~mllia~iIaC~cyRrKRkgK  499 (609)
                      +++..-||.++|.++|   .++|||||+..
T Consensus       229 ~lLVPSiILVLLaVGG---LLfYr~rrRs~  255 (285)
T PF05337_consen  229 YLLVPSIILVLLAVGG---LLFYRRRRRSH  255 (285)
T ss_dssp             ------------------------------
T ss_pred             cccccchhhhhhhccc---eeeeccccccc
Confidence            3333444444444444   46788776543


No 83 
>PF13753 SWM_repeat:  Putative flagellar system-associated repeat
Probab=33.57  E-value=5.9e+02  Score=26.81  Aligned_cols=207  Identities=14%  Similarity=0.151  Sum_probs=96.1

Q ss_pred             ccceeeeeeeeecccccCCCCCCCccccccCCCCCcccC-cCccce------eecCccEEEEcCCCCcccCCCCCeeeEE
Q psy12980         80 GRLMTVKNITWRGIGHCEDSPPPPVVKVKLENQVPLIRN-PIDHLE------AISGELLVYHVPEDTFFDHEDGGTRKLK  152 (609)
Q Consensus        80 ~~~~~vk~~~~~~~g~~~~~~~~~i~~v~~~N~~P~v~~-pi~~~~------a~e~~~fs~~iP~~tF~D~d~Gdt~~lt  152 (609)
                      ...+.|.....+..|+=..-+    ..+.-.+.+|.|.- ++.+..      +..+..|+.++     .+.+.|++..++
T Consensus        11 d~~~~v~vt~tD~aGN~~~~t----~~~~vDt~~P~v~i~~~~~~~~~~~~~~~~~~t~s~tv-----s~~~~g~~v~v~   81 (317)
T PF13753_consen   11 DGTYTVSVTVTDAAGNTSTAT----QSITVDTTAPTVTITSIADDDIINGDEATNTVTFSGTV-----SGAEPGSTVTVT   81 (317)
T ss_pred             CCcEEEEEEEEeCCCCeeeee----EEEEEecCCCceeeecccCCCccccceeeeeeEEEEEe-----cCCCCCCEEEEE
Confidence            567888888999998876522    22333556885533 344433      33444455443     334556653332


Q ss_pred             E-------EEecccc-eecccCccCcCccc-cccccceeeEEEeeeecccccccCCCCCCceecccC-CCCceecCc-cc
Q psy12980        153 L-------HLMTMDR-YVISNTTTGQAKPV-LTRALGRLMTVKNITWRGIGHCEDSPPPPVVKVKLE-NQVPLIRNP-ID  221 (609)
Q Consensus       153 l-------sl~~~d~-~~~~~t~T~sgtp~-~~d~~~~~v~v~~~~~~~~~~~~~~~~~~tv~v~~~-N~aPvv~n~-i~  221 (609)
                      .       ++. ++| |..-      .++. .-..+...|+|+ ...+..|+.... .+..+++... +.+|.+.-. +.
T Consensus        82 ~~g~~~t~~~~-~~G~ws~t------~~~~~~l~~g~~ti~v~-~~tD~aGN~~t~-~s~~~~vDt~~~~~p~vti~~~~  152 (317)
T PF13753_consen   82 INGTTGTLTAD-ADGNWSVT------VTPSDDLPDGDYTITVT-TVTDAAGNTSTA-ASQTFTVDTTAPTAPTVTITGIS  152 (317)
T ss_pred             ECCEEEEEEEe-cCCcEEEe------eccccccccCcceeEEE-EEEccCCccccc-cccccccccccccccccceeccc
Confidence            2       111 122 2221      1121 112234344444 122233333322 1234445444 455655332 32


Q ss_pred             eeeeeeCC--eEEEEcCCCccccCCCCCeeeEEEEEeccCCCCCCCCccceeccCCc-eEEeec--CCCC-ceeEEEEEE
Q psy12980        222 HLEAISGE--LLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQ-EFYGIP--QLTD-LGRREYQLV  295 (609)
Q Consensus       222 ~~~at~G~--~f~f~IP~dTF~D~eDGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t~-tf~G~P--~~~d-vG~~~f~Lt  295 (609)
                      .=....+.  ...+.+-.. ..+.+.|+  .+++++   +|...   .+ ......+ ++...|  +... -|.|.+.+.
T Consensus       153 ~~~~~~~~~~~~t~t~sg~-v~~~~~~d--~v~vt~---~G~~~---~~-~~~~~g~~t~~~~~~~~~~~~d~~~~v~v~  222 (317)
T PF13753_consen  153 DDNIINGAESTVTVTFSGT-VTGFDAGD--TVTVTI---NGTTY---TT-TVGADGTWTVTVTPSDLAGLADGTYTVTVT  222 (317)
T ss_pred             CCceeeccceeeccccccc-ceeeeece--eEEEee---ccccc---ce-eecCCCcccccccccccccccCceEEEEEE
Confidence            32222222  344444433 35566666  344444   33321   11 1111111 222233  3222 279999999


Q ss_pred             EEcCCCCee-EeeEEEEEEC
Q psy12980        296 CKDSNGLMA-NDGLEVIVKS  314 (609)
Q Consensus       296 ATDs~G~sa-sd~F~V~V~~  314 (609)
                      ++|..|... +.+..+.|..
T Consensus       223 ~tD~AGN~~~t~s~~~tvDt  242 (317)
T PF13753_consen  223 VTDAAGNTAGTASQTFTVDT  242 (317)
T ss_pred             eeecccCcccccceeEEEec
Confidence            999999988 6766776654


No 84 
>PF00801 PKD:  PKD domain;  InterPro: IPR000601 The PKD (Polycystic Kidney Disease) domain was first identified in the Polycystic Kidney Disease protein, polycystin-1 (PDK1 gene), and contains an Ig-like fold consisting of a beta-sandwich of seven strands in two sheets with a Greek key topology, although some members have additional strands []. Polycystin-1 is a large cell-surface glycoprotein involved in adhesive protein-protein and protein-carbohydrate interactions; however it is not clear if the PKD domain mediates any of these interactions.  PKD domains are also found in other proteins, usually in the extracellular parts of proteins involved in interactions with other proteins. For example, domains with a PKD-type fold are found in archaeal surface layer proteins that protect the cell from extreme environments [], and in the human VPS10 domain-containing receptor SorCS2 [].; PDB: 1B4R_A 2KZW_A 2C4X_A 2C26_A 2Y72_B 3JQU_A 3JS7_B 1WGO_A 1L0Q_A.
Probab=33.39  E-value=2.5e+02  Score=22.49  Aligned_cols=18  Identities=17%  Similarity=0.187  Sum_probs=15.5

Q ss_pred             ceeEEEEEEEEcCCCCee
Q psy12980        287 LGRREYQLVCKDSNGLMA  304 (609)
Q Consensus       287 vG~~~f~LtATDs~G~sa  304 (609)
                      .|.|.++|+|+|..|...
T Consensus        51 ~G~y~V~ltv~n~~g~~t   68 (69)
T PF00801_consen   51 PGTYTVTLTVTNGVGSAT   68 (69)
T ss_dssp             SEEEEEEEEEEETTSEEE
T ss_pred             CeEEEEEEEEEECCCCcC
Confidence            499999999999988653


No 85 
>PF11770 GAPT:  GRB2-binding adapter (GAPT);  InterPro: IPR021082  This entry represents a family of transmembrane proteins which bind the growth factor receptor-bound protein 2 (GRB2) in B cells []. In contrast to other transmembrane adaptor proteins, GAPT, which this entry represents, is not phosphorylated upon BCR ligation. It associates with GRB2 constitutively through its proline-rich region []. 
Probab=33.10  E-value=38  Score=32.94  Aligned_cols=17  Identities=24%  Similarity=0.421  Sum_probs=11.4

Q ss_pred             HHHHHHHhheeeecccC
Q psy12980        480 MLLCAALIACLLYRRRH  496 (609)
Q Consensus       480 mllia~iIaC~cyRrKR  496 (609)
                      +||+++.|+|+.|+++|
T Consensus        20 ~lLl~cgiGcvwhwkhr   36 (158)
T PF11770_consen   20 LLLLLCGIGCVWHWKHR   36 (158)
T ss_pred             HHHHHHhcceEEEeecc
Confidence            34566668898886554


No 86 
>PF02009 Rifin_STEVOR:  Rifin/stevor family;  InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=32.22  E-value=11  Score=40.28  Aligned_cols=20  Identities=35%  Similarity=0.715  Sum_probs=8.4

Q ss_pred             HHHHHHHHHhheee-ecccCC
Q psy12980        478 VIMLLCAALIACLL-YRRRHT  497 (609)
Q Consensus       478 a~mllia~iIaC~c-yRrKRk  497 (609)
                      ++++||-.||-.|+ ||||||
T Consensus       266 liIVLIMvIIYLILRYRRKKK  286 (299)
T PF02009_consen  266 LIIVLIMVIIYLILRYRRKKK  286 (299)
T ss_pred             HHHHHHHHHHHHHHHHHHHhh
Confidence            33334444444433 444443


No 87 
>KOG1219|consensus
Probab=31.96  E-value=3.6e+02  Score=37.20  Aligned_cols=185  Identities=14%  Similarity=0.129  Sum_probs=109.9

Q ss_pred             cccCC-CCCcccCcCccceeecCccEEEEcCCCCcccCCCCCeeeEEEEEeccc--ceecccCccCcCc------ccccc
Q psy12980        107 VKLEN-QVPLIRNPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMD--RYVISNTTTGQAK------PVLTR  177 (609)
Q Consensus       107 v~~~N-~~P~v~~pi~~~~a~e~~~fs~~iP~~tF~D~d~Gdt~~ltlsl~~~d--~~~~~~t~T~sgt------p~~~d  177 (609)
                      |..+| ++|+...|.=--+..|++.=...|=.--=.|+|...-.+|++.++.++  |.-.-+..||--|      .-..|
T Consensus      1048 ieDvNDn~Pq~s~pvy~asI~enSp~~vsivq~ea~D~Dsssn~kLmykI~sGnyq~FF~Id~~TG~iTt~r~LDRE~qd 1127 (4289)
T KOG1219|consen 1048 IEDVNDNVPQFSSPVYYASISENSPETVSIVQAEANDPDSSSNQKLMYKITSGNYQGFFQIDPETGLITTIRRLDREKQD 1127 (4289)
T ss_pred             EEecCCCCcccCCceEeeeeccCCCCceEEEEeccCCCCcccCcceEEEEccCCccceEEEccccceeeeehhhcccccc
Confidence            34455 478888888888888888776665444445888666778999999876  3333377777544      22334


Q ss_pred             ccceeeEEEeeeecccccccCCCCCCceecccCCC-CceecCccceeeeeeCCeEEEEcCCCc--------cccCCCCCe
Q psy12980        178 ALGRLMTVKNITWRGIGHCEDSPPPPVVKVKLENQ-VPLIRNPIDHLEAISGELLVYHVPEDT--------FFDHEDGGT  248 (609)
Q Consensus       178 ~~~~~v~v~~~~~~~~~~~~~~~~~~tv~v~~~N~-aPvv~n~i~~~~at~G~~f~f~IP~dT--------F~D~eDGdT  248 (609)
                      .--|+||+.+.--.    --.+...+.|.|..+|+ +|.....+-          .|.||+..        =.|.|.|. 
T Consensus      1128 EHiLeVTi~D~gep----~l~s~~rviV~IldvNdnsp~Flqk~~----------~~~v~~r~s~plyRl~a~d~DeG~- 1192 (4289)
T KOG1219|consen 1128 EHILEVTIQDNGEP----WLCSNQRVIVSILDVNDNSPRFLQKKT----------FLRVPERSSPPLYRLAAQDNDEGN- 1192 (4289)
T ss_pred             cceEEEEEecCCCC----ccccceEEEEEEeeccCCchhhhhhee----------EEEeeeccCCceeEEEEEecCCCc-
Confidence            55567776633211    11111123344445544 344433333          33444332        23566777 


Q ss_pred             eeEEEEEeccCCCCCCCCccceeccCCceEEeecCCCCceeEEEE-EEEEcCCCCeeEeeEEEEEE
Q psy12980        249 RKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIPQLTDLGRREYQ-LVCKDSNGLMANDGLEVIVK  313 (609)
Q Consensus       249 ~~L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~G~P~~~dvG~~~f~-LtATDs~G~sasd~F~V~V~  313 (609)
                       +-+++....||.+.     ...|..+...|-.-... .|.|.+. +.|||.+--.++.++.|++.
T Consensus      1193 -narityniedgde~-----FsID~~t~vVsss~~~~-~~eydi~~Ikatd~g~pq~sa~trl~lt 1251 (4289)
T KOG1219|consen 1193 -NARITYNIEDGDEV-----FSIDIATGVVSSSTLDP-AGEYDILGIKATDRGAPQASAGTRLHLT 1251 (4289)
T ss_pred             -ceEEEEecccCceE-----EEEeccCceEEeeeccC-CcceeEeeEEEecCCCCcccceeEEEEE
Confidence             55555555666552     45677776666544332 3778777 89999988888888888774


No 88 
>PF15347 PAG:  Phosphoprotein associated with glycosphingolipid-enriched
Probab=31.95  E-value=70  Score=35.45  Aligned_cols=36  Identities=25%  Similarity=0.309  Sum_probs=18.8

Q ss_pred             EEeehHHHHHHHHHHHHHHhhee--eecccCCCCcccCC
Q psy12980        468 ITFVAPLVVIVIMLLCAALIACL--LYRRRHTGKMSVGN  504 (609)
Q Consensus       468 lv~ViPaVIIa~mllia~iIaC~--cyRrKRkgK~~i~d  504 (609)
                      ++.+.-+.-|+.++||..||..|  |.|.| |+|---||
T Consensus        15 ivlwgsLaav~~f~lis~LifLCsSC~reK-K~~~~~GD   52 (428)
T PF15347_consen   15 IVLWGSLAAVTTFLLISFLIFLCSSCDREK-KPKQQSGD   52 (428)
T ss_pred             EEeehHHHHHHHHHHHHHHHHHhhcccccc-cCCCCCCc
Confidence            34444444455556666665554  66655 55533333


No 89 
>PF10164 DUF2367:  Uncharacterized conserved protein (DUF2367);  InterPro: IPR019317  This is a highly conserved set of proteins which contains three pairs of cysteine residues within a length of 42 amino acids and is rich in proline residues towards the N terminus. It includes a membrane protein that has been found to be highly expressed in the mouse brain and consequently, several members have been assigned as brain protein i3 (Bri3). Their function is unknown.
Probab=31.57  E-value=33  Score=31.03  Aligned_cols=23  Identities=22%  Similarity=0.527  Sum_probs=14.5

Q ss_pred             HHHHHHHHHHHHHhheeeecccC
Q psy12980        474 LVVIVIMLLCAALIACLLYRRRH  496 (609)
Q Consensus       474 aVIIa~mllia~iIaC~cyRrKR  496 (609)
                      +++.|++++=.||+.|++.|+||
T Consensus        68 Gi~~AI~fFPiGilcc~~~r~~r   90 (98)
T PF10164_consen   68 GILCAIFFFPIGILCCLAMRERR   90 (98)
T ss_pred             HHHHHHHHHhhHHHHhhhcCccc
Confidence            55555555556777777777665


No 90 
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=31.13  E-value=3.4e+02  Score=25.90  Aligned_cols=31  Identities=16%  Similarity=0.161  Sum_probs=26.2

Q ss_pred             CceeEEEEEEEEcCCCCeeEeeEEEEEECCC
Q psy12980        286 DLGRREYQLVCKDSNGLMANDGLEVIVKSPG  316 (609)
Q Consensus       286 dvG~~~f~LtATDs~G~sasd~F~V~V~~~p  316 (609)
                      +.-...|.++|+|..|.+.+..|.|+|..+-
T Consensus        83 d~l~l~~~v~a~D~DGD~s~~~l~VtI~DD~  113 (137)
T TIGR03660        83 DELTLNFPIIATDFDGDTSSITLPVTIVDDV  113 (137)
T ss_pred             ceEEEeeeEEEEeCCCCccccEEEEEEECCC
Confidence            3456888999999999999999999998753


No 91 
>PTZ00382 Variant-specific surface protein (VSP); Provisional
Probab=30.42  E-value=40  Score=30.19  Aligned_cols=11  Identities=18%  Similarity=0.174  Sum_probs=4.2

Q ss_pred             HHhheeeeccc
Q psy12980        485 ALIACLLYRRR  495 (609)
Q Consensus       485 ~iIaC~cyRrK  495 (609)
                      .++-.+++|||
T Consensus        85 ~l~w~f~~r~k   95 (96)
T PTZ00382         85 FLCWWFVCRGK   95 (96)
T ss_pred             HHhheeEEeec
Confidence            33333344433


No 92 
>PF04234 CopC:  CopC domain;  InterPro: IPR007348 CopC is a bacterial blue copper protein that binds 1 atom of copper per protein molecule. Along with CopA, CopC mediates copper resistance by sequestration of copper in the periplasm [].; GO: 0005507 copper ion binding, 0046688 response to copper ion, 0042597 periplasmic space; PDB: 1IX2_B 1LYQ_A 2C9P_C 2C9R_A 2C9Q_A 1M42_A 1OT4_A 1NM4_A.
Probab=29.95  E-value=3.2e+02  Score=23.94  Aligned_cols=62  Identities=10%  Similarity=0.012  Sum_probs=41.1

Q ss_pred             EEEEEeccCCCCCCCCccceeccCCceEEeecCC-CCceeEEEEEEEEcCCCCeeEeeEEEEEE
Q psy12980        251 LKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIPQL-TDLGRREYQLVCKDSNGLMANDGLEVIVK  313 (609)
Q Consensus       251 L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~G~P~~-~dvG~~~f~LtATDs~G~sasd~F~V~V~  313 (609)
                      -++.++..+|..+.. .....+....++...... -.-|.|.+..++.-..|-..+..|...|+
T Consensus        35 s~v~v~~~~g~~v~~-~~~~~~~~~~~~~~~l~~~l~~G~YtV~wrvvs~DGH~~~G~~~F~V~   97 (97)
T PF04234_consen   35 SSVTVTDPDGKRVDL-GEPTVDGDGKTLTVPLPPPLPPGTYTVSWRVVSADGHPVSGSFSFTVK   97 (97)
T ss_dssp             -EEEEEEEEETTSCT-CEEEEEESTTEEEEEESS---SEEEEEEEEEEETTSCEEEEEEEEEE-
T ss_pred             cEEEEEcCCCceeec-CcceecCCceEEEEECCCCCCCceEEEEEEEEecCCCCcCCEEEEEEC
Confidence            455667777776542 345555544566655444 34599999999988999999988887774


No 93 
>cd07912 Tweety_N N-terminal domain of the protein encoded by the Drosophila tweety gene and related proteins, a family of chloride ion channels. The protein product of the Drosophila tweety (tty) gene is thought to form a trans-membrane protein with five membrane-spanning regions and a cytoplasmic C-terminus. This N-terminal domain contains the putative transmembrane spanning regions. Tweety has been suggested as a candidate for a large conductance chloride channel, both in vertebrate and insect cells. Three human homologs have been identified and designated TTYH1-3. TTYH2 has been associated with the progression of cancer, and Drosophila melanogaster tweety has been assumed to play a role in development. TTYH2, and TTYH3 bind to and are ubiquinated by Nedd4-2, a HECT type E3 ubiquitin ligase, which most likely plays a role in controlling the cellular levels of tweety family proteins.
Probab=29.65  E-value=41  Score=37.78  Aligned_cols=11  Identities=18%  Similarity=0.335  Sum_probs=5.3

Q ss_pred             EEeehHHHHHH
Q psy12980        468 ITFVAPLVVIV  478 (609)
Q Consensus       468 lv~ViPaVIIa  478 (609)
                      ++-.++++..+
T Consensus        41 ~~a~iaavwli   51 (418)
T cd07912          41 ILASIPAACLI   51 (418)
T ss_pred             HHHHHHHHHHH
Confidence            33446666443


No 94 
>PF05915 DUF872:  Eukaryotic protein of unknown function (DUF872);  InterPro: IPR008590 This entry represents several uncharacterised eukaryotic transmembrane proteins. The function of this currently unknown.
Probab=29.11  E-value=25  Score=32.61  Aligned_cols=25  Identities=20%  Similarity=0.340  Sum_probs=18.1

Q ss_pred             hHHHHHHHHHHHHHH----hheeeecccC
Q psy12980        472 APLVVIVIMLLCAAL----IACLLYRRRH  496 (609)
Q Consensus       472 iPaVIIa~mllia~i----IaC~cyRrKR  496 (609)
                      +++.|+.+|++|-|+    |++||+|++|
T Consensus        77 ~~llilG~L~fIPG~Y~~~i~y~a~rg~~  105 (115)
T PF05915_consen   77 WALLILGILCFIPGFYHTRIAYYAWRGYK  105 (115)
T ss_pred             chHHHHHHHHHhccHHHHHHHHHHHcCCC
Confidence            467777878887765    7777777764


No 95 
>PF11857 DUF3377:  Domain of unknown function (DUF3377);  InterPro: IPR021805  This domain is functionally uncharacterised and found at the C terminus of peptidases belonging to MEROPS peptidase family M10A, membrane-type matrix metallopeptidases (clan MA). ; GO: 0004222 metalloendopeptidase activity
Probab=28.88  E-value=88  Score=27.04  Aligned_cols=31  Identities=26%  Similarity=0.482  Sum_probs=21.1

Q ss_pred             EEEeehHHHHHHHHHHHHHHhheeeecccCCCC
Q psy12980        467 LITFVAPLVVIVIMLLCAALIACLLYRRRHTGK  499 (609)
Q Consensus       467 Llv~ViPaVIIa~mllia~iIaC~cyRrKRkgK  499 (609)
                      ....|||.+++.|+|.+...|.-  .|||-..|
T Consensus        31 avaVviPl~L~LCiLvl~yai~~--fkrkGtPr   61 (74)
T PF11857_consen   31 AVAVVIPLVLLLCILVLIYAIFQ--FKRKGTPR   61 (74)
T ss_pred             EEEEeHHHHHHHHHHHHHHHhhe--eeecCCCc
Confidence            45578999999988887766655  25553333


No 96 
>PF05545 FixQ:  Cbb3-type cytochrome oxidase component FixQ;  InterPro: IPR008621 This family consists of several Cbb3-type cytochrome oxidase components (FixQ/CcoQ). FixQ is found in nitrogen fixing bacteria. Since nitrogen fixation is an energy-consuming process, effective symbioses depend on operation of a respiratory chain with a high affinity for O2, closely coupled to ATP production. This requirement is fulfilled by a special three-subunit terminal oxidase (cytochrome terminal oxidase cbb3), which was first identified in Bradyrhizobium japonicum as the product of the fixNOQP operon [].
Probab=28.65  E-value=36  Score=26.44  Aligned_cols=18  Identities=17%  Similarity=0.456  Sum_probs=9.2

Q ss_pred             HHHHHhheeeecccCCCCc
Q psy12980        482 LCAALIACLLYRRRHTGKM  500 (609)
Q Consensus       482 lia~iIaC~cyRrKRkgK~  500 (609)
                      ++.|++.. .||+|||.++
T Consensus        21 ~F~gi~~w-~~~~~~k~~~   38 (49)
T PF05545_consen   21 FFIGIVIW-AYRPRNKKRF   38 (49)
T ss_pred             HHHHHHHH-HHcccchhhH
Confidence            33344433 3466666666


No 97 
>PF13860 FlgD_ig:  FlgD Ig-like domain; PDB: 3C12_A 3OSV_A.
Probab=28.38  E-value=1.2e+02  Score=25.64  Aligned_cols=12  Identities=17%  Similarity=0.517  Sum_probs=9.9

Q ss_pred             eeEEEEEEEEcC
Q psy12980        288 GRREYQLVCKDS  299 (609)
Q Consensus       288 G~~~f~LtATDs  299 (609)
                      |.|.|.|.|+|.
T Consensus        69 G~Y~~~v~a~~~   80 (81)
T PF13860_consen   69 GTYTFRVTATDG   80 (81)
T ss_dssp             EEEEEEEEEEET
T ss_pred             CCEEEEEEEEeC
Confidence            888888888875


No 98 
>PRK12633 flgD flagellar basal body rod modification protein; Provisional
Probab=28.20  E-value=2.1e+02  Score=29.57  Aligned_cols=25  Identities=16%  Similarity=0.252  Sum_probs=19.4

Q ss_pred             EeecCCCCceeEEEEEEEEcCCCCeeE
Q psy12980        279 YGIPQLTDLGRREYQLVCKDSNGLMAN  305 (609)
Q Consensus       279 ~G~P~~~dvG~~~f~LtATDs~G~sas  305 (609)
                      .|.+...  |.|.|.+.|+|..|....
T Consensus       166 ~G~~~~~--G~Y~~~V~a~~~~G~~~~  190 (230)
T PRK12633        166 GGQPLAD--GKYSITVSASDADAKPVK  190 (230)
T ss_pred             CCCcCCC--cceEEEEEEEeCCCcEEe
Confidence            4555544  999999999999987553


No 99 
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=27.91  E-value=48  Score=38.48  Aligned_cols=38  Identities=13%  Similarity=0.287  Sum_probs=20.6

Q ss_pred             CccEEEE-----eehHHHHHHHHHH----HHHHhheeeecccCCCCc
Q psy12980        463 SVEYLIT-----FVAPLVVIVIMLL----CAALIACLLYRRRHTGKM  500 (609)
Q Consensus       463 ~~~~Llv-----~ViPaVIIa~mll----ia~iIaC~cyRrKRkgK~  500 (609)
                      ..++.++     +|++|+++++..|    +.+++.|||-||||+.+.
T Consensus       468 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  514 (548)
T PTZ00364        468 PEEYFLVVRCSTIILMAILLLVAGLISISIIGLYIVIRIRKKRETLK  514 (548)
T ss_pred             CCceEEEEeehhhHHHHHHHHHHHHhhhhhheeeEEEEEEeecCcee
Confidence            3456565     5555555444433    344556677777655544


No 100
>COG4736 CcoQ Cbb3-type cytochrome oxidase, subunit 3 [Posttranslational modification, protein turnover, chaperones]
Probab=27.60  E-value=30  Score=28.64  Aligned_cols=28  Identities=14%  Similarity=0.369  Sum_probs=16.4

Q ss_pred             eeecccCCCCcccCCCcccccccccCCCCEEcccccccc
Q psy12980        490 LLYRRRHTGKMSVGNGVDSGILIRSKGIPVIFQDELEER  528 (609)
Q Consensus       490 ~cyRrKRkgK~~i~d~deE~~tF~~KGiPvIF~dElde~  528 (609)
                      ..||++||+.+   |  ++      +.-++.+.||.+++
T Consensus        28 ~ayr~~~K~~~---d--~a------a~~~l~l~Dd~q~~   55 (60)
T COG4736          28 FAYRPGKKGEF---D--EA------ARGILPLNDDAQDA   55 (60)
T ss_pred             HHhcccchhhH---H--HH------hccCCCCCcchhhh
Confidence            45677777777   1  21      23355677777765


No 101
>PF06679 DUF1180:  Protein of unknown function (DUF1180);  InterPro: IPR009565 This entry consists of several hypothetical eukaryotic proteins thought to be membrane proteins. Their function is unknown.
Probab=26.62  E-value=47  Score=32.71  Aligned_cols=16  Identities=13%  Similarity=0.030  Sum_probs=8.4

Q ss_pred             CCcccCCCcccccccc
Q psy12980        498 GKMSVGNGVDSGILIR  513 (609)
Q Consensus       498 gK~~i~d~deE~~tF~  513 (609)
                      -||-+++.-+|+....
T Consensus       127 RkYgvl~~~~~~~Em~  142 (163)
T PF06679_consen  127 RKYGVLTTRAENVEMA  142 (163)
T ss_pred             eeecccCCCcccceec
Confidence            4565655444555544


No 102
>PF15415 DUF4622:  Protein of unknown function (DUF4622)
Probab=26.27  E-value=1.5e+02  Score=31.28  Aligned_cols=147  Identities=18%  Similarity=0.204  Sum_probs=79.0

Q ss_pred             cccCCCCceecCccceeeeeeCCeEEEEcCCCccccCCCCCeeeEEEEEeccCCCCCCCCccce--eccCCc--eEEeec
Q psy12980        207 VKLENQVPLIRNPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQ--FDAKNQ--EFYGIP  282 (609)
Q Consensus       207 v~~~N~aPvv~n~i~~~~at~G~~f~f~IP~dTF~D~eDGdT~~L~lsl~~~Dg~~l~~~sWL~--Fd~~t~--tf~G~P  282 (609)
                      |...|-.-+..|+|-+|.|    +|.|+|=..      +|   ..+|+.. ..|-+|   +=||  ||..++  --.|.-
T Consensus       148 i~~~nvq~i~LnPii~qta----Rm~Fti~~g------~g---v~tLe~l-~aGiEI---sGiQ~p~d~t~~~nw~~gD~  210 (310)
T PF15415_consen  148 INENNVQYIYLNPIINQTA----RMQFTIHKG------EG---VHTLEML-QAGIEI---SGIQQPFDHTSQFNWSNGDN  210 (310)
T ss_pred             eccCCeeEEEeChhhhhee----EEEEEEecC------Cc---eeeehhh-hcccee---ecccccCCCccccccccccc
Confidence            4445666788899988887    566666432      34   3555554 234443   3445  555443  233544


Q ss_pred             CCCCceeEEEEEEE------EcCCCCeeEeeEEEEEECCCCCCcceEEEEEecCCCCCcCchHHH-HHHHHHHHHHhCCC
Q psy12980        283 QLTDLGRREYQLVC------KDSNGLMANDGLEVIVKSPGPAYYSASFSLTLARPSSTFAQPAAQ-RAFLEKLAGLFGDR  355 (609)
Q Consensus       283 ~~~dvG~~~f~LtA------TDs~G~sasd~F~V~V~~~p~~~~~vef~L~l~~~~~~ft~~~~q-~~LVekLA~llGD~  355 (609)
                      +..-+|.-.=.++.      ||+.|.-.-++   -|-|..-+.+.|...|.+.+.     ..+-| ..+|..+-.+-|-+
T Consensus       211 lp~k~GDk~~~v~ip~~~~~t~a~g~L~g~t---GvLPTd~rStpv~Vl~N~~VN-----gvPTQy~~li~g~~ll~ghS  282 (310)
T PF15415_consen  211 LPMKMGDKSSWVRIPGSHFETNANGSLQGHT---GVLPTDCRSTPVSVLMNMKVN-----GVPTQYMMLINGKHLLHGHS  282 (310)
T ss_pred             cceecccccceEEEecceeeecCCCcEEeec---ccccccccCCceEEEEEEEec-----CCchheeEeeeeeEeccccc
Confidence            44445533222332      45555443332   244432334444444444332     22344 34555566666665


Q ss_pred             CCCcEEEEeeecCCCCCCCcEEEEEEecCC
Q psy12980        356 NASKIVNIEFEDNPQEPGKSTVVRWANRTL  385 (609)
Q Consensus       356 d~s~ItVrsI~~~~~~~~gSTvV~w~N~TL  385 (609)
                      - +.-..-+|+      +|-+++.|-|+|-
T Consensus       283 Y-Nyt~Tvki~------~GitV~tWQn~SW  305 (310)
T PF15415_consen  283 Y-NYTWTVKIS------NGITVVTWQNQSW  305 (310)
T ss_pred             c-ceEEEEEec------CCEEEEEeccccc
Confidence            4 555666787      8999999999774


No 103
>PF09680 Tiny_TM_bacill:  Protein of unknown function (Tiny_TM_bacill);  InterPro: IPR010070 This entry represents a family of hypothetical proteins, half of which are 40 residues or less in length. Members are found only in spore-forming species. A Gly-rich variable region is followed by a strongly conserved, highly hydrophobic region, predicted to form a transmembrane helix, ending with an invariant Gly. The consensus for this stretch is FALLVVFILLIIV.
Probab=25.31  E-value=66  Score=22.06  Aligned_cols=16  Identities=25%  Similarity=0.607  Sum_probs=7.2

Q ss_pred             HHHHHHHHHHHHHhheee
Q psy12980        474 LVVIVIMLLCAALIACLL  491 (609)
Q Consensus       474 aVIIa~mllia~iIaC~c  491 (609)
                      ++++.++||+  |+.|.|
T Consensus         8 livVLFILLi--IvG~s~   23 (24)
T PF09680_consen    8 LIVVLFILLI--IVGASC   23 (24)
T ss_pred             hHHHHHHHHH--Hhccee
Confidence            3444444444  445544


No 104
>PF09315 DUF1973:  Domain of unknown function (DUF1973);  InterPro: IPR015394 These functionally uncharacterised domains are found in various eukaryotic calcium-dependent chloride channels. 
Probab=25.11  E-value=4.7e+02  Score=25.97  Aligned_cols=76  Identities=17%  Similarity=0.263  Sum_probs=46.4

Q ss_pred             eEEEEEeccCCCCCCCCccceeccCCce-EEeecCCCCceeEEEEEEEEcCCCCeeEeeEEEEEE--CCCCCCcceEEEE
Q psy12980        250 KLKLHLMTMDRTTIPPTHWLQFDAKNQE-FYGIPQLTDLGRREYQLVCKDSNGLMANDGLEVIVK--SPGPAYYSASFSL  326 (609)
Q Consensus       250 ~L~lsl~~~Dg~~l~~~sWL~Fd~~t~t-f~G~P~~~dvG~~~f~LtATDs~G~sasd~F~V~V~--~~p~~~~~vef~L  326 (609)
                      ...+.|+.-+|.....   ...|...+. -.-+|.....|.|.|.|..+.+.    .+.+.|.|.  +.....+.+.+..
T Consensus        42 ~p~i~L~~P~G~~~~~---~~~d~~~~~~~i~ipg~ae~G~W~y~i~~~~~~----~q~v~vtVtS~~~~~~~~Pitv~~  114 (179)
T PF09315_consen   42 PPSITLTDPSGTVYTT---FTTDSNSKTARIQIPGTAEVGTWTYSITNTSSS----SQTVTVTVTSRASSPTVPPITVTA  114 (179)
T ss_pred             CceEEEECCCCCEEee---eEEcccccEEEEECCCCcccccEEEEEecCCCC----cceEEEEEEeeECCCCCCCEEEEE
Confidence            4566677777777653   335544432 12256666789999999765533    344566554  3334566777777


Q ss_pred             EecCCC
Q psy12980        327 TLARPS  332 (609)
Q Consensus       327 ~l~~~~  332 (609)
                      .+..+.
T Consensus       115 ~~~~~~  120 (179)
T PF09315_consen  115 WLSQSS  120 (179)
T ss_pred             EECCCC
Confidence            776643


No 105
>PF12768 Rax2:  Cortical protein marker for cell polarity
Probab=24.92  E-value=88  Score=33.26  Aligned_cols=22  Identities=27%  Similarity=0.442  Sum_probs=10.4

Q ss_pred             HHHHHHHHHHHHHhheeeeccc
Q psy12980        474 LVVIVIMLLCAALIACLLYRRR  495 (609)
Q Consensus       474 aVIIa~mllia~iIaC~cyRrK  495 (609)
                      |+=++++|.+++||....+|||
T Consensus       237 ALG~v~ll~l~Gii~~~~~r~~  258 (281)
T PF12768_consen  237 ALGTVFLLVLIGIILAYIRRRR  258 (281)
T ss_pred             HHHHHHHHHHHHHHHHHHHhhh
Confidence            3444444445555555444443


No 106
>PRK06439 NADH dehydrogenase subunit J; Provisional
Probab=24.59  E-value=66  Score=27.67  Aligned_cols=29  Identities=10%  Similarity=0.311  Sum_probs=23.0

Q ss_pred             EEEeehHHHHHHHHHHHHHHhheeeeccc
Q psy12980        467 LITFVAPLVVIVIMLLCAALIACLLYRRR  495 (609)
Q Consensus       467 Llv~ViPaVIIa~mllia~iIaC~cyRrK  495 (609)
                      +-.|++|+-++.++|++|-+-|..+-||+
T Consensus        43 ft~YllpFEl~SiLLLvAmIGAIvLarke   71 (72)
T PRK06439         43 FNTYIVPFELLSVIIVGAVIGVMYVAWGE   71 (72)
T ss_pred             HhhHHHHHHHHHHHHHHHHHHHheeeecC
Confidence            34578999999999999988887776653


No 107
>TIGR03370 PEPCTERM_Roseo variant PEP-CTERM putative exosortase signal, Roseobacter type. A probable protein export sorting signal, PEP-CTERM, was described by Haft, et al. (PubMed:16930487). It is predicted to interact with a putative transpeptidase we designate exosortase. Most examples of this signal are recognized by model TIGR02595, but some unusual clades require different models. This model describes a variant with conserved motif VPLPA, rather than VPEP. This variant is found prominently in two members of the Rhodobacterales, namely Jannaschia sp. CCS1 and Roseobacter denitrificans OCh 114. One interesting member protein has a full-length duplication and therefore two copies of this putative sorting domain.
Probab=24.57  E-value=33  Score=23.92  Aligned_cols=18  Identities=39%  Similarity=0.360  Sum_probs=10.8

Q ss_pred             HHHHHHHhheeeecccCC
Q psy12980        480 MLLCAALIACLLYRRRHT  497 (609)
Q Consensus       480 mllia~iIaC~cyRrKRk  497 (609)
                      .|+++++-++-..|||||
T Consensus         8 ~LLl~gLggl~~~rRRrk   25 (26)
T TIGR03370         8 LLLLAGLGGLGAMRRRRR   25 (26)
T ss_pred             HHHHHHHHHHHHHHHhhc
Confidence            356666666655566654


No 108
>COG5513 Predicted secreted protein [Function unknown]
Probab=24.48  E-value=5.9e+02  Score=23.81  Aligned_cols=90  Identities=17%  Similarity=0.216  Sum_probs=47.0

Q ss_pred             eeeeCCeEEEEcCCCccccCCCCCeeeEEEEEeccCCCCCCCCccceeccCCceEEeecCCCCceeEEEEEEEEcCCCCe
Q psy12980        224 EAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIPQLTDLGRREYQLVCKDSNGLM  303 (609)
Q Consensus       224 ~at~G~~f~f~IP~dTF~D~eDGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~G~P~~~dvG~~~f~LtATDs~G~s  303 (609)
                      ....|+.|.+++|+|.    -.|.  .-  .|....|..|-...--+.-+..++ -|.|     |.|...+.|.+++-+.
T Consensus        22 kL~~ge~f~L~l~~NP----tTGy--sW--eL~~s~gl~l~~~d~~~~~~~e~~-~G~~-----G~~~W~i~av~~G~q~   87 (113)
T COG5513          22 KLNNGEKFTLQLPENP----TTGY--SW--ELNTSEGLVLLREDYTVDPAEEGT-VGSP-----GIHTWEIRAVKQGTQQ   87 (113)
T ss_pred             EEecCCceEEEecCCC----ccce--EE--EEecCCccEEEeccCccCCCcccc-cCCC-----ceEEEEEEeecCCceE
Confidence            5667889999999884    3344  22  222233433221111111111222 3444     8899999999987665


Q ss_pred             eEeeEEEEEECCCCCCcceEEEEEec
Q psy12980        304 ANDGLEVIVKSPGPAYYSASFSLTLA  329 (609)
Q Consensus       304 asd~F~V~V~~~p~~~~~vef~L~l~  329 (609)
                      +.-.....=.+.  .-+...|++.++
T Consensus        88 v~g~Y~R~we~~--~g~e~~f~l~Ve  111 (113)
T COG5513          88 VEGTYRRLWEKE--AGPEINFSLIVE  111 (113)
T ss_pred             EEEEEEEeeccc--cCcceeEEEEEE
Confidence            554444333222  234445666554


No 109
>PLN03150 hypothetical protein; Provisional
Probab=24.45  E-value=29  Score=40.58  Aligned_cols=7  Identities=14%  Similarity=0.657  Sum_probs=3.9

Q ss_pred             EEEEEEC
Q psy12980        308 LEVIVKS  314 (609)
Q Consensus       308 F~V~V~~  314 (609)
                      +.|.+.+
T Consensus       340 l~isl~p  346 (623)
T PLN03150        340 LTIVLQP  346 (623)
T ss_pred             EEEEEee
Confidence            5555555


No 110
>PF08496 Peptidase_S49_N:  Peptidase family S49 N-terminal;  InterPro: IPR013703 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain is found to the N terminus of bacterial signal peptidases that belong to the MEROPS peptidase family S49 (protease IV family, clan SK) (see also IPR002142 from INTERPRO) [, ]. ; GO: 0004252 serine-type endopeptidase activity, 0005886 plasma membrane
Probab=24.28  E-value=59  Score=31.65  Aligned_cols=32  Identities=19%  Similarity=0.403  Sum_probs=18.9

Q ss_pred             hHHHHHHHHHHHHHHhheeeecccCCCCcccC
Q psy12980        472 APLVVIVIMLLCAALIACLLYRRRHTGKMSVG  503 (609)
Q Consensus       472 iPaVIIa~mllia~iIaC~cyRrKRkgK~~i~  503 (609)
                      +.-+|+++++++++|++..-+.|+++|++.+-
T Consensus        14 ~vTvVvaI~~vv~~I~~~~~k~k~~~G~L~Vt   45 (155)
T PF08496_consen   14 IVTVVVAILAVVGLIVAAAQKKKKKKGELEVT   45 (155)
T ss_pred             HHHHHHHHHHHHHHHHHHhccCCCCCCeEEEe
Confidence            33455555555666666665555568888543


No 111
>TIGR00868 hCaCC calcium-activated chloride channel protein 1. distributions. found a row in 1A13.INFO that was not parsed out
Probab=23.96  E-value=6.2e+02  Score=31.37  Aligned_cols=102  Identities=18%  Similarity=0.258  Sum_probs=58.3

Q ss_pred             eeeeeCCeEEEEcCCCccccCC-----CCCeeeEEEEEeccCCCCCCCCccceeccCCceEE-eecCCCCceeEEEEEEE
Q psy12980        223 LEAISGELLVYHVPEDTFFDHE-----DGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFY-GIPQLTDLGRREYQLVC  296 (609)
Q Consensus       223 ~~at~G~~f~f~IP~dTF~D~e-----DGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~-G~P~~~dvG~~~f~LtA  296 (609)
                      +.+..|+.+.-+|+-|.+...+     -....+..+.|++.+|....  +- +.|...+.-+ =+|....+|+|+|.|++
T Consensus       495 ~~l~~~~~~~~tv~iDstvG~~T~F~vtw~~~~p~i~l~dP~G~~~~--~~-~~d~~~~~a~l~ipg~a~~G~W~y~l~~  571 (863)
T TIGR00868       495 LTLQNNAWMNGTVPVDSTVGKDTFFLITWEFLKPEIFLQDPSGKSTS--DF-LVDKLNKMAYLQIPGTAKVGTWTYSLQA  571 (863)
T ss_pred             eEecCCcEEeeeEEecCCCCCCcEEEEEcCCCCceEEEECCCCCccc--cc-eEeecceEEEEEcCCCCccceeEEEEec
Confidence            4555555565555544322111     11112566778888887632  32 3555444322 35766678999999998


Q ss_pred             EcCCCCeeEeeEEEEEE--CCCCCCcceEEEEEecCCC
Q psy12980        297 KDSNGLMANDGLEVIVK--SPGPAYYSASFSLTLARPS  332 (609)
Q Consensus       297 TDs~G~sasd~F~V~V~--~~p~~~~~vef~L~l~~~~  332 (609)
                      + ..    .+.++|.|.  ......+.+.....|+...
T Consensus       572 ~-~~----~q~i~vtVtsra~s~~~~pi~~~a~~~~~~  604 (863)
T TIGR00868       572 S-AN----PQTLTLTVTSRARSPTLPPVTVTAKMNKDT  604 (863)
T ss_pred             c-CC----CceEEEEEEeccCCCCCCCEEEEEEEcccc
Confidence            8 22    344666665  3334567788888877644


No 112
>PF06365 CD34_antigen:  CD34/Podocalyxin family;  InterPro: IPR013836 This family consists of several mammalian CD34 antigen proteins. The CD34 antigen is a human leukocyte membrane protein expressed specifically by lymphohematopoietic progenitor cells. CD34 is a phosphoprotein. Activation of protein kinase C (PKC) has been found to enhance CD34 phosphorylation [, ]. This family contains several eukaryotic podocalyxin proteins. Podocalyxin is a major membrane protein of the glomerular epithelium and is thought to be involved in maintenance of the architecture of the foot processes and filtration slits characteristic of this unique epithelium by virtue of its high negative charge. Podocalyxin functions as an anti-adhesin that maintains an open filtration pathway between neighbouring foot processes in the glomerular epithelium by charge repulsion [].
Probab=23.94  E-value=73  Score=32.46  Aligned_cols=26  Identities=38%  Similarity=0.382  Sum_probs=11.0

Q ss_pred             EeehHHHHHH-HHHHHHHHhh-eeeeccc
Q psy12980        469 TFVAPLVVIV-IMLLCAALIA-CLLYRRR  495 (609)
Q Consensus       469 v~ViPaVIIa-~mllia~iIa-C~cyRrK  495 (609)
                      ++|. .|+++ ++||++.+++ +||+.||
T Consensus       101 ~lI~-lv~~g~~lLla~~~~~~Y~~~~Rr  128 (202)
T PF06365_consen  101 TLIA-LVTSGSFLLLAILLGAGYCCHQRR  128 (202)
T ss_pred             EEEe-hHHhhHHHHHHHHHHHHHHhhhhc
Confidence            3443 34444 3444444444 4444444


No 113
>PRK12634 flgD flagellar basal body rod modification protein; Reviewed
Probab=23.77  E-value=1.5e+02  Score=30.36  Aligned_cols=26  Identities=15%  Similarity=0.139  Sum_probs=19.5

Q ss_pred             eecCCCCceeEEEEEEEEcCCCCeeEee
Q psy12980        280 GIPQLTDLGRREYQLVCKDSNGLMANDG  307 (609)
Q Consensus       280 G~P~~~dvG~~~f~LtATDs~G~sasd~  307 (609)
                      |.+...  |.|.|.+.|+|..|....-.
T Consensus       160 G~~~~~--G~Yt~~v~a~~~~G~~~~~~  185 (221)
T PRK12634        160 GNRMAA--GKYGVTATQTDTAGSKSKLS  185 (221)
T ss_pred             CCcCCC--eeeEEEEEEEeCCCcEEeee
Confidence            444444  99999999999999855443


No 114
>PF05140 ResB:  ResB-like family ;  InterPro: IPR007816 This domain is found in a number of known and suspected cytochrome c biogenesis proteins, including ResB []. Mutations in ResB indicate that they are essential for growth []. ResB is predicted to be a transmembrane protein.
Probab=23.71  E-value=42  Score=37.73  Aligned_cols=33  Identities=24%  Similarity=0.326  Sum_probs=23.1

Q ss_pred             ccEEEEee---hHHHHHHHHHHHHHHhheeeecccC
Q psy12980        464 VEYLITFV---APLVVIVIMLLCAALIACLLYRRRH  496 (609)
Q Consensus       464 ~~~Llv~V---iPaVIIa~mllia~iIaC~cyRrKR  496 (609)
                      +..|-+..   +|.|.+.|+++++|++.+++.|+||
T Consensus       396 ~tGLqV~~DPG~~~V~~G~~llv~Gl~~sfy~~hRR  431 (464)
T PF05140_consen  396 YTGLQVKKDPGVPIVYLGFILLVVGLFLSFYIRHRR  431 (464)
T ss_pred             EEEEEEEeCCCchHHHHHHHHHHHHHHHhheecceE
Confidence            44455554   6788888999999997775555554


No 115
>PF00974 Rhabdo_glycop:  Rhabdovirus spike glycoprotein;  InterPro: IPR001903 Different families of ssRNA negative-strand viruses contain glycoproteins responsible for forming spikes on the surface of the virion. The glycoprotein spike is made up of a trimer of glycoproteins. These proteins are frequently abbreviated to G protein. Channel formed by glycoprotein spike is thought to function in a similar manner to Influenza virus M2 protein channel, thus allowing a signal to pass across the viral membrane to signal for viral uncoating [, ].; GO: 0019031 viral envelope; PDB: 2CMZ_C 2J6J_A 3EGD_D.
Probab=23.71  E-value=26  Score=40.06  Aligned_cols=9  Identities=0%  Similarity=0.021  Sum_probs=0.0

Q ss_pred             CCccEEEEe
Q psy12980        462 ASVEYLITF  470 (609)
Q Consensus       462 a~~~~Llv~  470 (609)
                      .+|+.++..
T Consensus       449 ~~W~~~~~~  457 (501)
T PF00974_consen  449 SNWGEWLSI  457 (501)
T ss_dssp             ---------
T ss_pred             ccHHHHHHH
Confidence            456554443


No 116
>PF08372 PRT_C:  Plant phosphoribosyltransferase C-terminal;  InterPro: IPR013583 This domain is found at the C terminus of phosphoribosyltransferases and phosphoribosyltransferase-like proteins. It contains putative transmembrane regions. It often appears together with calcium-ion dependent C2 domains (IPR000008 from INTERPRO). 
Probab=23.45  E-value=51  Score=32.22  Aligned_cols=24  Identities=25%  Similarity=0.444  Sum_probs=17.3

Q ss_pred             EcccccccccCCCCCCCCCeeccC
Q psy12980        520 IFQDELEERLEVSSGTKSPIILKE  543 (609)
Q Consensus       520 IF~dElde~~~~~~~~~~P~Ilke  543 (609)
                      +-.||+||+....|+++.+=+|++
T Consensus        36 ~~~deldEEfD~~ps~~~~~~lr~   59 (156)
T PF08372_consen   36 AHPDELDEEFDTFPSSRPPDSLRM   59 (156)
T ss_pred             CCcchhhhhhcccccccccHHHHH
Confidence            346899887666677777777765


No 117
>PHA03289 envelope glycoprotein I; Provisional
Probab=23.13  E-value=2.6e+02  Score=30.58  Aligned_cols=29  Identities=21%  Similarity=0.230  Sum_probs=23.3

Q ss_pred             cCCCCceeEEEEEEEEcCCCCeeEeeEEEEEE
Q psy12980        282 PQLTDLGRREYQLVCKDSNGLMANDGLEVIVK  313 (609)
Q Consensus       282 P~~~dvG~~~f~LtATDs~G~sasd~F~V~V~  313 (609)
                      |.-.|.|.|.++|..   +|+...|-|-+.|.
T Consensus       129 Pr~~DSGiYyLRV~L---dga~~sDvF~lsv~  157 (352)
T PHA03289        129 PRMSDTGIYYIRVSL---AGRNKSDIFRTSVF  157 (352)
T ss_pred             CCcccCeeEEEEEEe---CCCccCceEEEEEE
Confidence            667788999999998   45678888888764


No 118
>KOG3521|consensus
Probab=22.80  E-value=1.5e+02  Score=35.39  Aligned_cols=38  Identities=37%  Similarity=0.494  Sum_probs=26.7

Q ss_pred             CCcccccccccCCCCEEcccccccccCCCCCCCCC--eeccCCCCCCCCC
Q psy12980        504 NGVDSGILIRSKGIPVIFQDELEERLEVSSGTKSP--IILKEEKPPLPPP  551 (609)
Q Consensus       504 d~deE~~tF~~KGiPvIF~dElde~~~~~~~~~~P--~IlkeEkpplppp  551 (609)
                      +++++-.+|.++|.++        .  +-.-+.+|  +-|+|+.|||.++
T Consensus       768 n~~~~~~~~~~a~th~--------s--p~s~~rSpg~~~lse~~p~l~~~  807 (846)
T KOG3521|consen  768 NNDPSGETLTSAGTHI--------S--PISPNRSPGTLSLSEEAPPLALE  807 (846)
T ss_pred             ccccccchhcccCCCC--------C--CCCcccCccchhhhhhcccccCC
Confidence            3356666888888664        1  34446677  8899999999765


No 119
>PF01601 Corona_S2:  Coronavirus S2 glycoprotein;  InterPro: IPR002552 The type I glycoprotein S of Coronavirus, trimers of which constitute the typical viral spikes, is assembled into virions through noncovalent interactions with the M protein. The spike glycoprotein is translated as a large polypeptide that is subsequently cleaved to S1 IPR002551 from INTERPRO and S2 []. Both chimeric S proteins appeared to cause cell fusion when expressed individually, suggesting that they were biologically fully active []. The spike is a type I membrane glycoprotein that possesses a conserved transmembrane anchor and an unusual cysteine-rich (cys) domain that bridges the putative junction of the anchor and the cytoplasmic tail [].; GO: 0006944 cellular membrane fusion, 0046813 virion attachment, binding of host cell surface receptor, 0016021 integral to membrane, 0019031 viral envelope; PDB: 2BEQ_B 2FXP_A 1ZVB_A 1WNC_D 1ZV8_H 1ZV7_B 1WYY_B 1ZVA_A 2BEZ_F 1WDG_A ....
Probab=22.35  E-value=29  Score=40.40  Aligned_cols=24  Identities=21%  Similarity=0.137  Sum_probs=0.0

Q ss_pred             ccceeecCccEEEEcCCCC-cccCC
Q psy12980        121 DHLEAISGELLVYHVPEDT-FFDHE  144 (609)
Q Consensus       121 ~~~~a~e~~~fs~~iP~~t-F~D~d  144 (609)
                      ..-+|.||-.|+-.+=+|- |.|.|
T Consensus       146 ~~RS~IEDLLF~KV~t~glGtvd~d  170 (610)
T PF01601_consen  146 SGRSAIEDLLFNKVVTSGLGTVDQD  170 (610)
T ss_dssp             -------------------------
T ss_pred             CcchHHHHhhhceEEecCCCcchHH
Confidence            3467777777777665543 33544


No 120
>KOG4221|consensus
Probab=22.10  E-value=64  Score=40.65  Aligned_cols=34  Identities=21%  Similarity=0.319  Sum_probs=16.1

Q ss_pred             CCCCccEEEEeehHHHHHHHHHHHHHHhheeeecc
Q psy12980        460 ANASVEYLITFVAPLVVIVIMLLCAALIACLLYRR  494 (609)
Q Consensus       460 ~na~~~~Llv~ViPaVIIa~mllia~iIaC~cyRr  494 (609)
                      .+..++-|+..++.++.++ .+++.-+|++||+||
T Consensus      1052 ~~~~~~~ll~i~l~~~~vi-~i~~~~~v~v~C~~r 1085 (1381)
T KOG4221|consen 1052 TNLVDSNLLDIILVAVAVI-LILVLLLVLVTCRRR 1085 (1381)
T ss_pred             ccccccceeeeeeehhhhH-HHHHHhhheeEEecc
Confidence            3444444443443333332 233444567777776


No 121
>PF05808 Podoplanin:  Podoplanin;  InterPro: IPR008783 This family consists of several mammalian podoplanin-like proteins which are thought to control specifically the unique shape of podocytes [].; GO: 0016021 integral to membrane; PDB: 3IET_X.
Probab=22.00  E-value=30  Score=34.00  Aligned_cols=35  Identities=20%  Similarity=0.375  Sum_probs=0.0

Q ss_pred             cEEEEeehHHHHHHHHHH---HHHHhheeeecccCCCCcc
Q psy12980        465 EYLITFVAPLVVIVIMLL---CAALIACLLYRRRHTGKMS  501 (609)
Q Consensus       465 ~~Llv~ViPaVIIa~mll---ia~iIaC~cyRrKRkgK~~  501 (609)
                      +=|-+--|.++||.+++.   |+|||+.+.  ||=+|||+
T Consensus       124 ~GL~T~tLVGIIVGVLlaIG~igGIIivvv--RKmSGRys  161 (162)
T PF05808_consen  124 DGLSTVTLVGIIVGVLLAIGFIGGIIIVVV--RKMSGRYS  161 (162)
T ss_dssp             ----------------------------------------
T ss_pred             CCcceeeeeeehhhHHHHHHHHhheeeEEe--ehhccccC
Confidence            334434444555555444   555555553  33357774


No 122
>PF12669 P12:  Virus attachment protein p12 family
Probab=21.68  E-value=64  Score=26.32  Aligned_cols=8  Identities=13%  Similarity=0.571  Sum_probs=4.0

Q ss_pred             eecccCCC
Q psy12980        491 LYRRRHTG  498 (609)
Q Consensus       491 cyRrKRkg  498 (609)
                      .+|++|+|
T Consensus        20 ~~k~~K~G   27 (58)
T PF12669_consen   20 FIKDKKKG   27 (58)
T ss_pred             HHHHhhcC
Confidence            34555555


No 123
>PF15102 TMEM154:  TMEM154 protein family
Probab=21.55  E-value=60  Score=31.43  Aligned_cols=40  Identities=15%  Similarity=0.266  Sum_probs=26.1

Q ss_pred             CCCccEEEEeehHHHHHHHHHHHHHHhheeeecccCCCCc
Q psy12980        461 NASVEYLITFVAPLVVIVIMLLCAALIACLLYRRRHTGKM  500 (609)
Q Consensus       461 na~~~~Llv~ViPaVIIa~mllia~iIaC~cyRrKRkgK~  500 (609)
                      +.+.+.+-..++.+|-.++++|+..+++|+.++.|||...
T Consensus        49 ~~~~~q~efiLmIlIP~VLLvlLLl~vV~lv~~~kRkr~K   88 (146)
T PF15102_consen   49 EEDSSQLEFILMILIPLVLLVLLLLSVVCLVIYYKRKRTK   88 (146)
T ss_pred             CCCCCCcceEEEEeHHHHHHHHHHHHHHHheeEEeecccC
Confidence            4566766643343333455556677789999999888775


No 124
>PF14967 FAM70:  FAM70 protein
Probab=21.46  E-value=2.2e+02  Score=30.80  Aligned_cols=41  Identities=15%  Similarity=0.071  Sum_probs=24.8

Q ss_pred             CCCCCCCCCCCCCCCc-cEEEEeehHHHHHHHHHHHHHHhhe
Q psy12980        449 RKPAYSGKNTPANASV-EYLITFVAPLVVIVIMLLCAALIAC  489 (609)
Q Consensus       449 ~p~~~~~~~~~~na~~-~~Llv~ViPaVIIa~mllia~iIaC  489 (609)
                      .+..|-+..+-.+|.| -.||+-.+...|+=++.|+.|||.-
T Consensus       175 ~~~~YyEf~gV~SCqDVvhLy~LLWastvLNilgLfLGIiTA  216 (327)
T PF14967_consen  175 YSGSYYEFVGVSSCQDVVHLYRLLWASTVLNILGLFLGIITA  216 (327)
T ss_pred             CCCcceEEeccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3444434445455633 3366666778888778888777654


No 125
>PF10577 UPF0560:  Uncharacterised protein family UPF0560;  InterPro: IPR018890  This family of proteins has no known function. 
Probab=21.39  E-value=32  Score=41.37  Aligned_cols=30  Identities=10%  Similarity=0.249  Sum_probs=20.4

Q ss_pred             CCCCccEEEEeehHHHHHHHHHHHHHH-hhe
Q psy12980        460 ANASVEYLITFVAPLVVIVIMLLCAAL-IAC  489 (609)
Q Consensus       460 ~na~~~~Llv~ViPaVIIa~mllia~i-IaC  489 (609)
                      ....-.+||..|+-++++++++|++.| |.|
T Consensus       267 i~~YHT~fLl~ILG~~~livl~lL~vLl~yC  297 (807)
T PF10577_consen  267 ITTYHTVFLLAILGGTALIVLILLCVLLCYC  297 (807)
T ss_pred             chhHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence            344456888899887777777776665 556


No 126
>PF13750 Big_3_3:  Bacterial Ig-like domain (group 3)
Probab=20.54  E-value=2.1e+02  Score=27.74  Aligned_cols=36  Identities=11%  Similarity=0.104  Sum_probs=27.4

Q ss_pred             eeEEEEE-EEEcCCCCeeEeeEEEEEECCCCCCcceEE
Q psy12980        288 GRREYQL-VCKDSNGLMANDGLEVIVKSPGPAYYSASF  324 (609)
Q Consensus       288 G~~~f~L-tATDs~G~sasd~F~V~V~~~p~~~~~vef  324 (609)
                      |.|.+.+ +|+|..|...+..+...+.-+ .-.|.+.+
T Consensus        15 G~Y~l~~~~a~D~agN~~~~~~~~~~~iD-~T~Ptisi   51 (158)
T PF13750_consen   15 GSYTLTVVTATDAAGNTSTSTVSETFTID-NTPPTISI   51 (158)
T ss_pred             ccEEEEEEEEEecCCCEEEEEEeeEEEEc-CCCCEEEE
Confidence            9999999 899999999998887554443 34455555


Done!