Query         psy15726
Match_columns 171
No_of_seqs    169 out of 663
Neff          7.6 
Searched_HMMs 46136
Date          Fri Aug 16 19:50:57 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy15726.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/15726hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG0518|consensus              100.0 2.3E-28   5E-33  221.8  16.6  154   15-170   722-880 (1113)
  2 smart00557 IG_FLMN Filamin-typ 100.0 1.5E-27 3.3E-32  166.0  13.8   92   50-141     1-92  (93)
  3 KOG0518|consensus              100.0   2E-27 4.3E-32  215.8  17.6  149   15-171   635-787 (1113)
  4 PF00630 Filamin:  Filamin/ABP2  99.9 1.1E-23 2.5E-28  147.4  11.7   89   48-136     2-101 (101)
  5 smart00557 IG_FLMN Filamin-typ  98.6 4.2E-08 9.2E-13   67.8   4.3   31   16-46     62-92  (93)
  6 PF00630 Filamin:  Filamin/ABP2  98.2   1E-06 2.3E-11   61.0   2.7   26   16-41     76-101 (101)
  7 PRK14081 triple tyrosine motif  97.2   0.015 3.3E-07   53.2  14.1  147   16-169   344-508 (667)
  8 KOG1428|consensus               96.9   0.009 1.9E-07   58.5  10.5   46   98-144  2099-2144(3738)
  9 PRK14081 triple tyrosine motif  96.9    0.06 1.3E-06   49.4  15.3  148   16-169   442-603 (667)
 10 PF01835 A2M_N:  MG2 domain;  I  96.7    0.01 2.3E-07   40.8   7.1   64   63-126    11-86  (99)
 11 TIGR03503 conserved hypothetic  95.8    0.21 4.7E-06   42.9  11.8  110   16-141   178-302 (374)
 12 PF06312 Neurexophilin:  Neurex  95.8   0.041 8.9E-07   43.9   6.9   45   81-125    70-115 (219)
 13 PF13115 YtkA:  YtkA-like        93.7    0.64 1.4E-05   30.9   7.6   43   81-123    38-85  (86)
 14 COG2373 Large extracellular al  93.4    0.67 1.5E-05   46.8  10.2  106   63-168   405-522 (1621)
 15 PF02369 Big_1:  Bacterial Ig-l  93.3     1.1 2.4E-05   31.1   8.5   63   67-129    23-92  (100)
 16 PF13860 FlgD_ig:  FlgD Ig-like  92.5     1.6 3.5E-05   28.9   8.1   57   68-125    13-77  (81)
 17 TIGR00864 PCC polycystin catio  91.4      12 0.00026   40.0  16.1  144   10-164  2001-2153(2740)
 18 PF04234 CopC:  CopC domain;  I  89.7    0.27 5.8E-06   34.0   2.2   53   84-138    36-95  (97)
 19 PF13473 Cupredoxin_1:  Cupredo  89.4     2.8 6.1E-05   28.9   7.3   59   62-127    38-96  (104)
 20 PF11896 DUF3416:  Domain of un  89.1     4.1 8.8E-05   31.8   8.6   63   62-124    22-87  (187)
 21 TIGR00868 hCaCC calcium-activa  88.9      12 0.00027   35.8  13.1  108   18-129   559-684 (863)
 22 PF07705 CARDB:  CARDB;  InterP  87.8     3.7 8.1E-05   27.3   6.9   66   62-127    14-85  (101)
 23 smart00736 CADG Dystroglycan-t  87.7     3.9 8.5E-05   27.9   7.0   81   62-143     8-95  (97)
 24 PRK06655 flgD flagellar basal   87.0     5.3 0.00012   32.0   8.3   59   66-125   112-178 (225)
 25 PRK10301 hypothetical protein;  86.6     2.3 4.9E-05   30.9   5.5   25  114-139    96-123 (124)
 26 cd02858 Esterase_N_term Estera  85.4     8.7 0.00019   25.6   8.5   60   67-130     6-65  (85)
 27 COG2372 CopC Uncharacterized p  84.6     7.1 0.00015   28.6   7.2   72   67-139    45-124 (127)
 28 PF05404 TRAP-delta:  Transloco  83.5     5.8 0.00013   30.4   6.6   54   67-124    43-101 (167)
 29 KOG1692|consensus               83.3     8.5 0.00018   30.2   7.5   33   63-95     37-69  (201)
 30 PF01483 P_proprotein:  Proprot  83.0     6.5 0.00014   26.1   6.1   54   70-125     5-70  (87)
 31 COG1470 Predicted membrane pro  82.7      12 0.00026   33.3   9.0  120   19-139   347-482 (513)
 32 PRK09619 flgD flagellar basal   82.5     7.6 0.00017   30.9   7.3   70   52-124    97-170 (218)
 33 PRK12812 flgD flagellar basal   82.2      10 0.00022   31.1   8.0   58   67-125   128-193 (259)
 34 PF04151 PPC:  Bacterial pre-pe  82.2      10 0.00022   24.0   7.4   52   71-124    15-69  (70)
 35 TIGR03096 nitroso_cyanin nitro  81.8     7.3 0.00016   28.8   6.4   65   62-133    64-130 (135)
 36 PF13620 CarboxypepD_reg:  Carb  81.7       9 0.00019   24.6   6.4   46   81-129    15-60  (82)
 37 PF07523 Big_3:  Bacterial Ig-l  80.2     4.6 9.9E-05   25.6   4.4   44   82-130    18-61  (67)
 38 PF13584 BatD:  Oxygen toleranc  79.9      11 0.00024   33.1   8.2   62  106-167    74-151 (484)
 39 PRK12633 flgD flagellar basal   78.2      12 0.00027   29.9   7.3   66   68-134   117-193 (230)
 40 PF07495 Y_Y_Y:  Y_Y_Y domain;   77.9     3.7   8E-05   25.5   3.4   19  108-126    30-48  (66)
 41 KOG1428|consensus               77.8     8.3 0.00018   39.3   6.9   36   11-47   2107-2142(3738)
 42 PF00017 SH2:  SH2 domain;  Int  77.1     6.7 0.00015   25.2   4.6   27  102-128    19-45  (77)
 43 PF14545 DBB:  Dof, BCAP, and B  75.3      32 0.00068   25.7   8.7   75   62-139     7-88  (142)
 44 PF06016 Reovirus_L2:  Reovirus  74.4       4 8.7E-05   40.2   4.0  103   63-167  1035-1149(1289)
 45 PF13584 BatD:  Oxygen toleranc  73.5      63  0.0014   28.3  15.0   61   16-76     79-152 (484)
 46 PF14524 Wzt_C:  Wzt C-terminal  72.5      31 0.00066   24.3   7.6   64   62-125    30-107 (142)
 47 TIGR00864 PCC polycystin catio  72.2 1.3E+02  0.0029   32.7  14.1  139   17-167  1749-1891(2740)
 48 PRK12634 flgD flagellar basal   72.1      37  0.0008   27.1   8.5   55   81-135   122-187 (221)
 49 smart00252 SH2 Src homology 2   71.9     6.1 0.00013   25.8   3.4   28  100-127    18-45  (84)
 50 PF10670 DUF4198:  Domain of un  71.5      31 0.00068   26.3   7.9   62   64-127   148-213 (215)
 51 PF11589 DUF3244:  Domain of un  71.3      10 0.00022   26.4   4.6   53   81-133    48-102 (106)
 52 smart00634 BID_1 Bacterial Ig-  70.1      30 0.00065   23.2   8.5   28  100-127    57-84  (92)
 53 KOG4088|consensus               69.4      22 0.00047   26.6   6.0   53   68-124    44-101 (167)
 54 PF09134 Invasin_D3:  Invasin,   68.2      38 0.00083   23.7   6.9   70   67-138    21-95  (99)
 55 PRK13211 N-acetylglucosamine-b  67.9      92   0.002   27.9  11.2   86   52-139   312-404 (478)
 56 PF01105 EMP24_GP25L:  emp24/gp  67.5     1.8 3.8E-05   32.2   0.0   61   64-126    19-82  (183)
 57 PF09315 DUF1973:  Domain of un  67.4      37 0.00079   26.2   7.3   59   64-124    26-85  (179)
 58 KOG2996|consensus               67.2     5.1 0.00011   36.6   2.8   30   99-128   701-730 (865)
 59 PRK15095 FKBP-type peptidyl-pr  67.1      37 0.00081   25.4   7.2   79   37-118    32-132 (156)
 60 PRK12813 flgD flagellar basal   66.7      65  0.0014   25.8   9.0   57   66-125   110-173 (223)
 61 PF05751 FixH:  FixH;  InterPro  65.9      48   0.001   23.9  10.1   61   65-125    67-132 (146)
 62 TIGR03769 P_ac_wall_RPT actino  64.8     7.5 0.00016   22.6   2.3   24   15-38      6-29  (41)
 63 COG4856 Uncharacterized protei  63.7      71  0.0015   27.8   8.9   74    3-85     16-108 (403)
 64 KOG0544|consensus               63.7      40 0.00088   23.6   6.1   57  105-166    21-78  (108)
 65 cd00173 SH2 Src homology 2 dom  62.5      17 0.00037   23.9   4.2   28  100-127    17-44  (94)
 66 cd05771 IgC_Tapasin_R Tapasin-  61.9      56  0.0012   23.3  11.7   74   12-90      4-77  (139)
 67 PF14874 PapD-like:  Flagellar-  59.9      50  0.0011   22.1   9.8   74   63-139    16-97  (102)
 68 PF12988 DUF3872:  Domain of un  58.4      64  0.0014   23.9   6.8   82   33-126    25-120 (137)
 69 PF10648 Gmad2:  Immunoglobulin  57.5      50  0.0011   22.3   5.8   46   81-126    29-81  (88)
 70 PF00801 PKD:  PKD domain;  Int  57.4      27 0.00058   21.8   4.3   55   62-126     7-61  (69)
 71 PF03896 TRAP_alpha:  Transloco  54.7 1.1E+02  0.0024   25.4   8.5   81   61-141    93-191 (285)
 72 PF13754 Big_3_4:  Bacterial Ig  52.5      32  0.0007   20.8   3.8   26  100-125     7-32  (54)
 73 PF06832 BiPBP_C:  Penicillin-B  52.0      67  0.0015   21.2   6.8   20  105-124    59-78  (89)
 74 PRK05842 flgD flagellar basal   51.9   1E+02  0.0023   25.8   7.9   73   52-125   135-220 (295)
 75 PF12975 DUF3859:  Domain of un  51.2      91   0.002   22.5   6.9   70   62-131    37-119 (125)
 76 KOG1693|consensus               50.8      77  0.0017   25.1   6.5   51   69-123    47-98  (209)
 77 KOG4637|consensus               50.1      15 0.00033   31.8   2.7   34   95-128    35-68  (464)
 78 TIGR03503 conserved hypothetic  49.4 1.8E+02  0.0038   25.3  13.6   77   63-139   102-216 (374)
 79 PF07703 A2M_N_2:  Alpha-2-macr  48.9      92   0.002   21.9  10.2  102   62-167     9-118 (136)
 80 PF08308 PEGA:  PEGA domain;  I  46.7      71  0.0015   20.0   5.0   14  114-127    32-45  (71)
 81 PF10633 NPCBM_assoc:  NPCBM-as  46.4      78  0.0017   20.3   7.8   62   64-125     2-75  (78)
 82 PF09912 DUF2141:  Uncharacteri  46.0   1E+02  0.0022   21.6   6.6   28   99-126    35-62  (112)
 83 PF04571 Lipin_N:  lipin, N-ter  45.9      91   0.002   22.2   5.7   44   95-141    28-79  (110)
 84 PF09430 DUF2012:  Protein of u  45.9 1.1E+02  0.0023   21.8   6.4   37  100-140    25-61  (123)
 85 PF13954 PapC_N:  PapC N-termin  45.0      25 0.00055   25.8   3.0   20   17-36     25-44  (146)
 86 PF00116 COX2:  Cytochrome C ox  44.5 1.1E+02  0.0025   21.7   8.0   53   62-122    49-101 (120)
 87 PF12863 DUF3821:  Domain of un  43.8 1.6E+02  0.0035   23.3   7.7   32   62-93    101-142 (209)
 88 PF05738 Cna_B:  Cna protein B-  43.1      79  0.0017   19.5   7.8   61   85-147     6-69  (70)
 89 cd03866 M14_CPM Peptidase M14   41.0 1.1E+02  0.0023   26.5   6.5   60   65-127   254-350 (376)
 90 KOG1690|consensus               39.2      89  0.0019   24.8   5.2   46   81-127    65-110 (215)
 91 COG5489 Uncharacterized conser  39.0      66  0.0014   22.7   4.0   40   76-115    63-104 (107)
 92 PF09315 DUF1973:  Domain of un  37.2 1.9E+02  0.0041   22.2   7.1   85   18-104    74-166 (179)
 93 KOG3287|consensus               37.1 1.8E+02  0.0038   23.5   6.7   58   63-123    52-110 (236)
 94 PF09394 Inhibitor_I42:  Chagas  36.3      68  0.0015   21.1   3.8   22  105-126    53-74  (92)
 95 KOG1366|consensus               34.3 1.9E+02  0.0041   29.7   7.8   71   64-134   127-203 (1436)
 96 PF13753 SWM_repeat:  Putative   33.8 2.7E+02  0.0058   22.9   7.9   27  100-126   193-224 (317)
 97 PF03345 DDOST_48kD:  Oligosacc  33.7 1.8E+02  0.0038   25.7   6.8   61   63-126   279-354 (423)
 98 PF13750 Big_3_3:  Bacterial Ig  33.7   2E+02  0.0044   21.5  10.5  109   17-127    10-133 (158)
 99 PF13860 FlgD_ig:  FlgD Ig-like  33.4 1.4E+02   0.003   19.4   6.8   50   82-133    13-64  (81)
100 PRK10737 FKBP-type peptidyl-pr  33.4 1.3E+02  0.0029   23.5   5.5   56   62-118    51-128 (196)
101 KOG0393|consensus               33.3      45 0.00097   26.3   2.8   32    5-36     16-54  (198)
102 PF02922 CBM_48:  Carbohydrate-  32.8 1.3E+02  0.0029   19.1   8.9   59   67-127    11-74  (85)
103 PF03422 CBM_6:  Carbohydrate b  31.7 1.7E+02  0.0037   20.1   7.5   46   81-127    60-110 (125)
104 PF07563 DUF1541:  Protein of u  31.6   1E+02  0.0022   19.0   3.6   16  105-120    30-45  (53)
105 COG1622 CyoA Heme/copper-type   31.4 2.6E+02  0.0056   22.7   7.0   69   63-139   141-210 (247)
106 PF13715 DUF4480:  Domain of un  29.3 1.6E+02  0.0035   18.9   7.2   26  100-128    30-55  (88)
107 PF11763 DIPSY:  Cell-wall adhe  29.3 2.2E+02  0.0048   20.6   6.8   44   83-128    50-94  (123)
108 KOG1691|consensus               29.2 2.3E+02   0.005   22.6   6.1   42   81-125    60-101 (210)
109 PF04465 DUF499:  Protein of un  29.0   2E+02  0.0044   28.4   6.9   65   63-128   836-903 (1035)
110 PF00254 FKBP_C:  FKBP-type pep  28.3      12 0.00027   24.8  -1.0   39  131-169    31-69  (94)
111 MTH00047 COX2 cytochrome c oxi  26.7 3.1E+02  0.0067   21.3   8.5   71   63-142   120-192 (194)
112 PF09264 Sial-lect-inser:  Vibr  25.6   2E+02  0.0043   22.6   5.1  105   56-166    44-153 (198)
113 TIGR02148 Fibro_Slime fibro-sl  25.3      96  0.0021   21.3   3.0   23  117-139    64-86  (90)
114 PLN00115 pollen allergen group  25.3 2.6E+02  0.0057   20.1   5.7   64   64-139    21-88  (118)
115 cd07696 IgC_CH3 CH3 domain (th  24.8 2.2E+02  0.0048   19.2   4.9   72   59-139     7-82  (96)
116 PF07691 PA14:  PA14 domain;  I  24.5 1.5E+02  0.0032   20.8   4.2   21  107-127    50-70  (145)
117 PF12690 BsuPI:  Intracellular   24.4 1.7E+02  0.0038   19.3   4.1   17  106-122    63-81  (82)
118 PF04225 OapA:  Opacity-associa  24.1 1.8E+02  0.0039   19.3   4.2   46   58-109    38-83  (85)
119 PF14347 DUF4399:  Domain of un  24.1 1.4E+02   0.003   20.2   3.6   22   19-40     59-81  (87)
120 cd03858 M14_CP_N-E_like Carbox  23.8 3.1E+02  0.0068   23.3   6.6   43   82-127   299-351 (374)
121 PF08400 phage_tail_N:  Prophag  23.2 1.7E+02  0.0036   21.6   4.1   25  102-129    44-68  (134)
122 PF13895 Ig_2:  Immunoglobulin   22.8      81  0.0017   19.2   2.2   14   26-39     31-44  (80)
123 PF10980 DUF2787:  Protein of u  22.6 1.5E+02  0.0032   21.7   3.7   37    5-43     22-58  (128)
124 PF08770 SoxZ:  Sulphur oxidati  22.4 2.7E+02  0.0058   19.2   7.8   25   68-94     69-93  (100)
125 COG3354 FlaG Putative archaeal  22.2 3.3E+02  0.0072   20.5   5.5   44  103-146    66-116 (154)
126 cd03868 M14_CPD_I The first ca  22.1 3.2E+02  0.0069   23.3   6.3   61   64-127   254-349 (372)
127 PF05949 DUF881:  Bacterial pro  22.0      58  0.0013   24.2   1.6   21  123-144    81-102 (149)
128 PRK06655 flgD flagellar basal   21.7 4.2E+02   0.009   21.1   6.8   74   64-139    95-176 (225)
129 PF10651 DUF2479:  Domain of un  21.7 3.5E+02  0.0076   20.2   6.3   57   84-140    58-124 (170)
130 cd05767 IgC_MHC_II_alpha Class  21.6 2.6E+02  0.0057   18.8   8.2   43   66-109    15-62  (94)
131 PRK09937 stationary phase/star  21.5 2.3E+02  0.0051   18.4   4.2   37   55-91     34-70  (74)
132 cd06245 M14_CPD_III The third   21.0 3.6E+02  0.0077   23.1   6.4   43   83-128   289-340 (363)
133 KOG4278|consensus               21.0 2.1E+02  0.0045   27.3   5.1   42  103-147   172-213 (1157)
134 KOG3342|consensus               20.9 1.2E+02  0.0027   23.1   3.1   28    8-36     67-94  (180)
135 PHA03376 BARF1; Provisional     20.8 1.7E+02  0.0037   23.3   3.9   24   99-122    94-117 (221)
136 smart00720 calpain_III calpain  20.6 1.6E+02  0.0035   21.1   3.7   26  101-126   109-137 (143)
137 KOG3392|consensus               20.6 1.2E+02  0.0026   22.0   2.8   21   16-36     79-99  (147)
138 PHA03290 envelope glycoprotein  20.5 1.9E+02   0.004   24.8   4.4   35    8-43    128-162 (357)
139 PF10342 GPI-anchored:  Ser-Thr  20.5 2.5E+02  0.0054   18.1   8.1   65   63-127     9-80  (93)
140 PF08842 Mfa2:  Fimbrillin-A as  20.5 3.8E+02  0.0083   20.9   6.2   65   62-126     3-77  (283)
141 PF07965 Integrin_B_tail:  Inte  20.4 2.6E+02  0.0057   18.6   4.5   30   95-124    47-77  (87)
142 PF01067 Calpain_III:  Calpain   20.3 1.4E+02  0.0029   21.3   3.3   26  101-126   113-141 (147)
143 PF02792 Mago_nashi:  Mago nash  20.3 1.3E+02  0.0027   22.5   2.9   20   16-35     75-94  (143)
144 PF00927 Transglut_C:  Transglu  20.1 1.9E+02  0.0041   19.6   3.8   20  120-139     2-21  (107)

No 1  
>KOG0518|consensus
Probab=99.96  E-value=2.3e-28  Score=221.82  Aligned_cols=154  Identities=19%  Similarity=0.246  Sum_probs=141.0

Q ss_pred             ccCCCCceeEEEEEEECCEEcCCCCeEEEecCC-CCCCCcEEEcCCCcceEeCCeEEEEEEeCCCcCCceEEEEECCCCC
Q psy15726         15 APSWISYGLVQARTLREETSFRSCPMEVPVVDP-AVGREPSGSGLGLYQARSGIVNSFTLETCGVASSEFDVIVTSPQGA   93 (171)
Q Consensus        15 ~~~P~~~G~~~i~V~~~g~~I~gSPf~v~V~~~-~d~~~v~v~G~Gl~~~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~   93 (171)
                      +|.|+|+|+|.|+|+.+|+||++|||.+.|... .|+++++++|.|+.....-++++|.||++++|.+-|++.+++|+  
T Consensus       722 tF~P~e~GeH~I~Vk~~G~hVpgsPf~i~V~~~e~dAsk~~v~g~g~~~G~t~ep~~fivDtr~agyGgLsi~~~Gps--  799 (1113)
T KOG0518|consen  722 TFTPREVGEHKINVKVAGKHVPGSPFSIKVSESEIDASKVRVSGQGLKEGHTFEPAEFIVDTRKAGYGGLSISVQGPS--  799 (1113)
T ss_pred             EECCCcCcceEEEEEEcceECCCCCeEEEeccccccceeEEEecccccccccccchheEeccccCCCCceEEEEeCCc--
Confidence            489999999999999999999999999999876 57999999999998876669999999999999999999999999  


Q ss_pred             ccceEEEEcCCCEEEEEEEcCCCeeEEEEEEECCEEcCCCCeEEEecCCC----ceEEecCCcccEEcCCCEEEEEEccC
Q psy15726         94 AVPVRCYQQKFANLLAEFTPTTTGVYKIDVLQGARPVRGSPYLCQVYDAS----KVKIEHKGLSNIVVNDPISFKCKSTE  169 (171)
Q Consensus        94 ~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~~g~~I~gSPF~v~V~d~s----kv~~~G~gl~~~~~g~~~~F~Vd~~~  169 (171)
                      .++..+.|+.||+++|+|+|+++|.|.|+|+|+++||+||||.+++.+++    -+..+..++.-+.+|..++|.+++.+
T Consensus       800 kvd~~~~d~~dGt~kV~ytPtepG~Y~I~i~Fad~~I~gSPftVkv~~~~~vvesi~~~~~~~~va~~g~~~~l~lk~~~  879 (1113)
T KOG0518|consen  800 KVDLNVEDREDGTCKVSYTPTEPGTYIINIKFADEHIKGSPFTVKVTGESRVVESITRDREAPSVARVGHTCSLDLKATE  879 (1113)
T ss_pred             ccccceeecCCCeEEEEEeCCCCceEEEEEEEcCccCCCCceEEEecCCeeEeeeeeecccccceecccceeeeeeecCC
Confidence            46788999999999999999999999999999999999999999999988    34456666667999999999998875


Q ss_pred             C
Q psy15726        170 E  170 (171)
Q Consensus       170 a  170 (171)
                      +
T Consensus       880 ~  880 (1113)
T KOG0518|consen  880 A  880 (1113)
T ss_pred             C
Confidence            4


No 2  
>smart00557 IG_FLMN Filamin-type immunoglobulin domains. These form a rod-like structure in the actin-binding cytoskeleton protein, filamin. The C-terminal repeats of filamin bind beta1-integrin (CD29).
Probab=99.95  E-value=1.5e-27  Score=165.98  Aligned_cols=92  Identities=32%  Similarity=0.496  Sum_probs=89.0

Q ss_pred             CCCcEEEcCCCcceEeCCeEEEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCCEEEEEEEcCCCeeEEEEEEECCEE
Q psy15726         50 GREPSGSGLGLYQARSGIVNSFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTPTTTGVYKIDVLQGARP  129 (171)
Q Consensus        50 ~~~v~v~G~Gl~~~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~~g~~  129 (171)
                      +++|+++|+||+.+.+|++++|+|+++++|+++|+|.|.+|+|+.+++++.|++||+|.|+|+|.++|.|+|+|+|+|+|
T Consensus         1 p~~~~v~G~Gl~~~~vg~~~~f~v~~~d~G~~~~~v~i~~p~g~~~~~~v~d~~dGty~v~y~P~~~G~~~i~V~~~g~~   80 (93)
T smart00557        1 ASKVKASGPGLEKGVVGEPAEFTIDTRGAGGGELEVEVTGPSGKKVPVEVKDNGDGTYTVSYTPTEPGDYTVTVKFGGEH   80 (93)
T ss_pred             CCeEEEECCCcCceecCCCEEEEEEcCCCCCCcEEEEEECCCCCeeEeEEEeCCCCEEEEEEEeCCCEeEEEEEEECCEE
Confidence            47899999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             cCCCCeEEEecC
Q psy15726        130 VRGSPYLCQVYD  141 (171)
Q Consensus       130 I~gSPF~v~V~d  141 (171)
                      |+||||.++|.|
T Consensus        81 I~gSPF~v~V~~   92 (93)
T smart00557       81 IPGSPFTVKVGP   92 (93)
T ss_pred             CCCCCEEEEEeC
Confidence            999999999865


No 3  
>KOG0518|consensus
Probab=99.95  E-value=2e-27  Score=215.76  Aligned_cols=149  Identities=21%  Similarity=0.242  Sum_probs=135.6

Q ss_pred             ccCCCCceeEEEEEEECCEEcCCCCeEEEecCCCCCCCcEEEcCCCcceEeCCeEEEEEEeCCCcCCceEEEEECCCCCc
Q psy15726         15 APSWISYGLVQARTLREETSFRSCPMEVPVVDPAVGREPSGSGLGLYQARSGIVNSFTLETCGVASSEFDVIVTSPQGAA   94 (171)
Q Consensus        15 ~~~P~~~G~~~i~V~~~g~~I~gSPf~v~V~~~~d~~~v~v~G~Gl~~~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~   94 (171)
                      +|.|..||.|+|.|+|+|+||++|||...|... +      ++.+ ....||...++.|.-+.++...++++|.+|+|++
T Consensus       635 sy~P~vpG~y~i~V~~~~k~vpgSPF~~~Vt~~-~------s~r~-s~~~Vgs~~ev~in~~~~~~s~ltaeI~~PsGn~  706 (1113)
T KOG0518|consen  635 SYTPTVPGVYKIEVKFGDKHVPGSPFVALVTGD-E------SIRT-SHLSVGSIHEVTINITEADLSVLTAEIVDPSGNP  706 (1113)
T ss_pred             ccCCCCCceEEEEEEECCeeCCCCCceEEEecc-c------cccc-ccceecceeeEEeeccccCCceeEEEEECCCCCc
Confidence            489999999999999999999999999998752 1      1111 2478999999999988888899999999999999


Q ss_pred             cceEEEEcCCCEEEEEEEcCCCeeEEEEEEECCEEcCCCCeEEEe----cCCCceEEecCCcccEEcCCCEEEEEEccCC
Q psy15726         95 VPVRCYQQKFANLLAEFTPTTTGVYKIDVLQGARPVRGSPYLCQV----YDASKVKIEHKGLSNIVVNDPISFKCKSTEE  170 (171)
Q Consensus        95 ~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~~g~~I~gSPF~v~V----~d~skv~~~G~gl~~~~~g~~~~F~Vd~~~a  170 (171)
                      .+|.++...+|.|.++|+|+++|+|+|+|+++|+||+||||.|.|    .|++|++++|.||..+..-||++|.||+|+|
T Consensus       707 ~~c~~r~l~~g~~~itF~P~e~GeH~I~Vk~~G~hVpgsPf~i~V~~~e~dAsk~~v~g~g~~~G~t~ep~~fivDtr~a  786 (1113)
T KOG0518|consen  707 EPCLVRRLPNGHDGITFTPREVGEHKINVKVAGKHVPGSPFSIKVSESEIDASKVRVSGQGLKEGHTFEPAEFIVDTRKA  786 (1113)
T ss_pred             cceeeEecCCCceeEEECCCcCcceEEEEEEcceECCCCCeEEEeccccccceeEEEecccccccccccchheEeccccC
Confidence            999999999999999999999999999999999999999999999    5799999999999998777999999999998


Q ss_pred             C
Q psy15726        171 P  171 (171)
Q Consensus       171 ~  171 (171)
                      +
T Consensus       787 g  787 (1113)
T KOG0518|consen  787 G  787 (1113)
T ss_pred             C
Confidence            5


No 4  
>PF00630 Filamin:  Filamin/ABP280 repeat;  InterPro: IPR017868 The many different actin cross-linking proteins share a common architecture, consisting of a globular actin-binding domain and an extended rod. Whereas their actin-binding domains consist of two calponin homology domains (see IPR001715 from INTERPRO), their rods fall into three families. The rod domain of the family including the Dictyostelium discoideum (Slime mould) gelation factor (ABP120) and human filamin (ABP280) is constructed from tandem repeats of a 100-residue motif that is glycine and proline rich []. The gelation factor's rod contains 6 copies of the repeat, whereas filamin has a rod constructed from 24 repeats. The resolution of the 3D structure of rod repeats from the gelation factor has shown that they consist of a beta-sandwich, formed by two beta-sheets arranged in an immunoglobulin-like fold [, ]. Because conserved residues that form the core of the repeats are preserved in filamin, the repeat structure should be common to the members of the gelation factor/filamin family. The head to tail homodimerisation is crucial to the function of the ABP120 and ABP280 proteins. This interaction involves a small portion at the distal end of the rod domains. For the gelation factor it has been shown that the carboxy-terminal repeat 6 dimerises through a double edge-to-edge extension of the beta-sheet and that repeat 5 contributes to dimerisation to some extent [, , ].; PDB: 2DI9_A 2EEC_A 2DIC_A 2EEA_A 2DMC_A 2EE9_A 2D7O_A 2D7N_A 2K7P_A 2NQC_A ....
Probab=99.91  E-value=1.1e-23  Score=147.42  Aligned_cols=89  Identities=35%  Similarity=0.524  Sum_probs=82.3

Q ss_pred             CCCCCcEEEcCCCcceEeCCeEEEEEEeCCCcCC-------ceEEEEECCCCC----ccceEEEEcCCCEEEEEEEcCCC
Q psy15726         48 AVGREPSGSGLGLYQARSGIVNSFTLETCGVASS-------EFDVIVTSPQGA----AVPVRCYQQKFANLLAEFTPTTT  116 (171)
Q Consensus        48 ~d~~~v~v~G~Gl~~~~vg~~~~F~V~~~~ag~~-------~l~v~i~~p~g~----~~~~~v~~~~dg~y~v~y~P~~~  116 (171)
                      .||++|.++|+||+.+.+|++++|+|+++++++.       .++|.|.+|++.    .+++++.+++||+|.++|+|.++
T Consensus         2 ~dp~~~~v~G~gl~~~~~g~~~~F~V~~~d~~g~~~~~~~~~~~v~i~~p~~~~~~~~~~~~v~~~~~G~y~v~y~p~~~   81 (101)
T PF00630_consen    2 IDPSKCKVSGPGLEPAVVGEPATFTVDTRDAGGNPVSSGGDEFQVTITSPDGKEEPVPVPVEVIDNGDGTYTVSYTPTEP   81 (101)
T ss_dssp             CSGCGEEEESGGGTEEETTSEEEEEEEETTTTSSBEESTSSEEEEEEESSSSESS--EEEEEEEEESSSEEEEEEEESSS
T ss_pred             CCCCEEEEECCccCCeECCCcEEEEEEEccCCCCccccCCceeEEEEeCCCCCccccccceEEEECCCCEEEEEEEeCcc
Confidence            4789999999999999999999999999998543       678999999997    68899999999999999999999


Q ss_pred             eeEEEEEEECCEEcCCCCeE
Q psy15726        117 GVYKIDVLQGARPVRGSPYL  136 (171)
Q Consensus       117 G~~~i~V~~~g~~I~gSPF~  136 (171)
                      |.|+|+|+|+|++|+||||+
T Consensus        82 G~y~i~V~~~g~~I~gSPf~  101 (101)
T PF00630_consen   82 GKYKISVKINGQPIPGSPFT  101 (101)
T ss_dssp             EEEEEEEEESSEB-TTSSEE
T ss_pred             EeEEEEEEECCEECcCCCcC
Confidence            99999999999999999995


No 5  
>smart00557 IG_FLMN Filamin-type immunoglobulin domains. These form a rod-like structure in the actin-binding cytoskeleton protein, filamin. The C-terminal repeats of filamin bind beta1-integrin (CD29).
Probab=98.62  E-value=4.2e-08  Score=67.85  Aligned_cols=31  Identities=13%  Similarity=-0.040  Sum_probs=29.6

Q ss_pred             cCCCCceeEEEEEEECCEEcCCCCeEEEecC
Q psy15726         16 PSWISYGLVQARTLREETSFRSCPMEVPVVD   46 (171)
Q Consensus        16 ~~P~~~G~~~i~V~~~g~~I~gSPf~v~V~~   46 (171)
                      |.|.++|.|.|+|+|+|++|++|||.+.|.+
T Consensus        62 y~P~~~G~~~i~V~~~g~~I~gSPF~v~V~~   92 (93)
T smart00557       62 YTPTEPGDYTVTVKFGGEHIPGSPFTVKVGP   92 (93)
T ss_pred             EEeCCCEeEEEEEEECCEECCCCCEEEEEeC
Confidence            7899999999999999999999999999875


No 6  
>PF00630 Filamin:  Filamin/ABP280 repeat;  InterPro: IPR017868 The many different actin cross-linking proteins share a common architecture, consisting of a globular actin-binding domain and an extended rod. Whereas their actin-binding domains consist of two calponin homology domains (see IPR001715 from INTERPRO), their rods fall into three families. The rod domain of the family including the Dictyostelium discoideum (Slime mould) gelation factor (ABP120) and human filamin (ABP280) is constructed from tandem repeats of a 100-residue motif that is glycine and proline rich []. The gelation factor's rod contains 6 copies of the repeat, whereas filamin has a rod constructed from 24 repeats. The resolution of the 3D structure of rod repeats from the gelation factor has shown that they consist of a beta-sandwich, formed by two beta-sheets arranged in an immunoglobulin-like fold [, ]. Because conserved residues that form the core of the repeats are preserved in filamin, the repeat structure should be common to the members of the gelation factor/filamin family. The head to tail homodimerisation is crucial to the function of the ABP120 and ABP280 proteins. This interaction involves a small portion at the distal end of the rod domains. For the gelation factor it has been shown that the carboxy-terminal repeat 6 dimerises through a double edge-to-edge extension of the beta-sheet and that repeat 5 contributes to dimerisation to some extent [, , ].; PDB: 2DI9_A 2EEC_A 2DIC_A 2EEA_A 2DMC_A 2EE9_A 2D7O_A 2D7N_A 2K7P_A 2NQC_A ....
Probab=98.19  E-value=1e-06  Score=61.05  Aligned_cols=26  Identities=8%  Similarity=-0.004  Sum_probs=24.1

Q ss_pred             cCCCCceeEEEEEEECCEEcCCCCeE
Q psy15726         16 PSWISYGLVQARTLREETSFRSCPME   41 (171)
Q Consensus        16 ~~P~~~G~~~i~V~~~g~~I~gSPf~   41 (171)
                      |.|.++|.|+|+|+|+|++|++|||+
T Consensus        76 y~p~~~G~y~i~V~~~g~~I~gSPf~  101 (101)
T PF00630_consen   76 YTPTEPGKYKISVKINGQPIPGSPFT  101 (101)
T ss_dssp             EEESSSEEEEEEEEESSEB-TTSSEE
T ss_pred             EEeCccEeEEEEEEECCEECcCCCcC
Confidence            78999999999999999999999995


No 7  
>PRK14081 triple tyrosine motif-containing protein; Provisional
Probab=97.17  E-value=0.015  Score=53.15  Aligned_cols=147  Identities=12%  Similarity=0.076  Sum_probs=86.5

Q ss_pred             cCCCCceeEEEEEEECCEEcCCC-----CeEEEecCC-CCC---CCcEEEcCCCcceEeCCeEEEEEEeCCCcCCceEEE
Q psy15726         16 PSWISYGLVQARTLREETSFRSC-----PMEVPVVDP-AVG---REPSGSGLGLYQARSGIVNSFTLETCGVASSEFDVI   86 (171)
Q Consensus        16 ~~P~~~G~~~i~V~~~g~~I~gS-----Pf~v~V~~~-~d~---~~v~v~G~Gl~~~~vg~~~~F~V~~~~ag~~~l~v~   86 (171)
                      |+|.++|.|+|.|..-+..-.+.     -....+... ..|   ..+.+.  .-+...+|++..+.+.+.+..+-..+..
T Consensus       344 w~P~~~G~Y~I~v~VKDk~S~~eyD~~~~v~f~i~~~~~~p~~I~~vl~d--~~~~~lvG~~i~i~v~a~gg~~~lY~f~  421 (667)
T PRK14081        344 WKPKIAGKYSITLWVKDISSKGEYEDKSSIDYTIEEKSKEPIKIEDVILD--KGKHILKGEEIKIRVIAEGGTNLRYSFI  421 (667)
T ss_pred             EeeCCCceEEEEEEEEcCcCcccccceEEEEEEEcccCCCCeEEEEEEEC--CCCceEeCCeEEEEEEecCCCeEEEEEE
Confidence            79999999999998766532211     112223221 112   223332  2246889999999999887643333333


Q ss_pred             EECCCCCccceEEEEcCC-CEEEEEEEcCCCeeEEEEEEECCEEcC-----CCCeEEEecCCCceEEec---CCcccEEc
Q psy15726         87 VTSPQGAAVPVRCYQQKF-ANLLAEFTPTTTGVYKIDVLQGARPVR-----GSPYLCQVYDASKVKIEH---KGLSNIVV  157 (171)
Q Consensus        87 i~~p~g~~~~~~v~~~~d-g~y~v~y~P~~~G~~~i~V~~~g~~I~-----gSPF~v~V~d~skv~~~G---~gl~~~~~  157 (171)
                      |.. +|..    +.+.+= -.-.+.|+|.++|.|+|.|...+.-=.     -.=+.+.|.++..+...-   +-.....+
T Consensus       422 ik~-ng~~----ve~~~Y~~~~~~~f~P~~~G~Y~IeV~vKdk~S~~~yD~~k~v~l~V~e~~P~~i~~il~~~~~~~~v  496 (667)
T PRK14081        422 IKK-DGKE----EEKIDYGKNNWVNFIPEEKGNYELEVRVKDKYSDKEYDAHTIVYIKVHEYIPAEIDYILLPVKEYYLV  496 (667)
T ss_pred             EEE-CCEE----EEEeecccccEEEEEECCCeeEEEEEEEecccCchhcccceEEEEEEeccCceeeeeEEecccccEEe
Confidence            333 4542    222211 245899999999999999988653211     111445555544332221   22334799


Q ss_pred             CCCEEEEEEccC
Q psy15726        158 NDPISFKCKSTE  169 (171)
Q Consensus       158 g~~~~F~Vd~~~  169 (171)
                      |++++|++.+.+
T Consensus       497 g~~i~~~~~~~~  508 (667)
T PRK14081        497 GDDIEIEVIIQN  508 (667)
T ss_pred             CCEEEEEEEEeC
Confidence            999999998765


No 8  
>KOG1428|consensus
Probab=96.92  E-value=0.009  Score=58.54  Aligned_cols=46  Identities=20%  Similarity=0.244  Sum_probs=38.9

Q ss_pred             EEEEcCCCEEEEEEEcCCCeeEEEEEEECCEEcCCCCeEEEecCCCc
Q psy15726         98 RCYQQKFANLLAEFTPTTTGVYKIDVLQGARPVRGSPYLCQVYDASK  144 (171)
Q Consensus        98 ~v~~~~dg~y~v~y~P~~~G~~~i~V~~~g~~I~gSPF~v~V~d~sk  144 (171)
                      -+.-++||+|-.+|||...|.|+|+++++|..|.- -+.++|.|+-|
T Consensus      2099 ~ir~nnd~~y~~nWTP~a~G~Y~l~~tiDG~eId~-~~eVkVkd~Pk 2144 (3738)
T KOG1428|consen 2099 LIRVNNDGTYCANWTPGAIGLYTLHVTIDGIEIDA-GLEVKVKDPPK 2144 (3738)
T ss_pred             eEEecCCCceecccCCccceeEEEEEEecceeecc-ceEEEeccCCC
Confidence            35567889999999999999999999999999974 38888866543


No 9  
>PRK14081 triple tyrosine motif-containing protein; Provisional
Probab=96.89  E-value=0.06  Score=49.38  Aligned_cols=148  Identities=14%  Similarity=-0.007  Sum_probs=83.4

Q ss_pred             cCCCCceeEEEEEEECCEEc---CCC--CeEEEecCCCCCCCc-EEEcCCCcceEeCCeEEEEEEeCCCcCCceEEE-EE
Q psy15726         16 PSWISYGLVQARTLREETSF---RSC--PMEVPVVDPAVGREP-SGSGLGLYQARSGIVNSFTLETCGVASSEFDVI-VT   88 (171)
Q Consensus        16 ~~P~~~G~~~i~V~~~g~~I---~gS--Pf~v~V~~~~d~~~v-~v~G~Gl~~~~vg~~~~F~V~~~~ag~~~l~v~-i~   88 (171)
                      |.|..+|.|+|.|..-+.-=   -++  =+.+.|.... |... .+-=+-.+...+|.+.+|++.+.+..  ++..+ ..
T Consensus       442 f~P~~~G~Y~IeV~vKdk~S~~~yD~~k~v~l~V~e~~-P~~i~~il~~~~~~~~vg~~i~~~~~~~~~k--~v~y~y~~  518 (667)
T PRK14081        442 FIPEEKGNYELEVRVKDKYSDKEYDAHTIVYIKVHEYI-PAEIDYILLPVKEYYLVGDDIEIEVIIQNTK--DVLIKYIL  518 (667)
T ss_pred             EEECCCeeEEEEEEEecccCchhcccceEEEEEEeccC-ceeeeeEEecccccEEeCCEEEEEEEEeCCC--eEEEEEEE
Confidence            78999999999998654311   111  1233443321 1111 11112234478999999999988743  22222 33


Q ss_pred             CCCCCccceEEEEcCCCEEEEEEEcCCCeeEEEEEEECCEEc---C--CCCeEEEecCCCceEEecCCcc--cEEcCCCE
Q psy15726         89 SPQGAAVPVRCYQQKFANLLAEFTPTTTGVYKIDVLQGARPV---R--GSPYLCQVYDASKVKIEHKGLS--NIVVNDPI  161 (171)
Q Consensus        89 ~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~~g~~I---~--gSPF~v~V~d~skv~~~G~gl~--~~~~g~~~  161 (171)
                      .-+|..+  .+++=. -.-..+|+|..+|.|+|.|+..+.-=   -  -.=+.+.|.+|..++-.-.-..  ...+|++.
T Consensus       519 ~~NG~~v--~~t~Ys-~~~~ysf~P~~~GkY~I~V~aKn~~s~~~~D~~k~v~~~V~e~~pi~nt~~~~~~~~~~~n~~~  595 (667)
T PRK14081        519 KINGHKV--EETDYI-KNKKYKFIPKCSGKYTIEVLAKNIKSTEEYDSKKEVKFYVREALPITNTKIKTSKKKFKCNEEV  595 (667)
T ss_pred             EECCEEE--EEeecc-ccceEEEeecCCceEEEEEEEcccccccccccceEEEEEEcCCCCceeeEEEeecceEEcCCeE
Confidence            4455432  122211 13367889999999999999765320   0  1114445556544332221111  37889999


Q ss_pred             EEEEEccC
Q psy15726        162 SFKCKSTE  169 (171)
Q Consensus       162 ~F~Vd~~~  169 (171)
                      +|++++++
T Consensus       596 t~~~~~~g  603 (667)
T PRK14081        596 TFSVKSEG  603 (667)
T ss_pred             EEEEEccC
Confidence            99999875


No 10 
>PF01835 A2M_N:  MG2 domain;  InterPro: IPR002890 The proteinase-binding alpha-macroglobulins (A2M) [] are large glycoproteins found in the plasma of vertebrates, in the hemolymph of some invertebrates and in reptilian and avian egg white. A2M-like proteins are able to inhibit all four classes of proteinases by a 'trapping' mechanism. They have a peptide stretch, called the 'bait region', which contains specific cleavage sites for different proteinases. When a proteinase cleaves the bait region, a conformational change is induced in the protein, thus trapping the proteinase. The entrapped enzyme remains active against low molecular weight substrates, whilst its activity toward larger substrates is greatly reduced, due to steric hindrance. Following cleavage in the bait region, a thiol ester bond, formed between the side chains of a cysteine and a glutamine, is cleaved and mediates the covalent binding of the A2M-like protein to the proteinase. This family includes the N-terminal region of the alpha-2-macroglobulin family. The inhibitor domains belong to MEROPS inhibitor family I39.; GO: 0004866 endopeptidase inhibitor activity; PDB: 2B39_B 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 4ACQ_C 2P9R_B ....
Probab=96.68  E-value=0.01  Score=40.80  Aligned_cols=64  Identities=22%  Similarity=0.288  Sum_probs=47.7

Q ss_pred             eEeCCeEEEEEEeCCCc-------CCceEEEEECCCCCccceEEE--EcCCCEEEEEEE-cCC--CeeEEEEEEEC
Q psy15726         63 ARSGIVNSFTLETCGVA-------SSEFDVIVTSPQGAAVPVRCY--QQKFANLLAEFT-PTT--TGVYKIDVLQG  126 (171)
Q Consensus        63 ~~vg~~~~F~V~~~~ag-------~~~l~v~i~~p~g~~~~~~v~--~~~dg~y~v~y~-P~~--~G~~~i~V~~~  126 (171)
                      .+.|+.+.|.+-.++..       ...+.+.|.+|+|+.+.-...  .+.+|.+..+|. |..  .|.|+|.+..+
T Consensus        11 YrPGetV~~~~~~~~~~~~~~~~~~~~~~v~i~dp~g~~v~~~~~~~~~~~G~~~~~~~lp~~~~~G~y~i~~~~~   86 (99)
T PF01835_consen   11 YRPGETVHFRAIVRDLDNDFKPPANSPVTVTIKDPSGNEVFRWSVNTTNENGIFSGSFQLPDDAPLGTYTIRVKTD   86 (99)
T ss_dssp             E-TTSEEEEEEEEEEECTTCSCESSEEEEEEEEETTSEEEEEEEEEETTCTTEEEEEEE--SS---EEEEEEEEET
T ss_pred             cCCCCEEEEEEEEeccccccccccCCceEEEEECCCCCEEEEEEeeeeCCCCEEEEEEECCCCCCCEeEEEEEEEc
Confidence            56788888888865443       257889999999987654433  357799999997 665  49999999994


No 11 
>TIGR03503 conserved hypothetical protein TIGR03503. This set of conserved hypothetical protein has a phylogenetic range that closely matches that of TIGR03501, a putative C-terminal protein targeting signal.
Probab=95.82  E-value=0.21  Score=42.87  Aligned_cols=110  Identities=8%  Similarity=0.043  Sum_probs=67.3

Q ss_pred             cCCCCceeEEEEEEECCE----------EcCCCCeEEEecCCCCCCCcEEEcCCCcceEeCCeEEEEEEeCC----CcCC
Q psy15726         16 PSWISYGLVQARTLREET----------SFRSCPMEVPVVDPAVGREPSGSGLGLYQARSGIVNSFTLETCG----VASS   81 (171)
Q Consensus        16 ~~P~~~G~~~i~V~~~g~----------~I~gSPf~v~V~~~~d~~~v~v~G~Gl~~~~vg~~~~F~V~~~~----ag~~   81 (171)
                      ..-..+|.|.+.+.-++.          .|...|+.+.+..+.             .  .+.+..+.|....    .+.-
T Consensus       178 ~l~~~~G~Y~~~v~~~n~vF~R~~~q~v~V~p~Pi~~~~~~~~-------------~--~~~~h~l~v~~d~~~i~~~s~  242 (374)
T TIGR03503       178 NLDVAPGEYRPTYQSRNPVFLREVEQPVLVYPLPVSYTVIQSE-------------D--ESGAHQLMVDADAGHIDPGSL  242 (374)
T ss_pred             eccCCCceEEEEEEEcCceEEEEEEEeEEEECCCeeEEEEccC-------------C--CCCceEEEEEcccceeccccE
Confidence            346789999999987763          223344444333221             1  1334455555432    2445


Q ss_pred             ceEEEEECCCCCccceEEEEcCC-CEEEEEEEcCCCeeEEEEEEECCEEcCCCCeEEEecC
Q psy15726         82 EFDVIVTSPQGAAVPVRCYQQKF-ANLLAEFTPTTTGVYKIDVLQGARPVRGSPYLCQVYD  141 (171)
Q Consensus        82 ~l~v~i~~p~g~~~~~~v~~~~d-g~y~v~y~P~~~G~~~i~V~~~g~~I~gSPF~v~V~d  141 (171)
                      .+++++.+|+|. ......+... .+...-=...+.|.|+++.+.-+..+.|.||.+...+
T Consensus       243 ~~~~~~~~P~g~-~~~~~~~~~~~~~~~~l~~~~~~G~Y~i~~~~~~Tt~~GRe~~~~lpe  302 (374)
T TIGR03503       243 VIHGELVFPNGQ-IQQFSIELEEPETRVDLPANYEFGKYRVKGTVFGTTITGRELQLTLPE  302 (374)
T ss_pred             EEEEEEECCCCc-eEEecccCccCceEEeccCcCCCeEEEEEEEEEEecCCCCEEEEEcCC
Confidence            667788999996 4433333222 2222222344899999999999999999999998843


No 12 
>PF06312 Neurexophilin:  Neurexophilin
Probab=95.80  E-value=0.041  Score=43.91  Aligned_cols=45  Identities=16%  Similarity=0.161  Sum_probs=38.7

Q ss_pred             CceEEEEECCCC-CccceEEEEcCCCEEEEEEEcCCCeeEEEEEEE
Q psy15726         81 SEFDVIVTSPQG-AAVPVRCYQQKFANLLAEFTPTTTGVYKIDVLQ  125 (171)
Q Consensus        81 ~~l~v~i~~p~g-~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~  125 (171)
                      |++.++|.+++- .-+..+|+|.+||||+|.|+-.-+|.-.|.|.+
T Consensus        70 GDfrAri~s~~l~ag~~G~V~Dh~NGTYtv~F~L~W~G~v~vsV~L  115 (219)
T PF06312_consen   70 GDFRARIFSPKLKAGAAGKVTDHGNGTYTVSFPLLWPGQVSVSVSL  115 (219)
T ss_pred             CCcEEEEecCCccccceEEEEECCCCeEEEEEEeecCceEEEEEEE
Confidence            777999998874 336789999999999999999999998888875


No 13 
>PF13115 YtkA:  YtkA-like
Probab=93.67  E-value=0.64  Score=30.92  Aligned_cols=43  Identities=21%  Similarity=0.247  Sum_probs=31.0

Q ss_pred             CceEEEEECCC--CC---ccceEEEEcCCCEEEEEEEcCCCeeEEEEE
Q psy15726         81 SEFDVIVTSPQ--GA---AVPVRCYQQKFANLLAEFTPTTTGVYKIDV  123 (171)
Q Consensus        81 ~~l~v~i~~p~--g~---~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V  123 (171)
                      +.+++.+..|.  |.   ...+.....++|.|.++.....+|.|.|.|
T Consensus        38 a~V~~~~~m~~~~g~~~~~~~~~~~~~~~G~Y~~~~~f~m~G~W~i~v   85 (86)
T PF13115_consen   38 ADVQFEIWMPDMEGMEPMTSKVELEETGPGVYEAEVTFSMAGTWQITV   85 (86)
T ss_pred             CEEEEEEEeCCCCCCCCCceeeeeecCCCCeEEEEeecCCCeeEEEEE
Confidence            45666666664  21   233445556789999999999999999987


No 14 
>COG2373 Large extracellular alpha-helical protein [General function prediction only]
Probab=93.43  E-value=0.67  Score=46.83  Aligned_cols=106  Identities=14%  Similarity=0.165  Sum_probs=75.1

Q ss_pred             eEeCCeEEEEEEeCCCc------CCceEEEEECCCCCccceE-EEEcCCCEEEEEEEcCC---CeeEEEEEEECCE-EcC
Q psy15726         63 ARSGIVNSFTLETCGVA------SSEFDVIVTSPQGAAVPVR-CYQQKFANLLAEFTPTT---TGVYKIDVLQGAR-PVR  131 (171)
Q Consensus        63 ~~vg~~~~F~V~~~~ag------~~~l~v~i~~p~g~~~~~~-v~~~~dg~y~v~y~P~~---~G~~~i~V~~~g~-~I~  131 (171)
                      .+.|+..++.+-+|+..      ...+++.+.+|+|..+.-. +.....|.|+++|.-.+   .|.|+|.+.+.|. .+-
T Consensus       405 YRpGE~v~~~~~~R~~~~~~a~~~~p~~l~v~~PdG~~~~~~~~~~~~~G~~~~~~~l~~na~tG~w~l~~~~~~~~~~~  484 (1621)
T COG2373         405 YRPGETVHVNALLRDFDGKTALDNQPLKLRVLDPDGSVLRTLTITLDEEGLYELSFPLPENALTGGYTLELYTGGKSAVI  484 (1621)
T ss_pred             CCCCceeeeeeeehhhcccccccCCCeEEEEECCCCcEEEEEEEeccccCceEEeeeCCCCCCcceEEEEEEeCCcccee
Confidence            56799999999888643      3578999999999765543 44457799999997443   5999999999986 666


Q ss_pred             CCCeEEEecCCCceEEecCCccc-EEcCCCEEEEEEcc
Q psy15726        132 GSPYLCQVYDASKVKIEHKGLSN-IVVNDPISFKCKST  168 (171)
Q Consensus       132 gSPF~v~V~d~skv~~~G~gl~~-~~~g~~~~F~Vd~~  168 (171)
                      ...|.|..+-+.+.++.=..-+. ...|++..++|+.+
T Consensus       485 s~~f~V~df~p~r~~i~l~~~k~~~~~g~~v~~~v~~~  522 (1621)
T COG2373         485 SMSFRVEDFIPDRFKINLTLDKTEWVPGKDVKIKVDLR  522 (1621)
T ss_pred             eeeEEhhHhCCceEEEecccccccccCCCcEEEEEEEE
Confidence            67788876555544432222112 47777777777654


No 15 
>PF02369 Big_1:  Bacterial Ig-like domain (group 1);  InterPro: IPR003344 Proteins that contain this domain are found in a variety of bacterial and phage surface proteins such as intimins. Intimin is a bacterial cell-adhesion molecule that mediates the intimate bacterial host-cell interaction. It contains three domains; two immunoglobulin-like domains and a C-type lectin-like module implying that carbohydrate recognition may be important in intimin-mediated cell adhesion [].; PDB: 1CWV_A 4E9L_A 1F02_I 1F00_I.
Probab=93.30  E-value=1.1  Score=31.07  Aligned_cols=63  Identities=16%  Similarity=0.181  Sum_probs=37.4

Q ss_pred             CeEEEEEEeCCCcC---CceEEEE--ECCCCCccce--EEEEcCCCEEEEEEEcCCCeeEEEEEEECCEE
Q psy15726         67 IVNSFTLETCGVAS---SEFDVIV--TSPQGAAVPV--RCYQQKFANLLAEFTPTTTGVYKIDVLQGARP  129 (171)
Q Consensus        67 ~~~~F~V~~~~ag~---~~l~v~i--~~p~g~~~~~--~v~~~~dg~y~v~y~P~~~G~~~i~V~~~g~~  129 (171)
                      ....+++..+|+.+   ....|.+  ....+...+.  ...-+.+|.+.++|+...+|.|.|...+++..
T Consensus        23 ~~~tltatV~D~~gnpv~g~~V~f~~~~~~~~l~~~~~~~~Td~~G~a~~tltst~aG~~~VtA~~~~~~   92 (100)
T PF02369_consen   23 DTNTLTATVTDANGNPVPGQPVTFSSSSSGGTLSPTNTSATTDSNGIATVTLTSTKAGTYTVTATVDGGS   92 (100)
T ss_dssp             S-EEEEEEEEETTSEB-TS-EEEE--EESSSEES-CEE-EEE-TTSEEEEEEE-SS-EEEEEEEEETTEE
T ss_pred             CcEEEEEEEEcCCCCCCCCCEEEEEEcCCCcEEecCccccEECCCEEEEEEEEecCceEEEEEEEECCcc
Confidence            44555555556543   3344555  3333333233  35566789999999999999999999998753


No 16 
>PF13860 FlgD_ig:  FlgD Ig-like domain; PDB: 3C12_A 3OSV_A.
Probab=92.50  E-value=1.6  Score=28.93  Aligned_cols=57  Identities=16%  Similarity=0.173  Sum_probs=33.6

Q ss_pred             eEEEEEEeCCCcCCceEEEEECCCCCccce-EEEEcCCCEEEEEEE-------cCCCeeEEEEEEE
Q psy15726         68 VNSFTLETCGVASSEFDVIVTSPQGAAVPV-RCYQQKFANLLAEFT-------PTTTGVYKIDVLQ  125 (171)
Q Consensus        68 ~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~-~v~~~~dg~y~v~y~-------P~~~G~~~i~V~~  125 (171)
                      +..|.+..... ...+.+.|.+.+|+.+.. .+.....|.+.+.|-       |...|.|++.|..
T Consensus        13 ~~~~~~~l~~~-a~~v~v~I~d~~G~~V~t~~~~~~~~G~~~~~WdG~d~~G~~~~~G~Y~~~v~a   77 (81)
T PF13860_consen   13 KGSIEYTLPED-ADNVTVTIYDSNGQVVRTISLGSQSAGEHSFTWDGKDDDGNPVPDGTYTFRVTA   77 (81)
T ss_dssp             EEEEEEEECSS-CEEEEEEEEETTS-EEEEEEEEECSSEEEEEEE-SB-TTS-B--SEEEEEEEEE
T ss_pred             EEEEEEeCCCc-ccEEEEEEEcCCCCEEEEEEcCCcCCceEEEEECCCCCCcCCCCCCCEEEEEEE
Confidence            45555555432 246788888888876543 333445577777777       6667777777653


No 17 
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=91.44  E-value=12  Score=39.95  Aligned_cols=144  Identities=13%  Similarity=0.051  Sum_probs=85.0

Q ss_pred             Eeeccc--cCCCCceeEEEEEE-ECCEEcCCCCeEEEecCCCCCCCcEEE-cCCCcceEeCCeEEEEEEeCCCcCCceEE
Q psy15726         10 LLSEKA--PSWISYGLVQARTL-REETSFRSCPMEVPVVDPAVGREPSGS-GLGLYQARSGIVNSFTLETCGVASSEFDV   85 (171)
Q Consensus        10 ~~~~~~--~~P~~~G~~~i~V~-~~g~~I~gSPf~v~V~~~~d~~~v~v~-G~Gl~~~~vg~~~~F~V~~~~ag~~~l~v   85 (171)
                      .+++++  |+|.++|.|+|.|+ +|...-..-+-.+.|..+..  ...++ ++  ..+.+|+...|.+...- ++.++..
T Consensus      2001 ~~~G~~vty~~~~~G~~~I~v~A~N~l~s~n~t~~i~VQ~~i~--~~~L~~~p--~~~~~n~~v~f~~~~~~-~~~~v~~ 2075 (2740)
T TIGR00864      2001 IHMGKDVSYTAEAAGLLEIQLGAFNALGAENITLQLEAQDALM--DAALQAGP--QDCFTNKMAQFEAATSP-KPNFMAC 2075 (2740)
T ss_pred             eecCCceEEccCCCcEEEEEEEEEcccCcccccEEEEEEehee--eEEEeecC--cceecCceEEEEEEEcc-CCCcEEE
Confidence            355665  79999999999997 77654445566777776642  22333 34  35778999999998743 3345666


Q ss_pred             EEECCCCCccceEEEEcCCCEEEEEEEcCCCeeEEEEEEEC-CEEcCCCCeEEEe----cCCCceEEecCCcccEEcCCC
Q psy15726         86 IVTSPQGAAVPVRCYQQKFANLLAEFTPTTTGVYKIDVLQG-ARPVRGSPYLCQV----YDASKVKIEHKGLSNIVVNDP  160 (171)
Q Consensus        86 ~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~~-g~~I~gSPF~v~V----~d~skv~~~G~gl~~~~~g~~  160 (171)
                      ...=-||....  .+    ...+++++-..+|+|.+.|... .....--=-.+.|    +..-.|.+-|+.-....-.+|
T Consensus      2076 ~WDFGDGt~~~--~T----~~~~~sHtY~r~G~Y~V~V~a~N~VS~~~a~~~i~V~vl~C~~P~V~i~~~~~~~i~RSq~ 2149 (2740)
T TIGR00864      2076 HWDFGDGSAGQ--DT----DEPRAEHEYLHPGDYRVQVNASNLVSFFSAHAEINVQVLACEEPEVDVVLALQLAIRRSQP 2149 (2740)
T ss_pred             EEEecCCCCcc--cc----CCceeEEEecCCccEEEEEEEEcceeeeeeEEEEEEEEecCCCCCcEeccCceeEEEecCc
Confidence            65544554221  11    2467888888889999998843 3221111112222    344445555554333444555


Q ss_pred             EEEE
Q psy15726        161 ISFK  164 (171)
Q Consensus       161 ~~F~  164 (171)
                      +.|.
T Consensus      2150 ~~lE 2153 (2740)
T TIGR00864      2150 NLLE 2153 (2740)
T ss_pred             ceEE
Confidence            5554


No 18 
>PF04234 CopC:  CopC domain;  InterPro: IPR007348 CopC is a bacterial blue copper protein that binds 1 atom of copper per protein molecule. Along with CopA, CopC mediates copper resistance by sequestration of copper in the periplasm [].; GO: 0005507 copper ion binding, 0046688 response to copper ion, 0042597 periplasmic space; PDB: 1IX2_B 1LYQ_A 2C9P_C 2C9R_A 2C9Q_A 1M42_A 1OT4_A 1NM4_A.
Probab=89.71  E-value=0.27  Score=33.97  Aligned_cols=53  Identities=26%  Similarity=0.416  Sum_probs=25.5

Q ss_pred             EEEEECCCCCccceE---EEEcCCCEEEEEEEc-CCCeeEEEEE---EECCEEcCCCCeEEE
Q psy15726         84 DVIVTSPQGAAVPVR---CYQQKFANLLAEFTP-TTTGVYKIDV---LQGARPVRGSPYLCQ  138 (171)
Q Consensus        84 ~v~i~~p~g~~~~~~---v~~~~dg~y~v~y~P-~~~G~~~i~V---~~~g~~I~gSPF~v~  138 (171)
                      .+.|.+++|+.++..   +.+ +...+.+.-.+ ..+|.|.|.-   .-+|.++.|+ |.+.
T Consensus        36 ~v~v~~~~g~~v~~~~~~~~~-~~~~~~~~l~~~l~~G~YtV~wrvvs~DGH~~~G~-~~F~   95 (97)
T PF04234_consen   36 SVTVTDPDGKRVDLGEPTVDG-DGKTLTVPLPPPLPPGTYTVSWRVVSADGHPVSGS-FSFT   95 (97)
T ss_dssp             EEEEEEEEETTSCTCEEEEEE-STTEEEEEESS---SEEEEEEEEEEETTSCEEEEE-EEEE
T ss_pred             EEEEEcCCCceeecCcceecC-CceEEEEECCCCCCCceEEEEEEEEecCCCCcCCE-EEEE
Confidence            344555555443322   222 22455555555 5666666652   3466777666 4443


No 19 
>PF13473 Cupredoxin_1:  Cupredoxin-like domain; PDB: 1IBZ_D 1IC0_E 1IBY_D.
Probab=89.45  E-value=2.8  Score=28.89  Aligned_cols=59  Identities=12%  Similarity=0.192  Sum_probs=33.3

Q ss_pred             ceEeCCeEEEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCCEEEEEEEcCCCeeEEEEEEECC
Q psy15726         62 QARSGIVNSFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTPTTTGVYKIDVLQGA  127 (171)
Q Consensus        62 ~~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~~g  127 (171)
                      .+..|++..+.+.-.|...  -++.+.+     ...+..-..+.+.+++|+|.++|+|.+.=.+..
T Consensus        38 ~v~~G~~v~l~~~N~~~~~--h~~~i~~-----~~~~~~l~~g~~~~~~f~~~~~G~y~~~C~~~~   96 (104)
T PF13473_consen   38 TVKAGQPVTLTFTNNDSRP--HEFVIPD-----LGISKVLPPGETATVTFTPLKPGEYEFYCTMHP   96 (104)
T ss_dssp             EEETTCEEEEEEEE-SSS---EEEEEGG-----GTEEEEE-TT-EEEEEEEE-S-EEEEEB-SSS-
T ss_pred             EEcCCCeEEEEEEECCCCc--EEEEECC-----CceEEEECCCCEEEEEEcCCCCEEEEEEcCCCC
Confidence            4778999999998776654  3333433     122222223468899999999999987654433


No 20 
>PF11896 DUF3416:  Domain of unknown function (DUF3416);  InterPro: IPR021828  This presumed domain is functionally uncharacterised. This domain is found in bacteria and archaea. This domain is about 190 amino acids in length. This domain is found associated with PF00128 from PFAM. ; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds; PDB: 3ZT7_A 3ZST_B 3ZT6_A 3ZSS_D 3ZT5_B.
Probab=89.08  E-value=4.1  Score=31.76  Aligned_cols=63  Identities=16%  Similarity=0.112  Sum_probs=38.1

Q ss_pred             ceEeCCeEEEEEEeCCCcCCceEEE--EECCCCCc-cceEEEEcCCCEEEEEEEcCCCeeEEEEEE
Q psy15726         62 QARSGIVNSFTLETCGVASSEFDVI--VTSPQGAA-VPVRCYQQKFANLLAEFTPTTTGVYKIDVL  124 (171)
Q Consensus        62 ~~~vg~~~~F~V~~~~ag~~~l~v~--i~~p~g~~-~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~  124 (171)
                      +..+|+...+.-+.=..|-+.+.+.  ...|++.. -.+.....+|..|..+|+|...|.|...|.
T Consensus        22 KrvvGe~v~V~Adif~DGHD~l~A~l~~r~~~~~~w~~vpM~~~gnDrW~a~f~~~~~G~~~f~Ve   87 (187)
T PF11896_consen   22 KRVVGEPVPVSADIFRDGHDALAAELLWRHPGEREWQEVPMTPLGNDRWEASFTPDRPGRYEFRVE   87 (187)
T ss_dssp             EEETT-EEEEEEEE--SSSS-EEEEEEEE-TTS-B----B-EESTS-EEEEEEE--SSEEEEEEEE
T ss_pred             eeecCCeEEEEEEEEecCCCcEEEEEEEECCCCCcceeeccccCCCCEEEEEEECCCceeEEEEEE
Confidence            3678998888877755566666655  55777644 234455668889999999999999988874


No 21 
>TIGR00868 hCaCC calcium-activated chloride channel protein 1. distributions. found a row in 1A13.INFO that was not parsed out
Probab=88.94  E-value=12  Score=35.76  Aligned_cols=108  Identities=10%  Similarity=0.015  Sum_probs=66.5

Q ss_pred             CCCceeEEEEEEECCEEcCCCCeEEEecCC---CCCCCcEEEcCCCc--ceEeCCeEEEEEEeCCCc----CCceEEEEE
Q psy15726         18 WISYGLVQARTLREETSFRSCPMEVPVVDP---AVGREPSGSGLGLY--QARSGIVNSFTLETCGVA----SSEFDVIVT   88 (171)
Q Consensus        18 P~~~G~~~i~V~~~g~~I~gSPf~v~V~~~---~d~~~v~v~G~Gl~--~~~vg~~~~F~V~~~~ag----~~~l~v~i~   88 (171)
                      .-+.|.++.+|.-.+   ...+..+.|...   .+..-+++.. -+.  ...-..+..+..+.++..    +..+.+.|+
T Consensus       559 ~a~~G~W~y~l~~~~---~~q~i~vtVtsra~s~~~~pi~~~a-~~~~~~~~~psp~ivya~V~kG~~pVlgA~Vta~i~  634 (863)
T TIGR00868       559 TAKVGTWTYSLQASA---NPQTLTLTVTSRARSPTLPPVTVTA-KMNKDTAKFPSPMIVYAKISQGFLPVLGANVTALIE  634 (863)
T ss_pred             CCccceeEEEEeccC---CCceEEEEEEeccCCCCCCCEEEEE-EEccccccCCCceEEEEEEeeCCceeeccEEEEEEE
Confidence            457888888874331   123345555522   1122233321 111  122344444444444321    367888999


Q ss_pred             CCCCCccceEEEEcC--------CCEEEEEEEc-CCCeeEEEEEEECCEE
Q psy15726         89 SPQGAAVPVRCYQQK--------FANLLAEFTP-TTTGVYKIDVLQGARP  129 (171)
Q Consensus        89 ~p~g~~~~~~v~~~~--------dg~y~v~y~P-~~~G~~~i~V~~~g~~  129 (171)
                      .|+|+.+..++.|++        ||.|.--|+. ...|.|.|.|..+|..
T Consensus       635 ~~~g~~~~l~l~DnGaGaD~~knDGIYS~yft~f~~nGrYsl~V~v~~~~  684 (863)
T TIGR00868       635 SENGHTVTLELLDNGAGADTVKNDGIYSRYFTAYDGNGRYSLKVRALGGV  684 (863)
T ss_pred             CCCCCEEEEEEeecCCCCCccccCceeeeeeecccCCceEEEEEEEeCCC
Confidence            999998888888874        6999999875 4689999999988754


No 22 
>PF07705 CARDB:  CARDB;  InterPro: IPR011635 The APHP (acidic peptide-dependent hydrolases/peptidase) domain is found in a variety of different proteins.; PDB: 2KUT_A 2L0D_A 3IDU_A 2KL6_A.
Probab=87.85  E-value=3.7  Score=27.32  Aligned_cols=66  Identities=15%  Similarity=0.156  Sum_probs=37.6

Q ss_pred             ceEeCCeEEEEEEeCCCcC---CceEEEEECCCCCccceEEEEcCC---CEEEEEEEcCCCeeEEEEEEECC
Q psy15726         62 QARSGIVNSFTLETCGVAS---SEFDVIVTSPQGAAVPVRCYQQKF---ANLLAEFTPTTTGVYKIDVLQGA  127 (171)
Q Consensus        62 ~~~vg~~~~F~V~~~~ag~---~~l~v~i~~p~g~~~~~~v~~~~d---g~y~v~y~P~~~G~~~i~V~~~g  127 (171)
                      .+..|++..|++.-+..|.   ..+.+.+...+.......+.....   .++.+.|++..+|.|.|.+.++.
T Consensus        14 ~~~~g~~~~i~~~V~N~G~~~~~~~~v~~~~~~~~~~~~~i~~L~~g~~~~v~~~~~~~~~G~~~i~~~iD~   85 (101)
T PF07705_consen   14 NVVPGEPVTITVTVKNNGTADAENVTVRLYLDGNSVSTVTIPSLAPGESETVTFTWTPPSPGSYTIRVVIDP   85 (101)
T ss_dssp             EEETTSEEEEEEEEEE-SSS-BEEEEEEEEETTEEEEEEEESEB-TTEEEEEEEEEE-SS-CEEEEEEEEST
T ss_pred             cccCCCEEEEEEEEEECCCCCCCCEEEEEEECCceeccEEECCcCCCcEEEEEEEEEeCCCCeEEEEEEEee
Confidence            4667888888877765544   345566554332211222322222   35567778889999999998864


No 23 
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=87.66  E-value=3.9  Score=27.88  Aligned_cols=81  Identities=17%  Similarity=0.199  Sum_probs=53.3

Q ss_pred             ceEeCCeEEEEEEe---CCCcCCceEEEEECCCCCccceEEEEcCCCEEEEEEEcCC--CeeEEEEEEECCE--EcCCCC
Q psy15726         62 QARSGIVNSFTLET---CGVASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTPTT--TGVYKIDVLQGAR--PVRGSP  134 (171)
Q Consensus        62 ~~~vg~~~~F~V~~---~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~--~G~~~i~V~~~g~--~I~gSP  134 (171)
                      .+.+|+...|+|..   .+..+..+...+...+|..+|..+.=.. .+..+.-+|..  .|.|.|.|.-.+.  .-..-.
T Consensus         8 ~a~~g~~f~~~i~~~tF~d~d~~~lty~~~~~~~~~lP~Wl~fd~-~~~~~~GtP~~~~~g~~~i~v~a~D~~g~~~~~~   86 (97)
T smart00736        8 TATEGESFSYTIPSSTFTDADGDTLTYSATLSDGSALPSWLSFDS-DTGTLSGTPTNSDVGSLSLKVTATDSSGASASDT   86 (97)
T ss_pred             EEeCCCeEEEEcChhceECCCCCeEEEEEEeCCCCCCCCeEEEeC-CCCEEEEECCCCCCcEEEEEEEEEECCCCEEEEE
Confidence            47788888888764   2344567777777667767787765332 24556668887  8999988875432  223346


Q ss_pred             eEEEecCCC
Q psy15726        135 YLCQVYDAS  143 (171)
Q Consensus       135 F~v~V~d~s  143 (171)
                      |.+.|.+++
T Consensus        87 f~i~V~~~~   95 (97)
T smart00736       87 FTITVVNTN   95 (97)
T ss_pred             EEEEEeCCC
Confidence            888776543


No 24 
>PRK06655 flgD flagellar basal body rod modification protein; Reviewed
Probab=87.05  E-value=5.3  Score=31.99  Aligned_cols=59  Identities=19%  Similarity=0.222  Sum_probs=39.6

Q ss_pred             CCeEEEEEEeCCCcCCceEEEEECCCCCccce-EEEEcCCCEEEEEE-------EcCCCeeEEEEEEE
Q psy15726         66 GIVNSFTLETCGVASSEFDVIVTSPQGAAVPV-RCYQQKFANLLAEF-------TPTTTGVYKIDVLQ  125 (171)
Q Consensus        66 g~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~-~v~~~~dg~y~v~y-------~P~~~G~~~i~V~~  125 (171)
                      |....|.++..+. ...+.+.|.+.+|+.+.. .+-....|.+.++|       .+..+|.|++.|..
T Consensus       112 ~~~~~~~~~l~~~-a~~vti~I~D~~G~~Vrt~~lg~~~aG~~~f~WDG~d~~G~~lp~G~Yt~~V~A  178 (225)
T PRK06655        112 GGTTPFGVELPSA-ADNVTVTITDSAGQVVRTIDLGAQSAGVVSFTWDGTDTDGNALPDGNYTIKASA  178 (225)
T ss_pred             CCceEEEEEcCCC-CcEEEEEEEcCCCCEEEEEecCCcCCCceeEEECCCCCCCCcCCCeeEEEEEEE
Confidence            3466777775543 246889999999987653 33345567887777       24557899888853


No 25 
>PRK10301 hypothetical protein; Provisional
Probab=86.62  E-value=2.3  Score=30.87  Aligned_cols=25  Identities=28%  Similarity=0.534  Sum_probs=15.8

Q ss_pred             CCCeeEEEE---EEECCEEcCCCCeEEEe
Q psy15726        114 TTTGVYKID---VLQGARPVRGSPYLCQV  139 (171)
Q Consensus       114 ~~~G~~~i~---V~~~g~~I~gSPF~v~V  139 (171)
                      ..+|.|+|.   +.-+|.++.|+ |.+.|
T Consensus        96 L~~G~YtV~Wrvvs~DGH~~~G~-~~F~V  123 (124)
T PRK10301         96 LKPGTYTVDWHVVSVDGHKTKGH-YTFSV  123 (124)
T ss_pred             CCCccEEEEEEEEecCCCccCCe-EEEEE
Confidence            456666665   34578888887 55544


No 26 
>cd02858 Esterase_N_term Esterase N-terminal domain. Esterases catalyze the hydrolysis of organic esters to release an alcohol or thiol and acid. The term can be applied to enzymes that hydrolyze carboxylate, phosphate and sulphate esters, but is more often restricted to the first class of substrate. The N-terminus of esterase may be related to the immunoglobulin and/or fibronectin type III superfamilies.  These domains are associated with different types of catalytic domains at  either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions.  Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase.
Probab=85.37  E-value=8.7  Score=25.59  Aligned_cols=60  Identities=15%  Similarity=0.072  Sum_probs=40.5

Q ss_pred             CeEEEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCCEEEEEEEcCCCeeEEEEEEECCEEc
Q psy15726         67 IVNSFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTPTTTGVYKIDVLQGARPV  130 (171)
Q Consensus        67 ~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~~g~~I  130 (171)
                      ..++|.+-+..|  ..+.+.+...+.  ....+...++|.|+++-.+..+|.|.-.+..+|..+
T Consensus         6 ~~v~F~vwAP~A--~~V~L~~~~~~~--~~~~m~~~~~G~W~~~v~~l~~g~Y~Y~~~vdg~~~   65 (85)
T cd02858           6 RTVTFRLFAPKA--NEVQVRGSWGGA--GSHPMTKDEAGVWSVTTGPLAPGIYTYSFLVDGVRV   65 (85)
T ss_pred             CcEEEEEECCCC--CEEEEEeecCCC--ccEeCeECCCeEEEEEECCCCCcEEEEEEEECCeEe
Confidence            457888888776  355555544221  223455567899999885577888999999998443


No 27 
>COG2372 CopC Uncharacterized protein, homolog of Cu resistance protein CopC [General function prediction only]
Probab=84.63  E-value=7.1  Score=28.55  Aligned_cols=72  Identities=26%  Similarity=0.332  Sum_probs=40.1

Q ss_pred             CeEEEEEEeCCCcC-CceEEEEECCCCCccceE---EEEcCCCEEEEEE-EcCCCeeEEEE---EEECCEEcCCCCeEEE
Q psy15726         67 IVNSFTLETCGVAS-SEFDVIVTSPQGAAVPVR---CYQQKFANLLAEF-TPTTTGVYKID---VLQGARPVRGSPYLCQ  138 (171)
Q Consensus        67 ~~~~F~V~~~~ag~-~~l~v~i~~p~g~~~~~~---v~~~~dg~y~v~y-~P~~~G~~~i~---V~~~g~~I~gSPF~v~  138 (171)
                      -|..|++.-.+.=. +-..+.+.+|+|..+...   ..+.++-.++|.- .|.++|.|.+.   |.-+|..+.|| +.+.
T Consensus        45 aP~~i~L~Fse~ve~~fs~~~l~~~d~~~v~t~~~~~~~~~~~~l~v~l~~~L~aG~Y~v~WrvvS~DGH~v~G~-~sFs  123 (127)
T COG2372          45 APAAITLEFSEGVEPGFSGAKLTGPDGEEVATAGTKLDEQNHTQLEVPLPQPLKAGVYTVDWRVVSSDGHVVKGS-ISFS  123 (127)
T ss_pred             CceeEEEecCCccCCCcceeEEECCCCCccccCcccccccCCcEEEecCcccCCCCcEEEEEEEEecCCcEeccE-EEEE
Confidence            35556665554322 224567788887654332   2222222255544 35667788776   55688888887 4444


Q ss_pred             e
Q psy15726        139 V  139 (171)
Q Consensus       139 V  139 (171)
                      |
T Consensus       124 V  124 (127)
T COG2372         124 V  124 (127)
T ss_pred             e
Confidence            4


No 28 
>PF05404 TRAP-delta:  Translocon-associated protein, delta subunit precursor (TRAP-delta);  InterPro: IPR008855 This family consists of several eukaryotic translocon-associated protein, delta subunit precursors (TRAP-delta or SSR-delta). The exact function of this protein is unknown [].; GO: 0005783 endoplasmic reticulum, 0016021 integral to membrane
Probab=83.49  E-value=5.8  Score=30.38  Aligned_cols=54  Identities=17%  Similarity=0.356  Sum_probs=36.5

Q ss_pred             CeEEEEEEeCC-CcCCceEEEEECCCCCccceEEEEcCCCEEEEEEEcC----CCeeEEEEEE
Q psy15726         67 IVNSFTLETCG-VASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTPT----TTGVYKIDVL  124 (171)
Q Consensus        67 ~~~~F~V~~~~-ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~----~~G~~~i~V~  124 (171)
                      =-++|++.|++ +..-.|.+.+.   |+..|+..... .+.|+|+|+-.    ..|.|.|.+.
T Consensus        43 fi~EFsl~C~n~~~~~~LyAeV~---Gk~~PVar~~~-~nkYQVSW~~e~k~a~sG~y~V~~f  101 (167)
T PF05404_consen   43 FIAEFSLKCSNGAKNISLYAEVN---GKILPVARSGD-TNKYQVSWTEEHKKASSGTYEVKFF  101 (167)
T ss_pred             EEEEEEEEeCCCCcCccEEEEEC---CEEEEEEEcCC-CCceEEEEEechhhccCCceEEEEe
Confidence            34689999985 33456777775   67777654332 27999999843    2577877764


No 29 
>KOG1692|consensus
Probab=83.26  E-value=8.5  Score=30.19  Aligned_cols=33  Identities=24%  Similarity=0.263  Sum_probs=24.5

Q ss_pred             eEeCCeEEEEEEeCCCcCCceEEEEECCCCCcc
Q psy15726         63 ARSGIVNSFTLETCGVASSEFDVIVTSPQGAAV   95 (171)
Q Consensus        63 ~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~~   95 (171)
                      +..|.+.+......+.|...+.+.|.+|+|+.+
T Consensus        37 ~~~gd~~~vsF~v~~gg~~~vd~~I~gP~~~~i   69 (201)
T KOG1692|consen   37 LEEGDKLSVSFEVIDGGFLGVDVEITGPDGKII   69 (201)
T ss_pred             hccCCEEEEEEEEecCCccceeEEEECCCCchh
Confidence            456666666666666666789999999999654


No 30 
>PF01483 P_proprotein:  Proprotein convertase P-domain;  InterPro: IPR002884 This domain, termed the P domain is approximately 150 amino acids in length and C-terminal to a serine endopeptidase domain which belong to MEROPS peptidase family S8 (clan SB), subfamily S8B (kexin). The domain is primarily associated with the calcium-dependent serine endopeptidases, kex2/subtilisin proprotein convertases (PCs), which have been identified in all eukaryotes [] and in the gammaproteobacteria, Nostoc (cyanobacteria) and in Streptomyces avermitilis. The P domain appears necessary for folding and maintaining the endopeptidase catalytic domain and to regulate its calcium and acidic pH dependence. In addition, contained within the middle of the P domain in most PC family members is the cognate integrin binding RGD sequence [], which may be required for intracellular compartmentalization and maintenance of enzyme stability within the ER. The integrity of the RGD sequence of proprotein convertase PC1 is critical for its zymogen and C-terminal processing and for its cellular trafficking [, ]. The carboxy-terminal tail provides uniqueness to each PC family member being the least conserved region of all convertases [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 1P8J_E 1OT5_B 2ID4_A 1R64_A 3HJR_A.
Probab=82.95  E-value=6.5  Score=26.14  Aligned_cols=54  Identities=15%  Similarity=0.274  Sum_probs=28.4

Q ss_pred             EEEEEeCCCcCCceEEEEECCCCCccceEEEEcC------CCEEEEEEEcC------CCeeEEEEEEE
Q psy15726         70 SFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQK------FANLLAEFTPT------TTGVYKIDVLQ  125 (171)
Q Consensus        70 ~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~------dg~y~v~y~P~------~~G~~~i~V~~  125 (171)
                      .+.++..-...++|.+.+.+|+|..  +.+.+..      ++.+..+|...      ..|.++|.|.=
T Consensus         5 ~v~v~i~H~~~gdL~i~L~SP~Gt~--~~L~~~~~~d~~~~~~~~~~f~~~~f~Ge~~~G~W~L~v~D   70 (87)
T PF01483_consen    5 QVSVDITHPYRGDLRITLISPSGTR--STLKDRRGSDDSGSGFLNWTFTSVAFWGESANGTWTLRVTD   70 (87)
T ss_dssp             EEEEEEEESSGGGEEEEEE-TT--E--EEEE-SSTTHHHSTSEEEEEEEESTTTT-B--EEEEEEEEE
T ss_pred             EEEEEEEcCCcCCEEEEEECCCCCE--EEEECCcCCCcccccccccEEEEEeecCccCCCEEEEEEEE
Confidence            4445444445579999999999974  3344332      34566666632      14555555543


No 31 
>COG1470 Predicted membrane protein [Function unknown]
Probab=82.69  E-value=12  Score=33.29  Aligned_cols=120  Identities=11%  Similarity=0.119  Sum_probs=61.3

Q ss_pred             CCceeEEEEEEECCE-EcCCCC-eEEEecCCCCCCCcEEEcCCCc-ceEeCCeEEEEEEeCCCcC---CceEEEEECCCC
Q psy15726         19 ISYGLVQARTLREET-SFRSCP-MEVPVVDPAVGREPSGSGLGLY-QARSGIVNSFTLETCGVAS---SEFDVIVTSPQG   92 (171)
Q Consensus        19 ~~~G~~~i~V~~~g~-~I~gSP-f~v~V~~~~d~~~v~v~G~Gl~-~~~vg~~~~F~V~~~~ag~---~~l~v~i~~p~g   92 (171)
                      -+||.|.+.|.-.+- .+..-+ ..+.+... ....+.+.+--++ .+..|+.....+.-.+.|.   .++.+.|.+|+|
T Consensus       347 a~pG~Ynv~I~A~s~s~v~~e~~lki~~~g~-~~~~v~l~~g~~~lt~taGee~~i~i~I~NsGna~LtdIkl~v~~Pqg  425 (513)
T COG1470         347 ATPGTYNVTITASSSSGVTRELPLKIKNTGS-YNELVKLDNGPYRLTITAGEEKTIRISIENSGNAPLTDIKLTVNGPQG  425 (513)
T ss_pred             CCCCceeEEEEEeccccceeeeeEEEEeccc-cceeEEccCCcEEEEecCCccceEEEEEEecCCCccceeeEEecCCcc
Confidence            467888887776553 222222 22222221 2223333221121 2667777776666655554   567799999998


Q ss_pred             Cccce---EE---EEcCCCEEEEEEE-cC--CCeeEEEEEEECCEEc-CCCCeEEEe
Q psy15726         93 AAVPV---RC---YQQKFANLLAEFT-PT--TTGVYKIDVLQGARPV-RGSPYLCQV  139 (171)
Q Consensus        93 ~~~~~---~v---~~~~dg~y~v~y~-P~--~~G~~~i~V~~~g~~I-~gSPF~v~V  139 (171)
                      =.+.+   .|   ..-+.++-.++-+ |.  .+|+|.|.|+--.-.. ...=+++.|
T Consensus       426 Wei~Vd~~~I~sL~pge~~tV~ltI~vP~~a~aGdY~i~i~~ksDq~s~e~tlrV~V  482 (513)
T COG1470         426 WEIEVDESTIPSLEPGESKTVSLTITVPEDAGAGDYRITITAKSDQASSEDTLRVVV  482 (513)
T ss_pred             ceEEECcccccccCCCCcceEEEEEEcCCCCCCCcEEEEEEEeeccccccceEEEEE
Confidence            32221   11   1223345555555 33  3788999887543222 223355555


No 32 
>PRK09619 flgD flagellar basal body rod modification protein; Reviewed
Probab=82.50  E-value=7.6  Score=30.94  Aligned_cols=70  Identities=11%  Similarity=0.057  Sum_probs=42.7

Q ss_pred             CcEEEcCCCcceEeCCeEEEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCCEEEEEEEc----CCCeeEEEEEE
Q psy15726         52 EPSGSGLGLYQARSGIVNSFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTP----TTTGVYKIDVL  124 (171)
Q Consensus        52 ~v~v~G~Gl~~~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P----~~~G~~~i~V~  124 (171)
                      .|.+.|..+.-  .+....|.+...++ ...+.+.|.+.+|+......-....|.+.+.|-+    ..+|.|++.|.
T Consensus        97 ~V~~~g~~~~l--~~~~~~~~~~L~~~-a~~v~v~I~D~~G~v~t~~l~~~~aG~~~f~WDG~~~~lp~G~Y~~~V~  170 (218)
T PRK09619         97 QVMVESHEITL--GEDPVAGRLTLKHP-APTLTLHITDILGQEKKIDLGKQPAGPVNFTLDPAALGLQPGQYQLSVV  170 (218)
T ss_pred             EEEEecceEEe--cCCeeEEEEecCCc-CcEEEEEEEeCCCCEEEEecCCcCCCceeEEECCCCCCCCCceeEEEEE
Confidence            34454444321  23455666664332 3568899999999742222223455888888876    45788999986


No 33 
>PRK12812 flgD flagellar basal body rod modification protein; Reviewed
Probab=82.21  E-value=10  Score=31.11  Aligned_cols=58  Identities=14%  Similarity=0.053  Sum_probs=41.0

Q ss_pred             CeEEEEEEeCCCcCCceEEEEECCCCCccce-EEEEcCCCEEEEEEEc-------CCCeeEEEEEEE
Q psy15726         67 IVNSFTLETCGVASSEFDVIVTSPQGAAVPV-RCYQQKFANLLAEFTP-------TTTGVYKIDVLQ  125 (171)
Q Consensus        67 ~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~-~v~~~~dg~y~v~y~P-------~~~G~~~i~V~~  125 (171)
                      ....|.++.... ...+.+.|.+.+|+.++. .+.+...|.+.+.|--       ...|.|++.|..
T Consensus       128 ~~~~~~~~l~~~-a~~v~v~I~D~~G~~V~t~~lg~~~aG~~~f~WDG~d~~G~~~~~G~Yt~~v~A  193 (259)
T PRK12812        128 ELIALKLYFPED-SDEGTLEIYDSNNKLVEKIDFKEISQGLFTMEWDGRDNDGVYAGDGEYTIKAVY  193 (259)
T ss_pred             ceeEEEEecCCc-CceEEEEEEeCCCCEEEEEecCCCCCcceeEEECCCCCCCCcCCCeeeEEEEEE
Confidence            467777776433 357899999999986643 3335556888888865       347999999974


No 34 
>PF04151 PPC:  Bacterial pre-peptidase C-terminal domain;  InterPro: IPR007280 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  This domain is normally found at the C terminus of secreted archaeal and bacterial peptidases, the majority of which belong to MEROPS peptidase families M4 (vibriolysin, IPR001570 from INTERPRO), M9A amd M9B (microbial collangenase, IPR002169 from INTERPRO), M28 (aminopeptidase Ap1, IPR007484 from INTERPRO) and S8 (subtilisin family peptidases, IPR000209 from INTERPRO).; GO: 0008233 peptidase activity, 0006508 proteolysis; PDB: 4DY5_B 4DXZ_A 4DY3_B 3JQW_A 3JQX_C 1NQJ_B 1NQD_A 2O8O_A 1WMF_A 1WME_A ....
Probab=82.20  E-value=10  Score=24.00  Aligned_cols=52  Identities=19%  Similarity=0.313  Sum_probs=32.5

Q ss_pred             EEEEeCCCcCCceEEEEECCCCCccceEEEEcC--CC-EEEEEEEcCCCeeEEEEEE
Q psy15726         71 FTLETCGVASSEFDVIVTSPQGAAVPVRCYQQK--FA-NLLAEFTPTTTGVYKIDVL  124 (171)
Q Consensus        71 F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~--dg-~y~v~y~P~~~G~~~i~V~  124 (171)
                      ++|++.+.. +++.+.+.+++|..+... .+..  .+ .=.++|+..++|.|.|.|.
T Consensus        15 l~i~l~~~~-~d~dl~l~~~~g~~~~~~-d~~~~~~~~~~~i~~~~~~~GtYyi~V~   69 (70)
T PF04151_consen   15 LTIDLSGGS-GDADLYLYDSNGNSLASY-DDSSQSGGNDESITFTAPAAGTYYIRVY   69 (70)
T ss_dssp             EEEEECETT-SSEEEEEEETTSSSCEEC-CCCTCETTSEEEEEEEESSSEEEEEEEE
T ss_pred             EEEEEcCCC-CCeEEEEEcCCCCchhhh-eecCCCCCCccEEEEEcCCCEEEEEEEE
Confidence            445554332 367788888887532211 1111  12 4678899999999999985


No 35 
>TIGR03096 nitroso_cyanin nitrosocyanin. Nitrosocyanin, as described from the obligate chemolithoautotroph Nitrosomonas europaea, is a red copper protein of unknown function with sequence similarity to a number of blue copper redox proteins.
Probab=81.76  E-value=7.3  Score=28.82  Aligned_cols=65  Identities=17%  Similarity=0.185  Sum_probs=39.0

Q ss_pred             ceEeCCeEEEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCCEEEEEEEcCCCeeEEEEEE--ECCEEcCCC
Q psy15726         62 QARSGIVNSFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTPTTTGVYKIDVL--QGARPVRGS  133 (171)
Q Consensus        62 ~~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~--~~g~~I~gS  133 (171)
                      .+..|.++++++.-.+.-...+  .+.+ -|  +...+  ..+.+.+++|++.++|.|...=.  -...|++||
T Consensus        64 ~VkaGD~Vtl~vtN~d~~~H~f--~i~~-~g--is~~I--~pGet~TitF~adKpG~Y~y~C~~HP~~~H~~~~  130 (135)
T TIGR03096        64 VVKKGTPVKVTVENKSPISEGF--SIDA-YG--ISEVI--KAGETKTISFKADKAGAFTIWCQLHPKNIHLPGS  130 (135)
T ss_pred             EECCCCEEEEEEEeCCCCccce--EECC-CC--cceEE--CCCCeEEEEEECCCCEEEEEeCCCCChhhcCCCc
Confidence            3667888888776333322222  2222 11  12222  22468999999999999986533  336788887


No 36 
>PF13620 CarboxypepD_reg:  Carboxypeptidase regulatory-like domain; PDB: 3MN8_D 3P0D_I 3KCP_A 2B59_B 1UWY_A 1H8L_A 1QMU_A 2NSM_A.
Probab=81.66  E-value=9  Score=24.65  Aligned_cols=46  Identities=17%  Similarity=0.165  Sum_probs=28.6

Q ss_pred             CceEEEEECCCCCccceEEEEcCCCEEEEEEEcCCCeeEEEEEEECCEE
Q psy15726         81 SEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTPTTTGVYKIDVLQGARP  129 (171)
Q Consensus        81 ~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~~g~~  129 (171)
                      ....|.+.+.++.. .....-+.+|.|.+.-.|.  |.|.|.+...|..
T Consensus        15 ~~a~V~l~~~~~~~-~~~~~Td~~G~f~~~~l~~--g~Y~l~v~~~g~~   60 (82)
T PF13620_consen   15 PGATVTLTDQDGGT-VYTTTTDSDGRFSFEGLPP--GTYTLRVSAPGYQ   60 (82)
T ss_dssp             TT-EEEET--TTTE-CCEEE--TTSEEEEEEE-S--EEEEEEEEBTTEE
T ss_pred             CCEEEEEEEeeCCC-EEEEEECCCceEEEEccCC--EeEEEEEEECCcc
Confidence            45566666655543 3445555779999997776  9999999887743


No 37 
>PF07523 Big_3:  Bacterial Ig-like domain (group 3);  InterPro: IPR011080 This entry represents bacterial domains with an Ig-like fold. These domains are found in a variety of bacterial surface proteins.; PDB: 2L7Y_A 2KPN_A.
Probab=80.24  E-value=4.6  Score=25.60  Aligned_cols=44  Identities=11%  Similarity=0.150  Sum_probs=25.4

Q ss_pred             ceEEEEECCCCCccceEEEEcCCCEEEEEEEcCCCeeEEEEEEECCEEc
Q psy15726         82 EFDVIVTSPQGAAVPVRCYQQKFANLLAEFTPTTTGVYKIDVLQGARPV  130 (171)
Q Consensus        82 ~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~~g~~I  130 (171)
                      .+.+.....+|..+     +..+-+..-.|....+|.|.|.++|.+...
T Consensus        18 ~~~v~at~~dG~~~-----~~~~~~vs~~~d~~~~G~y~Vt~~y~~~t~   61 (67)
T PF07523_consen   18 GLFVTATYSDGTSL-----PLSDVTVSGTVDTSKAGTYTVTYTYKGVTA   61 (67)
T ss_dssp             CHEEEEEETTS-ES------GCCSEEES---TTS-CCEEEEEEECTEEE
T ss_pred             CCEEEEEEcCCCEe-----ceeeeEEEeeeecCCCceEEEEEEECCEEE
Confidence            45566777777652     222333333678899999999999999443


No 38 
>PF13584 BatD:  Oxygen tolerance
Probab=79.90  E-value=11  Score=33.08  Aligned_cols=62  Identities=21%  Similarity=0.367  Sum_probs=45.0

Q ss_pred             EEEEEEEcCCCeeEEEE---EEECCEEcCCCCeEEEecCCCc------------eEEecC-CcccEEcCCCEEEEEEc
Q psy15726        106 NLLAEFTPTTTGVYKID---VLQGARPVRGSPYLCQVYDASK------------VKIEHK-GLSNIVVNDPISFKCKS  167 (171)
Q Consensus       106 ~y~v~y~P~~~G~~~i~---V~~~g~~I~gSPF~v~V~d~sk------------v~~~G~-gl~~~~~g~~~~F~Vd~  167 (171)
                      +|.+.+.|...|.|+|-   |.++|+...--|..++|.+++.            +.+.-. .-+.+.+||++.+++..
T Consensus        74 ~~~~~l~p~~~G~~~IP~~~v~v~Gk~~~S~pi~i~V~~~~~~~~~~~~~~~~~~~l~~~v~~~~~Yvge~v~lt~~l  151 (484)
T PF13584_consen   74 TYTYTLQPKKTGTFTIPPFTVEVDGKTYKSQPITIEVSKASQSPSQPPSNADDDVFLEAEVSKKSVYVGEPVILTLRL  151 (484)
T ss_pred             EEEEEEEecccceEEEceEEEEECCEEEeecCEEEEEEecccCCccccccccccEEEEEEeCCCceecCCcEEEEEEE
Confidence            67888899999999875   6779999999999999944332            111111 12347889998888753


No 39 
>PRK12633 flgD flagellar basal body rod modification protein; Provisional
Probab=78.17  E-value=12  Score=29.95  Aligned_cols=66  Identities=24%  Similarity=0.361  Sum_probs=42.8

Q ss_pred             eEEEEEEeCCCcCCceEEEEECCCCCccce-EEEEcCCCEEEEEEEcC-------CCeeEEEEEEE---CCEEcCCCC
Q psy15726         68 VNSFTLETCGVASSEFDVIVTSPQGAAVPV-RCYQQKFANLLAEFTPT-------TTGVYKIDVLQ---GARPVRGSP  134 (171)
Q Consensus        68 ~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~-~v~~~~dg~y~v~y~P~-------~~G~~~i~V~~---~g~~I~gSP  134 (171)
                      ...|.++..++ ...+.+.|.+.+|+.++. .+-....|.+.+.|-..       ..|.|++.|..   +|+.+.-.+
T Consensus       117 ~~~~~~~l~~~-a~~v~v~I~D~~G~vV~t~~lg~~~aG~~~f~WDG~d~~G~~~~~G~Y~~~V~a~~~~G~~~~~~~  193 (230)
T PRK12633        117 ATPFGIDLQGD-ATKVTVKVLDPSGAVVRTMELGDLKTGVHTLQWDGNNDGGQPLADGKYSITVSASDADAKPVKAEA  193 (230)
T ss_pred             ceeEEEecCCc-CcEEEEEEEeCCCCEEEEEecCCCCCCceeEEECCCCCCCCcCCCcceEEEEEEEeCCCcEEeeee
Confidence            45566654332 357889999999987653 33345567888888654       46889988865   345554333


No 40 
>PF07495 Y_Y_Y:  Y_Y_Y domain;  InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=77.89  E-value=3.7  Score=25.50  Aligned_cols=19  Identities=21%  Similarity=0.371  Sum_probs=15.7

Q ss_pred             EEEEEcCCCeeEEEEEEEC
Q psy15726        108 LAEFTPTTTGVYKIDVLQG  126 (171)
Q Consensus       108 ~v~y~P~~~G~~~i~V~~~  126 (171)
                      .++|++..+|.|+|.|+..
T Consensus        30 ~~~~~~L~~G~Y~l~V~a~   48 (66)
T PF07495_consen   30 SISYTNLPPGKYTLEVRAK   48 (66)
T ss_dssp             EEEEES--SEEEEEEEEEE
T ss_pred             EEEEEeCCCEEEEEEEEEE
Confidence            9999999999999999864


No 41 
>KOG1428|consensus
Probab=77.75  E-value=8.3  Score=39.26  Aligned_cols=36  Identities=19%  Similarity=0.100  Sum_probs=30.9

Q ss_pred             eeccccCCCCceeEEEEEEECCEEcCCCCeEEEecCC
Q psy15726         11 LSEKAPSWISYGLVQARTLREETSFRSCPMEVPVVDP   47 (171)
Q Consensus        11 ~~~~~~~P~~~G~~~i~V~~~g~~I~gSPf~v~V~~~   47 (171)
                      +++-+|+|...|.|+++++.+|..|.. -+.|.|.++
T Consensus      2107 ~y~~nWTP~a~G~Y~l~~tiDG~eId~-~~eVkVkd~ 2142 (3738)
T KOG1428|consen 2107 TYCANWTPGAIGLYTLHVTIDGIEIDA-GLEVKVKDP 2142 (3738)
T ss_pred             ceecccCCccceeEEEEEEecceeecc-ceEEEeccC
Confidence            456789999999999999999999954 478888876


No 42 
>PF00017 SH2:  SH2 domain;  InterPro: IPR000980 The Src homology 2 (SH2) domain is a protein domain of about 100 amino-acid residues first identified as a conserved sequence region between the oncoproteins Src and Fps []. Similar sequences were later found in many other intracellular signal-transducing proteins []. SH2 domains function as regulatory modules of intracellular signalling cascades by interacting with high affinity to phosphotyrosine-containing target peptides in a sequence-specific, SH2 domains recognise between 3-6 residues C-terminal to the phosphorylated tyrosine in a fashion that differs from one SH2 domain to another, and strictly phosphorylation-dependent manner [, , , ]. They are found in a wide variety of protein contexts e.g., in association with catalytic domains of phospholipase Cy (PLCy) and the non-receptor protein tyrosine kinases; within structural proteins such as fodrin and tensin; and in a group of small adaptor molecules, i.e Crk and Nck. The domains are frequently found as repeats in a single protein sequence and will then often bind both mono- and di-phosphorylated substrates.  The structure of the SH2 domain belongs to the alpha+beta class, its overall shape forming a compact flattened hemisphere. The core structural elements comprise a central hydrophobic anti-parallel beta-sheet, flanked by 2 short alpha-helices. The loop between strands 2 and 3 provides many of the binding interactions with the phosphate group of its phosphopeptide ligand, and is hence designated the phosphate binding loop, the phosphorylated ligand binds perpendicular to the beta-sheet and typically interacts with the phosphate binding loop and a hydrophobic binding pocket that interacts with a pY+3 side chain. The N- and C-termini of the domain are close together in space and on the opposite face from the phosphopeptide binding surface and it has been speculated that this has facilitated their integration into surface-exposed regions of host proteins [].; GO: 0005515 protein binding; PDB: 1M27_A 1KA6_A 1D4W_B 1D4T_A 1D1Z_B 1KA7_A 1UUR_A 1UUS_A 1BLJ_A 1BLK_A ....
Probab=77.13  E-value=6.7  Score=25.16  Aligned_cols=27  Identities=15%  Similarity=0.247  Sum_probs=24.7

Q ss_pred             cCCCEEEEEEEcCCCeeEEEEEEECCE
Q psy15726        102 QKFANLLAEFTPTTTGVYKIDVLQGAR  128 (171)
Q Consensus       102 ~~dg~y~v~y~P~~~G~~~i~V~~~g~  128 (171)
                      ..+|+|.|......+|.|.|.|+++++
T Consensus        19 ~~~G~FLvR~s~~~~~~~~Lsv~~~~~   45 (77)
T PF00017_consen   19 KPDGTFLVRPSSSKPGKYVLSVRFDGK   45 (77)
T ss_dssp             SSTTEEEEEEESSSTTSEEEEEEETTE
T ss_pred             CCCCeEEEEeccccccccccccccccc
Confidence            568999999999999999999999993


No 43 
>PF14545 DBB:  Dof, BCAP, and BANK (DBB) motif,
Probab=75.27  E-value=32  Score=25.69  Aligned_cols=75  Identities=12%  Similarity=0.087  Sum_probs=50.4

Q ss_pred             ceEeCCeEEEEEEeCC-C-cCCceEEEEECCCCCccceEEEEcCCCEEEEEEE-cCC----CeeEEEEEEECCEEcCCCC
Q psy15726         62 QARSGIVNSFTLETCG-V-ASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFT-PTT----TGVYKIDVLQGARPVRGSP  134 (171)
Q Consensus        62 ~~~vg~~~~F~V~~~~-a-g~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~-P~~----~G~~~i~V~~~g~~I~gSP  134 (171)
                      +++.|++ ++.|-.++ - +++.++|.+...++........-.  .-|++++. |..    +|...++|..||..+.-.|
T Consensus         7 rI~cg~~-~vfIiL~~~l~~~~~~eVef~~~n~~~~~~~~~~~--N~yt~~~~aPd~~~~pag~V~v~v~~~g~~~~~~~   83 (142)
T PF14545_consen    7 RIRCGQP-EVFIILRDPLDEEDTVEVEFESNNKPIRRVPAKWE--NPYTLQFKAPDFCLEPAGSVNVRVYCDGVSLGTRQ   83 (142)
T ss_pred             eeecCCC-EEEEEEeCCCCCCCeEEEEEEeCCCeeEeccceEE--CCEEEEEECchhcCCCCceEEEEEEECCEEEEEEe
Confidence            4677888 55555544 3 246899999887764322222221  26999997 443    6999999999998887777


Q ss_pred             eEEEe
Q psy15726        135 YLCQV  139 (171)
Q Consensus       135 F~v~V  139 (171)
                      .++.-
T Consensus        84 ikyys   88 (142)
T PF14545_consen   84 IKYYS   88 (142)
T ss_pred             EEEEe
Confidence            66654


No 44 
>PF06016 Reovirus_L2:  Reovirus core-spike protein lambda-2 (L2);  InterPro: IPR010311 This family consists of several Reovirus core-spike protein lambda-2 (L2) sequences. The reovirus L2 genome segment encodes the core spike protein lambda-2, which mediates enzymatic reactions in 5' capping of the viral plus-strand transcripts [].; GO: 0004482 mRNA (guanine-N7-)-methyltransferase activity, 0004484 mRNA guanylyltransferase activity, 0005524 ATP binding, 0006370 mRNA capping, 0019028 viral capsid; PDB: 1EJ6_A 3IYL_W 3K1Q_A.
Probab=74.42  E-value=4  Score=40.24  Aligned_cols=103  Identities=21%  Similarity=0.284  Sum_probs=54.6

Q ss_pred             eEeCCeEEEEEEeCCCcCCceEEE------EECCCCC----ccceEE-EEcCCCEEEEEEEcCCCeeEEEEEEECCEEcC
Q psy15726         63 ARSGIVNSFTLETCGVASSEFDVI------VTSPQGA----AVPVRC-YQQKFANLLAEFTPTTTGVYKIDVLQGARPVR  131 (171)
Q Consensus        63 ~~vg~~~~F~V~~~~ag~~~l~v~------i~~p~g~----~~~~~v-~~~~dg~y~v~y~P~~~G~~~i~V~~~g~~I~  131 (171)
                      .+||++++++|..-+.. ..+.+.      ++...|+    .-++.. .+..+.-+.++|+|..+|.|.+.+..++.+|+
T Consensus      1035 ~~VGq~~~i~I~~~~~~-~s~~~~~~gV~v~~~~~~~~~s~~~~v~~q~~~~~~~~~l~~tp~~aGi~~vv~~~~~~~v~ 1113 (1289)
T PF06016_consen 1035 LVVGQQCSITIPGPNPQ-ASVSVTHDGVEVFTGTAGNLVSMVGPVALQYDPANNEWTLTWTPSTAGILDVVVDDGPVPVP 1113 (1289)
T ss_dssp             SCTTT-EEEEEESS-TT-EEEEEECTTCEEEECCCTTTCESSSEEEEEEETTTTEEEEEEE-SS-CEEEEEECTTTCCEE
T ss_pred             ccccceEEEEEccCCCC-ceEEEEECCEEEEEecCCcccccccceeEEEeccCceEEEEEccCCCceEEEEeccCCcccc
Confidence            56777777777765432 111111      1112222    112332 35666799999999999999997777777775


Q ss_pred             -CCCeEEEecCCCceEEecCCcccEEcCCCEEEEEEc
Q psy15726        132 -GSPYLCQVYDASKVKIEHKGLSNIVVNDPISFKCKS  167 (171)
Q Consensus       132 -gSPF~v~V~d~skv~~~G~gl~~~~~g~~~~F~Vd~  167 (171)
                       || |.+...|+.-...++.-++--.+|-.+..++|.
T Consensus      1114 ~GS-~~i~~Pd~~i~v~~Pa~~D~T~aGtd~~I~~d~ 1149 (1289)
T PF06016_consen 1114 LGS-FTIEPPDPTITVTWPANLDYTDAGTDVNITCDP 1149 (1289)
T ss_dssp             EEE-EEEB-----EEEE--TT--SSTT-EEEEEEE-T
T ss_pred             cce-eEeeCCCCceEEecccceeeeccCceeeEEecc
Confidence             55 777778887777777666666677666666653


No 45 
>PF13584 BatD:  Oxygen tolerance
Probab=73.53  E-value=63  Score=28.35  Aligned_cols=61  Identities=11%  Similarity=0.034  Sum_probs=42.6

Q ss_pred             cCCCCceeEEE---EEEECCEEcCCCCeEEEecCCCCC---------CCcEEEcC-CCcceEeCCeEEEEEEeC
Q psy15726         16 PSWISYGLVQA---RTLREETSFRSCPMEVPVVDPAVG---------REPSGSGL-GLYQARSGIVNSFTLETC   76 (171)
Q Consensus        16 ~~P~~~G~~~i---~V~~~g~~I~gSPf~v~V~~~~d~---------~~v~v~G~-Gl~~~~vg~~~~F~V~~~   76 (171)
                      +.|.+.|.|.|   .|++||+...-.|..+.|.+....         ..+.+.-. .-..+-+|++..+++..-
T Consensus        79 l~p~~~G~~~IP~~~v~v~Gk~~~S~pi~i~V~~~~~~~~~~~~~~~~~~~l~~~v~~~~~Yvge~v~lt~~ly  152 (484)
T PF13584_consen   79 LQPKKTGTFTIPPFTVEVDGKTYKSQPITIEVSKASQSPSQPPSNADDDVFLEAEVSKKSVYVGEPVILTLRLY  152 (484)
T ss_pred             EEecccceEEEceEEEEECCEEEeecCEEEEEEecccCCccccccccccEEEEEEeCCCceecCCcEEEEEEEE
Confidence            57999999985   467799998899999999876321         12222211 223478898888777763


No 46 
>PF14524 Wzt_C:  Wzt C-terminal domain; PDB: 2R5O_B.
Probab=72.47  E-value=31  Score=24.25  Aligned_cols=64  Identities=19%  Similarity=0.167  Sum_probs=38.0

Q ss_pred             ceEeCCeEEEEEEeCCC---cCCceEEEEECCCCCcc--------ceEEEEcCCCEEEEEEE---cCCCeeEEEEEEE
Q psy15726         62 QARSGIVNSFTLETCGV---ASSEFDVIVTSPQGAAV--------PVRCYQQKFANLLAEFT---PTTTGVYKIDVLQ  125 (171)
Q Consensus        62 ~~~vg~~~~F~V~~~~a---g~~~l~v~i~~p~g~~~--------~~~v~~~~dg~y~v~y~---P~~~G~~~i~V~~  125 (171)
                      ....|++..|.++.+-.   ..-.+.+.|.+.+|..+        ...+.....|+|+++|+   +..+|.|.|++.+
T Consensus        30 ~~~~ge~~~i~i~~~~~~~i~~~~~~~~i~~~~g~~v~~~~t~~~~~~~~~~~~g~~~~~~~i~~~L~~G~Y~i~v~l  107 (142)
T PF14524_consen   30 SFESGEPIRIRIDYEVNEDIDDPVFGFAIRDSDGQRVFGTNTYDSGFPIPLSEGGTYEVTFTIPKPLNPGEYSISVGL  107 (142)
T ss_dssp             SEETTSEEEEEEEEEESS-EEEEEEEEEEEETT--EEEEEEHHHHT--EEE-TT-EEEEEEEEE--B-SEEEEEEEEE
T ss_pred             EEeCCCEEEEEEEEEECCCCCccEEEEEEEcCCCCEEEEECccccCccccccCCCEEEEEEEEcCccCCCeEEEEEEE
Confidence            36678888777776522   22346677778777432        12333333788888876   3569999999999


No 47 
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=72.22  E-value=1.3e+02  Score=32.70  Aligned_cols=139  Identities=14%  Similarity=0.094  Sum_probs=71.3

Q ss_pred             CCCCceeEEEEEEECCE-EcCCCCeEEEecCCCCCCCcEEEcCCCc-ceEeCCeEEEEEEeCCCcCCceEEEEECCCCCc
Q psy15726         17 SWISYGLVQARTLREET-SFRSCPMEVPVVDPAVGREPSGSGLGLY-QARSGIVNSFTLETCGVASSEFDVIVTSPQGAA   94 (171)
Q Consensus        17 ~P~~~G~~~i~V~~~g~-~I~gSPf~v~V~~~~d~~~v~v~G~Gl~-~~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~   94 (171)
                      +-..+|.|.|.|+..++ --......+.|..+...-.++...++-. -..+|....|+=...  .+..+.-...=.+|..
T Consensus      1749 tf~~~G~y~VtVta~N~vss~~~s~~V~VqepV~GL~i~~~~~~~~~~~~ag~~v~F~a~vs--tGsnVsw~W~f~~g~s 1826 (2740)
T TIGR00864      1749 SFPSAGLHLVTMKAFNELGSANASEEVDVQEPISGLKIRAADAGEQNFFAADSSVCFQGELA--TGTNVSWCWAIDGGSS 1826 (2740)
T ss_pred             EccCCceEEEEEEEEccccccceeEEEEEEeccccceEecCCCCcccceecCcEEEEEEEcc--CCCeeEEEEEeCCCCc
Confidence            45679999999985443 2223446666766542222222222111 244565555544322  1233333332222210


Q ss_pred             cceEEEEcCCCEEEEEEEcCCCeeEEEEEEEC-CEEcCCCCeEEEecCCC-ceEEecCCcccEEcCCCEEEEEEc
Q psy15726         95 VPVRCYQQKFANLLAEFTPTTTGVYKIDVLQG-ARPVRGSPYLCQVYDAS-KVKIEHKGLSNIVVNDPISFKCKS  167 (171)
Q Consensus        95 ~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~~-g~~I~gSPF~v~V~d~s-kv~~~G~gl~~~~~g~~~~F~Vd~  167 (171)
                      .        . ...++|++.++|.|.|++..- +..=..-...+.|-|+= ...+. .+...+.+|+...|+++.
T Consensus      1827 ~--------~-gk~v~~Tf~~aG~ytV~L~AsN~vs~~~~s~~~~VQe~I~~L~L~-as~~~~~~n~~v~fsa~l 1891 (2740)
T TIGR00864      1827 K--------M-GKHACMTFPDAGTFAIRLNASNAVSGKSASREFFAEEPIFGLELK-ASKKIAAIGEKVEFQILL 1891 (2740)
T ss_pred             c--------c-cceeEEecCCCeEEEEEEEEEcccCcceeeeeEEEEEecceEEEe-cccccccCCCEEEEEEEe
Confidence            0        1 367889999999999999843 22211123445554432 22222 223346778888887754


No 48 
>PRK12634 flgD flagellar basal body rod modification protein; Reviewed
Probab=72.12  E-value=37  Score=27.11  Aligned_cols=55  Identities=18%  Similarity=0.205  Sum_probs=37.6

Q ss_pred             CceEEEEECCCCCccce-EEEEcCCCEEEEEEEcC-------CCeeEEEEEEE---CCEEcCCCCe
Q psy15726         81 SEFDVIVTSPQGAAVPV-RCYQQKFANLLAEFTPT-------TTGVYKIDVLQ---GARPVRGSPY  135 (171)
Q Consensus        81 ~~l~v~i~~p~g~~~~~-~v~~~~dg~y~v~y~P~-------~~G~~~i~V~~---~g~~I~gSPF  135 (171)
                      ..+.+.|.+.+|+.+.. .+.....|.+.++|-..       ..|.|++.|..   +|+.+.-.+.
T Consensus       122 ~~v~i~I~d~~G~~V~t~~lg~~~aG~~~f~WDG~d~~G~~~~~G~Yt~~v~a~~~~G~~~~~~~~  187 (221)
T PRK12634        122 GFVNFEITDANGAFVKQISVPASAAGEVSFAWDGTDANGNRMAAGKYGVTATQTDTAGSKSKLSTY  187 (221)
T ss_pred             CeEEEEEEcCCCCEEEEEecCCcCCCceeEEECCCCCCCCcCCCeeeEEEEEEEeCCCcEEeeeee
Confidence            57889999999986643 23345568888888663       35899999864   4555554443


No 49 
>smart00252 SH2 Src homology 2 domains. Src homology 2 domains bind phosphotyrosine-containing polypeptides via 2 surface pockets. Specificity is provided via interaction with residues that are distinct from the phosphotyrosine. Only a single occurrence of a SH2 domain has been found in S. cerevisiae.
Probab=71.89  E-value=6.1  Score=25.78  Aligned_cols=28  Identities=14%  Similarity=0.170  Sum_probs=24.1

Q ss_pred             EEcCCCEEEEEEEcCCCeeEEEEEEECC
Q psy15726        100 YQQKFANLLAEFTPTTTGVYKIDVLQGA  127 (171)
Q Consensus       100 ~~~~dg~y~v~y~P~~~G~~~i~V~~~g  127 (171)
                      .+..+|+|.+......+|.|.|.|++++
T Consensus        18 ~~~~~G~FLvR~s~~~~~~~~Lsv~~~~   45 (84)
T smart00252       18 KNEGDGDFLVRDSESEPGDYVLSVRVKG   45 (84)
T ss_pred             hcCCCcEEEEEcCCCCCCCEEEEEEECC
Confidence            3445899999999998999999999974


No 50 
>PF10670 DUF4198:  Domain of unknown function (DUF4198)
Probab=71.45  E-value=31  Score=26.30  Aligned_cols=62  Identities=19%  Similarity=0.130  Sum_probs=42.4

Q ss_pred             EeCCeEEEEEEeCCCcCCceEEEEECCCCCcc----ceEEEEcCCCEEEEEEEcCCCeeEEEEEEECC
Q psy15726         64 RSGIVNSFTLETCGVASSEFDVIVTSPQGAAV----PVRCYQQKFANLLAEFTPTTTGVYKIDVLQGA  127 (171)
Q Consensus        64 ~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~~----~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~~g  127 (171)
                      .+|++..|.|.-.+..-....|.+..+++...    ...++-+.+|  .++|++..+|.|.|.+.+.+
T Consensus       148 ~~g~~~~~~vl~~GkPl~~a~V~~~~~~~~~~~~~~~~~~~TD~~G--~~~~~~~~~G~wli~a~~~~  213 (215)
T PF10670_consen  148 KAGDPLPFQVLFDGKPLAGAEVEAFSPGGWYDVEHEAKTLKTDANG--RATFTLPRPGLWLIRASHKD  213 (215)
T ss_pred             cCCCEEEEEEEECCeEcccEEEEEEECCCccccccceEEEEECCCC--EEEEecCCCEEEEEEEEEec
Confidence            47888888888777655567777776664321    2333333445  57788999999999988754


No 51 
>PF11589 DUF3244:  Domain of unknown function (DUF3244);  InterPro: IPR021638  This family of proteins with unknown function appear to be restricted to Bacteroidetes. The protein may have an immunoglobulin-like beta-sandwich fold however this cannot be confirmed. ; PDB: 3D33_B 3SD2_A.
Probab=71.33  E-value=10  Score=26.36  Aligned_cols=53  Identities=11%  Similarity=0.303  Sum_probs=32.4

Q ss_pred             CceEEEEECCCCCccceEEEEcCC-CEEEEEEEcCCCeeEEEEEEECCE-EcCCC
Q psy15726         81 SEFDVIVTSPQGAAVPVRCYQQKF-ANLLAEFTPTTTGVYKIDVLQGAR-PVRGS  133 (171)
Q Consensus        81 ~~l~v~i~~p~g~~~~~~v~~~~d-g~y~v~y~P~~~G~~~i~V~~~g~-~I~gS  133 (171)
                      +++.++|.+..|..+--+...+.. ..+.+.-.=...|.|.|.+..+.- .+-|.
T Consensus        48 ~~vtI~I~d~~G~vVy~~~~~~~~~~~~~I~L~~~~~G~Y~l~i~~~~g~~l~G~  102 (106)
T PF11589_consen   48 GDVTITIKDSTGNVVYSETVSNSAGQSITIDLNGLPSGEYTLEITNGNGTYLYGE  102 (106)
T ss_dssp             SEEEEEEEETT--EEEEEEESCGGTTEEEEE-TTS-SEEEEEEEEECTC-EEEEE
T ss_pred             CCEEEEEEeCCCCEEEEEEccCCCCcEEEEEeCCCCCccEEEEEEeCCCCEEEEE
Confidence            567888888777665444433322 356666655678999999988765 65554


No 52 
>smart00634 BID_1 Bacterial Ig-like domain (group 1).
Probab=70.09  E-value=30  Score=23.16  Aligned_cols=28  Identities=21%  Similarity=0.210  Sum_probs=24.0

Q ss_pred             EEcCCCEEEEEEEcCCCeeEEEEEEECC
Q psy15726        100 YQQKFANLLAEFTPTTTGVYKIDVLQGA  127 (171)
Q Consensus       100 ~~~~dg~y~v~y~P~~~G~~~i~V~~~g  127 (171)
                      ..+.+|...+.++...+|.+.|.+..++
T Consensus        57 ~Td~~G~a~~~l~~~~~G~~~vta~~~~   84 (92)
T smart00634       57 TTDANGIATVTLTSTTAGVYTVTASLEN   84 (92)
T ss_pred             eeCCCCEEEEEEECCCCcEEEEEEEECC
Confidence            4455899999999999999999998876


No 53 
>KOG4088|consensus
Probab=69.41  E-value=22  Score=26.62  Aligned_cols=53  Identities=19%  Similarity=0.398  Sum_probs=33.7

Q ss_pred             eEEEEEEeCC-CcCCceEEEEECCCCCccceEEEEcCCCEEEEEEEcC--C--CeeEEEEEE
Q psy15726         68 VNSFTLETCG-VASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTPT--T--TGVYKIDVL  124 (171)
Q Consensus        68 ~~~F~V~~~~-ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~--~--~G~~~i~V~  124 (171)
                      -.+|++++++ .....+.+.+.   |+.+|+.+.+. -+.|.|+|+-.  .  +|.|.+.+.
T Consensus        44 i~EftLqCsn~~~n~~l~Aev~---Gk~~PVs~~~d-~~kyQVSW~ldhK~a~agt~~vr~F  101 (167)
T KOG4088|consen   44 ITEFTLQCSNNPKNIQLTAEVN---GKLIPVSISDD-TAKYQVSWTLDHKDAGAGTFNVRIF  101 (167)
T ss_pred             EEEEEEEeCCCCcceEEEEecC---CEEEeEEecCc-cceEEEEEEEeecccCCceEEEEEe
Confidence            3478888876 33345555543   78888776543 27999999854  2  455555543


No 54 
>PF09134 Invasin_D3:  Invasin, domain 3;  InterPro: IPR015217 This domain adopts a structure consisting of an immunoglobulin-like beta-sandwich, with seven strands in two beta-sheets, arranged in a Greek-key topology. It forms part of the extracellular region of the protein, which can be expressed as a soluble protein (Inv497) that binds integrins and promotes subsequent uptake by cells when attached to bacteria []. ; PDB: 1CWV_A.
Probab=68.18  E-value=38  Score=23.67  Aligned_cols=70  Identities=11%  Similarity=0.115  Sum_probs=36.0

Q ss_pred             CeEEEEEEeCCCcCCc---eE-EEEECCCCCccce-EEEEcCCCEEEEEEEcCCCeeEEEEEEECCEEcCCCCeEEE
Q psy15726         67 IVNSFTLETCGVASSE---FD-VIVTSPQGAAVPV-RCYQQKFANLLAEFTPTTTGVYKIDVLQGARPVRGSPYLCQ  138 (171)
Q Consensus        67 ~~~~F~V~~~~ag~~~---l~-v~i~~p~g~~~~~-~v~~~~dg~y~v~y~P~~~G~~~i~V~~~g~~I~gSPF~v~  138 (171)
                      ..+.+++.++|+.+..   +. +... .+|..+.. .+.+ ..|.|+++.+=..+|.+.|.=.++|+++..-|-++.
T Consensus        21 dta~LTfta~D~~g~pvsGl~~l~f~-~~g~~vtls~itE-~~GvYtATl~Gt~~G~vtI~P~v~G~~l~~~~~k~t   95 (99)
T PF09134_consen   21 DTATLTFTARDANGNPVSGLSVLAFV-QDGVPVTLSPITE-NPGVYTATLKGTSAGEVTITPQVNGQPLATLQKKLT   95 (99)
T ss_dssp             --EEEEEE-B-TTS-B--S--SEEEE-EEES--EE---EE--SS-EEEEEE-SS-EEEEEEEEETTEE-GGG-EEEE
T ss_pred             ceEEEEEEEEcCCCCcccCceeEEEE-ccCCCcEEccccc-cCCEEEEEEEcccCceEEEEEeeCCEEccccceeEE
Confidence            3455677777764421   11 2221 23332221 2334 458999999999999999999999999987665543


No 55 
>PRK13211 N-acetylglucosamine-binding protein A; Reviewed
Probab=67.89  E-value=92  Score=27.92  Aligned_cols=86  Identities=13%  Similarity=0.101  Sum_probs=52.4

Q ss_pred             CcEEEcCCCcceEeCCeE--EEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCC--EEEEEEEcCCCeeEEEEEEE--
Q psy15726         52 EPSGSGLGLYQARSGIVN--SFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFA--NLLAEFTPTTTGVYKIDVLQ--  125 (171)
Q Consensus        52 ~v~v~G~Gl~~~~vg~~~--~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg--~y~v~y~P~~~G~~~i~V~~--  125 (171)
                      ...++|..-+-...+..+  .|+|.+.  +...+++.|.+.+|+.+--.-...+|+  .+++...-.++|.|.|.|+.  
T Consensus       312 ~lsVsGL~~eY~I~dG~~~i~ftv~a~--g~~~vta~V~d~~g~~~~~~~~~v~d~s~~vtL~Ls~~~AG~y~Lvv~~t~  389 (478)
T PRK13211        312 ELEVSGLAKEYKIGDGAATLDFTVTAT--GDMNVEATVYNHDGEALGSKSQTVNDGSQSVSLDLSKLKAGHHMLVVKAKP  389 (478)
T ss_pred             ceEEecCcceeEEcCCcEEEEEEEEec--cceEEEEEEEcCCCCeeeeeeEEecCCceeEEEecccCCCceEEEEEEEEe
Confidence            345555433223333333  4555554  456888999999987654332333333  56666777789999999885  


Q ss_pred             -CCEEcCCCCeEEEe
Q psy15726        126 -GARPVRGSPYLCQV  139 (171)
Q Consensus       126 -~g~~I~gSPF~v~V  139 (171)
                       +|+.+.-.-+.+.|
T Consensus       390 ~dG~~~~q~~~~~~v  404 (478)
T PRK13211        390 KDGELIKQQTLDFML  404 (478)
T ss_pred             CCCceeeeeeEEEEE
Confidence             45666445566666


No 56 
>PF01105 EMP24_GP25L:  emp24/gp25L/p24 family/GOLD;  InterPro: IPR009038  The GOLD (for Golgi dynamics) domain is a protein module found in several eukaryotic Golgi and lipid-traffic proteins. It is typically between 90 and 150 amino acids long. Most of the size difference observed in the GOLD-domain superfamily is traceable to a single large low-complexity insert that is seen in some versions of the domain. With the exception of the p24 proteins, which have a simple architecture with the GOLD domain as their only globular domain, all other GOLD-domain proteins contain additional conserved globular domains. In these proteins, the GOLD domain co-occurs with lipid-, sterol- or fatty acid-binding domains such as PH, CRAL-TRIO, FYVE oxysterol binding- and acyl CoA-binding domains, suggesting that these proteins may interact with membranes. The GOLD domain can also be found associated with a RUN domain, which may have a role in the interaction of various proteins with cytoskeletal filaments. The GOLD domain is predicted to mediate diverse protein-protein interactions []. A secondary structure prediction for the GOLD domain reveals that it is likely to adopt a compact all-beta-fold structure with six to seven strands. Most of the sequence conservation is centred on the hydrophobic cores that support these predicted strands. The predicted secondary-structure elements and the size of the conserved core of the domain suggests that it may form a beta- sandwich fold with the strands arranged in two beta sheets stacked on each other [].  Some proteins known to contain a GOLD domain are listed below:   Eukaryotic proteins of the p24 family.  Animal Sec14-like proteins. They are involved in secretion.  Human Golgi resident protein GCP60. It interacts with the Golgi integral membrane protein Giantin. Yeast oxysterol-binding protein homologue 3 (OSH3).  ; GO: 0006810 transport, 0016021 integral to membrane; PDB: 1P23_A 1M23_A.
Probab=67.50  E-value=1.8  Score=32.19  Aligned_cols=61  Identities=15%  Similarity=0.146  Sum_probs=0.0

Q ss_pred             EeCCeEEEEEEeCCCc-CCceEEEEECCCC--CccceEEEEcCCCEEEEEEEcCCCeeEEEEEEEC
Q psy15726         64 RSGIVNSFTLETCGVA-SSEFDVIVTSPQG--AAVPVRCYQQKFANLLAEFTPTTTGVYKIDVLQG  126 (171)
Q Consensus        64 ~vg~~~~F~V~~~~ag-~~~l~v~i~~p~g--~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~~  126 (171)
                      ..|....+.+...+.+ ...+.+.|.+|++  ..+--  .......-..+|++.+.|.|++.+.-.
T Consensus        19 ~~~~~i~~~y~v~~~~~~~~v~~~i~~~~~~~~~i~~--~~~~~~~~~f~f~~~~~G~y~iCf~n~   82 (183)
T PF01105_consen   19 PKGTTIRGSYRVTDGGGAYDVDFTIRDPDPNGEVIYS--KSDKESEGSFSFTAKESGEYQICFDNS   82 (183)
T ss_dssp             ------------------------------------------------------------------
T ss_pred             CCCcEEEEEEEEeeccccceEEEEEEecccCCceeee--ecccccCCcEEEEeccCCCEEEEEEcC
Confidence            3366666666666555 5788899995554  42221  111111235667778899988877643


No 57 
>PF09315 DUF1973:  Domain of unknown function (DUF1973);  InterPro: IPR015394 These functionally uncharacterised domains are found in various eukaryotic calcium-dependent chloride channels. 
Probab=67.42  E-value=37  Score=26.18  Aligned_cols=59  Identities=14%  Similarity=0.169  Sum_probs=35.7

Q ss_pred             EeCCeEEEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCCEEEEEEE-cCCCeeEEEEEE
Q psy15726         64 RSGIVNSFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFT-PTTTGVYKIDVL  124 (171)
Q Consensus        64 ~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~-P~~~G~~~i~V~  124 (171)
                      .+|....|.|.-...  ..+.+.+.+|+|+....-..|.......+.-- ..++|.+++.|.
T Consensus        26 tvG~~T~f~v~w~~~--~~p~i~L~~P~G~~~~~~~~d~~~~~~~i~ipg~ae~G~W~y~i~   85 (179)
T PF09315_consen   26 TVGNNTVFTVTWQNS--SPPSITLTDPSGTVYTTFTTDSNSKTARIQIPGTAEVGTWTYSIT   85 (179)
T ss_pred             CCCCCeEEEEEECCC--CCceEEEECCCCCEEeeeEEcccccEEEEECCCCcccccEEEEEe
Confidence            478888888876654  46788899999976544233433233333321 145666666663


No 58 
>KOG2996|consensus
Probab=67.19  E-value=5.1  Score=36.64  Aligned_cols=30  Identities=7%  Similarity=0.135  Sum_probs=26.6

Q ss_pred             EEEcCCCEEEEEEEcCCCeeEEEEEEECCE
Q psy15726         99 CYQQKFANLLAEFTPTTTGVYKIDVLQGAR  128 (171)
Q Consensus        99 v~~~~dg~y~v~y~P~~~G~~~i~V~~~g~  128 (171)
                      +....||||.|.|.+.+++++-|+|+|+.+
T Consensus       701 Lk~~~ngT~LVR~r~kea~e~AISikynne  730 (865)
T KOG2996|consen  701 LKNRPNGTYLVRYRTKEAKEFAISIKYNNE  730 (865)
T ss_pred             hhcCCCceEEEEecccchhheeEEEEeccc
Confidence            445678999999999999999999999973


No 59 
>PRK15095 FKBP-type peptidyl-prolyl cis-trans isomerase; Provisional
Probab=67.10  E-value=37  Score=25.42  Aligned_cols=79  Identities=18%  Similarity=0.183  Sum_probs=45.6

Q ss_pred             CCCeEEEecCCCCCCCcEEEcCCCcceEeCCeEEEEEEeCCCcC------------------Cce----EEEEECCCCCc
Q psy15726         37 SCPMEVPVVDPAVGREPSGSGLGLYQARSGIVNSFTLETCGVAS------------------SEF----DVIVTSPQGAA   94 (171)
Q Consensus        37 gSPf~v~V~~~~d~~~v~v~G~Gl~~~~vg~~~~F~V~~~~ag~------------------~~l----~v~i~~p~g~~   94 (171)
                      +.|+.+.+...   .-+.-.-.+|....+|....|+|....+-+                  +.+    .+.+.+++|+.
T Consensus        32 ~~P~~f~~G~g---~vi~gle~aL~gm~~Ge~~~v~ipp~~ayG~~d~~~v~~vp~~~f~~~~~~~~G~~~~~~~~~G~~  108 (156)
T PRK15095         32 GKPALFRLGDG---SLSEGLEQQLLGLKVGDKKTFSLEPEAAFGVPSPDLIQYFSRRDFMDAGEPEIGAIMLFTAMDGSE  108 (156)
T ss_pred             CCCEEEEeCCC---CccHHHHHHHcCCCCCCEEEEEEChHHhcCCCChHHEEEecHHHCCcccCCCCCCEEEEECCCCCE
Confidence            46777666532   111112234555788999999998765411                  011    13345666666


Q ss_pred             cceEEEEcCCCEEEEEEEcCCCee
Q psy15726         95 VPVRCYQQKFANLLAEFTPTTTGV  118 (171)
Q Consensus        95 ~~~~v~~~~dg~y~v~y~P~~~G~  118 (171)
                      .++.|.+..+.+.++.|.-.-+|.
T Consensus       109 ~~~~V~~i~~~~v~vD~NHPLAGk  132 (156)
T PRK15095        109 MPGVIREINGDSITVDFNHPLAGQ  132 (156)
T ss_pred             EEEEEEEEcCCEEEEECCCcCCCC
Confidence            666777777777777776555554


No 60 
>PRK12813 flgD flagellar basal body rod modification protein; Reviewed
Probab=66.71  E-value=65  Score=25.78  Aligned_cols=57  Identities=16%  Similarity=0.215  Sum_probs=39.2

Q ss_pred             CCeEEEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCCEEEEEEEc-------CCCeeEEEEEEE
Q psy15726         66 GIVNSFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTP-------TTTGVYKIDVLQ  125 (171)
Q Consensus        66 g~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P-------~~~G~~~i~V~~  125 (171)
                      |....|.+....+ ...+.+.|.+.+|+.++..  +...|.+.++|-.       ..+|.|++.|..
T Consensus       110 g~~~~~~~~l~~~-a~~v~v~I~D~~G~vV~t~--~~~~G~~~f~WDG~d~~G~~l~~G~Yt~~V~A  173 (223)
T PRK12813        110 GTPVTISPNPAAD-ADKAELVVRDAAGAEVARE--TVPVGAGPVEWAGEDADGNPLPNGAYSFVVES  173 (223)
T ss_pred             CceeEEEEeccCC-CceEEEEEEcCCCCEEEEE--eeCCCceeEEeCCcCCCCCcCCCccEEEEEEE
Confidence            5566777765443 3578899999999876543  2345677777753       446899999875


No 61 
>PF05751 FixH:  FixH;  InterPro: IPR008620 This family consists of several Rhizobium FixH like proteins. It has been suggested that the four proteins FixG, FixH, FixI, and FixS may participate in a membrane-bound complex coupling the FixI cation pump with a redox process catalysed by FixG [].
Probab=65.92  E-value=48  Score=23.93  Aligned_cols=61  Identities=10%  Similarity=0.072  Sum_probs=42.6

Q ss_pred             eCCeEEEEEEeCC---CcCCceEEEEECCCCCcc--ceEEEEcCCCEEEEEEEcCCCeeEEEEEEE
Q psy15726         65 SGIVNSFTLETCG---VASSEFDVIVTSPQGAAV--PVRCYQQKFANLLAEFTPTTTGVYKIDVLQ  125 (171)
Q Consensus        65 vg~~~~F~V~~~~---ag~~~l~v~i~~p~g~~~--~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~  125 (171)
                      .+....+.+.-.+   .....+++.+..|+....  ...+...++|.|.........|.+.|.|.+
T Consensus        67 ~~~~~~i~~~d~~g~~~~~~~~~l~l~rp~~~~~D~~~~l~~~~~g~y~~~~~~~~~G~W~l~l~~  132 (146)
T PF05751_consen   67 DDNSLTIRLTDPNGAPVSGAKLTLSLYRPTDAKLDFTLTLTESAPGVYRAPVPLLKKGRWYLRLDW  132 (146)
T ss_pred             cCCeEEEEEEcCCCCcCcCceEEEEEECCCCccCCeeEEeeECCCceEEEEcCCCCCccEEEEEEE
Confidence            3455555553222   123678888888886543  445667788999999988899999999955


No 62 
>TIGR03769 P_ac_wall_RPT actinobacterial surface-anchored protein domain. This model describes a repeat domain that one to three times in Actinobacterial proteins, some of which have LPXTG-type sortase recognition motifs for covalent attachment to the Gram-positive cell wall. Where it occurs with duplication in an LPXTG-anchored protein, it tends to be adjacent to the substrate-binding protein of the gene trio of an ABC transporter system, where that substrate-binding protein has a single copy of this same domain. This arrangement suggests a substrate-binding relay system, with the LPXTG protein acting as a substrate receptor.
Probab=64.78  E-value=7.5  Score=22.60  Aligned_cols=24  Identities=8%  Similarity=-0.140  Sum_probs=18.1

Q ss_pred             ccCCCCceeEEEEEEECCEEcCCC
Q psy15726         15 APSWISYGLVQARTLREETSFRSC   38 (171)
Q Consensus        15 ~~~P~~~G~~~i~V~~~g~~I~gS   38 (171)
                      +|...+||.|.+.++..+....|.
T Consensus         6 nW~FT~PG~Y~l~~~a~~~~~~G~   29 (41)
T TIGR03769         6 NWVFTKPGTYTLTVQATATLTDGK   29 (41)
T ss_pred             ceeeCCCeEEEEEEEEEEEeCCCc
Confidence            577889999999998865544443


No 63 
>COG4856 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=63.74  E-value=71  Score=27.77  Aligned_cols=74  Identities=12%  Similarity=0.070  Sum_probs=43.5

Q ss_pred             eeeeEEEEeecccc----------CCCCceeE-----EEEEEECCE--EcCCCCeEEEecCCCCCCCcEEEcC-CCc-ce
Q psy15726          3 FFTKSCILLSEKAP----------SWISYGLV-----QARTLREET--SFRSCPMEVPVVDPAVGREPSGSGL-GLY-QA   63 (171)
Q Consensus         3 ~~~~~~~~~~~~~~----------~P~~~G~~-----~i~V~~~g~--~I~gSPf~v~V~~~~d~~~v~v~G~-Gl~-~~   63 (171)
                      ||.-+||+|+=++.          .+.+.+.+     .++|+||.+  +|.|-|+++.|.         ++|+ ++. ..
T Consensus        16 ff~A~~Lfl~vn~~n~~N~~~t~~~~~~t~~~tltdvpvdvlYD~~~y~isg~~etV~Vt---------l~G~ns~~~~~   86 (403)
T COG4856          16 FFFAILLFLYVNNNNFNNPISTIKPTSETYTLTLTDVPVDVLYDSDKYFISGQPETVTVT---------LKGPNSIVLKS   86 (403)
T ss_pred             HHHHHHhheeecccccCCcccccccccccccceeeeceeEEEEccccccccCCceEEEEE---------EeCCcceeeee
Confidence            55667887775553          34455444     467889874  899999999874         4443 111 13


Q ss_pred             EeCCeEEEEEEeCCCcCCceEE
Q psy15726         64 RSGIVNSFTLETCGVASSEFDV   85 (171)
Q Consensus        64 ~vg~~~~F~V~~~~ag~~~l~v   85 (171)
                      +..+......|...++.|..++
T Consensus        87 ~~~~dFkV~ADLt~a~~Gt~ev  108 (403)
T COG4856          87 EKPEDFKVVADLTHAGVGTHEV  108 (403)
T ss_pred             ecCcCeEEEEEhhhcCCCceEe
Confidence            3444444455556665554443


No 64 
>KOG0544|consensus
Probab=63.66  E-value=40  Score=23.58  Aligned_cols=57  Identities=16%  Similarity=0.268  Sum_probs=37.6

Q ss_pred             CEEEEEEEcC-CCeeEEEEEEECCEEcCCCCeEEEecCCCceEEecCCcccEEcCCCEEEEEE
Q psy15726        105 ANLLAEFTPT-TTGVYKIDVLQGARPVRGSPYLCQVYDASKVKIEHKGLSNIVVNDPISFKCK  166 (171)
Q Consensus       105 g~y~v~y~P~-~~G~~~i~V~~~g~~I~gSPF~v~V~d~skv~~~G~gl~~~~~g~~~~F~Vd  166 (171)
                      .+.++.||=. ..|     =+++..-=+++||++++---.-++-+-.|+..-.+|+.+.++|.
T Consensus        21 qtvt~hYtg~L~dG-----~kfDSs~dr~kPfkf~IGkgeVIkGwdegv~qmsvGekakLti~   78 (108)
T KOG0544|consen   21 QTVTVHYTGTLQDG-----KKFDSSRDRGKPFKFKIGKGEVIKGWDEGVAQMSVGEKAKLTIS   78 (108)
T ss_pred             CEEEEEEEeEecCC-----cEeecccccCCCeeEEecCcceeechhhcchhccccccceeeec
Confidence            3667777632 333     24455555789999999655556666667666677777777764


No 65 
>cd00173 SH2 Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
Probab=62.49  E-value=17  Score=23.93  Aligned_cols=28  Identities=14%  Similarity=0.182  Sum_probs=24.8

Q ss_pred             EEcCCCEEEEEEEcCCCeeEEEEEEECC
Q psy15726        100 YQQKFANLLAEFTPTTTGVYKIDVLQGA  127 (171)
Q Consensus       100 ~~~~dg~y~v~y~P~~~G~~~i~V~~~g  127 (171)
                      .+..+|+|.+......+|.|.|.|++++
T Consensus        17 ~~~~~G~FLiR~s~~~~~~~~Lsv~~~~   44 (94)
T cd00173          17 KKKPDGTFLVRDSESSPGDYVLSVRVKG   44 (94)
T ss_pred             hcCCCceEEEEecCCCCCCEEEEEEECC
Confidence            3457899999999999999999999988


No 66 
>cd05771 IgC_Tapasin_R Tapasin-R immunoglobulin-like domain. IgC_Tapasin_R: Immunoglobulin-like domain on Tapasin-R. Tapasin is a V-C1 (variable-constant) immunoglobulin superfamily molecule present in the endoplasmic reticulum (ER), where it links MHC class I molecules to the transporter associated with antigen processing (TAP). Tapasin-R is a tapasin-related protein that contains similar structural motifs to Tapasin, with some marked differences, especially in the V domain, transmembrane and cytoplasmic regions. The majority of Tapasin-R is located within the ER; however, there may be some expression of Tapasin-R at the cell surface. Tapasin-R lacks an obvious ER retention signal.
Probab=61.86  E-value=56  Score=23.31  Aligned_cols=74  Identities=11%  Similarity=-0.036  Sum_probs=42.1

Q ss_pred             eccccCCCCceeEEEEEEECCEEcCCCCeEEEecCCCCCCCcEEEcCCCcceEeCCeEEEEEEeCCCcCCceEEEEECC
Q psy15726         12 SEKAPSWISYGLVQARTLREETSFRSCPMEVPVVDPAVGREPSGSGLGLYQARSGIVNSFTLETCGVASSEFDVIVTSP   90 (171)
Q Consensus        12 ~~~~~~P~~~G~~~i~V~~~g~~I~gSPf~v~V~~~~d~~~v~v~G~Gl~~~~vg~~~~F~V~~~~ag~~~l~v~i~~p   90 (171)
                      +=.+.++...|.|...+.. +..-...-.++.|..+   .++.+.-.... ...|....++-.+.+.-..++.+.....
T Consensus         4 ~i~~v~~~D~G~Y~C~~~~-~~~~~~~~~~l~v~~~---P~v~~~~~~~~-~~~~~~~~L~C~a~g~~P~~~~i~W~~~   77 (139)
T cd05771           4 TLPGLTVHDEGTYICSVST-PPHQAQQIIQLSVSEP---PRVRLSLEKLV-SMIEEPQTLICHIAGYYPLDVQVEWTRE   77 (139)
T ss_pred             EECCCCHHHCEEEEEEEEc-cCcceeeEEEEEEecC---CceEEeCCCce-eecCCcEEEEEEECCCCCCCeEEEEEEC
Confidence            3356788889999999863 3222334445666543   35555544332 2246666666666554344567776553


No 67 
>PF14874 PapD-like:  Flagellar-associated PapD-like
Probab=59.90  E-value=50  Score=22.15  Aligned_cols=74  Identities=16%  Similarity=0.142  Sum_probs=39.8

Q ss_pred             eEeCCeEEEEEEeCCCcCCceEEEEECCCCCccceE-------EEEcCCCEEEEEEE-cCCCeeEEEEEEECCEEcCCCC
Q psy15726         63 ARSGIVNSFTLETCGVASSEFDVIVTSPQGAAVPVR-------CYQQKFANLLAEFT-PTTTGVYKIDVLQGARPVRGSP  134 (171)
Q Consensus        63 ~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~-------v~~~~dg~y~v~y~-P~~~G~~~i~V~~~g~~I~gSP  134 (171)
                      +.+|+....+|..++.+.-..+.++..|+.......       +.....-...|+|. +...|.|.-.+.+   ...++.
T Consensus        16 v~~g~~~~~~v~l~N~s~~p~~f~v~~~~~~~~~~~v~~~~g~l~PG~~~~~~V~~~~~~~~g~~~~~l~i---~~e~~~   92 (102)
T PF14874_consen   16 VFVGQTYSRTVTLTNTSSIPARFRVRQPESLSSFFSVEPPSGFLAPGESVELEVTFSPTKPLGDYEGSLVI---TTEGGS   92 (102)
T ss_pred             EccCCEEEEEEEEEECCCCCEEEEEEeCCcCCCCEEEECCCCEECCCCEEEEEEEEEeCCCCceEEEEEEE---EECCeE
Confidence            556666666666665554444444444431111111       11122246788999 6677988766654   344566


Q ss_pred             eEEEe
Q psy15726        135 YLCQV  139 (171)
Q Consensus       135 F~v~V  139 (171)
                      |.+.+
T Consensus        93 ~~i~v   97 (102)
T PF14874_consen   93 FEIPV   97 (102)
T ss_pred             EEEEE
Confidence            66655


No 68 
>PF12988 DUF3872:  Domain of unknown function, B. Theta Gene description (DUF3872);  InterPro: IPR024355 This entry represents proteins of unknown function found primarily in Bacteroides species. The Bacteroides thetaiotaomicron gene coding for this protein is located in a conjugate transposon and appears to be upregulated in the presence of host or other bacterial species compared to growth in pure culture [, ].; PDB: 2L3B_A 2L7Q_A.
Probab=58.39  E-value=64  Score=23.90  Aligned_cols=82  Identities=15%  Similarity=0.216  Sum_probs=40.1

Q ss_pred             EEcC-CCCeEEEecCCCCCCCcEEEcCCCcceEeCCeEEEEEEeCCCc---CCceEEEEECCCC---------Cc-cceE
Q psy15726         33 TSFR-SCPMEVPVVDPAVGREPSGSGLGLYQARSGIVNSFTLETCGVA---SSEFDVIVTSPQG---------AA-VPVR   98 (171)
Q Consensus        33 ~~I~-gSPf~v~V~~~~d~~~v~v~G~Gl~~~~vg~~~~F~V~~~~ag---~~~l~v~i~~p~g---------~~-~~~~   98 (171)
                      ..|. .=||++.+-|-.            ..+..|+.+++.......|   +....++-.-|+|         .. +|=+
T Consensus        25 LDIQQ~YpF~v~tmPVp------------k~I~~GeTvEIR~~l~reG~y~~t~Y~iRYFQ~dGkG~L~~~~g~~~~pND   92 (137)
T PF12988_consen   25 LDIQQAYPFTVETMPVP------------KKIKKGETVEIRCELKREGNYADTRYTIRYFQPDGKGTLRMDDGTVLLPND   92 (137)
T ss_dssp             S------SEEEEE----------------SS--TTEEEEEEEEEEESS--SS---EEEEE-SSS-EEEEETTS-EE-TTS
T ss_pred             cceeecCCcEEEEeccc------------cccCCCCEEEEEEEEecCceecccEEEEEEEeecCCEEEEecCCcEecccc
Confidence            3554 578988876641            2356677666655554322   1334444444444         22 3444


Q ss_pred             EEEcCCCEEEEEEEcCCCeeEEEEEEEC
Q psy15726         99 CYQQKFANLLAEFTPTTTGVYKIDVLQG  126 (171)
Q Consensus        99 v~~~~dg~y~v~y~P~~~G~~~i~V~~~  126 (171)
                      ..+...++|+.-||......+.|+|.+.
T Consensus        93 ~Y~L~~~~FRLYYTS~s~~~q~idv~ve  120 (137)
T PF12988_consen   93 RYPLEKEVFRLYYTSRSDDQQTIDVYVE  120 (137)
T ss_dssp             EEE-S-SEEEEEEEE-SSS-EEEEEEEE
T ss_pred             ceecCcCEEEEEEecCCCCCceeEEEEE
Confidence            5666678999999999999999998875


No 69 
>PF10648 Gmad2:  Immunoglobulin-like domain of bacterial spore germination;  InterPro: IPR018911  This domain is found linked to IPR019606 from INTERPRO in some bacterial proteins. It is predicted to contain an immunoglobulin-like all-beta fold. 
Probab=57.55  E-value=50  Score=22.33  Aligned_cols=46  Identities=13%  Similarity=0.226  Sum_probs=28.7

Q ss_pred             CceEEEEECCCCCcc-ceEEEE-cCC---CEE--EEEEEcCCCeeEEEEEEEC
Q psy15726         81 SEFDVIVTSPQGAAV-PVRCYQ-QKF---ANL--LAEFTPTTTGVYKIDVLQG  126 (171)
Q Consensus        81 ~~l~v~i~~p~g~~~-~~~v~~-~~d---g~y--~v~y~P~~~G~~~i~V~~~  126 (171)
                      +.+.++|.|.+|..+ .-.+.- .+.   |.|  ++.|.+..++.+.|.|...
T Consensus        29 gtv~~rv~D~~g~vl~e~~~~a~~g~~~~g~F~~tv~~~~~~~~~g~l~v~~~   81 (88)
T PF10648_consen   29 GTVNIRVRDGHGEVLAEGFVTATGGAPSWGPFEGTVSFPPPPPGKGTLEVFED   81 (88)
T ss_pred             eEEEEEEEcCCCcEEEEeeEEeccCCCcccceEEEEEeCCCCCCceEEEEEEe
Confidence            567777777777654 222221 111   455  5678877788999988754


No 70 
>PF00801 PKD:  PKD domain;  InterPro: IPR000601 The PKD (Polycystic Kidney Disease) domain was first identified in the Polycystic Kidney Disease protein, polycystin-1 (PDK1 gene), and contains an Ig-like fold consisting of a beta-sandwich of seven strands in two sheets with a Greek key topology, although some members have additional strands []. Polycystin-1 is a large cell-surface glycoprotein involved in adhesive protein-protein and protein-carbohydrate interactions; however it is not clear if the PKD domain mediates any of these interactions.  PKD domains are also found in other proteins, usually in the extracellular parts of proteins involved in interactions with other proteins. For example, domains with a PKD-type fold are found in archaeal surface layer proteins that protect the cell from extreme environments [], and in the human VPS10 domain-containing receptor SorCS2 [].; PDB: 1B4R_A 2KZW_A 2C4X_A 2C26_A 2Y72_B 3JQU_A 3JS7_B 1WGO_A 1L0Q_A.
Probab=57.36  E-value=27  Score=21.81  Aligned_cols=55  Identities=13%  Similarity=0.088  Sum_probs=33.0

Q ss_pred             ceEeCCeEEEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCCEEEEEEEcCCCeeEEEEEEEC
Q psy15726         62 QARSGIVNSFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTPTTTGVYKIDVLQG  126 (171)
Q Consensus        62 ~~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~~  126 (171)
                      .+.+|+++.|.....  .+..+.....=.+    ....  ..  .-.++|+...+|.|.|.|...
T Consensus         7 ~~~~g~~v~f~~~~~--~g~~~~y~W~fgd----~~~~--~~--~~~~t~ty~~~G~y~V~ltv~   61 (69)
T PF00801_consen    7 YVPTGQPVTFTASSS--DGSPVTYSWDFGD----NGTV--ST--GSSVTHTYSSPGTYTVTLTVT   61 (69)
T ss_dssp             CEBTTSEEEEEETTT--TSSECEEEEE-SS----ESEE--EC--SSEEEEEESSSEEEEEEEEEE
T ss_pred             EEcCCCEEEEEEEcc--CCCCeEEEEEECC----CCcc--cc--CCCEEEEcCCCeEEEEEEEEE
Confidence            366788888888762  2344554444322    1111  11  125677888899999998764


No 71 
>PF03896 TRAP_alpha:  Translocon-associated protein (TRAP), alpha subunit;  InterPro: IPR005595  The alpha-subunit of the TRAP complex (TRAP alpha) is a single-spanning membrane protein of the endoplasmic reticulum (ER) which is found in proximity of nascent polypeptide chains translocating across the membrane [].; GO: 0005783 endoplasmic reticulum
Probab=54.73  E-value=1.1e+02  Score=25.41  Aligned_cols=81  Identities=17%  Similarity=0.155  Sum_probs=53.7

Q ss_pred             cceEeCCeEEEEEEeCCCcCCceEEEEE-----CCCCC----------ccceEEEEcCCCEEEEEEEc---CCCeeEEEE
Q psy15726         61 YQARSGIVNSFTLETCGVASSEFDVIVT-----SPQGA----------AVPVRCYQQKFANLLAEFTP---TTTGVYKID  122 (171)
Q Consensus        61 ~~~~vg~~~~F~V~~~~ag~~~l~v~i~-----~p~g~----------~~~~~v~~~~dg~y~v~y~P---~~~G~~~i~  122 (171)
                      .....|++++|.|.-++.|..++.|...     -|..-          .....|.....-++..+|+|   ..++.|.|.
T Consensus        93 ~~l~aG~~~~~LvgftN~g~~~~~V~~i~aSl~~p~d~~~~iqNfTa~~y~~~V~pg~~aT~~YsF~~~~~l~pr~f~L~  172 (285)
T PF03896_consen   93 KKLPAGEPVKFLVGFTNKGSEPFTVESIEASLRYPQDYSYYIQNFTAVRYNREVPPGEEATFPYSFTPSEELAPRPFGLV  172 (285)
T ss_pred             ccccCCCeEEEEEEEEeCCCCCEEEEEEeeeecCccccceEEEeecccccCcccCCCCeEEEEEEEecchhcCCcceEEE
Confidence            4578899999999998877766665533     23211          01111222233577888888   457888888


Q ss_pred             EEECCEEcCCCCeEEEecC
Q psy15726        123 VLQGARPVRGSPYLCQVYD  141 (171)
Q Consensus       123 V~~~g~~I~gSPF~v~V~d  141 (171)
                      |...=+...|.+|...+++
T Consensus       173 i~l~y~d~~g~~y~~~~fN  191 (285)
T PF03896_consen  173 INLIYEDSDGNQYQVTVFN  191 (285)
T ss_pred             EEEEEEeCCCCEEEEEEec
Confidence            8776566788899988854


No 72 
>PF13754 Big_3_4:  Bacterial Ig-like domain (group 3)
Probab=52.47  E-value=32  Score=20.83  Aligned_cols=26  Identities=19%  Similarity=0.161  Sum_probs=18.6

Q ss_pred             EEcCCCEEEEEEEcCCCeeEEEEEEE
Q psy15726        100 YQQKFANLLAEFTPTTTGVYKIDVLQ  125 (171)
Q Consensus       100 ~~~~dg~y~v~y~P~~~G~~~i~V~~  125 (171)
                      ....+|.|.+++.-...|.|.|.|..
T Consensus         7 t~~~~G~Ws~t~~~~~dG~y~itv~a   32 (54)
T PF13754_consen    7 TVDSDGNWSFTVPALADGTYTITVTA   32 (54)
T ss_pred             EECCCCcEEEeCCCCCCccEEEEEEE
Confidence            33566888888765578888877765


No 73 
>PF06832 BiPBP_C:  Penicillin-Binding Protein C-terminus Family;  InterPro: IPR009647 This conserved region of approximately 90 residues is found in a sub-group of bacterial Penicillin-Binding Proteins (PBPs). A variable length loop region separates this region from the transpeptidase unit (IPR001460 from INTERPRO). It is predicted to be a beta fold.
Probab=52.03  E-value=67  Score=21.21  Aligned_cols=20  Identities=15%  Similarity=0.232  Sum_probs=16.9

Q ss_pred             CEEEEEEEcCCCeeEEEEEE
Q psy15726        105 ANLLAEFTPTTTGVYKIDVL  124 (171)
Q Consensus       105 g~y~v~y~P~~~G~~~i~V~  124 (171)
                      ..+.+.|.|..+|.|+|.|.
T Consensus        59 ~~~~~~~~~~~~G~h~l~vv   78 (89)
T PF06832_consen   59 PGHQLFWQPDRPGEHTLTVV   78 (89)
T ss_pred             CCCeEEeCCCCCeeEEEEEE
Confidence            35688899999999999983


No 74 
>PRK05842 flgD flagellar basal body rod modification protein; Reviewed
Probab=51.93  E-value=1e+02  Score=25.77  Aligned_cols=73  Identities=14%  Similarity=0.017  Sum_probs=41.1

Q ss_pred             CcEEEcCCCcceEeCCeEEEEEEeC---CCcCCceEEEEECCCCCccceE-EEEc--CCCEEEEEEEc-------CCCee
Q psy15726         52 EPSGSGLGLYQARSGIVNSFTLETC---GVASSEFDVIVTSPQGAAVPVR-CYQQ--KFANLLAEFTP-------TTTGV  118 (171)
Q Consensus        52 ~v~v~G~Gl~~~~vg~~~~F~V~~~---~ag~~~l~v~i~~p~g~~~~~~-v~~~--~dg~y~v~y~P-------~~~G~  118 (171)
                      .|.+.|..+. ..-|....|.+...   .+....+.+.|.+.+|+.+... +...  ..|.+.++|--       ...|.
T Consensus       135 ~V~v~g~~~~-l~~g~~~~~~~~l~~~~~~~a~~v~I~I~Da~G~vVrTi~l~~~~~~aG~~~f~WDG~d~~G~~~p~G~  213 (295)
T PRK05842        135 IAETDVSGAN-FDGNNKLSFSLFFDEKIDASKGVPAIQILNENNELVKTIPLKDYNGQKGYINFEWDGLNEKGEKVPKGN  213 (295)
T ss_pred             EEEEecceEE-ecCCCceEEEEeccccccccCceEEEEEEcCCCCEEEEEecCcccCCCcceeEEECCCCCCCCcCCCcc
Confidence            4455444431 11233556665443   2233578899999999866432 2222  23677777743       34688


Q ss_pred             EEEEEEE
Q psy15726        119 YKIDVLQ  125 (171)
Q Consensus       119 ~~i~V~~  125 (171)
                      |++.|..
T Consensus       214 Yt~~V~a  220 (295)
T PRK05842        214 YKIKAEY  220 (295)
T ss_pred             eEEEEEE
Confidence            8888754


No 75 
>PF12975 DUF3859:  Domain of unknown function (DUF3859);  InterPro: IPR024331 This entry represents a functionally uncharacterised domain, which is found in bacterial proteins.; PDB: 3N6Y_B.
Probab=51.24  E-value=91  Score=22.50  Aligned_cols=70  Identities=17%  Similarity=0.193  Sum_probs=36.0

Q ss_pred             ceEeCCeEEEEEEeCCCcCCc-eEEEEECC---CCCc---cceEEEEc-CCCEEEEEEE-----cCCCeeEEEEEEECCE
Q psy15726         62 QARSGIVNSFTLETCGVASSE-FDVIVTSP---QGAA---VPVRCYQQ-KFANLLAEFT-----PTTTGVYKIDVLQGAR  128 (171)
Q Consensus        62 ~~~vg~~~~F~V~~~~ag~~~-l~v~i~~p---~g~~---~~~~v~~~-~dg~y~v~y~-----P~~~G~~~i~V~~~g~  128 (171)
                      .+.+|..--|.+....+.+.. |++.+..|   .+..   .+...... .+......|+     ...+|.+++.|..+|+
T Consensus        37 pa~lg~~FG~r~~~~~~~~~~~lt~~i~HP~ip~~~~g~~~~~~~~~v~~~~~~~~~~~f~~~~e~v~G~Wr~~i~~~g~  116 (125)
T PF12975_consen   37 PAELGIEFGFRYNIKKAKGEKPLTVCITHPGIPDGDKGDTVEPWQGEVYVGSNDWDFYTFDTPWEPVPGEWRFTIELDGK  116 (125)
T ss_dssp             E--TT-EEEEEEEES--TTS--EEEEEEEEEETTS-EEEEEEEEE---TT-S-EEEEEE--SGGG---EEEEEEEEETTE
T ss_pred             chhcCCEEEEEEEeecCCCCccEEEEEeCCCCCCCCCCCEECcEEEEecCCCcceEEEEcCcccccCCceEEEEEEECCE
Confidence            366777777888777665555 99999996   4422   11111111 1123333332     3347999999999998


Q ss_pred             EcC
Q psy15726        129 PVR  131 (171)
Q Consensus       129 ~I~  131 (171)
                      .|.
T Consensus       117 ~i~  119 (125)
T PF12975_consen  117 VIA  119 (125)
T ss_dssp             EEE
T ss_pred             EEE
Confidence            774


No 76 
>KOG1693|consensus
Probab=50.77  E-value=77  Score=25.08  Aligned_cols=51  Identities=18%  Similarity=0.247  Sum_probs=30.5

Q ss_pred             EEEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCCEE-EEEEEcCCCeeEEEEE
Q psy15726         69 NSFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFANL-LAEFTPTTTGVYKIDV  123 (171)
Q Consensus        69 ~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y-~v~y~P~~~G~~~i~V  123 (171)
                      +.|.......|.=+++..|++|+|+.+--   +.+. .| .+.|+....|.|+..-
T Consensus        47 ~~~~fqV~tGG~fDVD~~I~aPdgkvI~~---~~kk-~~~~~~f~ae~~G~Y~fCF   98 (209)
T KOG1693|consen   47 TSFEFQVQTGGHFDVDYDIEAPDGKVIYS---EKKK-RYDSFLFKAEGKGEYTFCF   98 (209)
T ss_pred             EEEEEEEEeCCceeeEEEEECCCCCEEee---cccc-ccccEEEEEecceEEEEEe
Confidence            44444443335567778899999975432   2222 23 5677777777777653


No 77 
>KOG4637|consensus
Probab=50.14  E-value=15  Score=31.77  Aligned_cols=34  Identities=21%  Similarity=0.275  Sum_probs=29.2

Q ss_pred             cceEEEEcCCCEEEEEEEcCCCeeEEEEEEECCE
Q psy15726         95 VPVRCYQQKFANLLAEFTPTTTGVYKIDVLQGAR  128 (171)
Q Consensus        95 ~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~~g~  128 (171)
                      +.-.+.|..||+|.|.=.-+..|+|++.++.+|.
T Consensus        35 v~~~L~d~PDGsFlVRdAstm~GdYTLtl~k~g~   68 (464)
T KOG4637|consen   35 VNKKLRDQPDGSFLVRDASTMQGDYTLTLRKGGN   68 (464)
T ss_pred             HHHHhcCCCCCcEEeeccccCCCceEEEEecCCc
Confidence            3445788999999999888899999999999985


No 78 
>TIGR03503 conserved hypothetical protein TIGR03503. This set of conserved hypothetical protein has a phylogenetic range that closely matches that of TIGR03501, a putative C-terminal protein targeting signal.
Probab=49.39  E-value=1.8e+02  Score=25.30  Aligned_cols=77  Identities=18%  Similarity=0.107  Sum_probs=48.7

Q ss_pred             eEeCCeEEEEEEeCCCcC----------CceEEEEECCCCC------ccceEE---EE--------cCCCEEEEEEE-cC
Q psy15726         63 ARSGIVNSFTLETCGVAS----------SEFDVIVTSPQGA------AVPVRC---YQ--------QKFANLLAEFT-PT  114 (171)
Q Consensus        63 ~~vg~~~~F~V~~~~ag~----------~~l~v~i~~p~g~------~~~~~v---~~--------~~dg~y~v~y~-P~  114 (171)
                      ...|++..|+....+.+.          -+++|++.+.+..      ..+..+   .|        -+||.|+..+. -.
T Consensus       102 l~~gE~lk~ta~L~~d~~~i~~~~~l~~v~~~v~f~~~~~~~~e~~~~~~~vvg~f~DdG~g~DE~p~DGvFT~~l~l~~  181 (374)
T TIGR03503       102 LFQGETLKVTAKLLNDGEPLKLPDFRDVVKLTVRFVSTNNAEDENFGADPIVVGEFEDDGEGLDERPGDGIFTGEFNLDV  181 (374)
T ss_pred             ccCCCeEEEEEEEecCCEEeechhhhheEEEEEEEEecCCccccccccCCEEEEeeccCCccCCCCCCCceEEEEeeccC
Confidence            556888888887765432          1344555543221      112212   12        26899999996 44


Q ss_pred             CCeeEEEEEEECC----------EEcCCCCeEEEe
Q psy15726        115 TTGVYKIDVLQGA----------RPVRGSPYLCQV  139 (171)
Q Consensus       115 ~~G~~~i~V~~~g----------~~I~gSPF~v~V  139 (171)
                      .+|.|.+.|..++          ..|.-.|+.+.+
T Consensus       182 ~~G~Y~~~v~~~n~vF~R~~~q~v~V~p~Pi~~~~  216 (374)
T TIGR03503       182 APGEYRPTYQSRNPVFLREVEQPVLVYPLPVSYTV  216 (374)
T ss_pred             CCceEEEEEEEcCceEEEEEEEeEEEECCCeeEEE
Confidence            6999999999876          356677888777


No 79 
>PF07703 A2M_N_2:  Alpha-2-macroglobulin family N-terminal region;  InterPro: IPR011625 This is a domain of the alpha-2-macroglobulin family. The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].; PDB: 2QKI_D 3L3O_D 3NMS_A 2ICF_A 2A73_A 2ICE_D 2HR0_A 2A74_A 2XWJ_G 3OHX_A ....
Probab=48.92  E-value=92  Score=21.88  Aligned_cols=102  Identities=15%  Similarity=0.128  Sum_probs=63.3

Q ss_pred             ceEeCCeEEEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCC--EEEEEEEcCCCeeEEEEEEE----CCEEcCCCCe
Q psy15726         62 QARSGIVNSFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFA--NLLAEFTPTTTGVYKIDVLQ----GARPVRGSPY  135 (171)
Q Consensus        62 ~~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg--~y~v~y~P~~~G~~~i~V~~----~g~~I~gSPF  135 (171)
                      ...+|+.+.+.|.+... .+.+...|.+.+ +.+.....+..++  ++.+.-++.......|.+.+    +|+.+.+| -
T Consensus         9 ~~~~Ge~~~v~v~~~~~-~~~~~~~v~s~g-~I~~~~~~~~~~~~~~~~~~v~~~~~P~~~v~~~~v~~~~g~~~~~s-~   85 (136)
T PF07703_consen    9 SYKPGETAKVTVQSPFP-NGTFLYLVESRG-KIVSTGSVELKNGSTTFEFPVTPDMAPNFYVLAYYVRPADGEVVADS-V   85 (136)
T ss_dssp             SB-TTSEEEEEEEEESC-ESEEEEEEEETT-EEEEEEEEECTTTSSEEEEEE-GGGTSEEEEEEEEETTCTCEEEEEE-E
T ss_pred             CcCCCCEEEEEEEcCCC-ccEEEEEEEECC-eEEEEEEEEecCCcEEEEEecchhcCCcEEEEEEEEcCCCCeEEEEE-E
Confidence            47899999999998876 356777777644 4445444444443  56777777766666666654    55666655 3


Q ss_pred             EEEecC--CCceEEecCCcccEEcCCCEEEEEEc
Q psy15726        136 LCQVYD--ASKVKIEHKGLSNIVVNDPISFKCKS  167 (171)
Q Consensus       136 ~v~V~d--~skv~~~G~gl~~~~~g~~~~F~Vd~  167 (171)
                      .+.|..  ..++.+.-.- +...-|++++++|.+
T Consensus        86 ~i~V~~~~~~~v~l~~~~-~~~~Pg~~~~~~i~~  118 (136)
T PF07703_consen   86 WIEVEPCFELKVELTASP-DEYKPGEEVTLRIKA  118 (136)
T ss_dssp             EEEBGCSGSSSEEEEESS-SSBTTTSEEEEEEEE
T ss_pred             EEEecccccceEEEEEec-ceeCCCCEEEEEEEe
Confidence            444422  3456655421 356778888888876


No 80 
>PF08308 PEGA:  PEGA domain;  InterPro: IPR013229 This domain is found in both archaea and bacteria and has similarity to S-layer (surface layer) proteins. It is named after the characteristic PEGA sequence motif found in this domain. The secondary structure of this domain is predicted to be beta-strands.
Probab=46.73  E-value=71  Score=19.96  Aligned_cols=14  Identities=14%  Similarity=0.135  Sum_probs=8.2

Q ss_pred             CCCeeEEEEEEECC
Q psy15726        114 TTTGVYKIDVLQGA  127 (171)
Q Consensus       114 ~~~G~~~i~V~~~g  127 (171)
                      ..+|.|.|.|+..|
T Consensus        32 l~~G~~~v~v~~~G   45 (71)
T PF08308_consen   32 LPPGEHTVTVEKPG   45 (71)
T ss_pred             cCCccEEEEEEECC
Confidence            34566666666555


No 81 
>PF10633 NPCBM_assoc:  NPCBM-associated, NEW3 domain of alpha-galactosidase;  InterPro: IPR018905 This domain has been named NEW3, but its function is not known. It is found on proteins which are bacterial galactosidases [].; PDB: 1EUT_A 2BZD_A 1WCQ_C 2BER_A 1W8O_A 1EUU_A 1W8N_A.
Probab=46.37  E-value=78  Score=20.31  Aligned_cols=62  Identities=16%  Similarity=0.268  Sum_probs=31.2

Q ss_pred             EeCCeEEEEEEeCCCcC---CceEEEEECCCCCc---cceEEEEcCCC-EEEE--EEEcCC---CeeEEEEEEE
Q psy15726         64 RSGIVNSFTLETCGVAS---SEFDVIVTSPQGAA---VPVRCYQQKFA-NLLA--EFTPTT---TGVYKIDVLQ  125 (171)
Q Consensus        64 ~vg~~~~F~V~~~~ag~---~~l~v~i~~p~g~~---~~~~v~~~~dg-~y~v--~y~P~~---~G~~~i~V~~  125 (171)
                      ..|+...|++...+.|.   .++.+++..|.|=.   -+..+.....| +-.+  .-+|.+   +|.|.|.+..
T Consensus         2 ~~G~~~~~~~tv~N~g~~~~~~v~~~l~~P~GW~~~~~~~~~~~l~pG~s~~~~~~V~vp~~a~~G~y~v~~~a   75 (78)
T PF10633_consen    2 TPGETVTVTLTVTNTGTAPLTNVSLSLSLPEGWTVSASPASVPSLPPGESVTVTFTVTVPADAAPGTYTVTVTA   75 (78)
T ss_dssp             -TTEEEEEEEEEE--SSS-BSS-EEEEE--TTSE---EEEEE--B-TTSEEEEEEEEEE-TT--SEEEEEEEEE
T ss_pred             CCCCEEEEEEEEEECCCCceeeEEEEEeCCCCccccCCccccccCCCCCEEEEEEEEECCCCCCCceEEEEEEE
Confidence            35777778777765543   46778888898733   23333333332 3344  444432   6999888764


No 82 
>PF09912 DUF2141:  Uncharacterized protein conserved in bacteria (DUF2141);  InterPro: IPR018673  This family of conserved hypothetical proteins has no known function. 
Probab=45.98  E-value=1e+02  Score=21.63  Aligned_cols=28  Identities=21%  Similarity=0.285  Sum_probs=21.9

Q ss_pred             EEEcCCCEEEEEEEcCCCeeEEEEEEEC
Q psy15726         99 CYQQKFANLLAEFTPTTTGVYKIDVLQG  126 (171)
Q Consensus        99 v~~~~dg~y~v~y~P~~~G~~~i~V~~~  126 (171)
                      .....+|..+++|.-..+|.|.|.|.++
T Consensus        35 ~~~~~~~~~~~~f~~lp~G~YAi~v~hD   62 (112)
T PF09912_consen   35 KVPAKGGTVTITFEDLPPGTYAIAVFHD   62 (112)
T ss_pred             EEEcCCCcEEEEECCCCCccEEEEEEEe
Confidence            3444557888888877899999999886


No 83 
>PF04571 Lipin_N:  lipin, N-terminal conserved region;  InterPro: IPR007651 Mutations in the lipin gene lead to fatty liver dystrophy in mice. The protein has been shown to be phosphorylated by the TOR Ser/Thr protein kinases in response to insulin stimulation. This entry represents a conserved domain found at the N terminus of the member proteins [, ].
Probab=45.89  E-value=91  Score=22.24  Aligned_cols=44  Identities=16%  Similarity=0.208  Sum_probs=29.1

Q ss_pred             cceEEEEcCCCEEEE-----EE---EcCCCeeEEEEEEECCEEcCCCCeEEEecC
Q psy15726         95 VPVRCYQQKFANLLA-----EF---TPTTTGVYKIDVLQGARPVRGSPYLCQVYD  141 (171)
Q Consensus        95 ~~~~v~~~~dg~y~v-----~y---~P~~~G~~~i~V~~~g~~I~gSPF~v~V~d  141 (171)
                      +++-+.++.||+|..     +|   .-..+.+-.|+|..||+.+   +|.+++.+
T Consensus        28 iDVIVV~q~DGs~~sSPFhVRFGk~~vl~~~ek~V~I~VNG~~~---~~~MkLg~   79 (110)
T PF04571_consen   28 IDVIVVEQPDGSLKSSPFHVRFGKLGVLRPREKVVDIEVNGKPV---DFHMKLGE   79 (110)
T ss_pred             eeEEEEecCCCCEecCccEEEEcceeeecccCcEEEEEECCEEc---ceEEEECC
Confidence            455566666776543     33   2223556789999999997   58888744


No 84 
>PF09430 DUF2012:  Protein of unknown function (DUF2012);  InterPro: IPR019008  This domain is found in different proteins, including uncharacterised protein family UPF0480 and nodal modulators. A nodal modulator has been identified as part of a protein complex that participates in the nodal signaling pathway during vertebrate development []. 
Probab=45.85  E-value=1.1e+02  Score=21.81  Aligned_cols=37  Identities=19%  Similarity=0.341  Sum_probs=25.2

Q ss_pred             EEcCCCEEEEEEEcCCCeeEEEEEEECCEEcCCCCeEEEec
Q psy15726        100 YQQKFANLLAEFTPTTTGVYKIDVLQGARPVRGSPYLCQVY  140 (171)
Q Consensus       100 ~~~~dg~y~v~y~P~~~G~~~i~V~~~g~~I~gSPF~v~V~  140 (171)
                      .-+.||+|.+.=.|  +|.|.|+|..-+-.-  +|++|.|.
T Consensus        25 ~v~~dG~F~f~~Vp--~GsY~L~V~s~~~~F--~~~RVdV~   61 (123)
T PF09430_consen   25 FVRSDGSFVFHNVP--PGSYLLEVHSPDYVF--PPYRVDVS   61 (123)
T ss_pred             EecCCCEEEeCCCC--CceEEEEEECCCccc--cCEEEEEe
Confidence            34556777766655  589999988765443  45888875


No 85 
>PF13954 PapC_N:  PapC N-terminal domain; PDB: 2VQI_B 3FIP_A 3RFZ_E 3OHN_A 1ZDV_A 1ZE3_D 3BWU_D 1ZDX_A.
Probab=45.02  E-value=25  Score=25.77  Aligned_cols=20  Identities=10%  Similarity=-0.052  Sum_probs=16.7

Q ss_pred             CCCCceeEEEEEEECCEEcC
Q psy15726         17 SWISYGLVQARTLREETSFR   36 (171)
Q Consensus        17 ~P~~~G~~~i~V~~~g~~I~   36 (171)
                      +...||.|.++|.+||+.+.
T Consensus        25 ~~~~pG~Y~vdv~vN~~~~~   44 (146)
T PF13954_consen   25 NAIPPGEYSVDVYVNGKFIG   44 (146)
T ss_dssp             -SS-SEEEEEEEEETTEEEE
T ss_pred             CCCCCeEEEEEEEECCeeee
Confidence            57889999999999999885


No 86 
>PF00116 COX2:  Cytochrome C oxidase subunit II, periplasmic domain This family corresponds to chains b and o.;  InterPro: IPR002429 Cytochrome c oxidase (1.9.3.1 from EC) [, ] is an oligomeric enzymatic complex which is a component of the respiratory chain and is involved in the transfer of electrons from cytochrome c to oxygen. In eukaryotes this enzyme complex is located in the mitochondrial inner membrane; in aerobic prokaryotes it is found in the plasma membrane. The number of polypeptides in the complex ranges from 3-4 (prokaryotes), up to 13(mammals). Subunit 2 (CO II) transfers the electrons from cytochrome c to the catalytic subunit 1. It contains two adjacent transmembrane regions in its N terminus and the major part of the protein is exposed to the periplasmic or to the mitochondrial intermembrane space, respectively. CO II provides the substrate-binding site and contains a copper centre called Cu(A), probably the primary acceptor in cytochrome c oxidase. An exception is the corresponding subunit of the cbb3-type oxidase which lacks the copper A redox-centre. Several bacterial CO II have a C-terminal extension that contains a covalently bound haem c.; GO: 0004129 cytochrome-c oxidase activity, 0005507 copper ion binding, 0016020 membrane; PDB: 3OMN_D 3OMA_B 3OMI_D 3OM3_B 3EHB_B 1AR1_B 1QLE_B 3HB3_B 2IWK_B 2IWF_A ....
Probab=44.53  E-value=1.1e+02  Score=21.68  Aligned_cols=53  Identities=21%  Similarity=0.248  Sum_probs=34.0

Q ss_pred             ceEeCCeEEEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCCEEEEEEEcCCCeeEEEE
Q psy15726         62 QARSGIVNSFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTPTTTGVYKID  122 (171)
Q Consensus        62 ~~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~  122 (171)
                      .++.|++..|.+.+.|.- ..+.+    |+   ...++.-....+-.+.|+|.++|.|.+.
T Consensus        49 ~lp~g~~v~~~ltS~DVi-Hsf~i----p~---~~~k~d~~PG~~~~~~~~~~~~G~y~~~  101 (120)
T PF00116_consen   49 VLPAGQPVRFHLTSEDVI-HSFWI----PE---LGIKMDAIPGRTNSVTFTPDKPGTYYGQ  101 (120)
T ss_dssp             EEETTSEEEEEEEESSS--EEEEE----TT---CTEEEEEBTTCEEEEEEEESSSEEEEEE
T ss_pred             cccccceEeEEEEcCCcc-ccccc----cc---cCcccccccccceeeeeeeccCCcEEEc
Confidence            378899999999987752 22222    22   1233333334566788999999998764


No 87 
>PF12863 DUF3821:  Domain of unknown function (DUF3821);  InterPro: IPR024277 This is a domain largely confined to sequences from Methanomicrobiales. It is found in putative lipases but the function is unknown.
Probab=43.79  E-value=1.6e+02  Score=23.32  Aligned_cols=32  Identities=28%  Similarity=0.384  Sum_probs=23.7

Q ss_pred             ceEeCCeEEEEEEeCC--------CcCC--ceEEEEECCCCC
Q psy15726         62 QARSGIVNSFTLETCG--------VASS--EFDVIVTSPQGA   93 (171)
Q Consensus        62 ~~~vg~~~~F~V~~~~--------ag~~--~l~v~i~~p~g~   93 (171)
                      .+..|....|.|++.=        ....  -+.+.|++|+|.
T Consensus       101 ~V~~G~~v~FrI~tNL~~~~~R~g~~~~~~~v~I~V~~P~G~  142 (209)
T PF12863_consen  101 TVPRGDNVNFRIDTNLYSIFQRGGYTPGDGPVDIKVTTPSGA  142 (209)
T ss_pred             eeccCCeEEEEEcccHHHHhhcCCCCCCcceEEEEEeCCCCc
Confidence            3778999999998851        1122  388999999985


No 88 
>PF05738 Cna_B:  Cna protein B-type domain;  InterPro: IPR008454 This entry represents a repeated B region domain found in the collagen-binding surface protein Cna in Staphylococcus aureus, as well as other related domains. The B region domain of Cna has a prealbumin-like beta-sandwich fold of seven strands in two sheets with a Greek key topology []. However, this domain does not mediate collagen binding, the IPR008456 from INTERPRO region carries out that function; instead it appears to form a stalk that presents the ligand binding domain away from the bacterial cell surface. Cna is a collagen-binding MSCRAMM (Microbial Surface Component Recognizing Adhesive Matrix Molecules), and is necessary and sufficient for S. aureus cells to adhere to cartilage.; PDB: 2X5P_A 3RKP_A 3KPT_A 1VLF_T 1TI2_F 1TI6_D 1TI4_J 1VLE_V 1VLD_X 3PF2_A ....
Probab=43.14  E-value=79  Score=19.50  Aligned_cols=61  Identities=11%  Similarity=0.197  Sum_probs=34.6

Q ss_pred             EEEECCCCCccce-EEEEcCCCEEEEEEEcCCCeeEEEEEEE--CCEEcCCCCeEEEecCCCceEE
Q psy15726         85 VIVTSPQGAAVPV-RCYQQKFANLLAEFTPTTTGVYKIDVLQ--GARPVRGSPYLCQVYDASKVKI  147 (171)
Q Consensus        85 v~i~~p~g~~~~~-~v~~~~dg~y~v~y~P~~~G~~~i~V~~--~g~~I~gSPF~v~V~d~skv~~  147 (171)
                      +.+.+.++..... ...-..+|.|.+.-.  .+|.|.|.-.-  .|-.+...++.+.+.+-+++++
T Consensus         6 f~L~~~~~~~~~~~~~~Td~~G~~~f~~L--~~G~Y~l~E~~aP~GY~~~~~~~~~~i~~~~~~~v   69 (70)
T PF05738_consen    6 FELYDEDGNEVIEVTVTTDENGKYTFKNL--PPGTYTLKETKAPDGYQLDDTPYEFTITEDGDVTV   69 (70)
T ss_dssp             EEEEETTSEEEEEEEEEGGTTSEEEEEEE--ESEEEEEEEEETTTTEEEEECEEEEEECTTSCEEE
T ss_pred             EEEEECCCCEEEEEEEEECCCCEEEEeec--CCeEEEEEEEECCCCCEECCCceEEEEecCCEEEE
Confidence            3444444432221 344455676655543  57888887655  5666777777777655555443


No 89 
>cd03866 M14_CPM Peptidase M14 Carboxypeptidase (CP) M (CPM) belongs to the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPM is an extracellular glycoprotein, bound to cell membranes via a glycosyl-phosphatidylinositol on the C-terminus of the protein. It specifically removes C-terminal basic residues such as lysine and arginine from peptides and proteins. The highest levels of CPM have been found in human lung and placenta, but significant amounts are present in kidney, blood vessels, intestine, brain, and peripheral nerves. CPM has also been found in soluble form in various body fluids, including amniotic fluid, seminal plasma and urine. Due to its wide distribution in a variety of tissues, it is believed that it plays an important role in the cont
Probab=40.97  E-value=1.1e+02  Score=26.47  Aligned_cols=60  Identities=17%  Similarity=0.099  Sum_probs=37.0

Q ss_pred             eCCeEEEEEEeCCCc-------------------------CCceEEEEECCCCCccce-E--E---------EEcCCCEE
Q psy15726         65 SGIVNSFTLETCGVA-------------------------SSEFDVIVTSPQGAAVPV-R--C---------YQQKFANL  107 (171)
Q Consensus        65 vg~~~~F~V~~~~ag-------------------------~~~l~v~i~~p~g~~~~~-~--v---------~~~~dg~y  107 (171)
                      .+.+.+|+++.+...                         ...+.-.|.|.+|..++. .  +         ....+|.|
T Consensus       254 ~~~~~~~T~El~~~k~p~~~~l~~~w~~n~~~ll~~i~q~~~gI~G~V~D~~g~pi~~A~V~v~g~~~~~~~~T~~~G~y  333 (376)
T cd03866         254 WAQCFEITLELSCCKYPPEEQLPAFWEDNKAALIEYIKQVHLGVKGQVFDSNGNPIPNAIVEVKGRKHICPYRTNVNGEY  333 (376)
T ss_pred             hCceEEEEEEecCCCCCCHHHHHHHHHHhHHHHHHHHHHhcCceEEEEECCCCCccCCeEEEEEcCCceeEEEECCCceE
Confidence            467778888887531                         134666788888876652 2  2         23345666


Q ss_pred             EEEEEcCCCeeEEEEEEECC
Q psy15726        108 LAEFTPTTTGVYKIDVLQGA  127 (171)
Q Consensus       108 ~v~y~P~~~G~~~i~V~~~g  127 (171)
                      ..   ...+|.|.|.|...|
T Consensus       334 ~~---~l~pG~Y~v~vsa~G  350 (376)
T cd03866         334 FL---LLLPGKYMINVTAPG  350 (376)
T ss_pred             EE---ecCCeeEEEEEEeCC
Confidence            22   255777777777666


No 90 
>KOG1690|consensus
Probab=39.25  E-value=89  Score=24.79  Aligned_cols=46  Identities=22%  Similarity=0.066  Sum_probs=28.9

Q ss_pred             CceEEEEECCCCCccceEEEEcCCCEEEEEEEcCCCeeEEEEEEECC
Q psy15726         81 SEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTPTTTGVYKIDVLQGA  127 (171)
Q Consensus        81 ~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~~g  127 (171)
                      -.+.|.|.+|-.+.- .-+.+.....=.++|+...+|+|+|.+.-+-
T Consensus        65 ~gm~VeV~e~fdnnh-~Vl~q~~ss~G~ftFta~~~GeH~IC~~s~s  110 (215)
T KOG1690|consen   65 IGMHVEVKETFDNNH-VVLSQQYSSEGDFTFTALTPGEHRICIQSNS  110 (215)
T ss_pred             ceEEEEeecCCCCce-EEEeecCCCCCceEEEccCCCceEEEEeccc
Confidence            346777877765421 1122222233468899999999999987543


No 91 
>COG5489 Uncharacterized conserved protein [Function unknown]
Probab=39.00  E-value=66  Score=22.74  Aligned_cols=40  Identities=8%  Similarity=0.060  Sum_probs=28.5

Q ss_pred             CCCcCCceEEEEECCCC-CccceEEEEc-CCCEEEEEEEcCC
Q psy15726         76 CGVASSEFDVIVTSPQG-AAVPVRCYQQ-KFANLLAEFTPTT  115 (171)
Q Consensus        76 ~~ag~~~l~v~i~~p~g-~~~~~~v~~~-~dg~y~v~y~P~~  115 (171)
                      ++.|..-|+|++.+|+- ..+.+.+..- ++++|.+-|.+..
T Consensus        63 s~~G~dYlsvkLddP~f~~~i~A~L~~~e~~~~~~liW~rp~  104 (107)
T COG5489          63 SNSGRDYLSVKLDDPSFGAPIYANLFPAEGEGTYALIWNRPK  104 (107)
T ss_pred             cccCcceEEEEecCCcCCCeeEeeeeecCCCCcEEEEecCCC
Confidence            45566789999999994 4555665443 4788998887543


No 92 
>PF09315 DUF1973:  Domain of unknown function (DUF1973);  InterPro: IPR015394 These functionally uncharacterised domains are found in various eukaryotic calcium-dependent chloride channels. 
Probab=37.16  E-value=1.9e+02  Score=22.19  Aligned_cols=85  Identities=14%  Similarity=0.026  Sum_probs=44.7

Q ss_pred             CCCceeEEEEEEECCEEcCCCCeEEEecCC---CCCCCcEEEc-CCCcceEeCCeEEEEEEeCCC----cCCceEEEEEC
Q psy15726         18 WISYGLVQARTLREETSFRSCPMEVPVVDP---AVGREPSGSG-LGLYQARSGIVNSFTLETCGV----ASSEFDVIVTS   89 (171)
Q Consensus        18 P~~~G~~~i~V~~~g~~I~gSPf~v~V~~~---~d~~~v~v~G-~Gl~~~~vg~~~~F~V~~~~a----g~~~l~v~i~~   89 (171)
                      .-|+|.+.+.|......  -.+..+.|...   .+..-+++.- .+-....-.++.-+..+-++.    -++.+.+.|.+
T Consensus        74 ~ae~G~W~y~i~~~~~~--~q~v~vtVtS~~~~~~~~Pitv~~~~~~~~~~~~~p~~IyA~V~qg~~PVlgA~V~A~V~~  151 (179)
T PF09315_consen   74 TAEVGTWTYSITNTSSS--SQTVTVTVTSRASSPTVPPITVTAWLSQSSVNFPSPVIIYAEVQQGYLPVLGANVTATVES  151 (179)
T ss_pred             CcccccEEEEEecCCCC--cceEEEEEEeeECCCCCCCEEEEEEECCCCcCCCCcEEEEEEEEECCeeeeCCEEEEEEcC
Confidence            37899999988533321  23456666532   1111222221 110111112343333333332    23788899999


Q ss_pred             CCCCccceEEEEcCC
Q psy15726         90 PQGAAVPVRCYQQKF  104 (171)
Q Consensus        90 p~g~~~~~~v~~~~d  104 (171)
                      |+|.....++.|++.
T Consensus       152 ~~g~~~~l~L~DnG~  166 (179)
T PF09315_consen  152 PSGNTVTLELLDNGA  166 (179)
T ss_pred             CCCCEEEEEEEcCCC
Confidence            999988888888754


No 93 
>KOG3287|consensus
Probab=37.12  E-value=1.8e+02  Score=23.46  Aligned_cols=58  Identities=17%  Similarity=0.236  Sum_probs=34.0

Q ss_pred             eEeCCeEEEEEEeC-CCcCCceEEEEECCCCCccceEEEEcCCCEEEEEEEcCCCeeEEEEE
Q psy15726         63 ARSGIVNSFTLETC-GVASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTPTTTGVYKIDV  123 (171)
Q Consensus        63 ~~vg~~~~F~V~~~-~ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V  123 (171)
                      +..|...++..+-. ++|..++..++.+|-|..+--+ ....+|.|+++  ++++|.|++..
T Consensus        52 v~~~~tle~eyQVi~G~GDl~i~Ftl~~P~G~~lv~~-q~k~dg~ht~e--~~e~GdY~~Cf  110 (236)
T KOG3287|consen   52 VPQGATLEVEYQVIDGAGDLDIDFTLLNPAGEVLVSD-QRKVDGVHTVE--VTETGDYQVCF  110 (236)
T ss_pred             ccCCeEEEEEEEEEecCCccceeeEEeCCCccEEeec-ccccCceeEee--ccCCcceEEEE
Confidence            34443334444333 4566788888999998432211 12345666654  67789998765


No 94 
>PF09394 Inhibitor_I42:  Chagasin family peptidase inhibitor I42;  InterPro: IPR018990 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties.   Chagasin reversible inhibitor of papain-like cysteine proteases []. Chagasin has a beta-barrel structure, which is a unique variant of the immunoglobulin fold with homology to human CD8alpha [, ].; PDB: 2NQD_A 2NNR_A 2H7W_B 3E1Z_A 3CBK_B 3CBJ_B 2OUL_B 2FO8_A 2WGN_B 2C34_A ....
Probab=36.34  E-value=68  Score=21.08  Aligned_cols=22  Identities=14%  Similarity=0.083  Sum_probs=19.0

Q ss_pred             CEEEEEEEcCCCeeEEEEEEEC
Q psy15726        105 ANLLAEFTPTTTGVYKIDVLQG  126 (171)
Q Consensus       105 g~y~v~y~P~~~G~~~i~V~~~  126 (171)
                      |+....|.+..+|...|...|.
T Consensus        53 g~~~f~f~a~~~G~~~i~~~y~   74 (92)
T PF09394_consen   53 GTRTFTFKALKPGTTTIKFEYR   74 (92)
T ss_dssp             EEEEEEEEESSSEEEEEEEEEE
T ss_pred             cEEEEEEEEecCeeEEEEEEEE
Confidence            6788899999999999998774


No 95 
>KOG1366|consensus
Probab=34.31  E-value=1.9e+02  Score=29.72  Aligned_cols=71  Identities=17%  Similarity=0.180  Sum_probs=50.0

Q ss_pred             EeCCeEEEEEEeCCC--cCCceEEEEECCCCCccceEE-EEcCCCEEEEEEE---cCCCeeEEEEEEECCEEcCCCC
Q psy15726         64 RSGIVNSFTLETCGV--ASSEFDVIVTSPQGAAVPVRC-YQQKFANLLAEFT---PTTTGVYKIDVLQGARPVRGSP  134 (171)
Q Consensus        64 ~vg~~~~F~V~~~~a--g~~~l~v~i~~p~g~~~~~~v-~~~~dg~y~v~y~---P~~~G~~~i~V~~~g~~I~gSP  134 (171)
                      ..|+...|.|-+.+.  ...++.|.+.+|+|+.+.-.. .++..|.|.-+|.   ....|.++|.|.+.+++=..++
T Consensus       127 ~pgqtV~~Rvv~~~~~~~~~~~~v~~~dp~~~~ikq~~~~~~~~gv~~~~f~ls~~~~~G~w~I~v~~~~~~~~~~~  203 (1436)
T KOG1366|consen  127 KPGQTVRYRVVTLNLKLKPFKVTVYIEDPEGNRIKQWLSVQSDGGVYSLSFQLSSEPILGDWKIRVQYKNGLQTKSS  203 (1436)
T ss_pred             CCCCEEEEEEEEeccccccccceEEEECCCCCchhhhhhccccCceEEeeecccCCCcCCceEEEEEEcCCCccccc
Confidence            346666777766432  123388999999998765433 3455688988886   3346999999999988777776


No 96 
>PF13753 SWM_repeat:  Putative flagellar system-associated repeat
Probab=33.83  E-value=2.7e+02  Score=22.89  Aligned_cols=27  Identities=19%  Similarity=0.203  Sum_probs=20.1

Q ss_pred             EEcCCCEEEEEEEc-----CCCeeEEEEEEEC
Q psy15726        100 YQQKFANLLAEFTP-----TTTGVYKIDVLQG  126 (171)
Q Consensus       100 ~~~~dg~y~v~y~P-----~~~G~~~i~V~~~  126 (171)
                      ....+|.|.+.|+|     ...|.|.+.+..-
T Consensus       193 ~~~~~g~~t~~~~~~~~~~~~d~~~~v~v~~t  224 (317)
T PF13753_consen  193 TVGADGTWTVTVTPSDLAGLADGTYTVTVTVT  224 (317)
T ss_pred             eecCCCcccccccccccccccCceEEEEEEee
Confidence            33467899999988     4568888887754


No 97 
>PF03345 DDOST_48kD:  Oligosaccharyltransferase 48 kDa subunit beta;  InterPro: IPR005013 During N-linked glycosylation of proteins, oligosaccharide chains are assembled on the carrier molecule dolichyl pyrophosphate in the following order: 2 molecules of N-acetylglucosamine (GlcNAc), 9 molecules of mannose, and 3 molecules of glucose. These 14-residue oligosaccharide cores are then transferred to asparagine residues on nascent polypeptide chains in the endoplasmic reticulum (ER). As proteins progress through the Golgi apparatus, the oligosaccharide cores are modified by trimming and extension to generate a diverse array of glycosylated proteins [, ]. The oligosaccharyl transferase complex (OST complex) 2.4.1.119 from EC transfers 14-sugar branched oligosaccharides from dolichyl pyrophosphate to asparagine residues []. The complex contains nine protein subunits: Ost1p, Ost2p, Ost3p, Ost4p, Ost5p, Ost6p, Stt3p, Swp1p, and Wbp1p, all of which are integral membrane proteins of the ER. The OST complex interacts with the Sec61p pore complex [] involved in protein import into the ER. This entry represents subunits OST3 and OST6. OST3 is homologous to OST6 [], and several lines of evidence indicate that they are alternative members of the OST complex. Disruption of both OST3 and OST6 causes severe underglycosylation of soluble and membrane-bound glycoproteins and a defect in the assembly of the complex. Hence, the function of these genes seems to be essential for recruiting a fully active complex necessary for efficient N-glycosylation []. This entry also includes the magnesium transporter protein 1, also known as OST3 homologue B, which might be involved in N-glycosylation through its association with the oligosaccharyl transferase (OST) complex. Wbp1p is the beta subunit of the OST complex, one of the original six subunits purified []. Wbp1 is essential [, ], but conditional mutants have decreased transferase activity [, ]. Wbp1p is homologous to mammalian OST48 [].; GO: 0004579 dolichyl-diphosphooligosaccharide-protein glycotransferase activity, 0018279 protein N-linked glycosylation via asparagine, 0005789 endoplasmic reticulum membrane
Probab=33.73  E-value=1.8e+02  Score=25.71  Aligned_cols=61  Identities=15%  Similarity=0.229  Sum_probs=38.7

Q ss_pred             eEeCCeEEEEEEeCCCc--------CCceE--EEEECCCCCccceEEEE----cCCCEEEEEEE-cCCCeeEEEEEEEC
Q psy15726         63 ARSGIVNSFTLETCGVA--------SSEFD--VIVTSPQGAAVPVRCYQ----QKFANLLAEFT-PTTTGVYKIDVLQG  126 (171)
Q Consensus        63 ~~vg~~~~F~V~~~~ag--------~~~l~--v~i~~p~g~~~~~~v~~----~~dg~y~v~y~-P~~~G~~~i~V~~~  126 (171)
                      .+++..+.|.|+...-.        ..+++  ..+.||=   ++..+..    .+.|+|..+|+ |..=|.|+..|.|.
T Consensus       279 Y~I~D~v~ysi~i~e~~~~~WvPf~~dDiQlEf~mLDPy---~R~~L~~~~~~~~~~~Y~~~FklPD~hGVF~F~vdY~  354 (423)
T PF03345_consen  279 YRIKDDVEYSIEISEWNNGKWVPFKADDIQLEFVMLDPY---VRLTLKPSYSTDDNGTYSTTFKLPDVHGVFTFKVDYK  354 (423)
T ss_pred             eEECCcEEEEEEEEEEeCCeEecCCCCcEEEEEEEcCcE---EEcccccccccCCCCEEEEEEECCCccceEEEEEEEe
Confidence            55666666666654311        13444  4455553   2222322    46699999997 88899999999986


No 98 
>PF13750 Big_3_3:  Bacterial Ig-like domain (group 3)
Probab=33.71  E-value=2e+02  Score=21.47  Aligned_cols=109  Identities=13%  Similarity=0.116  Sum_probs=55.9

Q ss_pred             CCCCceeEEEEE-EE----CCEEcCCCCeEEEecCCCCCCCcEE-EcCCCcc-eEeCCeEEEEEEeCCCcCC-ce-EEEE
Q psy15726         17 SWISYGLVQART-LR----EETSFRSCPMEVPVVDPAVGREPSG-SGLGLYQ-ARSGIVNSFTLETCGVASS-EF-DVIV   87 (171)
Q Consensus        17 ~P~~~G~~~i~V-~~----~g~~I~gSPf~v~V~~~~d~~~v~v-~G~Gl~~-~~vg~~~~F~V~~~~ag~~-~l-~v~i   87 (171)
                      +-...|.|.+.+ +-    ++..-....+.+.+ +.. +..+.+ .|..... ..+.......|...|.... .+ .+.+
T Consensus        10 ~~l~dG~Y~l~~~~a~D~agN~~~~~~~~~~~i-D~T-~Ptisi~~~~~~~~g~~v~~~~~i~i~~tD~~~~~~i~sv~l   87 (158)
T PF13750_consen   10 STLPDGSYTLTVVTATDAAGNTSTSTVSETFTI-DNT-PPTISISDGASVANGSTVYGLVNISINVTDNSDDSKITSVSL   87 (158)
T ss_pred             CcCCCccEEEEEEEEEecCCCEEEEEEeeEEEE-cCC-CCEEEEecCCccCCCccccceeeeEEEEEeCCCCceEEEEEE
Confidence            445679999988 32    22221112223333 221 223444 2223322 3444455666766665332 23 3445


Q ss_pred             EC-CCCCccceEEEEcCCCEEEEEEEcC-----CCeeEEEEEEECC
Q psy15726         88 TS-PQGAAVPVRCYQQKFANLLAEFTPT-----TTGVYKIDVLQGA  127 (171)
Q Consensus        88 ~~-p~g~~~~~~v~~~~dg~y~v~y~P~-----~~G~~~i~V~~~g  127 (171)
                      .+ |....+.......++|.|...|...     .-+.|.|.|.-.+
T Consensus        88 ~Gg~~~d~v~ls~~~~~~~~~~~~yp~~fpsle~~~~YtLtV~a~D  133 (158)
T PF13750_consen   88 TGGPASDSVSLSWTNKGNGVYTLEYPRIFPSLEADDSYTLTVSATD  133 (158)
T ss_pred             ECCcccceEEEeeEeccCceEEeecccccCCcCCCCeEEEEEEEEe
Confidence            43 3333344445577889999988632     3457999988644


No 99 
>PF13860 FlgD_ig:  FlgD Ig-like domain; PDB: 3C12_A 3OSV_A.
Probab=33.42  E-value=1.4e+02  Score=19.42  Aligned_cols=50  Identities=14%  Similarity=0.119  Sum_probs=29.4

Q ss_pred             ceEEEEECCCC-CccceEEEEcCCC-EEEEEEEcCCCeeEEEEEEECCEEcCCC
Q psy15726         82 EFDVIVTSPQG-AAVPVRCYQQKFA-NLLAEFTPTTTGVYKIDVLQGARPVRGS  133 (171)
Q Consensus        82 ~l~v~i~~p~g-~~~~~~v~~~~dg-~y~v~y~P~~~G~~~i~V~~~g~~I~gS  133 (171)
                      ...+.+.=|.. ..+.+.|.|...- .+++...+..+|.|.  +.|+|+.-.|.
T Consensus        13 ~~~~~~~l~~~a~~v~v~I~d~~G~~V~t~~~~~~~~G~~~--~~WdG~d~~G~   64 (81)
T PF13860_consen   13 KGSIEYTLPEDADNVTVTIYDSNGQVVRTISLGSQSAGEHS--FTWDGKDDDGN   64 (81)
T ss_dssp             EEEEEEEECSSCEEEEEEEEETTS-EEEEEEEEECSSEEEE--EEE-SB-TTS-
T ss_pred             EEEEEEeCCCcccEEEEEEEcCCCCEEEEEEcCCcCCceEE--EEECCCCCCcC
Confidence            45555333333 3467888887443 367888888999865  67886554444


No 100
>PRK10737 FKBP-type peptidyl-prolyl cis-trans isomerase; Provisional
Probab=33.37  E-value=1.3e+02  Score=23.49  Aligned_cols=56  Identities=14%  Similarity=0.156  Sum_probs=33.6

Q ss_pred             ceEeCCeEEEEEEeCCCcC------------------Cc----eEEEEECCCCCccceEEEEcCCCEEEEEEEcCCCee
Q psy15726         62 QARSGIVNSFTLETCGVAS------------------SE----FDVIVTSPQGAAVPVRCYQQKFANLLAEFTPTTTGV  118 (171)
Q Consensus        62 ~~~vg~~~~F~V~~~~ag~------------------~~----l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~  118 (171)
                      ...+|...+|+|...+|-+                  ..    ..+....++| .+++.|++..+...++-|.-.-+|.
T Consensus        51 G~~~Gd~~~v~l~peeAyGe~d~~lV~~vpr~~F~~~~~l~~G~~~~~~~~~G-~~~~~V~ev~~d~V~vD~NHPLAG~  128 (196)
T PRK10737         51 GHEVGDKFDVAVGANDAYGQYDENLVQRVPKDVFMGVDELQVGMRFLAETDQG-PVPVEITAVEDDHVVVDGNHMLAGQ  128 (196)
T ss_pred             CCCCCCEEEEEEChHHhcCCCChHHEEEecHHHCCCccCCCCCCEEEEeCCCC-cEEEEEEEEcCCEEEEECCCcCCCC
Confidence            4678899999998766521                  00    1233344555 3566666666666677776555554


No 101
>KOG0393|consensus
Probab=33.35  E-value=45  Score=26.28  Aligned_cols=32  Identities=16%  Similarity=0.181  Sum_probs=25.7

Q ss_pred             eeEEEEeecc------ccCCCCceeEEEEEEEC-CEEcC
Q psy15726          5 TKSCILLSEK------APSWISYGLVQARTLRE-ETSFR   36 (171)
Q Consensus         5 ~~~~~~~~~~------~~~P~~~G~~~i~V~~~-g~~I~   36 (171)
                      -|+||+++..      +|.|+.-..|..+|..+ |.+|.
T Consensus        16 GKT~ll~~~t~~~fp~~yvPTVFdnys~~v~V~dg~~v~   54 (198)
T KOG0393|consen   16 GKTCLLISYTTNAFPEEYVPTVFDNYSANVTVDDGKPVE   54 (198)
T ss_pred             CceEEEEEeccCcCcccccCeEEccceEEEEecCCCEEE
Confidence            3788887766      58999999999999985 77664


No 102
>PF02922 CBM_48:  Carbohydrate-binding module 48 (Isoamylase N-terminal domain);  InterPro: IPR004193 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Enzymes containing this domain belong to family 13 (GH13 from CAZY) of the glycosyl hydrolases. This domain is found in a range of enzymes that act on branched substrates ie. isoamylase, pullulanase and branching enzyme. Isoamylase hydrolyses 1,6-alpha-D-glucosidic branch linkages in glycogen, amylopectin and dextrin; 1,4-alpha-glucan branching enzyme functions in the formation of 1,6-glucosidic linkages of glycogen; and pullulanase is a starch-debranching enzyme.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 2BHZ_A 2BY2_A 2BY3_A 2BXY_A 2BY1_A 2BHY_A 2BHU_A 2BXZ_A 2BY0_A 2FHB_A ....
Probab=32.76  E-value=1.3e+02  Score=19.13  Aligned_cols=59  Identities=8%  Similarity=-0.037  Sum_probs=40.3

Q ss_pred             CeEEEEEEeCCCcCCceEEEEECCC-CCccceEEE-EcCCCEEEEEEE-cCCCe--eEEEEEEECC
Q psy15726         67 IVNSFTLETCGVASSEFDVIVTSPQ-GAAVPVRCY-QQKFANLLAEFT-PTTTG--VYKIDVLQGA  127 (171)
Q Consensus        67 ~~~~F~V~~~~ag~~~l~v~i~~p~-g~~~~~~v~-~~~dg~y~v~y~-P~~~G--~~~i~V~~~g  127 (171)
                      ..+.|.+=+.+|.  .+.+.+...+ ......... ...+|.|.++-. ....|  .|...|...+
T Consensus        11 ~~~~F~vwaP~A~--~V~l~~~~~~~~~~~~~~m~~~~~~G~w~~~~~~~~~~g~~~Y~y~i~~~~   74 (85)
T PF02922_consen   11 GGVTFRVWAPNAK--SVELVLYFNGSWPAEEYPMTRKDDDGVWEVTVPGDLPPGGYYYKYRIDGDD   74 (85)
T ss_dssp             TEEEEEEE-TTES--EEEEEEETTTSSEEEEEEEEEECTTTEEEEEEEGCGTTTT-EEEEEEEETT
T ss_pred             CEEEEEEECCCCC--EEEEEEEeeecCCCceEEeeecCCCCEEEEEEcCCcCCCCEEEEEEEEeCC
Confidence            5788999988774  6777777766 222334455 467899999987 34444  7888888776


No 103
>PF03422 CBM_6:  Carbohydrate binding module (family 6);  InterPro: IPR005084 A carbohydrate-binding module (CBM) is defined as a contiguous amino acid sequence within a carbohydrate-active enzyme with a discreet fold having carbohydrate-binding activity. A few exceptions are CBMs in cellulosomal scaffolding proteins and rare instances of independent putative CBMs. The requirement of CBMs existing as modules within larger enzymes sets this class of carbohydrate-binding protein apart from other non-catalytic sugar binding proteins such as lectins and sugar transport proteins. CBMs were previously classified as cellulose-binding domains (CBDs) based on the initial discovery of several modules that bound cellulose [, ]. However, additional modules in carbohydrate-active enzymes are continually being found that bind carbohydrates other than cellulose yet otherwise meet the CBM criteria, hence the need to reclassify these polypeptides using more inclusive terminology. Previous classification of cellulose-binding domains were based on amino acid similarity. Groupings of CBDs were called "Types" and numbered with roman numerals (e.g. Type I or Type II CBDs). In keeping with the glycoside hydrolase classification, these groupings are now called families and numbered with Arabic numerals. Families 1 to 13 are the same as Types I to XIII. For a detailed review on the structure and binding modes of CBMs see [].  This entry represents CBM6 from CAZY which was previously known as cellulose-binding domain family VI (CBD VI). CBM6 bind to amorphous cellulose, xylan, mixed beta-(1,3)(1,4)glucan and beta-1,3-glucan[, , ]. CBM6 adopts a classic lectin-like beta-jelly roll fold, predominantly consisting of five antiparallel beta-strands on one face and four antiparallel beta-strands on the other face. It contains two potential ligand binding sites, named respectively cleft A and B. These clefts include aromatic residues which are probably involved in the substrate binding. The cleft B is located on the concave surface of one beta-sheet, and the cleft A on one edge of the protein between the loop that connects the inner and outer beta-sheets of the jellyroll fold []. The multiple binding clefts confer the extensive range of specificities displayed by the domain [, , ].; GO: 0030246 carbohydrate binding; PDB: 1UY1_A 1UY3_A 1UY4_A 1UY2_A 1UYY_A 1UXZ_B 1UYZ_A 1UY0_B 1UYX_A 1UZ0_A ....
Probab=31.74  E-value=1.7e+02  Score=20.06  Aligned_cols=46  Identities=11%  Similarity=0.141  Sum_probs=25.5

Q ss_pred             CceEEEEECCCCCcc-ceEEEEcCC-CEE---EEEEEcCCCeeEEEEEEECC
Q psy15726         81 SEFDVIVTSPQGAAV-PVRCYQQKF-ANL---LAEFTPTTTGVYKIDVLQGA  127 (171)
Q Consensus        81 ~~l~v~i~~p~g~~~-~~~v~~~~d-g~y---~v~y~P~~~G~~~i~V~~~g  127 (171)
                      +.+++.+.+++|..+ .+.+...++ ..|   .+.. ....|.|.|.+.+.+
T Consensus        60 ~~~~l~id~~~g~~~~~~~~~~tg~w~~~~~~~~~v-~l~~G~h~i~l~~~~  110 (125)
T PF03422_consen   60 GTIELRIDGPDGTLIGTVSLPPTGGWDTWQTVSVSV-KLPAGKHTIYLVFNG  110 (125)
T ss_dssp             EEEEEEETTTTSEEEEEEEEE-ESSTTEEEEEEEEE-EEESEEEEEEEEESS
T ss_pred             cEEEEEECCCCCcEEEEEEEcCCCCccccEEEEEEE-eeCCCeeEEEEEEEC
Confidence            567777777666432 233322222 233   2222 334499999998876


No 104
>PF07563 DUF1541:  Protein of unknown function (DUF1541);  InterPro: IPR011438 This domain is found in several hypothetical bacterial proteins as a tandem repeat.; PDB: 2KY9_A 4FIB_B.
Probab=31.65  E-value=1e+02  Score=19.00  Aligned_cols=16  Identities=31%  Similarity=0.501  Sum_probs=11.2

Q ss_pred             CEEEEEEEcCCCeeEE
Q psy15726        105 ANLLAEFTPTTTGVYK  120 (171)
Q Consensus       105 g~y~v~y~P~~~G~~~  120 (171)
                      -.|.|.|+|+.-|+-.
T Consensus        30 tvY~V~ytPt~gg~~v   45 (53)
T PF07563_consen   30 TVYMVDYTPTTGGEKV   45 (53)
T ss_dssp             EEEEEEEE-TTT-SEE
T ss_pred             eEEEEEEEECCCCcEe
Confidence            3699999999988743


No 105
>COG1622 CyoA Heme/copper-type cytochrome/quinol oxidases, subunit 2 [Energy production and conversion]
Probab=31.42  E-value=2.6e+02  Score=22.71  Aligned_cols=69  Identities=16%  Similarity=0.004  Sum_probs=45.1

Q ss_pred             eEeCCeEEEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCCEEEEEEEcCCCeeEEEE-EEECCEEcCCCCeEEEe
Q psy15726         63 ARSGIVNSFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTPTTTGVYKID-VLQGARPVRGSPYLCQV  139 (171)
Q Consensus        63 ~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~-V~~~g~~I~gSPF~v~V  139 (171)
                      +.+|++..|.+.+.|.- ..|.+    |+-   ..++.-...-.+...++|.++|.|... =.|.|..-..=+|++.|
T Consensus       141 lPv~~~V~f~ltS~DVi-HsF~I----P~l---~~k~d~iPG~~~~~~~~~~~~G~Y~g~Cae~CG~gH~~M~~~v~v  210 (247)
T COG1622         141 LPVGRPVRFKLTSADVI-HSFWI----PQL---GGKIDAIPGMTTELWLTANKPGTYRGICAEYCGPGHSFMRFKVIV  210 (247)
T ss_pred             EeCCCeEEEEEEechhc-eeEEe----cCC---CceeeecCCceEEEEEecCCCeEEEEEcHhhcCCCcccceEEEEE
Confidence            78899999999988753 22221    211   112222222467788899999999776 34667666667788888


No 106
>PF13715 DUF4480:  Domain of unknown function (DUF4480)
Probab=29.30  E-value=1.6e+02  Score=18.91  Aligned_cols=26  Identities=12%  Similarity=0.176  Sum_probs=19.2

Q ss_pred             EEcCCCEEEEEEEcCCCeeEEEEEEECCE
Q psy15726        100 YQQKFANLLAEFTPTTTGVYKIDVLQGAR  128 (171)
Q Consensus       100 ~~~~dg~y~v~y~P~~~G~~~i~V~~~g~  128 (171)
                      .-+.+|.|.+. .|  .|.|.|.+.+-|-
T Consensus        30 ~Td~~G~F~i~-~~--~g~~~l~is~~Gy   55 (88)
T PF13715_consen   30 VTDENGRFSIK-LP--EGDYTLKISYIGY   55 (88)
T ss_pred             EECCCeEEEEE-Ec--CCCeEEEEEEeCE
Confidence            34457889888 44  6889999888774


No 107
>PF11763 DIPSY:  Cell-wall adhesin ligand-binding C-terminal;  InterPro: IPR021746  The DIPSY domain is characterised by the distinctive D*I*PSY motif at the very C terminus of yeast cell-wall glycoproteins. It appears not to be conserved in any other species, however. In fungi, cell adhesion is required for flocculation, mating and virulence, and is mediated by covalently bound cell wall proteins termed adhesins. Map4, an adhesin required for mating in Schizosaccharomyces pombe, is N-glycosylated and O-glycosylated, and is an endogenous substrate for the mannosyl transferase Oma4p. Map4 has a modular structure with an N-terminal signal peptide, a serine and threonine (S/T)-rich domain that includes nine repeats of 36 amino acids (rich in serine and threonine residues, but lacking glutamines), and a C-terminal DIPSY domain with no glycosyl-phosphatidyl inositol (GPI)-anchor signal. The N-terminal S/T-rich regions, are required for cell wall attachment, but the C-terminal DIPSY domain is required for agglutination and mating in liquid and solid media []. 
Probab=29.25  E-value=2.2e+02  Score=20.56  Aligned_cols=44  Identities=11%  Similarity=0.189  Sum_probs=30.6

Q ss_pred             eEEEEECCCCCccceEEEEcCCCEEEEEEEcCCCeeE-EEEEEECCE
Q psy15726         83 FDVIVTSPQGAAVPVRCYQQKFANLLAEFTPTTTGVY-KIDVLQGAR  128 (171)
Q Consensus        83 l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~~-~i~V~~~g~  128 (171)
                      -++...+|.+.  .-.|.++.||+|+.+|+-...+++ .++++-+|+
T Consensus        50 Y~v~~~D~~~~--sf~I~kn~dG~~~Ft~~e~~~~ep~~l~~l~dgr   94 (123)
T PF11763_consen   50 YRVYHDDPNKS--SFNIIKNNDGTYQFTFVESSFSEPLDLHTLSDGR   94 (123)
T ss_pred             EEEeecCCCcc--eEEEEecCCCcEEEEEcccCCCCcEEEEEecCCc
Confidence            34555666543  456888999999999988777763 566666653


No 108
>KOG1691|consensus
Probab=29.18  E-value=2.3e+02  Score=22.58  Aligned_cols=42  Identities=24%  Similarity=0.282  Sum_probs=26.3

Q ss_pred             CceEEEEECCCCCccceEEEEcCCCEEEEEEEcCCCeeEEEEEEE
Q psy15726         81 SEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTPTTTGVYKIDVLQ  125 (171)
Q Consensus        81 ~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~  125 (171)
                      ..+.+.|++|.|+... +..+...|  .+.|+..+.|.|.+...-
T Consensus        60 ~~~~~~Vts~~G~~~~-~~env~~g--qFaFta~e~~~y~~Cf~~  101 (210)
T KOG1691|consen   60 HKLSVKVTSPYGNNLH-SKENVTKG--QFAFTAEESGMYEACFTA  101 (210)
T ss_pred             ceEEEEEEcCCCceee-hhhccccc--eEEEEeccCCcEEEEEec
Confidence            4799999999996432 11122222  456777777777666554


No 109
>PF04465 DUF499:  Protein of unknown function (DUF499);  InterPro: IPR007555 This is a family of uncharacterised hypothetical prokaryotic proteins.
Probab=28.96  E-value=2e+02  Score=28.40  Aligned_cols=65  Identities=12%  Similarity=0.201  Sum_probs=47.2

Q ss_pred             eEeCCeEEEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCCEEEEEEE---cCCCeeEEEEEEECCE
Q psy15726         63 ARSGIVNSFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFT---PTTTGVYKIDVLQGAR  128 (171)
Q Consensus        63 ~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~---P~~~G~~~i~V~~~g~  128 (171)
                      +..|+..+|.|...-.|...+.|.+....|....-++.- +.+.+.++|+   |.+.+.|.|.++.+|.
T Consensus       836 ~~pge~~~~~v~~~~~g~~~~~v~l~~~~g~l~~~~~~~-~~~~~~~~w~~~~~~~~~~~~i~~~~~~~  903 (1035)
T PF04465_consen  836 GKPGEVVEVKVVIKPLGNYRFEVELEVSYGELDSDEGIL-EGEPFKTTWRITIPKEKRIYRIEARSDGK  903 (1035)
T ss_pred             CCCCCceEEEEEEEEcCCCceeEEEEeccceeeecceee-cCCCceEEEEEeccCCCceEEEEEeccCc
Confidence            457888889998888887888888887766432222322 3456788884   7888999999999773


No 110
>PF00254 FKBP_C:  FKBP-type peptidyl-prolyl cis-trans isomerase;  InterPro: IPR001179 Synonym(s): Peptidylprolyl cis-trans isomerase FKBP-type peptidylprolyl isomerases (5.2.1.8 from EC) in vertebrates, are receptors for the two immunosuppressants, FK506 and rapamycin. The drugs inhibit T cell proliferation by arresting two distinct cytoplasmic signal transmission pathways. Peptidylprolyl isomerases accelerate protein folding by catalysing the cis-trans isomerisation of proline imidic peptide bonds in oligopeptides. These proteins are found in a variety of organisms.; GO: 0006457 protein folding; PDB: 1IX5_A 3JXV_A 3JYM_A 1T11_A 1PBK_A 1FD9_A 2VCD_A 3B7X_A 1Q6H_B 1Q6I_B ....
Probab=28.33  E-value=12  Score=24.78  Aligned_cols=39  Identities=13%  Similarity=0.184  Sum_probs=24.1

Q ss_pred             CCCCeEEEecCCCceEEecCCcccEEcCCCEEEEEEccC
Q psy15726        131 RGSPYLCQVYDASKVKIEHKGLSNIVVNDPISFKCKSTE  169 (171)
Q Consensus       131 ~gSPF~v~V~d~skv~~~G~gl~~~~~g~~~~F~Vd~~~  169 (171)
                      .+.||.+.+.+...+...-.+|..-.+|+...|.|....
T Consensus        31 ~~~~~~~~~g~~~~i~g~e~al~~m~~Ge~~~~~vp~~~   69 (94)
T PF00254_consen   31 EGEPFEFRLGSGQVIPGLEEALIGMKVGEKREFYVPPEL   69 (94)
T ss_dssp             TTSEEEEETTSSSSSHHHHHHHTTSBTTEEEEEEEEGGG
T ss_pred             cCcceeeeeccCccccchhhhcccccCCCEeeeEeCChh
Confidence            445898888655544444444555667777777776544


No 111
>MTH00047 COX2 cytochrome c oxidase subunit II; Provisional
Probab=26.73  E-value=3.1e+02  Score=21.35  Aligned_cols=71  Identities=17%  Similarity=0.154  Sum_probs=44.5

Q ss_pred             eEeCCeEEEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCC-EEEEEEEcCCCeeEEEE-EEECCEEcCCCCeEEEec
Q psy15726         63 ARSGIVNSFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFA-NLLAEFTPTTTGVYKID-VLQGARPVRGSPYLCQVY  140 (171)
Q Consensus        63 ~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg-~y~v~y~P~~~G~~~i~-V~~~g~~I~gSPF~v~V~  140 (171)
                      .++|++..|.+.+.|.= ..+.+    |+   ...++ |.-.| .-.+.++|.++|.|... -.+-|.--..=||.+.|.
T Consensus       120 lp~g~~v~~~ltS~DVi-Hsf~v----p~---l~~k~-d~~PG~~~~~~~~~~~~G~y~g~C~e~CG~~H~~M~~~v~v~  190 (194)
T MTH00047        120 LVYGVPYHLLVTSSDVI-HSFSV----PD---LNLKM-DAIPGRINHLFFCPDRHGVFVGYCSELCGVGHSYMPIVIEVV  190 (194)
T ss_pred             EeCCCEEEeeeecCccc-cceec----cc---cCcee-ecCCCceEEEEEEcCCCEEEEEEeehhhCcCcccCcEEEEEE
Confidence            67899999999988862 22222    11   01111 11223 34678889999998765 456677666677888775


Q ss_pred             CC
Q psy15726        141 DA  142 (171)
Q Consensus       141 d~  142 (171)
                      ++
T Consensus       191 ~~  192 (194)
T MTH00047        191 DV  192 (194)
T ss_pred             cC
Confidence            43


No 112
>PF09264 Sial-lect-inser:  Vibrio cholerae sialidase, lectin insertion;  InterPro: IPR015344 This domain is predominantly found in Vibrio cholerae sialidase, and adopt a beta sandwich structure consisting of 12-14 strands arranged in two beta-sheets. It binds to lectins with high affinity helping to target the protein to sialic acid-rich environments, thereby enhancing the catalytic efficiency of the enzyme []. ; PDB: 1W0P_A 1W0O_A 1KIT_A 2W68_B.
Probab=25.55  E-value=2e+02  Score=22.64  Aligned_cols=105  Identities=10%  Similarity=0.029  Sum_probs=54.9

Q ss_pred             EcCCCcceEeCCeEEEEEEeCCCcCCceEEEEECCCCCc-cceEEEEc-CCCEEEEEEEcCCCeeEEEEEEECCEEcCCC
Q psy15726         56 SGLGLYQARSGIVNSFTLETCGVASSEFDVIVTSPQGAA-VPVRCYQQ-KFANLLAEFTPTTTGVYKIDVLQGARPVRGS  133 (171)
Q Consensus        56 ~G~Gl~~~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~-~~~~v~~~-~dg~y~v~y~P~~~G~~~i~V~~~g~~I~gS  133 (171)
                      .|--......+-..+|.+...=-..+.|.+.+.+.++.. ++....|. +=-.|.+.|.|..   +.-...++|..|. +
T Consensus        44 ~G~~n~~yyAnG~~r~l~~lsvn~sG~LvA~L~g~ss~~~~~~~~~di~gyH~Y~i~~~p~~---~tASfy~DG~lI~-t  119 (198)
T PF09264_consen   44 SGGCNTNYYANGSKRYLPILSVNESGSLVAELEGQSSNTLLATTGADIHGYHKYEIVFSPLT---NTASFYFDGTLIA-T  119 (198)
T ss_dssp             EES-EEEEEEESSEEEEEEEEE-TTS-EEEEETTS-S-EEEE-CHHHHCSEEEEEEEEETTT---TEEEEEETTEEEE-E
T ss_pred             cCcceeEEEcCCceEEEEEEEEcCCCCEEEEEecCCCcEEEecccccccceeEEEEEecCCC---CceEEEECCEEEe-e
Confidence            343333333333444444443223468888888877642 11111111 1146899999953   3788899999988 3


Q ss_pred             CeEEEecCCCceEEecCCccc---EEcCCCEEEEEE
Q psy15726        134 PYLCQVYDASKVKIEHKGLSN---IVVNDPISFKCK  166 (171)
Q Consensus       134 PF~v~V~d~skv~~~G~gl~~---~~~g~~~~F~Vd  166 (171)
                       +. --.-.+|..++|.|-+.   ..+=+.+.|+|+
T Consensus       120 -w~-G~~ssqn~I~wGNg~SstdG~~~yr~I~f~i~  153 (198)
T PF09264_consen  120 -WS-GEPSSQNMIVWGNGSSSTDGRAAYRDINFEIQ  153 (198)
T ss_dssp             -E---EEECEEEEEEEE--SSS-EEEEEEEEEEEET
T ss_pred             -cc-CccccCceEEEcCCccccccceeeeeEEEEEc
Confidence             21 11445678889988543   333466677663


No 113
>TIGR02148 Fibro_Slime fibro-slime domain. This model represents a conserved region of about 90 amino acids, shared in at least 4 distinct large putative proteins from the slime mold Dictyostelium discoideum and 10 proteins from the rumen bacterium Fibrobacter succinogenes, and in no other species so far. We propose here the name fibro-slime domain
Probab=25.31  E-value=96  Score=21.29  Aligned_cols=23  Identities=22%  Similarity=0.404  Sum_probs=20.5

Q ss_pred             eeEEEEEEECCEEcCCCCeEEEe
Q psy15726        117 GVYKIDVLQGARPVRGSPYLCQV  139 (171)
Q Consensus       117 G~~~i~V~~~g~~I~gSPF~v~V  139 (171)
                      ..|.+++.+..+|-.+|.|+++.
T Consensus        64 ~~Y~~d~F~~ERht~~S~~~i~T   86 (90)
T TIGR02148        64 KTYPFDIFYCERHTTGSNFRIRT   86 (90)
T ss_pred             cEeeEEEEEEeecCCCCCEEEEE
Confidence            45999999999999999999864


No 114
>PLN00115 pollen allergen group 3; Provisional
Probab=25.29  E-value=2.6e+02  Score=20.06  Aligned_cols=64  Identities=11%  Similarity=0.295  Sum_probs=37.2

Q ss_pred             EeCCeEEEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCCEEEEEEE-cCC--Cee-EEEEEEECCEEcCCCCeEEEe
Q psy15726         64 RSGIVNSFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFT-PTT--TGV-YKIDVLQGARPVRGSPYLCQV  139 (171)
Q Consensus        64 ~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~-P~~--~G~-~~i~V~~~g~~I~gSPF~v~V  139 (171)
                      ..|....|+|+... ....|.+.+   ++....++|.+.+..    .|. |..  -|. +.++   .+.+.+| ||.+++
T Consensus        21 ~~g~~v~F~V~~gS-np~yL~ll~---~~dI~~V~Ik~~g~~----~W~~~M~rswGavW~~~---s~~pl~G-PlS~R~   88 (118)
T PLN00115         21 SCATEVTFKVGKGS-SSTSLELVT---NVAISEVEIKEKGAK----DWVDDLKESSTNTWTLK---SKAPLKG-PFSVRF   88 (118)
T ss_pred             hcCCceEEEECCCC-CcceEEEEE---eCCEEEEEEeecCCC----cccCccccCccceeEec---CCCCCCC-ceEEEE
Confidence            46788899998532 334454333   555667777775432    355 443  243 2221   2567788 999998


No 115
>cd07696 IgC_CH3 CH3 domain (third constant Ig domain of the heavy chain) in immunoglobulin. IgC_CH3: The third immunoglobulin constant domain (IgC) of immunoglobulin (Ig) heavy chains. This domain is found on the Fc fragment. The basic structure of Ig molecules is a tetramer of two light chains and two heavy chains linked by disulfide bonds. There are two types of light chains: kappa and lambda; each composed of a constant domain and a variable domain. There are five types of heavy chains: alpha, delta, epsilon, gamma and mu, all consisting of a variable domain (VH) and three (in alpha, delta and gamma) or four (in epsilon and mu) constant domains (CH1 to CH4). Ig molecules are modular proteins, in which the variable and constant domains have clear, conserved sequence patterns.
Probab=24.82  E-value=2.2e+02  Score=19.18  Aligned_cols=72  Identities=13%  Similarity=0.107  Sum_probs=37.2

Q ss_pred             CCcceEeCCeEEEEEEeCCC-cCCceEEEEECCCCCccceEEEEcCCCEEEEEEEcCCCeeEEEEEEEC---CEEcCCCC
Q psy15726         59 GLYQARSGIVNSFTLETCGV-ASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTPTTTGVYKIDVLQG---ARPVRGSP  134 (171)
Q Consensus        59 Gl~~~~vg~~~~F~V~~~~a-g~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~~---g~~I~gSP  134 (171)
                      -.+....++.+.++-.+.+- -..++.|+....++.......         -...|...|.|.+.....   +.--.|.-
T Consensus         7 s~~~l~~~~~~tLtC~v~gf~yP~~i~v~W~r~~~~~~~~~~---------~~~~~~~dgty~~~s~l~v~~~~w~~g~~   77 (96)
T cd07696           7 SPKQLLLTKSVKLTCLVTNLTSPEGVNVTWTRGSGEPVKASI---------QPPESHNNATLSVTSTLNVSTDDWISGDT   77 (96)
T ss_pred             CHHHhccCCCeEEEEEEeCCCCCCCcEEEEEECCeEeeeEEe---------cccccCCCCCEEEEEEEEeCHHHccCCCc
Confidence            33445567777777777776 566777776654432111110         011344445555544432   11235667


Q ss_pred             eEEEe
Q psy15726        135 YLCQV  139 (171)
Q Consensus       135 F~v~V  139 (171)
                      |.|.|
T Consensus        78 ysC~V   82 (96)
T cd07696          78 FQCKV   82 (96)
T ss_pred             EEEEE
Confidence            77777


No 116
>PF07691 PA14:  PA14 domain;  InterPro: IPR011658 The PA14 domain forms an insert in bacterial beta-glucosidases, other glycosidases, glycosyltransferases, proteases, amidases, yeast adhesins and bacterial toxins, including anthrax protective antigen (PA). The domain also occurs in a Dictyostelium pre-spore cell-inducing factor Psi and in fibrocystin, the mammalian protein whose mutation leads to polycystic kidney and hepatic disease. The crystal structure of PA shows that this domain (named PA14 after its location in the PA20 pro-peptide) has a beta-barrel structure. The PA14 domain sequence suggests a binding function, rather than a catalytic role. The PA14 domain distribution is compatible with carbohydrate binding [].; PDB: 2XVG_A 2XVK_A 2XVL_A 2XJU_A 2XJT_A 2XJQ_A 2XJS_A 2XJV_A 2XJP_A 2XJR_A ....
Probab=24.47  E-value=1.5e+02  Score=20.78  Aligned_cols=21  Identities=24%  Similarity=0.469  Sum_probs=12.4

Q ss_pred             EEEEEEcCCCeeEEEEEEECC
Q psy15726        107 LLAEFTPTTTGVYKIDVLQGA  127 (171)
Q Consensus       107 y~v~y~P~~~G~~~i~V~~~g  127 (171)
                      ++-.++|.+.|.|++.+.-+|
T Consensus        50 ~~G~~~~~~~G~y~f~~~~~d   70 (145)
T PF07691_consen   50 WTGYFKPPETGTYTFSLTSDD   70 (145)
T ss_dssp             EEEEEEESSSEEEEEEEEESS
T ss_pred             EEEEEecccCceEEEEEEecc
Confidence            344556666677666665444


No 117
>PF12690 BsuPI:  Intracellular proteinase inhibitor;  InterPro: IPR020481 BsuPI is a intracellular proteinase inhibitor that directly regulates the major intracellular proteinase (ISP-1) activity in vivo. It inhibits ISP-1 in the early stages of sporulation and then may be inactivated by a membrane-bound proteinase [].; PDB: 3ISY_A.
Probab=24.40  E-value=1.7e+02  Score=19.31  Aligned_cols=17  Identities=12%  Similarity=0.339  Sum_probs=9.9

Q ss_pred             EEEEEEEcCC--CeeEEEE
Q psy15726        106 NLLAEFTPTT--TGVYKID  122 (171)
Q Consensus       106 ~y~v~y~P~~--~G~~~i~  122 (171)
                      +|..++-...  +|.|++.
T Consensus        63 ~~~~~~~~~~~~~G~Y~~~   81 (82)
T PF12690_consen   63 TYEETWDLKDLSPGEYTLE   81 (82)
T ss_dssp             EEEEEESS----SEEEEEE
T ss_pred             EEEEEECCCCCCCceEEEe
Confidence            4566665554  7888775


No 118
>PF04225 OapA:  Opacity-associated protein A LysM-like domain;  InterPro: IPR007340 This entry includes the Haemophilus influenzae opacity-associated protein. This protein is required for efficient nasopharyngeal mucosal colonization, and its expression is associated with a distinctive transparent colony phenotype. OapA is thought to be a secreted protein, and its expression exhibits high-frequency phase variation [].; PDB: 2GU1_A.
Probab=24.12  E-value=1.8e+02  Score=19.35  Aligned_cols=46  Identities=13%  Similarity=0.088  Sum_probs=20.8

Q ss_pred             CCCcceEeCCeEEEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCCEEEE
Q psy15726         58 LGLYQARSGIVNSFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFANLLA  109 (171)
Q Consensus        58 ~Gl~~~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v  109 (171)
                      .-|.+..+|+..+|.++..+. -..|.  +....   ..+......||.|.-
T Consensus        38 k~L~~L~pGq~l~f~~d~~g~-L~~L~--~~~~~---~~~~~~R~~DG~f~~   83 (85)
T PF04225_consen   38 KPLTRLKPGQTLEFQLDEDGQ-LTALR--YERSP---KTTLYTRQSDGSFSY   83 (85)
T ss_dssp             --GGG--TT-EEEEEE-TTS--EEEEE--EEEET---TEEEEEE-TTS-EEE
T ss_pred             chHhhCCCCCEEEEEECCCCC-EEEEE--EEcCC---cEEEEEEeCCCCEEe
Confidence            346678999999999985421 12232  32211   144555667788763


No 119
>PF14347 DUF4399:  Domain of unknown function (DUF4399)
Probab=24.06  E-value=1.4e+02  Score=20.18  Aligned_cols=22  Identities=9%  Similarity=-0.016  Sum_probs=17.2

Q ss_pred             CCceeEEEEEEECC-EEcCCCCe
Q psy15726         19 ISYGLVQARTLREE-TSFRSCPM   40 (171)
Q Consensus        19 ~~~G~~~i~V~~~g-~~I~gSPf   40 (171)
                      -+||.|.+.+.+++ .|++-+|-
T Consensus        59 L~PG~htLtl~~~d~~h~~~~~~   81 (87)
T PF14347_consen   59 LPPGKHTLTLQLGDGDHVPHDPP   81 (87)
T ss_pred             eCCCCEEEEEEeCCCCcccCCCc
Confidence            48999999988765 68777664


No 120
>cd03858 M14_CP_N-E_like Carboxypeptidase (CP) N/E-like subfamily of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. The N/E subfamily includes eight members, of which five (CPN, CPE, CPM, CPD, CPZ) are considered enzymatically active, while the other three are non-active (CPX1, PCX2, ACLP/AEBP1) and lack the critical active site and substrate-binding residues considered necessary for CP activity. These non-active members may function as binding proteins or display catalytic activity towards other substrates. Unlike the A/B CP subfamily, enzymes belonging to the N/E subfamily are not produced as inactive precursors that require proteolysis to produce the active form; rather, they rely on their substrate specificity and subcellular compartmentalization to prevent inappr
Probab=23.77  E-value=3.1e+02  Score=23.27  Aligned_cols=43  Identities=9%  Similarity=0.147  Sum_probs=27.2

Q ss_pred             ceEEEEECCCCCccc-eE---------EEEcCCCEEEEEEEcCCCeeEEEEEEECC
Q psy15726         82 EFDVIVTSPQGAAVP-VR---------CYQQKFANLLAEFTPTTTGVYKIDVLQGA  127 (171)
Q Consensus        82 ~l~v~i~~p~g~~~~-~~---------v~~~~dg~y~v~y~P~~~G~~~i~V~~~g  127 (171)
                      .+.-.|.+.+|..++ +.         ..-+.+|.|...   ..+|.|+|.|.+.|
T Consensus       299 ~i~G~V~d~~g~pl~~A~V~i~~~~~~~~Td~~G~f~~~---l~~G~y~l~vs~~G  351 (374)
T cd03858         299 GIKGFVRDANGNPIANATISVEGINHDVTTAEDGDYWRL---LLPGTYNVTASAPG  351 (374)
T ss_pred             ceEEEEECCCCCccCCeEEEEecceeeeEECCCceEEEe---cCCEeEEEEEEEcC
Confidence            455556666654432 11         223456888754   34999999999988


No 121
>PF08400 phage_tail_N:  Prophage tail fibre N-terminal;  InterPro: IPR013609 This entry represents the N terminus of phage 933W tail fibre protein. The characteristics of the protein distribution suggest prophage matches.
Probab=23.16  E-value=1.7e+02  Score=21.63  Aligned_cols=25  Identities=20%  Similarity=0.421  Sum_probs=19.2

Q ss_pred             cCCCEEEEEEEcCCCeeEEEEEEECCEE
Q psy15726        102 QKFANLLAEFTPTTTGVYKIDVLQGARP  129 (171)
Q Consensus       102 ~~dg~y~v~y~P~~~G~~~i~V~~~g~~  129 (171)
                      ..+|.|...-   ++|.|.|.++.+|++
T Consensus        44 ~~~G~Ys~~~---epG~Y~V~l~~~g~~   68 (134)
T PF08400_consen   44 GEAGEYSFDV---EPGVYRVTLKVEGRP   68 (134)
T ss_pred             CCCceEEEEe---cCCeEEEEEEECCCC
Confidence            3568887764   569999999998863


No 122
>PF13895 Ig_2:  Immunoglobulin domain; PDB: 2V5R_B 2V5M_A 2V5S_B 2GI7_A 3LAF_A 4DEP_C 3O4O_B 2EC8_A 2E9W_A 1J87_A ....
Probab=22.82  E-value=81  Score=19.19  Aligned_cols=14  Identities=7%  Similarity=0.150  Sum_probs=7.3

Q ss_pred             EEEEECCEEcCCCC
Q psy15726         26 ARTLREETSFRSCP   39 (171)
Q Consensus        26 i~V~~~g~~I~gSP   39 (171)
                      +.-..++..+..++
T Consensus        31 ~~w~~~~~~~~~~~   44 (80)
T PF13895_consen   31 VQWYKNGSPINSSQ   44 (80)
T ss_dssp             EEEEETTEEEEEES
T ss_pred             eeeeeeeeeeeeee
Confidence            44445666665443


No 123
>PF10980 DUF2787:  Protein of unknown function (DUF2787);  InterPro: IPR021248  This bacterial family of proteins has no known function. ; PDB: 2W56_A 2V1L_A.
Probab=22.56  E-value=1.5e+02  Score=21.68  Aligned_cols=37  Identities=11%  Similarity=-0.018  Sum_probs=24.4

Q ss_pred             eeEEEEeeccccCCCCceeEEEEEEECCEEcCCCCeEEE
Q psy15726          5 TKSCILLSEKAPSWISYGLVQARTLREETSFRSCPMEVP   43 (171)
Q Consensus         5 ~~~~~~~~~~~~~P~~~G~~~i~V~~~g~~I~gSPf~v~   43 (171)
                      .+..|+.-+.+|++++-|-|-|+|++.-  -.++++.+.
T Consensus        22 ~~ltlNFRD~~Ysae~GGfHPVEIrl~r--~~~~~W~i~   58 (128)
T PF10980_consen   22 NRLTLNFRDPSYSAERGGFHPVEIRLER--SSSDQWQIV   58 (128)
T ss_dssp             -EEEEEEE-TT-BTTTB-S--EEEEEEE---TTS-EEEE
T ss_pred             ceEEEEecCCCcccccCCcCCEEEEEEE--cCCCCEEEE
Confidence            3566888889999999999999999887  567888766


No 124
>PF08770 SoxZ:  Sulphur oxidation protein SoxZ;  InterPro: IPR014880 SoxZ forms an anti parallel beta structure and forms a complex with SoxY. Sulphur oxidation occurs at the thiol of a conserved cysteine residue of the SoxY subunit []. ; PDB: 1V8H_B 2OX5_E 2OXG_E 2OXH_C.
Probab=22.44  E-value=2.7e+02  Score=19.18  Aligned_cols=25  Identities=16%  Similarity=0.202  Sum_probs=14.7

Q ss_pred             eEEEEEEeCCCcCCceEEEEECCCCCc
Q psy15726         68 VNSFTLETCGVASSEFDVIVTSPQGAA   94 (171)
Q Consensus        68 ~~~F~V~~~~ag~~~l~v~i~~p~g~~   94 (171)
                      -..|.+...++|  .|++.++|-+|+.
T Consensus        69 ~l~F~~~~~~~g--~l~v~~~Dn~G~~   93 (100)
T PF08770_consen   69 YLRFSFKGKKSG--TLTVTWTDNKGNS   93 (100)
T ss_dssp             EEEEEEEESSSE--EEEEEEEETTS-E
T ss_pred             cEEEEEecCCCc--EEEEEEEECCCCE
Confidence            345666655443  7777777766653


No 125
>COG3354 FlaG Putative archaeal flagellar protein G [Cell motility and secretion]
Probab=22.21  E-value=3.3e+02  Score=20.47  Aligned_cols=44  Identities=18%  Similarity=0.107  Sum_probs=32.0

Q ss_pred             CCCEEEEEEEcCCCee-------EEEEEEECCEEcCCCCeEEEecCCCceE
Q psy15726        103 KFANLLAEFTPTTTGV-------YKIDVLQGARPVRGSPYLCQVYDASKVK  146 (171)
Q Consensus       103 ~dg~y~v~y~P~~~G~-------~~i~V~~~g~~I~gSPF~v~V~d~skv~  146 (171)
                      .+|.|+.+|--+-.|.       -.|.|..+|.-+.+.=-.+...+.+.++
T Consensus        66 ~~g~~t~t~yiKNtG~~~~~fd~~sitVliDG~iv~~a~~~~~~~~gs~i~  116 (154)
T COG3354          66 TDGPYTYTFYIKNTGSDSIAFDNTSITVLIDGNIVTPAYVTFTSVNGSSIR  116 (154)
T ss_pred             CCCceEEEEEEecCCCcccccCCCeEEEEEcCcEeccceEEEEecCCCeeE
Confidence            3477777776666554       4788999998888877777667777666


No 126
>cd03868 M14_CPD_I The first carboxypeptidase (CP)-like domain of  Carboxypeptidase D (CPD; EC 3.4.17.22), domain I. CPD differs from all other metallocarboxypeptidases in that it contains multiple CP-like domains. CPD belongs to the N/E-like subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPD is a single-chain protein containing a signal peptide, three tandem repeats of CP-like domains separated by short bridge regions, followed by a transmembrane domain, and a C-terminal cytosolic tail. The first two CP-like domains of CPD contain all of the essential active site and substrate-binding residues, the third CP-like domain lacks critical residues necessary for enzymatic activity and is inactive towards standard CP substrates. Domain I is optimally active at p
Probab=22.10  E-value=3.2e+02  Score=23.31  Aligned_cols=61  Identities=16%  Similarity=0.184  Sum_probs=40.4

Q ss_pred             EeCCeEEEEEEeCCCc-------------------------CCceEEEEECCCCCccc-eE--E-------EEcCCCEEE
Q psy15726         64 RSGIVNSFTLETCGVA-------------------------SSEFDVIVTSPQGAAVP-VR--C-------YQQKFANLL  108 (171)
Q Consensus        64 ~vg~~~~F~V~~~~ag-------------------------~~~l~v~i~~p~g~~~~-~~--v-------~~~~dg~y~  108 (171)
                      ..+.+.+|+++.++.-                         ...+.-.|.+++|..++ +.  +       ..+.+|.|.
T Consensus       254 ~~~~~~~~T~El~~~~~p~~~~l~~~w~~n~~al~~~~~~~~~~i~G~V~d~~g~pv~~A~V~v~~~~~~~~td~~G~y~  333 (372)
T cd03868         254 LHSNCFEITLELSCCKYPPASELPEEWNNNRESLLAYLEQVHIGVKGFVRDASGNPIEDATIMVAGIDHNVTTAKFGDYW  333 (372)
T ss_pred             hccCeeEEEEEecCCCCCCHHHHHHHHHHhHHHHHHHHHHhCCceEEEEEcCCCCcCCCcEEEEEecccceEeCCCceEE
Confidence            3567788888886520                         13467778888887655 22  2       233557785


Q ss_pred             EEEEcCCCeeEEEEEEECC
Q psy15726        109 AEFTPTTTGVYKIDVLQGA  127 (171)
Q Consensus       109 v~y~P~~~G~~~i~V~~~g  127 (171)
                       .  ...+|.|+|.|...|
T Consensus       334 -~--~l~~G~Y~l~vs~~G  349 (372)
T cd03868         334 -R--LLLPGTYTITAVAPG  349 (372)
T ss_pred             -e--cCCCEEEEEEEEecC
Confidence             2  356899999999877


No 127
>PF05949 DUF881:  Bacterial protein of unknown function (DUF881);  InterPro: IPR010273 This family consists of a series of hypothetical bacterial proteins. One of the family members Q45543 from SWISSPROT from Bacillus subtilis is thought to be involved in cell division and sporulation [].; PDB: 3GMG_B.
Probab=22.04  E-value=58  Score=24.20  Aligned_cols=21  Identities=29%  Similarity=0.653  Sum_probs=15.8

Q ss_pred             EEECCEEcCCCCeEEEe-cCCCc
Q psy15726        123 VLQGARPVRGSPYLCQV-YDASK  144 (171)
Q Consensus       123 V~~~g~~I~gSPF~v~V-~d~sk  144 (171)
                      ++.||+++. +||.+++ .||.+
T Consensus        81 i~Vng~~i~-~Py~I~AIGdp~~  102 (149)
T PF05949_consen   81 ILVNGRPIS-PPYVIKAIGDPET  102 (149)
T ss_dssp             EEETTEEE--SSEEEEEES-HHH
T ss_pred             EEECCEEcc-CCEEEEEEeCHHH
Confidence            889999997 8999998 45443


No 128
>PRK06655 flgD flagellar basal body rod modification protein; Reviewed
Probab=21.74  E-value=4.2e+02  Score=21.11  Aligned_cols=74  Identities=14%  Similarity=0.038  Sum_probs=42.1

Q ss_pred             EeCCeEEEEEEeCCCc-CCceEEEEECCCC-CccceEEEEcCCC-EEEEEEEcCCCeeEEEEEEECCE-----EcCCCCe
Q psy15726         64 RSGIVNSFTLETCGVA-SSEFDVIVTSPQG-AAVPVRCYQQKFA-NLLAEFTPTTTGVYKIDVLQGAR-----PVRGSPY  135 (171)
Q Consensus        64 ~vg~~~~F~V~~~~ag-~~~l~v~i~~p~g-~~~~~~v~~~~dg-~y~v~y~P~~~G~~~i~V~~~g~-----~I~gSPF  135 (171)
                      -+|+.+.+.=+.-... ++.....+.=++. ..+.++|.|.... .+++..-+..+|.|.  +.|+|.     +++..-|
T Consensus        95 lIGk~V~~~~~~~~~~~~~~~~~~~~l~~~a~~vti~I~D~~G~~Vrt~~lg~~~aG~~~--f~WDG~d~~G~~lp~G~Y  172 (225)
T PRK06655         95 LVGRGVLVPGDTVLVGTGGTTPFGVELPSAADNVTVTITDSAGQVVRTIDLGAQSAGVVS--FTWDGTDTDGNALPDGNY  172 (225)
T ss_pred             hcCCeEEEecceEEecCCCceEEEEEcCCCCcEEEEEEEcCCCCEEEEEecCCcCCCcee--EEECCCCCCCCcCCCeeE
Confidence            3666665544332111 2344555553443 3567788876433 345666678899875  557764     5555558


Q ss_pred             EEEe
Q psy15726        136 LCQV  139 (171)
Q Consensus       136 ~v~V  139 (171)
                      .++|
T Consensus       173 t~~V  176 (225)
T PRK06655        173 TIKA  176 (225)
T ss_pred             EEEE
Confidence            8887


No 129
>PF10651 DUF2479:  Domain of unknown function (DUF2479);  InterPro: IPR018913  This domain is found in phage from a number of different bacteria including (Listeria phage A118 (Bacteriophage A118)). It is thought to be a putative long tail fibre protein.; PDB: 4DIV_L 3UH8_A.
Probab=21.70  E-value=3.5e+02  Score=20.19  Aligned_cols=57  Identities=7%  Similarity=0.125  Sum_probs=36.5

Q ss_pred             EEEEECCCC---CccceEEEEcCCCEEEEEEEcC---CCeeE--EEEEEE--CCEEcCCCCeEEEec
Q psy15726         84 DVIVTSPQG---AAVPVRCYQQKFANLLAEFTPT---TTGVY--KIDVLQ--GARPVRGSPYLCQVY  140 (171)
Q Consensus        84 ~v~i~~p~g---~~~~~~v~~~~dg~y~v~y~P~---~~G~~--~i~V~~--~g~~I~gSPF~v~V~  140 (171)
                      .+.+..++|   ...++++.|..+|.++..+.-.   ..|.+  .+.+..  +++.+.-.-|.+.|.
T Consensus        58 ~l~l~~~dg~~~~~~~~~i~d~~~G~i~y~lp~~~l~~~G~v~a~~~i~~~~~~~~~~t~~F~F~V~  124 (170)
T PF10651_consen   58 KLVLKMSDGSIFIQDDVEIVDPTNGIIQYTLPDEFLKHVGKVKAQLFIYYNGDGQSISTANFTFEVE  124 (170)
T ss_dssp             EEE-TTTTSSSSEEEE-SEEETTTTEEEEE--TTTTHSSEEEEEEEEEEEESSSEEEE---EEEEEE
T ss_pred             EEEEEcCCCcEEEEecceEEcCcCcEEEEEECHHHhccCcEEEEEEEEEECCCCCEEEEeeEEEEEe
Confidence            334446666   2346778898899998888654   47887  567778  788888778999984


No 130
>cd05767 IgC_MHC_II_alpha Class II major histocompatibility complex (MHC) alpha chain immunoglobulin domain. IgC_MHC_II_alpha: Immunoglobulin (Ig) domain of major histocompatibility complex (MHC) class II alpha chain.  MHC class II molecules play a key role in the initiation of the antigen-specific immune reponse. In both humans and in mice these molecules have been shown to be expressed constitutively on the cell surface of professional antigen-presenting cells (APCs), for example on B-lymphocytes, monocytes, and macrophages. The expression of these molecules has been shown to be induced in nonprofessional APCs such as keratinocyctes, and they are expressed on the surface of activated human T cells and on T cells from other species. The MHC II molecules present antigenic peptides to CD4(+) T-lymphocytes. These peptides derive mostly from protelytic processing via the endocytic pathway, of antigens internalized by the APC. These peptides bind to the MHC class II molecules in the endosom
Probab=21.59  E-value=2.6e+02  Score=18.76  Aligned_cols=43  Identities=16%  Similarity=0.170  Sum_probs=20.2

Q ss_pred             CCeEEEEEEeCCCcCCceEEEEECCCCCccc-----eEEEEcCCCEEEE
Q psy15726         66 GIVNSFTLETCGVASSEFDVIVTSPQGAAVP-----VRCYQQKFANLLA  109 (171)
Q Consensus        66 g~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~-----~~v~~~~dg~y~v  109 (171)
                      ++...++-.+.+=-..++.++... +|..+.     ..+..++||+|..
T Consensus        15 ~~~~tL~C~a~gFyP~~I~v~W~~-~g~~~~~~~~~~~~~~~~Dgty~~   62 (94)
T cd05767          15 GEPNTLICFVDNFFPPVLNVTWLK-NGVPVTDGVSETRYYPRQDLSFQK   62 (94)
T ss_pred             CCCeEEEEEEEEEECCcCEEEEEE-CCeEccCceEEeeeEeCCCCcEEE
Confidence            444445554444334567776652 333221     1234455666543


No 131
>PRK09937 stationary phase/starvation inducible regulatory protein CspD; Provisional
Probab=21.48  E-value=2.3e+02  Score=18.37  Aligned_cols=37  Identities=8%  Similarity=0.013  Sum_probs=25.6

Q ss_pred             EEcCCCcceEeCCeEEEEEEeCCCcCCceEEEEECCC
Q psy15726         55 GSGLGLYQARSGIVNSFTLETCGVASSEFDVIVTSPQ   91 (171)
Q Consensus        55 v~G~Gl~~~~vg~~~~F~V~~~~ag~~~l~v~i~~p~   91 (171)
                      +.+.|+.....|+.++|.+.....|.....|...++.
T Consensus        34 i~~~g~~~l~~G~~V~f~~~~~~~G~~A~~V~~~~~~   70 (74)
T PRK09937         34 IQMDGYRTLKAGQSVQFDVHQGPKGNHASVIVPVEVE   70 (74)
T ss_pred             ccccCCCCCCCCCEEEEEEEECCCCceeeEEEECCcc
Confidence            3456667788999999999877666555555555444


No 132
>cd06245 M14_CPD_III The third carboxypeptidase (CP)-like domain of Carboxypeptidase D (CPD; EC 3.4.17.22), domain III. CPD differs from all other metallocarboxypeptidases in that it contains multiple CP-like domains. CPD belongs to the N/E-like subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPD is a single-chain protein containing a signal peptide, three tandem repeats of CP-like domains separated by short bridge regions, followed by a transmembrane domain, and a C-terminal cytosolic tail. The first two CP-like domains of CPD contain all of the essential active site and substrate-binding residues, the third CP-like domain lacks critical residues necessary for enzymatic activity and is inactive towards standard CP substrates. Domain I is optimally active a
Probab=21.01  E-value=3.6e+02  Score=23.12  Aligned_cols=43  Identities=16%  Similarity=0.240  Sum_probs=25.9

Q ss_pred             eEEEEECCCCCccc-eE--------EEEcCCCEEEEEEEcCCCeeEEEEEEECCE
Q psy15726         83 FDVIVTSPQGAAVP-VR--------CYQQKFANLLAEFTPTTTGVYKIDVLQGAR  128 (171)
Q Consensus        83 l~v~i~~p~g~~~~-~~--------v~~~~dg~y~v~y~P~~~G~~~i~V~~~g~  128 (171)
                      +.-.|.+.+|..++ +.        +.-+.+|.|...   ..+|.|+|.|.+.|-
T Consensus       289 I~G~V~d~~g~pi~~A~V~v~g~~~~~T~~~G~y~~~---L~pG~y~v~vs~~Gy  340 (363)
T cd06245         289 VHGVVTDKAGKPISGATIVLNGGHRVYTKEGGYFHVL---LAPGQHNINVIAEGY  340 (363)
T ss_pred             EEEEEEcCCCCCccceEEEEeCCCceEeCCCcEEEEe---cCCceEEEEEEEeCc
Confidence            44456666665443 21        223345777764   447889999888773


No 133
>KOG4278|consensus
Probab=20.96  E-value=2.1e+02  Score=27.30  Aligned_cols=42  Identities=17%  Similarity=0.247  Sum_probs=32.3

Q ss_pred             CCCEEEEEEEcCCCeeEEEEEEECCEEcCCCCeEEEecCCCceEE
Q psy15726        103 KFANLLAEFTPTTTGVYKIDVLQGARPVRGSPYLCQVYDASKVKI  147 (171)
Q Consensus       103 ~dg~y~v~y~P~~~G~~~i~V~~~g~~I~gSPF~v~V~d~skv~~  147 (171)
                      -+|.|.|.=.-..+|.|.|.+.|.|+--.   |+|++..-.|+.+
T Consensus       172 InGSFLVRESEsSpgQ~sISlRyeGRVyH---YRINt~~dgK~yv  213 (1157)
T KOG4278|consen  172 INGSFLVRESESSPGQYSISLRYEGRVYH---YRINTDNDGKMYV  213 (1157)
T ss_pred             cccceEEeeccCCCcceeEEEEecceEEE---EEeeccCCccEEE
Confidence            46889998888899999999999998655   7777644444444


No 134
>KOG3342|consensus
Probab=20.91  E-value=1.2e+02  Score=23.09  Aligned_cols=28  Identities=14%  Similarity=0.118  Sum_probs=20.1

Q ss_pred             EEEeeccccCCCCceeEEEEEEECCEEcC
Q psy15726          8 CILLSEKAPSWISYGLVQARTLREETSFR   36 (171)
Q Consensus         8 ~~~~~~~~~~P~~~G~~~i~V~~~g~~I~   36 (171)
                      .|.|++.+..|.+.|+-.+- ++.|.+||
T Consensus        67 lLfL~N~~~~p~~vGdivVf-~vegR~IP   94 (180)
T KOG3342|consen   67 LLFLTNRNEDPIRVGDIVVF-KVEGREIP   94 (180)
T ss_pred             EEEEecCCCCcceeccEEEE-EECCccCc
Confidence            46677777888888876653 67777764


No 135
>PHA03376 BARF1; Provisional
Probab=20.76  E-value=1.7e+02  Score=23.29  Aligned_cols=24  Identities=21%  Similarity=0.115  Sum_probs=19.1

Q ss_pred             EEEcCCCEEEEEEEcCCCeeEEEE
Q psy15726         99 CYQQKFANLLAEFTPTTTGVYKID  122 (171)
Q Consensus        99 v~~~~dg~y~v~y~P~~~G~~~i~  122 (171)
                      ++-.+||+|++.|.-.+-|.|+..
T Consensus        94 l~lSDdGtY~C~fQkge~~~~k~e  117 (221)
T PHA03376         94 ANISHDGNYLCRMKLGETEVTKQE  117 (221)
T ss_pred             eeecCCceEEEEEEcCCCceeeee
Confidence            445677999999998888888765


No 136
>smart00720 calpain_III calpain_III.
Probab=20.59  E-value=1.6e+02  Score=21.07  Aligned_cols=26  Identities=15%  Similarity=0.344  Sum_probs=19.2

Q ss_pred             EcCCCEEEE---EEEcCCCeeEEEEEEEC
Q psy15726        101 QQKFANLLA---EFTPTTTGVYKIDVLQG  126 (171)
Q Consensus       101 ~~~dg~y~v---~y~P~~~G~~~i~V~~~  126 (171)
                      ....|.|.+   +|.|.+.|.|.|.|.-.
T Consensus       109 ~L~~G~Y~iVPsT~~p~~~g~F~LrV~s~  137 (143)
T smart00720      109 RLPPGEYVIVPSTFEPNQEGDFLLRVFSE  137 (143)
T ss_pred             EcCCCCEEEEEeecCCCCccCEEEEEEec
Confidence            345577855   68888889999888654


No 137
>KOG3392|consensus
Probab=20.58  E-value=1.2e+02  Score=21.97  Aligned_cols=21  Identities=10%  Similarity=0.048  Sum_probs=19.0

Q ss_pred             cCCCCceeEEEEEEECCEEcC
Q psy15726         16 PSWISYGLVQARTLREETSFR   36 (171)
Q Consensus        16 ~~P~~~G~~~i~V~~~g~~I~   36 (171)
                      |-|...|.-++.|.++++||.
T Consensus        79 p~pd~~gkqelei~~~~ehis   99 (147)
T KOG3392|consen   79 PPPDRVGRQELEIVIGDEHIS   99 (147)
T ss_pred             cCCCcccceeEEEEECCeEEE
Confidence            478899999999999999993


No 138
>PHA03290 envelope glycoprotein I; Provisional
Probab=20.54  E-value=1.9e+02  Score=24.83  Aligned_cols=35  Identities=9%  Similarity=-0.022  Sum_probs=27.5

Q ss_pred             EEEeeccccCCCCceeEEEEEEECCEEcCCCCeEEE
Q psy15726          8 CILLSEKAPSWISYGLVQARTLREETSFRSCPMEVP   43 (171)
Q Consensus         8 ~~~~~~~~~~P~~~G~~~i~V~~~g~~I~gSPf~v~   43 (171)
                      .++|.=.++.|...|.|.+.|+++|.. .-+=|.+.
T Consensus       128 ~vLL~I~~P~~~DSGiY~LRV~Ldga~-~sDvF~ls  162 (357)
T PHA03290        128 EIIFKINKPGIEDAGIYLLLVQLDHSR-LFDGFFLG  162 (357)
T ss_pred             ceEEEeCCCCcccCeeEEEEEEeCCCc-ccceEEEE
Confidence            456777889999999999999999986 34446544


No 139
>PF10342 GPI-anchored:  Ser-Thr-rich glycosyl-phosphatidyl-inositol-anchored membrane family;  InterPro: IPR018466 This entry represents glycoproteins involved in cell wall (1-->6)-beta-glucan assembly. In yeast a null mutation leads to severe growth defects, aberrant multi-budded morphology, and mating defects [, ]. The entry includes DRMIP and Hesp-379, which are involved in both fruiting body formation and in host attack respectively. Hesp-379 is a haustorially expressed secreted protein; the haustorium being the small sucker that penetrates host tissue []. 
Probab=20.48  E-value=2.5e+02  Score=18.07  Aligned_cols=65  Identities=9%  Similarity=0.108  Sum_probs=34.3

Q ss_pred             eEeCCeEEEEEEeCCCcCCceEEEEECCCCCc--cceEEE---EcCCCEEEEEEEcC--CCeeEEEEEEECC
Q psy15726         63 ARSGIVNSFTLETCGVASSEFDVIVTSPQGAA--VPVRCY---QQKFANLLAEFTPT--TTGVYKIDVLQGA  127 (171)
Q Consensus        63 ~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~--~~~~v~---~~~dg~y~v~y~P~--~~G~~~i~V~~~g  127 (171)
                      ..+|++..++.+..+...+.+.+.+...+...  ....|.   +..+|.|.+.--+.  ..+.|.|.+.-.+
T Consensus         9 ~~~g~~~~I~W~~~~~~~~~~~I~L~~g~~~~~~~~~~ia~~v~~~~gs~~~~~p~~l~~~~~Y~i~~~~~~   80 (93)
T PF10342_consen    9 WTAGQPITITWTSDGTDPGNVTIYLCNGNNTNLNFVQTIASNVSNSDGSYTWTIPSDLPSGGDYFIQIVNSS   80 (93)
T ss_pred             EECCCcEEEEEeCCCCCCcEEEEEEEcCCCCCcceeEEEEecccCCCCEEEEEcCCCCCCCCcEEEEEEECC
Confidence            45677777777776555567777755443311  112232   23335665444221  1356888877443


No 140
>PF08842 Mfa2:  Fimbrillin-A associated anchor proteins Mfa1 and Mfa2;  InterPro: IPR014941 This family of proteins may be lipoproteins principally from bacilli. They are between 300 and 400 residues. Many Bacteroides-like bacterial species, including Porphyromonas gingivalis, the causal agent of periodontal infection, carry at least two types of fimbriae, namely FimA and Mfa1 fimbriae, following the names of their major subunit proteins []. Normally, FimA fimbriae are long filaments that are easily detached from cells, whereas Mfa1 fimbriae are short filaments that are tightly bound to cells; however, in the absence of Mfa2 protein, the Mfa1 fimbriae are also very long and are not attached. Mfa2 and Mfa1 are associated with each other in whole P. gingivalis cells to the extent that Mfa2 is located on the cell surface and probably associated with Mfa1 fimbriae in such a way that it anchors the Mfa1 fimbriae to the cell surface and regulates Mfa1 filament length [].; PDB: 3PAY_C 3GF8_A.
Probab=20.45  E-value=3.8e+02  Score=20.94  Aligned_cols=65  Identities=14%  Similarity=0.143  Sum_probs=32.6

Q ss_pred             ceEeCCeEEEEEEeCCCc-------CCceEEEEECCCCCccceEEEEcC---CCEEEEEEEcCCCeeEEEEEEEC
Q psy15726         62 QARSGIVNSFTLETCGVA-------SSEFDVIVTSPQGAAVPVRCYQQK---FANLLAEFTPTTTGVYKIDVLQG  126 (171)
Q Consensus        62 ~~~vg~~~~F~V~~~~ag-------~~~l~v~i~~p~g~~~~~~v~~~~---dg~y~v~y~P~~~G~~~i~V~~~  126 (171)
                      .|..+-...|..+.....       -..+++-|.+.+|+.+.....+..   .+.|.+......+|.|++-+.-|
T Consensus         3 dC~~~~~l~f~y~~~~~~~~~~~~~v~~v~lyvFd~~g~~v~~~~~~~~~~~~~~y~~~~~~l~~G~Y~~va~~n   77 (283)
T PF08842_consen    3 DCPGGLTLKFSYDYNMGNADAFEDEVKRVDLYVFDEDGKLVKQRTIDSEELEGGGYTMFLLDLPPGTYTFVAWGN   77 (283)
T ss_dssp             -----EEEEEE---STT-S--HHHH--EEEEEEE-TTSBEEEEEEEECGGCCTTTEEE-CCT--SEEEEEEEEES
T ss_pred             CCCceEEEEEEEeCCccccccccceEeEEEEEEEeCCCeEEEEEEcccccccCCceEEeeccCCCCcEEEEEEEC
Confidence            344455556665554221       157888899988875444433332   24588766677889999877665


No 141
>PF07965 Integrin_B_tail:  Integrin beta tail domain;  InterPro: IPR012896 Integrins are the major metazoan receptors for cell adhesion to extracellular matrix proteins and, in vertebrates, also play important roles in certain cell-cell adhesions, make transmembrane connections to the cytoskeleton and activate many intracellular signalling pathways [, ]. The integrin receptors are composed of alpha and beta subunit heterodimers. Each subunit crosses the membrane once, with most of the polypeptide residing in the extracellular space, and has two short cytoplasmic domains. Some members of this family have EGF repeats at the C terminus and also have a vWA domain inserted within the integrin domain at the N terminus.  Most integrins recognise relatively short peptide motifs, and in general require an acidic amino acid to be present. Ligand specificity depends upon both the alpha and beta subunits []. There are at least 18 types of alpha and 8 types of beta subunits recognised in humans []. Each alpha subunit tends to associate only with one type of beta subunit, but there are exceptions to this rule []. Each association of alpha and beta subunits has its own binding specificity and signalling properties. Many integrins require activation on the cell surface before they can bind ligands. Integrins frequently intercommunicate, and binding at one integrin receptor activate or inhibit another.  The structure of unliganded alphaV beta3 showed the molecule to be folded, with the head bent over towards the C termini of the legs which would normally be inserted into the membrane []. The head comprises a beta propeller domain at the end terminus of the alphaV subunit and an I/A domain inserted into a loop on the top of the hybrid domain in the beta subunit. The I/A domain consists of a Rossman fold with a core of beta parallel sheets surrounded by amphipathic alpha helices.  This entry represents the tail domain of the integrin beta subunit. It forms a four-stranded beta-sheet that contains parallel and anttparallel strands and faces an alpha helix found at the N terminus of this domain []. Interactions between the alpha-helix and the beta-sheet are mostly hydrophobic and involve a disulphide bond. The rear of the beta sheet is covered with a long A-B loop.; GO: 0004872 receptor activity, 0005488 binding, 0007155 cell adhesion, 0007160 cell-matrix adhesion, 0007229 integrin-mediated signaling pathway, 0008305 integrin complex; PDB: 3K71_H 3K6S_H 3K72_D 3IJE_B 1M1X_B 1U8C_B 2RN0_A 3FCS_B 1JV2_B 1L5G_B ....
Probab=20.44  E-value=2.6e+02  Score=18.61  Aligned_cols=30  Identities=13%  Similarity=0.090  Sum_probs=24.2

Q ss_pred             cceEEEEcCCCEEEEEEE-cCCCeeEEEEEE
Q psy15726         95 VPVRCYQQKFANLLAEFT-PTTTGVYKIDVL  124 (171)
Q Consensus        95 ~~~~v~~~~dg~y~v~y~-P~~~G~~~i~V~  124 (171)
                      +.+...|.+|-.|..+|. -...|.|.|.|.
T Consensus        47 ~~C~~~De~dC~~~F~y~~~~~~~~~~v~V~   77 (87)
T PF07965_consen   47 VRCSFRDEDDCWFTFTYEQYDGNGKYVVYVQ   77 (87)
T ss_dssp             EEEEEEETTSEEEEEEEE-ECTTTEEEEEEE
T ss_pred             CccccCCCCCcEEEEEEEEeCCCCeEEEEEc
Confidence            368888888888888888 677888988875


No 142
>PF01067 Calpain_III:  Calpain large subunit, domain III;  InterPro: IPR022682 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of cysteine peptidases belong to the MEROPS peptidase family C2 (calpain family, clan CA). A type example is calpain, which is an intracellular protease involved in many important cellular functions that are regulated by calcium []. The protein is a complex of 2 polypeptide chains (light and heavy), with three known forms in mammals [, ]: a highly calcium-sensitive (i.e., micro-molar range) form known as mu-calpain, mu-CANP or calpain I; a form sensitive to calcium in the milli-molar range, known as m-calpain, m-CANP or calpain II; and a third form, known as p94, which is found in skeletal muscle only [].  All forms have identical light but different heavy chains. Both mu- and m-calpain are heterodimers containing an identical 28kDa subunit and an 80kDa subunit that shares 55-65% sequence homology between the two proteases [, ]. The crystallographic structure of m-calpain reveals six "domains" in the 80kDa subunit:    A 19-amino acid NH2-terminal sequence; Active site domain IIa; Active site domain IIb.  Domain 2 shows low levels of sequence similarity to papain; although the catalytic His has not been located by biochemical means, it is likely that calpain and papain are related [].  Domain III; An 18-amino acid extended sequence linking domain III to domain IV; Domain IV, which resembles the penta EF-hand family of polypeptides, binds calcium and regulates activity []. />]. Ca2+-binding causes a rearrangement of the protein backbone, the net effect of which is that a Trp side chain, which acts as a wedge between catalytic domains IIa and IIb in the apo state, moves away from the active site cleft allowing for the proper formation of the catalytic triad [].   Calpain-like mRNAs have been identified in other organisms including bacteria, but the molecules encoded by these mRNAs have not been isolated, so little is known about their properties. How calpain activity is regulated in these organisms cells is still unclear In metazoans, the activity of calpain is controlled by a single proteinase inhibitor, calpastatin (IPR001259 from INTERPRO). The calpastatin gene can produce eight or more calpastatin polypeptides ranging from 17 to 85 kDa by use of different promoters and alternative splicing events. The physiological significance of these different calpastatins is unclear, although all bind to three different places on the calpain molecule; binding to at least two of the sites is Ca2+ dependent. The calpains ostensibly participate in a variety of cellular processes including remodelling of cytoskeletal/membrane attachments, different signal transduction pathways, and apoptosis. Deregulated calpain activity following loss of Ca2+ homeostasis results in tissue damage in response to events such as myocardial infarcts, stroke, and brain trauma [].   This entry represents domain III. It is found in association with PF00648 from PFAM. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated by calcium ions. ; PDB: 1QXP_B 2QFE_A 1DF0_A 1U5I_A 3DF0_A 3BOW_A 1KFU_L 1KFX_L.
Probab=20.30  E-value=1.4e+02  Score=21.29  Aligned_cols=26  Identities=12%  Similarity=0.327  Sum_probs=17.3

Q ss_pred             EcCCCEEEE---EEEcCCCeeEEEEEEEC
Q psy15726        101 QQKFANLLA---EFTPTTTGVYKIDVLQG  126 (171)
Q Consensus       101 ~~~dg~y~v---~y~P~~~G~~~i~V~~~  126 (171)
                      ....|.|.+   +|.|...|.|.|.|..+
T Consensus       113 ~L~~G~YvIVPsT~~~~~~g~F~L~v~s~  141 (147)
T PF01067_consen  113 TLPPGTYVIVPSTYEPGQEGEFTLRVFSD  141 (147)
T ss_dssp             EE-SEEEEEEEEESSTT--EEEEEEEEES
T ss_pred             EcCCCCEEEEEecCCCCCeeeEEEEEEEC
Confidence            445678876   56688899999998864


No 143
>PF02792 Mago_nashi:  Mago nashi protein;  InterPro: IPR004023 This family was originally identified in drosophila and called mago nashi, it is a strict maternal effect, grandchildless-like, gene []. The human homologue has been shown to interact with an RNA binding protein, ribonucleoprotein rbm8 (Q9Y5S9 from SWISSPROT) []. An RNAi knockout of the Caenorhabditis elegans homologue causes masculinization of the germ line (Mog phenotype) hermaphrodites, suggesting it is involved in hermaphrodite germ-line sex determination [] but the protein is also found in hermaphrodites and other organisms without a sexual differentiation.; GO: 0005634 nucleus; PDB: 2XB2_Y 2J0S_C 3EX7_A 2J0Q_F 1P27_C 2HYI_A 2X1G_B 1HL6_B 1RK8_B 1OO0_A ....
Probab=20.27  E-value=1.3e+02  Score=22.50  Aligned_cols=20  Identities=10%  Similarity=0.024  Sum_probs=17.6

Q ss_pred             cCCCCceeEEEEEEECCEEc
Q psy15726         16 PSWISYGLVQARTLREETSF   35 (171)
Q Consensus        16 ~~P~~~G~~~i~V~~~g~~I   35 (171)
                      |.|...|.-++.|.++++||
T Consensus        75 P~pd~~GrQELEI~lg~~Hi   94 (143)
T PF02792_consen   75 PEPDRVGRQELEIVLGNEHI   94 (143)
T ss_dssp             --SBTTBEEEEEEEETTEEE
T ss_pred             CCCCCccceeEEEEECCEEE
Confidence            68999999999999999998


No 144
>PF00927 Transglut_C:  Transglutaminase family, C-terminal ig like domain;  InterPro: IPR008958 Synonym(s): Protein-glutamine gamma-glutamyltransferase, Fibrinoligase, TGase  Transglutaminases catalyse the post-translational modification of proteins at glutamine residues, with formation of isopeptide bonds. Members of the transglutaminase family usually have three domains: N-terminal (IPR001102 from INTERPRO), middle (IPR013808 from INTERPRO) and C-terminal. The middle domain is usually well conserved, but family members can display major differences in their N- and C-terminal domains, although their overall structure is conserved []. This entry represents the C-terminal domain found in transglutaminases, which consists of an immunoglobulin-like beta-sandwich consisting of seven strands in two sheets with a Greek key topology. The best known transglutaminase is blood coagulation factor XIII, a plasma tetrameric protein composed of two catalytic A subunits and two non-catalytic B subunits. Factor XIII is responsible for cross-linking fibrin chains, thus stabilising the fibrin clot. Protein-glutamine gamma-glutamyltransferases (2.3.2.13 from EC) are calcium-dependent enzymes that catalyse the cross-linking of proteins by promoting the formation of isopeptide bonds between the gamma-carboxyl group of a glutamine in one polypeptide chain and the epsilon-amino group of a lysine in a second polypeptide chain. TGases also catalyse the conjugation of polyamines to proteins [, ].; GO: 0003810 protein-glutamine gamma-glutamyltransferase activity, 0018149 peptide cross-linking; PDB: 2XZZ_A 1GGY_B 1FIE_B 1GGU_B 1GGT_B 1F13_A 1QRK_B 1EVU_A 1EX0_B 1L9N_B ....
Probab=20.08  E-value=1.9e+02  Score=19.58  Aligned_cols=20  Identities=25%  Similarity=0.438  Sum_probs=17.2

Q ss_pred             EEEEEECCEEcCCCCeEEEe
Q psy15726        120 KIDVLQGARPVRGSPYLCQV  139 (171)
Q Consensus       120 ~i~V~~~g~~I~gSPF~v~V  139 (171)
                      .+.|+..+..+-|++|.+.+
T Consensus         2 ~~~i~~~~~~~vG~d~~v~v   21 (107)
T PF00927_consen    2 EIKIKLPGDPVVGQDFTVSV   21 (107)
T ss_dssp             EEEEEEESEEBTTSEEEEEE
T ss_pred             eEEEEECCCccCCCCEEEEE
Confidence            36788889999999999988


Done!