Query         psy5522
Match_columns 1397
No_of_seqs    428 out of 1892
Neff          7.8 
Searched_HMMs 46136
Date          Fri Aug 16 19:41:25 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy5522.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5522hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1366|consensus              100.0  7E-157  1E-161 1521.3  91.1 1180    1-1339   22-1433(1436)
  2 COG2373 Large extracellular al 100.0 1.7E-64 3.8E-69  666.6  73.5  812   97-997   393-1269(1621)
  3 cd02896 complement_C3_C4_C5 Pr 100.0 2.6E-56 5.6E-61  516.2  13.5  279  880-1279    2-287 (297)
  4 cd02897 A2M_2 Proteins similar 100.0 1.4E-50 3.1E-55  469.2  15.1  280  879-1279    1-282 (292)
  5 PF07678 A2M_comp:  A-macroglob 100.0 1.3E-49 2.8E-54  447.5   1.5  233  931-1279    1-234 (246)
  6 cd02891 A2M_like Proteins simi 100.0   2E-36 4.4E-41  350.2  13.8  270  880-1279    2-272 (282)
  7 PF07703 A2M_N_2:  Alpha-2-macr  99.9 8.7E-23 1.9E-27  209.5  18.3  135  419-555     1-136 (136)
  8 PF00207 A2M:  Alpha-2-macroglo  99.9 1.8E-22 3.9E-27  191.6  14.5   91  660-751     1-92  (92)
  9 PF07677 A2M_recep:  A-macroglo  99.8   7E-22 1.5E-26  186.2   5.0   49 1091-1141    1-51  (91)
 10 KOG1366|consensus               99.8 4.2E-20 9.2E-25  243.4  20.6  199  954-1157 1154-1390(1436)
 11 PF01835 A2M_N:  MG2 domain;  I  99.8 3.6E-19 7.9E-24  171.9  12.6   93   99-191     1-99  (99)
 12 cd00688 ISOPREN_C2_like This g  99.7 1.4E-17   3E-22  193.2   8.9  262  881-1279    3-290 (300)
 13 cd02889 SQCY Squalene cyclase   99.5 6.6E-14 1.4E-18  167.3  13.6  259  899-1279   23-322 (348)
 14 PF10569 Thiol-ester_cl:  Alpha  99.4 2.4E-13 5.2E-18   98.6   2.1   31  882-912     1-31  (31)
 15 COG2373 Large extracellular al  99.3   1E-07 2.2E-12  129.5  50.9  282   95-394   285-599 (1621)
 16 cd02892 SQCY_1 Squalene cyclas  99.1 7.5E-10 1.6E-14  141.7  14.6  118 1157-1279  475-608 (634)
 17 cd02896 complement_C3_C4_C5 Pr  98.6 7.2E-08 1.6E-12  112.3   8.7   71  936-1011  214-296 (297)
 18 TIGR01507 hopene_cyclase squal  98.6 9.7E-07 2.1E-11  112.2  16.8  150 1156-1353  469-633 (635)
 19 PF07678 A2M_comp:  A-macroglob  98.3   5E-07 1.1E-11  102.3   5.3   51  956-1011  190-243 (246)
 20 cd02897 A2M_2 Proteins similar  98.0 2.8E-05 6.1E-10   90.7  10.1   54  955-1011  235-291 (292)
 21 TIGR03463 osq_cycl 2,3-oxidosq  97.9 4.9E-05 1.1E-09   97.0  12.5  114 1157-1275  476-604 (634)
 22 TIGR01787 squalene_cyclas squa  97.9 9.8E-05 2.1E-09   94.2  13.7  119 1156-1279  458-592 (621)
 23 PF05326 SVA:  Seminal vesicle   97.8 0.00017 3.6E-09   70.2  10.5   84  711-809    30-113 (124)
 24 PLN03012 Camelliol C synthase   97.6 0.00017 3.8E-09   91.7  10.1   70  931-1001  517-611 (759)
 25 PLN02993 lupeol synthase        97.4  0.0011 2.4E-08   84.8  13.2  114 1157-1274  589-721 (763)
 26 cd00688 ISOPREN_C2_like This g  97.4 0.00031 6.6E-09   81.3   7.5   85 1175-1276  138-228 (300)
 27 cd02889 SQCY Squalene cyclase   97.3 0.00067 1.5E-08   81.3   9.4   91  904-1004  222-321 (348)
 28 cd02890 PTase Protein prenyltr  97.0    0.01 2.2E-07   69.1  15.0   91  902-1002   28-120 (286)
 29 PF13249 Prenyltrans_2:  Prenyl  97.0   0.001 2.2E-08   65.6   5.8   91  898-1000   20-112 (113)
 30 PF13249 Prenyltrans_2:  Prenyl  96.9 0.00057 1.2E-08   67.4   2.9   60  936-1003    4-65  (113)
 31 PF07703 A2M_N_2:  Alpha-2-macr  96.7   0.044 9.5E-07   56.1  14.8  122  101-238     2-128 (136)
 32 cd02894 GGTase-II Geranylgeran  96.6   0.024 5.2E-07   66.0  13.9  104  889-1002   18-122 (287)
 33 PRK14081 triple tyrosine motif  96.3     5.6 0.00012   50.7  54.2  417   68-544   155-608 (667)
 34 COG1470 Predicted membrane pro  96.2    0.25 5.4E-06   58.7  18.7  158  707-876   259-472 (513)
 35 cd02893 FTase Protein farnesyl  96.1     0.1 2.3E-06   60.9  15.3   77  917-1002   44-120 (299)
 36 cd02892 SQCY_1 Squalene cyclas  95.9   0.023   5E-07   73.5   9.6   90  904-1004  507-607 (634)
 37 TIGR03503 conserved hypothetic  95.7     1.1 2.4E-05   53.0  21.4  190    6-207    98-311 (374)
 38 TIGR01507 hopene_cyclase squal  95.7   0.029 6.4E-07   71.9   9.3   86  906-1002  504-599 (635)
 39 COG1470 Predicted membrane pro  95.6     1.1 2.3E-05   53.6  20.3  171   10-182   280-469 (513)
 40 PLN02710 farnesyltranstransfer  95.5     0.2 4.3E-06   60.8  14.7   73  917-1001   89-164 (439)
 41 PF13243 Prenyltrans_1:  Prenyl  95.3  0.0063 1.4E-07   59.6   0.8   65  932-1002    4-68  (109)
 42 PLN02993 lupeol synthase        95.1   0.037 8.1E-07   71.2   7.3   70  921-1002  591-661 (763)
 43 PLN03201 RAB geranylgeranyl tr  95.1     0.2 4.2E-06   59.1  12.5   84  904-1002   43-126 (316)
 44 TIGR01787 squalene_cyclas squa  95.0   0.073 1.6E-06   68.4   9.4   71  920-1002  460-531 (621)
 45 TIGR03463 osq_cycl 2,3-oxidosq  94.8   0.082 1.8E-06   68.0   9.3   88  904-1001  508-604 (634)
 46 PF10633 NPCBM_assoc:  NPCBM-as  94.7    0.15 3.3E-06   46.7   8.4   72  725-811     1-76  (78)
 47 TIGR03503 conserved hypothetic  93.8      19 0.00042   42.9  27.6  234  136-397    32-287 (374)
 48 PLN03012 Camelliol C synthase   93.7    0.22 4.8E-06   64.3   9.4   88  906-1001  623-722 (759)
 49 PF10670 DUF4198:  Domain of un  93.1     9.3  0.0002   42.1  20.4  174   98-293     4-213 (215)
 50 cd02890 PTase Protein prenyltr  93.0     0.3 6.5E-06   56.8   8.6   81  908-1000  134-215 (286)
 51 cd02894 GGTase-II Geranylgeran  92.4     0.5 1.1E-05   55.0   9.3   74  915-1000  143-216 (287)
 52 cd02891 A2M_like Proteins simi  92.1    0.49 1.1E-05   54.7   8.9   52  955-1011  227-281 (282)
 53 PRK13211 N-acetylglucosamine-b  92.1      34 0.00073   42.5  24.5  222   60-308   158-404 (478)
 54 PF01835 A2M_N:  MG2 domain;  I  92.1     2.9 6.4E-05   40.0  12.8   78  319-399     6-86  (99)
 55 PF07919 Gryzun:  Gryzun, putat  91.9    0.48   1E-05   60.6   9.2   79  712-805   468-546 (554)
 56 PF07705 CARDB:  CARDB;  InterP  91.6     2.5 5.5E-05   40.2  11.8   76  718-812     8-85  (101)
 57 PF05753 TRAP_beta:  Translocon  91.1     1.9 4.2E-05   46.3  11.2   85  705-808    19-107 (181)
 58 COG5029 CAL1 Prenyltransferase  90.5    0.79 1.7E-05   52.0   7.8   97  890-1002   93-196 (342)
 59 cd02895 GGTase-I Geranylgerany  89.6     1.2 2.5E-05   52.5   8.8  110  888-1003   14-135 (307)
 60 PF13115 YtkA:  YtkA-like        89.1     3.2 6.9E-05   38.7   9.7   75   98-178     7-85  (86)
 61 PF10670 DUF4198:  Domain of un  89.0     1.8   4E-05   47.7   9.5   76  314-399   135-212 (215)
 62 PLN03201 RAB geranylgeranyl tr  88.3     1.5 3.3E-05   51.7   8.6   73  917-1001  149-221 (316)
 63 cd02895 GGTase-I Geranylgerany  88.0     1.1 2.4E-05   52.6   7.1   70  922-1000  167-236 (307)
 64 PLN02710 farnesyltranstransfer  87.9     1.1 2.4E-05   54.6   7.1   67  922-1002  145-213 (439)
 65 cd02893 FTase Protein farnesyl  87.9       2 4.2E-05   50.4   9.1   74  916-1001  142-216 (299)
 66 smart00557 IG_FLMN Filamin-typ  87.5     3.6 7.7E-05   39.1   9.0   64  110-182    15-78  (93)
 67 COG5029 CAL1 Prenyltransferase  87.5     2.2 4.8E-05   48.6   8.6   94  892-1001  151-244 (342)
 68 KOG0367|consensus               87.2     1.1 2.3E-05   50.4   5.9  105  886-1001  136-249 (347)
 69 PF00207 A2M:  Alpha-2-macroglo  87.1       6 0.00013   37.5  10.3   75  153-234    10-90  (92)
 70 smart00634 BID_1 Bacterial Ig-  85.9     8.5 0.00018   36.5  10.6   67  328-400    18-84  (92)
 71 PF00630 Filamin:  Filamin/ABP2  84.3     6.1 0.00013   37.8   9.1   70  110-182    18-92  (101)
 72 KOG0366|consensus               83.4     3.3 7.2E-05   45.8   7.3   90  899-1002   93-184 (329)
 73 PF00927 Transglut_C:  Transglu  82.8     6.7 0.00014   38.3   8.7   79  717-808     3-86  (107)
 74 PF13243 Prenyltrans_1:  Prenyl  81.2    0.58 1.3E-05   45.7   0.6   60  922-994    48-107 (109)
 75 PF02369 Big_1:  Bacterial Ig-l  79.8     9.7 0.00021   36.8   8.5   69  326-401    21-91  (100)
 76 PF06030 DUF916:  Bacterial pro  79.5      11 0.00023   37.9   8.9   75  725-799    23-105 (121)
 77 PF09315 DUF1973:  Domain of un  78.8      27 0.00059   37.6  12.3   46  133-181    42-87  (179)
 78 PF14545 DBB:  Dof, BCAP, and B  76.0      18  0.0004   37.1   9.4   84    3-95      1-88  (142)
 79 PF10633 NPCBM_assoc:  NPCBM-as  75.1      15 0.00033   33.5   8.0   54  817-871    22-75  (78)
 80 PF14524 Wzt_C:  Wzt C-terminal  73.6      19  0.0004   36.6   9.2   73  106-180    28-107 (142)
 81 PF12690 BsuPI:  Intracellular   73.3      15 0.00032   34.2   7.4   64  115-178     2-82  (82)
 82 PF12975 DUF3859:  Domain of un  73.0      13 0.00029   37.5   7.6   58  134-191    59-125 (125)
 83 PF04734 Ceramidase_alk:  Neutr  72.8      16 0.00036   47.4  10.3   82   98-181   557-653 (674)
 84 KOG0366|consensus               71.7      11 0.00024   42.0   7.0   80  907-1001   56-135 (329)
 85 PF10648 Gmad2:  Immunoglobulin  69.7      39 0.00084   31.9   9.4   51  130-181    27-81  (88)
 86 PRK10301 hypothetical protein;  69.0      14  0.0003   37.3   6.7   61  132-192    59-124 (124)
 87 PF13620 CarboxypepD_reg:  Carb  68.5      30 0.00064   31.6   8.5   57  117-182     2-58  (82)
 88 PF11589 DUF3244:  Domain of un  67.9      30 0.00065   33.8   8.7   59  131-192    47-106 (106)
 89 smart00634 BID_1 Bacterial Ig-  67.8      41 0.00089   31.8   9.4   80  100-182     3-84  (92)
 90 KOG0497|consensus               67.7     5.7 0.00012   49.9   4.3   67  931-1002  593-660 (760)
 91 PF15418 DUF4625:  Domain of un  67.6      23  0.0005   36.1   8.0   74  105-182    28-117 (132)
 92 KOG0365|consensus               66.5      22 0.00048   41.2   8.2  100  891-1002  139-241 (423)
 93 PF13860 FlgD_ig:  FlgD Ig-like  66.1      24 0.00052   32.6   7.2   50  131-180    24-77  (81)
 94 PF01483 P_proprotein:  Proprot  64.6      20 0.00043   33.5   6.5   51  130-180    14-70  (87)
 95 PTZ00487 ceramidase; Provision  63.9      25 0.00055   45.6   9.2   73  108-181   607-692 (715)
 96 PF00432 Prenyltrans:  Prenyltr  63.0     6.9 0.00015   31.6   2.6   21  982-1002    3-23  (44)
 97 PRK06655 flgD flagellar basal   61.5      47   0.001   37.2   9.7   75  100-180   100-178 (225)
 98 PF04234 CopC:  CopC domain;  I  61.2     9.4  0.0002   36.6   3.7   60  132-192    33-97  (97)
 99 KOG4386|consensus               60.7      17 0.00036   44.1   6.2   86  706-806   685-770 (809)
100 PF13115 YtkA:  YtkA-like        59.8      87  0.0019   29.0   9.9   69  311-384     5-76  (86)
101 PF02369 Big_1:  Bacterial Ig-l  59.3 1.3E+02  0.0028   29.0  11.2   78  101-182    11-90  (100)
102 PRK14081 triple tyrosine motif  56.2 5.8E+02   0.013   33.3  44.6  340   68-463   253-625 (667)
103 PRK13211 N-acetylglucosamine-b  55.4 1.1E+02  0.0024   38.1  12.3   86   26-115   338-432 (478)
104 PF00432 Prenyltrans:  Prenyltr  54.6     8.5 0.00019   31.0   1.9   20 1257-1276    4-23  (44)
105 PF14326 DUF4384:  Domain of un  54.5      65  0.0014   29.9   8.0   63  107-178     1-65  (83)
106 PF13199 Glyco_hydro_66:  Glyco  53.9      64  0.0014   41.0  10.2   93  104-199     2-105 (559)
107 PF08400 phage_tail_N:  Prophag  51.9   1E+02  0.0022   31.4   9.3   96  328-430     1-100 (134)
108 TIGR03437 Soli_cterm Solibacte  51.6 3.4E+02  0.0074   30.2  14.2  119   59-182    51-204 (215)
109 PF11614 FixG_C:  IG-like fold   51.4 1.8E+02  0.0038   28.8  11.1   63  733-811    35-101 (118)
110 PF13620 CarboxypepD_reg:  Carb  50.4      35 0.00077   31.0   5.6   57  332-400     2-58  (82)
111 PF00868 Transglut_N:  Transglu  49.9 1.2E+02  0.0026   30.3   9.5   82   96-180    33-116 (118)
112 PRK12812 flgD flagellar basal   48.7 1.8E+02  0.0039   33.3  11.7   63  113-181   128-194 (259)
113 PF14900 DUF4493:  Domain of un  46.6 4.7E+02    0.01   29.4  19.1   44  131-182    32-75  (235)
114 PF05751 FixH:  FixH;  InterPro  46.1      90   0.002   32.1   8.4   63  113-180    69-132 (146)
115 PRK12634 flgD flagellar basal   45.2 1.2E+02  0.0026   33.9   9.6   51  131-181   121-175 (221)
116 COG2372 CopC Uncharacterized p  42.5      79  0.0017   31.8   6.7   62  131-192    59-125 (127)
117 PRK05842 flgD flagellar basal   41.7 1.5E+02  0.0032   34.6   9.8   69  110-180   146-220 (295)
118 PF15432 Sec-ASP3:  Accessory S  41.6 2.3E+02  0.0049   28.8  10.0   80  104-193    45-126 (128)
119 PF13715 DUF4480:  Domain of un  41.2 2.5E+02  0.0053   25.9   9.8   52  117-182     2-54  (88)
120 PF00775 Dioxygenase_C:  Dioxyg  41.1      66  0.0014   34.8   6.6   28  324-353    24-51  (183)
121 PRK12633 flgD flagellar basal   40.4      96  0.0021   34.9   7.9   51  131-181   128-182 (230)
122 KOG0367|consensus               39.6      91   0.002   35.8   7.4   55  939-1000  137-194 (347)
123 cd00421 intradiol_dioxygenase   38.8 1.1E+02  0.0023   31.9   7.5   22  325-346     7-28  (146)
124 COG1689 Uncharacterized protei  37.3      44 0.00094   36.4   4.3   41  955-1000  208-248 (274)
125 PF12584 TRAPPC10:  Trafficking  36.5 2.8E+02  0.0061   28.8  10.2   70  722-806    24-105 (147)
126 PF04151 PPC:  Bacterial pre-pe  35.5   1E+02  0.0023   27.3   6.0   44  133-179    25-69  (70)
127 KOG0365|consensus               35.4      85  0.0018   36.7   6.4   73  917-1001   69-141 (423)
128 PF05688 DUF824:  Salmonella re  34.4      62  0.0013   26.8   3.8   31  323-355     8-38  (47)
129 COG1657 SqhC Squalene cyclase   32.7   1E+02  0.0023   38.5   7.2  111 1158-1274  352-479 (517)
130 PF01122 Cobalamin_bind:  Eukar  31.5      96  0.0021   36.6   6.3   87 1174-1279  142-240 (326)
131 PF06280 DUF1034:  Fn3-like dom  31.1 2.4E+02  0.0051   27.7   8.3   66  728-798     7-80  (112)
132 PF07177 Neuralized:  Neuralize  30.8   1E+02  0.0022   27.7   5.0   28   98-126    29-56  (69)
133 TIGR01451 B_ant_repeat conserv  30.8      83  0.0018   26.6   4.2   37  104-144     3-41  (53)
134 KOG0497|consensus               30.4      54  0.0012   41.7   4.2  134  854-1000  572-720 (760)
135 PF07233 DUF1425:  Protein of u  30.2   3E+02  0.0064   26.3   8.4   62  720-794    15-78  (94)
136 PF09608 Alph_Pro_TM:  Putative  29.9 1.1E+02  0.0024   34.5   6.3  102   85-193    66-196 (236)
137 TIGR02186 alph_Pro_TM conserve  28.7 1.2E+02  0.0026   34.5   6.3   51  136-193   166-221 (261)
138 PF05751 FixH:  FixH;  InterPro  28.6   3E+02  0.0066   28.2   9.0   65  331-398    70-134 (146)
139 cd05822 TLP_HIUase HIUase (5-h  27.9 1.9E+02  0.0042   28.7   6.8   60  121-182     7-68  (112)
140 cd05790 S1_Rrp40 S1_Rrp40: Rrp  27.9      84  0.0018   29.6   4.1   32  106-143    47-78  (86)
141 KOG1948|consensus               27.6 1.6E+03   0.035   30.0  27.8   57  369-437   258-314 (1165)
142 PF07705 CARDB:  CARDB;  InterP  27.2 5.1E+02   0.011   24.0  10.6   75  422-498     9-88  (101)
143 PF01273 LBP_BPI_CETP:  LBP / B  27.2 4.4E+02  0.0095   27.5  10.1   46  186-231    41-87  (164)
144 PF12863 DUF3821:  Domain of un  26.2 1.9E+02  0.0041   32.0   7.1   39  109-150   102-148 (209)
145 PF11974 MG1:  Alpha-2-macroglo  26.0 1.3E+02  0.0029   28.8   5.3   45  332-382    15-60  (97)
146 cd03459 3,4-PCD Protocatechuat  25.7   2E+02  0.0043   30.4   6.9   49  325-380    11-78  (158)
147 TIGR02439 catechol_proteo cate  25.4 1.7E+02  0.0037   33.9   6.9   22  325-346   124-145 (285)
148 PF05688 DUF824:  Salmonella re  25.4 1.4E+02  0.0031   24.7   4.4   35  108-142     8-42  (47)
149 PF14874 PapD-like:  Flagellar-  25.2 5.9E+02   0.013   24.1  11.9   68  724-809    15-85  (102)
150 cd03463 3,4-PCD_alpha Protocat  25.1 1.9E+02  0.0042   31.3   6.9   48  326-380    33-98  (185)
151 PF00576 Transthyretin:  HIUase  25.1 2.1E+02  0.0046   28.4   6.5   62  121-182     7-69  (112)
152 PF01122 Cobalamin_bind:  Eukar  25.1      88  0.0019   36.9   4.5   76  923-1011  214-292 (326)
153 COG4935 Regulatory P domain of  25.0 5.4E+02   0.012   27.6   9.7   53  130-182    50-110 (177)
154 PRK12813 flgD flagellar basal   24.0 2.1E+02  0.0046   32.0   7.0   48  131-181   123-174 (223)
155 TIGR01451 B_ant_repeat conserv  23.8 2.2E+02  0.0047   24.0   5.5   40  722-761     5-45  (53)
156 TIGR02438 catachol_actin catec  23.5   2E+02  0.0042   33.4   6.8   23  325-347   128-150 (281)
157 cd03458 Catechol_intradiol_dio  23.2   2E+02  0.0044   32.8   6.9   22  325-346   100-121 (256)
158 PRK15036 hydroxyisourate hydro  23.2 3.6E+02  0.0079   27.7   8.1   64  116-182    28-93  (137)
159 COG5266 CbiK ABC-type Co2+ tra  23.2 2.1E+02  0.0046   32.3   6.7   57  320-381   164-228 (264)
160 cd03462 1,2-CCD chlorocatechol  23.1 2.1E+02  0.0046   32.5   6.9   22  325-346    95-116 (247)
161 PF05506 DUF756:  Domain of unk  23.0   4E+02  0.0088   24.9   7.9   58  732-810    21-78  (89)
162 cd05469 Transthyretin_like Tra  23.0 4.2E+02  0.0092   26.3   8.1   59  121-182     7-68  (113)
163 cd03460 1,2-CTD Catechol 1,2 d  22.9   2E+02  0.0043   33.4   6.8   22  325-346   120-141 (282)
164 PF14782 BBS2_C:  Ciliary BBSom  22.8 2.7E+02  0.0059   34.4   8.3   81  780-871    93-173 (431)
165 TIGR02962 hdxy_isourate hydrox  22.3 4.2E+02  0.0092   26.3   8.0   60  121-182     7-68  (112)
166 COG1689 Uncharacterized protei  22.1      76  0.0016   34.7   3.0   66  921-999     7-73  (274)
167 PF07718 Coatamer_beta_C:  Coat  21.9 2.6E+02  0.0056   28.8   6.6   67  714-799    57-124 (140)
168 smart00736 CADG Dystroglycan-t  21.8 5.1E+02   0.011   24.6   8.5   47  133-182    31-77  (97)
169 cd05703 S1_Rrp5_repeat_hs12_sc  21.7 1.2E+02  0.0026   27.3   3.9   18  108-125    46-63  (73)
170 PF01345 DUF11:  Domain of unkn  20.8 2.1E+02  0.0046   25.7   5.4   40  721-760    33-73  (76)
171 COG2372 CopC Uncharacterized p  20.8 2.2E+02  0.0048   28.7   5.7   58  452-510    68-126 (127)

No 1  
>KOG1366|consensus
Probab=100.00  E-value=6.8e-157  Score=1521.27  Aligned_cols=1180  Identities=35%  Similarity=0.543  Sum_probs=925.7

Q ss_pred             CEEEEeeCeeecCCeEEEEEEeecCCccEEEEEEEEeecCCCCeeEEeEEEEECCCceeEEEEEcCCCCC--ceeEEEEE
Q psy5522           1 YYTVVAPKVLRPNGEYHVAVSTQAVSEATQVSVEVGGKQDSGGEFRTKQLLSVDPFSTRIVKLDIGDLGP--GQYNLTVK   78 (1397)
Q Consensus         1 ~y~v~~P~~lr~g~~~~v~v~l~~~~~~v~v~v~L~~~~~~~~~~~~~~~~~~~~~~~~~~~f~vP~~~~--~~~~l~v~   78 (1397)
                      +|+|++|+++|+|+++++|.    ..+.+.+++++...+.+. .........-......|+.|++|....  ..|.....
T Consensus        22 ~y~v~~P~iir~~~~~~~~~----~~~~~~~~v~v~~~~~~~-~~~~~~~~~~~~~~~~~v~~~ip~~~~~~~~~~~~~~   96 (1436)
T KOG1366|consen   22 TYMLLAPSIIRVESPESLLE----VQSDVDVTVSVLLRPPSN-ARSIVTDTEGIFQDSSLVTFQIPSLPPNSAEYGRSEK   96 (1436)
T ss_pred             ceeeecceeEecCCCCceee----ecCCcceEEEEEecCccc-ccceeeccccccccccceeccccccCCCcceeEEeee
Confidence            59999999999999999444    344444444444332222 222333332334557899999996543  34788888


Q ss_pred             ecCceeeEeeEEEEEEecccEEEEEcCCCcccCCceeEEEEEEEcCCCccCCCcceEEEEECCCCCeEEeeeeeccCCce
Q psy5522          79 GSGSLNFYNSTGLVYVHKSYSVFIQLDKAIYKPGDLVRFRAIVLNSHLKPSVTGALEIFITDGKGNRIKQWTRALTTRGV  158 (1397)
Q Consensus        79 g~~~~~f~~~~~v~~~~~~~~vfIqTDKpIYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I~DP~g~~I~qw~~~~~~~Gi  158 (1397)
                      | +++.|.+++.+++..+++++|||||||||+|||||+||+|+++.+++|. +  +.|+|+||+||+|+||+......|+
T Consensus        97 ~-~~l~f~~~~~l~~~~~~~~vfIqTDkpiY~pgqtV~~Rvv~~~~~~~~~-~--~~v~~~dp~~~~ikq~~~~~~~~gv  172 (1436)
T KOG1366|consen   97 G-PELEFENSTRLLFSQRRGSVFIQTDKPIYKPGQTVRYRVVTLNLKLKPF-K--VTVYIEDPEGNRIKQWLSVQSDGGV  172 (1436)
T ss_pred             c-ceEEEeeeeeeeeecccceEEEecCCCccCCCCEEEEEEEEeccccccc-c--ceEEEECCCCCchhhhhhccccCce
Confidence            8 8899999999999999999999999999999999999999999999995 4  9999999999999999999999999


Q ss_pred             eeeeeecCCCCCCeeEEEEEEEcCeEEEEEEEEeeecCCcEEEEEeCCCceEeeCCeEEEEEEEEEecCccccceEEEEE
Q psy5522         159 FSADLQLSKSPVLGDWNITINVLDQKFTKRFTVAEYVLPKFEVNVNVPPHATFKDSKVVISVNAKYTYGKPVKGEATITA  238 (1397)
Q Consensus       159 ~~~sf~L~~~~~lG~y~I~v~~~~~~~~~~F~VeEYvLPkFeV~v~~p~~i~~~~~~~~v~v~AkYtyGkPV~G~v~v~v  238 (1397)
                      ++++|+||++|.+|.|+|+|+..++..+..++|+|||||+|||.+++|+.+++. +++.++|||+|+|||||+|.+++++
T Consensus       173 ~~~~f~ls~~~~~G~w~I~v~~~~~~~~~~~~v~eyvlp~FeV~v~~~~~~~~~-~~~~~~v~a~y~ygkpV~g~~~v~~  251 (1436)
T KOG1366|consen  173 YSLSFQLSSEPILGDWKIRVQYKNGLQTKSSTVEEYVLPKFEVNVEPPSIITIN-KELEVTVCARYTYGKPVTGKATVSF  251 (1436)
T ss_pred             EEeeecccCCCcCCceEEEEEEcCCCccccceEEEEEecCceEeeecCcceeec-ceeEEEEeeeeecCCcccceEEEEE
Confidence            999999999999999999999988555555599999999999999999998887 9999999999999999999999999


Q ss_pred             EeccccCcc----ccccccceeEEeecCceEEEEEechhhhcccccc---ceEEEEEEEEEECCcceEEeeEEEEEEEEe
Q psy5522         239 YPTIFSGVI----QPLFQTPVRKVVPIDGKTVIEFDVVKELQLTDEY---ERNIHFDVAVEEALTGRRQNNTGSVVFHKH  311 (1397)
Q Consensus       239 ~~~~~~g~~----~~~~~~~~~~~~d~~G~~~~~~~~~~~~~~~~~~---~~~l~V~a~VtE~gTG~~~~~~~~i~i~~~  311 (1397)
                      ++....+..    ....|.....+...++|.....+... +......   .....+.+..+..-++...+.....+..+.
T Consensus       252 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~a~~~~i~~v~s~~~~~~~~~~~~~~~~~~~  330 (1436)
T KOG1366|consen  252 GLLDSDYKKCQPKIQSNCKNFEQQLNVEGCKVLNVDTQN-LQLYYNFEEGAEPVGITAVRSPFTTDIEETPLHLVPGGPA  330 (1436)
T ss_pred             ecccccccccccchhccccccccccceeeEEEEecCchh-heeehhhhccCcceeEEEEcccchhhhhhcceeecCCcce
Confidence            987432221    12234444555666677766655544 3332111   124555565565555666666667777788


Q ss_pred             eeEEEEEecCCcccCCCeEEEEEEEEccCCCccCCC-CceEEEEEeeeccccccceeeEeeCCCeeEEEE----------
Q psy5522         312 KYKMDLIKSSEYFKPGLKYTAYMKLTHHDGTPVTDN-NNMVQVRHGFSYDESKYEANQYKLDRNGMIKLV----------  380 (1397)
Q Consensus       312 ~~~i~f~~~~~~~kpG~p~~~~v~v~~~dG~P~~~~-~~~V~v~~~~~~~~~~~~~~~~ttd~~G~~~f~----------  380 (1397)
                      ..++.+.+....+|+|+|+.+++++.+.+|.|+... .....+.        .....+++++..+...+.          
T Consensus       331 ~~~v~v~~~~~~~k~gvp~~v~v~v~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~  402 (1436)
T KOG1366|consen  331 ELKVSVEDEDDSFKPGVPFTVQVKVVSKDGVPLDLDLPKEVQVD--------TSVQRPITTDRSKLVESNETMLATQNPS  402 (1436)
T ss_pred             eeeEEEeecCCcccCCcceEEEEEEecCCCcccccccccccccc--------ceEEEeeeccccccceeceeeccccCCc
Confidence            888999999999999999999999999999998732 1111111        111222223222222221          


Q ss_pred             -----EeC-CCCCceeEEEEEEEecCcceEEEEEEeecCCCCCeEEEEecCCCCCCCCeEEEEEEecC--------Ccce
Q psy5522         381 -----YYP-PANENVTTLGIEAEYLDIKEWFSTISASESPSNSFIQAALLTQNPKVNKDVELEINSTA--------PLKY  446 (1397)
Q Consensus       381 -----i~~-~~~~~~~~l~l~a~~~~~~~~~~~v~~~~S~s~s~L~i~~~~~~~~~G~~~~v~v~s~~--------~~~~  446 (1397)
                           ... +.+...  +....+..  ..+.+......+...++++.......+-++-...+.+.+..        ....
T Consensus       403 ~~~~~~~~~~~~~~~--~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  478 (1436)
T KOG1366|consen  403 LTLVRSLAPPSECPP--FLSDDETD--NRAVRLFRKTFNIVQSYIEKAKNSYKLYLQLNKGVKVLSGLGKRVTGILKSVS  478 (1436)
T ss_pred             ceeeEeeccCCCCcc--eEEeeecc--ccccccccccccccccccccccccceeeccccccceEEeeccceeeeEEEEec
Confidence                 111 111100  10000000  01111122222333344444333333333332222222110        0012


Q ss_pred             EEEEEEeCCeEEEE----EEEEecCCceeEEEEEeeCCCCCceeEEEEEEEecCCeEEEEeEEEEEcccccceeeEEecC
Q psy5522         447 ISYQVLGRGDVIMA----DTITVPGNKMSTVIRFLATYAMAPTAHVIVQYVREDGEVVADGLDLELEGGLQNFVSANVSP  522 (1397)
Q Consensus       447 ~~ylV~srG~Iv~~----~~~~~~~~~~~~~~~l~vt~~maP~a~l~vy~v~~~gevvads~~~~V~~~f~nkV~l~~s~  522 (1397)
                      ..|.++.++-++..    ..+..............++-.-.|.+++..+++.++|+++.|+..|.++.+++|+|.+.+++
T Consensus       479 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~v~~~~~n~v~l~~~~  558 (1436)
T KOG1366|consen  479 PLYHFTYSGMIVFTITHGSHMVARVPTETVNDSACVKVLDKPVGLLVSFGIDSSGEVVGDSLKFKVEGDLKNKVALVANK  558 (1436)
T ss_pred             ceeEEeeccceeEEEEEeeEEecccceeeeccceeEEeeeeceeeEEEEEeccccccccceeEEEeeccccceEEEEEcc
Confidence            33444444444333    33322222345566667777777877888889999999999999999999999999999999


Q ss_pred             CCCCCCceEEEEEEecCCcEEEEEEEeeecccccCCCCCChhHHhHhccccCCCCCCCCccccccccccC----------
Q psy5522         523 DETEPGSNIQINLEAKPNSYIGLLAVDQKVLLLKTGNDIGKEDVMRELRSYDETDTSKLPLVENLRERYP----------  592 (1397)
Q Consensus       523 ~~~~Pg~~v~l~V~a~p~S~v~L~aVD~sVllL~~~~~lt~~~v~~~l~~y~~~~~~~~~~~~~~~~~~p----------  592 (1397)
                      ....||+++.+++...|+|+||+++||+|+++|+..|+++.+++...|..|+......    .. .+++|          
T Consensus       559 ~~~~~~~~~~~~~~~~~~s~vgl~~vd~s~~~m~~~n~~~~~~~~~~l~~~~~~~~~~----~~-~~~~~~~~~~~~~~~  633 (1436)
T KOG1366|consen  559 SAKEPGNNVALEVTDSPDSFVGLSAVDQSVLLMDAANDLTLEGVTFLLSEYDLTNQGG----KS-RGRNPGELVNKKKVT  633 (1436)
T ss_pred             cccCCCCcceeeecCCCceeecchhhhhhhhhcccccchhhhcceeeecccccccccc----Cc-cccCchhhhhhhhhh
Confidence            9999999999999999999999999999999999999999999999999998665311    00 11111          


Q ss_pred             CCcchhhHhhhcCeEEEecccccccCcccccccC---------CCCCCCcccchhhccccccccccccceecccCcceeE
Q psy5522         593 GSFTAQATFEKAGAIVMTNGYVHERNPWVYYKSL---------NDPPDDMLDGEEQLLSQVTTSVTQLTVRKHFPETWLF  663 (1397)
Q Consensus       593 ~~~d~~~~f~~~Gl~v~Tn~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~vR~~FpETwiW  663 (1397)
                      .+.+.+..++.+|+.+++|..+..+..+..+...         .....++..........+......++||++|||||||
T Consensus       634 ~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~VRk~FPEtWlw  713 (1436)
T KOG1366|consen  634 NAKYKYSTSKGDGLRFFCNPGIRSPSDASRRILVGRECSSVRRDLSGAGMPILAHALTNEDLSVPSKPRVRKHFPETWLW  713 (1436)
T ss_pred             cccchhhccccccceeeecCCccchhhhhhhccccccceeeeccccCCCcchheeecccccccCCCCcchhccCCcceee
Confidence            2356678888999999999888765443211000         0001111110000000011234678999999999999


Q ss_pred             EEEEeccCceEEEEEEcCCccccEEEEEEEEeCCCceeeeeCCeeEEEeeeEEEEecCCceeeeCCEEEEEEEEEcCCCc
Q psy5522         664 QMEETGFDGKVMVNEKVPDSITSWVLSAFSVDSLYGLGLMDMPKKLRVFRPFFISLDLPYSVMRGEVVAIPVVVFNYLSQ  743 (1397)
Q Consensus       664 ~~~~~~~~G~a~v~~tvPDtiT~W~~~Afa~s~~~G~G~a~~~~~l~vfkpffv~l~LPysv~rGE~~~l~~~VfNyl~~  743 (1397)
                      +.+.++.. ..++.+++||+||+|.++|||+|++.|+|++ ++.++++||||||+++|||||+|||+|.|+++||||+++
T Consensus       714 ~~~~v~~~-~~~~~v~~PDtiT~W~~~~f~ls~~~Gl~v~-~~~~l~~fQpfFi~l~lPySV~RgE~i~l~~tv~NYl~k  791 (1436)
T KOG1366|consen  714 DDVNVGRR-VKELTVTVPDTITTWVASGFSLSEDKGLGVA-PTTSLRVFQPFFIELNLPYSVRRGEQIALRVTVFNYLTK  791 (1436)
T ss_pred             eeEecCCc-eEEEEEEcCCccceEEEEEEEEcCCceeEec-CCceEEEeeceeEEecCceeEEeCcEeEEeEEEecccCc
Confidence            99999888 9999999999999999999999999999999 577799999999999999999999999999999999999


Q ss_pred             cEEEEEEEeecCCeeeeecCCCccCCCCCCcceeeeEEEEEeCCCeeEEEEEEEeceeeeEEEEEEEEeCCCCceeEEEE
Q psy5522         744 DLVADVTLENVGQFDFADFSNEVDAAPQPKFEVFRRKKLTIKANSGSTTTFVITPKELGYIGIKVTATSNLAGDSMEGKL  823 (1397)
Q Consensus       744 ~~~v~v~L~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~a~~~~tv~f~v~p~~lG~v~itV~A~~~~~~D~v~k~L  823 (1397)
                      +++|.|.+..+++++....+..+.....  .....+.....++++.+.+.|+|+|..+|.++++++|.+..++|+|+|+|
T Consensus       792 ~~~v~V~l~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~v~~~~s~~~~~~~v~p~~~g~~~~~~~~~t~~~~dav~k~l  869 (1436)
T KOG1366|consen  792 ELDVSVILLSSEDFCYDATSKANEQVGV--SQRSVQVSRVTPAQSGKLVYFPVRPGVIGLILVTISALTLLGGDAVHKTL  869 (1436)
T ss_pred             ceEEEEEEccCCCeeeecccccCccccc--ccceEEEEEEcccccceEEEEEeccccccccceEEEEeecCCCccEEEEE
Confidence            9999999999998887654433221110  01222334455699999999999999999999999999999999999999


Q ss_pred             EecCCCeeeEEEEEEEEecCCCceeEEEEEecCCCCcCCCCcEEEEEEEcccccCcccCcchhhccCCCchhhhhcccch
Q psy5522         824 LVKPEGETQYKNKAIFVDLRKNKTFSVNVTLDMPKNIVPGSEHVEVSAVGDLLGPSIPNLANLIKMPFGCGEQNMLNFVP  903 (1397)
Q Consensus       824 ~V~peG~~~~~t~s~~l~~~~~~~~~~~i~l~~P~~~v~gs~~~~vsv~gd~l~~~~~~l~~Ll~~P~GCgEQ~m~~~~p  903 (1397)
                      +|+|||++++..++.+||++.  +.++++++.+|+++|+||+++++++.||++|++++|+++|++||||||||||+.|+|
T Consensus       870 ~V~~eG~~~~~~~~~~i~l~~--~~~~~~~~~~p~~~V~~S~~~~~~~vgD~lGps~~~l~~Ll~lP~GcGEQnMi~faP  947 (1436)
T KOG1366|consen  870 LVEPEGITQERNFSVLIDLSV--SASKELSLGVPNDVVPGSERARISVVGDVLGPSMNNLSNLLRLPYGCGEQNMINFAP  947 (1436)
T ss_pred             EEeccCccceeeeeEEeccCC--CcCcceeecCCCCcCccccceeeEEEecccccChhhhHHhccCCCCcccceeecccc
Confidence            999999999999999999874  578999999999999999999999999999999999999999999999999999999


Q ss_pred             HHHHHHHHHHhccccHHHHHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHH
Q psy5522         904 NIVVLEYLKKTYQLTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVIL  983 (1397)
Q Consensus       904 ~i~~l~yL~~~~~l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~  983 (1397)
                      |||+++||+.++|++++++++|+++|++||||+|+|+|+|||||+|+..+.++++||||||+++|.+|+++|.||+..+.
T Consensus       948 ni~v~~YL~~t~q~~~~~k~ka~~~l~~GyqrqL~yk~~DgSySaFg~~~~~~stWLtafvlr~f~~a~~~i~id~~~i~ 1027 (1436)
T KOG1366|consen  948 NIYVLKYLPKTNQLTPELKRKALKFLEQGYQRQLTYKRADGSYSAFGSSDRSGSTWLTAFVLRVFSQAKEYIFIDPNVIT 1027 (1436)
T ss_pred             hhhHHHHHhhhhccChhHHHHHHHHHHHHHHHHHhhhccCCChhhhcCCCCcccHHHHHHHHHHhhhccCceEecHHHHH
Confidence            99999999999999999999999999999999999999999999999977899999999999999999999999999999


Q ss_pred             HHHHHHHhcccCCCccccchHhHHHhhcCCCCccEEEEEEEEeccccceeEEEEeeeeEEEEEEEEEEeeccCCCCCcee
Q psy5522         984 EALAWLSSNQAVNGSFPEVGKVSHADMQGGAAKGLALTAYTLLPKKTRAVNMTATGSGFAVVYISYQYNINVTGAWPMFT 1063 (1397)
Q Consensus       984 ~~~~WL~~~q~~~G~F~stqdvl~al~~~~~~~~v~lt~~v~lp~~~~~~~v~~~G~G~~~~q~~~~Yn~~~~~~~~~f~ 1063 (1397)
                      ++++||.++|+.+|+|.+..+|++..+++++...+.+++....                                     
T Consensus      1028 ~a~~wl~~~Qk~~GsF~e~~~v~~~~~qg~l~~~~~l~~~~l~------------------------------------- 1070 (1436)
T KOG1366|consen 1028 QALNWLSQQQKENGSFKEVGEVLHNEMQGGLQDIVALTAIVLS------------------------------------- 1070 (1436)
T ss_pred             HHHHHHHHhhccCceEeccccccchhhhcCCCCcceeeeEeee-------------------------------------
Confidence            9999999999999999999999999999887666677776544                                     


Q ss_pred             eccccccCCCCCceEEEEEEeecCCCCccEEEEEecCCcccccCCCCchhhcccCceeEEEEeCCeEEEEEEeeecCCcc
Q psy5522        1064 LDPQVDKNSDSNHLQLSICSGFIGEGDSNMAVMEVSLPSGFTVDSDALPSLQVSQNVKRVETKNGNTMVVLYFAYHQVLP 1143 (1397)
Q Consensus      1064 l~v~~~~~~~~~~~~l~vc~~y~~~~~s~Mailev~l~SGf~~~~~~l~~l~~~~~i~~~e~~~~~~~v~lY~~~~~~~~ 1143 (1397)
                                                    +++++                             +               
T Consensus      1071 ------------------------------~~~~~-----------------------------~--------------- 1076 (1436)
T KOG1366|consen 1071 ------------------------------VILDV-----------------------------D--------------- 1076 (1436)
T ss_pred             ------------------------------eeccc-----------------------------h---------------
Confidence                                          55555                             1               


Q ss_pred             ceeeecccccccchhhHHHHHHHHHhhcCCC-CchHHHHHHHHHHhhCCCCchHHHHHHHhhcccccCC-cccccccCCC
Q psy5522        1144 WTLMHYLVSKYPRMNTINKAVDYIVKNLAGT-EDAYAIAICSYALHLAQHPVKDVAFNLLESKAHNEDG-KKWWKRAERP 1221 (1397)
Q Consensus      1144 ~~~~~~~~~~~~~~~~~~~a~~~l~~~~~~~-~~~y~~al~aYal~la~~~~~~~~~~~L~~~a~~~~~-~~~W~~~~~~ 1221 (1397)
                                   ...+.+|..|+++.++.. .++|+.|++||||.+++++.+..++.+|+.+|+.+++ ..||..+.+.
T Consensus      1077 -------------~~~~~~a~~~~~~~~~~~~~~~~~~a~~ayAl~l~~sp~a~~~~~~l~~~a~~~~d~~~~~~~~~~~ 1143 (1436)
T KOG1366|consen 1077 -------------NASIQKAVAYLEESLDSGSMDVYTVAITAYALQLAKSPQAAKALAKLKSLARVEGDRRYWWASALKA 1143 (1436)
T ss_pred             -------------hhHHHHHHHHHhhccCccccccchHHHHHHHHHhccCchHHHHHHhhhhhhcccCCceeeeeccccc
Confidence                         135788999999999865 7899999999999999999999988889999999877 5666665443


Q ss_pred             CCCCCCCCCCCCcccchhhHHHHHHhhccCCcccchhhhhhhhhcccCCCCccccccC----------------------
Q psy5522        1222 EDKKNPWAQVPNSVDVEMTSYALLSYLDRGLVEDTLPILTWLVTQQNDQGGFASTQMS---------------------- 1279 (1397)
Q Consensus      1222 ~~~~~~~~~~~~s~~vE~t~Y~lL~~~~~~~~~~a~~i~~Wl~~q~n~~Ggf~stQdT---------------------- 1279 (1397)
                      .+... |.+++++++|||||||||+++...+..++.||+|||++|||..|||+|||||                      
T Consensus      1144 ~~~~~-~~~q~~s~~VE~tsYaLL~~~~~~~~~~~~pivrWl~~qr~~~GGf~STQdTvvalqaLs~y~~~~~~~~~~~~ 1222 (1436)
T KOG1366|consen 1144 KNAVK-YSPQARSIDVETTAYALLAYLLLAQVDYALPIVRWLVEQRNALGGFSSTQDTVVALQALSEYAALSHTETLSLQ 1222 (1436)
T ss_pred             cCccc-CCCccchhhccchHHHHHHHHHhcccCcCchhHhhhhhhhcccCceeehHHHHHHHHHHHHhhhhcccceeEEE
Confidence            33212 5678899999999999999999888889999999999999999999999999                      


Q ss_pred             --------------------------------------------------------------------------------
Q psy5522        1280 -------------------------------------------------------------------------------- 1279 (1397)
Q Consensus      1280 -------------------------------------------------------------------------------- 1279 (1397)
                                                                                                      
T Consensus      1223 V~i~~~~~~~~~~~~~~~~~N~lllQ~~~~p~~~~~v~~ta~G~g~~~~q~~~~Ynv~~~~~~~~f~l~~~~~~~~~~~~ 1302 (1436)
T KOG1366|consen 1223 VTIQVPGKSSGTSQVDSTPNNLLLLQSVSLPSLHKTVCFTAKGQGCATVQISYTYNLLTPKSDPRFFLIVSTKPEPVEFD 1302 (1436)
T ss_pred             EEEEecccCCCceeeeccccchhhcceeecccCCCceEEEEecceeEEEEeeEEEeecCcccCCCceeeeeeccccccCc
Confidence                                                                                            


Q ss_pred             -----------------------------------------------------------------------CCcccccee
Q psy5522        1280 -----------------------------------------------------------------------KKELCPTVS 1288 (1397)
Q Consensus      1280 -----------------------------------------------------------------------~~~~c~~~~ 1288 (1397)
                                                                                             ++..|+.|+
T Consensus      1303 ~s~~~~~~~~~~~~~~~~~~SnMAIldVklpSGf~~~~~sl~~le~~~~v~~~~~~~~~~~v~~Y~D~v~~~~~~cv~F~ 1382 (1436)
T KOG1366|consen 1303 LSNIVVVYTTCCERTGSSGRSNMAILDVKLPSGFRADHESLGLLEQLEDVYRVEVENDHRRVLIYFDSVSGNETFCVRFE 1382 (1436)
T ss_pred             cccceEEEEEEEeecCCCCCCCceeeeeecccCcccChhhHHHhhccceEEEEEEecCCcceEEEeeccCCCCceEEEEE
Confidence                                                                                   567799999


Q ss_pred             ccccccccccCceeEEEeeccccchhheeeeccccccccccccCCCCCCcc
Q psy5522        1289 AFRTHKVAQQKPVPVSIYDYYDQSRRARMFYEPRVATLCDICEGEDCSKDT 1339 (1397)
Q Consensus      1289 ~~~~~~v~~~~p~~v~~ydyy~~~~~~~~~y~~~~~~~c~~c~~~~c~~~~ 1339 (1397)
                      |.|.++|+++|||+|+|||||+|+++|++||+++..++||+|++++|...|
T Consensus      1383 a~~~~~V~~~kpa~v~VYDYY~pee~~t~~Y~~~~~~~~~~c~~~~c~~~c 1433 (1436)
T KOG1366|consen 1383 AVQSFPVANLKPASVTVYDYYEPEEQATRFYNAESSSSCDVCEGAQCKCPC 1433 (1436)
T ss_pred             EEEEeehhccCCceEEEeeccCchhhhhhhhcccccchhhhcccccCCcCC
Confidence            999999999999999999999999999999999888899999999998755


No 2  
>COG2373 Large extracellular alpha-helical protein [General function prediction only]
Probab=100.00  E-value=1.7e-64  Score=666.61  Aligned_cols=812  Identities=18%  Similarity=0.239  Sum_probs=611.9

Q ss_pred             ccEEEEEcCCCcccCCceeEEEEEEEcCCCc-cCCCcceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEE
Q psy5522          97 SYSVFIQLDKAIYKPGDLVRFRAIVLNSHLK-PSVTGALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWN  175 (1397)
Q Consensus        97 ~~~vfIqTDKpIYkPGqtV~fRvv~ld~~l~-P~~~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~  175 (1397)
                      ...+|++|||+||||||+||+.++.++.+-+ +..+.++.+.+.+|+|..+++-.....+.|..+++|+||++++.|.|+
T Consensus       393 ~~k~y~ftDRglYRpGE~v~~~~~~R~~~~~~a~~~~p~~l~v~~PdG~~~~~~~~~~~~~G~~~~~~~l~~na~tG~w~  472 (1621)
T COG2373         393 GLKVYLFTDRGLYRPGETVHVNALLRDFDGKTALDNQPLKLRVLDPDGSVLRTLTITLDEEGLYELSFPLPENALTGGYT  472 (1621)
T ss_pred             ceEEEEecCcccCCCCceeeeeeeehhhcccccccCCCeEEEEECCCCcEEEEEEEeccccCceEEeeeCCCCCCcceEE
Confidence            5789999999999999999999999999887 667899999999999999999988889999999999999999999999


Q ss_pred             EEEEEcC--eEEEEEEEEeeecCCcEEEEEeCCCceEeeCCeEEEEEEEEEecCccccc-eEEEEEEec-----------
Q psy5522         176 ITINVLD--QKFTKRFTVAEYVLPKFEVNVNVPPHATFKDSKVVISVNAKYTYGKPVKG-EATITAYPT-----------  241 (1397)
Q Consensus       176 I~v~~~~--~~~~~~F~VeEYvLPkFeV~v~~p~~i~~~~~~~~v~v~AkYtyGkPV~G-~v~v~v~~~-----------  241 (1397)
                      |++...+  ...+.+|+|+||++|+|+|.++..+..+..++.+.++|.++|+||.|++| +++-.+..+           
T Consensus       473 l~~~~~~~~~~~s~~f~V~df~p~r~~i~l~~~k~~~~~g~~v~~~v~~~yL~GaPa~g~~~~~~l~lr~~~~~~~~~~~  552 (1621)
T COG2373         473 LELYTGGKSAVISMSFRVEDFIPDRFKINLTLDKTEWVPGKDVKIKVDLRYLYGAPAAGLTVQGELDLRPTRFSVPGFPG  552 (1621)
T ss_pred             EEEEeCCccceeeeeEEhhHhCCceEEEecccccccccCCCcEEEEEEEEecCCCcccCceeeeEEEeecccccccCCcc
Confidence            9999987  68899999999999999999999998888999999999999999999998 455323221           


Q ss_pred             cccCccccc-----cccce-eEEeecCceEEEEEechhhhccccccceEEEEEEE-EEECCcceEEeeEEEEEEEEeeeE
Q psy5522         242 IFSGVIQPL-----FQTPV-RKVVPIDGKTVIEFDVVKELQLTDEYERNIHFDVA-VEEALTGRRQNNTGSVVFHKHKYK  314 (1397)
Q Consensus       242 ~~~g~~~~~-----~~~~~-~~~~d~~G~~~~~~~~~~~~~~~~~~~~~l~V~a~-VtE~gTG~~~~~~~~i~i~~~~~~  314 (1397)
                      +.||...++     +.+.. ...+|.+|++.+.+++.+ ..    ..+.+.+.+. +.| +.|+.++.+.++.|.....-
T Consensus       553 ~~fg~~~~~~~~~~~~~~~~~~~~D~~G~a~~~l~~~~-~~----s~~~~~~~~~~~~~-~ggR~v~~s~~~~v~p~~~l  626 (1621)
T COG2373         553 FTFGLADEEPNSLTFEEELELTVTDGKGKASLDLDLAE-TP----SPYQLLVASSEVEE-DGGRAVTRSLSVRVSPLDEL  626 (1621)
T ss_pred             eeeeccccccccccccccccccccCCCCcEeEeccccc-CC----CceEEEeecceeec-cCCceeccceeEEEEcCcce
Confidence            122322111     11111 256889999999998866 21    1233333333 545 55999999999999999988


Q ss_pred             EEEEe--cCCcccCCCeEEEEEEEEccCCCccCCCCceEEEE-E--eeecc--cc------------ccceeeEeeCCCe
Q psy5522         315 MDLIK--SSEYFKPGLKYTAYMKLTHHDGTPVTDNNNMVQVR-H--GFSYD--ES------------KYEANQYKLDRNG  375 (1397)
Q Consensus       315 i~f~~--~~~~~kpG~p~~~~v~v~~~dG~P~~~~~~~V~v~-~--~~~~~--~~------------~~~~~~~ttd~~G  375 (1397)
                      +.+..  ...+.+-..+..+.+.+.+++|+.++..+..+.+. .  .+.|.  +.            .....+..+..++
T Consensus       627 iGvr~~f~~~~~~~~~~~~~~~i~v~~~g~~~a~~~l~~~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~  706 (1621)
T COG2373         627 IGVRPKFAADYVKDITPAKFSIIAVSADGSKVALKGLKLVLIEVKRNYNWTYSEGESWKYESVEIETVILEDTLDISANE  706 (1621)
T ss_pred             eeeecccchhhhcccCceEEEEEEecccccccccCCceeEEEEEeecceEEEecCcceecccccchhhhcccceeecCCC
Confidence            88776  34555555889999999999999998654444422 1  11221  11            0112223456777


Q ss_pred             eEEEEEeCCCCCceeEEEEEEEecC-cceE--EEEEEe-e---cCCCCCeEEEEecCCCCCCCCeEEEEEEecCCcceEE
Q psy5522         376 MIKLVYYPPANENVTTLGIEAEYLD-IKEW--FSTISA-S---ESPSNSFIQAALLTQNPKVNKDVELEINSTAPLKYIS  448 (1397)
Q Consensus       376 ~~~f~i~~~~~~~~~~l~l~a~~~~-~~~~--~~~v~~-~---~S~s~s~L~i~~~~~~~~~G~~~~v~v~s~~~~~~~~  448 (1397)
                      .+.|++++.  +|.|+|.|+  +.+ ...+  .|.+.. +   +..++..|.|..+++.|++||++++.|+.+.+.. -.
T Consensus       707 ~~~it~p~~--~G~Yrl~v~--~~d~~~~~~~~f~v~~~~~~~~~~~~d~l~i~ldK~~Y~~Gdtakl~I~~p~ag~-~~  781 (1621)
T COG2373         707 TTIITVPVG--WGDYRLEVE--DSDGGILASVSFDVGGSGNDTVTGTPDVLKLKLDKPEYQPGDTAKLLITTPYAGE-EA  781 (1621)
T ss_pred             ceEEEeccc--CCceEEEEE--cCCCCeEEEEEEeccccccccccCCccceEEecCccccCCCceEEEEeccCCCCc-ce
Confidence            788888664  477666555  444 2211  222211 1   1223678999999999999999999999999886 67


Q ss_pred             EEEEeCCeEEEEEEEEecCCceeEEEEEeeCCCCCceeEEEEEEEecCCeE--------EEEeEEEEEcccccceeeEEe
Q psy5522         449 YQVLGRGDVIMADTITVPGNKMSTVIRFLATYAMAPTAHVIVQYVREDGEV--------VADGLDLELEGGLQNFVSANV  520 (1397)
Q Consensus       449 ylV~srG~Iv~~~~~~~~~~~~~~~~~l~vt~~maP~a~l~vy~v~~~gev--------vads~~~~V~~~f~nkV~l~~  520 (1397)
                      ++++++.+++....+.+..  ....+++++++.|.+.+++.|-.++| |+.        -...++|.|... .+++.|++
T Consensus       782 litve~Dk~~~~~~~~v~~--~~~~~~i~v~~~~~~~~yV~v~f~r~-~~~~~~~~~~~~yG~~~~~v~~~-~r~L~v~l  857 (1621)
T COG2373         782 LITVESDKVIQVQLLKVPA--NGLTLEIPVPAAWEHNAYVSVSFVRP-GDASQSFTPARAYGFVPLPVDVK-DRRLDVQL  857 (1621)
T ss_pred             eEEEecccceeeEEEecCC--CCceEEEEccHHhcCCeEEEEEEEcC-CChhhhccCccceEEEecccCCc-cceeEEEe
Confidence            7888999999999888754  44689999999999999999999985 322        134567888766 67888887


Q ss_pred             c-CCCCCCCceEEEEEEecC-----CcEEEEEEEeeecccccCCCCCChhHHhHhccccCCCCCCCCccccccccccCCC
Q psy5522         521 S-PDETEPGSNIQINLEAKP-----NSYIGLLAVDQKVLLLKTGNDIGKEDVMRELRSYDETDTSKLPLVENLRERYPGS  594 (1397)
Q Consensus       521 s-~~~~~Pg~~v~l~V~a~p-----~S~v~L~aVD~sVllL~~~~~lt~~~v~~~l~~y~~~~~~~~~~~~~~~~~~p~~  594 (1397)
                      . +.+.+||+.++++|+..-     .+.+.+.|||++||.|.+....++.+.|-.-...                    +
T Consensus       858 ~~p~k~~Pg~~l~i~l~~~~~~~~~~a~vtV~aVDegIl~lt~y~~Pdp~~~ff~~r~l--------------------~  917 (1621)
T COG2373         858 KTPKKVKPGEPLTIKLSVSGAGGPKSAYVTVAAVDEGILNLTGYEAPDPVKFFFGQRRL--------------------G  917 (1621)
T ss_pred             cCccccCCCCeeEEEEEEcCCCCCcceEEEEEEeeeehhhccCCCCCCHHHhhhccccc--------------------c
Confidence            6 678889999999999643     7999999999999999886655554443111110                    1


Q ss_pred             cchhhHhhhcCeEEEecccccccCcccccccCCCCCCCcccchhhccccccccccccceecccCcceeEEE-EEeccCce
Q psy5522         595 FTAQATFEKAGAIVMTNGYVHERNPWVYYKSLNDPPDDMLDGEEQLLSQVTTSVTQLTVRKHFPETWLFQM-EETGFDGK  673 (1397)
Q Consensus       595 ~d~~~~f~~~Gl~v~Tn~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vR~~FpETwiW~~-~~~~~~G~  673 (1397)
                      .+.+++   .|..+--.       ..   .+...+.+|..+.        ......+..|+ |..+++|.. +.+|++|+
T Consensus       918 v~t~di---~~~ii~~~-------~~---~~~~~~~Ggd~~~--------~~~~~~~p~r~-~~~va~~~g~v~~d~~G~  975 (1621)
T COG2373         918 VETRDI---YGRIIDGF-------QG---VAKPLRTGGDGGR--------AAKQGEPPTRR-EKPVAAFSGIVETDAQGK  975 (1621)
T ss_pred             eEeeec---cccccccc-------cc---cccccccCCCccc--------cccccCCCccc-ccchhhhccceEEcCCCc
Confidence            111111   11110000       00   0000111111100        00112334555 666677775 56999999


Q ss_pred             EEEEEEcCCccccEEEEEEEEeCCCceeeeeCCeeEEEeeeEEEEecCCceeeeCCEEEEEEEEEcCCCccEEEEEEEee
Q psy5522         674 VMVNEKVPDSITSWVLSAFSVDSLYGLGLMDMPKKLRVFRPFFISLDLPYSVMRGEVVAIPVVVFNYLSQDLVADVTLEN  753 (1397)
Q Consensus       674 a~v~~tvPDtiT~W~~~Afa~s~~~G~G~a~~~~~l~vfkpffv~l~LPysv~rGE~~~l~~~VfNyl~~~~~v~v~L~~  753 (1397)
                      ++|+|.+||..|+||+||+|++ +..+|-++  ++++|++|+.++++||+|++.||++.+.+.|||..++...+   +..
T Consensus       976 a~vt~~iPdfng~~RvmavA~t-~~~vG~~~--~~~~v~~p~i~~~s~PrflapGD~~~~~l~v~N~~~~~g~~---~~~ 1049 (1621)
T COG2373         976 ATVTFPIPDFNGRLRVMAVAWT-DGAVGQAE--AELIVRKPVIVEASMPRFLAPGDRSSAGLDVFNLTGKAGDV---LTV 1049 (1621)
T ss_pred             EEEEEecCCcCceEEEEEEEec-Cccccccc--eeEEecCCeEEecCCCceecCCccceeEEEEeccCCCCcce---eec
Confidence            9999999999999999999999 67999994  99999999999999999999999999999999999999873   333


Q ss_pred             cCCeeeeecCCCccCCCCCCcceeeeEEEEEeCCCeeEEEEEEEece-eeeEEEEEEEEeCCCCceeEEEEEecCCCeee
Q psy5522         754 VGQFDFADFSNEVDAAPQPKFEVFRRKKLTIKANSGSTTTFVITPKE-LGYIGIKVTATSNLAGDSMEGKLLVKPEGETQ  832 (1397)
Q Consensus       754 s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~a~~~~tv~f~v~p~~-lG~v~itV~A~~~~~~D~v~k~L~V~peG~~~  832 (1397)
                      +..+.+..              ....+++.+..+++.++.|++.+.. .|.+.++++... ..+|.+++.+.|.|...++
T Consensus      1050 ~~~l~~~~--------------~~~~~~~~l~~g~~~~l~~~l~a~~g~g~~~~~~~~~~-~~~~~~~~~~~vr~~~~~~ 1114 (1621)
T COG2373        1050 EGTLKFNG--------------KEAPQTLTLAEGSATTLFFPLRALDGSGKIDATLSVSG-QVGDKLERHLNVRPAVPPQ 1114 (1621)
T ss_pred             cceecccc--------------cccceEEEecCCCceEEEEEeeccCCCceeeEEEEecC-cccceeeeeeccccCccce
Confidence            33343321              1246899999999999999999986 555554444432 2689999999999999999


Q ss_pred             EEEEEEEEecCCCceeEEEEEecCCCCcCCCCcEEEEEEEcccccCcccCcchhhccCCCchhhhhcccchHHHHHHHHH
Q psy5522         833 YKNKAIFVDLRKNKTFSVNVTLDMPKNIVPGSEHVEVSAVGDLLGPSIPNLANLIKMPFGCGEQNMLNFVPNIVVLEYLK  912 (1397)
Q Consensus       833 ~~t~s~~l~~~~~~~~~~~i~l~~P~~~v~gs~~~~vsv~gd~l~~~~~~l~~Ll~~P~GCgEQ~m~~~~p~i~~l~yL~  912 (1397)
                      +.++...+.+.      ..++..++.++.+++..+.+++++.+....-..++.|.+|||||.||+.|++.|.+|+.....
T Consensus      1115 ~~~~~~~l~~~------~~~~~~~l~~l~~~~~~~~l~ls~~~~~~~~~~~~~L~~YPygc~EQt~S~~~pll~~~~~~~ 1188 (1621)
T COG2373        1115 TQVKTVVLAPG------FTISGDLLAGLSPQAVNAQLQLSGTPAADIAALLDVLDDYPYGCAEQTASRLLPLLYAQKATA 1188 (1621)
T ss_pred             eeeEEEecccc------cccCchhhhhcCccccceEEEeccCccccHHHHHHHHHhCCccchhhhhhhHHHHHhhhhhhc
Confidence            99888877642      455677888999999999999999988877778999999999999999999999999887665


Q ss_pred             HhccccHHHHHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCC-CccCHHHHHHHHHHHHh
Q psy5522         913 KTYQLTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASH-TTIDESVILEALAWLSS  991 (1397)
Q Consensus       913 ~~~~l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~-i~vd~~~~~~~~~WL~~  991 (1397)
                      ..    ....++....|+..+.|++.+|..+|+|++|++. .+++.|+||||..++++|+.. +.++...+.++..|+.+
T Consensus      1189 ~~----~~~~~~~~~~l~~a~~rL~~~Q~~~G~F~~W~~~-~~~d~~ltaYa~~Fl~~A~e~g~~vp~~~~~~~~~~~~~ 1263 (1621)
T COG2373        1189 DP----GAADNDLRARLQDAIGRLLSLQGSNGAFGLWGGN-GSGDPWLTAYAVDFLLRAREQGYSVPSDALNQMLERLLE 1263 (1621)
T ss_pred             cc----cccchhHHHHHHHHHHHHHhhhhcCCceeecCCC-CCcchhhhHHHHHHHhhhhhcCcCCCHHHHHHHHHHHHH
Confidence            44    1223344445667788999999999999999886 689999999999999999877 89999999999898876


Q ss_pred             cccCCC
Q psy5522         992 NQAVNG  997 (1397)
Q Consensus       992 ~q~~~G  997 (1397)
                      .-...|
T Consensus      1264 ~l~n~~ 1269 (1621)
T COG2373        1264 YLQNPG 1269 (1621)
T ss_pred             HHhCcC
Confidence            544433


No 3  
>cd02896 complement_C3_C4_C5 Proteins similar to C3, C4 and C5 of vertebrate complement.  The vertebrate complement system, comprised of a large number of distinct plasma proteins, is an effector of both the acquired and innate immune systems.  The point of convergence of the classical, alternative and lectin pathways of the complement system is the proteolytic activation of C3. C4 plays a key role in propagating the classical and lectin pathways. C5 participates in the classical and alternative pathways. The thioester bond located within the structure of C3 and C4 is central to the function of complement. C5 does not contain an active thioester bond.
Probab=100.00  E-value=2.6e-56  Score=516.16  Aligned_cols=279  Identities=39%  Similarity=0.644  Sum_probs=242.6

Q ss_pred             ccCcchhhccCCCchhhhhcccchHHHHHHHHHHhcccc---HHHHHHHHHHHHhccccceeeecCCCCccccCCCCCCC
Q psy5522         880 IPNLANLIKMPFGCGEQNMLNFVPNIVVLEYLKKTYQLT---DAIEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNG  956 (1397)
Q Consensus       880 ~~~l~~Ll~~P~GCgEQ~m~~~~p~i~~l~yL~~~~~l~---~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~  956 (1397)
                      .+||++|++|||||+||||++|+|++|+++||+.+++++   ++.++++.++|++||+|+++||++||||++|++.  .+
T Consensus         2 ~~~l~~L~~~P~GC~EQ~~~~~~p~~~~~~yl~~~~~~~~~~~~~~~~~~~~i~~g~~r~l~~q~~dGsf~~w~~~--~~   79 (297)
T cd02896           2 PEGLEKLIRLPTGCGEQTMIKLAPTVYALRYLDTTNQWEKLGPERRDEALKYIRQGYQRQLSYRKPDGSYAAWKNR--PS   79 (297)
T ss_pred             chhHHHHHhCCCCcHHHHHHHHHHHHHHHHHHHHhccccccchhHHHHHHHHHHHHHHHHHhccCCCCCccCCCCC--Cc
Confidence            468999999999999999999999999999999999865   4667899999999999999999999999999764  68


Q ss_pred             CcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCccccchHhHHHhhcCCCC---ccEEEEEEEEecccccee
Q psy5522         957 STWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSFPEVGKVSHADMQGGAA---KGLALTAYTLLPKKTRAV 1033 (1397)
Q Consensus       957 s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~stqdvl~al~~~~~~---~~v~lt~~v~lp~~~~~~ 1033 (1397)
                      ++||||||+++|.+|++++.||+..++++++||.++|+++|||.+...+++.-+.++++   .++.+|+++.+       
T Consensus        80 s~wlTA~v~~~l~~a~~~~~v~~~~l~~a~~wL~~~Q~~dG~f~e~~~~~~~~m~gg~~~~~~~~~lTA~vl~-------  152 (297)
T cd02896          80 STWLTAFVVKVFSLARKYIPVDQNVICGSVNWLISNQKPDGSFQEPSPVIHREMTGGVEGSEGDVSLTAFVLI-------  152 (297)
T ss_pred             chhhHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHhcCCCCCeeCCCCCccChhccCCccccCCCccchHHHHH-------
Confidence            99999999999999999999999999999999999999999999877777766666543   57889999877       


Q ss_pred             EEEEeeeeEEEEEEEEEEeeccCCCCCceeeccccccCCCCCceEEEEEEeecCCCCccEEEEEecCCcccccCCCCchh
Q psy5522        1034 NMTATGSGFAVVYISYQYNINVTGAWPMFTLDPQVDKNSDSNHLQLSICSGFIGEGDSNMAVMEVSLPSGFTVDSDALPS 1113 (1397)
Q Consensus      1034 ~v~~~G~G~~~~q~~~~Yn~~~~~~~~~f~l~v~~~~~~~~~~~~l~vc~~y~~~~~s~Mailev~l~SGf~~~~~~l~~ 1113 (1397)
                                                                                  +|+|.+-+.+          
T Consensus       153 ------------------------------------------------------------aL~~~~~~~~----------  162 (297)
T cd02896         153 ------------------------------------------------------------ALQEARSICP----------  162 (297)
T ss_pred             ------------------------------------------------------------HHHhhhcccc----------
Confidence                                                                        7777721100          


Q ss_pred             hcccCceeEEEEeCCeEEEEEEeeecCCccceeeecccccccchhhHHHHHHHHHhhcCCCCchHHHHHHHHHHhhCCCC
Q psy5522        1114 LQVSQNVKRVETKNGNTMVVLYFAYHQVLPWTLMHYLVSKYPRMNTINKAVDYIVKNLAGTEDAYAIAICSYALHLAQHP 1193 (1397)
Q Consensus      1114 l~~~~~i~~~e~~~~~~~v~lY~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~l~~~~~~~~~~y~~al~aYal~la~~~ 1193 (1397)
                                    ..         ...              ....+.+|+.||+++.+++.++|++||++|||+|+|++
T Consensus       163 --------------~~---------~~~--------------~~~~i~rA~~yL~~~~~~~~~~Y~~Al~ayALal~~~~  205 (297)
T cd02896         163 --------------PE---------VQN--------------LDQSIRKAISYLENQLPNLQRPYALAITAYALALADSP  205 (297)
T ss_pred             --------------cc---------chh--------------hHHHHHHHHHHHHHhcccCCChHHHHHHHHHHHHcCCh
Confidence                          00         011              45689999999999999999999999999999999999


Q ss_pred             chHHHHHHHhhcccccCCcccccccCCCCCCCCCCC-CCCCcccchhhHHHHHHhhccCCcccchhhhhhhhhcccCCCC
Q psy5522        1194 VKDVAFNLLESKAHNEDGKKWWKRAERPEDKKNPWA-QVPNSVDVEMTSYALLSYLDRGLVEDTLPILTWLVTQQNDQGG 1272 (1397)
Q Consensus      1194 ~~~~~~~~L~~~a~~~~~~~~W~~~~~~~~~~~~~~-~~~~s~~vE~t~Y~lL~~~~~~~~~~a~~i~~Wl~~q~n~~Gg 1272 (1397)
                      .+..++++|+++++.+++..||..+..     ..|+ ..+++++||+||||||++|..+++++|.|||+||++|||++||
T Consensus       206 ~~~~a~~~L~~~~~~d~~~~~~~~~~~-----~~~~~~~~~~~~vE~TAYALLall~~~~~~~a~~iv~WL~~qr~~~Gg  280 (297)
T cd02896         206 LSHAANRKLLSLAKRDGNGWYWWTIDS-----PYWPVPGPSAITVETTAYALLALLKLGDIEYANPIARWLTEQRNYGGG  280 (297)
T ss_pred             hhHHHHHHHHHHhhhCCCcceeccCcC-----ccCCCCCCchhhhHHHHHHHHHHHhcCCchhHHHHHHHHHhcCCCCCC
Confidence            888899999999999998877755311     1232 3567889999999999999999999999999999999999999


Q ss_pred             ccccccC
Q psy5522        1273 FASTQMS 1279 (1397)
Q Consensus      1273 f~stQdT 1279 (1397)
                      |+|||||
T Consensus       281 f~sTQdT  287 (297)
T cd02896         281 FGSTQDT  287 (297)
T ss_pred             eehHHHH
Confidence            9999999


No 4  
>cd02897 A2M_2 Proteins similar to alpha2-macroglobulin (alpha (2)-M). This group also contains the pregnancy zone protein (PZP).  Alpha(2)-M and PZP are broadly specific proteinase inhibitors. Alpha (2)-M is a major carrier protein in serum. The structural thioester of alpha (2)-M, is involved in the immobilization and entrapment of proteases.  PZP is a trace protein in the plasma of non-pregnant females and males which is elevated in pregnancy. Alpha (2)-M and PZ bind to placental protein-14 and may modulate its activity in T-cell growth and cytokine production contributing to fetal survival. It has been suggested that thioester bond cleavage promotes the binding of PZ and alpha (2)-M to the CD91 receptor clearing them from circulation.
Probab=100.00  E-value=1.4e-50  Score=469.24  Aligned_cols=280  Identities=49%  Similarity=0.798  Sum_probs=242.6

Q ss_pred             cccCcchhhccCCCchhhhhcccchHHHHHHHHHHhccccHHHHHHHHHHHHhccccceeeecCCCCccccCCCCCCCCc
Q psy5522         879 SIPNLANLIKMPFGCGEQNMLNFVPNIVVLEYLKKTYQLTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGST  958 (1397)
Q Consensus       879 ~~~~l~~Ll~~P~GCgEQ~m~~~~p~i~~l~yL~~~~~l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~  958 (1397)
                      ++++|++|+++||||+||||++++|++|+++||+.++++++++++++.++|++||+|+++||++||||++|++.+..+++
T Consensus         1 ~~~~l~~L~~~P~gc~Eq~~~~~~p~~~~~~yl~~~~~~~~~~~~~~~~~l~~g~~~~~~~q~~dGsf~~w~~~~~~~~~   80 (292)
T cd02897           1 ALQNLDNLLRMPYGCGEQNMVNFAPNIYVLDYLKATGQLTPEIESKALGFLRTGYQRQLTYKHSDGSYSAFGESDKSGST   80 (292)
T ss_pred             ChhhHHHHHhCCCCcHHHHHHHHHHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHHhccCCCCCeecccCCCCCcch
Confidence            35789999999999999999999999999999999999999999999999999999999999999999999765457899


Q ss_pred             chhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCccccchHhHHHhhcCCCCccEEEEEEEEeccccceeEEEEe
Q psy5522         959 WLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSFPEVGKVSHADMQGGAAKGLALTAYTLLPKKTRAVNMTAT 1038 (1397)
Q Consensus       959 wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~stqdvl~al~~~~~~~~v~lt~~v~lp~~~~~~~v~~~ 1038 (1397)
                      |+||||+++|.+|++++.+|+..++++++||.++|+++|||.+...+++..+++.....+.+|+++.+            
T Consensus        81 wlTa~v~~~L~~a~~~~~v~~~~i~ra~~wL~~~Q~~dG~f~~~~~~~~~~~~~~~~~~~~~TA~vl~------------  148 (292)
T cd02897          81 WLTAFVLKSFAQARPFIYIDENVLQQALTWLSSHQKSNGCFREVGRVFHKAMQGGVDDEVALTAYVLI------------  148 (292)
T ss_pred             hhHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHhcCCCCCCCCCCcccChhhcCCCCCCcchHHHHHH------------
Confidence            99999999999999889999999999999999999999999876656555454443455677887766            


Q ss_pred             eeeEEEEEEEEEEeeccCCCCCceeeccccccCCCCCceEEEEEEeecCCCCccEEEEEecCCcccccCCCCchhhcccC
Q psy5522        1039 GSGFAVVYISYQYNINVTGAWPMFTLDPQVDKNSDSNHLQLSICSGFIGEGDSNMAVMEVSLPSGFTVDSDALPSLQVSQ 1118 (1397)
Q Consensus      1039 G~G~~~~q~~~~Yn~~~~~~~~~f~l~v~~~~~~~~~~~~l~vc~~y~~~~~s~Mailev~l~SGf~~~~~~l~~l~~~~ 1118 (1397)
                                                                             |+++.                    
T Consensus       149 -------------------------------------------------------aL~~~--------------------  153 (292)
T cd02897         149 -------------------------------------------------------ALLEA--------------------  153 (292)
T ss_pred             -------------------------------------------------------HHHhc--------------------
Confidence                                                                   77776                    


Q ss_pred             ceeEEEEeCCeEEEEEEeeecCCccceeeecccccccchhhHHHHHHHHHhhcCCCCchHHHHHHHHHHhhCCCCchHHH
Q psy5522        1119 NVKRVETKNGNTMVVLYFAYHQVLPWTLMHYLVSKYPRMNTINKAVDYIVKNLAGTEDAYAIAICSYALHLAQHPVKDVA 1198 (1397)
Q Consensus      1119 ~i~~~e~~~~~~~v~lY~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~l~~~~~~~~~~y~~al~aYal~la~~~~~~~~ 1198 (1397)
                               +.          +.              .++++.+|+.||++....+.++|.+||++|||++++++....+
T Consensus       154 ---------g~----------~~--------------~~~~i~~a~~yL~~~~~~~~~~y~~al~a~AL~~~~~~~~~~~  200 (292)
T cd02897         154 ---------GL----------PS--------------ERPVVEKALSCLEAALDSISDPYTLALAAYALTLAGSEKRPEA  200 (292)
T ss_pred             ---------CC----------cc--------------ccHHHHHHHHHHHHhcccCCCHHHHHHHHHHHHHcCCccHHHH
Confidence                     32          11              4578999999999998888999999999999999997666779


Q ss_pred             HHHHhhcccccCCcccccccCCCCCCCCCCCCCCCcccchhhHHHHHHhhccC--CcccchhhhhhhhhcccCCCCcccc
Q psy5522        1199 FNLLESKAHNEDGKKWWKRAERPEDKKNPWAQVPNSVDVEMTSYALLSYLDRG--LVEDTLPILTWLVTQQNDQGGFAST 1276 (1397)
Q Consensus      1199 ~~~L~~~a~~~~~~~~W~~~~~~~~~~~~~~~~~~s~~vE~t~Y~lL~~~~~~--~~~~a~~i~~Wl~~q~n~~Ggf~st 1276 (1397)
                      +++|+.+++.+++..||..... ......+++.+.+.+||+||||||++++.+  +++.+.+|++||.+|||++|||+||
T Consensus       201 ~~~l~~~~~~~~~~~~W~~~~~-~~~~~~~~~~~~~~~ve~TAyaLlall~~~~~~~~~~~~~v~WL~~~q~~~Ggf~sT  279 (292)
T cd02897         201 LKKLDELAISEDGTKHWSRPPP-SEEGPSYYWQAPSAEVEMTAYALLALLSAGGEDLAEALPIVKWLAKQRNSLGGFSST  279 (292)
T ss_pred             HHHHHHHHhccCCCCCCCcCCC-cccccccCCCCCcchHHHHHHHHHHHHHcCCccHhHHHHHHHHHHHcCCCCCCcccH
Confidence            9999999999889999987521 111223334556789999999999999877  8899999999999999999999999


Q ss_pred             ccC
Q psy5522        1277 QMS 1279 (1397)
Q Consensus      1277 QdT 1279 (1397)
                      |||
T Consensus       280 Qdt  282 (292)
T cd02897         280 QDT  282 (292)
T ss_pred             HHH
Confidence            999


No 5  
>PF07678 A2M_comp:  A-macroglobulin complement component;  InterPro: IPR011626 This domain covers the complement component region of the alpha-2-macroglobulin family. The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].; GO: 0005615 extracellular space; PDB: 1QSJ_D 1QQF_A 4ACQ_C 2B39_B 2WIN_H 2I07_B 2ICF_B 2XWJ_D 3G6J_B 2NOJ_C ....
Probab=100.00  E-value=1.3e-49  Score=447.46  Aligned_cols=233  Identities=45%  Similarity=0.767  Sum_probs=199.7

Q ss_pred             hccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCccccchHhHHHhh
Q psy5522         931 TGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSFPEVGKVSHADM 1010 (1397)
Q Consensus       931 ~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~stqdvl~al~ 1010 (1397)
                      +||||+|+||++||||++| ..+.++|+||||||+++|.+|+++++||+..|+++++||.++|+++|||.+.+.+++..+
T Consensus         1 ~GYqr~L~y~~~DGsfs~f-~~~~~~s~WLTAfv~k~f~~a~~~i~vd~~~i~~a~~wL~~~Q~~dG~F~e~~~~~~~~~   79 (246)
T PF07678_consen    1 QGYQRQLSYRRSDGSFSAF-SSDSPSSTWLTAFVVKVFSQAKKYIFVDENVICRAVKWLISQQQPDGSFEEDGPVIHREM   79 (246)
T ss_dssp             HHHHHHHTTB-TTSSBBSS-TTTSSBBHHHHHHHHHHHHHHTTTS-CEHHHHHHHHHHHHHHBETTSEB--SSS-SSGGG
T ss_pred             CchHHHhcCCCCCCCeecc-ccCCcccHHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHHHhhcCCCccccCCCcccccc
Confidence            5999999999999999999 555689999999999999999999999999999999999999999999999888988888


Q ss_pred             cCCCCccEEEEEEEEeccccceeEEEEeeeeEEEEEEEEEEeeccCCCCCceeeccccccCCCCCceEEEEEEeecCCCC
Q psy5522        1011 QGGAAKGLALTAYTLLPKKTRAVNMTATGSGFAVVYISYQYNINVTGAWPMFTLDPQVDKNSDSNHLQLSICSGFIGEGD 1090 (1397)
Q Consensus      1011 ~~~~~~~v~lt~~v~lp~~~~~~~v~~~G~G~~~~q~~~~Yn~~~~~~~~~f~l~v~~~~~~~~~~~~l~vc~~y~~~~~ 1090 (1397)
                      +++++.++.||+||.+                                                                
T Consensus        80 ~g~~~~~~~lTA~Vli----------------------------------------------------------------   95 (246)
T PF07678_consen   80 QGGVEDDIALTAYVLI----------------------------------------------------------------   95 (246)
T ss_dssp             SGGGTHHHHHHHHHHH----------------------------------------------------------------
T ss_pred             CCCCCCCeeehHHHHH----------------------------------------------------------------
Confidence            8887888999999988                                                                


Q ss_pred             ccEEEEEecCCcccccCCCCchhhcccCceeEEEEeCCeEEEEEEeeecCCccceeeecccccccchhhHHHHHHHHHhh
Q psy5522        1091 SNMAVMEVSLPSGFTVDSDALPSLQVSQNVKRVETKNGNTMVVLYFAYHQVLPWTLMHYLVSKYPRMNTINKAVDYIVKN 1170 (1397)
Q Consensus      1091 s~Mailev~l~SGf~~~~~~l~~l~~~~~i~~~e~~~~~~~v~lY~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~l~~~ 1170 (1397)
                         |++|.+=..+                        ..         .+.              ...+++||+.||+++
T Consensus        96 ---AL~e~~~~~~------------------------~~---------~~~--------------~~~~i~kA~~~L~~~  125 (246)
T PF07678_consen   96 ---ALLEAGSLCD------------------------SE---------KPE--------------YENAINKALNYLERH  125 (246)
T ss_dssp             ---HHHHCHCCHT------------------------TT---------HHC--------------HHHHHHHHHHHHHHH
T ss_pred             ---HHHhhhhhcc------------------------cc---------chh--------------hHHHHHHHHHHHHHh
Confidence               8888821000                        00         123              567999999999999


Q ss_pred             cCCCCchHHHHHHHHHHhhCC-CCchHHHHHHHhhcccccCCcccccccCCCCCCCCCCCCCCCcccchhhHHHHHHhhc
Q psy5522        1171 LAGTEDAYAIAICSYALHLAQ-HPVKDVAFNLLESKAHNEDGKKWWKRAERPEDKKNPWAQVPNSVDVEMTSYALLSYLD 1249 (1397)
Q Consensus      1171 ~~~~~~~y~~al~aYal~la~-~~~~~~~~~~L~~~a~~~~~~~~W~~~~~~~~~~~~~~~~~~s~~vE~t~Y~lL~~~~ 1249 (1397)
                      ++++.++|++||+||||+|++ ++.+..++++|++.|+.+++.+||+......+...+|+ .+++.+||+||||||+++.
T Consensus       126 ~~~~~~~Y~lAl~aYAL~la~~~~~~~~~~~~L~~~a~~~~~~~~W~~~~~~~~~~~~~~-~~~s~~vEtTaYaLLa~l~  204 (246)
T PF07678_consen  126 LDNIQDPYTLALVAYALALAGDSPQASKLLNKLNSMATTEGGLRYWSSDESSSSSSSPWS-RGSSLDVETTAYALLALLK  204 (246)
T ss_dssp             HGCTSSHHHHHHHHHHHHHTTTCHHHHHHHHHHHCHCEETTTTCEE-SSSSSSSSSSTTT--SHHHHHHHHHHHHHHHHH
T ss_pred             ccccCCHHHHHHHHHHHHhhcccchHHHHHHHHHHhhhhccccCcccCCccccccccccc-ccchHHHHHHHHHHHHHHh
Confidence            999999999999999999996 44457799999999999999999999876554433333 4789999999999999999


Q ss_pred             cCCcccchhhhhhhhhcccCCCCccccccC
Q psy5522        1250 RGLVEDTLPILTWLVTQQNDQGGFASTQMS 1279 (1397)
Q Consensus      1250 ~~~~~~a~~i~~Wl~~q~n~~Ggf~stQdT 1279 (1397)
                      .+++.+|.|||+||++|||++|||+|||||
T Consensus       205 ~~~~~~~~~iv~WL~~qr~~~Ggf~STQdT  234 (246)
T PF07678_consen  205 RGDLEEASPIVRWLISQRNSGGGFGSTQDT  234 (246)
T ss_dssp             HTCHHHHHHHHHHHHHCTTTTSSTSSHHHH
T ss_pred             cccHHHHHHHHHHHHHhcCCCCccCcHHHH
Confidence            999999999999999999999999999999


No 6  
>cd02891 A2M_like Proteins similar to alpha2-macroglobulin (alpha (2)-M).  Alpha (2)-M is a major carrier protein in serum. It is a broadly specific proteinase inhibitor.  The structural thioester of alpha (2)-M, is involved in the immobilization and entrapment of proteases. This group contains another broadly specific proteinase inhibitor:  pregnancy zone protein (PZP).  PZP is a trace protein in the plasma of non-pregnant females and males which is elevated in pregnancy. Alpha (2)-M and PZ bind to placental protein-14 and may modulate its activity in T-cell growth and cytokine production thereby protecting the allogeneic fetus from attack by the maternal immune system. This group also contains C3, C4 and C5 of vertebrate complement.  The vertebrate complement is an effector of both the acquired and innate immune systems The point of convergence of the classical, alternative and lectin pathways of the complement system is the proteolytic activation of C3. C4 plays a key role in propaga
Probab=100.00  E-value=2e-36  Score=350.22  Aligned_cols=270  Identities=45%  Similarity=0.718  Sum_probs=226.5

Q ss_pred             ccCcchhhccCCCchhhhhcccchHHHHHHHHHHhccccHHHHHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcc
Q psy5522         880 IPNLANLIKMPFGCGEQNMLNFVPNIVVLEYLKKTYQLTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTW  959 (1397)
Q Consensus       880 ~~~l~~Ll~~P~GCgEQ~m~~~~p~i~~l~yL~~~~~l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~w  959 (1397)
                      .+++++|++|||||+||+|++++|++++.+||...+++++..++++...|++|+++++++|++||||++|++.+ .+++|
T Consensus         2 ~~~l~~L~~yPygC~EQ~~s~~~p~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~Q~~dGgf~~w~~~~-~~~~~   80 (282)
T cd02891           2 LGNLDYLLRYPYGCGEQTMSRAAPNLYVLKYLDATGQLTPEIREKALEYIRKGYQRLLTYQRSDGSFSAWGNSD-SGSTW   80 (282)
T ss_pred             cchHHHHHcCCCCcHHHHHHHHHHHHHHHHHHHHhCcCCHHHHHHHHHHHHHHHHHHHhhcCCCCCccccCCCC-CCchH
Confidence            46799999999999999999999999999999999999888888999999999999999999999999998753 68999


Q ss_pred             hhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCccccchHhHHHhhcCCCCccEEEEEEEEeccccceeEEEEee
Q psy5522         960 LTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSFPEVGKVSHADMQGGAAKGLALTAYTLLPKKTRAVNMTATG 1039 (1397)
Q Consensus       960 lTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~stqdvl~al~~~~~~~~v~lt~~v~lp~~~~~~~v~~~G 1039 (1397)
                      +||||+++|..|++...+|+..+.++++||.++|+.+|+|.......+.-+.+.....+.+++++..             
T Consensus        81 ~Ta~~~~~L~~a~~~~~v~~~~i~ra~~~L~~~q~~~g~~~~~~~~~~~~~~~~~~~~~~~tA~al~-------------  147 (282)
T cd02891          81 LTAYVVKFLSQARKYIDVDENVLARALGWLVPQQKEDGSFRELGPVIHREMKGGVDDSVSLTAYVLI-------------  147 (282)
T ss_pred             HHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHhccCCCCCcCCCCCccCHhhcCCcCCCcchHHHHHH-------------
Confidence            9999999999999887899999999999999999999999877654222112222233445555443             


Q ss_pred             eeEEEEEEEEEEeeccCCCCCceeeccccccCCCCCceEEEEEEeecCCCCccEEEEEecCCcccccCCCCchhhcccCc
Q psy5522        1040 SGFAVVYISYQYNINVTGAWPMFTLDPQVDKNSDSNHLQLSICSGFIGEGDSNMAVMEVSLPSGFTVDSDALPSLQVSQN 1119 (1397)
Q Consensus      1040 ~G~~~~q~~~~Yn~~~~~~~~~f~l~v~~~~~~~~~~~~l~vc~~y~~~~~s~Mailev~l~SGf~~~~~~l~~l~~~~~ 1119 (1397)
                                                                            ++.+.                     
T Consensus       148 ------------------------------------------------------~L~~~---------------------  152 (282)
T cd02891         148 ------------------------------------------------------ALAEA---------------------  152 (282)
T ss_pred             ------------------------------------------------------HHHHh---------------------
Confidence                                                                  44444                     


Q ss_pred             eeEEEEeCCeEEEEEEeeecCCccceeeecccccccchhhHHHHHHHHHhhcCCCCchHHHHHHHHHHhhCCCCc-hHHH
Q psy5522        1120 VKRVETKNGNTMVVLYFAYHQVLPWTLMHYLVSKYPRMNTINKAVDYIVKNLAGTEDAYAIAICSYALHLAQHPV-KDVA 1198 (1397)
Q Consensus      1120 i~~~e~~~~~~~v~lY~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~l~~~~~~~~~~y~~al~aYal~la~~~~-~~~~ 1198 (1397)
                              +..                         ....+.+|+.||.++.....++|.+|++||||++.|+.. +.++
T Consensus       153 --------g~~-------------------------~~~~~~~a~~~L~~~~~~~~~~~~~a~la~al~~~g~~~~~~~~  199 (282)
T cd02891         153 --------GKA-------------------------CDASIEKALAYLETQLDGLLDPYALAILAYALALAGDSTRADEA  199 (282)
T ss_pred             --------ccc-------------------------chHHHHHHHHHHHHhcccCCChHHHHHHHHHHHHcCccHHHHHH
Confidence                    321                         245789999999999887889999999999999999755 4679


Q ss_pred             HHHHhhcccccCCcccccccCCCCCCCCCCCCCCCcccchhhHHHHHHhhccCCcccchhhhhhhhhcccCCCCcccccc
Q psy5522        1199 FNLLESKAHNEDGKKWWKRAERPEDKKNPWAQVPNSVDVEMTSYALLSYLDRGLVEDTLPILTWLVTQQNDQGGFASTQM 1278 (1397)
Q Consensus      1199 ~~~L~~~a~~~~~~~~W~~~~~~~~~~~~~~~~~~s~~vE~t~Y~lL~~~~~~~~~~a~~i~~Wl~~q~n~~Ggf~stQd 1278 (1397)
                      +++|.+.+...++..+|...        .+.+..+++.+|+|+|+||+.+..++...+.+|++||.+||+..|+|.||||
T Consensus       200 l~~l~~~~~~~~~~~~~~~~--------~~~~~~s~~~~~a~a~all~~~~~~~~~~~~~~~~~L~~~~~~~~~~~sTq~  271 (282)
T cd02891         200 LKKLLEAAREKGGTAHWSLS--------WPGDYGSSLRVEATAYALLALLKLGDLEEAGPIAKWLAQQRNSGGGFLSTQD  271 (282)
T ss_pred             HHHHHHHhhhcCCcccccCC--------CCCCCCchhhHHHHHHHHHHHHhcCChhhHHHHHHHHHHcCCCCCCcccHHH
Confidence            99999999988888888211        2233446688999999999999888888999999999999999999999999


Q ss_pred             C
Q psy5522        1279 S 1279 (1397)
Q Consensus      1279 T 1279 (1397)
                      |
T Consensus       272 t  272 (282)
T cd02891         272 T  272 (282)
T ss_pred             H
Confidence            9


No 7  
>PF07703 A2M_N_2:  Alpha-2-macroglobulin family N-terminal region;  InterPro: IPR011625 This is a domain of the alpha-2-macroglobulin family. The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].; PDB: 2QKI_D 3L3O_D 3NMS_A 2ICF_A 2A73_A 2ICE_D 2HR0_A 2A74_A 2XWJ_G 3OHX_A ....
Probab=99.90  E-value=8.7e-23  Score=209.48  Aligned_cols=135  Identities=36%  Similarity=0.517  Sum_probs=124.9

Q ss_pred             EEEEecCCCCCCCCeEEEEEEecCCcceEEEEEEeCCeEEEEEEEEecCCceeEEEEEeeCCCCCceeEEEEEEEec-CC
Q psy5522         419 IQAALLTQNPKVNKDVELEINSTAPLKYISYQVLGRGDVIMADTITVPGNKMSTVIRFLATYAMAPTAHVIVQYVRE-DG  497 (1397)
Q Consensus       419 L~i~~~~~~~~~G~~~~v~v~s~~~~~~~~ylV~srG~Iv~~~~~~~~~~~~~~~~~l~vt~~maP~a~l~vy~v~~-~g  497 (1397)
                      |+|.++++.|+|||++++.+.++.+...++|+|++||+|+..+.+++..  +..+++|+++++|+|++++++|++++ +|
T Consensus         1 l~i~~~~~~~~~Ge~~~v~v~~~~~~~~~~~~v~s~g~I~~~~~~~~~~--~~~~~~~~v~~~~~P~~~v~~~~v~~~~g   78 (136)
T PF07703_consen    1 LQISTDKDSYKPGETAKVTVQSPFPNGTFLYLVESRGKIVSTGSVELKN--GSTTFEFPVTPDMAPNFYVLAYYVRPADG   78 (136)
T ss_dssp             EEEEE-SSSB-TTSEEEEEEEEESCESEEEEEEEETTEEEEEEEEECTT--TSSEEEEEE-GGGTSEEEEEEEEETTCTC
T ss_pred             CEEEcCCCCcCCCCEEEEEEEcCCCccEEEEEEEECCeEEEEEEEEecC--CcEEEEEecchhcCCcEEEEEEEEcCCCC
Confidence            6899999999999999999999998667999999999999999999765  34589999999999999999999998 99


Q ss_pred             eEEEEeEEEEEcccccceeeEEecCCCCCCCceEEEEEEecCCcEEEEEEEeeecccc
Q psy5522         498 EVVADGLDLELEGGLQNFVSANVSPDETEPGSNIQINLEAKPNSYIGLLAVDQKVLLL  555 (1397)
Q Consensus       498 evvads~~~~V~~~f~nkV~l~~s~~~~~Pg~~v~l~V~a~p~S~v~L~aVD~sVllL  555 (1397)
                      |+++|+++|.|+.|+.++|++++++++++||++++++|++.++|+|+|+|||||||+|
T Consensus        79 ~~~~~s~~i~V~~~~~~~v~l~~~~~~~~Pg~~~~~~i~~~~~s~v~l~~VD~sv~~L  136 (136)
T PF07703_consen   79 EVVADSVWIEVEPCFELKVELTASPDEYKPGEEVTLRIKAPPNSLVGLSAVDESVLLL  136 (136)
T ss_dssp             EEEEEEEEEEBGCSGSSSEEEEESSSSBTTTSEEEEEEEESTTEEEEEEEEEHHHHHH
T ss_pred             eEEEEEEEEEecccccceEEEEEecceeCCCCEEEEEEEeCCCCEEEEEEeEhhhhcC
Confidence            9999999999999999999999999999999999999999999999999999999976


No 8  
>PF00207 A2M:  Alpha-2-macroglobulin family;  InterPro: IPR001599 This entry contains serum complement C3 and C4 precursors and alpha-macrogrobulins.  The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].; GO: 0004866 endopeptidase inhibitor activity; PDB: 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 2PN5_A 3FRP_G 3HRZ_B ....
Probab=99.89  E-value=1.8e-22  Score=191.56  Aligned_cols=91  Identities=44%  Similarity=0.804  Sum_probs=81.2

Q ss_pred             ceeEEEEEe-ccCceEEEEEEcCCccccEEEEEEEEeCCCceeeeeCCeeEEEeeeEEEEecCCceeeeCCEEEEEEEEE
Q psy5522         660 TWLFQMEET-GFDGKVMVNEKVPDSITSWVLSAFSVDSLYGLGLMDMPKKLRVFRPFFISLDLPYSVMRGEVVAIPVVVF  738 (1397)
Q Consensus       660 TwiW~~~~~-~~~G~a~v~~tvPDtiT~W~~~Afa~s~~~G~G~a~~~~~l~vfkpffv~l~LPysv~rGE~~~l~~~Vf  738 (1397)
                      ||+|.+..+ |++|++++++++||+||+|+++|||++++.|||++ ++.+++++||||++++||||+++||++.++++||
T Consensus         1 T~~w~~~~~~d~~G~~~~~~~lPd~it~w~v~a~a~s~~~~~g~~-~~~~~~v~~p~~i~~~lP~~l~~GD~~~i~v~v~   79 (92)
T PF00207_consen    1 TWFWSPIVVTDANGKATFSFTLPDSITSWRVTAFAVSPTGGFGIA-EPPEITVFKPFFIQLNLPRSLRRGDQIQIPVTVF   79 (92)
T ss_dssp             EEEEEEEEECCTTEEEEEEEE-SSSSSEEEEEEEEEETTTEEEEE-CCEEEEEB-SEEEEEE--SEEETTSEEEEEEEEE
T ss_pred             CeEEeeeeccCCCCEEEEEEECCCCccEEEEEEEEECCCCcceEe-cceEEEEEeeEEEEcCCCcEEecCCEEEEEEEEE
Confidence            799999975 69999999999999999999999999999999999 4559999999999999999999999999999999


Q ss_pred             cCCCccEEEEEEE
Q psy5522         739 NYLSQDLVADVTL  751 (1397)
Q Consensus       739 Nyl~~~~~v~v~L  751 (1397)
                      ||++++++|+|+|
T Consensus        80 N~~~~~~~v~V~l   92 (92)
T PF00207_consen   80 NYTDKDQEVTVTL   92 (92)
T ss_dssp             E-SSS-EEEEEEE
T ss_pred             eCCCCCEEEEEEC
Confidence            9999999999986


No 9  
>PF07677 A2M_recep:  A-macroglobulin receptor;  InterPro: IPR009048 This entry represents the receptor-binding domain (RBD) of alpha-2-macroglobulin proteins. The RBD is located at the C terminus, its structure having an immunoglobulin-like fold consists of a sandwich of nine strands in two sheets with a Greek-key topology [, ]. The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].; GO: 0005576 extracellular region; PDB: 1AYO_A 2B39_B 3HS0_C 3PRX_B 3PVM_D 3HRZ_C 3FRP_B 1EDY_A 3KLS_B 3KM9_B ....
Probab=99.84  E-value=7e-22  Score=186.17  Aligned_cols=49  Identities=49%  Similarity=0.739  Sum_probs=43.7

Q ss_pred             ccEEEEEecCCcccccCCCCchhhcccCc--eeEEEEeCCeEEEEEEeeecCC
Q psy5522        1091 SNMAVMEVSLPSGFTVDSDALPSLQVSQN--VKRVETKNGNTMVVLYFAYHQV 1141 (1397)
Q Consensus      1091 s~Mailev~l~SGf~~~~~~l~~l~~~~~--i~~~e~~~~~~~v~lY~~~~~~ 1141 (1397)
                      ||||||||+|||||.|+.++|++|.....  |+|+|++++++  +||||+++.
T Consensus         1 S~Mai~eV~lpSGf~~~~~~l~~l~~~~~~~~~r~E~~~~~V--~lY~d~i~~   51 (91)
T PF07677_consen    1 SNMAIMEVGLPSGFVPDQDSLKQLQNSNDVKLKRYEVDDNKV--ILYFDEIPS   51 (91)
T ss_dssp             BSSEEEEEE-STTEEEEHHHHHHHHCSTTESESEEEEETTEE--EEEESEBCS
T ss_pred             CceEEEEEcCCCceeEChhHhhccccCCceeEEEEEEcCCEE--EEEEeeccc
Confidence            79999999999999999999999988654  56999999985  999999985


No 10 
>KOG1366|consensus
Probab=99.84  E-value=4.2e-20  Score=243.37  Aligned_cols=199  Identities=26%  Similarity=0.366  Sum_probs=165.1

Q ss_pred             CCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCccccchHh---HHHhhcCCC-------CccEEEEE-
Q psy5522         954 PNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSFPEVGKV---SHADMQGGA-------AKGLALTA- 1022 (1397)
Q Consensus       954 ~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~stqdv---l~al~~~~~-------~~~v~lt~- 1022 (1397)
                      .+.++++|||||++++..++.     +.+.++++||.+||+..|||.||||.   ++||++...       ...+++.. 
T Consensus      1154 ~s~~VE~tsYaLL~~~~~~~~-----~~~~pivrWl~~qr~~~GGf~STQdTvvalqaLs~y~~~~~~~~~~~~V~i~~~ 1228 (1436)
T KOG1366|consen 1154 RSIDVETTAYALLAYLLLAQV-----DYALPIVRWLVEQRNALGGFSSTQDTVVALQALSEYAALSHTETLSLQVTIQVP 1228 (1436)
T ss_pred             chhhccchHHHHHHHHHhccc-----CcCchhHhhhhhhhcccCceeehHHHHHHHHHHHHhhhhcccceeEEEEEEEec
Confidence            357899999999999988764     27788999999999999999999983   889888421       11233221 


Q ss_pred             --------------------EEEeccccceeEEEEeeeeEEEEEEEEEEeeccCCCCCceeeccccccC-C----CCCce
Q psy5522        1023 --------------------YTLLPKKTRAVNMTATGSGFAVVYISYQYNINVTGAWPMFTLDPQVDKN-S----DSNHL 1077 (1397)
Q Consensus      1023 --------------------~v~lp~~~~~~~v~~~G~G~~~~q~~~~Yn~~~~~~~~~f~l~v~~~~~-~----~~~~~ 1077 (1397)
                                          .+.+|..++.+.++|+|.||+++|++++||+.++++++.|.|.+++.+. +    +....
T Consensus      1229 ~~~~~~~~~~~~~~N~lllQ~~~~p~~~~~v~~ta~G~g~~~~q~~~~Ynv~~~~~~~~f~l~~~~~~~~~~~~~s~~~~ 1308 (1436)
T KOG1366|consen 1229 GKSSGTSQVDSTPNNLLLLQSVSLPSLHKTVCFTAKGQGCATVQISYTYNLLTPKSDPRFFLIVSTKPEPVEFDLSNIVV 1308 (1436)
T ss_pred             ccCCCceeeeccccchhhcceeecccCCCceEEEEecceeEEEEeeEEEeecCcccCCCceeeeeeccccccCccccceE
Confidence                                2466788999999999999999999999999999999999999987432 2    23345


Q ss_pred             EEEEEEeec-CCCCccEEEEEecCCcccccCCCCchhhcccCceeEEEEeCCeEEEEEEeeecC-Cccceeeeccccccc
Q psy5522        1078 QLSICSGFI-GEGDSNMAVMEVSLPSGFTVDSDALPSLQVSQNVKRVETKNGNTMVVLYFAYHQ-VLPWTLMHYLVSKYP 1155 (1397)
Q Consensus      1078 ~l~vc~~y~-~~~~s~Mailev~l~SGf~~~~~~l~~l~~~~~i~~~e~~~~~~~v~lY~~~~~-~~~~~~~~~~~~~~~ 1155 (1397)
                      ....|..+. .+++||||||||+|||||.++.++|.+|....++.+.|.+.++..|++|||+++ .+++.+.|+...-+|
T Consensus      1309 ~~~~~~~~~~~~~~SnMAIldVklpSGf~~~~~sl~~le~~~~v~~~~~~~~~~~v~~Y~D~v~~~~~~cv~F~a~~~~~ 1388 (1436)
T KOG1366|consen 1309 VYTTCCERTGSSGRSNMAILDVKLPSGFRADHESLGLLEQLEDVYRVEVENDHRRVLIYFDSVSGNETFCVRFEAVQSFP 1388 (1436)
T ss_pred             EEEEEEeecCCCCCCCceeeeeecccCcccChhhHHHhhccceEEEEEEecCCcceEEEeeccCCCCceEEEEEEEEEee
Confidence            556666555 467899999999999999999999999999999999999999877899999999 999999998888776


Q ss_pred             ch
Q psy5522        1156 RM 1157 (1397)
Q Consensus      1156 ~~ 1157 (1397)
                      ..
T Consensus      1389 V~ 1390 (1436)
T KOG1366|consen 1389 VA 1390 (1436)
T ss_pred             hh
Confidence            43


No 11 
>PF01835 A2M_N:  MG2 domain;  InterPro: IPR002890 The proteinase-binding alpha-macroglobulins (A2M) [] are large glycoproteins found in the plasma of vertebrates, in the hemolymph of some invertebrates and in reptilian and avian egg white. A2M-like proteins are able to inhibit all four classes of proteinases by a 'trapping' mechanism. They have a peptide stretch, called the 'bait region', which contains specific cleavage sites for different proteinases. When a proteinase cleaves the bait region, a conformational change is induced in the protein, thus trapping the proteinase. The entrapped enzyme remains active against low molecular weight substrates, whilst its activity toward larger substrates is greatly reduced, due to steric hindrance. Following cleavage in the bait region, a thiol ester bond, formed between the side chains of a cysteine and a glutamine, is cleaved and mediates the covalent binding of the A2M-like protein to the proteinase. This family includes the N-terminal region of the alpha-2-macroglobulin family. The inhibitor domains belong to MEROPS inhibitor family I39.; GO: 0004866 endopeptidase inhibitor activity; PDB: 2B39_B 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 4ACQ_C 2P9R_B ....
Probab=99.80  E-value=3.6e-19  Score=171.94  Aligned_cols=93  Identities=37%  Similarity=0.716  Sum_probs=80.3

Q ss_pred             EEEEEcCCCcccCCceeEEEEEEEcCC--CccCCCcceEEEEECCCCCeEEeeee-eccCCceeeeeeecCCCCCCeeEE
Q psy5522          99 SVFIQLDKAIYKPGDLVRFRAIVLNSH--LKPSVTGALEIFITDGKGNRIKQWTR-ALTTRGVFSADLQLSKSPVLGDWN  175 (1397)
Q Consensus        99 ~vfIqTDKpIYkPGqtV~fRvv~ld~~--l~P~~~~~~~v~I~DP~g~~I~qw~~-~~~~~Gi~~~sf~L~~~~~lG~y~  175 (1397)
                      ++|||||||+|||||+|+||+++++.+  +.|..+.++.|+|.||+|+.|.+|.. ..+..|+++++|+||+++.+|.|+
T Consensus         1 ~~~i~TDr~iYrPGetV~~~~~~~~~~~~~~~~~~~~~~v~i~dp~g~~v~~~~~~~~~~~G~~~~~~~lp~~~~~G~y~   80 (99)
T PF01835_consen    1 KIFIQTDRPIYRPGETVHFRAIVRDLDNDFKPPANSPVTVTIKDPSGNEVFRWSVNTTNENGIFSGSFQLPDDAPLGTYT   80 (99)
T ss_dssp             EEEEEESSSEE-TTSEEEEEEEEEEECTTCSCESSEEEEEEEEETTSEEEEEEEEEETTCTTEEEEEEE--SS---EEEE
T ss_pred             CEEEECCccCcCCCCEEEEEEEEeccccccccccCCceEEEEECCCCCEEEEEEeeeeCCCCEEEEEEECCCCCCCEeEE
Confidence            589999999999999999999999887  66667899999999999999999999 789999999999999999999999


Q ss_pred             EEEEEc---CeEEEEEEEE
Q psy5522         176 ITINVL---DQKFTKRFTV  191 (1397)
Q Consensus       176 I~v~~~---~~~~~~~F~V  191 (1397)
                      |++...   +...+++|+|
T Consensus        81 i~~~~~~~~~~~~~~~F~V   99 (99)
T PF01835_consen   81 IRVKTDDDGGQSFSKTFQV   99 (99)
T ss_dssp             EEEEETTTTCEEEEEEEEE
T ss_pred             EEEEEccCCCCEEEEEEEC
Confidence            999993   3778999987


No 12 
>cd00688 ISOPREN_C2_like This group contains class II terpene cyclases, protein prenyltransferases beta subunit, two broadly specific proteinase inhibitors alpha2-macroglobulin (alpha (2)-M) and pregnancy zone protein (PZP) and, the C3 C4 and C5 components of vertebrate complement. Class II terpene cyclases include squalene cyclase (SQCY) and 2,3-oxidosqualene cyclase (OSQCY), these integral membrane proteins catalyze a cationic cyclization cascade converting linear triterpenes to fused ring compounds.  The protein prenyltransferases include protein farnesyltransferase (FTase) and geranylgeranyltransferase types I and II (GGTase-I and GGTase-II) which catalyze the carboxyl-terminal lipidation of Ras, Rab, and several other cellular signal transduction proteins, facilitating membrane associations and specific protein-protein interactions. Alpha (2)-M is a major carrier protein in serum and involved in the immobilization and entrapment of proteases. PZP is a pregnancy associated protein. 
Probab=99.71  E-value=1.4e-17  Score=193.16  Aligned_cols=262  Identities=34%  Similarity=0.497  Sum_probs=180.2

Q ss_pred             cCcchhhccCCC--------chhhhhcccchHHHHHHHHHHhccccHHHHHHHHHHHHhccccceeeecCCCCccccCCC
Q psy5522         881 PNLANLIKMPFG--------CGEQNMLNFVPNIVVLEYLKKTYQLTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTT  952 (1397)
Q Consensus       881 ~~l~~Ll~~P~G--------CgEQ~m~~~~p~i~~l~yL~~~~~l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~  952 (1397)
                      .++++|+++|++        |+||+++.++|.+....+....+     .+.+....++.+++.++++|+.||+|++|...
T Consensus         3 ~~~~~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~l~~~q~~dG~~~~~~~~   77 (300)
T cd00688           3 KHLKYLLRYPYGDGHWYQSLCGEQTWSTAWPLLALLLLLAATG-----IRDKADENIEKGIQRLLSYQLSDGGFSGWGGN   77 (300)
T ss_pred             hHHHHHHHccccCCCccccCCcCcHHHHHHHHHHHHHhCCccc-----ccccchHHHHHHHHHHHhccCCCCCccCCCCC
Confidence            468899999999        99999999999888776654321     11122233445555666788999999999653


Q ss_pred             CCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCccccchHhHHHhhcCCCCccEEEEEEEEeccccce
Q psy5522         953 DPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSFPEVGKVSHADMQGGAAKGLALTAYTLLPKKTRA 1032 (1397)
Q Consensus       953 ~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~stqdvl~al~~~~~~~~v~lt~~v~lp~~~~~ 1032 (1397)
                      . .+++|.|+|++++|..+++...++...+.++++||.++|+.+|+|.........  ......++..++++..      
T Consensus        78 ~-~~~~~~T~~~~~~l~~~~~~~~~~~~~~~~~~~~l~~~q~~dG~~~~~~~~~~~--~~~~~~~~~~t~~al~------  148 (300)
T cd00688          78 D-YPSLWLTAYALKALLLAGDYIAVDRIDLARALNWLLSLQNEDGGFREDGPGNHR--IGGDESDVRLTAYALI------  148 (300)
T ss_pred             C-CcchHhHHHHHHHHHHcCCccccCHHHHHHHHHHHHHccCCCCCeeeecCCCCc--ccCCCCcccHHHHHHH------
Confidence            2 679999999999999887654567788999999999999999999753211000  0000111222222111      


Q ss_pred             eEEEEeeeeEEEEEEEEEEeeccCCCCCceeeccccccCCCCCceEEEEEEeecCCCCccEEEEEecCCcccccCCCCch
Q psy5522        1033 VNMTATGSGFAVVYISYQYNINVTGAWPMFTLDPQVDKNSDSNHLQLSICSGFIGEGDSNMAVMEVSLPSGFTVDSDALP 1112 (1397)
Q Consensus      1033 ~~v~~~G~G~~~~q~~~~Yn~~~~~~~~~f~l~v~~~~~~~~~~~~l~vc~~y~~~~~s~Mailev~l~SGf~~~~~~l~ 1112 (1397)
                                                                                   ++.+.              
T Consensus       149 -------------------------------------------------------------aL~~~--------------  153 (300)
T cd00688         149 -------------------------------------------------------------ALALL--------------  153 (300)
T ss_pred             -------------------------------------------------------------HHHHc--------------
Confidence                                                                         22221              


Q ss_pred             hhcccCceeEEEEeCCeEEEEEEeeecCCccceeeecccccccchhhHHHHHHHHHhhcCC--------CCchHHHHHHH
Q psy5522        1113 SLQVSQNVKRVETKNGNTMVVLYFAYHQVLPWTLMHYLVSKYPRMNTINKAVDYIVKNLAG--------TEDAYAIAICS 1184 (1397)
Q Consensus      1113 ~l~~~~~i~~~e~~~~~~~v~lY~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~l~~~~~~--------~~~~y~~al~a 1184 (1397)
                                     +.          ..              ....+.+++.||.++...        ..++|..|.++
T Consensus       154 ---------------~~----------~~--------------~~~~~~~~~~~l~~~q~~~g~~~~~~~~~~~~t~~~~  194 (300)
T cd00688         154 ---------------GK----------LD--------------PDPLIEKALDYLLSCQNYDGGFGPGGESHGYGTACAA  194 (300)
T ss_pred             ---------------CC----------CC--------------CcHHHHHHHHHHHHHhcCCCCcCCCccccHHHHHHHH
Confidence                           11          00              135688999999887642        47899999999


Q ss_pred             HHHhhCCCC---chHHHHHHHhhcccccCCcccccccCCCCCCCCCCCCCCCcccchhhHHHHHHhhccCCcccchhhhh
Q psy5522        1185 YALHLAQHP---VKDVAFNLLESKAHNEDGKKWWKRAERPEDKKNPWAQVPNSVDVEMTSYALLSYLDRGLVEDTLPILT 1261 (1397)
Q Consensus      1185 Yal~la~~~---~~~~~~~~L~~~a~~~~~~~~W~~~~~~~~~~~~~~~~~~s~~vE~t~Y~lL~~~~~~~~~~a~~i~~ 1261 (1397)
                      +||++++..   ..+.+++.|.+....+++..+|....         ...+......++.|+|++.....+.....++++
T Consensus       195 ~aL~~~~~~~~~~~~~~~~~L~~~q~~~g~~~~~~~~~---------~~~~~~~~~~~~~~aL~~~~~~~~~~~~~~~~~  265 (300)
T cd00688         195 AALALLGDLDSPDAKKALRWLLSRQRPDGGWGEGRDRT---------NKLSDSCYTEWAAYALLALGKLGDLEDAEKLVK  265 (300)
T ss_pred             HHHHHcCCcchHHHHHHHHHHHHhcCCCCCcCccccCC---------CCcCchHHHHHHHHHHHHHhhhcCcccHHHHHH
Confidence            999999973   34668888888887777665543210         112233445566666666554446678899999


Q ss_pred             hhhhcccCCCCccc-------cccC
Q psy5522        1262 WLVTQQNDQGGFAS-------TQMS 1279 (1397)
Q Consensus      1262 Wl~~q~n~~Ggf~s-------tQdT 1279 (1397)
                      ||..||+..|||.+       ||+|
T Consensus       266 ~L~~~q~~~G~w~~~~~~~~~~~~t  290 (300)
T cd00688         266 WLLSQQNEDGGFSSKPGKSYDTQHT  290 (300)
T ss_pred             HHHhccCCCCCcCcCCCCCCcchhh
Confidence            99999999999998       7877


No 13 
>cd02889 SQCY Squalene cyclase (SQCY) domain; found in class II terpene cyclases that have an alpha 6 - alpha 6 barrel fold. Squalene cyclase (SQCY) and 2,3-oxidosqualene cyclase (OSQCY) are integral membrane proteins that catalyze a cationic cyclization cascade converting linear triterpenes to fused ring compounds. Bacterial SQCY catalyzes the convertion of squalene to hopene or diplopterol. Eukaryotic OSQCY transforms the 2,3-epoxide of squalene to compounds such as, lanosterol (a metabolic precursor of cholesterol and steroid hormones) in mammals and fungi or, cycloartenol in plants. Deletion of a single glycine residue of Alicyclobacillus acidocaldarius SQCY alters its substrate specificity into that of eukaryotic OSQCY. Both enzymes have a second minor domain, which forms an alpha-alpha barrel that is inserted into the major domain. This group also contains SQCY-like archael sequences and some bacterial SQCY's which lack this minor domain.
Probab=99.51  E-value=6.6e-14  Score=167.34  Aligned_cols=259  Identities=19%  Similarity=0.198  Sum_probs=163.4

Q ss_pred             cccchHHHHHHHHHHhcc--ccHHHHHHHHHHHHhccccce---------ee-ecCCCCccccCCCCCCCCcchhHHHHH
Q psy5522         899 LNFVPNIVVLEYLKKTYQ--LTDAIEAKASRYLETGYQQEL---------TY-RRPDGSFSAFGTTDPNGSTWLTAFVAK  966 (1397)
Q Consensus       899 ~~~~p~i~~l~yL~~~~~--l~~~~~~~a~~~l~~gy~~~l---------~y-~~~dGsfs~~~~~~~~~s~wlTAyvl~  966 (1397)
                      +.+....+++..|...++  ...+..+||++||.+.   ++         .| +.++|+|.+-.......+.-.||||++
T Consensus        23 ~~~~~Ta~~~~al~~~g~~~~~~~~~~ka~~~l~~~---q~~~~~~~~~~~~~~~~~Ggw~y~~~~~~~~~~~~Ta~~l~   99 (348)
T cd02889          23 SQVWDTALALQALLEAGLAPEFDPALKKALEWLLKS---QIRDNPDDWKVKYRHLRKGGWAFSTANQGYPDSDDTAEALK   99 (348)
T ss_pred             CchHHHHHHHHHHHHcCCCCccCHHHHHHHHHHHhc---CCCCCCCchhhcCCCCCCCcCcccCcCCCCCCCCChHHHHH
Confidence            444556677788877765  3456667999999873   21         12 256888764321111245569999999


Q ss_pred             HHHHccCCC----ccCHHHHHHHHHHHHhcccCCCccccchHh----H-H--HhhcCC--C-CccEEEEEEEEeccccce
Q psy5522         967 SFRQAASHT----TIDESVILEALAWLSSNQAVNGSFPEVGKV----S-H--ADMQGG--A-AKGLALTAYTLLPKKTRA 1032 (1397)
Q Consensus       967 ~~~~a~~~i----~vd~~~~~~~~~WL~~~q~~~G~F~stqdv----l-~--al~~~~--~-~~~v~lt~~v~lp~~~~~ 1032 (1397)
                      +|..++.+.    .++.+.+.++++||.++|+++|+|.+....    + .  ..+.+.  . ...+.+|+++.+      
T Consensus       100 al~~~~~~~~~~~~~~~~~i~~a~~~L~~~Q~~dG~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ta~~l~------  173 (348)
T cd02889         100 ALLRLQKKPPDGKKVSRERLYDAVDWLLSMQNSNGGFAAFEPDNTYKYLELIPEVDGDIMIDPPYVECTGSVLE------  173 (348)
T ss_pred             HHHHhhccCcccchhhHHHHHHHHHHHHHhccCCCCEeeecCCccHHHHhcCchhhcCCccCCCCcchHHHHHH------
Confidence            999988774    678899999999999999999999753210    1 0  000000  0 011222222211      


Q ss_pred             eEEEEeeeeEEEEEEEEEEeeccCCCCCceeeccccccCCCCCceEEEEEEeecCCCCccEEEEEecCCcccccCCCCch
Q psy5522        1033 VNMTATGSGFAVVYISYQYNINVTGAWPMFTLDPQVDKNSDSNHLQLSICSGFIGEGDSNMAVMEVSLPSGFTVDSDALP 1112 (1397)
Q Consensus      1033 ~~v~~~G~G~~~~q~~~~Yn~~~~~~~~~f~l~v~~~~~~~~~~~~l~vc~~y~~~~~s~Mailev~l~SGf~~~~~~l~ 1112 (1397)
                                                                                   +|++.              
T Consensus       174 -------------------------------------------------------------aL~~~--------------  178 (348)
T cd02889         174 -------------------------------------------------------------ALGLF--------------  178 (348)
T ss_pred             -------------------------------------------------------------HHHHh--------------
Confidence                                                                         33332              


Q ss_pred             hhcccCceeEEEEeCCeEEEEEEeeecCCccceeeecccccccchhhHHHHHHHHHhhcCC---------CCchHHHHHH
Q psy5522        1113 SLQVSQNVKRVETKNGNTMVVLYFAYHQVLPWTLMHYLVSKYPRMNTINKAVDYIVKNLAG---------TEDAYAIAIC 1183 (1397)
Q Consensus      1113 ~l~~~~~i~~~e~~~~~~~v~lY~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~l~~~~~~---------~~~~y~~al~ 1183 (1397)
                                     +..         ..       ..|+  ...+.+.+|+.||.+....         ....|..|++
T Consensus       179 ---------------~~~---------~~-------~~~~--~~~~~i~~a~~~L~~~q~~dG~w~~~~~~~~~y~ta~a  225 (348)
T cd02889         179 ---------------GKL---------YP-------EHRR--EIDPAIRRAVKYLEREQEPDGSWYGRWGVCFIYGTWFA  225 (348)
T ss_pred             ---------------hhc---------CC-------chHH--HHHHHHHHHHHHHHHhCCCCCCccccCCCcchHHHHHH
Confidence                           110         00       0000  0236789999999876441         2468999999


Q ss_pred             HHHHhhCCCCc----hHHHHHHHhhcccccCCcccccccCCCCCCCCCCCCCCCcccchhhHHHHHHhhccC--Ccccch
Q psy5522        1184 SYALHLAQHPV----KDVAFNLLESKAHNEDGKKWWKRAERPEDKKNPWAQVPNSVDVEMTSYALLSYLDRG--LVEDTL 1257 (1397)
Q Consensus      1184 aYal~la~~~~----~~~~~~~L~~~a~~~~~~~~W~~~~~~~~~~~~~~~~~~s~~vE~t~Y~lL~~~~~~--~~~~a~ 1257 (1397)
                      .+||.++|...    .+.+++-|.+....+++   |.......  ....+..+...++++|||||++++..+  +...+.
T Consensus       226 ~~aL~~~g~~~~~~~~~~~~~~L~~~Q~~dG~---w~~~~~~~--~~~~~~~~~~~~~~~Ta~al~aL~~~g~~~~~~v~  300 (348)
T cd02889         226 LEALAAAGEDENSPYVRKACDWLLSKQNPDGG---WGESYESY--EDPSYAGGGRSTVVQTAWALLALMAAGEPDSEAVK  300 (348)
T ss_pred             HHHHHHcCCCcCcHHHHHHHHHHHHccCCCCC---cCCccccc--ccccccCCCCCcHHHHHHHHHHHHhcCCCCHHHHH
Confidence            99999999753    23466666666555554   54321100  000112344568999999999999876  345678


Q ss_pred             hhhhhhhhcccCCCCccccccC
Q psy5522        1258 PILTWLVTQQNDQGGFASTQMS 1279 (1397)
Q Consensus      1258 ~i~~Wl~~q~n~~Ggf~stQdT 1279 (1397)
                      ++++||.+|||+.|||.|+|++
T Consensus       301 ~a~~wL~~~Q~~dG~f~~~~~~  322 (348)
T cd02889         301 RGVKYLLNTQQEDGDWPQEEIT  322 (348)
T ss_pred             HHHHHHHHhcCCCCCcCCceee
Confidence            8899999999999999999987


No 14 
>PF10569 Thiol-ester_cl:  Alpha-macro-globulin thiol-ester bond-forming region;  InterPro: IPR019565 This entry contains serum complement C3 and C4 precursors and alpha-macrogrobulins.  The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].  This short highly conserved region of proteinase-binding alpha-macro-globulins contains the cysteine and a glutamine of a thiol-ester bond that is cleaved at the moment of proteinase binding, and mediates the covalent binding of the alpha-macro-globulin to the proteinase. The GCGEQ motif is highly conserved. ; PDB: 2B39_B 2PN5_A 4ACQ_C 1HZF_A 2WIN_H 2I07_B 2ICF_B 2XWJ_D 3G6J_B 2NOJ_C ....
Probab=99.36  E-value=2.4e-13  Score=98.60  Aligned_cols=31  Identities=55%  Similarity=1.112  Sum_probs=27.7

Q ss_pred             CcchhhccCCCchhhhhcccchHHHHHHHHH
Q psy5522         882 NLANLIKMPFGCGEQNMLNFVPNIVVLEYLK  912 (1397)
Q Consensus       882 ~l~~Ll~~P~GCgEQ~m~~~~p~i~~l~yL~  912 (1397)
                      ||++|+++|+|||||||++|+|++++++||+
T Consensus         1 ~l~~Li~~P~GCgEQtm~~~~P~v~~~~YL~   31 (31)
T PF10569_consen    1 NLDSLIRYPYGCGEQTMSSFAPNVYALRYLD   31 (31)
T ss_dssp             TGGGGSSS--SSTTHHHHHHHHHHHHHHHHH
T ss_pred             CHHHHHhcCCCcHHHHHHHHhHHHHHHHHhC
Confidence            6899999999999999999999999999995


No 15 
>COG2373 Large extracellular alpha-helical protein [General function prediction only]
Probab=99.29  E-value=1e-07  Score=129.45  Aligned_cols=282  Identities=15%  Similarity=0.128  Sum_probs=187.6

Q ss_pred             ecccEEEEEcCCCcccCCceeEEEEEEEc-CCCccCCCcceEEEEECCCCCeEEeeeeeccCCc-------eeeeeeecC
Q psy5522          95 HKSYSVFIQLDKAIYKPGDLVRFRAIVLN-SHLKPSVTGALEIFITDGKGNRIKQWTRALTTRG-------VFSADLQLS  166 (1397)
Q Consensus        95 ~~~~~vfIqTDKpIYkPGqtV~fRvv~ld-~~l~P~~~~~~~v~I~DP~g~~I~qw~~~~~~~G-------i~~~sf~L~  166 (1397)
                      .....+|+.||.+++++..+.++.+++.. .+.+|+  ..+.+.+.|-+|..+.+-  .++..|       .....|.+.
T Consensus       285 ~~a~~~~~VSD~g~~~~~~~~~l~v~v~sl~sg~P~--~g~~v~~~~~ngevl~~~--~tds~G~~~~~~~~~~~~~l~~  360 (1621)
T COG2373         285 ARATRLFFVSDIGIVAKAYHDGLDVFVQSLSSGKPV--PGVEVSVLAGNGEVLVSG--TTDSQGHAKFSNAKKPAALLLA  360 (1621)
T ss_pred             ccceEEEEEeCceeEeecccccEEEEEEeccCCCCC--CCcEEEEEecCCcEEeee--EEccccceecccccCCceEEEE
Confidence            56777999999999999999999998875 456775  345677889999777654  455566       444456665


Q ss_pred             CCCCCeeEEEEEEEcCeEEEEEEEEeeecCCc--EEEEEeCCCceEeeCCeEEEEEEEEEecCc-cccc-eEEEEEEecc
Q psy5522         167 KSPVLGDWNITINVLDQKFTKRFTVAEYVLPK--FEVNVNVPPHATFKDSKVVISVNAKYTYGK-PVKG-EATITAYPTI  242 (1397)
Q Consensus       167 ~~~~lG~y~I~v~~~~~~~~~~F~VeEYvLPk--FeV~v~~p~~i~~~~~~~~v~v~AkYtyGk-PV~G-~v~v~v~~~~  242 (1397)
                      ...- +.|...-..++......|.|+.+..|-  +.+-+-+++-++..||.+.+.+-+|+..|+ ++.+ .+++++... 
T Consensus       361 ~~~~-~~~~~~d~~~~~~d~s~f~V~g~~~~~~~~k~y~ftDRglYRpGE~v~~~~~~R~~~~~~a~~~~p~~l~v~~P-  438 (1621)
T COG2373         361 RKED-GDFLGLDLTGGVFDLSDFDVEGRAAPGYGLKVYLFTDRGLYRPGETVHVNALLRDFDGKTALDNQPLKLRVLDP-  438 (1621)
T ss_pred             ecCC-CceEEEEcCCCccccccccccceecCCCceEEEEecCcccCCCCceeeeeeeehhhcccccccCCCeEEEEECC-
Confidence            5432 334433333334456789999998887  888889999999999999999999999999 5655 366666532 


Q ss_pred             ccCccccccccceeEEeecCceEEEEEechhhhccccccceEEEEEEEEEECCcceEEeeEEEE-EEEEeeeEEEEEecC
Q psy5522         243 FSGVIQPLFQTPVRKVVPIDGKTVIEFDVVKELQLTDEYERNIHFDVAVEEALTGRRQNNTGSV-VFHKHKYKMDLIKSS  321 (1397)
Q Consensus       243 ~~g~~~~~~~~~~~~~~d~~G~~~~~~~~~~~~~~~~~~~~~l~V~a~VtE~gTG~~~~~~~~i-~i~~~~~~i~f~~~~  321 (1397)
                       .|.    .......+++++|-+++.+++.++     .....+++++.+.+..  ...+.+..| .+.+..++|......
T Consensus       439 -dG~----~~~~~~~~~~~~G~~~~~~~l~~n-----a~tG~w~l~~~~~~~~--~~~s~~f~V~df~p~r~~i~l~~~k  506 (1621)
T COG2373         439 -DGS----VLRTLTITLDEEGLYELSFPLPEN-----ALTGGYTLELYTGGKS--AVISMSFRVEDFIPDRFKINLTLDK  506 (1621)
T ss_pred             -CCc----EEEEEEEeccccCceEEeeeCCCC-----CCcceEEEEEEeCCcc--ceeeeeEEhhHhCCceEEEeccccc
Confidence             121    112345678899999999998661     1223556666654322  333333333 334555666655555


Q ss_pred             CcccCCCeEEEEEEEEccCCCccCCCCce--EEEEEee----ecc---------cc-----ccceeeEeeCCCeeEEEEE
Q psy5522         322 EYFKPGLKYTAYMKLTHHDGTPVTDNNNM--VQVRHGF----SYD---------ES-----KYEANQYKLDRNGMIKLVY  381 (1397)
Q Consensus       322 ~~~kpG~p~~~~v~v~~~dG~P~~~~~~~--V~v~~~~----~~~---------~~-----~~~~~~~ttd~~G~~~f~i  381 (1397)
                      ..+.+|.++.++|.+.+..|.|+.+...+  +.++...    .|.         ..     .......++|.+|.+.|.+
T Consensus       507 ~~~~~g~~v~~~v~~~yL~GaPa~g~~~~~~l~lr~~~~~~~~~~~~~fg~~~~~~~~~~~~~~~~~~~~D~~G~a~~~l  586 (1621)
T COG2373         507 TEWVPGKDVKIKVDLRYLYGAPAAGLTVQGELDLRPTRFSVPGFPGFTFGLADEEPNSLTFEEELELTVTDGKGKASLDL  586 (1621)
T ss_pred             ccccCCCcEEEEEEEEecCCCcccCceeeeEEEeecccccccCCcceeeeccccccccccccccccccccCCCCcEeEec
Confidence            66899999999999999999999964322  2222110    111         00     0111236789999999999


Q ss_pred             eCCCCCceeEEEE
Q psy5522         382 YPPANENVTTLGI  394 (1397)
Q Consensus       382 ~~~~~~~~~~l~l  394 (1397)
                      +..+....+.+.+
T Consensus       587 ~~~~~~s~~~~~~  599 (1621)
T COG2373         587 DLAETPSPYQLLV  599 (1621)
T ss_pred             ccccCCCceEEEe
Confidence            9875543333333


No 16 
>cd02892 SQCY_1 Squalene cyclase (SQCY) domain subgroup 1; found in class II terpene cyclases that have an alpha 6 - alpha 6 barrel fold. Squalene cyclase (SQCY)  and 2,3-oxidosqualene cyclase (OSQCY) are integral membrane proteins that catalyze a cationic cyclization cascade converting linear triterpenes to fused ring compounds. This group contains bacterial SQCY which catalyzes the convertion of squalene to hopene or diplopterol and eukaryotic OSQCY which transforms the 2,3-epoxide of squalene to compounds such as, lanosterol in mammals and fungi or, cycloartenol in plants. Deletion of a single glycine residue of Alicyclobacillus acidocaldarius SQCY alters its substrate specificity into that of eukaryotic OSQCY. Both enzymes have a second minor domain, which forms an alpha-alpha barrel that is inserted into the major domain.
Probab=99.08  E-value=7.5e-10  Score=141.68  Aligned_cols=118  Identities=20%  Similarity=0.247  Sum_probs=85.9

Q ss_pred             hhhHHHHHHHHHhhcCC---------CCchHHHHHHHHHHhhCCCC-c----hHHHHHHHhhcccccCCcccccccCCCC
Q psy5522        1157 MNTINKAVDYIVKNLAG---------TEDAYAIAICSYALHLAQHP-V----KDVAFNLLESKAHNEDGKKWWKRAERPE 1222 (1397)
Q Consensus      1157 ~~~~~~a~~~l~~~~~~---------~~~~y~~al~aYal~la~~~-~----~~~~~~~L~~~a~~~~~~~~W~~~~~~~ 1222 (1397)
                      ...+++|+.||++.-..         ....|..+.+.+||..+|.. .    .+.+.+-|.+....+||   |......-
T Consensus       475 ~~~i~rAv~~L~~~Q~~DGsW~g~wg~~~~Y~T~~al~AL~~~G~~~~~~~~i~~a~~~L~s~Q~~DGg---Wge~~~s~  551 (634)
T cd02892         475 DPAIRRAVKYLLREQEPDGSWYGRWGVCYIYGTWFALEALAAAGEDYENSPYIRKACDFLLSKQNPDGG---WGESYLSY  551 (634)
T ss_pred             HHHHHHHHHHHHHccCCCCCccccCCCccHHHHHHHHHHHHHhCCcccCcHHHHHHHHHHHhcCCCCCC---CCCccccc
Confidence            46899999999986431         35578999999999998875 2    35678888887665555   54321100


Q ss_pred             CCCCCCCCCCCcccchhhHHHHHHhhccCCc--ccchhhhhhhhhcccCCCCccccccC
Q psy5522        1223 DKKNPWAQVPNSVDVEMTSYALLSYLDRGLV--EDTLPILTWLVTQQNDQGGFASTQMS 1279 (1397)
Q Consensus      1223 ~~~~~~~~~~~s~~vE~t~Y~lL~~~~~~~~--~~a~~i~~Wl~~q~n~~Ggf~stQdT 1279 (1397)
                      .  ...+..+...+++.||||||+++..++.  ....+.++||.++|++.|||.++|+|
T Consensus       552 ~--~~~~~~~~~s~~~~TA~AllaLl~~g~~~~~~i~r~i~wL~~~Q~~~G~w~~~~~~  608 (634)
T cd02892         552 E--DKSYAGGGRSTVVQTAWALLALMAAGEPDSEAVERGIKYLLNTQLPDGDWPQEEIT  608 (634)
T ss_pred             c--CcccCCCCCCcHHHHHHHHHHHHHcCCCChHHHHHHHHHHHHcCCCCCCCCCcccc
Confidence            0  0112223456789999999999987755  34667799999999999999999998


No 17 
>cd02896 complement_C3_C4_C5 Proteins similar to C3, C4 and C5 of vertebrate complement.  The vertebrate complement system, comprised of a large number of distinct plasma proteins, is an effector of both the acquired and innate immune systems.  The point of convergence of the classical, alternative and lectin pathways of the complement system is the proteolytic activation of C3. C4 plays a key role in propagating the classical and lectin pathways. C5 participates in the classical and alternative pathways. The thioester bond located within the structure of C3 and C4 is central to the function of complement. C5 does not contain an active thioester bond.
Probab=98.62  E-value=7.2e-08  Score=112.34  Aligned_cols=71  Identities=14%  Similarity=0.123  Sum_probs=55.2

Q ss_pred             ceeeecCCCCcccc---------CCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCccccchHh-
Q psy5522         936 ELTYRRPDGSFSAF---------GTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSFPEVGKV- 1005 (1397)
Q Consensus       936 ~l~y~~~dGsfs~~---------~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~stqdv- 1005 (1397)
                      ++..++.||++..|         +....+.+++.|||||++++..+.     ...+.++++||.+||+++|+|+||||. 
T Consensus       214 L~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~vE~TAYALLall~~~~-----~~~a~~iv~WL~~qr~~~Ggf~sTQdTv  288 (297)
T cd02896         214 LLSLAKRDGNGWYWWTIDSPYWPVPGPSAITVETTAYALLALLKLGD-----IEYANPIARWLTEQRNYGGGFGSTQDTV  288 (297)
T ss_pred             HHHHhhhCCCcceeccCcCccCCCCCCchhhhHHHHHHHHHHHhcCC-----chhHHHHHHHHHhcCCCCCCeehHHHHH
Confidence            44456678876543         222123479999999999998874     357889999999999999999999983 


Q ss_pred             --HHHhhc
Q psy5522        1006 --SHADMQ 1011 (1397)
Q Consensus      1006 --l~al~~ 1011 (1397)
                        ||||++
T Consensus       289 val~AL~~  296 (297)
T cd02896         289 VALQALAE  296 (297)
T ss_pred             HHHHHHhc
Confidence              899986


No 18 
>TIGR01507 hopene_cyclase squalene-hopene cyclase. SHC is an essential prokaryotic gene in hopanoid (triterpenoid) biosynthesis. Squalene hopene cyclase, an integral membrane protein, directly cyclizes squalene into hopanoid products.
Probab=98.55  E-value=9.7e-07  Score=112.20  Aligned_cols=150  Identities=23%  Similarity=0.332  Sum_probs=93.9

Q ss_pred             chhhHHHHHHHHHhhcC---------CCCchHHHHHHHHHHhhCCCCc----hHHHHHHHhhcccccCCcccccccCCCC
Q psy5522        1156 RMNTINKAVDYIVKNLA---------GTEDAYAIAICSYALHLAQHPV----KDVAFNLLESKAHNEDGKKWWKRAERPE 1222 (1397)
Q Consensus      1156 ~~~~~~~a~~~l~~~~~---------~~~~~y~~al~aYal~la~~~~----~~~~~~~L~~~a~~~~~~~~W~~~~~~~ 1222 (1397)
                      .++.|++|+.||++.-+         +....|..+.+.-||+.+|-..    .+.+++-|.+.- +.||  -|......-
T Consensus       469 ~~~~i~rav~~L~~~Q~~dG~W~g~wg~~~~Y~T~~al~aL~~~g~~~~~~~i~rAv~wL~~~Q-~~DG--GWge~~~sy  545 (635)
T TIGR01507       469 AWPVIERAVEYLKREQEPDGSWFGRWGVNYLYGTGAVLSALKAVGIDTREPYIQKALAWLESHQ-NPDG--GWGEDCRSY  545 (635)
T ss_pred             hhHHHHHHHHHHHHccCCCCCCccCCCCccccHHHHHHHHHHHcCCCcccHHHHHHHHHHHHhc-CCCC--CCCCCCccc
Confidence            35689999999988532         2345777777777777776432    345666677653 3343  243221000


Q ss_pred             CCCCCCCCCCCcccchhhHHHHHHhhccCC--cccchhhhhhhhhcccCCCCccccccCCCccccceeccccccccccCc
Q psy5522        1223 DKKNPWAQVPNSVDVEMTSYALLSYLDRGL--VEDTLPILTWLVTQQNDQGGFASTQMSKKELCPTVSAFRTHKVAQQKP 1300 (1397)
Q Consensus      1223 ~~~~~~~~~~~s~~vE~t~Y~lL~~~~~~~--~~~a~~i~~Wl~~q~n~~Ggf~stQdT~~~~c~~~~~~~~~~v~~~~p 1300 (1397)
                      .  .+.+.....-++..||+||+++...++  -....+.++||.++||+.||+....-|.  +|            .-+-
T Consensus       546 ~--~~~~~g~g~s~~s~TA~AL~AL~~ag~~~~~~I~rav~~L~~~Q~~dG~W~e~~~~g--~g------------fp~~  609 (635)
T TIGR01507       546 E--DPAYAGKGASTASQTAWALIALIAAGRAESEAARRGVQYLVETQRPDGGWDEPYYTG--TG------------FPGD  609 (635)
T ss_pred             c--cccccCCCCCcHHHHHHHHHHHHHhCCCCcHHHHHHHHHHHHhcCCCCCCCCccccc--Cc------------ccce
Confidence            0  001111112378899999999997663  3456779999999999999997666652  11            1233


Q ss_pred             eeEEEeeccccchhheeeeccccccccccccCCCCCCccchhHHHHHHHhhhh
Q psy5522        1301 VPVSIYDYYDQSRRARMFYEPRVATLCDICEGEDCSKDTVIGLYALAQLAEKV 1353 (1397)
Q Consensus      1301 ~~v~~ydyy~~~~~~~~~y~~~~~~~c~~c~~~~c~~~~~~~~~~~~~~~~~~ 1353 (1397)
                      -+++ |.||.                            .+--|-||+.|....
T Consensus       610 ~yi~-Y~~Y~----------------------------~~fpl~ALg~y~~~~  633 (635)
T TIGR01507       610 FYLG-YHMYR----------------------------HVFPLLALARYKQAI  633 (635)
T ss_pred             eeec-ccchh----------------------------hHhHHHHHHHHHHhh
Confidence            4566 77774                            157788999998754


No 19 
>PF07678 A2M_comp:  A-macroglobulin complement component;  InterPro: IPR011626 This domain covers the complement component region of the alpha-2-macroglobulin family. The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].; GO: 0005615 extracellular space; PDB: 1QSJ_D 1QQF_A 4ACQ_C 2B39_B 2WIN_H 2I07_B 2ICF_B 2XWJ_D 3G6J_B 2NOJ_C ....
Probab=98.32  E-value=5e-07  Score=102.35  Aligned_cols=51  Identities=18%  Similarity=0.180  Sum_probs=45.6

Q ss_pred             CCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCccccchHh---HHHhhc
Q psy5522         956 GSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSFPEVGKV---SHADMQ 1011 (1397)
Q Consensus       956 ~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~stqdv---l~al~~ 1011 (1397)
                      .++++|||+|++++..+     |.....++++||.+||+.+|||.||||.   |+||++
T Consensus       190 ~~vEtTaYaLLa~l~~~-----~~~~~~~iv~WL~~qr~~~Ggf~STQdTvvaL~AL~~  243 (246)
T PF07678_consen  190 LDVETTAYALLALLKRG-----DLEEASPIVRWLISQRNSGGGFGSTQDTVVALQALAE  243 (246)
T ss_dssp             HHHHHHHHHHHHHHHHT-----CHHHHHHHHHHHHHCTTTTSSTSSHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHhcc-----cHHHHHHHHHHHHHhcCCCCccCcHHHHHHHHHHHHH
Confidence            35899999999999984     5688999999999999999999999983   899986


No 20 
>cd02897 A2M_2 Proteins similar to alpha2-macroglobulin (alpha (2)-M). This group also contains the pregnancy zone protein (PZP).  Alpha(2)-M and PZP are broadly specific proteinase inhibitors. Alpha (2)-M is a major carrier protein in serum. The structural thioester of alpha (2)-M, is involved in the immobilization and entrapment of proteases.  PZP is a trace protein in the plasma of non-pregnant females and males which is elevated in pregnancy. Alpha (2)-M and PZ bind to placental protein-14 and may modulate its activity in T-cell growth and cytokine production contributing to fetal survival. It has been suggested that thioester bond cleavage promotes the binding of PZ and alpha (2)-M to the CD91 receptor clearing them from circulation.
Probab=97.95  E-value=2.8e-05  Score=90.68  Aligned_cols=54  Identities=19%  Similarity=0.265  Sum_probs=46.4

Q ss_pred             CCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCccccchHh---HHHhhc
Q psy5522         955 NGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSFPEVGKV---SHADMQ 1011 (1397)
Q Consensus       955 ~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~stqdv---l~al~~ 1011 (1397)
                      .++++.|||+|++++.++.   -+...+.++++||.++|+++|+|.||||.   |+||++
T Consensus       235 ~~~ve~TAyaLlall~~~~---~~~~~~~~~v~WL~~~q~~~Ggf~sTQdt~~al~AL~~  291 (292)
T cd02897         235 SAEVEMTAYALLALLSAGG---EDLAEALPIVKWLAKQRNSLGGFSSTQDTVVALQALAK  291 (292)
T ss_pred             cchHHHHHHHHHHHHHcCC---ccHhHHHHHHHHHHHcCCCCCCcccHHHHHHHHHHHHc
Confidence            4689999999999998874   23467888999999999999999999983   889875


No 21 
>TIGR03463 osq_cycl 2,3-oxidosqualene cyclase. This model identifies 2,3-oxidosqualene cyclases from Stigmatella aurantiaca which produces cycloartenol, and Gemmata obscuriglobus and Methylococcus capsulatus which each produce the closely related sterol, lanosterol.
Probab=97.93  E-value=4.9e-05  Score=96.97  Aligned_cols=114  Identities=20%  Similarity=0.253  Sum_probs=73.4

Q ss_pred             hhhHHHHHHHHHhhcC---------CCCchHHHHHHHHHHhhCCCCc----hHHHHHHHhhcccccCCcccccccCCCCC
Q psy5522        1157 MNTINKAVDYIVKNLA---------GTEDAYAIAICSYALHLAQHPV----KDVAFNLLESKAHNEDGKKWWKRAERPED 1223 (1397)
Q Consensus      1157 ~~~~~~a~~~l~~~~~---------~~~~~y~~al~aYal~la~~~~----~~~~~~~L~~~a~~~~~~~~W~~~~~~~~ 1223 (1397)
                      ..++.+|+.||.+.-.         +....|..+.+.-||..+|...    .+.+.+-|.+.-. .||.  |...-..-.
T Consensus       476 ~~ai~rav~~L~~~Q~~dGsW~g~Wg~~~~Y~T~~al~aL~~~G~~~~~~~i~rA~~~Ll~~Q~-~DGg--Wg~~~~s~~  552 (634)
T TIGR03463       476 TRAISRGVRFLRSRQREDGSFPGSWGVCFTYGTFHGVMGLRAAGASPDDMALQRAAAWLRSYQR-ADGG--WGEVYESCL  552 (634)
T ss_pred             HHHHHHHHHHHHHhcCCCCCccccCCCCCcHHHHHHHHHHHHcCCCcCcHHHHHHHHHHHHccC-CCCC--ccCccCccc
Confidence            4679999999988532         2345788888888888877642    3557777776533 3442  432110000


Q ss_pred             CCCCCCCCCCcccchhhHHHHHHhhccCC--cccchhhhhhhhhcccCCCCccc
Q psy5522        1224 KKNPWAQVPNSVDVEMTSYALLSYLDRGL--VEDTLPILTWLVTQQNDQGGFAS 1275 (1397)
Q Consensus      1224 ~~~~~~~~~~s~~vE~t~Y~lL~~~~~~~--~~~a~~i~~Wl~~q~n~~Ggf~s 1275 (1397)
                        ...|-....-++..||+||++++..+.  -....+-++||.++|++.|||..
T Consensus       553 --~~~y~~~~~S~~~~TA~Al~aL~~~g~~~~~~i~rgi~~L~~~Q~~dG~W~~  604 (634)
T TIGR03463       553 --QARYVEGKQSQAVMTSWALLALAEAGEGGHDAVQRGVAWLRSRQQEDGRWPR  604 (634)
T ss_pred             --cccccCCCCCcHHHHHHHHHHHHHcCCcCCHHHHHHHHHHHHhCCCCCCCCC
Confidence              011111122457899999999998663  34567789999999999999953


No 22 
>TIGR01787 squalene_cyclas squalene/oxidosqualene cyclases. This family of enzymes catalyzes the cyclization of the triterpenes squalene or 2-3-oxidosqualene to a variety of products including hopene, lanosterol, cycloartenol, amyrin, lupeol and isomultiflorenol.
Probab=97.87  E-value=9.8e-05  Score=94.16  Aligned_cols=119  Identities=18%  Similarity=0.281  Sum_probs=77.7

Q ss_pred             chhhHHHHHHHHHhhcC--C-------CCchHHHHHHHHHHhhCCCC-----chHHHHHHHhhcccccCCcccccccCCC
Q psy5522        1156 RMNTINKAVDYIVKNLA--G-------TEDAYAIAICSYALHLAQHP-----VKDVAFNLLESKAHNEDGKKWWKRAERP 1221 (1397)
Q Consensus      1156 ~~~~~~~a~~~l~~~~~--~-------~~~~y~~al~aYal~la~~~-----~~~~~~~~L~~~a~~~~~~~~W~~~~~~ 1221 (1397)
                      ....+++|+.||.+.-.  +       ....|..+.+.-||+.+|..     ..+.+++-|.+.-..+||   |...-..
T Consensus       458 ~~~~i~rAl~~L~~~Q~~DGsw~g~wg~~y~YgT~~al~aL~~~G~~~~~~~~i~rA~~~L~~~Q~~DGG---Wge~~~s  534 (621)
T TIGR01787       458 IRNVLERALEYLRREQRADGSWFGRWGVNYTYGTGFVLSALAAAGRTYRNCPEVQKACDWLLSRQMPDGG---WGEDCFS  534 (621)
T ss_pred             HHHHHHHHHHHHHHhcCCCCCCcccCCCCCchhHHHHHHHHHHhCCcccCCHHHHHHHHHHHhhcCCCCC---CCcCCcc
Confidence            34689999999987532  2       23467888888888888753     235577777766444444   3211000


Q ss_pred             CCCCCCCCCCCCcccchhhHHHHHHhhccCC--cccchhhhhhhhhcccCCCCccccccC
Q psy5522        1222 EDKKNPWAQVPNSVDVEMTSYALLSYLDRGL--VEDTLPILTWLVTQQNDQGGFASTQMS 1279 (1397)
Q Consensus      1222 ~~~~~~~~~~~~s~~vE~t~Y~lL~~~~~~~--~~~a~~i~~Wl~~q~n~~Ggf~stQdT 1279 (1397)
                        .....|.....-++..|||||++++..++  .....+.++||.+.|++.|||.....|
T Consensus       535 --~~~~~y~~~~~S~~s~Ta~AL~AL~~ag~~~~~ai~rgv~~L~~~Q~~dG~w~~~~~~  592 (621)
T TIGR01787       535 --YEDPSYVGSGGSTPSQTGWALMALIAAGEADSEAIERGVKYLLETQRPDGDWPQEYIT  592 (621)
T ss_pred             --ccccccCCCCCCCHHHHHHHHHHHHHcCccchHHHHHHHHHHHHhCCCCCCCCCcccc
Confidence              00011111122478899999999997663  346778999999999999999776655


No 23 
>PF05326 SVA:  Seminal vesicle autoantigen (SVA);  InterPro: IPR007990 This family consists of seminal vesicle autoantigen and prolactin-inducible (PIP) proteins. Seminal vesicle autoantigen (SVA) is specifically present in the seminal plasma of mice. This 19 kDa secretory glycoprotein suppresses the motility of spermatozoa by interacting with phospholipid. PIP has several known functions. In saliva, this protein plays a role in host defence by binding to microorganisms such as Streptococcus. PIP is an aspartyl proteinase and it acts as a factor capable of suppressing T-cell apoptosis through its interaction with CD4 [].; GO: 0005576 extracellular region; PDB: 3ES6_B.
Probab=97.79  E-value=0.00017  Score=70.24  Aligned_cols=84  Identities=17%  Similarity=0.150  Sum_probs=50.0

Q ss_pred             EeeeEEEEecCCceeeeCCEEEEEEEEEcCCCccEEEEEEEeecCCeeeeecCCCccCCCCCCcceeeeEEEEEeCCCee
Q psy5522         711 VFRPFFISLDLPYSVMRGEVVAIPVVVFNYLSQDLVADVTLENVGQFDFADFSNEVDAAPQPKFEVFRRKKLTIKANSGS  790 (1397)
Q Consensus       711 vfkpffv~l~LPysv~rGE~~~l~~~VfNyl~~~~~v~v~L~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~a~~~~  790 (1397)
                      ..||+-+++.+|...++||+++++++|-|++.+|+.|++.|..+..++.               .....++-|+|.+..+
T Consensus        30 ~~k~l~l~l~~~~~~~~~eev~v~l~V~t~~~eCmvVk~yl~sn~~i~~---------------~fny~YTaCLC~d~~r   94 (124)
T PF05326_consen   30 SRKPLSLNLQVPQTAKANEEVTVTLTVTTELRECMVVKIYLESNPPIDG---------------SFNYKYTACLCDDYPR   94 (124)
T ss_dssp             -----EE-----SEE-SS--EEEEEEEEE--SS-EEEEEEEEESS---S---------------GGG-EEEEEE-SSS-E
T ss_pred             cCccEEEEeecCCCCCCCCEEEEEEEEEcchheeEEEEEEeccCCCccc---------------cccceEEEEeCCCCCc
Confidence            8999999999999999999999999999999999999999998876642               2244789999999999


Q ss_pred             EEEEEEEeceeeeEEEEEE
Q psy5522         791 TTTFVITPKELGYIGIKVT  809 (1397)
Q Consensus       791 tv~f~v~p~~lG~v~itV~  809 (1397)
                      ++.|.|...+-+.+...+.
T Consensus        95 ~FyWDi~~~~t~~i~~~ad  113 (124)
T PF05326_consen   95 TFYWDIQVNRTVTIAAVAD  113 (124)
T ss_dssp             EEEEEE--SS-EEEEEEEE
T ss_pred             cEEEEEEECceEEEEEEEE
Confidence            9999999877665544433


No 24 
>PLN03012 Camelliol C synthase
Probab=97.62  E-value=0.00017  Score=91.74  Aligned_cols=70  Identities=14%  Similarity=0.179  Sum_probs=52.6

Q ss_pred             hccccceeeecCCCCccccCCCC-------------------CCCCcchhHHHHHHHHHccCCCc------cCHHHHHHH
Q psy5522         931 TGYQQELTYRRPDGSFSAFGTTD-------------------PNGSTWLTAFVAKSFRQAASHTT------IDESVILEA  985 (1397)
Q Consensus       931 ~gy~~~l~y~~~dGsfs~~~~~~-------------------~~~s~wlTAyvl~~~~~a~~~i~------vd~~~~~~~  985 (1397)
                      .+..-+|++|+.||||++|....                   +....+.||-|+++|...+++..      | ...+.+|
T Consensus       517 ~av~wlL~mQn~dGGwaafe~~~~~~~le~lnp~E~F~d~mid~~y~dcTa~~l~aL~~f~~~~~~~r~~~i-~~~i~rA  595 (759)
T PLN03012        517 DAVNILLSLQSKNGGMTAWEPAGAPEWLELLNPTEMFADIVIEHEYNECTSSAIQALILFKQLYPDHRTEEI-NAFIKKA  595 (759)
T ss_pred             HHHHHHHhccCCCCCEeeecCCcchHHHHhcChhhhhcCeecCCCcccHHHHHHHHHHHHhhhCcccchhhh-HHHHHHH
Confidence            44445677999999999986421                   13445669999999987665422      2 4678999


Q ss_pred             HHHHHhcccCCCcccc
Q psy5522         986 LAWLSSNQAVNGSFPE 1001 (1397)
Q Consensus       986 ~~WL~~~q~~~G~F~s 1001 (1397)
                      ++||.+.|++||+|..
T Consensus       596 v~~L~~~Q~~DGsW~G  611 (759)
T PLN03012        596 AEYIENIQMLDGSWYG  611 (759)
T ss_pred             HHHHHHhcCCCCCCcc
Confidence            9999999999999973


No 25 
>PLN02993 lupeol synthase
Probab=97.40  E-value=0.0011  Score=84.79  Aligned_cols=114  Identities=19%  Similarity=0.294  Sum_probs=73.1

Q ss_pred             hhhHHHHHHHHHhhcC---------CCCchHHHHHHHHHHhhCCCC-----chHHHHHHHhhcccccCCcccccccCCCC
Q psy5522        1157 MNTINKAVDYIVKNLA---------GTEDAYAIAICSYALHLAQHP-----VKDVAFNLLESKAHNEDGKKWWKRAERPE 1222 (1397)
Q Consensus      1157 ~~~~~~a~~~l~~~~~---------~~~~~y~~al~aYal~la~~~-----~~~~~~~~L~~~a~~~~~~~~W~~~~~~~ 1222 (1397)
                      +..+++|+.||++.=.         ++.-.|..+-+--||+.+|..     ..+.+.+=|.+....+||   |..+-..-
T Consensus       589 ~~~i~rAv~yL~~~Q~~DGSW~G~Wgv~y~YgT~~aL~aL~a~G~~~~~~~~IrrAv~fLls~Q~~DGG---WGEs~~S~  665 (763)
T PLN02993        589 IKSIEKAVQFIESKQTPDGSWYGNWGICFIYATWFALGGLAAAGKTYNDCLAMRKGVHFLLTIQRDDGG---WGESYLSC  665 (763)
T ss_pred             HHHHHHHHHHHHHhcCCCCCcccccccccCcHHHHHHHHHHHcCCCCCCcHHHHHHHHHHHHhcCCCCC---cCcCcCcC
Confidence            4689999999987421         245678777777788888764     235577777776555544   43221000


Q ss_pred             CCCCCCCC-CCCcccchhhHHHHHHhhccCCcc----cchhhhhhhhhcccCCCCcc
Q psy5522        1223 DKKNPWAQ-VPNSVDVEMTSYALLSYLDRGLVE----DTLPILTWLVTQQNDQGGFA 1274 (1397)
Q Consensus      1223 ~~~~~~~~-~~~s~~vE~t~Y~lL~~~~~~~~~----~a~~i~~Wl~~q~n~~Ggf~ 1274 (1397)
                      .. ..+-. ....-++.-||+|||+++..|.-+    .-.+-++||.+.|...|+|-
T Consensus       666 ~~-~~y~~~~~~~St~~qTAwAllaL~~aG~~~~~~~~l~Rgi~~L~~~Q~~~G~W~  721 (763)
T PLN02993        666 PE-QRYIPLEGNRSNLVQTAWAMMGLIHAGQAERDLIPLHRAAKLIITSQLENGDFP  721 (763)
T ss_pred             CC-cccccCCCCCCchhhHHHHHHHHHHcCCCCCCcHHHHHHHHHHHhccCCCCCCC
Confidence            00 00111 112235678999999999887532    23468999999999999983


No 26 
>cd00688 ISOPREN_C2_like This group contains class II terpene cyclases, protein prenyltransferases beta subunit, two broadly specific proteinase inhibitors alpha2-macroglobulin (alpha (2)-M) and pregnancy zone protein (PZP) and, the C3 C4 and C5 components of vertebrate complement. Class II terpene cyclases include squalene cyclase (SQCY) and 2,3-oxidosqualene cyclase (OSQCY), these integral membrane proteins catalyze a cationic cyclization cascade converting linear triterpenes to fused ring compounds.  The protein prenyltransferases include protein farnesyltransferase (FTase) and geranylgeranyltransferase types I and II (GGTase-I and GGTase-II) which catalyze the carboxyl-terminal lipidation of Ras, Rab, and several other cellular signal transduction proteins, facilitating membrane associations and specific protein-protein interactions. Alpha (2)-M is a major carrier protein in serum and involved in the immobilization and entrapment of proteases. PZP is a pregnancy associated protein. 
Probab=97.38  E-value=0.00031  Score=81.28  Aligned_cols=85  Identities=22%  Similarity=0.185  Sum_probs=60.3

Q ss_pred             CchHHHHHHHHHHhhCCCC----chHHHHHHHhhcccccCCcccccccCCCCCCCCCCCCCCCcccchhhHHHHHHhhcc
Q psy5522        1175 EDAYAIAICSYALHLAQHP----VKDVAFNLLESKAHNEDGKKWWKRAERPEDKKNPWAQVPNSVDVEMTSYALLSYLDR 1250 (1397)
Q Consensus      1175 ~~~y~~al~aYal~la~~~----~~~~~~~~L~~~a~~~~~~~~W~~~~~~~~~~~~~~~~~~s~~vE~t~Y~lL~~~~~ 1250 (1397)
                      .+....|.+.+||++++..    ....+++-|.+.-..+++...|                 ...+...|+.++.++...
T Consensus       138 ~~~~~t~~al~aL~~~~~~~~~~~~~~~~~~l~~~q~~~g~~~~~-----------------~~~~~~~t~~~~~aL~~~  200 (300)
T cd00688         138 SDVRLTAYALIALALLGKLDPDPLIEKALDYLLSCQNYDGGFGPG-----------------GESHGYGTACAAAALALL  200 (300)
T ss_pred             CcccHHHHHHHHHHHcCCCCCcHHHHHHHHHHHHHhcCCCCcCCC-----------------ccccHHHHHHHHHHHHHc
Confidence            4566677888889998875    3355666666665544443111                 113557889999998877


Q ss_pred             CC--cccchhhhhhhhhcccCCCCcccc
Q psy5522        1251 GL--VEDTLPILTWLVTQQNDQGGFAST 1276 (1397)
Q Consensus      1251 ~~--~~~a~~i~~Wl~~q~n~~Ggf~st 1276 (1397)
                      ++  -......++||.+.|+..|||...
T Consensus       201 ~~~~~~~~~~~~~~L~~~q~~~g~~~~~  228 (300)
T cd00688         201 GDLDSPDAKKALRWLLSRQRPDGGWGEG  228 (300)
T ss_pred             CCcchHHHHHHHHHHHHhcCCCCCcCcc
Confidence            76  567889999999999999999654


No 27 
>cd02889 SQCY Squalene cyclase (SQCY) domain; found in class II terpene cyclases that have an alpha 6 - alpha 6 barrel fold. Squalene cyclase (SQCY) and 2,3-oxidosqualene cyclase (OSQCY) are integral membrane proteins that catalyze a cationic cyclization cascade converting linear triterpenes to fused ring compounds. Bacterial SQCY catalyzes the convertion of squalene to hopene or diplopterol. Eukaryotic OSQCY transforms the 2,3-epoxide of squalene to compounds such as, lanosterol (a metabolic precursor of cholesterol and steroid hormones) in mammals and fungi or, cycloartenol in plants. Deletion of a single glycine residue of Alicyclobacillus acidocaldarius SQCY alters its substrate specificity into that of eukaryotic OSQCY. Both enzymes have a second minor domain, which forms an alpha-alpha barrel that is inserted into the major domain. This group also contains SQCY-like archael sequences and some bacterial SQCY's which lack this minor domain.
Probab=97.31  E-value=0.00067  Score=81.26  Aligned_cols=91  Identities=19%  Similarity=0.248  Sum_probs=63.5

Q ss_pred             HHHHHHHHHHhcc-ccHHHHHHHHHHHHhccccceeeecCCCCccccCC--------CCCCCCcchhHHHHHHHHHccCC
Q psy5522         904 NIVVLEYLKKTYQ-LTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGT--------TDPNGSTWLTAFVAKSFRQAASH  974 (1397)
Q Consensus       904 ~i~~l~yL~~~~~-l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~--------~~~~~s~wlTAyvl~~~~~a~~~  974 (1397)
                      +.+++.-|...+. ...+..++++++|.+       .|++||+|+....        ....+++..|||++++|+.++..
T Consensus       222 ta~a~~aL~~~g~~~~~~~~~~~~~~L~~-------~Q~~dG~w~~~~~~~~~~~~~~~~~~~~~~Ta~al~aL~~~g~~  294 (348)
T cd02889         222 TWFALEALAAAGEDENSPYVRKACDWLLS-------KQNPDGGWGESYESYEDPSYAGGGRSTVVQTAWALLALMAAGEP  294 (348)
T ss_pred             HHHHHHHHHHcCCCcCcHHHHHHHHHHHH-------ccCCCCCcCCcccccccccccCCCCCcHHHHHHHHHHHHhcCCC
Confidence            3444444444443 222333456665543       5778998764210        12356889999999999998764


Q ss_pred             CccCHHHHHHHHHHHHhcccCCCccccchH
Q psy5522         975 TTIDESVILEALAWLSSNQAVNGSFPEVGK 1004 (1397)
Q Consensus       975 i~vd~~~~~~~~~WL~~~q~~~G~F~stqd 1004 (1397)
                         |...+.++++||.++|+++|+|.|+|+
T Consensus       295 ---~~~~v~~a~~wL~~~Q~~dG~f~~~~~  321 (348)
T cd02889         295 ---DSEAVKRGVKYLLNTQQEDGDWPQEEI  321 (348)
T ss_pred             ---CHHHHHHHHHHHHHhcCCCCCcCCcee
Confidence               667899999999999999999999985


No 28 
>cd02890 PTase Protein prenyltransferase (PTase) domain, beta subunit (alpha 6 - alpha 6 barrel fold). The protein prenyltransferase family of lipid-modifying enzymes includes protein farnesyltransferase (FTase) and geranylgeranyltransferase types I and II (GGTase-I and GGTase-II). They catalyze the carboxyl-terminal lipidation of Ras, Rab, and several other cellular signal transduction proteins, facilitating membrane associations and specific protein-protein interactions. Prenyltransferases employ a Zn2+ ion to alkylate a thiol group catalyzing the formation of thioether linkages between the C1 atom of farnesyl (15-carbon by FTase) or geranylgeranyl (20-carbon by GGTase-I, II) isoprenoid lipids and cysteine residues at or near the C-terminus of protein acceptors. FTase and GGTase-I prenylate the cysteine in the terminal sequence, "CAAX"; and GGTase-II prenylates both cysteines in the "CC" (or "CXC") terminal sequence. These enzymes are heterodimeric with both alpha and beta subunits re
Probab=97.01  E-value=0.01  Score=69.14  Aligned_cols=91  Identities=18%  Similarity=0.097  Sum_probs=65.1

Q ss_pred             chHHHHHHHHHHhcc-ccHHHHHHHHHHHHhccccceeee-cCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCH
Q psy5522         902 VPNIVVLEYLKKTYQ-LTDAIEAKASRYLETGYQQELTYR-RPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDE  979 (1397)
Q Consensus       902 ~p~i~~l~yL~~~~~-l~~~~~~~a~~~l~~gy~~~l~y~-~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~  979 (1397)
                      .-..+++.-|.--+. +....+++++++|..       .| ++||+|+....  ..++++.|.+++.++...+.- ..+.
T Consensus        28 ~~~y~~l~~l~ll~~~~~~~~~~~~i~~l~~-------~q~~~~Ggf~~~~~--~~~~~~~T~~al~~l~llg~~-~~~~   97 (286)
T cd02890          28 WLLYWILSSLDLLGEDLDDENKDEIIDFIYS-------CQVNEDGGFGGGPG--QDPHLASTYAAVLSLAILGDD-ALSR   97 (286)
T ss_pred             HHHHHHHHHHHHhCCCcchHHHHHHHHHHHH-------hhcCCCCCCCCCCC--CCccHHHHHHHHHHHHHcCcc-ccch
Confidence            333444444443343 566778889988775       45 78999988643  367899999999999988752 1233


Q ss_pred             HHHHHHHHHHHhcccCCCccccc
Q psy5522         980 SVILEALAWLSSNQAVNGSFPEV 1002 (1397)
Q Consensus       980 ~~~~~~~~WL~~~q~~~G~F~st 1002 (1397)
                      ....++++||.+.|+++|+|...
T Consensus        98 ~~~~~~~~~l~~~q~~dGgf~~~  120 (286)
T cd02890          98 IDREKIYKFLSSLQNPDGSFRGD  120 (286)
T ss_pred             hhHHHHHHHHHHhcCCCCCcccC
Confidence            34567899999999999999753


No 29 
>PF13249 Prenyltrans_2:  Prenyltransferase-like; PDB: 1O6R_B 1O6Q_B 1H35_C 1H3A_C 1SQC_A 1UMP_A 1O6H_C 1O79_B 1GSZ_C 1H37_C ....
Probab=97.01  E-value=0.001  Score=65.58  Aligned_cols=91  Identities=19%  Similarity=0.230  Sum_probs=60.2

Q ss_pred             hcccchHHHHHHHHHHhccccHHHHHHH--HHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCC
Q psy5522         898 MLNFVPNIVVLEYLKKTYQLTDAIEAKA--SRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHT  975 (1397)
Q Consensus       898 m~~~~p~i~~l~yL~~~~~l~~~~~~~a--~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i  975 (1397)
                      .+....+.+++.-|...+...+  ++++  .++|.+       -|++||+|+..... ...+++.|++++.++...... 
T Consensus        20 ~~~~~~T~~al~aL~~~g~~~~--~~~~~~~~~L~~-------~q~~dGg~~~~~~~-~~~~~~~t~~~l~~l~~~~~~-   88 (113)
T PF13249_consen   20 PSDVWDTAFALLALAALGEEPD--RDRAAAVEWLLS-------QQNPDGGWGSNPDG-GPPDVYTTYVALAALELLGRP-   88 (113)
T ss_dssp             -BEHHHHHHHHHHHHHHTSHHC--HHHHHHHHHHHH-------HB-TTSGBBSSTTT-T-BSHHHHHHHHHHHHHHT-G-
T ss_pred             CCCHHHHHHHHHHHHHhCCccc--HHHHHHHHHHHH-------hCCCCCCccCCCCC-CCccHHHHHHHHHHHHHcCCC-
Confidence            3444555566666665554322  4455  777765       47799999887532 356789999999988766543 


Q ss_pred             ccCHHHHHHHHHHHHhcccCCCccc
Q psy5522         976 TIDESVILEALAWLSSNQAVNGSFP 1000 (1397)
Q Consensus       976 ~vd~~~~~~~~~WL~~~q~~~G~F~ 1000 (1397)
                       -+...+.++++||.++|+++|+|.
T Consensus        89 -~~~~~~~~a~~~l~~~Q~~dGg~~  112 (113)
T PF13249_consen   89 -DDEEAVRKAVDWLLSCQNPDGGWG  112 (113)
T ss_dssp             -GCHTTHCCHHHHHHHTB-TTSSB-
T ss_pred             -cccHHHHHHHHHHHHhcCCCCCCC
Confidence             224678999999999999999995


No 30 
>PF13249 Prenyltrans_2:  Prenyltransferase-like; PDB: 1O6R_B 1O6Q_B 1H35_C 1H3A_C 1SQC_A 1UMP_A 1O6H_C 1O79_B 1GSZ_C 1H37_C ....
Probab=96.91  E-value=0.00057  Score=67.43  Aligned_cols=60  Identities=25%  Similarity=0.417  Sum_probs=46.1

Q ss_pred             ceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHH--HHHHHhcccCCCccccch
Q psy5522         936 ELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEA--LAWLSSNQAVNGSFPEVG 1003 (1397)
Q Consensus       936 ~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~--~~WL~~~q~~~G~F~stq 1003 (1397)
                      +++.|++||+|+.+ .  ..++++.|++++.+|..+...     ....++  ++||.++|+++|+|.+..
T Consensus         4 L~~~Q~~dGgw~~~-~--~~~~~~~T~~al~aL~~~g~~-----~~~~~~~~~~~L~~~q~~dGg~~~~~   65 (113)
T PF13249_consen    4 LLSRQNPDGGWGGF-G--GPSDVWDTAFALLALAALGEE-----PDRDRAAAVEWLLSQQNPDGGWGSNP   65 (113)
T ss_dssp             HHHHB-TTSSBBSS-T--S-BEHHHHHHHHHHHHHHTSH-----HCHHHHHHHHHHHHHB-TTSGBBSST
T ss_pred             hHHHcCCCCCCcCC-C--CCCCHHHHHHHHHHHHHhCCc-----ccHHHHHHHHHHHHhCCCCCCccCCC
Confidence            45579999999886 2  367899999999999988753     225556  999999999999998754


No 31 
>PF07703 A2M_N_2:  Alpha-2-macroglobulin family N-terminal region;  InterPro: IPR011625 This is a domain of the alpha-2-macroglobulin family. The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].; PDB: 2QKI_D 3L3O_D 3NMS_A 2ICF_A 2A73_A 2ICE_D 2HR0_A 2A74_A 2XWJ_G 3OHX_A ....
Probab=96.67  E-value=0.044  Score=56.05  Aligned_cols=122  Identities=15%  Similarity=0.217  Sum_probs=79.8

Q ss_pred             EEEcCCCcccCCceeEEEEEEEcCCCccCCCcceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEE
Q psy5522         101 FIQLDKAIYKPGDLVRFRAIVLNSHLKPSVTGALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINV  180 (1397)
Q Consensus       101 fIqTDKpIYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~  180 (1397)
                      -|.+||..|+|||++++.+.+-..      ...+.+.|.. +|..+. +.......|.-..+|+++++...+-|-+....
T Consensus         2 ~i~~~~~~~~~Ge~~~v~v~~~~~------~~~~~~~v~s-~g~I~~-~~~~~~~~~~~~~~~~v~~~~~P~~~v~~~~v   73 (136)
T PF07703_consen    2 QISTDKDSYKPGETAKVTVQSPFP------NGTFLYLVES-RGKIVS-TGSVELKNGSTTFEFPVTPDMAPNFYVLAYYV   73 (136)
T ss_dssp             EEEE-SSSB-TTSEEEEEEEEESC------ESEEEEEEEE-TTEEEE-EEEEECTTTSSEEEEEE-GGGTSEEEEEEEEE
T ss_pred             EEEcCCCCcCCCCEEEEEEEcCCC------ccEEEEEEEE-CCeEEE-EEEEEecCCcEEEEEecchhcCCcEEEEEEEE
Confidence            478999999999999999887543      3555666654 444443 33444666777999999988777665555554


Q ss_pred             ---cC--eEEEEEEEEeeecCCcEEEEEeCCCceEeeCCeEEEEEEEEEecCccccceEEEEE
Q psy5522         181 ---LD--QKFTKRFTVAEYVLPKFEVNVNVPPHATFKDSKVVISVNAKYTYGKPVKGEATITA  238 (1397)
Q Consensus       181 ---~~--~~~~~~F~VeEYvLPkFeV~v~~p~~i~~~~~~~~v~v~AkYtyGkPV~G~v~v~v  238 (1397)
                         ++  ...+..|.|+.-.  +.+|+++..+..+..++++++++.+      |..+.+.+.+
T Consensus        74 ~~~~g~~~~~s~~i~V~~~~--~~~v~l~~~~~~~~Pg~~~~~~i~~------~~~s~v~l~~  128 (136)
T PF07703_consen   74 RPADGEVVADSVWIEVEPCF--ELKVELTASPDEYKPGEEVTLRIKA------PPNSLVGLSA  128 (136)
T ss_dssp             TTCTCEEEEEEEEEEBGCSG--SSSEEEEESSSSBTTTSEEEEEEEE------STTEEEEEEE
T ss_pred             cCCCCeEEEEEEEEEecccc--cceEEEEEecceeCCCCEEEEEEEe------CCCCEEEEEE
Confidence               23  3467789998822  4555555655556689999999998      4444454443


No 32 
>cd02894 GGTase-II Geranylgeranyltransferase type II (GGTase-II)_like proteins containing the protein prenyltransferase (PTase) domain, beta subunit (alpha 6 - alpha 6 barrel fold). GGTase-IIs are a subgroup of the protein prenyltransferase family of lipid-modifying enzymes. PTases catalyze the carboxyl-terminal lipidation of Ras, Rab, and several other cellular signal transduction proteins, facilitating membrane associations and specific protein-protein interactions. Prenyltransferases employ a Zn2+ ion to alkylate a thiol group catalyzing the formation of thioether linkages between cysteine residues at or near the C-terminus of protein acceptors and the C1 atom of isoprenoid lipids (geranylgeranyl (20-carbon) in the case of GGTase-II ). GGTase-II catalyzes alkylation of both cysteine residues in Rab proteins containing carboxy-terminal "CC", "CXCX" or "CXC" motifs. PTases are heterodimeric with both alpha and beta subunits required for catalytic activity. In contrast to other prenyltr
Probab=96.62  E-value=0.024  Score=65.95  Aligned_cols=104  Identities=15%  Similarity=0.074  Sum_probs=66.1

Q ss_pred             cCCCchhhhhcccchHHHHHHHHHHhccccHHHHHHHHHHHHhccccceeee-cCCCCccccCCCCCCCCcchhHHHHHH
Q psy5522         889 MPFGCGEQNMLNFVPNIVVLEYLKKTYQLTDAIEAKASRYLETGYQQELTYR-RPDGSFSAFGTTDPNGSTWLTAFVAKS  967 (1397)
Q Consensus       889 ~P~GCgEQ~m~~~~p~i~~l~yL~~~~~l~~~~~~~a~~~l~~gy~~~l~y~-~~dGsfs~~~~~~~~~s~wlTAyvl~~  967 (1397)
                      +|+.=..+..+++.-.-+++.-|.--+++....+++.++++.+       .+ ..+|+|......  .+++..|-+++.+
T Consensus        18 ~~~~~~~~~~~r~~~~y~~l~~l~lL~~~~~~~~~~~i~fl~~-------~q~~~~GgF~~~~~~--~~~~~~t~~a~~~   88 (287)
T cd02894          18 DDYEYILTEHLRMSGIYWGLTALDLLGQLERLNREEIIEFVKS-------CQDNEDGGFGGSPGH--DPHILSTLSAIQI   88 (287)
T ss_pred             CChhhhhHHhhHHHHHHHHHHHHHHhCCCchhhHHHHHHHHHH-------HhcCCCCCCCCCCCC--cchHHHHHHHHHH
Confidence            3554344444555555555544444455444445777777765       34 568999875432  4577778777777


Q ss_pred             HHHccCCCccCHHHHHHHHHHHHhcccCCCccccc
Q psy5522         968 FRQAASHTTIDESVILEALAWLSSNQAVNGSFPEV 1002 (1397)
Q Consensus       968 ~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~st 1002 (1397)
                      +...+.+-.||+ ..+++++||.+.|+++|||...
T Consensus        89 L~ll~~~~~i~~-~~~~~~~~i~~~q~~dGgf~~~  122 (287)
T cd02894          89 LALYDLLNKIDE-NKEKIAKFIKGLQNEDGSFSGD  122 (287)
T ss_pred             HHHhhhhhhccH-HHHHHHHHHHHHcCCCCCeecC
Confidence            776543323453 4678999999999999999854


No 33 
>PRK14081 triple tyrosine motif-containing protein; Provisional
Probab=96.27  E-value=5.6  Score=50.72  Aligned_cols=417  Identities=15%  Similarity=0.151  Sum_probs=212.9

Q ss_pred             CCCceeEEEEEec---CceeeEeeEEEEEEeccc-----EEEEEcCCCcccCCceeEEEEEEEcCCCccCCCcceEEEEE
Q psy5522          68 LGPGQYNLTVKGS---GSLNFYNSTGLVYVHKSY-----SVFIQLDKAIYKPGDLVRFRAIVLNSHLKPSVTGALEIFIT  139 (1397)
Q Consensus        68 ~~~~~~~l~v~g~---~~~~f~~~~~v~~~~~~~-----~vfIqTDKpIYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I~  139 (1397)
                      ..+|.|.|.+...   +.-.|..++.+.+.-+..     .-|.....|+.. |+.|.|.+-+....-+-   --....+.
T Consensus       155 ~~~G~Y~ll~~~Kd~~S~~~fDD~~~v~y~Vk~~~~v~I~~F~~lns~~i~-~~eI~f~~~a~~~~g~~---~LYKF~~i  230 (667)
T PRK14081        155 NKPGKYELLVECKRIDSTKDFDDFKKVKFKVKEIDKVEITDFKCLNKELIC-DEELVFEVESVYEEDRT---ILYKFVKI  230 (667)
T ss_pred             cCCCcEEEEEEEecCCCccccCcceEEEEEcccCcceEEEeccccCcceec-CcEEEEEEEEEeCCCce---EEEEEEEE
Confidence            3567788877664   224566666666655433     233346667766 88888887665433211   12233455


Q ss_pred             CCCCCeEEeeeeecc--CCceeeeeeecCCCCCCeeEEEEEEEcC--------eEEEEEEEEeeecCCcEE-EEEeCCCc
Q psy5522         140 DGKGNRIKQWTRALT--TRGVFSADLQLSKSPVLGDWNITINVLD--------QKFTKRFTVAEYVLPKFE-VNVNVPPH  208 (1397)
Q Consensus       140 DP~g~~I~qw~~~~~--~~Gi~~~sf~L~~~~~lG~y~I~v~~~~--------~~~~~~F~VeEYvLPkFe-V~v~~p~~  208 (1397)
                      |++|....    .++  ...++  ++. |  ...|.|+|.+.+.+        ......|.|.-|..++.+ +.+...+.
T Consensus       231 ~~~G~~~~----~qdYst~n~~--~y~-~--~~~G~Y~i~~~VKD~~S~~eyDD~~~l~y~Vk~~~~vkI~~~~~d~~s~  301 (667)
T PRK14081        231 DSDGKQTC----IQDYSTKNIV--SYK-E--KKSGDYKLLCLVKDMYSNNEFDDRAVLVYTVKPYKDIKIRNFTTDLSSP  301 (667)
T ss_pred             CCCCCEEE----ecCccccceE--EEE-e--CCCccEEEEEEEeccCcccccccceEEEEEEecCCCcEEEEEEEcCCCC
Confidence            77774332    211  12222  222 2  25799999999976        345668999888877753 23333333


Q ss_pred             eEeeCCeEEEEEEEEEecCccccceEEEEEEeccccCccc--ccc--ccceeEEeecCceEEEEEechhhhccccccceE
Q psy5522         209 ATFKDSKVVISVNAKYTYGKPVKGEATITAYPTIFSGVIQ--PLF--QTPVRKVVPIDGKTVIEFDVVKELQLTDEYERN  284 (1397)
Q Consensus       209 i~~~~~~~~v~v~AkYtyGkPV~G~v~v~v~~~~~~g~~~--~~~--~~~~~~~~d~~G~~~~~~~~~~~~~~~~~~~~~  284 (1397)
                      + +.++.+.+++.|.  -|.    .+.++.-.+   |...  .++  +..+.=+-...|...+.+-+++ --..+.|...
T Consensus       302 q-l~g~~I~ika~a~--GG~----~llYrf~I~---G~~~e~~~Y~~~n~~~w~P~~~G~Y~I~v~VKD-k~S~~eyD~~  370 (667)
T PRK14081        302 Q-LTDTDIELKAVAE--GGK----ELLYRFIIK---GKESEDSGYIRNNIYTWKPKIAGKYSITLWVKD-ISSKGEYEDK  370 (667)
T ss_pred             e-ecCceEEEEEEec--CCC----ceEEEEEEC---CcEEeeccccccceEEEeeCCCceEEEEEEEEc-CcCcccccce
Confidence            3 4678888888776  232    222332211   1100  000  0111112223455544444433 1111122222


Q ss_pred             EEEEEEEEECCcceEEeeEEEEEEEEeeeEEEEE--ecCCcccCCCeEEEEEEEEccCCCccCCCCceEEEEEeeecccc
Q psy5522         285 IHFDVAVEEALTGRRQNNTGSVVFHKHKYKMDLI--KSSEYFKPGLKYTAYMKLTHHDGTPVTDNNNMVQVRHGFSYDES  362 (1397)
Q Consensus       285 l~V~a~VtE~gTG~~~~~~~~i~i~~~~~~i~f~--~~~~~~kpG~p~~~~v~v~~~dG~P~~~~~~~V~v~~~~~~~~~  362 (1397)
                      -.++-+|.|..              ..|.+|.-.  +....+--|.++.+.+.+.+.++  +.     ....+..+    
T Consensus       371 ~~v~f~i~~~~--------------~~p~~I~~vl~d~~~~~lvG~~i~i~v~a~gg~~--~l-----Y~f~ik~n----  425 (667)
T PRK14081        371 SSIDYTIEEKS--------------KEPIKIEDVILDKGKHILKGEEIKIRVIAEGGTN--LR-----YSFIIKKD----  425 (667)
T ss_pred             EEEEEEEcccC--------------CCCeEEEEEEECCCCceEeCCeEEEEEEecCCCe--EE-----EEEEEEEC----
Confidence            22222222211              013333322  22345556888877776655432  21     11111100    


Q ss_pred             ccceeeEeeCCCeeEEEEEeCCCCCceeEEEEEEEecCcce-----EEEEEEee-cCCCC-CeEEEEecCCCCCCCCeEE
Q psy5522         363 KYEANQYKLDRNGMIKLVYYPPANENVTTLGIEAEYLDIKE-----WFSTISAS-ESPSN-SFIQAALLTQNPKVNKDVE  435 (1397)
Q Consensus       363 ~~~~~~~ttd~~G~~~f~i~~~~~~~~~~l~l~a~~~~~~~-----~~~~v~~~-~S~s~-s~L~i~~~~~~~~~G~~~~  435 (1397)
                      +....  .++=.-.-.+.|.| ...|.|.|.|.|+++-+..     ....+..+ +.|-. +++ |.+.++.+.||+++.
T Consensus       426 g~~ve--~~~Y~~~~~~~f~P-~~~G~Y~IeV~vKdk~S~~~yD~~k~v~l~V~e~~P~~i~~i-l~~~~~~~~vg~~i~  501 (667)
T PRK14081        426 GKEEE--KIDYGKNNWVNFIP-EEKGNYELEVRVKDKYSDKEYDAHTIVYIKVHEYIPAEIDYI-LLPVKEYYLVGDDIE  501 (667)
T ss_pred             CEEEE--EeecccccEEEEEE-CCCeeEEEEEEEecccCchhcccceEEEEEEeccCceeeeeE-EecccccEEeCCEEE
Confidence            01001  11111111344544 3557888888887765432     11222222 34532 333 456677899999999


Q ss_pred             EEEEecCC-cceEEEEEEeCCeEEEEEEEEecCCceeEEEEEeeCCCCCceeEEEEEEEecCCeEEEE---eEEEEEccc
Q psy5522         436 LEINSTAP-LKYISYQVLGRGDVIMADTITVPGNKMSTVIRFLATYAMAPTAHVIVQYVREDGEVVAD---GLDLELEGG  511 (1397)
Q Consensus       436 v~v~s~~~-~~~~~ylV~srG~Iv~~~~~~~~~~~~~~~~~l~vt~~maP~a~l~vy~v~~~gevvad---s~~~~V~~~  511 (1397)
                      +.+.+... .-..-|.++.+|..+...-..-   +.  .++|  .|.-+-.-.+.||+-+.-.....|   .+.|.|...
T Consensus       502 ~~~~~~~~k~v~y~y~~~~NG~~v~~t~Ys~---~~--~ysf--~P~~~GkY~I~V~aKn~~s~~~~D~~k~v~~~V~e~  574 (667)
T PRK14081        502 IEVIIQNTKDVLIKYILKINGHKVEETDYIK---NK--KYKF--IPKCSGKYTIEVLAKNIKSTEEYDSKKEVKFYVREA  574 (667)
T ss_pred             EEEEEeCCCeEEEEEEEEECCEEEEEeeccc---cc--eEEE--eecCCceEEEEEEEcccccccccccceEEEEEEcCC
Confidence            99986532 2236688899998875332221   12  2333  244555566777765433333333   356777644


Q ss_pred             cc-ceeeEEecCCCCCCCceEEEEEEecCCcEEE
Q psy5522         512 LQ-NFVSANVSPDETEPGSNIQINLEAKPNSYIG  544 (1397)
Q Consensus       512 f~-nkV~l~~s~~~~~Pg~~v~l~V~a~p~S~v~  544 (1397)
                      ++ ....+..++....+|+.+++.+.+.-|.-|.
T Consensus       575 ~pi~nt~~~~~~~~~~~n~~~t~~~~~~gg~~v~  608 (667)
T PRK14081        575 LPITNTKIKTSKKKFKCNEEVTFSVKSEGGKDVC  608 (667)
T ss_pred             CCceeeEEEeecceEEcCCeEEEEEEccCCCcEE
Confidence            32 2356666778899999999999988766553


No 34 
>COG1470 Predicted membrane protein [Function unknown]
Probab=96.23  E-value=0.25  Score=58.72  Aligned_cols=158  Identities=18%  Similarity=0.259  Sum_probs=110.6

Q ss_pred             eeEEEeeeEEEEecCC--c-eeeeCCEEEEEEEEEcCCCccEEEEEEEe-ecCCeeeeecCCCccCCCCCCcceeeeEEE
Q psy5522         707 KKLRVFRPFFISLDLP--Y-SVMRGEVVAIPVVVFNYLSQDLVADVTLE-NVGQFDFADFSNEVDAAPQPKFEVFRRKKL  782 (1397)
Q Consensus       707 ~~l~vfkpffv~l~LP--y-sv~rGE~~~l~~~VfNyl~~~~~v~v~L~-~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~  782 (1397)
                      ..+.-.+.||.+++.-  | -+..++.+.+.+++-|...++.++.+.+. .++++...-.+.+           ...-++
T Consensus       259 ls~~~k~~~ftEl~~s~~~~~i~~~~t~sf~V~IeN~g~~~d~y~Le~~g~pe~w~~~Fteg~-----------~~vt~v  327 (513)
T COG1470         259 LSVTEKKSYFTELNSSDIYLEISPSTTASFTVSIENRGKQDDEYALELSGLPEGWTAEFTEGE-----------LRVTSV  327 (513)
T ss_pred             EEEEeccceEEEeecccceeEEccCCceEEEEEEccCCCCCceeEEEeccCCCCcceEEeeCc-----------eEEEEE
Confidence            4455567888888765  3 34568888999999999999999998888 5565543322221           335589


Q ss_pred             EEeCCCeeEEEEEEEec---eeeeEEEEEEEEeCC---------------------------------------------
Q psy5522         783 TIKANSGSTTTFVITPK---ELGYIGIKVTATSNL---------------------------------------------  814 (1397)
Q Consensus       783 ~v~a~~~~tv~f~v~p~---~lG~v~itV~A~~~~---------------------------------------------  814 (1397)
                      .|.+|+.+.+...|.|.   ..|.-+++|.|.+..                                             
T Consensus       328 kL~~gE~kdvtleV~ps~na~pG~Ynv~I~A~s~s~v~~e~~lki~~~g~~~~~v~l~~g~~~lt~taGee~~i~i~I~N  407 (513)
T COG1470         328 KLKPGEEKDVTLEVYPSLNATPGTYNVTITASSSSGVTRELPLKIKNTGSYNELVKLDNGPYRLTITAGEEKTIRISIEN  407 (513)
T ss_pred             EecCCCceEEEEEEecCCCCCCCceeEEEEEeccccceeeeeEEEEeccccceeEEccCCcEEEEecCCccceEEEEEEe
Confidence            99999999999999984   589999999987630                                             


Q ss_pred             -CCceeE-EEEEec-CCCeeeEEEEEEEEe-cCCCceeEEEEEecCCCCcCCCCcEEEEEEEcccc
Q psy5522         815 -AGDSME-GKLLVK-PEGETQYKNKAIFVD-LRKNKTFSVNVTLDMPKNIVPGSEHVEVSAVGDLL  876 (1397)
Q Consensus       815 -~~D~v~-k~L~V~-peG~~~~~t~s~~l~-~~~~~~~~~~i~l~~P~~~v~gs~~~~vsv~gd~l  876 (1397)
                       ..+.++ -.|.|+ |-|++.+...+ .|+ ++.+...+-++++++|+++.+|--++++++..|.+
T Consensus       408 sGna~LtdIkl~v~~PqgWei~Vd~~-~I~sL~pge~~tV~ltI~vP~~a~aGdY~i~i~~ksDq~  472 (513)
T COG1470         408 SGNAPLTDIKLTVNGPQGWEIEVDES-TIPSLEPGESKTVSLTITVPEDAGAGDYRITITAKSDQA  472 (513)
T ss_pred             cCCCccceeeEEecCCccceEEECcc-cccccCCCCcceEEEEEEcCCCCCCCcEEEEEEEeeccc
Confidence             111111 233443 66666665554 332 22223456777889999999999999999999943


No 35 
>cd02893 FTase Protein farnesyltransferase (FTase)_like proteins containing the protein prenyltransferase (PTase) domain, beta subunit (alpha 6 - alpha 6 barrel fold). FTases are a subgroup of PTase family of lipid-modifying enzymes. PTases catalyze the carboxyl-terminal lipidation of Ras, Rab, and several other cellular signal transduction proteins, facilitating membrane associations and specific protein-protein interactions. These proteins are heterodimers of alpha and beta subunits. Both subunits are required for catalytic activity. Prenyltransferases employ a Zn2+ ion to alkylate a thiol group catalyzing the formation of thioether linkages between cysteine residues at or near the C-terminus of protein acceptors and the C1 atom of isoprenoid lipids. Ftase attaches a 15-carbon farnesyl group to the cysteine within the C-terminal CaaX motif of substrate proteins when X is Ala, Met, Ser, Cys or Gln. Protein farnesylation has been shown to play critical roles in a variety of cellular pro
Probab=96.10  E-value=0.1  Score=60.93  Aligned_cols=77  Identities=16%  Similarity=0.122  Sum_probs=53.7

Q ss_pred             ccHHHHHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCC
Q psy5522         917 LTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVN  996 (1397)
Q Consensus       917 l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~  996 (1397)
                      +....+++.+++|.+       .|++||+|+.-..  ..++++.|=+++.++...+.--..+.-..+++++||.+.|+++
T Consensus        44 ~~~~~~~~~i~~i~~-------~q~~~GgF~~~~~--~~~h~~~Ty~A~~~L~ll~~~~~~~~id~~~~~~~l~~~q~~d  114 (299)
T cd02893          44 LDQSYADDVISFLRR-------CQNPSGGFGGGPG--QLPHLATTYAAVNALAIIGTEEAYDVIDREALYKFLLSLKQPD  114 (299)
T ss_pred             ccHHHHHHHHHHHHH-------hcCCCCCCCCCCC--CCccHHHHHHHHHHHHHhCCchhhhHhhHHHHHHHHHHhcCCC
Confidence            455667788887765       4678999976322  3678899988888887766410011122345899999999999


Q ss_pred             Cccccc
Q psy5522         997 GSFPEV 1002 (1397)
Q Consensus       997 G~F~st 1002 (1397)
                      |+|...
T Consensus       115 Ggf~~~  120 (299)
T cd02893         115 GSFRMH  120 (299)
T ss_pred             CCeeCC
Confidence            999865


No 36 
>cd02892 SQCY_1 Squalene cyclase (SQCY) domain subgroup 1; found in class II terpene cyclases that have an alpha 6 - alpha 6 barrel fold. Squalene cyclase (SQCY)  and 2,3-oxidosqualene cyclase (OSQCY) are integral membrane proteins that catalyze a cationic cyclization cascade converting linear triterpenes to fused ring compounds. This group contains bacterial SQCY which catalyzes the convertion of squalene to hopene or diplopterol and eukaryotic OSQCY which transforms the 2,3-epoxide of squalene to compounds such as, lanosterol in mammals and fungi or, cycloartenol in plants. Deletion of a single glycine residue of Alicyclobacillus acidocaldarius SQCY alters its substrate specificity into that of eukaryotic OSQCY. Both enzymes have a second minor domain, which forms an alpha-alpha barrel that is inserted into the major domain.
Probab=95.91  E-value=0.023  Score=73.45  Aligned_cols=90  Identities=19%  Similarity=0.291  Sum_probs=63.9

Q ss_pred             HHHHHHHHHHhccc--cHHHHHHHHHHHHhccccceeeecCCCCccc---------cCCCCCCCCcchhHHHHHHHHHcc
Q psy5522         904 NIVVLEYLKKTYQL--TDAIEAKASRYLETGYQQELTYRRPDGSFSA---------FGTTDPNGSTWLTAFVAKSFRQAA  972 (1397)
Q Consensus       904 ~i~~l~yL~~~~~l--~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~---------~~~~~~~~s~wlTAyvl~~~~~a~  972 (1397)
                      +.+++.-|...+..  ..+..++|.++|.+       .|+.||+|+.         +.+ ...+++..|||++++|+.++
T Consensus       507 T~~al~AL~~~G~~~~~~~~i~~a~~~L~s-------~Q~~DGgWge~~~s~~~~~~~~-~~~s~~~~TA~AllaLl~~g  578 (634)
T cd02892         507 TWFALEALAAAGEDYENSPYIRKACDFLLS-------KQNPDGGWGESYLSYEDKSYAG-GGRSTVVQTAWALLALMAAG  578 (634)
T ss_pred             HHHHHHHHHHhCCcccCcHHHHHHHHHHHh-------cCCCCCCCCCccccccCcccCC-CCCCcHHHHHHHHHHHHHcC
Confidence            33455555554432  23344567776654       5788998752         111 13467889999999999887


Q ss_pred             CCCccCHHHHHHHHHHHHhcccCCCccccchH
Q psy5522         973 SHTTIDESVILEALAWLSSNQAVNGSFPEVGK 1004 (1397)
Q Consensus       973 ~~i~vd~~~~~~~~~WL~~~q~~~G~F~stqd 1004 (1397)
                      ..   |...+.++++||.++|+++|+|.+++.
T Consensus       579 ~~---~~~~i~r~i~wL~~~Q~~~G~w~~~~~  607 (634)
T cd02892         579 EP---DSEAVERGIKYLLNTQLPDGDWPQEEI  607 (634)
T ss_pred             CC---ChHHHHHHHHHHHHcCCCCCCCCCccc
Confidence            63   667899999999999999999998875


No 37 
>TIGR03503 conserved hypothetical protein TIGR03503. This set of conserved hypothetical protein has a phylogenetic range that closely matches that of TIGR03501, a putative C-terminal protein targeting signal.
Probab=95.75  E-value=1.1  Score=53.04  Aligned_cols=190  Identities=16%  Similarity=0.229  Sum_probs=111.8

Q ss_pred             eeCeeecCCeEEEEEEeecCCcc---------EEEEEEEEeecCCCC-eeEEeEEEE---------EC-CCceeEEEEEc
Q psy5522           6 APKVLRPNGEYHVAVSTQAVSEA---------TQVSVEVGGKQDSGG-EFRTKQLLS---------VD-PFSTRIVKLDI   65 (1397)
Q Consensus         6 ~P~~lr~g~~~~v~v~l~~~~~~---------v~v~v~L~~~~~~~~-~~~~~~~~~---------~~-~~~~~~~~f~v   65 (1397)
                      .|..|..|...++...|++.+++         +.+++++...++.+. +......++         ++ +..-..++-.+
T Consensus        98 lP~~l~~gE~lk~ta~L~~d~~~i~~~~~l~~v~~~v~f~~~~~~~~e~~~~~~~vvg~f~DdG~g~DE~p~DGvFT~~l  177 (374)
T TIGR03503        98 LPSPLFQGETLKVTAKLLNDGEPLKLPDFRDVVKLTVRFVSTNNAEDENFGADPIVVGEFEDDGEGLDERPGDGIFTGEF  177 (374)
T ss_pred             CCccccCCCeEEEEEEEecCCEEeechhhhheEEEEEEEEecCCccccccccCCEEEEeeccCCccCCCCCCCceEEEEe
Confidence            58899999999999999986665         345555544322111 111111111         00 01112333344


Q ss_pred             C-CCCCceeEEEEEecCc-eeeEeeEEEEEEec-ccEEEEEcCCCcccCCceeEEEEEEEcCCCccCCCcceEEEEECCC
Q psy5522          66 G-DLGPGQYNLTVKGSGS-LNFYNSTGLVYVHK-SYSVFIQLDKAIYKPGDLVRFRAIVLNSHLKPSVTGALEIFITDGK  142 (1397)
Q Consensus        66 P-~~~~~~~~l~v~g~~~-~~f~~~~~v~~~~~-~~~vfIqTDKpIYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I~DP~  142 (1397)
                      | ..++|.|++.+...++ +.=..+..+.+.+. =...|+|.+.|    ++.=++.+-.-+..++| .+..+++++..|+
T Consensus       178 ~l~~~~G~Y~~~v~~~n~vF~R~~~q~v~V~p~Pi~~~~~~~~~~----~~~h~l~v~~d~~~i~~-~s~~~~~~~~~P~  252 (374)
T TIGR03503       178 NLDVAPGEYRPTYQSRNPVFLREVEQPVLVYPLPVSYTVIQSEDE----SGAHQLMVDADAGHIDP-GSLVIHGELVFPN  252 (374)
T ss_pred             eccCCCceEEEEEEEcCceEEEEEEEeEEEECCCeeEEEEccCCC----CCceEEEEEcccceecc-ccEEEEEEEECCC
Confidence            5 4467899999888544 22233456777654 34678898854    55434444444456778 5889999999999


Q ss_pred             CCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC-eEEEEEEEEeeecCCcEEEEEeCCC
Q psy5522         143 GNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD-QKFTKRFTVAEYVLPKFEVNVNVPP  207 (1397)
Q Consensus       143 g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~-~~~~~~F~VeEYvLPkFeV~v~~p~  207 (1397)
                      |. +.+-....   .-....+.||.....|.|++....-+ ....+.|.   +.||.+...+..|.
T Consensus       253 g~-~~~~~~~~---~~~~~~~~l~~~~~~G~Y~i~~~~~~Tt~~GRe~~---~~lpe~~F~~~~p~  311 (374)
T TIGR03503       253 GQ-IQQFSIEL---EEPETRVDLPANYEFGKYRVKGTVFGTTITGRELQ---LTLPEFSFNVEEPP  311 (374)
T ss_pred             Cc-eEEecccC---ccCceEEeccCcCCCeEEEEEEEEEEecCCCCEEE---EEcCCEeEEecCCC
Confidence            97 55433221   22344777888888999999988754 22233332   34566655555543


No 38 
>TIGR01507 hopene_cyclase squalene-hopene cyclase. SHC is an essential prokaryotic gene in hopanoid (triterpenoid) biosynthesis. Squalene hopene cyclase, an integral membrane protein, directly cyclizes squalene into hopanoid products.
Probab=95.72  E-value=0.029  Score=71.94  Aligned_cols=86  Identities=21%  Similarity=0.296  Sum_probs=62.1

Q ss_pred             HHHHHHHHhcc-ccHHHHHHHHHHHHhccccceeeecCCCCccc---------cCCCCCCCCcchhHHHHHHHHHccCCC
Q psy5522         906 VVLEYLKKTYQ-LTDAIEAKASRYLETGYQQELTYRRPDGSFSA---------FGTTDPNGSTWLTAFVAKSFRQAASHT  975 (1397)
Q Consensus       906 ~~l~yL~~~~~-l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~---------~~~~~~~~s~wlTAyvl~~~~~a~~~i  975 (1397)
                      .++.-|...+. ...+..++|+++|.+       -|++||+|+.         +.+. ..+++..|||++.+|..++.. 
T Consensus       504 ~al~aL~~~g~~~~~~~i~rAv~wL~~-------~Q~~DGGWge~~~sy~~~~~~g~-g~s~~s~TA~AL~AL~~ag~~-  574 (635)
T TIGR01507       504 AVLSALKAVGIDTREPYIQKALAWLES-------HQNPDGGWGEDCRSYEDPAYAGK-GASTASQTAWALIALIAAGRA-  574 (635)
T ss_pred             HHHHHHHHcCCCcccHHHHHHHHHHHH-------hcCCCCCCCCCCcccccccccCC-CCCcHHHHHHHHHHHHHhCCC-
Confidence            44445554442 233455678888765       4889999752         1111 246788999999999988753 


Q ss_pred             ccCHHHHHHHHHHHHhcccCCCccccc
Q psy5522         976 TIDESVILEALAWLSSNQAVNGSFPEV 1002 (1397)
Q Consensus       976 ~vd~~~~~~~~~WL~~~q~~~G~F~st 1002 (1397)
                        +...+.++++||.++|+++|+|.+.
T Consensus       575 --~~~~I~rav~~L~~~Q~~dG~W~e~  599 (635)
T TIGR01507       575 --ESEAARRGVQYLVETQRPDGGWDEP  599 (635)
T ss_pred             --CcHHHHHHHHHHHHhcCCCCCCCCc
Confidence              5577999999999999999999765


No 39 
>COG1470 Predicted membrane protein [Function unknown]
Probab=95.59  E-value=1.1  Score=53.56  Aligned_cols=171  Identities=20%  Similarity=0.311  Sum_probs=105.4

Q ss_pred             eecCCeEEEEEEeecCC-ccEEEEEEEEeecC-CCCee----EEeEEEEECCCceeEEEEEc-C--CCCCceeEEEEEec
Q psy5522          10 LRPNGEYHVAVSTQAVS-EATQVSVEVGGKQD-SGGEF----RTKQLLSVDPFSTRIVKLDI-G--DLGPGQYNLTVKGS   80 (1397)
Q Consensus        10 lr~g~~~~v~v~l~~~~-~~v~v~v~L~~~~~-~~~~~----~~~~~~~~~~~~~~~~~f~v-P--~~~~~~~~l~v~g~   80 (1397)
                      ++++....+.|.+.+.. ++-++.+.+.+-++ .+..+    .....+.++++....+.++| |  +..+|.|.+.|...
T Consensus       280 i~~~~t~sf~V~IeN~g~~~d~y~Le~~g~pe~w~~~Fteg~~~vt~vkL~~gE~kdvtleV~ps~na~pG~Ynv~I~A~  359 (513)
T COG1470         280 ISPSTTASFTVSIENRGKQDDEYALELSGLPEGWTAEFTEGELRVTSVKLKPGEEKDVTLEVYPSLNATPGTYNVTITAS  359 (513)
T ss_pred             EccCCceEEEEEEccCCCCCceeEEEeccCCCCcceEEeeCceEEEEEEecCCCceEEEEEEecCCCCCCCceeEEEEEe
Confidence            55666656666665522 34455555553222 11111    12344567888888888877 3  33567776666554


Q ss_pred             CceeeEeeEEEEEEecc-cEEEEEcCCCcc----cCCceeEEEEEEEcCCCccCCCcceEEEEECCCCCeEEeee----e
Q psy5522          81 GSLNFYNSTGLVYVHKS-YSVFIQLDKAIY----KPGDLVRFRAIVLNSHLKPSVTGALEIFITDGKGNRIKQWT----R  151 (1397)
Q Consensus        81 ~~~~f~~~~~v~~~~~~-~~vfIqTDKpIY----kPGqtV~fRvv~ld~~l~P~~~~~~~v~I~DP~g~~I~qw~----~  151 (1397)
                      ++-....+..|.+.... .-..+-+|-..|    +||+..-+|+..-+..--|.  ..+.++|.+|+|=.++--.    .
T Consensus       360 s~s~v~~e~~lki~~~g~~~~~v~l~~g~~~lt~taGee~~i~i~I~NsGna~L--tdIkl~v~~PqgWei~Vd~~~I~s  437 (513)
T COG1470         360 SSSGVTRELPLKIKNTGSYNELVKLDNGPYRLTITAGEEKTIRISIENSGNAPL--TDIKLTVNGPQGWEIEVDESTIPS  437 (513)
T ss_pred             ccccceeeeeEEEEeccccceeEEccCCcEEEEecCCccceEEEEEEecCCCcc--ceeeEEecCCccceEEECcccccc
Confidence            32233444455555443 235555666555    59998777777766654453  6788999999995554111    1


Q ss_pred             -eccCCceeeeeeecCCCCCCeeEEEEEEEcC
Q psy5522         152 -ALTTRGVFSADLQLSKSPVLGDWNITINVLD  182 (1397)
Q Consensus       152 -~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~  182 (1397)
                       ...+.+-+......|+++..|+|.|.++...
T Consensus       438 L~pge~~tV~ltI~vP~~a~aGdY~i~i~~ks  469 (513)
T COG1470         438 LEPGESKTVSLTITVPEDAGAGDYRITITAKS  469 (513)
T ss_pred             cCCCCcceEEEEEEcCCCCCCCcEEEEEEEee
Confidence             2335678999999999999999999998753


No 40 
>PLN02710 farnesyltranstransferase subunit beta
Probab=95.51  E-value=0.2  Score=60.82  Aligned_cols=73  Identities=15%  Similarity=0.221  Sum_probs=52.8

Q ss_pred             ccHHHHHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccC---CCccCHHHHHHHHHHHHhcc
Q psy5522         917 LTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAAS---HTTIDESVILEALAWLSSNQ  993 (1397)
Q Consensus       917 l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~---~i~vd~~~~~~~~~WL~~~q  993 (1397)
                      ++++.+++.+++|.+       .|++||+|+-..+  ..++...|=.++.++...+.   +-.||.   .++++||.+.|
T Consensus        89 l~~~~~~~ii~~l~~-------cQ~~dGGFgg~pg--~~~hl~~TY~Av~~L~iLg~~~~l~~Idr---~~l~~fl~s~q  156 (439)
T PLN02710         89 LDDELENDTIDFLSR-------CQDPNGGYGGGPG--QLPHLATTYAAVNTLVTIGGERALSSINR---EKLYTFLLRMK  156 (439)
T ss_pred             ccHHHHHHHHHHHHH-------hcCCCcCCCCCCC--CCccHHHHHHHHHHHHHcCCchhhcccCH---HHHHHHHHHcC
Confidence            667778888888775       5778999976543  24677777777777766653   223443   55799999999


Q ss_pred             cCCCcccc
Q psy5522         994 AVNGSFPE 1001 (1397)
Q Consensus       994 ~~~G~F~s 1001 (1397)
                      +++|+|..
T Consensus       157 ~~dGgF~~  164 (439)
T PLN02710        157 DPSGGFRM  164 (439)
T ss_pred             CCCCCccc
Confidence            99999975


No 41 
>PF13243 Prenyltrans_1:  Prenyltransferase-like; PDB: 3SDR_A 3SAE_A 3SDV_A 3SDT_A 3SDQ_A 3SDU_A.
Probab=95.26  E-value=0.0063  Score=59.64  Aligned_cols=65  Identities=23%  Similarity=0.324  Sum_probs=16.5

Q ss_pred             ccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCccccc
Q psy5522         932 GYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSFPEV 1002 (1397)
Q Consensus       932 gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~st 1002 (1397)
                      ...-++.-|++||||..-    ...+...|++++.++..+..-  .....+.++++||.++|+++|+|...
T Consensus         4 ~~~~l~~~Q~~dG~W~~~----~~~~~~~t~~~~~al~~~~~~--~~~~ai~ka~~~l~~~Q~~dG~w~~~   68 (109)
T PF13243_consen    4 AAEWLLSQQNPDGSWGYN----WGSDVFVTAALILALAAAGDA--AVDEAIKKAIDWLLSHQNPDGGWGYS   68 (109)
T ss_dssp             -----------------------------------------TS---SSBSSHHHHHHHHH---TTS--S-T
T ss_pred             cccccccccccccccccc----ccccccccccccccccccCCC--CcHHHHHHHHHHHHHhcCCCCCCCCc
Confidence            334556679999998532    124577788888888877643  33456799999999999999999754


No 42 
>PLN02993 lupeol synthase
Probab=95.15  E-value=0.037  Score=71.24  Aligned_cols=70  Identities=23%  Similarity=0.345  Sum_probs=53.5

Q ss_pred             HHHHHHHHHHhccccceeeecCCCCc-cccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCcc
Q psy5522         921 IEAKASRYLETGYQQELTYRRPDGSF-SAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSF  999 (1397)
Q Consensus       921 ~~~~a~~~l~~gy~~~l~y~~~dGsf-s~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F  999 (1397)
                      ..++|++||.+       -|++|||| +.|+.    .-+.-|++||.+|..++.-.. +...+.++++||.+.|+.||||
T Consensus       591 ~i~rAv~yL~~-------~Q~~DGSW~G~Wgv----~y~YgT~~aL~aL~a~G~~~~-~~~~IrrAv~fLls~Q~~DGGW  658 (763)
T PLN02993        591 SIEKAVQFIES-------KQTPDGSWYGNWGI----CFIYATWFALGGLAAAGKTYN-DCLAMRKGVHFLLTIQRDDGGW  658 (763)
T ss_pred             HHHHHHHHHHH-------hcCCCCCccccccc----ccCcHHHHHHHHHHHcCCCCC-CcHHHHHHHHHHHHhcCCCCCc
Confidence            44567776654       59999998 55654    346789999999987765322 3467999999999999999999


Q ss_pred             ccc
Q psy5522        1000 PEV 1002 (1397)
Q Consensus      1000 ~st 1002 (1397)
                      +++
T Consensus       659 GEs  661 (763)
T PLN02993        659 GES  661 (763)
T ss_pred             CcC
Confidence            864


No 43 
>PLN03201 RAB geranylgeranyl transferase beta-subunit; Provisional
Probab=95.06  E-value=0.2  Score=59.11  Aligned_cols=84  Identities=15%  Similarity=0.203  Sum_probs=56.5

Q ss_pred             HHHHHHHHHHhccccHHHHHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHH
Q psy5522         904 NIVVLEYLKKTYQLTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVIL  983 (1397)
Q Consensus       904 ~i~~l~yL~~~~~l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~  983 (1397)
                      .+-.++.|..   +....+++++++|.+       .|++||+|+.+...  .++.-.|-.++.++....++-.||   .+
T Consensus        43 ~l~~L~lL~~---~~~~~~~~~i~~l~~-------cq~~~GGF~~~~~~--~~h~~~Ty~al~~L~ll~~~~~id---~~  107 (316)
T PLN03201         43 GLTALDLLGK---LDDVDRDEVVSWVMR-------CQHESGGFGGNTGH--DPHILYTLSAVQILALFDRLDLLD---AD  107 (316)
T ss_pred             HHHHHHHhCC---CccccHHHHHHHHHH-------hcCCCCCcCCCCCC--cccHHHHHHHHHHHHHhhhhhhhh---HH
Confidence            3344555542   334445677777665       47789999887543  456777777777776655442233   45


Q ss_pred             HHHHHHHhcccCCCccccc
Q psy5522         984 EALAWLSSNQAVNGSFPEV 1002 (1397)
Q Consensus       984 ~~~~WL~~~q~~~G~F~st 1002 (1397)
                      ++++||.+.|+++|||...
T Consensus       108 ~~~~~l~s~Q~~dGgF~~~  126 (316)
T PLN03201        108 KVASYVAGLQNEDGSFSGD  126 (316)
T ss_pred             HHHHHHHHhcCCCCCccCC
Confidence            6899999999999999853


No 44 
>TIGR01787 squalene_cyclas squalene/oxidosqualene cyclases. This family of enzymes catalyzes the cyclization of the triterpenes squalene or 2-3-oxidosqualene to a variety of products including hopene, lanosterol, cycloartenol, amyrin, lupeol and isomultiflorenol.
Probab=94.98  E-value=0.073  Score=68.36  Aligned_cols=71  Identities=28%  Similarity=0.441  Sum_probs=53.2

Q ss_pred             HHHHHHHHHHHhccccceeeecCCCCc-cccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCc
Q psy5522         920 AIEAKASRYLETGYQQELTYRRPDGSF-SAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGS  998 (1397)
Q Consensus       920 ~~~~~a~~~l~~gy~~~l~y~~~dGsf-s~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~  998 (1397)
                      +..++|++||..       .|++||+| +.|+.    ..+.-|++|+.+|..++.... +...+.++++||.++|+++||
T Consensus       460 ~~i~rAl~~L~~-------~Q~~DGsw~g~wg~----~y~YgT~~al~aL~~~G~~~~-~~~~i~rA~~~L~~~Q~~DGG  527 (621)
T TIGR01787       460 NVLERALEYLRR-------EQRADGSWFGRWGV----NYTYGTGFVLSALAAAGRTYR-NCPEVQKACDWLLSRQMPDGG  527 (621)
T ss_pred             HHHHHHHHHHHH-------hcCCCCCCcccCCC----CCchhHHHHHHHHHHhCCccc-CCHHHHHHHHHHHhhcCCCCC
Confidence            344577777664       58899995 45543    235669999999998876432 237789999999999999999


Q ss_pred             cccc
Q psy5522         999 FPEV 1002 (1397)
Q Consensus       999 F~st 1002 (1397)
                      |.++
T Consensus       528 Wge~  531 (621)
T TIGR01787       528 WGED  531 (621)
T ss_pred             CCcC
Confidence            9754


No 45 
>TIGR03463 osq_cycl 2,3-oxidosqualene cyclase. This model identifies 2,3-oxidosqualene cyclases from Stigmatella aurantiaca which produces cycloartenol, and Gemmata obscuriglobus and Methylococcus capsulatus which each produce the closely related sterol, lanosterol.
Probab=94.84  E-value=0.082  Score=68.00  Aligned_cols=88  Identities=19%  Similarity=0.397  Sum_probs=62.4

Q ss_pred             HHHHHHHHHHhcc-ccHHHHHHHHHHHHhccccceeeecCCCCccc-cCC-------CCCCCCcchhHHHHHHHHHccCC
Q psy5522         904 NIVVLEYLKKTYQ-LTDAIEAKASRYLETGYQQELTYRRPDGSFSA-FGT-------TDPNGSTWLTAFVAKSFRQAASH  974 (1397)
Q Consensus       904 ~i~~l~yL~~~~~-l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~-~~~-------~~~~~s~wlTAyvl~~~~~a~~~  974 (1397)
                      +.+++.-|...+. ......++|+++|.+       .|++||||+- |..       ....+++..||+++.+|..++..
T Consensus       508 T~~al~aL~~~G~~~~~~~i~rA~~~Ll~-------~Q~~DGgWg~~~~s~~~~~y~~~~~S~~~~TA~Al~aL~~~g~~  580 (634)
T TIGR03463       508 TFHGVMGLRAAGASPDDMALQRAAAWLRS-------YQRADGGWGEVYESCLQARYVEGKQSQAVMTSWALLALAEAGEG  580 (634)
T ss_pred             HHHHHHHHHHcCCCcCcHHHHHHHHHHHH-------ccCCCCCccCccCccccccccCCCCCcHHHHHHHHHHHHHcCCc
Confidence            4455555555453 233455677777764       5899999842 221       11246788999999999988742


Q ss_pred             CccCHHHHHHHHHHHHhcccCCCcccc
Q psy5522         975 TTIDESVILEALAWLSSNQAVNGSFPE 1001 (1397)
Q Consensus       975 i~vd~~~~~~~~~WL~~~q~~~G~F~s 1001 (1397)
                         +...+.++++||.++|+++|+|..
T Consensus       581 ---~~~~i~rgi~~L~~~Q~~dG~W~~  604 (634)
T TIGR03463       581 ---GHDAVQRGVAWLRSRQQEDGRWPR  604 (634)
T ss_pred             ---CCHHHHHHHHHHHHhCCCCCCCCC
Confidence               556789999999999999999975


No 46 
>PF10633 NPCBM_assoc:  NPCBM-associated, NEW3 domain of alpha-galactosidase;  InterPro: IPR018905 This domain has been named NEW3, but its function is not known. It is found on proteins which are bacterial galactosidases [].; PDB: 1EUT_A 2BZD_A 1WCQ_C 2BER_A 1W8O_A 1EUU_A 1W8N_A.
Probab=94.75  E-value=0.15  Score=46.74  Aligned_cols=72  Identities=21%  Similarity=0.205  Sum_probs=45.4

Q ss_pred             eeeCCEEEEEEEEEcCCCcc-EEEEEEEeecCCeeeeecCCCccCCCCCCcceeeeEEEEEeCCCeeEEEEEEEec---e
Q psy5522         725 VMRGEVVAIPVVVFNYLSQD-LVADVTLENVGQFDFADFSNEVDAAPQPKFEVFRRKKLTIKANSGSTTTFVITPK---E  800 (1397)
Q Consensus       725 v~rGE~~~l~~~VfNyl~~~-~~v~v~L~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~a~~~~tv~f~v~p~---~  800 (1397)
                      |.+||.+.+.++|.|..+.. ..+++.|...++.... .              .......|++|++.++.|.|+|-   .
T Consensus         1 v~~G~~~~~~~tv~N~g~~~~~~v~~~l~~P~GW~~~-~--------------~~~~~~~l~pG~s~~~~~~V~vp~~a~   65 (78)
T PF10633_consen    1 VTPGETVTVTLTVTNTGTAPLTNVSLSLSLPEGWTVS-A--------------SPASVPSLPPGESVTVTFTVTVPADAA   65 (78)
T ss_dssp             --TTEEEEEEEEEE--SSS-BSS-EEEEE--TTSE------------------EEEEE--B-TTSEEEEEEEEEE-TT--
T ss_pred             CCCCCEEEEEEEEEECCCCceeeEEEEEeCCCCcccc-C--------------CccccccCCCCCEEEEEEEEECCCCCC
Confidence            46899999999999998765 3578888877777521 0              11223389999999999999973   4


Q ss_pred             eeeEEEEEEEE
Q psy5522         801 LGYIGIKVTAT  811 (1397)
Q Consensus       801 lG~v~itV~A~  811 (1397)
                      .|.-.|++.|.
T Consensus        66 ~G~y~v~~~a~   76 (78)
T PF10633_consen   66 PGTYTVTVTAR   76 (78)
T ss_dssp             SEEEEEEEEEE
T ss_pred             CceEEEEEEEE
Confidence            69988888875


No 47 
>TIGR03503 conserved hypothetical protein TIGR03503. This set of conserved hypothetical protein has a phylogenetic range that closely matches that of TIGR03501, a putative C-terminal protein targeting signal.
Probab=93.76  E-value=19  Score=42.91  Aligned_cols=234  Identities=12%  Similarity=0.096  Sum_probs=119.2

Q ss_pred             EEEECCCCCeEEeee-eeccCCceeeeeeec--CCCCCCeeEEEEEEEcCeEEEEEEEEeeecCCcEEEEEeCCCceEee
Q psy5522         136 IFITDGKGNRIKQWT-RALTTRGVFSADLQL--SKSPVLGDWNITINVLDQKFTKRFTVAEYVLPKFEVNVNVPPHATFK  212 (1397)
Q Consensus       136 v~I~DP~g~~I~qw~-~~~~~~Gi~~~sf~L--~~~~~lG~y~I~v~~~~~~~~~~F~VeEYvLPkFeV~v~~p~~i~~~  212 (1397)
                      +.|..|+|.+...|. ...+-.-.-..++.|  -++|+.|.|.+.-.....+   ...    ++-..++.+++-+..++.
T Consensus        32 vvLV~PDGsk~ya~~~~p~~V~W~~~~~~DlItI~~PmpGpWq~~G~v~p~s---ri~----viS~L~L~v~plP~~l~~  104 (374)
T TIGR03503        32 VILVRPDGSKYYAWRVHPEDVKWYDESTMDIISIKNPMPGPWQAIGKITPGN---RVK----VISNLRLEVEPLPSPLFQ  104 (374)
T ss_pred             eEEECCCCcEEeccCCCCCCceEEecCCceEEEeCCCCCCCcEEeeeeCCCC---eEE----EEeccEEEEecCCccccC
Confidence            567789999988887 433333333344444  7899999999988875411   111    244566667665555567


Q ss_pred             CCeEEEEEEEEEec-Cccccc-----eEEEEEEeccccCccccccc------ccee--E----EeecCceEEEEEechhh
Q psy5522         213 DSKVVISVNAKYTY-GKPVKG-----EATITAYPTIFSGVIQPLFQ------TPVR--K----VVPIDGKTVIEFDVVKE  274 (1397)
Q Consensus       213 ~~~~~v~v~AkYty-GkPV~G-----~v~v~v~~~~~~g~~~~~~~------~~~~--~----~~d~~G~~~~~~~~~~~  274 (1397)
                      ++.+  +++|+-+. |+|+..     .+++++.........++++.      .++.  +    ....||.++..+++.. 
T Consensus       105 gE~l--k~ta~L~~d~~~i~~~~~l~~v~~~v~f~~~~~~~~e~~~~~~~vvg~f~DdG~g~DE~p~DGvFT~~l~l~~-  181 (374)
T TIGR03503       105 GETL--KVTAKLLNDGEPLKLPDFRDVVKLTVRFVSTNNAEDENFGADPIVVGEFEDDGEGLDERPGDGIFTGEFNLDV-  181 (374)
T ss_pred             CCeE--EEEEEEecCCEEeechhhhheEEEEEEEEecCCccccccccCCEEEEeeccCCccCCCCCCCceEEEEeeccC-
Confidence            7755  45555444 788874     23344322111110011110      0110  0    1235788888877633 


Q ss_pred             hccccccceEEEEEEEEEECCcceEEee-EEEEEEEEeeeEEEEEecCCcccCCCeEEEEEEEEccCCCccCCCCceEEE
Q psy5522         275 LQLTDEYERNIHFDVAVEEALTGRRQNN-TGSVVFHKHKYKMDLIKSSEYFKPGLKYTAYMKLTHHDGTPVTDNNNMVQV  353 (1397)
Q Consensus       275 ~~~~~~~~~~l~V~a~VtE~gTG~~~~~-~~~i~i~~~~~~i~f~~~~~~~kpG~p~~~~v~v~~~dG~P~~~~~~~V~v  353 (1397)
                        .    +..+.+.+.+   +++.-... +..+.+++.|..++|.....   ++.+..+.|.+-+..=.|-     .+.+
T Consensus       182 --~----~G~Y~~~v~~---~n~vF~R~~~q~v~V~p~Pi~~~~~~~~~---~~~~h~l~v~~d~~~i~~~-----s~~~  244 (374)
T TIGR03503       182 --A----PGEYRPTYQS---RNPVFLREVEQPVLVYPLPVSYTVIQSED---ESGAHQLMVDADAGHIDPG-----SLVI  244 (374)
T ss_pred             --C----CceEEEEEEE---cCceEEEEEEEeEEEECCCeeEEEEccCC---CCCceEEEEEcccceeccc-----cEEE
Confidence              1    1233443333   22433333 45788899999988876543   4666666555322221221     2322


Q ss_pred             EEeeeccccccceeeEeeCCCeeEEEEEeCCCCCceeEEEEEEE
Q psy5522         354 RHGFSYDESKYEANQYKLDRNGMIKLVYYPPANENVTTLGIEAE  397 (1397)
Q Consensus       354 ~~~~~~~~~~~~~~~~ttd~~G~~~f~i~~~~~~~~~~l~l~a~  397 (1397)
                      ........ +.........+.+...+.+......|.|++...+.
T Consensus       245 ~~~~~~P~-g~~~~~~~~~~~~~~~~~l~~~~~~G~Y~i~~~~~  287 (374)
T TIGR03503       245 HGELVFPN-GQIQQFSIELEEPETRVDLPANYEFGKYRVKGTVF  287 (374)
T ss_pred             EEEEECCC-CceEEecccCccCceEEeccCcCCCeEEEEEEEEE
Confidence            22111111 11111112223444556665555677877766653


No 48 
>PLN03012 Camelliol C synthase
Probab=93.66  E-value=0.22  Score=64.26  Aligned_cols=88  Identities=17%  Similarity=0.198  Sum_probs=60.7

Q ss_pred             HHHHHHHHhccc--cHHHHHHHHHHHHhccccceeeecCCCCccc-cCCC---------CCCCCcchhHHHHHHHHHccC
Q psy5522         906 VVLEYLKKTYQL--TDAIEAKASRYLETGYQQELTYRRPDGSFSA-FGTT---------DPNGSTWLTAFVAKSFRQAAS  973 (1397)
Q Consensus       906 ~~l~yL~~~~~l--~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~-~~~~---------~~~~s~wlTAyvl~~~~~a~~  973 (1397)
                      .++.-|...+..  +.+..+||.++|.+       -|++||||+- |..-         ...+.+.-||||+++|+.|+.
T Consensus       623 ~aL~aL~a~g~~~~~~~~Irrav~fLls-------~Q~~DGGWGEs~~Sc~~~~y~~~~~~~S~~~qTaWAl~aLi~ag~  695 (759)
T PLN03012        623 FALAGLAAAGKTFNDCEAIRKGVHFLLA-------AQKDNGGWGESYLSCPKKIYIAQEGEISNLVQTAWALMGLIHAGQ  695 (759)
T ss_pred             HHHHHHHHhCccCCCcHHHHHHHHHHHH-------hcCCCCCcCCCCCCCCCccccCCCCCCCcHHHHHHHHHHHHHcCC
Confidence            344444444431  22445678888775       4889999752 2210         113567889999999998864


Q ss_pred             CCccCHHHHHHHHHHHHhcccCCCcccc
Q psy5522         974 HTTIDESVILEALAWLSSNQAVNGSFPE 1001 (1397)
Q Consensus       974 ~i~vd~~~~~~~~~WL~~~q~~~G~F~s 1001 (1397)
                       ..-|+..+.++++||.++|.++|+|..
T Consensus       696 -~~~~~~~i~Rg~~~Ll~~Q~~dG~W~q  722 (759)
T PLN03012        696 -AERDPIPLHRAAKLIINSQLENGDFPQ  722 (759)
T ss_pred             -CCCCcHHHHHHHHHHHHcccCCCCCCC
Confidence             234566899999999999999999964


No 49 
>PF10670 DUF4198:  Domain of unknown function (DUF4198)
Probab=93.08  E-value=9.3  Score=42.11  Aligned_cols=174  Identities=13%  Similarity=0.128  Sum_probs=91.2

Q ss_pred             cEEEEEcCCCcccCCceeEEEEEEEcCCCccC--CCcceEEEEECCCCCeEEeeeeecc----------CCceeeeeeec
Q psy5522          98 YSVFIQLDKAIYKPGDLVRFRAIVLNSHLKPS--VTGALEIFITDGKGNRIKQWTRALT----------TRGVFSADLQL  165 (1397)
Q Consensus        98 ~~vfIqTDKpIYkPGqtV~fRvv~ld~~l~P~--~~~~~~v~I~DP~g~~I~qw~~~~~----------~~Gi~~~sf~L  165 (1397)
                      ..++|.+++...++|+.+.+++..=. .+...  .... .+.+.+|+|.++........          ..|.+...+.+
T Consensus         4 H~~Wl~~~~~~~~~g~~~~v~~~~Gh-~~~~~~~~~~~-~l~~~~~dg~~~~l~~~~~~~~~~~~~~~~~~g~~~~~~~~   81 (215)
T PF10670_consen    4 HELWLEPDAFQVSPGEEVTVELYFGH-PFPGDEFPPDK-SLQVISPDGKRIDLPPTVEDGKYALVFDVEQPGTYKLTVAL   81 (215)
T ss_pred             ccEEEECCccccCCCCEEEEEEEECC-CCCCccccccC-ceEEECCCCeeecccccccccccceeEeecCCCcEEEEEEc
Confidence            46799999999999999999887532 22110  0111 78889999984443221111          23333333322


Q ss_pred             CCCCCCeeEEEEEEEcC----eE------EEEEEEE-eeec----------CC---cEEEEEeCCCceEeeCCeEEEEEE
Q psy5522         166 SKSPVLGDWNITINVLD----QK------FTKRFTV-AEYV----------LP---KFEVNVNVPPHATFKDSKVVISVN  221 (1397)
Q Consensus       166 ~~~~~lG~y~I~v~~~~----~~------~~~~F~V-eEYv----------LP---kFeV~v~~p~~i~~~~~~~~v~v~  221 (1397)
                      ..   .|.|+.......    ..      ..+.++. +-|+          .|   .||+.-.....-+..++.+.+.|-
T Consensus        82 ~~---~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~aK~~v~~~~~~~~~~~~~g~~lEIvPl~~P~~l~~g~~~~~~vl  158 (215)
T PF10670_consen   82 RD---PGYYSFTPEGWKRYSKREGPDAESSERYQKYAKTLVNVGGPSEDWSKPVGLPLEIVPLTNPYKLKAGDPLPFQVL  158 (215)
T ss_pred             cC---CEEEEEeccCcccccccccCccceEEEEEEEEEEEEEccCCccccccccCCcEEEEECcCcccccCCCEEEEEEE
Confidence            21   355555444321    00      1111211 1122          12   234433222233346777777775


Q ss_pred             EEEecCccccceEEEEEEeccccCccccccccceeEEeecCceEEEEEechhhhccccccceEEEEEEEEEE
Q psy5522         222 AKYTYGKPVKGEATITAYPTIFSGVIQPLFQTPVRKVVPIDGKTVIEFDVVKELQLTDEYERNIHFDVAVEE  293 (1397)
Q Consensus       222 AkYtyGkPV~G~v~v~v~~~~~~g~~~~~~~~~~~~~~d~~G~~~~~~~~~~~~~~~~~~~~~l~V~a~VtE  293 (1397)
                         +-|+|++| +.|.+.........   .......++|.+|.++|.++-+.          ...|.|...+
T Consensus       159 ---~~GkPl~~-a~V~~~~~~~~~~~---~~~~~~~~TD~~G~~~~~~~~~G----------~wli~a~~~~  213 (215)
T PF10670_consen  159 ---FDGKPLAG-AEVEAFSPGGWYDV---EHEAKTLKTDANGRATFTLPRPG----------LWLIRASHKD  213 (215)
T ss_pred             ---ECCeEccc-EEEEEEECCCcccc---ccceEEEEECCCCEEEEecCCCE----------EEEEEEEEec
Confidence               77999999 44444332111000   00144678999999998876533          4566666544


No 50 
>cd02890 PTase Protein prenyltransferase (PTase) domain, beta subunit (alpha 6 - alpha 6 barrel fold). The protein prenyltransferase family of lipid-modifying enzymes includes protein farnesyltransferase (FTase) and geranylgeranyltransferase types I and II (GGTase-I and GGTase-II). They catalyze the carboxyl-terminal lipidation of Ras, Rab, and several other cellular signal transduction proteins, facilitating membrane associations and specific protein-protein interactions. Prenyltransferases employ a Zn2+ ion to alkylate a thiol group catalyzing the formation of thioether linkages between the C1 atom of farnesyl (15-carbon by FTase) or geranylgeranyl (20-carbon by GGTase-I, II) isoprenoid lipids and cysteine residues at or near the C-terminus of protein acceptors. FTase and GGTase-I prenylate the cysteine in the terminal sequence, "CAAX"; and GGTase-II prenylates both cysteines in the "CC" (or "CXC") terminal sequence. These enzymes are heterodimeric with both alpha and beta subunits re
Probab=92.97  E-value=0.3  Score=56.81  Aligned_cols=81  Identities=23%  Similarity=0.232  Sum_probs=57.2

Q ss_pred             HHHHHHhccccHHHHHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHH
Q psy5522         908 LEYLKKTYQLTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEALA  987 (1397)
Q Consensus       908 l~yL~~~~~l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~  987 (1397)
                      +.-|...+......+++++++|.+       .|.+||+|+.-++  .++....|++++.++..++..   +...+.++++
T Consensus       134 l~~l~ll~~~~~~~~~~~~~~l~~-------~Q~~dGGf~~~~~--~es~~~~t~~av~sL~~l~~~---~~~~~~~~~~  201 (286)
T cd02890         134 LSILSLLNILTDIDKEKLIDYILS-------CQNYDGGFGGVPG--AESHGGYTFCAVASLALLGRL---DLIDKERLLR  201 (286)
T ss_pred             HHHHHHhCCchhhhHHHHHHHHHH-------hCCCCCCcCCCCC--CCCCccHhHHHHHHHHHcCCC---cccCHHHHHH
Confidence            333333344334456677777764       4788999976543  356788999999999888764   3445788899


Q ss_pred             HHHhcccCCC-ccc
Q psy5522         988 WLSSNQAVNG-SFP 1000 (1397)
Q Consensus       988 WL~~~q~~~G-~F~ 1000 (1397)
                      ||.++|+.+| ||.
T Consensus       202 ~L~~~q~~~ggGf~  215 (286)
T cd02890         202 WLVERQLASGGGFN  215 (286)
T ss_pred             HHHHhCCCCCCCcC
Confidence            9999998777 785


No 51 
>cd02894 GGTase-II Geranylgeranyltransferase type II (GGTase-II)_like proteins containing the protein prenyltransferase (PTase) domain, beta subunit (alpha 6 - alpha 6 barrel fold). GGTase-IIs are a subgroup of the protein prenyltransferase family of lipid-modifying enzymes. PTases catalyze the carboxyl-terminal lipidation of Ras, Rab, and several other cellular signal transduction proteins, facilitating membrane associations and specific protein-protein interactions. Prenyltransferases employ a Zn2+ ion to alkylate a thiol group catalyzing the formation of thioether linkages between cysteine residues at or near the C-terminus of protein acceptors and the C1 atom of isoprenoid lipids (geranylgeranyl (20-carbon) in the case of GGTase-II ). GGTase-II catalyzes alkylation of both cysteine residues in Rab proteins containing carboxy-terminal "CC", "CXCX" or "CXC" motifs. PTases are heterodimeric with both alpha and beta subunits required for catalytic activity. In contrast to other prenyltr
Probab=92.38  E-value=0.5  Score=55.03  Aligned_cols=74  Identities=19%  Similarity=0.081  Sum_probs=54.9

Q ss_pred             ccccHHHHHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhccc
Q psy5522         915 YQLTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQA  994 (1397)
Q Consensus       915 ~~l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~  994 (1397)
                      +.+...-.++++++|.+       .|++||+|+.-++  .+.++..|.+++-++..++..-.+   .++++++||.++|.
T Consensus       143 ~~~~~i~~~~~~~~l~~-------~q~~dGGF~~~~~--~es~~~~t~cavasL~llg~~~~~---~~~~~~~~L~~~q~  210 (287)
T cd02894         143 GKLDLIDVDKAVDYLLS-------CYNFDGGFGCRPG--AESHAGQIFCCVGALAILGSLDLI---DRDRLGWWLCERQL  210 (287)
T ss_pred             CCcchhhHHHHHHHHHH-------cCCCCCCcCCCCC--CCCchhHHHHHHHHHHHcCccccc---CHHHHHHHHHHhCC
Confidence            33433345788888775       3678999976433  356788899999999888765233   37778999999999


Q ss_pred             CCCccc
Q psy5522         995 VNGSFP 1000 (1397)
Q Consensus       995 ~~G~F~ 1000 (1397)
                      .+|||.
T Consensus       211 ~~GGf~  216 (287)
T cd02894         211 PSGGLN  216 (287)
T ss_pred             CCCCcC
Confidence            999996


No 52 
>cd02891 A2M_like Proteins similar to alpha2-macroglobulin (alpha (2)-M).  Alpha (2)-M is a major carrier protein in serum. It is a broadly specific proteinase inhibitor.  The structural thioester of alpha (2)-M, is involved in the immobilization and entrapment of proteases. This group contains another broadly specific proteinase inhibitor:  pregnancy zone protein (PZP).  PZP is a trace protein in the plasma of non-pregnant females and males which is elevated in pregnancy. Alpha (2)-M and PZ bind to placental protein-14 and may modulate its activity in T-cell growth and cytokine production thereby protecting the allogeneic fetus from attack by the maternal immune system. This group also contains C3, C4 and C5 of vertebrate complement.  The vertebrate complement is an effector of both the acquired and innate immune systems The point of convergence of the classical, alternative and lectin pathways of the complement system is the proteolytic activation of C3. C4 plays a key role in propaga
Probab=92.14  E-value=0.49  Score=54.73  Aligned_cols=52  Identities=15%  Similarity=0.132  Sum_probs=42.8

Q ss_pred             CCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCccccchHh---HHHhhc
Q psy5522         955 NGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSFPEVGKV---SHADMQ 1011 (1397)
Q Consensus       955 ~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~stqdv---l~al~~ 1011 (1397)
                      ...++.|||++++.....     +.....++++||.++++.+|+|.|||+.   ++||.+
T Consensus       227 ~~~~~a~a~all~~~~~~-----~~~~~~~~~~~L~~~~~~~~~~~sTq~t~~al~AL~~  281 (282)
T cd02891         227 SLRVEATAYALLALLKLG-----DLEEAGPIAKWLAQQRNSGGGFLSTQDTVVALQALAA  281 (282)
T ss_pred             hhhHHHHHHHHHHHHhcC-----ChhhHHHHHHHHHHcCCCCCCcccHHHHHHHHHHHHh
Confidence            457899999999998654     4567788999999999999999999983   777753


No 53 
>PRK13211 N-acetylglucosamine-binding protein A; Reviewed
Probab=92.11  E-value=34  Score=42.51  Aligned_cols=222  Identities=18%  Similarity=0.236  Sum_probs=112.3

Q ss_pred             EEEEEcCCCCCcee-EEEEE--ecCceeeEeeEEEEEEecccE------EEEEcCCCcccCCceeEEEEEEEcCCCccCC
Q psy5522          60 IVKLDIGDLGPGQY-NLTVK--GSGSLNFYNSTGLVYVHKSYS------VFIQLDKAIYKPGDLVRFRAIVLNSHLKPSV  130 (1397)
Q Consensus        60 ~~~f~vP~~~~~~~-~l~v~--g~~~~~f~~~~~v~~~~~~~~------vfIqTDKpIYkPGqtV~fRvv~ld~~l~P~~  130 (1397)
                      ..+..+|. .+|.. -+.+.  +...-.|.+-..|.|......      +--+....--+.||+|+||++--.... +  
T Consensus       158 ~~~~~lP~-RsG~hVI~~vWq~~Dt~~aFY~c~DV~f~~~~~~~p~w~~~g~i~~~~~l~~Gd~V~~RvFd~~Ge~-~--  233 (478)
T PRK13211        158 SHECNVPE-RTGYQVILAVWEVGDTANSFYNVIDVNFDGGGGVVPEWSKVGQINPSMDLKAGDKVMTRVFDANGEN-P--  233 (478)
T ss_pred             EEEEECCC-CCccEEEEEEEEecCCcCCceEEEEEEEcCCCCCCCccccCccCccCCCCCCCCEEEEEEecCCCcc-c--
Confidence            33556775 55542 22222  212236888888888753211      001122225678999999999665321 2  


Q ss_pred             CcceEEEEECCCCCeEEeeeeec----------cCCceeee--eeecCCCCCCeeEEEEEEEcC--eEEEEEEEEeeecC
Q psy5522         131 TGALEIFITDGKGNRIKQWTRAL----------TTRGVFSA--DLQLSKSPVLGDWNITINVLD--QKFTKRFTVAEYVL  196 (1397)
Q Consensus       131 ~~~~~v~I~DP~g~~I~qw~~~~----------~~~Gi~~~--sf~L~~~~~lG~y~I~v~~~~--~~~~~~F~VeEYvL  196 (1397)
                      .....+.|.+.....-.+|....          ..-|....  .+.    |..|.-.|-++.++  ...+..|+.+. .-
T Consensus       234 ~~~~~l~I~~~~~~~~~~W~~~LA~~iN~~~~~~~~G~~~~~g~i~----p~~g~N~iy~~~~s~~~~vei~~~~~~-p~  308 (478)
T PRK13211        234 ALQTELTIDSATQGKKNNWSHALASKINQEQQQLRAGQLNADGQIE----PVYGKNPIYLKAGSGLERVEIGYDIEA-PA  308 (478)
T ss_pred             cceeEEEEeccccCChhhhHHHHHHHHhhcchhhheeeecCCCCEe----eccCccceEEeCCCCceEEEEEEEecC-CC
Confidence            33456777764323334565211          11232222  222    67888888888776  44455555444 46


Q ss_pred             CcEEEEEeC-CCceEeeCCeEEEEEEEEEecCccccceEEEEEEeccccCccccccccceeEEeecCceEEEEEechhhh
Q psy5522         197 PKFEVNVNV-PPHATFKDSKVVISVNAKYTYGKPVKGEATITAYPTIFSGVIQPLFQTPVRKVVPIDGKTVIEFDVVKEL  275 (1397)
Q Consensus       197 PkFeV~v~~-p~~i~~~~~~~~v~v~AkYtyGkPV~G~v~v~v~~~~~~g~~~~~~~~~~~~~~d~~G~~~~~~~~~~~~  275 (1397)
                      |.+++.|.- .....+.+...++++...      ..|.+++++......|..     ......+-.++...+++.+.+ +
T Consensus       309 p~~~lsVsGL~~eY~I~dG~~~i~ftv~------a~g~~~vta~V~d~~g~~-----~~~~~~~v~d~s~~vtL~Ls~-~  376 (478)
T PRK13211        309 PDYELEVSGLAKEYKIGDGAATLDFTVT------ATGDMNVEATVYNHDGEA-----LGSKSQTVNDGSQSVSLDLSK-L  376 (478)
T ss_pred             CCCceEEecCcceeEEcCCcEEEEEEEE------eccceEEEEEEEcCCCCe-----eeeeeEEecCCceeEEEeccc-C
Confidence            778888774 233344566676666555      345555554432211111     011223333455777777765 3


Q ss_pred             ccccccceEEEEEEEEEECCcceEE-eeEEEEEE
Q psy5522         276 QLTDEYERNIHFDVAVEEALTGRRQ-NNTGSVVF  308 (1397)
Q Consensus       276 ~~~~~~~~~l~V~a~VtE~gTG~~~-~~~~~i~i  308 (1397)
                      .. +  .+.|.|.|++  +. |... ..+..+.+
T Consensus       377 ~A-G--~y~Lvv~~t~--~d-G~~~~q~~~~~~v  404 (478)
T PRK13211        377 KA-G--HHMLVVKAKP--KD-GELIKQQTLDFML  404 (478)
T ss_pred             CC-c--eEEEEEEEEe--CC-CceeeeeeEEEEE
Confidence            31 1  2345555554  33 4443 34444443


No 54 
>PF01835 A2M_N:  MG2 domain;  InterPro: IPR002890 The proteinase-binding alpha-macroglobulins (A2M) [] are large glycoproteins found in the plasma of vertebrates, in the hemolymph of some invertebrates and in reptilian and avian egg white. A2M-like proteins are able to inhibit all four classes of proteinases by a 'trapping' mechanism. They have a peptide stretch, called the 'bait region', which contains specific cleavage sites for different proteinases. When a proteinase cleaves the bait region, a conformational change is induced in the protein, thus trapping the proteinase. The entrapped enzyme remains active against low molecular weight substrates, whilst its activity toward larger substrates is greatly reduced, due to steric hindrance. Following cleavage in the bait region, a thiol ester bond, formed between the side chains of a cysteine and a glutamine, is cleaved and mediates the covalent binding of the A2M-like protein to the proteinase. This family includes the N-terminal region of the alpha-2-macroglobulin family. The inhibitor domains belong to MEROPS inhibitor family I39.; GO: 0004866 endopeptidase inhibitor activity; PDB: 2B39_B 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 4ACQ_C 2P9R_B ....
Probab=92.06  E-value=2.9  Score=40.05  Aligned_cols=78  Identities=14%  Similarity=0.152  Sum_probs=47.5

Q ss_pred             ecCCcccCCCeEEEEEEEEccCC--CccCCCCceEEEEEeeeccccccceeeE-eeCCCeeEEEEEeCCCCCceeEEEEE
Q psy5522         319 KSSEYFKPGLKYTAYMKLTHHDG--TPVTDNNNMVQVRHGFSYDESKYEANQY-KLDRNGMIKLVYYPPANENVTTLGIE  395 (1397)
Q Consensus       319 ~~~~~~kpG~p~~~~v~v~~~dG--~P~~~~~~~V~v~~~~~~~~~~~~~~~~-ttd~~G~~~f~i~~~~~~~~~~l~l~  395 (1397)
                      .....|+||....+++.+.+.++  .+..+  ..+.+.+... +......... +++++|.+.+++.+|++...-.++|+
T Consensus         6 TDr~iYrPGetV~~~~~~~~~~~~~~~~~~--~~~~v~i~dp-~g~~v~~~~~~~~~~~G~~~~~~~lp~~~~~G~y~i~   82 (99)
T PF01835_consen    6 TDRPIYRPGETVHFRAIVRDLDNDFKPPAN--SPVTVTIKDP-SGNEVFRWSVNTTNENGIFSGSFQLPDDAPLGTYTIR   82 (99)
T ss_dssp             ESSSEE-TTSEEEEEEEEEEECTTCSCESS--EEEEEEEEET-TSEEEEEEEEEETTCTTEEEEEEE--SS---EEEEEE
T ss_pred             CCccCcCCCCEEEEEEEEeccccccccccC--CceEEEEECC-CCCEEEEEEeeeeCCCCEEEEEEECCCCCCCEeEEEE
Confidence            35578999999999999999984  44443  3555554321 1111233445 67899999999998877633446666


Q ss_pred             EEec
Q psy5522         396 AEYL  399 (1397)
Q Consensus       396 a~~~  399 (1397)
                      +...
T Consensus        83 ~~~~   86 (99)
T PF01835_consen   83 VKTD   86 (99)
T ss_dssp             EEET
T ss_pred             EEEc
Confidence            6664


No 55 
>PF07919 Gryzun:  Gryzun, putative trafficking through Golgi;  InterPro: IPR012880 The proteins featured in this family are all hypothetical eukaryotic proteins of unknown function. The region in question is approximately 150 residues long. 
Probab=91.86  E-value=0.48  Score=60.63  Aligned_cols=79  Identities=16%  Similarity=0.226  Sum_probs=70.0

Q ss_pred             eeeEEEEecCCceeeeCCEEEEEEEEEcCCCccEEEEEEEeecCCeeeeecCCCccCCCCCCcceeeeEEEEEeCCCeeE
Q psy5522         712 FRPFFISLDLPYSVMRGEVVAIPVVVFNYLSQDLVADVTLENVGQFDFADFSNEVDAAPQPKFEVFRRKKLTIKANSGST  791 (1397)
Q Consensus       712 fkpffv~l~LPysv~rGE~~~l~~~VfNyl~~~~~v~v~L~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~a~~~~t  791 (1397)
                      --|+-|.+.+|.+...|+.|.+..+|.|....-++..++|+.+++|-|.               +.++.++.|-+.++.+
T Consensus       468 ~~~~~v~~~~p~~~~~~~~~~l~~~I~N~T~~~~~~~~~me~s~~F~fs---------------G~k~~~~~llP~s~~~  532 (554)
T PF07919_consen  468 SSPLRVLASVPPSAIVGEPFTLSYTIENPTNHFQTFELSMEPSDDFMFS---------------GPKQTTFSLLPFSRHT  532 (554)
T ss_pred             CCCcEEEEecCCccccCcEEEEEEEEECCCCccEEEEEEEccCCCEEEE---------------CCCcCceEECCCCcEE
Confidence            3467788889999999999999999999999999999999999998773               2335688999999999


Q ss_pred             EEEEEEeceeeeEE
Q psy5522         792 TTFVITPKELGYIG  805 (1397)
Q Consensus       792 v~f~v~p~~lG~v~  805 (1397)
                      +.|.+.|...|...
T Consensus       533 ~~y~l~pl~~G~~~  546 (554)
T PF07919_consen  533 VRYNLLPLVAGWWI  546 (554)
T ss_pred             EEEEEEEccCCcEE
Confidence            99999999999764


No 56 
>PF07705 CARDB:  CARDB;  InterPro: IPR011635 The APHP (acidic peptide-dependent hydrolases/peptidase) domain is found in a variety of different proteins.; PDB: 2KUT_A 2L0D_A 3IDU_A 2KL6_A.
Probab=91.58  E-value=2.5  Score=40.21  Aligned_cols=76  Identities=28%  Similarity=0.257  Sum_probs=54.6

Q ss_pred             EecCCceeeeCCEEEEEEEEEcCCCcc-EEEEEEEeecCCeeeeecCCCccCCCCCCcceeeeEEE-EEeCCCeeEEEEE
Q psy5522         718 SLDLPYSVMRGEVVAIPVVVFNYLSQD-LVADVTLENVGQFDFADFSNEVDAAPQPKFEVFRRKKL-TIKANSGSTTTFV  795 (1397)
Q Consensus       718 ~l~LPysv~rGE~~~l~~~VfNyl~~~-~~v~v~L~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~v~a~~~~tv~f~  795 (1397)
                      ....|..+..|+.+.+.++|.|..... -.+.|.+...+...                   ....+ .|.+|+..++.|.
T Consensus         8 ~~~~~~~~~~g~~~~i~~~V~N~G~~~~~~~~v~~~~~~~~~-------------------~~~~i~~L~~g~~~~v~~~   68 (101)
T PF07705_consen    8 ITVSPSNVVPGEPVTITVTVKNNGTADAENVTVRLYLDGNSV-------------------STVTIPSLAPGESETVTFT   68 (101)
T ss_dssp             EEEC-SEEETTSEEEEEEEEEE-SSS-BEEEEEEEEETTEEE-------------------EEEEESEB-TTEEEEEEEE
T ss_pred             EeeCCCcccCCCEEEEEEEEEECCCCCCCCEEEEEEECCcee-------------------ccEEECCcCCCcEEEEEEE
Confidence            566799999999999999999987654 55677766554331                   13445 7899999999999


Q ss_pred             EEeceeeeEEEEEEEEe
Q psy5522         796 ITPKELGYIGIKVTATS  812 (1397)
Q Consensus       796 v~p~~lG~v~itV~A~~  812 (1397)
                      +.+...|...|++.+..
T Consensus        69 ~~~~~~G~~~i~~~iD~   85 (101)
T PF07705_consen   69 WTPPSPGSYTIRVVIDP   85 (101)
T ss_dssp             EE-SS-CEEEEEEEEST
T ss_pred             EEeCCCCeEEEEEEEee
Confidence            99999999988887743


No 57 
>PF05753 TRAP_beta:  Translocon-associated protein beta (TRAPB);  InterPro: IPR008856 This family consists of several eukaryotic translocon-associated protein beta (TRAPB) or signal sequence receptor beta subunit (SSR-beta) proteins. The normal translocation of nascent polypeptides into the lumen of the endoplasmic reticulum (ER) is thought to be aided in part by a translocon-associated protein (TRAP) complex consisting of 4 protein subunits. The association of mature proteins with the ER and Golgi, or other intracellular locales, such as lysosomes, depends on the initial targeting of the nascent polypeptide to the ER membrane. A similar scenario must also exist for proteins destined for secretion [].; GO: 0005783 endoplasmic reticulum, 0016021 integral to membrane
Probab=91.10  E-value=1.9  Score=46.32  Aligned_cols=85  Identities=24%  Similarity=0.375  Sum_probs=62.7

Q ss_pred             CCeeEEEeeeEEEEecCCceeeeCCEEEEEEEEEcCCCccEEEEEEEee----cCCeeeeecCCCccCCCCCCcceeeeE
Q psy5522         705 MPKKLRVFRPFFISLDLPYSVMRGEVVAIPVVVFNYLSQDLVADVTLEN----VGQFDFADFSNEVDAAPQPKFEVFRRK  780 (1397)
Q Consensus       705 ~~~~l~vfkpffv~l~LPysv~rGE~~~l~~~VfNyl~~~~~v~v~L~~----s~~~~~~~~~~~~~~~~~~~~~~~~~~  780 (1397)
                      .+..+.+.|-+     ++..+..|+.+.+..+|+|..+... ..|+|..    +++|+++...             ....
T Consensus        19 ~~a~llv~K~i-----l~~~~v~g~~v~V~~~iyN~G~~~A-~dV~l~D~~fp~~~F~lvsG~-------------~s~~   79 (181)
T PF05753_consen   19 SPARLLVSKQI-----LNKYLVEGEDVTVTYTIYNVGSSAA-YDVKLTDDSFPPEDFELVSGS-------------LSAS   79 (181)
T ss_pred             CCcEEEEEEee-----ccccccCCcEEEEEEEEEECCCCeE-EEEEEECCCCCccccEeccCc-------------eEEE
Confidence            46777777754     6888999999999999999987643 3444444    2467765321             1122


Q ss_pred             EEEEeCCCeeEEEEEEEeceeeeEEEEE
Q psy5522         781 KLTIKANSGSTTTFVITPKELGYIGIKV  808 (1397)
Q Consensus       781 ~~~v~a~~~~tv~f~v~p~~lG~v~itV  808 (1397)
                      --.|++|+..+..+.++|++.|..+++-
T Consensus        80 ~~~i~pg~~vsh~~vv~p~~~G~f~~~~  107 (181)
T PF05753_consen   80 WERIPPGENVSHSYVVRPKKSGYFNFTP  107 (181)
T ss_pred             EEEECCCCeEEEEEEEeeeeeEEEEccC
Confidence            3489999999999999999999888763


No 58 
>COG5029 CAL1 Prenyltransferase, beta subunit [Posttranslational modification, protein turnover, chaperones]
Probab=90.48  E-value=0.79  Score=52.03  Aligned_cols=97  Identities=21%  Similarity=0.307  Sum_probs=59.0

Q ss_pred             CCCchhhhhcccchHHHHHHHHHH---hccccHHHHHHHHHHHHhccccceeeecCCCCcc--ccCCCCCCCCcch--hH
Q psy5522         890 PFGCGEQNMLNFVPNIVVLEYLKK---TYQLTDAIEAKASRYLETGYQQELTYRRPDGSFS--AFGTTDPNGSTWL--TA  962 (1397)
Q Consensus       890 P~GCgEQ~m~~~~p~i~~l~yL~~---~~~l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs--~~~~~~~~~s~wl--TA  962 (1397)
                      ++||++-..++++.++++.+-|.-   ..-++...|....+|+..       -|++||+|.  .|+.    .++-.  +|
T Consensus        93 ~f~~~~g~~~hL~sT~~Ai~~L~~~d~~~~~~~idr~~l~~fi~~-------lk~pdGsF~~~~~ge----vDtr~~Y~a  161 (342)
T COG5029          93 GFGGGPGQDSHLASTVFAIQSLAMLDSLDVLSRIDRDSLASFISG-------LKNPDGSFRSDLEGE----VDTRFLYIA  161 (342)
T ss_pred             CcCCCCccchhHHHHHHHHHHHHHhccccccchhhHHHHHHHHHh-------ccCCCCceecccCCc----chHHHHHHH
Confidence            456666666677777766654443   333343344445555543       589999985  3332    22222  33


Q ss_pred             HHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCccccc
Q psy5522         963 FVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSFPEV 1002 (1397)
Q Consensus       963 yvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~st 1002 (1397)
                      -.+++++.  .   .|....+.+++||.+.|+.+|||+.-
T Consensus       162 l~ilsllg--~---~~~~~~e~~vdyl~kCqnyeGGFg~~  196 (342)
T COG5029         162 LSILSLLG--D---LDKELFEGAVDYLKKCQNYEGGFGLC  196 (342)
T ss_pred             HHHHHHHh--h---cchhhhHHHHHHHHHhhccCCcccCC
Confidence            33333332  2   36677888999999999999999853


No 59 
>cd02895 GGTase-I Geranylgeranyltransferase types I (GGTase-I)-like proteins containing the protein prenyltransferase (PTase) domain, beta subunit (alpha 6 - alpha 6 barrel fold). GGTase-I s are a subgroup of the protein prenyltransferase family of lipid-modifying enzymes PTases catalyze the carboxyl-terminal lipidation of Ras, Rab, and several other cellular signal transduction proteins, facilitating membrane associations and specific protein-protein interactions. Prenyltransferases employ a Zn2+ ion to alkylate a thiol group catalyzing the formation of thioether linkages between cysteine residues at or near the C-terminus of protein acceptors and the C1 atom of isoprenoid lipids (geranylgeranyl (20-carbon) in the case of GGTase-I ). GGTase-I prenylates the cysteine in the terminal sequence, "CAAX" when X is Leu or Phe. Substrates for GTTase-I include the gamma subunit of neural G-proteins and several Ras-related G-proteins.  PTases are heterodimeric with both alpha and beta subunits r
Probab=89.62  E-value=1.2  Score=52.45  Aligned_cols=110  Identities=20%  Similarity=0.149  Sum_probs=69.0

Q ss_pred             ccCCCchhhhhcccchHHHHHHHHHHhccc---cHHHHHHHHHHHHhccccceeeecCCCCccccCCC--------CCCC
Q psy5522         888 KMPFGCGEQNMLNFVPNIVVLEYLKKTYQL---TDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTT--------DPNG  956 (1397)
Q Consensus       888 ~~P~GCgEQ~m~~~~p~i~~l~yL~~~~~l---~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~--------~~~~  956 (1397)
                      .+|......-..+++-..+++.-|.--+..   +...+++.+++|.+- |..  -.++||+|+.-...        ...+
T Consensus        14 ~lp~~~~~~d~~r~~~~yf~l~~L~lL~~~~~~~~~~r~~~i~~i~~~-q~~--~~~~~GgF~~~~~~~~~~~~~~~~~~   90 (307)
T cd02895          14 LLPSSYQSLDTNRLTIAFFALSGLDLLGALDSILVEEKDDIIEWIYSL-QVL--SNLPRGGFRGSSTLGLPGTASKYDTG   90 (307)
T ss_pred             hCCchhhhhhhhhHHHHHHHHHHHHHhCCCccccHHHHHHHHHHHHHH-hcc--cCCCCCCCCCCCCCccccccccCCcc
Confidence            357776666666655555555433333322   355667777777652 100  12789999764321        1257


Q ss_pred             CcchhHHHHHHHHHccCC-CccCHHHHHHHHHHHHhcccCCCccccch
Q psy5522         957 STWLTAFVAKSFRQAASH-TTIDESVILEALAWLSSNQAVNGSFPEVG 1003 (1397)
Q Consensus       957 s~wlTAyvl~~~~~a~~~-i~vd~~~~~~~~~WL~~~q~~~G~F~stq 1003 (1397)
                      +++.|=+++.++...+.- -.+|   -+++++||.+.|+++|+|.+..
T Consensus        91 ~l~~ty~Al~~L~lL~~~~~~id---r~~i~~~l~~~q~~dGgF~~~~  135 (307)
T cd02895          91 NLAMTYFALLSLLILGDDLSRVD---RKAILNFLSKLQLPDGSFGSVL  135 (307)
T ss_pred             cHHHHHHHHHHHHHhCCchhhhh---HHHHHHHHHHhCCCCCCccCCc
Confidence            899999998888776542 1233   3567999999999999998763


No 60 
>PF13115 YtkA:  YtkA-like
Probab=89.10  E-value=3.2  Score=38.72  Aligned_cols=75  Identities=23%  Similarity=0.344  Sum_probs=53.7

Q ss_pred             cEEEEEcCCCcccCCceeEEEEEEEcCCCccCCCcceEEEEECCC--C-CeEEeeeeec-cCCceeeeeeecCCCCCCee
Q psy5522          98 YSVFIQLDKAIYKPGDLVRFRAIVLNSHLKPSVTGALEIFITDGK--G-NRIKQWTRAL-TTRGVFSADLQLSKSPVLGD  173 (1397)
Q Consensus        98 ~~vfIqTDKpIYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I~DP~--g-~~I~qw~~~~-~~~Gi~~~sf~L~~~~~lG~  173 (1397)
                      ..+-++++.++ +.|+. -|.+- ++.+.+|+.+..+.+.+..|.  | ..-..+.... ...|.|..+..|+.   .|.
T Consensus         7 ~~v~l~~~~~~-~~g~~-~i~v~-~~~~g~pv~~a~V~~~~~m~~~~g~~~~~~~~~~~~~~~G~Y~~~~~f~m---~G~   80 (86)
T PF13115_consen    7 YTVELVSPEPP-KVGEN-TITVT-VDQGGKPVTDADVQFEIWMPDMEGMEPMTSKVELEETGPGVYEAEVTFSM---AGT   80 (86)
T ss_pred             EEEEEecCCCC-cCCce-EEEEE-ECCCCCCCCCCEEEEEEEeCCCCCCCCCceeeeeecCCCCeEEEEeecCC---Cee
Confidence            34455555444 88888 57666 888999997778999998885  3 2223333333 47899999988865   699


Q ss_pred             EEEEE
Q psy5522         174 WNITI  178 (1397)
Q Consensus       174 y~I~v  178 (1397)
                      |.|++
T Consensus        81 W~i~v   85 (86)
T PF13115_consen   81 WQITV   85 (86)
T ss_pred             EEEEE
Confidence            99987


No 61 
>PF10670 DUF4198:  Domain of unknown function (DUF4198)
Probab=88.95  E-value=1.8  Score=47.74  Aligned_cols=76  Identities=20%  Similarity=0.232  Sum_probs=48.8

Q ss_pred             EEEEEe--cCCcccCCCeEEEEEEEEccCCCccCCCCceEEEEEeeeccccccceeeEeeCCCeeEEEEEeCCCCCceeE
Q psy5522         314 KMDLIK--SSEYFKPGLKYTAYMKLTHHDGTPVTDNNNMVQVRHGFSYDESKYEANQYKLDRNGMIKLVYYPPANENVTT  391 (1397)
Q Consensus       314 ~i~f~~--~~~~~kpG~p~~~~v~v~~~dG~P~~~~~~~V~v~~~~~~~~~~~~~~~~ttd~~G~~~f~i~~~~~~~~~~  391 (1397)
                      .+++++  .|..++.|.++.++|.   .||+|+++  ..|.+.....|.........++||++|.+.|.+..+   |.  
T Consensus       135 ~lEIvPl~~P~~l~~g~~~~~~vl---~~GkPl~~--a~V~~~~~~~~~~~~~~~~~~~TD~~G~~~~~~~~~---G~--  204 (215)
T PF10670_consen  135 PLEIVPLTNPYKLKAGDPLPFQVL---FDGKPLAG--AEVEAFSPGGWYDVEHEAKTLKTDANGRATFTLPRP---GL--  204 (215)
T ss_pred             cEEEEECcCcccccCCCEEEEEEE---ECCeEccc--EEEEEEECCCccccccceEEEEECCCCEEEEecCCC---EE--
Confidence            345555  4556789999887774   79999995  356554332222111226789999999999887554   45  


Q ss_pred             EEEEEEec
Q psy5522         392 LGIEAEYL  399 (1397)
Q Consensus       392 l~l~a~~~  399 (1397)
                      .-|+|.+.
T Consensus       205 wli~a~~~  212 (215)
T PF10670_consen  205 WLIRASHK  212 (215)
T ss_pred             EEEEEEEe
Confidence            44555544


No 62 
>PLN03201 RAB geranylgeranyl transferase beta-subunit; Provisional
Probab=88.26  E-value=1.5  Score=51.65  Aligned_cols=73  Identities=18%  Similarity=0.183  Sum_probs=51.1

Q ss_pred             ccHHHHHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCC
Q psy5522         917 LTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVN  996 (1397)
Q Consensus       917 l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~  996 (1397)
                      ++....+++++||.+       -|+.||+|+.-++.  ++....|..++-++..++..-.+|.   +...+||.++|...
T Consensus       149 ~~~i~~~~~~~~i~s-------cq~~dGGF~~~p~~--esh~g~T~caiaaL~llg~~~~~d~---~~l~~wL~~rQ~~~  216 (316)
T PLN03201        149 LDKINVEKAVDYIVS-------CKNFDGGFGCTPGG--ESHAGQIFCCVGALAITGSLHHVDK---DLLGWWLCERQVKS  216 (316)
T ss_pred             cchhHHHHHHHHHHH-------hcCCCCCcCCCCCC--CCccceehHHHHHHHHcCccccCCH---HHHHHHHHHhCCCC
Confidence            333335678888764       46789999875443  4566668777777777765544564   44589999999989


Q ss_pred             Ccccc
Q psy5522         997 GSFPE 1001 (1397)
Q Consensus       997 G~F~s 1001 (1397)
                      |||..
T Consensus       217 GGf~g  221 (316)
T PLN03201        217 GGLNG  221 (316)
T ss_pred             CCcCC
Confidence            99974


No 63 
>cd02895 GGTase-I Geranylgeranyltransferase types I (GGTase-I)-like proteins containing the protein prenyltransferase (PTase) domain, beta subunit (alpha 6 - alpha 6 barrel fold). GGTase-I s are a subgroup of the protein prenyltransferase family of lipid-modifying enzymes PTases catalyze the carboxyl-terminal lipidation of Ras, Rab, and several other cellular signal transduction proteins, facilitating membrane associations and specific protein-protein interactions. Prenyltransferases employ a Zn2+ ion to alkylate a thiol group catalyzing the formation of thioether linkages between cysteine residues at or near the C-terminus of protein acceptors and the C1 atom of isoprenoid lipids (geranylgeranyl (20-carbon) in the case of GGTase-I ). GGTase-I prenylates the cysteine in the terminal sequence, "CAAX" when X is Leu or Phe. Substrates for GTTase-I include the gamma subunit of neural G-proteins and several Ras-related G-proteins.  PTases are heterodimeric with both alpha and beta subunits r
Probab=88.01  E-value=1.1  Score=52.64  Aligned_cols=70  Identities=17%  Similarity=0.196  Sum_probs=46.9

Q ss_pred             HHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCccc
Q psy5522         922 EAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSFP 1000 (1397)
Q Consensus       922 ~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~ 1000 (1397)
                      +++++++|.+       -|..||+|+.-+..  ++....|-.++-++......-..+.....++++||.++|+.+|||.
T Consensus       167 ~~~li~~l~s-------~Q~~dGGF~~~~~~--Esh~g~Tyca~asL~lL~~~~~~~~~~~~~l~~wL~~rQ~~~GGF~  236 (307)
T cd02895         167 KEKLIDYIKS-------SQSYDGGFGQGPGL--ESHGGSTFCAIASLSLLGKLEELSEKFLERLKRWLVHRQVSGTGFN  236 (307)
T ss_pred             HHHHHHHHHH-------ccCCCCCccCCCCC--CccccHHHHHHHHHHHcCCccccccccHHHHHHHHHHhcCCCCCcC
Confidence            4567777664       47779999865543  4456666666666655554311113446778999999999999996


No 64 
>PLN02710 farnesyltranstransferase subunit beta
Probab=87.93  E-value=1.1  Score=54.59  Aligned_cols=67  Identities=16%  Similarity=0.231  Sum_probs=43.5

Q ss_pred             HHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchh--HHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCcc
Q psy5522         922 EAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLT--AFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSF  999 (1397)
Q Consensus       922 ~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlT--Ayvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F  999 (1397)
                      +++..+||.+       .|++||||+.-...  +.+++.|  |-+++.++..     ++....+++++||.+.|+.+|||
T Consensus       145 r~~l~~fl~s-------~q~~dGgF~~~~~g--E~D~R~tYcAlail~LL~~-----l~~~~~e~~~~~I~scQ~~dGGF  210 (439)
T PLN02710        145 REKLYTFLLR-------MKDPSGGFRMHDGG--EMDVRACYTAISVASLLNI-----LDDELVKGVGDYILSCQTYEGGI  210 (439)
T ss_pred             HHHHHHHHHH-------cCCCCCCcccCCCC--CCCcCCcHHHHHHHHHhCc-----CchhhHHHHHHHHHHhCCCCCCC
Confidence            4566666654       57789999763322  3344433  4444444432     24556788999999999999999


Q ss_pred             ccc
Q psy5522        1000 PEV 1002 (1397)
Q Consensus      1000 ~st 1002 (1397)
                      ...
T Consensus       211 ~g~  213 (439)
T PLN02710        211 GGE  213 (439)
T ss_pred             CCC
Confidence            855


No 65 
>cd02893 FTase Protein farnesyltransferase (FTase)_like proteins containing the protein prenyltransferase (PTase) domain, beta subunit (alpha 6 - alpha 6 barrel fold). FTases are a subgroup of PTase family of lipid-modifying enzymes. PTases catalyze the carboxyl-terminal lipidation of Ras, Rab, and several other cellular signal transduction proteins, facilitating membrane associations and specific protein-protein interactions. These proteins are heterodimers of alpha and beta subunits. Both subunits are required for catalytic activity. Prenyltransferases employ a Zn2+ ion to alkylate a thiol group catalyzing the formation of thioether linkages between cysteine residues at or near the C-terminus of protein acceptors and the C1 atom of isoprenoid lipids. Ftase attaches a 15-carbon farnesyl group to the cysteine within the C-terminal CaaX motif of substrate proteins when X is Ala, Met, Ser, Cys or Gln. Protein farnesylation has been shown to play critical roles in a variety of cellular pro
Probab=87.86  E-value=2  Score=50.39  Aligned_cols=74  Identities=19%  Similarity=0.302  Sum_probs=53.5

Q ss_pred             cccHHHHHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccC
Q psy5522         916 QLTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAV  995 (1397)
Q Consensus       916 ~l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~  995 (1397)
                      .++++..+++++||.+       -|..||+|+..++.  ++..-.|.+++-+|..++..-.+|   .++.++||.++|..
T Consensus       142 ~~~~~~~~~~~~~l~~-------cQ~~dGGF~~~p~~--e~h~~yTfcavasL~llg~~~~~d---~~~l~~wl~~~q~~  209 (299)
T cd02893         142 ILTDELFEGVAEYILS-------CQTYEGGFGGVPGN--EAHGGYTFCALAALAILGKPDKLD---LESLLRWLVARQMR  209 (299)
T ss_pred             CCchhhHHHHHHHHHH-------cCCCCCCcCCCCCC--CCCccHHHHHHHHHHHcCCccccC---HHHHHHHHHhhcCC
Confidence            3456667788888775       46779999875443  556677888888887776643344   56679999998876


Q ss_pred             -CCcccc
Q psy5522         996 -NGSFPE 1001 (1397)
Q Consensus       996 -~G~F~s 1001 (1397)
                       .|||.-
T Consensus       210 ~~GGf~g  216 (299)
T cd02893         210 FEGGFQG  216 (299)
T ss_pred             CCCCcCC
Confidence             899963


No 66 
>smart00557 IG_FLMN Filamin-type immunoglobulin domains. These form a rod-like structure in the actin-binding cytoskeleton protein, filamin. The C-terminal repeats of filamin bind beta1-integrin (CD29).
Probab=87.48  E-value=3.6  Score=39.15  Aligned_cols=64  Identities=17%  Similarity=0.270  Sum_probs=50.6

Q ss_pred             cCCceeEEEEEEEcCCCccCCCcceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC
Q psy5522         110 KPGDLVRFRAIVLNSHLKPSVTGALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD  182 (1397)
Q Consensus       110 kPGqtV~fRvv~ld~~l~P~~~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~  182 (1397)
                      +.|+...|.+-++|.     ....+.+.|.+|+|+.+.-. .....+|.+..+|. |..  .|.|+|.+..++
T Consensus        15 ~vg~~~~f~v~~~d~-----G~~~~~v~i~~p~g~~~~~~-v~d~~dGty~v~y~-P~~--~G~~~i~V~~~g   78 (93)
T smart00557       15 VVGEPAEFTIDTRGA-----GGGELEVEVTGPSGKKVPVE-VKDNGDGTYTVSYT-PTE--PGDYTVTVKFGG   78 (93)
T ss_pred             ecCCCEEEEEEcCCC-----CCCcEEEEEECCCCCeeEeE-EEeCCCCEEEEEEE-eCC--CEeEEEEEEECC
Confidence            567788888888876     35788999999999766543 35677799999999 444  399999999987


No 67 
>COG5029 CAL1 Prenyltransferase, beta subunit [Posttranslational modification, protein turnover, chaperones]
Probab=87.47  E-value=2.2  Score=48.59  Aligned_cols=94  Identities=17%  Similarity=0.303  Sum_probs=59.6

Q ss_pred             CchhhhhcccchHHHHHHHHHHhccccHHHHHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHc
Q psy5522         892 GCGEQNMLNFVPNIVVLEYLKKTYQLTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQA  971 (1397)
Q Consensus       892 GCgEQ~m~~~~p~i~~l~yL~~~~~l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a  971 (1397)
                      ||..+.+...+-+  ++..|   +.++.+..+-+++|+.+       -|+.||+|+.=+..+..+----+|.+.++++  
T Consensus       151 gevDtr~~Y~al~--ilsll---g~~~~~~~e~~vdyl~k-------CqnyeGGFg~~p~aEaHag~tFcalaalalL--  216 (342)
T COG5029         151 GEVDTRFLYIALS--ILSLL---GDLDKELFEGAVDYLKK-------CQNYEGGFGLCPYAEAHAGYTFCALAALALL--  216 (342)
T ss_pred             CcchHHHHHHHHH--HHHHH---hhcchhhhHHHHHHHHH-------hhccCCcccCCCchhhccchHHHHHHHHHHH--
Confidence            6666666333222  23334   35666777788887776       4777999998766433333333455555444  


Q ss_pred             cCCCccCHHHHHHHHHHHHhcccCCCcccc
Q psy5522         972 ASHTTIDESVILEALAWLSSNQAVNGSFPE 1001 (1397)
Q Consensus       972 ~~~i~vd~~~~~~~~~WL~~~q~~~G~F~s 1001 (1397)
                      .+...++.  .++..+||.++|-+.|||..
T Consensus       217 ~~Ld~ls~--~E~l~~Wl~~RQ~ssgGl~G  244 (342)
T COG5029         217 GKLDKLSD--VEKLIRWLAERQLSSGGLNG  244 (342)
T ss_pred             hcccccch--HHHHHHHHHHcccccCCcCC
Confidence            44444443  66779999999999999963


No 68 
>KOG0367|consensus
Probab=87.21  E-value=1.1  Score=50.45  Aligned_cols=105  Identities=19%  Similarity=0.327  Sum_probs=57.9

Q ss_pred             hhccCCCchh------hhhcccchHHHHHHHHHHhccccHHHHHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcc
Q psy5522         886 LIKMPFGCGE------QNMLNFVPNIVVLEYLKKTYQLTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTW  959 (1397)
Q Consensus       886 Ll~~P~GCgE------Q~m~~~~p~i~~l~yL~~~~~l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~w  959 (1397)
                      ..|.|-||+-      .+-.+|+-...+.-|+-..-+  ---+++++.||.+-       |+.||+|+.-++.++.+-.-
T Consensus       136 ~~Q~~dGsF~~~~~GSe~DmRFvYcA~aI~ymLd~~s--~iD~ek~~~yI~~~-------q~YdgGfg~~pg~EsHgG~T  206 (347)
T KOG0367|consen  136 ACQRPDGSFVSINVGSESDMRFVYCAVAICYMLDFWS--GIDKEKLIGYIRSS-------QRYDGGFGQHPGGESHGGAT  206 (347)
T ss_pred             HhcCCCCceeecCCCCchhhHHHHHHHHHHHHhcccc--ccCHHHHHHHHHHh-------hccccccccCCCCCCCcchh
Confidence            4566767653      222344433333344432111  11246788888753       66799999876543332222


Q ss_pred             hhHHHHHHHHHccCCC--cc-CHHHHHHHHHHHHhcccCCCcccc
Q psy5522         960 LTAFVAKSFRQAASHT--TI-DESVILEALAWLSSNQAVNGSFPE 1001 (1397)
Q Consensus       960 lTAyvl~~~~~a~~~i--~v-d~~~~~~~~~WL~~~q~~~G~F~s 1001 (1397)
                      --|.+-++|  ..+.+  .+ +...+++.++||.++|..+|||..
T Consensus       207 fCAlAsL~L--~~~l~~e~l~~~~~~erlirWli~RQ~~sgGfqG  249 (347)
T KOG0367|consen  207 FCALASLAL--MGKLIPEELSNTSKVERLIRWLIQRQVSSGGFQG  249 (347)
T ss_pred             HHHHHHHHH--HhhhhhhhhccccCHHHHHHHHHHHhhccCCcCC
Confidence            223333333  33331  12 333488999999999999999964


No 69 
>PF00207 A2M:  Alpha-2-macroglobulin family;  InterPro: IPR001599 This entry contains serum complement C3 and C4 precursors and alpha-macrogrobulins.  The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].; GO: 0004866 endopeptidase inhibitor activity; PDB: 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 2PN5_A 3FRP_G 3HRZ_B ....
Probab=87.07  E-value=6  Score=37.50  Aligned_cols=75  Identities=17%  Similarity=0.251  Sum_probs=51.2

Q ss_pred             ccCCceeeeeeecCCCCCCeeEEEEEEEcC-----eEEEE-EEEEeeecCCcEEEEEeCCCceEeeCCeEEEEEEEEEec
Q psy5522         153 LTTRGVFSADLQLSKSPVLGDWNITINVLD-----QKFTK-RFTVAEYVLPKFEVNVNVPPHATFKDSKVVISVNAKYTY  226 (1397)
Q Consensus       153 ~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~-----~~~~~-~F~VeEYvLPkFeV~v~~p~~i~~~~~~~~v~v~AkYty  226 (1397)
                      ++++|-...+|.||+.  +|.|+|.+..-+     ...+. .|.|..    .|-++...|..+. .+|++.+.+...-+-
T Consensus        10 ~d~~G~~~~~~~lPd~--it~w~v~a~a~s~~~~~g~~~~~~~~v~~----p~~i~~~lP~~l~-~GD~~~i~v~v~N~~   82 (92)
T PF00207_consen   10 TDANGKATFSFTLPDS--ITSWRVTAFAVSPTGGFGIAEPPEITVFK----PFFIQLNLPRSLR-RGDQIQIPVTVFNYT   82 (92)
T ss_dssp             CCTTEEEEEEEE-SSS--SSEEEEEEEEEETTTEEEEECCEEEEEB-----SEEEEEE--SEEE-TTSEEEEEEEEEE-S
T ss_pred             cCCCCEEEEEEECCCC--ccEEEEEEEEECCCCcceEecceEEEEEe----eEEEEcCCCcEEe-cCCEEEEEEEEEeCC
Confidence            3578999999999997  689999998743     22333 566533    6888999988875 688999998887666


Q ss_pred             CccccceE
Q psy5522         227 GKPVKGEA  234 (1397)
Q Consensus       227 GkPV~G~v  234 (1397)
                      ++....++
T Consensus        83 ~~~~~v~V   90 (92)
T PF00207_consen   83 DKDQEVTV   90 (92)
T ss_dssp             SS-EEEEE
T ss_pred             CCCEEEEE
Confidence            66554433


No 70 
>smart00634 BID_1 Bacterial Ig-like domain (group 1).
Probab=85.86  E-value=8.5  Score=36.49  Aligned_cols=67  Identities=16%  Similarity=0.161  Sum_probs=44.8

Q ss_pred             CeEEEEEEEEccCCCccCCCCceEEEEEeeeccccccceeeEeeCCCeeEEEEEeCCCCCceeEEEEEEEecC
Q psy5522         328 LKYTAYMKLTHHDGTPVTDNNNMVQVRHGFSYDESKYEANQYKLDRNGMIKLVYYPPANENVTTLGIEAEYLD  400 (1397)
Q Consensus       328 ~p~~~~v~v~~~dG~P~~~~~~~V~v~~~~~~~~~~~~~~~~ttd~~G~~~f~i~~~~~~~~~~l~l~a~~~~  400 (1397)
                      ....+++.++|.+|.|+++.  .|.+.+...- ..........||++|.+.+.+... ..+.  +.|+|+..+
T Consensus        18 d~~~i~v~v~D~~Gnpv~~~--~V~f~~~~~~-~~~~~~~~~~Td~~G~a~~~l~~~-~~G~--~~vta~~~~   84 (92)
T smart00634       18 DAITLTATVTDANGNPVAGQ--EVTFTTPSGG-ALTLSKGTATTDANGIATVTLTST-TAGV--YTVTASLEN   84 (92)
T ss_pred             ccEEEEEEEECCCCCCcCCC--EEEEEECCCc-eeeccCCeeeeCCCCEEEEEEECC-CCcE--EEEEEEECC
Confidence            67889999999999999964  4555542110 000112456899999999999854 4455  666666543


No 71 
>PF00630 Filamin:  Filamin/ABP280 repeat;  InterPro: IPR017868 The many different actin cross-linking proteins share a common architecture, consisting of a globular actin-binding domain and an extended rod. Whereas their actin-binding domains consist of two calponin homology domains (see IPR001715 from INTERPRO), their rods fall into three families. The rod domain of the family including the Dictyostelium discoideum (Slime mould) gelation factor (ABP120) and human filamin (ABP280) is constructed from tandem repeats of a 100-residue motif that is glycine and proline rich []. The gelation factor's rod contains 6 copies of the repeat, whereas filamin has a rod constructed from 24 repeats. The resolution of the 3D structure of rod repeats from the gelation factor has shown that they consist of a beta-sandwich, formed by two beta-sheets arranged in an immunoglobulin-like fold [, ]. Because conserved residues that form the core of the repeats are preserved in filamin, the repeat structure should be common to the members of the gelation factor/filamin family. The head to tail homodimerisation is crucial to the function of the ABP120 and ABP280 proteins. This interaction involves a small portion at the distal end of the rod domains. For the gelation factor it has been shown that the carboxy-terminal repeat 6 dimerises through a double edge-to-edge extension of the beta-sheet and that repeat 5 contributes to dimerisation to some extent [, , ].; PDB: 2DI9_A 2EEC_A 2DIC_A 2EEA_A 2DMC_A 2EE9_A 2D7O_A 2D7N_A 2K7P_A 2NQC_A ....
Probab=84.28  E-value=6.1  Score=37.84  Aligned_cols=70  Identities=13%  Similarity=0.201  Sum_probs=54.6

Q ss_pred             cCCceeEEEEEEEcCCCccC--CCcceEEEEECCCCCe---EEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC
Q psy5522         110 KPGDLVRFRAIVLNSHLKPS--VTGALEIFITDGKGNR---IKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD  182 (1397)
Q Consensus       110 kPGqtV~fRvv~ld~~l~P~--~~~~~~v~I~DP~g~~---I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~  182 (1397)
                      ..|+...|.|.++|..-.+.  ....+.+.|.+|++..   -..+.......|.+..+|....   .|.|+|.+..++
T Consensus        18 ~~g~~~~F~V~~~d~~g~~~~~~~~~~~v~i~~p~~~~~~~~~~~~v~~~~~G~y~v~y~p~~---~G~y~i~V~~~g   92 (101)
T PF00630_consen   18 VVGEPATFTVDTRDAGGNPVSSGGDEFQVTITSPDGKEEPVPVPVEVIDNGDGTYTVSYTPTE---PGKYKISVKING   92 (101)
T ss_dssp             ETTSEEEEEEEETTTTSSBEESTSSEEEEEEESSSSESS--EEEEEEEEESSSEEEEEEEESS---SEEEEEEEEESS
T ss_pred             ECCCcEEEEEEEccCCCCccccCCceeEEEEeCCCCCccccccceEEEECCCCEEEEEEEeCc---cEeEEEEEEECC
Confidence            68999999999999865543  3467889999999872   2233445678899999999554   399999999976


No 72 
>KOG0366|consensus
Probab=83.44  E-value=3.3  Score=45.80  Aligned_cols=90  Identities=18%  Similarity=0.252  Sum_probs=53.2

Q ss_pred             cccchHHHHHHHHHHhccccHHHHHHHHHHHHhccccceeeecCCCCcc--ccCCCCCCCCcchhHHHHHHHHHccCCCc
Q psy5522         899 LNFVPNIVVLEYLKKTYQLTDAIEAKASRYLETGYQQELTYRRPDGSFS--AFGTTDPNGSTWLTAFVAKSFRQAASHTT  976 (1397)
Q Consensus       899 ~~~~p~i~~l~yL~~~~~l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs--~~~~~~~~~s~wlTAyvl~~~~~a~~~i~  976 (1397)
                      .++.-++.+.|.|.--..+..--+.+...|+. |      .|++||||+  .||.    -++-.+--++.++...++.  
T Consensus        93 ~hll~TlsAvQiL~~ydsi~~~d~d~v~~yi~-g------Lq~edGsF~gD~wGE----vDTRfs~~av~~L~lLg~l--  159 (329)
T KOG0366|consen   93 PHLLYTLSAVQILALYDSINVLDRDKVASYIK-G------LQQEDGSFSGDIWGE----VDTRFSYCAVACLALLGKL--  159 (329)
T ss_pred             hHHHHHHHHHHHHHHHcccccccHHHHHHHHH-h------hcCcCCcccCCcccc----cchhhhHHHHHHHHHHhhH--
Confidence            34444555666665444443333445556664 4      478899995  4543    2333333333333333332  


Q ss_pred             cCHHHHHHHHHHHHhcccCCCccccc
Q psy5522         977 IDESVILEALAWLSSNQAVNGSFPEV 1002 (1397)
Q Consensus       977 vd~~~~~~~~~WL~~~q~~~G~F~st 1002 (1397)
                       |.-.+++|++|+.+--|-||||...
T Consensus       160 -d~~nve~aVd~~~~CyN~DGGFG~~  184 (329)
T KOG0366|consen  160 -DTINVEKAVDFVLSCYNFDGGFGCR  184 (329)
T ss_pred             -HHhhHHHHHHHHHhhcccCCCcCCC
Confidence             3455788999999999999999864


No 73 
>PF00927 Transglut_C:  Transglutaminase family, C-terminal ig like domain;  InterPro: IPR008958 Synonym(s): Protein-glutamine gamma-glutamyltransferase, Fibrinoligase, TGase  Transglutaminases catalyse the post-translational modification of proteins at glutamine residues, with formation of isopeptide bonds. Members of the transglutaminase family usually have three domains: N-terminal (IPR001102 from INTERPRO), middle (IPR013808 from INTERPRO) and C-terminal. The middle domain is usually well conserved, but family members can display major differences in their N- and C-terminal domains, although their overall structure is conserved []. This entry represents the C-terminal domain found in transglutaminases, which consists of an immunoglobulin-like beta-sandwich consisting of seven strands in two sheets with a Greek key topology. The best known transglutaminase is blood coagulation factor XIII, a plasma tetrameric protein composed of two catalytic A subunits and two non-catalytic B subunits. Factor XIII is responsible for cross-linking fibrin chains, thus stabilising the fibrin clot. Protein-glutamine gamma-glutamyltransferases (2.3.2.13 from EC) are calcium-dependent enzymes that catalyse the cross-linking of proteins by promoting the formation of isopeptide bonds between the gamma-carboxyl group of a glutamine in one polypeptide chain and the epsilon-amino group of a lysine in a second polypeptide chain. TGases also catalyse the conjugation of polyamines to proteins [, ].; GO: 0003810 protein-glutamine gamma-glutamyltransferase activity, 0018149 peptide cross-linking; PDB: 2XZZ_A 1GGY_B 1FIE_B 1GGU_B 1GGT_B 1F13_A 1QRK_B 1EVU_A 1EX0_B 1L9N_B ....
Probab=82.79  E-value=6.7  Score=38.27  Aligned_cols=79  Identities=19%  Similarity=0.195  Sum_probs=53.5

Q ss_pred             EEecCCceeeeCCEEEEEEEEEcCCCcc-EEEEEEEeecC----CeeeeecCCCccCCCCCCcceeeeEEEEEeCCCeeE
Q psy5522         717 ISLDLPYSVMRGEVVAIPVVVFNYLSQD-LVADVTLENVG----QFDFADFSNEVDAAPQPKFEVFRRKKLTIKANSGST  791 (1397)
Q Consensus       717 v~l~LPysv~rGE~~~l~~~VfNyl~~~-~~v~v~L~~s~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~a~~~~t  791 (1397)
                      +++.++-.+..|+.+.+.+++.|-++.. -.|++.|....    +.....             .........|.+++..+
T Consensus         3 ~~i~~~~~~~vG~d~~v~v~~~N~~~~~l~~v~~~l~~~~v~ytG~~~~~-------------~~~~~~~~~l~p~~~~~   69 (107)
T PF00927_consen    3 IKIKLPGDPVVGQDFTVSVSFTNPSSEPLRNVSLNLCAFTVEYTGLTRDQ-------------FKKEKFEVTLKPGETKS   69 (107)
T ss_dssp             EEEEEESEEBTTSEEEEEEEEEE-SSS-EECEEEEEEEEEEECTTTEEEE-------------EEEEEEEEEE-TTEEEE
T ss_pred             EEEEECCCccCCCCEEEEEEEEeCCcCccccceeEEEEEEEEECCccccc-------------EeEEEcceeeCCCCEEE
Confidence            4566777888999999999999999999 55777764321    111000             12335578999999999


Q ss_pred             EEEEEEeceeeeEEEEE
Q psy5522         792 TTFVITPKELGYIGIKV  808 (1397)
Q Consensus       792 v~f~v~p~~lG~v~itV  808 (1397)
                      +.+.|.|...|.-.+-+
T Consensus        70 ~~~~i~p~~yG~~~~l~   86 (107)
T PF00927_consen   70 VEVTITPSQYGPKQLLV   86 (107)
T ss_dssp             EEEEE-HHSHEEECCEE
T ss_pred             EEEEEEceeEecchhcc
Confidence            99999999998744333


No 74 
>PF13243 Prenyltrans_1:  Prenyltransferase-like; PDB: 3SDR_A 3SAE_A 3SDV_A 3SDT_A 3SDQ_A 3SDU_A.
Probab=81.20  E-value=0.58  Score=45.66  Aligned_cols=60  Identities=25%  Similarity=0.333  Sum_probs=31.8

Q ss_pred             HHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhccc
Q psy5522         922 EAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQA  994 (1397)
Q Consensus       922 ~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~  994 (1397)
                      .+++.+||.+       .|++||+|..++.    .+...|+-++.++..+...  -+...+.++++||.++|.
T Consensus        48 i~ka~~~l~~-------~Q~~dG~w~~~~~----~~~~~t~~~~~~l~~~~~~--~~~~~~~r~~~wi~~~~~  107 (109)
T PF13243_consen   48 IKKAIDWLLS-------HQNPDGGWGYSGG----EYVSMTAAAIAALALAGVY--PDDEAVERGLEWILSHQL  107 (109)
T ss_dssp             SHHHHHHHHH----------TTS--S-TS------HHHHHHHHHHHHHHHHTT----HHHHHHHHHHHHHH--
T ss_pred             HHHHHHHHHH-------hcCCCCCCCCcCC----CCHHHHHHHHHHHHHhCCC--CCCHHHHHHHHHHHHccC
Confidence            3678887765       5889999987732    2344555444444433221  266779999999998753


No 75 
>PF02369 Big_1:  Bacterial Ig-like domain (group 1);  InterPro: IPR003344 Proteins that contain this domain are found in a variety of bacterial and phage surface proteins such as intimins. Intimin is a bacterial cell-adhesion molecule that mediates the intimate bacterial host-cell interaction. It contains three domains; two immunoglobulin-like domains and a C-type lectin-like module implying that carbohydrate recognition may be important in intimin-mediated cell adhesion [].; PDB: 1CWV_A 4E9L_A 1F02_I 1F00_I.
Probab=79.83  E-value=9.7  Score=36.77  Aligned_cols=69  Identities=16%  Similarity=0.139  Sum_probs=40.6

Q ss_pred             CCCeEEEEEEEEccCCCccCCCCceEEEEEeeecccccccee--eEeeCCCeeEEEEEeCCCCCceeEEEEEEEecCc
Q psy5522         326 PGLKYTAYMKLTHHDGTPVTDNNNMVQVRHGFSYDESKYEAN--QYKLDRNGMIKLVYYPPANENVTTLGIEAEYLDI  401 (1397)
Q Consensus       326 pG~p~~~~v~v~~~dG~P~~~~~~~V~v~~~~~~~~~~~~~~--~~ttd~~G~~~f~i~~~~~~~~~~l~l~a~~~~~  401 (1397)
                      -+....+++.++|.+|+|+++.  .|.+.....  .......  ..+||++|.+.+.+... ..+.  ..|+|+..+.
T Consensus        21 g~~~~tltatV~D~~gnpv~g~--~V~f~~~~~--~~~l~~~~~~~~Td~~G~a~~tltst-~aG~--~~VtA~~~~~   91 (100)
T PF02369_consen   21 GSDTNTLTATVTDANGNPVPGQ--PVTFSSSSS--GGTLSPTNTSATTDSNGIATVTLTST-KAGT--YTVTATVDGG   91 (100)
T ss_dssp             SSS-EEEEEEEEETTSEB-TS---EEEE--EES--SSEES-CEE-EEE-TTSEEEEEEE-S-S-EE--EEEEEEETTE
T ss_pred             CcCcEEEEEEEEcCCCCCCCCC--EEEEEEcCC--CcEEecCccccEECCCEEEEEEEEec-CceE--EEEEEEECCc
Confidence            4667899999999999999964  455511111  0001111  47899999999999754 4455  6677776654


No 76 
>PF06030 DUF916:  Bacterial protein of unknown function (DUF916);  InterPro: IPR010317 This family consists of putative cell surface proteins, from Firmicutes, of unknown function. 
Probab=79.51  E-value=11  Score=37.95  Aligned_cols=75  Identities=23%  Similarity=0.283  Sum_probs=47.8

Q ss_pred             eeeCCEEEEEEEEEcCCCccEEEEEEEeec---C--CeeeeecCCCccCC--CCCCcceeeeEEEEEeCCCeeEEEEEEE
Q psy5522         725 VMRGEVVAIPVVVFNYLSQDLVADVTLENV---G--QFDFADFSNEVDAA--PQPKFEVFRRKKLTIKANSGSTTTFVIT  797 (1397)
Q Consensus       725 v~rGE~~~l~~~VfNyl~~~~~v~v~L~~s---~--~~~~~~~~~~~~~~--~~~~~~~~~~~~~~v~a~~~~tv~f~v~  797 (1397)
                      +.+|++-.|.+.|.|..++.+.+.+.+...   +  .+............  +....-....+.+.|+|++++.|.|.|+
T Consensus        23 ~~P~q~~~l~v~i~N~s~~~~tv~v~~~~A~Tn~nG~I~Y~~~~~~~d~sl~~~~~~~v~~~~~Vtl~~~~sk~V~~~i~  102 (121)
T PF06030_consen   23 VKPGQKQTLEVRITNNSDKEITVKVSANTATTNDNGVIDYSQNNPKKDKSLKYPFSDLVKIPKEVTLPPNESKTVTFTIK  102 (121)
T ss_pred             eCCCCEEEEEEEEEeCCCCCEEEEEEEeeeEecCCEEEEECCCCcccCcccCcchHHhccCCcEEEECCCCEEEEEEEEE
Confidence            678999999999999999999999887642   1  12222221111000  0000001123349999999999999998


Q ss_pred             -ec
Q psy5522         798 -PK  799 (1397)
Q Consensus       798 -p~  799 (1397)
                       |.
T Consensus       103 ~P~  105 (121)
T PF06030_consen  103 MPK  105 (121)
T ss_pred             cCC
Confidence             54


No 77 
>PF09315 DUF1973:  Domain of unknown function (DUF1973);  InterPro: IPR015394 These functionally uncharacterised domains are found in various eukaryotic calcium-dependent chloride channels. 
Probab=78.80  E-value=27  Score=37.56  Aligned_cols=46  Identities=24%  Similarity=0.389  Sum_probs=35.0

Q ss_pred             ceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEc
Q psy5522         133 ALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVL  181 (1397)
Q Consensus       133 ~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~  181 (1397)
                      .+.|.|.||+|+......   .+...-...|.||..+..|.|++.+...
T Consensus        42 ~p~i~L~~P~G~~~~~~~---~d~~~~~~~i~ipg~ae~G~W~y~i~~~   87 (179)
T PF09315_consen   42 PPSITLTDPSGTVYTTFT---TDSNSKTARIQIPGTAEVGTWTYSITNT   87 (179)
T ss_pred             CceEEEECCCCCEEeeeE---EcccccEEEEECCCCcccccEEEEEecC
Confidence            567889999999998832   2234455667788889999999999543


No 78 
>PF14545 DBB:  Dof, BCAP, and BANK (DBB) motif,
Probab=76.01  E-value=18  Score=37.05  Aligned_cols=84  Identities=17%  Similarity=0.199  Sum_probs=54.9

Q ss_pred             EEEeeCeeecCCeEEEEEEeec-CCccEEEEEEEEeecCCCCeeEEeEEEEECCCceeEEEEEcCCC---CCceeEEEEE
Q psy5522           3 TVVAPKVLRPNGEYHVAVSTQA-VSEATQVSVEVGGKQDSGGEFRTKQLLSVDPFSTRIVKLDIGDL---GPGQYNLTVK   78 (1397)
Q Consensus         3 ~v~~P~~lr~g~~~~v~v~l~~-~~~~v~v~v~L~~~~~~~~~~~~~~~~~~~~~~~~~~~f~vP~~---~~~~~~l~v~   78 (1397)
                      +++.|+.++.|.+ +|.|-|.+ ..+..+++|.+...   +  .. .+.+.....+...+.|++|+.   ++|.+.++|.
T Consensus         1 f~V~P~rI~cg~~-~vfIiL~~~l~~~~~~eVef~~~---n--~~-~~~~~~~~~N~yt~~~~aPd~~~~pag~V~v~v~   73 (142)
T PF14545_consen    1 FVVQPSRIRCGQP-EVFIILRDPLDEEDTVEVEFESN---N--KP-IRRVPAKWENPYTLQFKAPDFCLEPAGSVNVRVY   73 (142)
T ss_pred             CeecCceeecCCC-EEEEEEeCCCCCCCeEEEEEEeC---C--Ce-eEeccceEECCEEEEEECchhcCCCCceEEEEEE
Confidence            4678999999999 99997766 33345677777654   1  11 112222333567889999998   8898888888


Q ss_pred             ecCceeeEeeEEEEEEe
Q psy5522          79 GSGSLNFYNSTGLVYVH   95 (1397)
Q Consensus        79 g~~~~~f~~~~~v~~~~   95 (1397)
                      - +|.. .-.++|.+.+
T Consensus        74 ~-~g~~-~~~~~ikyys   88 (142)
T PF14545_consen   74 C-DGVS-LGTRQIKYYS   88 (142)
T ss_pred             E-CCEE-EEEEeEEEEe
Confidence            7 4442 2344555443


No 79 
>PF10633 NPCBM_assoc:  NPCBM-associated, NEW3 domain of alpha-galactosidase;  InterPro: IPR018905 This domain has been named NEW3, but its function is not known. It is found on proteins which are bacterial galactosidases [].; PDB: 1EUT_A 2BZD_A 1WCQ_C 2BER_A 1W8O_A 1EUU_A 1W8N_A.
Probab=75.13  E-value=15  Score=33.54  Aligned_cols=54  Identities=28%  Similarity=0.391  Sum_probs=28.1

Q ss_pred             ceeEEEEEecCCCeeeEEEEEEEEecCCCceeEEEEEecCCCCcCCCCcEEEEEE
Q psy5522         817 DSMEGKLLVKPEGETQYKNKAIFVDLRKNKTFSVNVTLDMPKNIVPGSEHVEVSA  871 (1397)
Q Consensus       817 D~v~k~L~V~peG~~~~~t~s~~l~~~~~~~~~~~i~l~~P~~~v~gs~~~~vsv  871 (1397)
                      ..+.-.|.+ |+|......-...-.+..+...+.++.+..|+++.+|+-.+.+.+
T Consensus        22 ~~v~~~l~~-P~GW~~~~~~~~~~~l~pG~s~~~~~~V~vp~~a~~G~y~v~~~a   75 (78)
T PF10633_consen   22 TNVSLSLSL-PEGWTVSASPASVPSLPPGESVTVTFTVTVPADAAPGTYTVTVTA   75 (78)
T ss_dssp             SS-EEEEE---TTSE---EEEEE--B-TTSEEEEEEEEEE-TT--SEEEEEEEEE
T ss_pred             eeEEEEEeC-CCCccccCCccccccCCCCCEEEEEEEEECCCCCCCceEEEEEEE
Confidence            334444555 899883333223224444457788899999999999987766654


No 80 
>PF14524 Wzt_C:  Wzt C-terminal domain; PDB: 2R5O_B.
Probab=73.58  E-value=19  Score=36.59  Aligned_cols=73  Identities=15%  Similarity=0.284  Sum_probs=48.4

Q ss_pred             CCcccCCceeEEEEEEEcCCCccCCCcceEEEEECCCCCeEEeeeee------ccC-CceeeeeeecCCCCCCeeEEEEE
Q psy5522         106 KAIYKPGDLVRFRAIVLNSHLKPSVTGALEIFITDGKGNRIKQWTRA------LTT-RGVFSADLQLSKSPVLGDWNITI  178 (1397)
Q Consensus       106 KpIYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I~DP~g~~I~qw~~~------~~~-~Gi~~~sf~L~~~~~lG~y~I~v  178 (1397)
                      ......||++.|++-..-.  ++..+-.+.+.|.|.+|..|......      ... .|.....+.+|..-..|.|.|.+
T Consensus        28 ~~~~~~ge~~~i~i~~~~~--~~i~~~~~~~~i~~~~g~~v~~~~t~~~~~~~~~~~~g~~~~~~~i~~~L~~G~Y~i~v  105 (142)
T PF14524_consen   28 TSSFESGEPIRIRIDYEVN--EDIDDPVFGFAIRDSDGQRVFGTNTYDSGFPIPLSEGGTYEVTFTIPKPLNPGEYSISV  105 (142)
T ss_dssp             -SSEETTSEEEEEEEEEES--S-EEEEEEEEEEEETT--EEEEEEHHHHT--EEE-TT-EEEEEEEEE--B-SEEEEEEE
T ss_pred             eeEEeCCCEEEEEEEEEEC--CCCCccEEEEEEEcCCCCEEEEECccccCccccccCCCEEEEEEEEcCccCCCeEEEEE
Confidence            3567899999999876542  23334568899999999998853221      112 78888888888878889999999


Q ss_pred             EE
Q psy5522         179 NV  180 (1397)
Q Consensus       179 ~~  180 (1397)
                      ..
T Consensus       106 ~l  107 (142)
T PF14524_consen  106 GL  107 (142)
T ss_dssp             EE
T ss_pred             EE
Confidence            98


No 81 
>PF12690 BsuPI:  Intracellular proteinase inhibitor;  InterPro: IPR020481 BsuPI is a intracellular proteinase inhibitor that directly regulates the major intracellular proteinase (ISP-1) activity in vivo. It inhibits ISP-1 in the early stages of sporulation and then may be inactivated by a membrane-bound proteinase [].; PDB: 3ISY_A.
Probab=73.30  E-value=15  Score=34.22  Aligned_cols=64  Identities=14%  Similarity=0.312  Sum_probs=31.5

Q ss_pred             eEEEEEEEcCCCccC-----CCcceEEEEECCCCCeEEeeeeec----------c-CCceeeeeeecCCCCC-CeeEEEE
Q psy5522         115 VRFRAIVLNSHLKPS-----VTGALEIFITDGKGNRIKQWTRAL----------T-TRGVFSADLQLSKSPV-LGDWNIT  177 (1397)
Q Consensus       115 V~fRvv~ld~~l~P~-----~~~~~~v~I~DP~g~~I~qw~~~~----------~-~~Gi~~~sf~L~~~~~-lG~y~I~  177 (1397)
                      |.|++...+..-+|+     .+....+.|.|++|+.|-+|..-.          . ..-....++.++..+. .|.|++.
T Consensus         2 v~~~l~v~N~s~~~v~l~f~sgq~~D~~v~d~~g~~vwrwS~~~~FtQal~~~~l~pGe~~~~~~~~~~~~~~~G~Y~~~   81 (82)
T PF12690_consen    2 VEFTLTVTNNSDEPVTLQFPSGQRYDFVVKDKEGKEVWRWSDGKMFTQALQEETLEPGESLTYEETWDLKDLSPGEYTLE   81 (82)
T ss_dssp             EEEEEEEEE-SSS-EEEEESSS--EEEEEE-TT--EEEETTTT-------EEEEE-TT-EEEEEEEESS----SEEEEEE
T ss_pred             EEEEEEEEeCCCCeEEEEeCCCCEEEEEEECCCCCEEEEecCCchhhheeeEEEECCCCEEEEEEEECCCCCCCceEEEe
Confidence            455555555443332     467899999999999999996421          1 1112333344444332 7888876


Q ss_pred             E
Q psy5522         178 I  178 (1397)
Q Consensus       178 v  178 (1397)
                      |
T Consensus        82 a   82 (82)
T PF12690_consen   82 A   82 (82)
T ss_dssp             E
T ss_pred             C
Confidence            5


No 82 
>PF12975 DUF3859:  Domain of unknown function (DUF3859);  InterPro: IPR024331 This entry represents a functionally uncharacterised domain, which is found in bacterial proteins.; PDB: 3N6Y_B.
Probab=73.00  E-value=13  Score=37.48  Aligned_cols=58  Identities=26%  Similarity=0.433  Sum_probs=35.4

Q ss_pred             eEEEEECC------CCCeEEeeeeecc--CCceeeeeeecCCCCCCeeEEEEEEEcC-eEEEEEEEE
Q psy5522         134 LEIFITDG------KGNRIKQWTRALT--TRGVFSADLQLSKSPVLGDWNITINVLD-QKFTKRFTV  191 (1397)
Q Consensus       134 ~~v~I~DP------~g~~I~qw~~~~~--~~Gi~~~sf~L~~~~~lG~y~I~v~~~~-~~~~~~F~V  191 (1397)
                      +++.|.-|      +|..+..|.....  .+-..-..|.-+.++..|.|++.+..++ ...+++|+|
T Consensus        59 lt~~i~HP~ip~~~~g~~~~~~~~~v~~~~~~~~~~~f~~~~e~v~G~Wr~~i~~~g~~i~~ktF~V  125 (125)
T PF12975_consen   59 LTVCITHPGIPDGDKGDTVEPWQGEVYVGSNDWDFYTFDTPWEPVPGEWRFTIELDGKVIAEKTFEV  125 (125)
T ss_dssp             EEEEEEEEEETTS-EEEEEEEEE---TT-S-EEEEEE--SGGG---EEEEEEEEETTEEEEEEEEEE
T ss_pred             EEEEEeCCCCCCCCCCCEECcEEEEecCCCcceEEEEcCcccccCCceEEEEEEECCEEEEEEEEeC
Confidence            55555543      5666677775422  2334555677778888999999999998 667999987


No 83 
>PF04734 Ceramidase_alk:  Neutral/alkaline non-lysosomal ceramidase;  InterPro: IPR006823 This family represents a group of neutral/alkaline ceramidases found in both bacteria and eukaryotes [, , ]. They hydrolyse the sphingolipid ceramide into sphingosine and free fatty acid.; PDB: 2ZXC_A 2ZWS_A.
Probab=72.76  E-value=16  Score=47.36  Aligned_cols=82  Identities=18%  Similarity=0.182  Sum_probs=48.6

Q ss_pred             cEEEEEcCCCcccCCceeEEEEEEEcCC--CccCCCcceEEEEE-----------CCCCCeEEeeeeeccCC--ceeeee
Q psy5522          98 YSVFIQLDKAIYKPGDLVRFRAIVLNSH--LKPSVTGALEIFIT-----------DGKGNRIKQWTRALTTR--GVFSAD  162 (1397)
Q Consensus        98 ~~vfIqTDKpIYkPGqtV~fRvv~ld~~--l~P~~~~~~~v~I~-----------DP~g~~I~qw~~~~~~~--Gi~~~s  162 (1397)
                      +.+..| -++.|+|||+|.+..+.-++.  ++- .+.++.|+-.           |.+-....+|.......  ..+..+
T Consensus       557 Gdv~~~-~~~~y~~g~~v~~~F~~a~Prn~l~~-~~tfl~Ver~~~~~~W~~v~~D~dw~t~f~W~r~~~~~~~S~~ti~  634 (674)
T PF04734_consen  557 GDVLQQ-PNPSYRPGDTVSATFVGANPRNNLRL-EGTFLTVERLESGGSWQTVADDADWSTRFRWKRTGSLLGTSEVTIE  634 (674)
T ss_dssp             T-EEE----SEEETT-EEEEEEEE--GGG---T-TS-SEEEEEEES-S--EEEEETTSTTEEEEEEEETTT--EEEEEEE
T ss_pred             CceEeC-CcCccCCCCeEEEEEeeeCCCCccCC-CCCeEEEEEecCCCCeEEEEeCCCccEEEEEEecCCccccEEEEEE
Confidence            445555 568899999999998887653  333 3456655543           45556677898665553  356778


Q ss_pred             eecCCCCCCeeEEEEEEEc
Q psy5522         163 LQLSKSPVLGDWNITINVL  181 (1397)
Q Consensus       163 f~L~~~~~lG~y~I~v~~~  181 (1397)
                      +.+|.+.+.|.|+|+..-.
T Consensus       635 W~ip~~~~~G~YRi~~~G~  653 (674)
T PF04734_consen  635 WEIPPDTPPGTYRIRHFGD  653 (674)
T ss_dssp             EE--TT--SEEEEEEEEEE
T ss_pred             EECCCCCCCCCEEEEEEee
Confidence            8899999999999998764


No 84 
>KOG0366|consensus
Probab=71.71  E-value=11  Score=41.95  Aligned_cols=80  Identities=18%  Similarity=0.234  Sum_probs=52.1

Q ss_pred             HHHHHHHhccccHHHHHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHH
Q psy5522         907 VLEYLKKTYQLTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEAL  986 (1397)
Q Consensus       907 ~l~yL~~~~~l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~  986 (1397)
                      +++.|.   |+..-.+++.+.|+.       +-|+.+|+|+...+.  .+.+--|-+|+..+..-...-.+|   .++.+
T Consensus        56 al~ll~---~le~~~~eeiv~~v~-------~C~~~~GGfa~~~Gh--d~hll~TlsAvQiL~~ydsi~~~d---~d~v~  120 (329)
T KOG0366|consen   56 ALDLLG---QLERMSREEIVSFVL-------SCQHEDGGFAGCPGH--DPHLLYTLSAVQILALYDSINVLD---RDKVA  120 (329)
T ss_pred             HHHHhc---ccchhhHHHHHHHHH-------heecCCCCcCCCCCC--ChHHHHHHHHHHHHHHHccccccc---HHHHH
Confidence            444443   333333556666554       478999999988764  345666777777666544332234   34468


Q ss_pred             HHHHhcccCCCcccc
Q psy5522         987 AWLSSNQAVNGSFPE 1001 (1397)
Q Consensus       987 ~WL~~~q~~~G~F~s 1001 (1397)
                      .|+..-|+++|+|..
T Consensus       121 ~yi~gLq~edGsF~g  135 (329)
T KOG0366|consen  121 SYIKGLQQEDGSFSG  135 (329)
T ss_pred             HHHHhhcCcCCcccC
Confidence            999999999999974


No 85 
>PF10648 Gmad2:  Immunoglobulin-like domain of bacterial spore germination;  InterPro: IPR018911  This domain is found linked to IPR019606 from INTERPRO in some bacterial proteins. It is predicted to contain an immunoglobulin-like all-beta fold. 
Probab=69.66  E-value=39  Score=31.92  Aligned_cols=51  Identities=18%  Similarity=0.320  Sum_probs=40.5

Q ss_pred             CCcceEEEEECCCCCeEEeeeee----ccCCceeeeeeecCCCCCCeeEEEEEEEc
Q psy5522         130 VTGALEIFITDGKGNRIKQWTRA----LTTRGVFSADLQLSKSPVLGDWNITINVL  181 (1397)
Q Consensus       130 ~~~~~~v~I~DP~g~~I~qw~~~----~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~  181 (1397)
                      .+..+.+.|+|.+|+.+.+-...    ....|-|+.++.++..+ .|.++|.+...
T Consensus        27 FEgtv~~rv~D~~g~vl~e~~~~a~~g~~~~g~F~~tv~~~~~~-~~~g~l~v~~~   81 (88)
T PF10648_consen   27 FEGTVNIRVRDGHGEVLAEGFVTATGGAPSWGPFEGTVSFPPPP-PGKGTLEVFED   81 (88)
T ss_pred             eeeEEEEEEEcCCCcEEEEeeEEeccCCCcccceEEEEEeCCCC-CCceEEEEEEe
Confidence            47889999999999998554322    34568899999999887 89999998754


No 86 
>PRK10301 hypothetical protein; Provisional
Probab=68.99  E-value=14  Score=37.31  Aligned_cols=61  Identities=21%  Similarity=0.177  Sum_probs=40.1

Q ss_pred             cceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC-----eEEEEEEEEe
Q psy5522         132 GALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD-----QKFTKRFTVA  192 (1397)
Q Consensus       132 ~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~-----~~~~~~F~Ve  192 (1397)
                      ....|.+.|++|+.+..-........--....+|+.....|.|+|+...-+     ...+..|+|+
T Consensus        59 ~~s~i~v~~~~g~~v~~~~~~~~~~~~~~~~v~l~~~L~~G~YtV~Wrvvs~DGH~~~G~~~F~V~  124 (124)
T PRK10301         59 GFSGATITGPKQENIKTLPAKRNEQDQKQLIVPLADSLKPGTYTVDWHVVSVDGHKTKGHYTFSVK  124 (124)
T ss_pred             cccEEEEEcCCCCEeccCCccccCCCCcEEEEECCCCCCCccEEEEEEEEecCCCccCCeEEEEEC
Confidence            456688899999876533222222333456778877778899999988743     3456677764


No 87 
>PF13620 CarboxypepD_reg:  Carboxypeptidase regulatory-like domain; PDB: 3MN8_D 3P0D_I 3KCP_A 2B59_B 1UWY_A 1H8L_A 1QMU_A 2NSM_A.
Probab=68.48  E-value=30  Score=31.55  Aligned_cols=57  Identities=21%  Similarity=0.372  Sum_probs=36.7

Q ss_pred             EEEEEEcCCCccCCCcceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC
Q psy5522         117 FRAIVLNSHLKPSVTGALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD  182 (1397)
Q Consensus       117 fRvv~ld~~l~P~~~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~  182 (1397)
                      +++.++|.+-+|+  ....|.|.+..+..+.  ...++..|-|.+.- |   ++ |.|+|++...+
T Consensus         2 I~G~V~d~~g~pv--~~a~V~l~~~~~~~~~--~~~Td~~G~f~~~~-l---~~-g~Y~l~v~~~g   58 (82)
T PF13620_consen    2 ISGTVTDATGQPV--PGATVTLTDQDGGTVY--TTTTDSDGRFSFEG-L---PP-GTYTLRVSAPG   58 (82)
T ss_dssp             EEEEEEETTSCBH--TT-EEEET--TTTECC--EEE--TTSEEEEEE-E----S-EEEEEEEEBTT
T ss_pred             EEEEEEcCCCCCc--CCEEEEEEEeeCCCEE--EEEECCCceEEEEc-c---CC-EeEEEEEEECC
Confidence            4677778888886  4456778887776543  35678888877661 2   22 99999998876


No 88 
>PF11589 DUF3244:  Domain of unknown function (DUF3244);  InterPro: IPR021638  This family of proteins with unknown function appear to be restricted to Bacteroidetes. The protein may have an immunoglobulin-like beta-sandwich fold however this cannot be confirmed. ; PDB: 3D33_B 3SD2_A.
Probab=67.87  E-value=30  Score=33.79  Aligned_cols=59  Identities=22%  Similarity=0.281  Sum_probs=39.9

Q ss_pred             CcceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcCe-EEEEEEEEe
Q psy5522         131 TGALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLDQ-KFTKRFTVA  192 (1397)
Q Consensus       131 ~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~~-~~~~~F~Ve  192 (1397)
                      ...++|.|.|..|+.|.+-.......  -..++.|... ..|.|.|.+..+.. .....|.+|
T Consensus        47 ~~~vtI~I~d~~G~vVy~~~~~~~~~--~~~~I~L~~~-~~G~Y~l~i~~~~g~~l~G~F~ie  106 (106)
T PF11589_consen   47 IGDVTITIKDSTGNVVYSETVSNSAG--QSITIDLNGL-PSGEYTLEITNGNGTYLYGEFTIE  106 (106)
T ss_dssp             -SEEEEEEEETT--EEEEEEESCGGT--TEEEEE-TTS--SEEEEEEEEECTC-EEEEEEEE-
T ss_pred             CCCEEEEEEeCCCCEEEEEEccCCCC--cEEEEEeCCC-CCccEEEEEEeCCCCEEEEEEEEC
Confidence            46689999999999998865443332  3457777665 48999999998864 677788775


No 89 
>smart00634 BID_1 Bacterial Ig-like domain (group 1).
Probab=67.80  E-value=41  Score=31.79  Aligned_cols=80  Identities=16%  Similarity=0.235  Sum_probs=57.2

Q ss_pred             EEEEcCCCcccCC--ceeEEEEEEEcCCCccCCCcceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEE
Q psy5522         100 VFIQLDKAIYKPG--DLVRFRAIVLNSHLKPSVTGALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNIT  177 (1397)
Q Consensus       100 vfIqTDKpIYkPG--qtV~fRvv~ld~~l~P~~~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~  177 (1397)
                      +-+..||...+++  +.+.+.+-++|.+-.|+.+..+.+.+..+.-..+..-...++.+|.....+. +.  ..|.++|+
T Consensus         3 i~l~~~~~~~~Adg~d~~~i~v~v~D~~Gnpv~~~~V~f~~~~~~~~~~~~~~~~Td~~G~a~~~l~-~~--~~G~~~vt   79 (92)
T smart00634        3 TTLTADKDTAVANGSDAITLTATVTDANGNPVAGQEVTFTTPSGGALTLSKGTATTDANGIATVTLT-ST--TAGVYTVT   79 (92)
T ss_pred             EEEEeCCCcEEEcCcccEEEEEEEECCCCCCcCCCEEEEEECCCceeeccCCeeeeCCCCEEEEEEE-CC--CCcEEEEE
Confidence            3455677777777  8899999999999999877777777754420111122235677999999988 32  67999999


Q ss_pred             EEEcC
Q psy5522         178 INVLD  182 (1397)
Q Consensus       178 v~~~~  182 (1397)
                      +...+
T Consensus        80 a~~~~   84 (92)
T smart00634       80 ASLEN   84 (92)
T ss_pred             EEECC
Confidence            98875


No 90 
>KOG0497|consensus
Probab=67.66  E-value=5.7  Score=49.94  Aligned_cols=67  Identities=24%  Similarity=0.310  Sum_probs=49.4

Q ss_pred             hccccceeeecCCCCc-cccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCccccc
Q psy5522         931 TGYQQELTYRRPDGSF-SAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSFPEV 1002 (1397)
Q Consensus       931 ~gy~~~l~y~~~dGsf-s~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~st 1002 (1397)
                      .+.+-+-+.|.+|||| +.|+-    =-+..|=|++..+..|++.+. +-..+.+++.+|.+.|+.+||+.+.
T Consensus       593 ~av~~ie~~Q~~DGSWyGsWgv----CFtY~t~Fa~~gl~aaGkty~-nc~~irka~~Fll~~Q~~~GGWgEs  660 (760)
T KOG0497|consen  593 KAVEFIEKLQLPDGSWYGSWGV----CFTYGTWFALRGLAAAGKTYE-NCEAIRKACDFLLSKQNPDGGWGES  660 (760)
T ss_pred             HHHHHHHHcCCCCCcccchhhH----HHHHHHHHhcchhhhcchhhh-ccHHHHHHHHHHHhhhcccCCCccc
Confidence            3333333469999985 34432    236789999999998876544 3567899999999999999999864


No 91 
>PF15418 DUF4625:  Domain of unknown function (DUF4625)
Probab=67.55  E-value=23  Score=36.11  Aligned_cols=74  Identities=19%  Similarity=0.332  Sum_probs=50.7

Q ss_pred             CCCcccCCceeEEEEEEEcCCCccCCCcceEEEEE---CC--CCCe----EEeeeeec-------cCCceeeeeeecCCC
Q psy5522         105 DKAIYKPGDLVRFRAIVLNSHLKPSVTGALEIFIT---DG--KGNR----IKQWTRAL-------TTRGVFSADLQLSKS  168 (1397)
Q Consensus       105 DKpIYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I~---DP--~g~~----I~qw~~~~-------~~~Gi~~~sf~L~~~  168 (1397)
                      +--+++.|++++|++-..+...    =+.+.|+|-   |-  .+..    ...|....       .....+...+.+|.+
T Consensus        28 ~~~~~~~G~~ihfe~~i~d~~~----i~si~VeIH~nfd~H~h~~~~~~~~~~~~~~~~~~~~~g~~~~~~h~~i~IPa~  103 (132)
T PF15418_consen   28 NCKVATRGDDIHFEADISDNSA----IKSIKVEIHNNFDHHTHSTEAGECEKPWVFEQDYDIYGGKKNYDFHEHIDIPAD  103 (132)
T ss_pred             CCeEEecCCcEEEEEEEEcccc----eeEEEEEEecCcCcccccccccccccCcEEEEEEcccCCcccEeEEEeeeCCCC
Confidence            4567789999999998877532    245677771   21  1111    33444321       224578999999999


Q ss_pred             CCCeeEEEEEEEcC
Q psy5522         169 PVLGDWNITINVLD  182 (1397)
Q Consensus       169 ~~lG~y~I~v~~~~  182 (1397)
                      +..|.|.+.+..-+
T Consensus       104 a~~G~YH~~i~VtD  117 (132)
T PF15418_consen  104 APAGDYHFMITVTD  117 (132)
T ss_pred             CCCcceEEEEEEEE
Confidence            99999999998865


No 92 
>KOG0365|consensus
Probab=66.47  E-value=22  Score=41.22  Aligned_cols=100  Identities=23%  Similarity=0.348  Sum_probs=59.7

Q ss_pred             CCchhhhhcccchHHHHHHHHHHhccccHHHHHHHHHHH--HhccccceeeecCCCCccccCCCCCCCCc-chhHHHHHH
Q psy5522         891 FGCGEQNMLNFVPNIVVLEYLKKTYQLTDAIEAKASRYL--ETGYQQELTYRRPDGSFSAFGTTDPNGST-WLTAFVAKS  967 (1397)
Q Consensus       891 ~GCgEQ~m~~~~p~i~~l~yL~~~~~l~~~~~~~a~~~l--~~gy~~~l~y~~~dGsfs~~~~~~~~~s~-wlTAyvl~~  967 (1397)
                      ||-|--...++||+-.+..-|---+.      ++|.+-|  +.=||-+.+.|++||||..-.+    |.+ --.||...+
T Consensus       139 fgGGPGQl~HLA~TYAAVnaL~~~~~------e~A~~~InR~~l~~fL~slK~~dGgFrmh~~----GE~DvRs~YcA~s  208 (423)
T KOG0365|consen  139 FGGGPGQLPHLAPTYAAVNALCLCGS------EDAYSSINREKLYQFLFSLKDPDGGFRMHVE----GEVDVRSAYCALS  208 (423)
T ss_pred             CCCCCccchhhhHHHHHHHHHHhcCc------HHHHHHhhHHHHHHHHHHhcCCCCCeEeecC----CcchHHHHHHHHH
Confidence            44455556788887544433332221      1122222  1223344557999999977533    233 236888777


Q ss_pred             HHHccCCCccCHHHHHHHHHHHHhcccCCCccccc
Q psy5522         968 FRQAASHTTIDESVILEALAWLSSNQAVNGSFPEV 1002 (1397)
Q Consensus       968 ~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~st 1002 (1397)
                      ....-. |. +++..+...+||.+-|+..|||++-
T Consensus       209 vaslln-i~-~deL~eG~~~wi~~CQtyEGG~GG~  241 (423)
T KOG0365|consen  209 VASLLN-IP-MDELFEGTLDWIASCQTYEGGFGGE  241 (423)
T ss_pred             HHHHHC-CC-cHHHHHHHHHHHHhcccccCCcCCC
Confidence            654432 22 3466788999999999999999864


No 93 
>PF13860 FlgD_ig:  FlgD Ig-like domain; PDB: 3C12_A 3OSV_A.
Probab=66.06  E-value=24  Score=32.59  Aligned_cols=50  Identities=12%  Similarity=0.191  Sum_probs=35.9

Q ss_pred             CcceEEEEECCCCCeEEeeeeeccCCceeeeeeecCC----CCCCeeEEEEEEE
Q psy5522         131 TGALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSK----SPVLGDWNITINV  180 (1397)
Q Consensus       131 ~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~----~~~lG~y~I~v~~  180 (1397)
                      -..+.|.|.|.+|+.|+++.......|.....+.=-+    ..+-|.|++++..
T Consensus        24 a~~v~v~I~d~~G~~V~t~~~~~~~~G~~~~~WdG~d~~G~~~~~G~Y~~~v~a   77 (81)
T PF13860_consen   24 ADNVTVTIYDSNGQVVRTISLGSQSAGEHSFTWDGKDDDGNPVPDGTYTFRVTA   77 (81)
T ss_dssp             CEEEEEEEEETTS-EEEEEEEEECSSEEEEEEE-SB-TTS-B--SEEEEEEEEE
T ss_pred             ccEEEEEEEcCCCCEEEEEEcCCcCCceEEEEECCCCCCcCCCCCCCEEEEEEE
Confidence            4568999999999999998776667788777777221    2456999999875


No 94 
>PF01483 P_proprotein:  Proprotein convertase P-domain;  InterPro: IPR002884 This domain, termed the P domain is approximately 150 amino acids in length and C-terminal to a serine endopeptidase domain which belong to MEROPS peptidase family S8 (clan SB), subfamily S8B (kexin). The domain is primarily associated with the calcium-dependent serine endopeptidases, kex2/subtilisin proprotein convertases (PCs), which have been identified in all eukaryotes [] and in the gammaproteobacteria, Nostoc (cyanobacteria) and in Streptomyces avermitilis. The P domain appears necessary for folding and maintaining the endopeptidase catalytic domain and to regulate its calcium and acidic pH dependence. In addition, contained within the middle of the P domain in most PC family members is the cognate integrin binding RGD sequence [], which may be required for intracellular compartmentalization and maintenance of enzyme stability within the ER. The integrity of the RGD sequence of proprotein convertase PC1 is critical for its zymogen and C-terminal processing and for its cellular trafficking [, ]. The carboxy-terminal tail provides uniqueness to each PC family member being the least conserved region of all convertases [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 1P8J_E 1OT5_B 2ID4_A 1R64_A 3HJR_A.
Probab=64.60  E-value=20  Score=33.54  Aligned_cols=51  Identities=18%  Similarity=0.270  Sum_probs=34.6

Q ss_pred             CCcceEEEEECCCCCeEEeeeeecc---CCceeeeeeec---CCCCCCeeEEEEEEE
Q psy5522         130 VTGALEIFITDGKGNRIKQWTRALT---TRGVFSADLQL---SKSPVLGDWNITINV  180 (1397)
Q Consensus       130 ~~~~~~v~I~DP~g~~I~qw~~~~~---~~Gi~~~sf~L---~~~~~lG~y~I~v~~  180 (1397)
                      .-..+.|+|..|.|.++.=+.....   ..+.+...|.+   -.+.+.|+|++++.-
T Consensus        14 ~~gdL~i~L~SP~Gt~~~L~~~~~~d~~~~~~~~~~f~~~~f~Ge~~~G~W~L~v~D   70 (87)
T PF01483_consen   14 YRGDLRITLISPSGTRSTLKDRRGSDDSGSGFLNWTFTSVAFWGESANGTWTLRVTD   70 (87)
T ss_dssp             SGGGEEEEEE-TT--EEEEE-SSTTHHHSTSEEEEEEEESTTTT-B--EEEEEEEEE
T ss_pred             CcCCEEEEEECCCCCEEEEECCcCCCcccccccccEEEEEeecCccCCCEEEEEEEE
Confidence            4577899999999998876655444   56778888875   478889999999987


No 95 
>PTZ00487 ceramidase; Provisional
Probab=63.88  E-value=25  Score=45.59  Aligned_cols=73  Identities=14%  Similarity=0.116  Sum_probs=52.7

Q ss_pred             cccCCceeEEEEEEEcC--CCccCCCcceEEEEEC-----------CCCCeEEeeeeeccCCceeeeeeecCCCCCCeeE
Q psy5522         108 IYKPGDLVRFRAIVLNS--HLKPSVTGALEIFITD-----------GKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDW  174 (1397)
Q Consensus       108 IYkPGqtV~fRvv~ld~--~l~P~~~~~~~v~I~D-----------P~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y  174 (1397)
                      .|++||+|....+.-++  |++. .+.+..|+-++           .+-+...+|.......--+..++.+|++...|.|
T Consensus       607 ~y~~g~~v~~~F~~a~Prn~l~~-~~tf~~Ve~~~~~~~W~~v~~D~dw~t~~~W~r~~~~~S~~ti~W~i~~~~~~G~Y  685 (715)
T PTZ00487        607 VYSNNDTVSAEFYGGNPRNNFMT-ESSFLTVDKLNEKNQWTTILVDGDWDTKWHWKMHDLGFSLITIIWSIGPTTEPGTY  685 (715)
T ss_pred             ccCCCCEEEEEEEecCCCCcccc-CcceEEEEEecCCCceeEeccCCCcceEEEEeccCCCceeEEEEEECCCCCCCeee
Confidence            49999999988888765  4544 46677777643           3445677887652222346778999999999999


Q ss_pred             EEEEEEc
Q psy5522         175 NITINVL  181 (1397)
Q Consensus       175 ~I~v~~~  181 (1397)
                      +|+-.-.
T Consensus       686 Ri~~~G~  692 (715)
T PTZ00487        686 RITHSGY  692 (715)
T ss_pred             EEEEeec
Confidence            9988654


No 96 
>PF00432 Prenyltrans:  Prenyltransferase and squalene oxidase repeat This Prosite family is a subset of the Pfam family.;  InterPro: IPR001330 The beta subunit of the farnesyltransferases is responsible for peptide binding. Squalene-hopene cyclase is a bacterial enzyme that catalyzes the cyclization of squalene into hopene, a key step in hopanoid (triterpenoid) metabolism []. Lanosterol synthase (5.4.99.7 from EC) (oxidosqualene-lanosterol cyclase) catalyzes the cyclization of (S)-2,3-epoxysqualene to lanosterol, the initial precursor of cholesterol, steroid hormones and vitamin D in vertebrates and of ergosterol in fungi []. Cycloartenol synthase () (2,3-epoxysqualene-cycloartenol cyclase) is a plant enzyme that catalyzes the cyclization of (S)-2,3-epoxysqualene to cycloartenol.; GO: 0003824 catalytic activity; PDB: 2IEJ_B 1LD7_B 1LD8_B 2H6G_B 1TN6_B 1S63_B 1MZC_B 2H6I_B 2H6F_B 1JCQ_B ....
Probab=63.02  E-value=6.9  Score=31.56  Aligned_cols=21  Identities=24%  Similarity=0.332  Sum_probs=18.4

Q ss_pred             HHHHHHHHHhcccCCCccccc
Q psy5522         982 ILEALAWLSSNQAVNGSFPEV 1002 (1397)
Q Consensus       982 ~~~~~~WL~~~q~~~G~F~st 1002 (1397)
                      ++++.+||.+.|+++|||...
T Consensus         3 ~~~~~~~l~~~Q~~dGGf~~~   23 (44)
T PF00432_consen    3 VEKLIRFLLSCQNPDGGFGGR   23 (44)
T ss_dssp             HHHHHHHHHHTBBTTSSBBSS
T ss_pred             HHHHHHHHHHHCCCCCCCCCC
Confidence            467899999999999999865


No 97 
>PRK06655 flgD flagellar basal body rod modification protein; Reviewed
Probab=61.50  E-value=47  Score=37.18  Aligned_cols=75  Identities=21%  Similarity=0.320  Sum_probs=46.1

Q ss_pred             EEEEcCCCcccCCceeEEEEEEEcCCCccCCCcceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCC----CCCeeEE
Q psy5522         100 VFIQLDKAIYKPGDLVRFRAIVLNSHLKPSVTGALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKS----PVLGDWN  175 (1397)
Q Consensus       100 vfIqTDKpIYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~----~~lG~y~  175 (1397)
                      +.+.+|.-....|..+.+++. +     |.....+.|+|.|.+|+.|+.-.......|...+++.=-+.    .+-|.|+
T Consensus       100 V~~~~~~~~~~~~~~~~~~~~-l-----~~~a~~vti~I~D~~G~~Vrt~~lg~~~aG~~~f~WDG~d~~G~~lp~G~Yt  173 (225)
T PRK06655        100 VLVPGDTVLVGTGGTTPFGVE-L-----PSAADNVTVTITDSAGQVVRTIDLGAQSAGVVSFTWDGTDTDGNALPDGNYT  173 (225)
T ss_pred             EEEecceEEecCCCceEEEEE-c-----CCCCcEEEEEEEcCCCCEEEEEecCCcCCCceeEEECCCCCCCCcCCCeeEE
Confidence            334444433344556666633 2     21245689999999999998765434556776665543222    4579999


Q ss_pred             EEEEE
Q psy5522         176 ITINV  180 (1397)
Q Consensus       176 I~v~~  180 (1397)
                      ++|..
T Consensus       174 ~~V~A  178 (225)
T PRK06655        174 IKASA  178 (225)
T ss_pred             EEEEE
Confidence            88865


No 98 
>PF04234 CopC:  CopC domain;  InterPro: IPR007348 CopC is a bacterial blue copper protein that binds 1 atom of copper per protein molecule. Along with CopA, CopC mediates copper resistance by sequestration of copper in the periplasm [].; GO: 0005507 copper ion binding, 0046688 response to copper ion, 0042597 periplasmic space; PDB: 1IX2_B 1LYQ_A 2C9P_C 2C9R_A 2C9Q_A 1M42_A 1OT4_A 1NM4_A.
Probab=61.16  E-value=9.4  Score=36.63  Aligned_cols=60  Identities=18%  Similarity=0.224  Sum_probs=38.5

Q ss_pred             cceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC-----eEEEEEEEEe
Q psy5522         132 GALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD-----QKFTKRFTVA  192 (1397)
Q Consensus       132 ~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~-----~~~~~~F~Ve  192 (1397)
                      ....|.|.|++|+++..-...... +--.....|+.....|.|+|....-+     ...+..|.|+
T Consensus        33 ~~s~v~v~~~~g~~v~~~~~~~~~-~~~~~~~~l~~~l~~G~YtV~wrvvs~DGH~~~G~~~F~V~   97 (97)
T PF04234_consen   33 GFSSVTVTDPDGKRVDLGEPTVDG-DGKTLTVPLPPPLPPGTYTVSWRVVSADGHPVSGSFSFTVK   97 (97)
T ss_dssp             CC-EEEEEEEEETTSCTCEEEEEE-STTEEEEEESS---SEEEEEEEEEEETTSCEEEEEEEEEE-
T ss_pred             CccEEEEEcCCCceeecCcceecC-CceEEEEECCCCCCCceEEEEEEEEecCCCCcCCEEEEEEC
Confidence            566788999999877533222222 34577888888788899999988742     4567788774


No 99 
>KOG4386|consensus
Probab=60.65  E-value=17  Score=44.06  Aligned_cols=86  Identities=17%  Similarity=0.242  Sum_probs=73.1

Q ss_pred             CeeEEEeeeEEEEecCCceeeeCCEEEEEEEEEcCCCccEEEEEEEeecCCeeeeecCCCccCCCCCCcceeeeEEEEEe
Q psy5522         706 PKKLRVFRPFFISLDLPYSVMRGEVVAIPVVVFNYLSQDLVADVTLENVGQFDFADFSNEVDAAPQPKFEVFRRKKLTIK  785 (1397)
Q Consensus       706 ~~~l~vfkpffv~l~LPysv~rGE~~~l~~~VfNyl~~~~~v~v~L~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~  785 (1397)
                      |.-+.-.=|+-|...||.+-+..|.+.++--+-|-.+..++|++.++.+|.|-|.               +..+-++.|-
T Consensus       685 phviVe~iPlhvnadlpsfgrVReslpvkyhLqnktdlvqdveisvepsDaFMFS---------------GlkqirlriL  749 (809)
T KOG4386|consen  685 PHVIVEAIPLHVNADLPSFGRVRESLPVKYHLQNKTDLVQDVEISVEPSDAFMFS---------------GLKQIRLRIL  749 (809)
T ss_pred             ccceeeeccceeecCCCCcceecccccEEEEeccccceeeeEEeecccchhheec---------------ccceEEEEEc
Confidence            4455567799999999999999999999999999999999999999999999773               2345678888


Q ss_pred             CCCeeEEEEEEEeceeeeEEE
Q psy5522         786 ANSGSTTTFVITPKELGYIGI  806 (1397)
Q Consensus       786 a~~~~tv~f~v~p~~lG~v~i  806 (1397)
                      +|..+.+.+.+-|...|..++
T Consensus       750 PGteqemlynfypLmAGyqql  770 (809)
T KOG4386|consen  750 PGTEQEMLYNFYPLMAGYQQL  770 (809)
T ss_pred             CCCceEEEEEEehhhchhhhC
Confidence            999999999999988775443


No 100
>PF13115 YtkA:  YtkA-like
Probab=59.80  E-value=87  Score=28.98  Aligned_cols=69  Identities=19%  Similarity=0.177  Sum_probs=39.9

Q ss_pred             eeeEEEEEecCCcccCCCeEEEEEEEEccCCCccCCCCceEEEEEeee-cc-cc-ccceeeEeeCCCeeEEEEEeCC
Q psy5522         311 HKYKMDLIKSSEYFKPGLKYTAYMKLTHHDGTPVTDNNNMVQVRHGFS-YD-ES-KYEANQYKLDRNGMIKLVYYPP  384 (1397)
Q Consensus       311 ~~~~i~f~~~~~~~kpG~p~~~~v~v~~~dG~P~~~~~~~V~v~~~~~-~~-~~-~~~~~~~ttd~~G~~~f~i~~~  384 (1397)
                      ..|.|.+...+ ..+.|.+ .+.+. .+.+|+|+.+.  .|.+..... .. .. .......+..+.|.....+..+
T Consensus         5 ~~~~v~l~~~~-~~~~g~~-~i~v~-~~~~g~pv~~a--~V~~~~~m~~~~g~~~~~~~~~~~~~~~G~Y~~~~~f~   76 (86)
T PF13115_consen    5 GGYTVELVSPE-PPKVGEN-TITVT-VDQGGKPVTDA--DVQFEIWMPDMEGMEPMTSKVELEETGPGVYEAEVTFS   76 (86)
T ss_pred             ccEEEEEecCC-CCcCCce-EEEEE-ECCCCCCCCCC--EEEEEEEeCCCCCCCCCceeeeeecCCCCeEEEEeecC
Confidence            34667776554 5688888 78888 89999999965  455543211 00 00 0112223334567666665554


No 101
>PF02369 Big_1:  Bacterial Ig-like domain (group 1);  InterPro: IPR003344 Proteins that contain this domain are found in a variety of bacterial and phage surface proteins such as intimins. Intimin is a bacterial cell-adhesion molecule that mediates the intimate bacterial host-cell interaction. It contains three domains; two immunoglobulin-like domains and a C-type lectin-like module implying that carbohydrate recognition may be important in intimin-mediated cell adhesion [].; PDB: 1CWV_A 4E9L_A 1F02_I 1F00_I.
Probab=59.27  E-value=1.3e+02  Score=29.04  Aligned_cols=78  Identities=19%  Similarity=0.251  Sum_probs=49.8

Q ss_pred             EEEcCCCcccCCceeEEEEEEEcCCCccCCCcceEEEEECCCCCeEEee--eeeccCCceeeeeeecCCCCCCeeEEEEE
Q psy5522         101 FIQLDKAIYKPGDLVRFRAIVLNSHLKPSVTGALEIFITDGKGNRIKQW--TRALTTRGVFSADLQLSKSPVLGDWNITI  178 (1397)
Q Consensus       101 fIqTDKpIYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I~DP~g~~I~qw--~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v  178 (1397)
                      .+..|+.+=--++.+.+.+.+.|.+..|+.+..+.+. .++.+-.+..-  ...++..|.+...|. +.  ..|.++|.+
T Consensus        11 ~~~~~~~~a~g~~~~tltatV~D~~gnpv~g~~V~f~-~~~~~~~l~~~~~~~~Td~~G~a~~tlt-st--~aG~~~VtA   86 (100)
T PF02369_consen   11 SIVADKAVADGSDTNTLTATVTDANGNPVPGQPVTFS-SSSSGGTLSPTNTSATTDSNGIATVTLT-ST--KAGTYTVTA   86 (100)
T ss_dssp             EEEES-EESSSSS-EEEEEEEEETTSEB-TS-EEEE---EESSSEES-CEE-EEE-TTSEEEEEEE--S--S-EEEEEEE
T ss_pred             eeeecceEeCCcCcEEEEEEEEcCCCCCCCCCEEEEE-EcCCCcEEecCccccEECCCEEEEEEEE-ec--CceEEEEEE
Confidence            3444444444788899999999999999977777773 33444444332  357889999988876 22  679999999


Q ss_pred             EEcC
Q psy5522         179 NVLD  182 (1397)
Q Consensus       179 ~~~~  182 (1397)
                      ...+
T Consensus        87 ~~~~   90 (100)
T PF02369_consen   87 TVDG   90 (100)
T ss_dssp             EETT
T ss_pred             EECC
Confidence            9987


No 102
>PRK14081 triple tyrosine motif-containing protein; Provisional
Probab=56.24  E-value=5.8e+02  Score=33.28  Aligned_cols=340  Identities=16%  Similarity=0.154  Sum_probs=159.5

Q ss_pred             CCCceeEEEEEecCc---eeeEeeEEEEEEe--cc-c--EEEE-EcCCCcccCCceeEEEEEEEcCCCccCCCcceEEEE
Q psy5522          68 LGPGQYNLTVKGSGS---LNFYNSTGLVYVH--KS-Y--SVFI-QLDKAIYKPGDLVRFRAIVLNSHLKPSVTGALEIFI  138 (1397)
Q Consensus        68 ~~~~~~~l~v~g~~~---~~f~~~~~v~~~~--~~-~--~vfI-qTDKpIYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I  138 (1397)
                      ..+|.|+|.+...+.   -.|..+..+.+.=  .. .  .-|+ -.++|+ -.|+.|.|++.+-...     +......|
T Consensus       253 ~~~G~Y~i~~~VKD~~S~~eyDD~~~l~y~Vk~~~~vkI~~~~~d~~s~q-l~g~~I~ika~a~GG~-----~llYrf~I  326 (667)
T PRK14081        253 KKSGDYKLLCLVKDMYSNNEFDDRAVLVYTVKPYKDIKIRNFTTDLSSPQ-LTDTDIELKAVAEGGK-----ELLYRFII  326 (667)
T ss_pred             CCCccEEEEEEEeccCcccccccceEEEEEEecCCCcEEEEEEEcCCCCe-ecCceEEEEEEecCCC-----ceEEEEEE
Confidence            356778777666432   2344444444331  11 1  1111 124666 4677777777765432     11233344


Q ss_pred             ECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC--------eEEEEEEEEeeec-CCcEEEEEeCC-Cc
Q psy5522         139 TDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD--------QKFTKRFTVAEYV-LPKFEVNVNVP-PH  208 (1397)
Q Consensus       139 ~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~--------~~~~~~F~VeEYv-LPkFeV~v~~p-~~  208 (1397)
                      .   |..+.     ....| -..+|.-.++ ..|.|.|.+.+.+        ....-.|.|++|. .|-.=-.|..| +.
T Consensus       327 ~---G~~~e-----~~~Y~-~~n~~~w~P~-~~G~Y~I~v~VKDk~S~~eyD~~~~v~f~i~~~~~~p~~I~~vl~d~~~  396 (667)
T PRK14081        327 K---GKESE-----DSGYI-RNNIYTWKPK-IAGKYSITLWVKDISSKGEYEDKSSIDYTIEEKSKEPIKIEDVILDKGK  396 (667)
T ss_pred             C---CcEEe-----ecccc-ccceEEEeeC-CCceEEEEEEEEcCcCcccccceEEEEEEEcccCCCCeEEEEEEECCCC
Confidence            2   43332     11111 2223333222 4799999999976        2345588898883 34322333444 34


Q ss_pred             eEeeCCeEEEEEEEEEecCccccceEEEEEEeccccCccc--ccc--ccceeEEeecCceEEEEEechhhhccccccceE
Q psy5522         209 ATFKDSKVVISVNAKYTYGKPVKGEATITAYPTIFSGVIQ--PLF--QTPVRKVVPIDGKTVIEFDVVKELQLTDEYERN  284 (1397)
Q Consensus       209 i~~~~~~~~v~v~AkYtyGkPV~G~v~v~v~~~~~~g~~~--~~~--~~~~~~~~d~~G~~~~~~~~~~~~~~~~~~~~~  284 (1397)
                      -.+.|+.+.+++.|.=+  .    .+.++.-.+. .|...  .++  ...+.-+-...|..++.+-.++ -.....|...
T Consensus       397 ~~lvG~~i~i~v~a~gg--~----~~lY~f~ik~-ng~~ve~~~Y~~~~~~~f~P~~~G~Y~IeV~vKd-k~S~~~yD~~  468 (667)
T PRK14081        397 HILKGEEIKIRVIAEGG--T----NLRYSFIIKK-DGKEEEKIDYGKNNWVNFIPEEKGNYELEVRVKD-KYSDKEYDAH  468 (667)
T ss_pred             ceEeCCeEEEEEEecCC--C----eEEEEEEEEE-CCEEEEEeecccccEEEEEECCCeeEEEEEEEec-ccCchhcccc
Confidence            44568888888875432  2    1222211110 01100  000  0111122233444444444433 1111122222


Q ss_pred             EEEEEEEEECCcceEEeeEEEEEEEEeeeEEEEE--ecCCcccCCCeEEEEEEEEccCCCccCCCCceEEEEEeeecccc
Q psy5522         285 IHFDVAVEEALTGRRQNNTGSVVFHKHKYKMDLI--KSSEYFKPGLKYTAYMKLTHHDGTPVTDNNNMVQVRHGFSYDES  362 (1397)
Q Consensus       285 l~V~a~VtE~gTG~~~~~~~~i~i~~~~~~i~f~--~~~~~~kpG~p~~~~v~v~~~dG~P~~~~~~~V~v~~~~~~~~~  362 (1397)
                      -.|...|.|.                .|..+...  +...++.-|.+.++.+...+.++.-.     .-.+..+..|.. 
T Consensus       469 k~v~l~V~e~----------------~P~~i~~il~~~~~~~~vg~~i~~~~~~~~~k~v~y-----~y~~~~NG~~v~-  526 (667)
T PRK14081        469 TIVYIKVHEY----------------IPAEIDYILLPVKEYYLVGDDIEIEVIIQNTKDVLI-----KYILKINGHKVE-  526 (667)
T ss_pred             eEEEEEEecc----------------CceeeeeEEecccccEEeCCEEEEEEEEeCCCeEEE-----EEEEEECCEEEE-
Confidence            2222233221                23333332  33466778998888888877543221     111221112210 


Q ss_pred             ccceeeEeeCCCeeEEEEEeCCCCCceeEEEEEEEecCcce-------EEEEEEeecCCCCCeEEEEecCCCCCCCCeEE
Q psy5522         363 KYEANQYKLDRNGMIKLVYYPPANENVTTLGIEAEYLDIKE-------WFSTISASESPSNSFIQAALLTQNPKVNKDVE  435 (1397)
Q Consensus       363 ~~~~~~~ttd~~G~~~f~i~~~~~~~~~~l~l~a~~~~~~~-------~~~~v~~~~S~s~s~L~i~~~~~~~~~G~~~~  435 (1397)
                         .+.+.  .+.  .++|.| ...|.|.+.+-|+..-+..       -.+.+.  ....-.-..|..++....||++++
T Consensus       527 ---~t~Ys--~~~--~ysf~P-~~~GkY~I~V~aKn~~s~~~~D~~k~v~~~V~--e~~pi~nt~~~~~~~~~~~n~~~t  596 (667)
T PRK14081        527 ---ETDYI--KNK--KYKFIP-KCSGKYTIEVLAKNIKSTEEYDSKKEVKFYVR--EALPITNTKIKTSKKKFKCNEEVT  596 (667)
T ss_pred             ---Eeecc--ccc--eEEEee-cCCceEEEEEEEcccccccccccceEEEEEEc--CCCCceeeEEEeecceEEcCCeEE
Confidence               11111  122  344443 3456766666665443321       112222  122233455667788999999999


Q ss_pred             EEEEecCCcc-eEEEEEEeCCeEEEEEEE
Q psy5522         436 LEINSTAPLK-YISYQVLGRGDVIMADTI  463 (1397)
Q Consensus       436 v~v~s~~~~~-~~~ylV~srG~Iv~~~~~  463 (1397)
                      +.+.+...-. ..-|.||.+|.-.....-
T Consensus       597 ~~~~~~gg~~v~Yef~v~~~g~w~~vq~y  625 (667)
T PRK14081        597 FSVKSEGGKDVCYEFYIMEKGEWKLVQKY  625 (667)
T ss_pred             EEEEccCCCcEEEEEEEEECCcEEEEeec
Confidence            9999763222 355678999987665543


No 103
>PRK13211 N-acetylglucosamine-binding protein A; Reviewed
Probab=55.45  E-value=1.1e+02  Score=38.12  Aligned_cols=86  Identities=23%  Similarity=0.232  Sum_probs=51.9

Q ss_pred             CccEEEEEEEEeecCCCCeeEEeEEEEECCCceeEEEEEcCCCCCceeEEEEEec--Cc-eeeEeeEEEEEEecc-----
Q psy5522          26 SEATQVSVEVGGKQDSGGEFRTKQLLSVDPFSTRIVKLDIGDLGPGQYNLTVKGS--GS-LNFYNSTGLVYVHKS-----   97 (1397)
Q Consensus        26 ~~~v~v~v~L~~~~~~~~~~~~~~~~~~~~~~~~~~~f~vP~~~~~~~~l~v~g~--~~-~~f~~~~~v~~~~~~-----   97 (1397)
                      ++.+.|+++|...   ..+........++.+. +.+++.+..+..|.|.|.|++.  +| .....+..+++....     
T Consensus       338 ~g~~~vta~V~d~---~g~~~~~~~~~v~d~s-~~vtL~Ls~~~AG~y~Lvv~~t~~dG~~~~q~~~~~~v~~~~~~~~~  413 (478)
T PRK13211        338 TGDMNVEATVYNH---DGEALGSKSQTVNDGS-QSVSLDLSKLKAGHHMLVVKAKPKDGELIKQQTLDFMLEAKDPPPSG  413 (478)
T ss_pred             ccceEEEEEEEcC---CCCeeeeeeEEecCCc-eeEEEecccCCCceEEEEEEEEeCCCceeeeeeEEEEEEecCCCCCC
Confidence            4456666666543   1122333334444433 5667777788889998888875  33 445677777774221     


Q ss_pred             cEEEEEcCC-CcccCCcee
Q psy5522          98 YSVFIQLDK-AIYKPGDLV  115 (1397)
Q Consensus        98 ~~vfIqTDK-pIYkPGqtV  115 (1397)
                      ..=|.+-+- ..|+.|++|
T Consensus       414 ~~~~~~~~g~~~Y~aGd~V  432 (478)
T PRK13211        414 DYDFVFPEGLKSYTAGTKV  432 (478)
T ss_pred             CcCccccCCcccccCCCEE
Confidence            112455556 889999987


No 104
>PF00432 Prenyltrans:  Prenyltransferase and squalene oxidase repeat This Prosite family is a subset of the Pfam family.;  InterPro: IPR001330 The beta subunit of the farnesyltransferases is responsible for peptide binding. Squalene-hopene cyclase is a bacterial enzyme that catalyzes the cyclization of squalene into hopene, a key step in hopanoid (triterpenoid) metabolism []. Lanosterol synthase (5.4.99.7 from EC) (oxidosqualene-lanosterol cyclase) catalyzes the cyclization of (S)-2,3-epoxysqualene to lanosterol, the initial precursor of cholesterol, steroid hormones and vitamin D in vertebrates and of ergosterol in fungi []. Cycloartenol synthase () (2,3-epoxysqualene-cycloartenol cyclase) is a plant enzyme that catalyzes the cyclization of (S)-2,3-epoxysqualene to cycloartenol.; GO: 0003824 catalytic activity; PDB: 2IEJ_B 1LD7_B 1LD8_B 2H6G_B 1TN6_B 1S63_B 1MZC_B 2H6I_B 2H6F_B 1JCQ_B ....
Probab=54.64  E-value=8.5  Score=31.00  Aligned_cols=20  Identities=30%  Similarity=0.635  Sum_probs=17.8

Q ss_pred             hhhhhhhhhcccCCCCcccc
Q psy5522        1257 LPILTWLVTQQNDQGGFAST 1276 (1397)
Q Consensus      1257 ~~i~~Wl~~q~n~~Ggf~st 1276 (1397)
                      .++++||.+.|++.|||+..
T Consensus         4 ~~~~~~l~~~Q~~dGGf~~~   23 (44)
T PF00432_consen    4 EKLIRFLLSCQNPDGGFGGR   23 (44)
T ss_dssp             HHHHHHHHHTBBTTSSBBSS
T ss_pred             HHHHHHHHHHCCCCCCCCCC
Confidence            57899999999999999765


No 105
>PF14326 DUF4384:  Domain of unknown function (DUF4384)
Probab=54.52  E-value=65  Score=29.88  Aligned_cols=63  Identities=19%  Similarity=0.382  Sum_probs=37.7

Q ss_pred             CcccCCceeEEEEEEEcCCCccCCCcceEEEEECCCCCeEEeeeeeccCCceee--eeeecCCCCCCeeEEEEE
Q psy5522         107 AIYKPGDLVRFRAIVLNSHLKPSVTGALEIFITDGKGNRIKQWTRALTTRGVFS--ADLQLSKSPVLGDWNITI  178 (1397)
Q Consensus       107 pIYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~--~sf~L~~~~~lG~y~I~v  178 (1397)
                      +.|+.||+|+|++=+       ....++.|.-.|++|+...=..+.......+.  ..+.+|+.  -..|.+++
T Consensus         1 ~~~~~Ge~v~~~~~~-------~~~~Yl~l~~~~~~G~v~~L~Pn~~~~~~~v~ag~~~~iP~~--~~~~~~~v   65 (83)
T PF14326_consen    1 TVYRVGERVRFRVTS-------NRDGYLYLFYIDADGKVTLLFPNRYQPDNFVKAGQTYTIPDP--GDRFSFTV   65 (83)
T ss_pred             CcccCCCEEEEEEEe-------CCCeEEEEEEECCCCCEEEEecCccccCceEcCCceEEcCCC--CCceEEEE
Confidence            579999999998765       12457777778999977654444322223222  24566643  23444444


No 106
>PF13199 Glyco_hydro_66:  Glycosyl hydrolase family 66; PDB: 3VMO_A 3VMN_A 3VMP_A.
Probab=53.90  E-value=64  Score=41.01  Aligned_cols=93  Identities=16%  Similarity=0.194  Sum_probs=0.0

Q ss_pred             cCCCcccCCceeEEEEEEEcCCCccCCCcceEEEEECCCCCeEEeeeeeccCCceeeee----eecCCCCCCeeEEEEEE
Q psy5522         104 LDKAIYKPGDLVRFRAIVLNSHLKPSVTGALEIFITDGKGNRIKQWTRALTTRGVFSAD----LQLSKSPVLGDWNITIN  179 (1397)
Q Consensus       104 TDKpIYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~s----f~L~~~~~lG~y~I~v~  179 (1397)
                      |||..|+|||+|.|.+=..+..-..- ...+.|++.-=+ +.|.+-.....-.+.-+.+    +..|.+-- .-|-+.+.
T Consensus         2 tDKA~Y~PGe~V~l~~~~~~~~~~~~-~g~~~v~~~~l~-~~V~~~~~~~~~~~~~~~~~~~~w~~P~~df-~GYlv~i~   78 (559)
T PF13199_consen    2 TDKARYRPGEKVTLTASLKNTTGSDF-SGTVKVRYYHLN-EVVGETKQSVDLTSGASWTLTIEWTAPADDF-TGYLVEIY   78 (559)
T ss_dssp             ES-SSB-TTS-EEEE-EEE--SSS-E-EEEEEEEEEETT-EEEEEEEEEEEE-TT-EEE-TTSEEE-TTSS-EEEEEEEE
T ss_pred             CCcceeCCCCeEEEEEEeccCccccc-cceEEEEEEEcC-eEeeeeeeEEeecCCCcceeeEEEECCcccC-eeEEEEEE


Q ss_pred             EcC-----eEEEEEEEEee--ecCCcE
Q psy5522         180 VLD-----QKFTKRFTVAE--YVLPKF  199 (1397)
Q Consensus       180 ~~~-----~~~~~~F~VeE--YvLPkF  199 (1397)
                      ...     ...+..+.|..  .+-||+
T Consensus        79 ~~~~g~~~~~~t~aiDVSsdW~~fPRY  105 (559)
T PF13199_consen   79 ADSGGKEVDSATIAIDVSSDWTRFPRY  105 (559)
T ss_dssp             E-TT--EEEEEEEEEEE-SSTTSS--E
T ss_pred             EecCCceeeeeeEEEEecCCcccCCcc


No 107
>PF08400 phage_tail_N:  Prophage tail fibre N-terminal;  InterPro: IPR013609 This entry represents the N terminus of phage 933W tail fibre protein. The characteristics of the protein distribution suggest prophage matches.
Probab=51.87  E-value=1e+02  Score=31.43  Aligned_cols=96  Identities=14%  Similarity=0.079  Sum_probs=0.0

Q ss_pred             CeEEEEEEEEccCCCccCCCCceEEEEEeeecccc-ccceeeEeeCCCeeEEEEEeCCCCCceeEEEEEEEecCcce-EE
Q psy5522         328 LKYTAYMKLTHHDGTPVTDNNNMVQVRHGFSYDES-KYEANQYKLDRNGMIKLVYYPPANENVTTLGIEAEYLDIKE-WF  405 (1397)
Q Consensus       328 ~p~~~~v~v~~~dG~P~~~~~~~V~v~~~~~~~~~-~~~~~~~ttd~~G~~~f~i~~~~~~~~~~l~l~a~~~~~~~-~~  405 (1397)
                      +...+.=.+.|+.|+|++  +..|.+....+-... ........|+++|...|.+.+    |.|.+.+...-..... ..
T Consensus         1 MsV~ISGvL~dg~G~pv~--g~~I~L~A~~tS~~Vv~~t~as~~t~~~G~Ys~~~ep----G~Y~V~l~~~g~~~~~vG~   74 (134)
T PF08400_consen    1 MSVKISGVLKDGAGKPVP--GCTITLKARRTSSTVVVGTVASVVTGEAGEYSFDVEP----GVYRVTLKVEGRPPVYVGD   74 (134)
T ss_pred             CeEEEEEEEeCCCCCcCC--CCEEEEEEccCchheEEEEEEEEEcCCCceEEEEecC----CeEEEEEEECCCCceeEEE


Q ss_pred             EEEEeecCCC--CCeEEEEecCCCCCC
Q psy5522         406 STISASESPS--NSFIQAALLTQNPKV  430 (1397)
Q Consensus       406 ~~v~~~~S~s--~s~L~i~~~~~~~~~  430 (1397)
                      .++..-..|.  |.|| ..+....++|
T Consensus        75 I~V~~dS~pGTLN~fL-~~~~e~dl~P  100 (134)
T PF08400_consen   75 ITVYEDSKPGTLNDFL-TAPDEDDLRP  100 (134)
T ss_pred             EEEecCCCCCcHHHHh-hccccccCCH


No 108
>TIGR03437 Soli_cterm Solibacter uncharacterized C-terminal domain. This model describes a protein domain found in 90 proteins of Solibacter usitatus Ellin6076, nearly always as the C-terminal domain of a much larger protein. No homologs to this domain are detected outside of S. usitatus, a member of the Acidobacteria.
Probab=51.61  E-value=3.4e+02  Score=30.19  Aligned_cols=119  Identities=15%  Similarity=0.239  Sum_probs=72.3

Q ss_pred             eEEEEEcC-CCCCceeEEEEEecCceeeEeeEEEEEEecccEEEEE--------------------cCCCcccCCceeEE
Q psy5522          59 RIVKLDIG-DLGPGQYNLTVKGSGSLNFYNSTGLVYVHKSYSVFIQ--------------------LDKAIYKPGDLVRF  117 (1397)
Q Consensus        59 ~~~~f~vP-~~~~~~~~l~v~g~~~~~f~~~~~v~~~~~~~~vfIq--------------------TDKpIYkPGqtV~f  117 (1397)
                      ..++|++| .+..+...+.|+-.++  -.....+.+.....-+|..                    .+.|. ||||.|.+
T Consensus        51 ~QIN~qvP~~~~~g~~~v~V~~~g~--~s~~~~v~va~~aPgiFt~~~~g~g~~a~~~~~d~s~~~~~~PA-~~G~~i~i  127 (215)
T TIGR03437        51 GQINAQVPYEVAPGAATVTVTYNGG--ASAAVTVTVAAAAPGIFTLDGSGTGQAAALNNQDGSVNSAANPA-APGDVVVL  127 (215)
T ss_pred             ceEEEEeCCCcCCCcEEEEEEeCCc--ccceEEEEeeccccEEEEecCCCCcceeEEEcCCCccccCCCCC-CCCCEEEE
Confidence            45889999 4555666666664222  1344455555555555542                    35566 89999888


Q ss_pred             EEEEEcC------CCccC-------CCcceEEEEECCCCCeEEeeeeecc-CCceeeeeeecCCCCCCeeEEEEEEEcC
Q psy5522         118 RAIVLNS------HLKPS-------VTGALEIFITDGKGNRIKQWTRALT-TRGVFSADLQLSKSPVLGDWNITINVLD  182 (1397)
Q Consensus       118 Rvv~ld~------~l~P~-------~~~~~~v~I~DP~g~~I~qw~~~~~-~~Gi~~~sf~L~~~~~lG~y~I~v~~~~  182 (1397)
                      -+--+..      +..+.       ...++++.|=.-.=.  -++.-... .-|++|..+.||.....|.+.|.+..++
T Consensus       128 y~TG~G~~~p~~~~G~~~~~~~~~~~~~~v~vtigG~~a~--V~yaGlaPg~~Gl~QvNv~vP~~~~~G~~~v~itvgg  204 (215)
T TIGR03437       128 YATGLGPTSPAVADGAPAPSSPLAPALAPVTVTIGGVPAT--VLYAGLAPGFVGLYQVNVRVPAGLATGAVPVVITVGG  204 (215)
T ss_pred             EEeCCCCCCCccccCccccCCccccccCCeEEEECCEEee--EEEeccCCCCCceEEEEEEcCCCCCCCcEeEEEEECC
Confidence            7765421      11111       134567776221111  12222222 3499999999999999999999999987


No 109
>PF11614 FixG_C:  IG-like fold at C-terminal of FixG, putative oxidoreductase; PDB: 2R39_A.
Probab=51.40  E-value=1.8e+02  Score=28.84  Aligned_cols=63  Identities=14%  Similarity=0.190  Sum_probs=40.8

Q ss_pred             EEEEEEcCCCccEEEEEEEeecCCeeeeecCCCccCCCCCCcceeeeEEEEEeCCCeeEEEEEEEe-ce---eeeEEEEE
Q psy5522         733 IPVVVFNYLSQDLVADVTLENVGQFDFADFSNEVDAAPQPKFEVFRRKKLTIKANSGSTTTFVITP-KE---LGYIGIKV  808 (1397)
Q Consensus       733 l~~~VfNyl~~~~~v~v~L~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~a~~~~tv~f~v~p-~~---lG~v~itV  808 (1397)
                      -.+.|.|-.++...+++++...+++++..                ....+.|+|++...+.+.|.. ..   .|..+|++
T Consensus        35 Y~lkl~Nkt~~~~~~~i~~~g~~~~~l~~----------------~~~~i~v~~g~~~~~~v~v~~p~~~~~~~~~~i~f   98 (118)
T PF11614_consen   35 YTLKLTNKTNQPRTYTISVEGLPGAELQG----------------PENTITVPPGETREVPVFVTAPPDALKSGSTPITF   98 (118)
T ss_dssp             EEEEEEE-SSS-EEEEEEEES-SS-EE-E----------------S--EEEE-TT-EEEEEEEEEE-GGG-SSSEEEEEE
T ss_pred             EEEEEEECCCCCEEEEEEEecCCCeEEEC----------------CCcceEECCCCEEEEEEEEEECHHHccCCCeeEEE
Confidence            45668899999999999999877777621                145789999999999988874 22   35556666


Q ss_pred             EEE
Q psy5522         809 TAT  811 (1397)
Q Consensus       809 ~A~  811 (1397)
                      .+.
T Consensus        99 ~v~  101 (118)
T PF11614_consen   99 TVT  101 (118)
T ss_dssp             EEE
T ss_pred             EEE
Confidence            665


No 110
>PF13620 CarboxypepD_reg:  Carboxypeptidase regulatory-like domain; PDB: 3MN8_D 3P0D_I 3KCP_A 2B59_B 1UWY_A 1H8L_A 1QMU_A 2NSM_A.
Probab=50.43  E-value=35  Score=31.05  Aligned_cols=57  Identities=16%  Similarity=0.170  Sum_probs=31.6

Q ss_pred             EEEEEEccCCCccCCCCceEEEEEeeeccccccceeeEeeCCCeeEEEEEeCCCCCceeEEEEEEEecC
Q psy5522         332 AYMKLTHHDGTPVTDNNNMVQVRHGFSYDESKYEANQYKLDRNGMIKLVYYPPANENVTTLGIEAEYLD  400 (1397)
Q Consensus       332 ~~v~v~~~dG~P~~~~~~~V~v~~~~~~~~~~~~~~~~ttd~~G~~~f~i~~~~~~~~~~l~l~a~~~~  400 (1397)
                      +.-+|+|.+|+|+++  ..|.+.  ..   .........||++|...|.--++   +.  +.|++....
T Consensus         2 I~G~V~d~~g~pv~~--a~V~l~--~~---~~~~~~~~~Td~~G~f~~~~l~~---g~--Y~l~v~~~g   58 (82)
T PF13620_consen    2 ISGTVTDATGQPVPG--ATVTLT--DQ---DGGTVYTTTTDSDGRFSFEGLPP---GT--YTLRVSAPG   58 (82)
T ss_dssp             EEEEEEETTSCBHTT---EEEET-------TTTECCEEE--TTSEEEEEEE-S---EE--EEEEEEBTT
T ss_pred             EEEEEEcCCCCCcCC--EEEEEE--Ee---eCCCEEEEEECCCceEEEEccCC---Ee--EEEEEEECC
Confidence            344677899999995  345443  11   12345677899999888772222   45  455554443


No 111
>PF00868 Transglut_N:  Transglutaminase family;  InterPro: IPR001102 Synonym(s): Protein-glutamine gamma-glutamyltransferase, Fibrinoligase, TGase Protein-glutamine gamma-glutamyltransferases (2.3.2.13 from EC) (TGase) are calcium-dependent enzymes that catalyse the cross-linking of proteins by promoting the formation of isopeptide bonds between the gamma-carboxyl group of a glutamine in one polypeptide chain and the epsilon-amino group of a lysine in a second polypeptide chain. TGases also catalyse the conjugation of polyamines to proteins [, ]. Transglutaminases are widely distributed in various organs, tissues and body fluids. The best known transglutaminase is blood coagulation factor XIII, a plasma tetrameric protein composed of two catalytic A subunits and two non-catalytic B subunits. Factor XIII is responsible for cross-linking fibrin chains, thus stabilising the fibrin clot. There are commonly three domains: N-terminal, middle (IPR013808 from INTERPRO) and C-terminal (IPR013807 from INTERPRO). This entry represents the N-terminal domain found in transglutaminases.; GO: 0018149 peptide cross-linking; PDB: 1L9N_B 1NUF_A 1NUD_A 1NUG_B 1L9M_A 1KV3_C 3S3S_A 2Q3Z_A 3LY6_A 3S3P_A ....
Probab=49.85  E-value=1.2e+02  Score=30.33  Aligned_cols=82  Identities=5%  Similarity=0.197  Sum_probs=45.1

Q ss_pred             cccEEEEEcCCCcccCCceeEEEEEEEcCCCccCCCcceEEEEECCCCCeEEeeeee--ccCCceeeeeeecCCCCCCee
Q psy5522          96 KSYSVFIQLDKAIYKPGDLVRFRAIVLNSHLKPSVTGALEIFITDGKGNRIKQWTRA--LTTRGVFSADLQLSKSPVLGD  173 (1397)
Q Consensus        96 ~~~~vfIqTDKpIYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I~DP~g~~I~qw~~~--~~~~Gi~~~sf~L~~~~~lG~  173 (1397)
                      +...+-|.-+||.....|.+.|-.- +.++=....+..+.+-+....  .-..|...  ..+.+.+..+...|.+++.|.
T Consensus        33 Q~F~i~l~f~r~~~~~~d~l~l~~~-~G~~P~~~~gT~~~~~~~~~~--~~~~W~a~v~~~~~~~~tv~V~spa~A~VG~  109 (118)
T PF00868_consen   33 QPFTITLRFNRPFDPSKDQLSLEFE-TGPNPSESKGTKVVFPVSSSL--DSSSWSARVESQDGNSVTVSVTSPANAPVGR  109 (118)
T ss_dssp             SEEEEEEEESSS--TTTEEEEEEEE-ESSS--TTTTSEEEEEECSSS---TSSSEEEEEEEETTEEEEEEE--TTS--EE
T ss_pred             CEEEEEEEEcCCcCCCCcEEEEEEE-EecccccCCCcEEEEEEccCC--CCCCEEEEEEecCCCEEEEEEECCCCCceEE
Confidence            4456777788885555666665443 232211123456666662221  12367643  334566899999999999999


Q ss_pred             EEEEEEE
Q psy5522         174 WNITINV  180 (1397)
Q Consensus       174 y~I~v~~  180 (1397)
                      |++.+..
T Consensus       110 y~l~v~~  116 (118)
T PF00868_consen  110 YKLSVET  116 (118)
T ss_dssp             EEEEEEE
T ss_pred             EEEEEEE
Confidence            9999875


No 112
>PRK12812 flgD flagellar basal body rod modification protein; Reviewed
Probab=48.67  E-value=1.8e+02  Score=33.32  Aligned_cols=63  Identities=13%  Similarity=0.222  Sum_probs=42.9

Q ss_pred             ceeEEEEEEEcCCCccCCCcceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCC----CCCeeEEEEEEEc
Q psy5522         113 DLVRFRAIVLNSHLKPSVTGALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKS----PVLGDWNITINVL  181 (1397)
Q Consensus       113 qtV~fRvv~ld~~l~P~~~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~----~~lG~y~I~v~~~  181 (1397)
                      ..+.|++- +     |..-..+.|.|.|.+|+.|+.........|...+++.=-+.    .+-|.|+|++...
T Consensus       128 ~~~~~~~~-l-----~~~a~~v~v~I~D~~G~~V~t~~lg~~~aG~~~f~WDG~d~~G~~~~~G~Yt~~v~A~  194 (259)
T PRK12812        128 ELIALKLY-F-----PEDSDEGTLEIYDSNNKLVEKIDFKEISQGLFTMEWDGRDNDGVYAGDGEYTIKAVYN  194 (259)
T ss_pred             ceeEEEEe-c-----CCcCceEEEEEEeCCCCEEEEEecCCCCCcceeEEECCCCCCCCcCCCeeeEEEEEEE
Confidence            46666633 2     22235689999999999998776445567877666655332    4569999999854


No 113
>PF14900 DUF4493:  Domain of unknown function (DUF4493)
Probab=46.59  E-value=4.7e+02  Score=29.37  Aligned_cols=44  Identities=16%  Similarity=0.395  Sum_probs=30.0

Q ss_pred             CcceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC
Q psy5522         131 TGALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD  182 (1397)
Q Consensus       131 ~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~  182 (1397)
                      .....|.|.+..|..+..|......    ...+.||    .|.|+|.|..++
T Consensus        32 ~~~~~v~I~~~~~~~~~~~~~~~~~----~~~i~L~----~G~Ytv~A~~g~   75 (235)
T PF14900_consen   32 VDDFTVEIYNADGTVVKYWKYSEMP----GESIELP----VGSYTVKASYGD   75 (235)
T ss_pred             cCceEEEEEeCCCcEEEecchhccc----cceEeec----CCcEEEEEEcCC
Confidence            5678899999997666666433222    3556655    499999999653


No 114
>PF05751 FixH:  FixH;  InterPro: IPR008620 This family consists of several Rhizobium FixH like proteins. It has been suggested that the four proteins FixG, FixH, FixI, and FixS may participate in a membrane-bound complex coupling the FixI cation pump with a redox process catalysed by FixG [].
Probab=46.08  E-value=90  Score=32.10  Aligned_cols=63  Identities=14%  Similarity=0.262  Sum_probs=44.1

Q ss_pred             ceeEEEEEEEcCCCccCCCcceEEEEECCCCCeEEeeee-eccCCceeeeeeecCCCCCCeeEEEEEEE
Q psy5522         113 DLVRFRAIVLNSHLKPSVTGALEIFITDGKGNRIKQWTR-ALTTRGVFSADLQLSKSPVLGDWNITINV  180 (1397)
Q Consensus       113 qtV~fRvv~ld~~l~P~~~~~~~v~I~DP~g~~I~qw~~-~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~  180 (1397)
                      .++.+++  .|.+-.|.....+++.+.-|....-.+-.. .....|.+....+++   ..|.|.|++..
T Consensus        69 ~~~~i~~--~d~~g~~~~~~~~~l~l~rp~~~~~D~~~~l~~~~~g~y~~~~~~~---~~G~W~l~l~~  132 (146)
T PF05751_consen   69 NSLTIRL--TDPNGAPVSGAKLTLSLYRPTDAKLDFTLTLTESAPGVYRAPVPLL---KKGRWYLRLDW  132 (146)
T ss_pred             CeEEEEE--EcCCCCcCcCceEEEEEECCCCccCCeeEEeeECCCceEEEEcCCC---CCccEEEEEEE
Confidence            4454444  346677777778889888887655544333 346789888888777   67999999944


No 115
>PRK12634 flgD flagellar basal body rod modification protein; Reviewed
Probab=45.20  E-value=1.2e+02  Score=33.89  Aligned_cols=51  Identities=22%  Similarity=0.273  Sum_probs=38.3

Q ss_pred             CcceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCC----CCCeeEEEEEEEc
Q psy5522         131 TGALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKS----PVLGDWNITINVL  181 (1397)
Q Consensus       131 ~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~----~~lG~y~I~v~~~  181 (1397)
                      ...++|.|.|.+|+.|+.........|...+++.=-+.    .+-|.|+++|...
T Consensus       121 a~~v~i~I~d~~G~~V~t~~lg~~~aG~~~f~WDG~d~~G~~~~~G~Yt~~v~a~  175 (221)
T PRK12634        121 AGFVNFEITDANGAFVKQISVPASAAGEVSFAWDGTDANGNRMAAGKYGVTATQT  175 (221)
T ss_pred             CCeEEEEEEcCCCCEEEEEecCCcCCCceeEEECCCCCCCCcCCCeeeEEEEEEE
Confidence            46789999999999998876545566877777765332    4569999999753


No 116
>COG2372 CopC Uncharacterized protein, homolog of Cu resistance protein CopC [General function prediction only]
Probab=42.51  E-value=79  Score=31.79  Aligned_cols=62  Identities=16%  Similarity=0.199  Sum_probs=45.8

Q ss_pred             CcceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC-----eEEEEEEEEe
Q psy5522         131 TGALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD-----QKFTKRFTVA  192 (1397)
Q Consensus       131 ~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~-----~~~~~~F~Ve  192 (1397)
                      ..+..+.|.+|+|.++.--.....+.---+...+|+....-|.|++.-..-+     ...+..|.|+
T Consensus        59 ~~fs~~~l~~~d~~~v~t~~~~~~~~~~~~l~v~l~~~L~aG~Y~v~WrvvS~DGH~v~G~~sFsV~  125 (127)
T COG2372          59 PGFSGAKLTGPDGEEVATAGTKLDEQNHTQLEVPLPQPLKAGVYTVDWRVVSSDGHVVKGSISFSVG  125 (127)
T ss_pred             CCcceeEEECCCCCccccCcccccccCCcEEEecCcccCCCCcEEEEEEEEecCCcEeccEEEEEec
Confidence            4557789999999998754444444434678899999999999998877642     4567788875


No 117
>PRK05842 flgD flagellar basal body rod modification protein; Reviewed
Probab=41.75  E-value=1.5e+02  Score=34.57  Aligned_cols=69  Identities=12%  Similarity=0.181  Sum_probs=42.1

Q ss_pred             cCCceeEEEEEEEcCCCccCCCcceEEEEECCCCCeEEeeeeecc--CCceeeeeeecCCC----CCCeeEEEEEEE
Q psy5522         110 KPGDLVRFRAIVLNSHLKPSVTGALEIFITDGKGNRIKQWTRALT--TRGVFSADLQLSKS----PVLGDWNITINV  180 (1397)
Q Consensus       110 kPGqtV~fRvv~ld~~l~P~~~~~~~v~I~DP~g~~I~qw~~~~~--~~Gi~~~sf~L~~~----~~lG~y~I~v~~  180 (1397)
                      ..|..+.|++. ++... |.....+.|.|.|.+|+.|+.......  ..|...+++.=-+.    .+-|.|+++|..
T Consensus       146 ~~g~~~~~~~~-l~~~~-~~~a~~v~I~I~Da~G~vVrTi~l~~~~~~aG~~~f~WDG~d~~G~~~p~G~Yt~~V~a  220 (295)
T PRK05842        146 DGNNKLSFSLF-FDEKI-DASKGVPAIQILNENNELVKTIPLKDYNGQKGYINFEWDGLNEKGEKVPKGNYKIKAEY  220 (295)
T ss_pred             cCCCceEEEEe-ccccc-cccCceEEEEEEcCCCCEEEEEecCcccCCCcceeEEECCCCCCCCcCCCcceEEEEEE
Confidence            34446666632 22211 223457899999999999987653222  34766666544222    456999999864


No 118
>PF15432 Sec-ASP3:  Accessory Sec secretory system ASP3
Probab=41.57  E-value=2.3e+02  Score=28.85  Aligned_cols=80  Identities=15%  Similarity=0.215  Sum_probs=58.3

Q ss_pred             cCCCcccCCceeEEEEEEEcCCCccCCCcceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC-
Q psy5522         104 LDKAIYKPGDLVRFRAIVLNSHLKPSVTGALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD-  182 (1397)
Q Consensus       104 TDKpIYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~-  182 (1397)
                      -+=|+-|+|++=++.   ++.+..|.+...+.|...|-+|..|..-.....     +.+|..|+++  =.|+|..-..| 
T Consensus        45 P~LPlLk~G~~Y~l~---~~~~~~P~~svylki~F~dr~~e~i~~~i~k~~-----~~~F~yP~~a--ysY~I~LinaG~  114 (128)
T PF15432_consen   45 PSLPLLKRGHTYQLK---FNIDVVPENSVYLKIIFFDRQGEEIEEQIIKND-----SFEFTYPEEA--YSYTISLINAGC  114 (128)
T ss_pred             CCCCEecCCCEEEEE---EEEEEccCCeEEEEEEEEccCCCEeeEEEEecC-----ceEEeCCCCc--eEEEEEEeeCCC
Confidence            466999999996653   455667877788999999999999987653322     3688889886  67888887766 


Q ss_pred             -eEEEEEEEEee
Q psy5522         183 -QKFTKRFTVAE  193 (1397)
Q Consensus       183 -~~~~~~F~VeE  193 (1397)
                       .-.-+.|.++|
T Consensus       115 ~~l~F~~i~I~e  126 (128)
T PF15432_consen  115 QSLTFHSIEISE  126 (128)
T ss_pred             CeeEEeEEEEEE
Confidence             34455555544


No 119
>PF13715 DUF4480:  Domain of unknown function (DUF4480)
Probab=41.24  E-value=2.5e+02  Score=25.90  Aligned_cols=52  Identities=21%  Similarity=0.413  Sum_probs=37.1

Q ss_pred             EEEEEEcCC-CccCCCcceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC
Q psy5522         117 FRAIVLNSH-LKPSVTGALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD  182 (1397)
Q Consensus       117 fRvv~ld~~-l~P~~~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~  182 (1397)
                      +++.++|.+ -.|.  ..+.|.+.+.+     + ...+++.|-|...  ++    .|.|+|.+..-|
T Consensus         2 i~G~V~d~~t~~pl--~~a~V~~~~~~-----~-~~~Td~~G~F~i~--~~----~g~~~l~is~~G   54 (88)
T PF13715_consen    2 ISGKVVDSDTGEPL--PGATVYLKNTK-----K-GTVTDENGRFSIK--LP----EGDYTLKISYIG   54 (88)
T ss_pred             EEEEEEECCCCCCc--cCeEEEEeCCc-----c-eEEECCCeEEEEE--Ec----CCCeEEEEEEeC
Confidence            567788876 6776  45667777665     1 2456788887777  43    588999999877


No 120
>PF00775 Dioxygenase_C:  Dioxygenase;  InterPro: IPR000627 This entry represents the C-terminal domain common to several intradiol ring-cleavage dioxygenases. Dioxygenases catalyse the incorporation of both atoms of molecular oxygen into substrates using a variety of reaction mechanisms. Cleavage of aromatic rings is one of the most important functions of dioxygenases, which play key roles in the degradation of aromatic compounds. The substrates of ring-cleavage dioxygenases can be classified into two groups according to the mode of scission of the aromatic ring. Intradiol enzymes use a non-haem Fe(III) to cleave the aromatic ring between two hydroxyl groups (ortho-cleavage), whereas extradiol enzymes (IPR000486 from INTERPRO) use a non-haem Fe(II) to cleave the aromatic ring between a hydroxylated carbon and an adjacent non-hydroxylated carbon (meta-cleavage) []. These two subfamilies differ in sequence, structural fold, iron ligands, and the orientation of second sphere active site amino acid residues. Enzymes that belong to the intradiol family include catechol 1,2-dioxygenase (1,2-CTD) (1.13.11.1 from EC); protocatechuate 3,4-dioxygenase (3,4-PCD) (1.13.11.3 from EC); and chlorocatechol 1,2-dioxygenase (1.13.11.1 from EC) [].; GO: 0003824 catalytic activity, 0008199 ferric iron binding, 0006725 cellular aromatic compound metabolic process, 0055114 oxidation-reduction process; PDB: 2BUV_A 2BUX_A 2BUU_A 2BUR_A 1EO9_A 2BUZ_A 2BV0_A 1EO2_A 1EOC_A 1EOA_A ....
Probab=41.13  E-value=66  Score=34.82  Aligned_cols=28  Identities=25%  Similarity=0.433  Sum_probs=21.6

Q ss_pred             ccCCCeEEEEEEEEccCCCccCCCCceEEE
Q psy5522         324 FKPGLKYTAYMKLTHHDGTPVTDNNNMVQV  353 (1397)
Q Consensus       324 ~kpG~p~~~~v~v~~~dG~P~~~~~~~V~v  353 (1397)
                      -.+|.++.+..+|+|.+|+|+++  ..|.+
T Consensus        24 ~~~G~~l~l~G~V~D~~g~Pv~~--A~vei   51 (183)
T PF00775_consen   24 DAPGEPLVLHGRVIDTDGKPVPG--ALVEI   51 (183)
T ss_dssp             TSSS-EEEEEEEEEETTSSB-TT--EEEEE
T ss_pred             CCCCCEEEEEEEEECCCCCCCCC--cEEEE
Confidence            56999999999999999999995  34544


No 121
>PRK12633 flgD flagellar basal body rod modification protein; Provisional
Probab=40.40  E-value=96  Score=34.89  Aligned_cols=51  Identities=14%  Similarity=0.205  Sum_probs=38.7

Q ss_pred             CcceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCC----CCCeeEEEEEEEc
Q psy5522         131 TGALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKS----PVLGDWNITINVL  181 (1397)
Q Consensus       131 ~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~----~~lG~y~I~v~~~  181 (1397)
                      -..+.|.|.|.+|+.|+.........|...+.+.--+.    .+-|.|+++|...
T Consensus       128 a~~v~v~I~D~~G~vV~t~~lg~~~aG~~~f~WDG~d~~G~~~~~G~Y~~~V~a~  182 (230)
T PRK12633        128 ATKVTVKVLDPSGAVVRTMELGDLKTGVHTLQWDGNNDGGQPLADGKYSITVSAS  182 (230)
T ss_pred             CcEEEEEEEeCCCCEEEEEecCCCCCCceeEEECCCCCCCCcCCCcceEEEEEEE
Confidence            45689999999999999776545567887777766432    4679999999763


No 122
>KOG0367|consensus
Probab=39.64  E-value=91  Score=35.77  Aligned_cols=55  Identities=25%  Similarity=0.466  Sum_probs=31.8

Q ss_pred             eecCCCCccccCCCCCCCCcchhHHHHHH--HHHc-cCCCccCHHHHHHHHHHHHhcccCCCccc
Q psy5522         939 YRRPDGSFSAFGTTDPNGSTWLTAFVAKS--FRQA-ASHTTIDESVILEALAWLSSNQAVNGSFP 1000 (1397)
Q Consensus       939 y~~~dGsfs~~~~~~~~~s~wlTAyvl~~--~~~a-~~~i~vd~~~~~~~~~WL~~~q~~~G~F~ 1000 (1397)
                      -|++||||..-... ++.+  | -||.-+  .+-. ...-.+|.+   +++.|+.++|+++|||+
T Consensus       137 ~Q~~dGsF~~~~~G-Se~D--m-RFvYcA~aI~ymLd~~s~iD~e---k~~~yI~~~q~YdgGfg  194 (347)
T KOG0367|consen  137 CQRPDGSFVSINVG-SESD--M-RFVYCAVAICYMLDFWSGIDKE---KLIGYIRSSQRYDGGFG  194 (347)
T ss_pred             hcCCCCceeecCCC-Cchh--h-HHHHHHHHHHHHhccccccCHH---HHHHHHHHhhccccccc
Confidence            68999999654322 1222  2 122111  1111 112346654   45899999999999997


No 123
>cd00421 intradiol_dioxygenase Intradiol dioxygenases catalyze the critical ring-cleavage step in the conversion of catecholate derivatives to citric acid cycle intermediates. This family contains catechol 1,2-dioxygenases and protocatechuate 3,4-dioxygenases which are mononuclear non-heme iron enzymes that catalyze the oxygenation of catecholates to aliphatic acids via the cleavage of aromatic rings. The members are intradiol-cleaving enzymes which break the catechol C1-C2 bond and utilize Fe3+, as opposed to the extradiol-cleaving enzymes which break the C2-C3 or C1-C6 bond and utilize Fe2+ and Mn+. Catechol 1,2-dioxygenases are mostly homodimers with one catalytic ferric ion per monomer. Protocatechuate 3,4-dioxygenases form more diverse oligomers.
Probab=38.81  E-value=1.1e+02  Score=31.88  Aligned_cols=22  Identities=36%  Similarity=0.493  Sum_probs=20.5

Q ss_pred             cCCCeEEEEEEEEccCCCccCC
Q psy5522         325 KPGLKYTAYMKLTHHDGTPVTD  346 (1397)
Q Consensus       325 kpG~p~~~~v~v~~~dG~P~~~  346 (1397)
                      .+|.|..+..+|+|.+|.|+++
T Consensus         7 ~~G~~l~l~G~V~D~~g~pv~~   28 (146)
T cd00421           7 APGEPLTLTGTVLDGDGCPVPD   28 (146)
T ss_pred             CCCCEEEEEEEEECCCCCCCCC
Confidence            5899999999999999999985


No 124
>COG1689 Uncharacterized protein conserved in archaea [Function unknown]
Probab=37.28  E-value=44  Score=36.44  Aligned_cols=41  Identities=17%  Similarity=0.194  Sum_probs=32.6

Q ss_pred             CCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCccc
Q psy5522         955 NGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSFP 1000 (1397)
Q Consensus       955 ~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~ 1000 (1397)
                      ++-++-|=|+++.+...++-+.     +.+.++++.+-||.||||+
T Consensus       208 PPYiE~t~ya~r~lelL~~k~~-----i~~~~rFI~slqN~nGGFR  248 (274)
T COG1689         208 PPYIEPTFYALRGLELLGGKYC-----ISDHIRFIRSLQNQNGGFR  248 (274)
T ss_pred             CCccchHHHHHhHHHHHccCcC-----chHHHHHHHHhhcCCCCee
Confidence            4568889999999887765433     4556789999999999997


No 125
>PF12584 TRAPPC10:  Trafficking protein particle complex subunit 10, TRAPPC10;  InterPro: IPR022233 The trafficking protein particle complex TRAPP is a multi-protein complex needed in the early stages of the secretory pathway. To date, two kinds of TRAPP complexes have been studied, TRAPPI and TRAPP II. These complexes differ in subunit composition []. TRAPP I binds vesicles derived from the endoplasmic reticulum bringing them closer to the acceptor membrane.  This entry represents a domain which forms part of the TRAPP complex for mediating vesicle docking and fusion in the Golgi apparatus. The fungal version is referred to as Trs130, and an alternative vertebrate alias is TMEM1 [, ].
Probab=36.51  E-value=2.8e+02  Score=28.77  Aligned_cols=70  Identities=14%  Similarity=0.190  Sum_probs=47.2

Q ss_pred             CceeeeCCEEEEEEEEEcCC-----------CccEEEEEEEeec-CCeeeeecCCCccCCCCCCcceeeeEEEEEeCCCe
Q psy5522         722 PYSVMRGEVVAIPVVVFNYL-----------SQDLVADVTLENV-GQFDFADFSNEVDAAPQPKFEVFRRKKLTIKANSG  789 (1397)
Q Consensus       722 Pysv~rGE~~~l~~~VfNyl-----------~~~~~v~v~L~~s-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~a~~~  789 (1397)
                      |-.++.|+-+.+...|.|..           ++..+....+..+ +... +              .+.++..+.+..++.
T Consensus        24 ~~~~~vGqpi~~~l~I~~~~~W~~~~~~~~~~~~~~~~yei~a~~~~Wl-V--------------~Grrrg~f~~~~~~~   88 (147)
T PF12584_consen   24 PPPCRVGQPIPAELRIKNSRKWSSEDQEESSNEDTEFMYEIVADSDNWL-V--------------SGRRRGVFSLSDGSE   88 (147)
T ss_pred             CcceEeCCeEEEEEEEEEcccCCccccccccCCCccEEEEEecCCCcEE-E--------------eccCcceEEecCCCe
Confidence            44678999999999999952           2223333344322 2221 1              233455778888889


Q ss_pred             eEEEEEEEeceeeeEEE
Q psy5522         790 STTTFVITPKELGYIGI  806 (1397)
Q Consensus       790 ~tv~f~v~p~~lG~v~i  806 (1397)
                      ..+.+.+.|++.|.+.+
T Consensus        89 ~~~~l~LIPL~~G~L~l  105 (147)
T PF12584_consen   89 HEIPLTLIPLRAGYLPL  105 (147)
T ss_pred             EEEEEEEEecccceecC
Confidence            99999999999998764


No 126
>PF04151 PPC:  Bacterial pre-peptidase C-terminal domain;  InterPro: IPR007280 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  This domain is normally found at the C terminus of secreted archaeal and bacterial peptidases, the majority of which belong to MEROPS peptidase families M4 (vibriolysin, IPR001570 from INTERPRO), M9A amd M9B (microbial collangenase, IPR002169 from INTERPRO), M28 (aminopeptidase Ap1, IPR007484 from INTERPRO) and S8 (subtilisin family peptidases, IPR000209 from INTERPRO).; GO: 0008233 peptidase activity, 0006508 proteolysis; PDB: 4DY5_B 4DXZ_A 4DY3_B 3JQW_A 3JQX_C 1NQJ_B 1NQD_A 2O8O_A 1WMF_A 1WME_A ....
Probab=35.49  E-value=1e+02  Score=27.27  Aligned_cols=44  Identities=18%  Similarity=0.425  Sum_probs=29.3

Q ss_pred             ceEEEEECCCCCeEEeeeeeccCCce-eeeeeecCCCCCCeeEEEEEE
Q psy5522         133 ALEIFITDGKGNRIKQWTRALTTRGV-FSADLQLSKSPVLGDWNITIN  179 (1397)
Q Consensus       133 ~~~v~I~DP~g~~I~qw~~~~~~~Gi-~~~sf~L~~~~~lG~y~I~v~  179 (1397)
                      -..+.|.|++|+.+..........|. -...|..   +..|+|-|+|.
T Consensus        25 d~dl~l~~~~g~~~~~~d~~~~~~~~~~~i~~~~---~~~GtYyi~V~   69 (70)
T PF04151_consen   25 DADLYLYDSNGNSLASYDDSSQSGGNDESITFTA---PAAGTYYIRVY   69 (70)
T ss_dssp             SEEEEEEETTSSSCEECCCCTCETTSEEEEEEEE---SSSEEEEEEEE
T ss_pred             CeEEEEEcCCCCchhhheecCCCCCCccEEEEEc---CCCEEEEEEEE
Confidence            45688999999877765433322333 3445555   56799999985


No 127
>KOG0365|consensus
Probab=35.38  E-value=85  Score=36.68  Aligned_cols=73  Identities=19%  Similarity=0.338  Sum_probs=56.2

Q ss_pred             ccHHHHHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCC
Q psy5522         917 LTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVN  996 (1397)
Q Consensus       917 l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~  996 (1397)
                      +.+.+++|-..||..++.++-.      .|..-    +.+..||-=|.+.++.....  .+|.+..++++.+|.+-|.+.
T Consensus        69 ~~~~~r~kH~~YL~~~Lr~Lp~------~~~~L----DASR~Wm~YWil~sl~lL~~--~~dd~v~~~~i~fL~~c~~Pe  136 (423)
T KOG0365|consen   69 LEPIFRQKHLMYLDKMLRQLPS------QYTCL----DASRPWMCYWILNSLALLDE--WLDDDVKENAIDFLFTCQGPE  136 (423)
T ss_pred             HHHHHHHHHHHHHHHHHHhCCC------ccccc----ccCcchhHHHHHHHHHHhcC--cCCHHHHHHHHHHHHhcCCCC
Confidence            4566688888899888755321      23322    24568999999999888775  589999999999999999999


Q ss_pred             Ccccc
Q psy5522         997 GSFPE 1001 (1397)
Q Consensus       997 G~F~s 1001 (1397)
                      |||..
T Consensus       137 GGfgG  141 (423)
T KOG0365|consen  137 GGFGG  141 (423)
T ss_pred             CCCCC
Confidence            99975


No 128
>PF05688 DUF824:  Salmonella repeat of unknown function (DUF824);  InterPro: IPR008542 This family consists of a series of repeated sequences (of around 180 residues) which are found in Salmonella typhimurium, Salmonella typhi and Escherichia coli. These repeats are almost always found with this entry. The repeats are associated with RatA and RatB, the coding sequences of which are found in the pathogeneicity island of Salmonella. The sequences may be determinants of pathogenicity [, ].
Probab=34.36  E-value=62  Score=26.75  Aligned_cols=31  Identities=16%  Similarity=0.245  Sum_probs=25.1

Q ss_pred             cccCCCeEEEEEEEEccCCCccCCCCceEEEEE
Q psy5522         323 YFKPGLKYTAYMKLTHHDGTPVTDNNNMVQVRH  355 (1397)
Q Consensus       323 ~~kpG~p~~~~v~v~~~dG~P~~~~~~~V~v~~  355 (1397)
                      ..|.|....+.|.+.|.+|+|+++.  .+.|..
T Consensus         8 kaK~Ge~I~ltVt~kda~G~pv~n~--~f~l~r   38 (47)
T PF05688_consen    8 KAKVGETIPLTVTVKDANGNPVPNA--PFTLTR   38 (47)
T ss_pred             heecCCeEEEEEEEECCCCCCcCCc--eEEEEe
Confidence            4578999999999999999999954  455543


No 129
>COG1657 SqhC Squalene cyclase [Lipid metabolism]
Probab=32.67  E-value=1e+02  Score=38.51  Aligned_cols=111  Identities=21%  Similarity=0.223  Sum_probs=66.5

Q ss_pred             hhHHHHHHHHHhhcC---------CCCchHHHHHHHHHHhhCCCCc----h-HHHHHHHhhcccccCC-cccccccCCCC
Q psy5522        1158 NTINKAVDYIVKNLA---------GTEDAYAIAICSYALHLAQHPV----K-DVAFNLLESKAHNEDG-KKWWKRAERPE 1222 (1397)
Q Consensus      1158 ~~~~~a~~~l~~~~~---------~~~~~y~~al~aYal~la~~~~----~-~~~~~~L~~~a~~~~~-~~~W~~~~~~~ 1222 (1397)
                      ..+++|+.||-+...         .+...|.-+++.+|+++.|...    . +++..-|..+-..++| .-+|.....  
T Consensus       352 ~~i~~a~e~LL~~Q~~~GsW~g~w~v~~iY~~s~a~~~l~~~g~~~~~~~~v~~~~~~l~~~~~~~~Gw~e~~~~~~~--  429 (517)
T COG1657         352 QPIERALEWLLSDQEPDGSWYGRWGVCYIYGTSGALSALALVGETDENEVLVRKLISWLVSKQMPDGGWGEAKEAISD--  429 (517)
T ss_pred             CcccHHHhhhhhhccccCceeeEEEEEEEEehhhhhhhhhccCccccchHHHHHHHHHhhhccccCCCcccccccccc--
Confidence            347788888865432         2346899999999999999887    2 4455555555444443 233433221  


Q ss_pred             CCCCCCCCCCCcccchhhHHHHHHhhccC--CcccchhhhhhhhhcccCCCCcc
Q psy5522        1223 DKKNPWAQVPNSVDVEMTSYALLSYLDRG--LVEDTLPILTWLVTQQNDQGGFA 1274 (1397)
Q Consensus      1223 ~~~~~~~~~~~s~~vE~t~Y~lL~~~~~~--~~~~a~~i~~Wl~~q~n~~Ggf~ 1274 (1397)
                          +.+....+--+=-|+.||..++...  +-+.-.++++-|..+|-+.|-..
T Consensus       430 ----~~~~~t~~sl~~~~~wal~~~~~a~~~~~~~i~~~~~~~~~~~~~~g~~~  479 (517)
T COG1657         430 ----PVYTGTESSLLVQTNWALIALLTALEPNQEAIKPGINLLVSDQEPDGSWR  479 (517)
T ss_pred             ----cccccccchhhcchhHHHHHHHHhcccchhhhcccccccccCcCCCCccc
Confidence                1111112222235778888887532  33446778888888888777543


No 130
>PF01122 Cobalamin_bind:  Eukaryotic cobalamin-binding protein;  InterPro: IPR002157 Cobalamin (Cbl or vitamin B12) is only accessible through diet in mammals. Absorption, plasma transport and cellular uptake of Cbl in mammals involves three Cbl-transporting proteins, which are listed below in order of increasing Cbl-specificity:  Haptocorrin (cobalophilin), which binds Cbl and Cbl-derivatives such as cobinamide; it may play a role in preventing the absorption of cobalamin analogues produced by bacteria. Transcobalamin (TC), which transport Cbl from blood to cells. Intrinsic factor (IF), which promotes Cbl absorption in the ileum by specific receptor-mediated endocytosis.  The structure of TC reveals a two-domain structure, an N-terminal alpha(6)-alpha(6) barrel, and a smaller C-terminal domain []. Many interactions between Cbl and its binding site in the interface of the two domains are conserved among the other Cbl transporters. Specificity for Cbl between the different transporters may reside in a beta-hairpin motif found in the smaller C-terminal domain []. ; GO: 0031419 cobalamin binding, 0015889 cobalamin transport; PDB: 3KQ4_A 2PMV_A 2BB5_A 2V3N_A 2BBC_A 2BB6_D 2V3P_A.
Probab=31.46  E-value=96  Score=36.57  Aligned_cols=87  Identities=14%  Similarity=0.178  Sum_probs=57.8

Q ss_pred             CCchHHHHHHHHHHhhCCCCchHHHHHHHhhcccccCCcccccccCCCCCCCCCCCCCCCcccchhhHHHHHHhh--ccC
Q psy5522        1174 TEDAYAIAICSYALHLAQHPVKDVAFNLLESKAHNEDGKKWWKRAERPEDKKNPWAQVPNSVDVEMTSYALLSYL--DRG 1251 (1397)
Q Consensus      1174 ~~~~y~~al~aYal~la~~~~~~~~~~~L~~~a~~~~~~~~W~~~~~~~~~~~~~~~~~~s~~vE~t~Y~lL~~~--~~~ 1251 (1397)
                      .++-|-.+|..-||.+.+.........+|-..+..+                   |+.....+|-+.|-|.||+-  .+.
T Consensus       142 ~TnyYq~sL~vLALCv~~~~~~~~~v~kL~~~~~~~-------------------~~~~~~~sVDT~AmA~LALtCv~~~  202 (326)
T PF01122_consen  142 LTNYYQYSLGVLALCVHNKRVSLSVVAKLLKAENHN-------------------FYHGSQFSVDTGAMAVLALTCVKNS  202 (326)
T ss_dssp             SSGHHHHHHHHHHHHHTTHHHHHHHHHHHHHHHHSS-------------------TSS-STCHHHHHHHHHHHHHHHHTT
T ss_pred             CCcccchHHHHHHHHccCCCcCHHHHHHHHHHHHhh-------------------cccCCCCCccHHHHHHHHHHHHhcc
Confidence            456799999999999999877655556655443311                   12223467889999999965  221


Q ss_pred             ---------Ccccc-hhhhhhhhhcccCCCCccccccC
Q psy5522        1252 ---------LVEDT-LPILTWLVTQQNDQGGFASTQMS 1279 (1397)
Q Consensus      1252 ---------~~~~a-~~i~~Wl~~q~n~~Ggf~stQdT 1279 (1397)
                               .+..| ..++++|.+||.+.|-|...=-|
T Consensus       203 ~~~~~~~~~~i~~~i~~~~~kIl~~q~~~G~~GNiyST  240 (326)
T PF01122_consen  203 NPNGPELRRRIQQAIRSLVEKILSQQKPNGLFGNIYST  240 (326)
T ss_dssp             TSTTGGGHHHHHHHHHHHHHHHHHTB-TTS-BSSTTTH
T ss_pred             CcCcHhHHHHHHHHHHHHHHHHHHhcCCCCcccchhhH
Confidence                     23333 34889999999999999876666


No 131
>PF06280 DUF1034:  Fn3-like domain (DUF1034);  InterPro: IPR010435 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain of unknown function is present in bacterial and plant peptidases belonging to MEROPS peptidase family S8 (subfamily S8A subtilisin, clan SB). It is C-terminal to and adjacent to the S8 peptidase domain and can be found in conjunction with the PA (Protease associated) domain (IPR003137 from INTERPRO) and additionally in Gram-positive bacteria with the surface protein anchor domain (IPR001899 from INTERPRO).; GO: 0004252 serine-type endopeptidase activity, 0005618 cell wall, 0016020 membrane; PDB: 3EIF_A 1XF1_B.
Probab=31.14  E-value=2.4e+02  Score=27.66  Aligned_cols=66  Identities=18%  Similarity=0.240  Sum_probs=34.9

Q ss_pred             CCEEEEEEEEEcCCCccEEEEEEEeec--------CCeeeeecCCCccCCCCCCcceeeeEEEEEeCCCeeEEEEEEEe
Q psy5522         728 GEVVAIPVVVFNYLSQDLVADVTLENV--------GQFDFADFSNEVDAAPQPKFEVFRRKKLTIKANSGSTTTFVITP  798 (1397)
Q Consensus       728 GE~~~l~~~VfNyl~~~~~v~v~L~~s--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~a~~~~tv~f~v~p  798 (1397)
                      |......++|.|+.++.+..++.-...        ++.........  .   .........+++|+||+++.+.+.|.+
T Consensus         7 ~~~~~~~itl~N~~~~~~ty~~~~~~~~t~~~~~~~~~~~~~~~~~--~---~~~~~~~~~~vTV~ag~s~~v~vti~~   80 (112)
T PF06280_consen    7 GNKFSFTITLHNYGDKPVTYTLSHVPVLTDKTDTEEGYSILVPPVP--S---ISTVSFSPDTVTVPAGQSKTVTVTITP   80 (112)
T ss_dssp             -SEEEEEEEEEE-SSS-EEEEEEEE-EEEEEE--ETTEEEEEEEE---------EEE---EEEEE-TTEEEEEEEEEE-
T ss_pred             CCceEEEEEEEECCCCCEEEEEeeEEEEeeEeeccCCccccccccc--c---eeeEEeCCCeEEECCCCEEEEEEEEEe
Confidence            566889999999999998877765510        11111000000  0   000223456889999999998888886


No 132
>PF07177 Neuralized:  Neuralized;  InterPro: IPR006573 NEUZ is a domain of unknown function found in neuralized proteins, i.e. proteins involved in the specification of the neuroblast during cellular differentiation. ; PDB: 2YUE_A 2E63_A.
Probab=30.84  E-value=1e+02  Score=27.67  Aligned_cols=28  Identities=7%  Similarity=0.124  Sum_probs=19.2

Q ss_pred             cEEEEEcCCCcccCCceeEEEEEEEcCCC
Q psy5522          98 YSVFIQLDKAIYKPGDLVRFRAIVLNSHL  126 (1397)
Q Consensus        98 ~~vfIqTDKpIYkPGqtV~fRvv~ld~~l  126 (1397)
                      ....++|+||+ ++||++++|+.-++...
T Consensus        29 ~~giVFS~rPl-~~~E~~~v~I~~~~~~w   56 (69)
T PF07177_consen   29 NNGIVFSSRPL-RIGEKFEVRIDEVEPSW   56 (69)
T ss_dssp             SS-EEEESS-B--TT-EEEEEEEEE-SSS
T ss_pred             CceEEEecCCc-cCCCEEEEEEEecCCCc
Confidence            34789999998 69999999998887643


No 133
>TIGR01451 B_ant_repeat conserved repeat domain. This model represents the conserved region of about 53 amino acids shared between regions, usually repeated, of proteins from a small number of phylogenetically distant prokaryotes. Examples include a 132-residue region found repeated in three of the five longest proteins of Bacillus anthracis, a 131-residue repeat in a cell wall-anchored protein of Enterococcus faecalis, and a 120-residue repeat in Methanobacterium thermoautotrophicum. A similar region is found in some Chlamydial outer membrane proteins.
Probab=30.76  E-value=83  Score=26.57  Aligned_cols=37  Identities=27%  Similarity=0.480  Sum_probs=27.4

Q ss_pred             cCCCcccCCceeEEEEEEEcCCCccCCCcceEEEEEC--CCCC
Q psy5522         104 LDKAIYKPGDLVRFRAIVLNSHLKPSVTGALEIFITD--GKGN  144 (1397)
Q Consensus       104 TDKpIYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I~D--P~g~  144 (1397)
                      .||...+|||+|.|++...+..-.|+    -.|.|.|  |.|-
T Consensus         3 ~d~~~~~~Gd~v~Yti~v~N~g~~~a----~~v~v~D~lP~g~   41 (53)
T TIGR01451         3 VDKTVATIGDTITYTITVTNNGNVPA----TNVVVTDILPSGT   41 (53)
T ss_pred             cCccccCCCCEEEEEEEEEECCCCce----EeEEEEEcCCCCC
Confidence            58999999999999999988765554    1355555  4443


No 134
>KOG0497|consensus
Probab=30.38  E-value=54  Score=41.75  Aligned_cols=134  Identities=20%  Similarity=0.241  Sum_probs=76.7

Q ss_pred             ecCCCCcCCCCcEEEEEEEcccccCcccCcchhhccCCCchhhhh-c-ccchHHHHHHHHHHhccc---cHHHHHHHHHH
Q psy5522         854 LDMPKNIVPGSEHVEVSAVGDLLGPSIPNLANLIKMPFGCGEQNM-L-NFVPNIVVLEYLKKTYQL---TDAIEAKASRY  928 (1397)
Q Consensus       854 l~~P~~~v~gs~~~~vsv~gd~l~~~~~~l~~Ll~~P~GCgEQ~m-~-~~~p~i~~l~yL~~~~~l---~~~~~~~a~~~  928 (1397)
                      |..|....||+.+..|.-   .+-.+++-+.+ .|+|+|...=+- + -+...-++++=|.+.++-   -+++ ++|..|
T Consensus       572 L~~F~k~~p~~r~~Ei~~---~i~~av~~ie~-~Q~~DGSWyGsWgvCFtY~t~Fa~~gl~aaGkty~nc~~i-rka~~F  646 (760)
T KOG0497|consen  572 LVYFHKLFPGHRKKEIEK---SIEKAVEFIEK-LQLPDGSWYGSWGVCFTYGTWFALRGLAAAGKTYENCEAI-RKACDF  646 (760)
T ss_pred             HHhhcccCccHHHHHHHH---HHHHHHHHHHH-cCCCCCcccchhhHHHHHHHHHhcchhhhcchhhhccHHH-HHHHHH
Confidence            455666666665544331   11122333333 477888655332 1 112222333333333321   2223 355554


Q ss_pred             HHhccccceeeecCCCCccccCCC----------CCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCc
Q psy5522         929 LETGYQQELTYRRPDGSFSAFGTT----------DPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGS  998 (1397)
Q Consensus       929 l~~gy~~~l~y~~~dGsfs~~~~~----------~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~  998 (1397)
                             ||+.|.+||+|+--..+          ..+.++--|++++.++..+++. ..|+-.+.++++-|.+.|.++|.
T Consensus       647 -------ll~~Q~~~GGWgEs~lscp~~~Yi~~~gn~s~vv~T~wAlm~Li~~~q~-~rd~~P~hr~ak~linsQ~~nGd  718 (760)
T KOG0497|consen  647 -------LLSKQNPDGGWGESYLSCPEKRYIPLEGNKSNVVQTAWALMALIMAGQA-ERDPLPLHRAAKVLINSQLENGD  718 (760)
T ss_pred             -------HHhhhcccCCCccccccCccccccccccccccchhHHHHHHHHHhcCCc-ccccchHHHHHHHHHhcccccCC
Confidence                   56689999986522111          0134566799999999987653 45666789999999999999999


Q ss_pred             cc
Q psy5522         999 FP 1000 (1397)
Q Consensus       999 F~ 1000 (1397)
                      |.
T Consensus       719 fp  720 (760)
T KOG0497|consen  719 FP  720 (760)
T ss_pred             cc
Confidence            96


No 135
>PF07233 DUF1425:  Protein of unknown function (DUF1425);  InterPro: IPR010824 This family consists of several hypothetical bacterial proteins of around 125 residues in length. Several members of this family are described as putative lipoproteins and are often known as YcfL. The function of this family is unknown.; PDB: 3O0L_A.
Probab=30.15  E-value=3e+02  Score=26.27  Aligned_cols=62  Identities=11%  Similarity=0.048  Sum_probs=37.9

Q ss_pred             cCCceeeeCCEEEEEEEEEcCCCccEEEEEEEe--ecCCeeeeecCCCccCCCCCCcceeeeEEEEEeCCCeeEEEE
Q psy5522         720 DLPYSVMRGEVVAIPVVVFNYLSQDLVADVTLE--NVGQFDFADFSNEVDAAPQPKFEVFRRKKLTIKANSGSTTTF  794 (1397)
Q Consensus       720 ~LPysv~rGE~~~l~~~VfNyl~~~~~v~v~L~--~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~a~~~~tv~f  794 (1397)
                      .=|.....++.....+.+.|-.+..+.+.-.+.  ..++++..+.             ...-+.+.|.+++..++.-
T Consensus        15 ~~~~~~~~~g~~~~~~~l~N~~~~~~~l~Yrf~WyD~~G~~v~~~-------------~~~w~~~~l~~~~~~~l~~   78 (94)
T PF07233_consen   15 ENPSTSRVNGLLRAQATLSNKSSKPLTLQYRFYWYDKQGLEVDPE-------------QSPWQSLTLPGGQTVTLSA   78 (94)
T ss_dssp             EEEEEEECCCEEEEEEEEEE-SSS-EEEEEEEEEE-TTS-EE--T-------------T---EEEEE-TT-EEEEEE
T ss_pred             eccEEEeeCCeEEEEEEEEECCCCcEEEEEEEEEECCCCCCcCCC-------------CCCCEEEEEcCCCEEEEEE
Confidence            345566779999999999999999877776665  6668875322             1235688999888765543


No 136
>PF09608 Alph_Pro_TM:  Putative transmembrane protein (Alph_Pro_TM);  InterPro: IPR019088  This entry consists of predicted transmembrane proteins of about 270 amino acids. They are found predominantly, though not exclusively, in alphaproteobacteria, generally only once in each genome. 
Probab=29.90  E-value=1.1e+02  Score=34.45  Aligned_cols=102  Identities=13%  Similarity=0.232  Sum_probs=60.7

Q ss_pred             eEeeEEEEEEecccEEEEEcCCCccc-----------CCce-eEEEEEEEcCCCc-cC-----------CCcceEEEEEC
Q psy5522          85 FYNSTGLVYVHKSYSVFIQLDKAIYK-----------PGDL-VRFRAIVLNSHLK-PS-----------VTGALEIFITD  140 (1397)
Q Consensus        85 f~~~~~v~~~~~~~~vfIqTDKpIYk-----------PGqt-V~fRvv~ld~~l~-P~-----------~~~~~~v~I~D  140 (1397)
                      +.|..++.|..-...--|-|.||+-.           .|=. ++.++-....... +.           ....-.++-.+
T Consensus        66 WiN~~~~~f~~vPsfY~vassrPl~~i~~~~~~~~~~ig~~~l~~~i~~~~~~~~~~~~~~~f~~alirlk~~~gLY~~~  145 (236)
T PF09608_consen   66 WINTDSVEFDNVPSFYAVASSRPLDEIASPEELLRLGIGLDNLRLRIRRVGSDQSDPDEQDDFREALIRLKERAGLYQEN  145 (236)
T ss_pred             EEecCceEEccCCchheeeecCCHHHhCCHHHHHhhCcCcchhhhcccccCcccCChhhHHHHHHHHHHHHHhCCCceec
Confidence            46777888888888888999999543           3322 1111000001000 00           01122334445


Q ss_pred             CCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEc--C---eEEEEEEEEee
Q psy5522         141 GKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVL--D---QKFTKRFTVAE  193 (1397)
Q Consensus       141 P~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~--~---~~~~~~F~VeE  193 (1397)
                      +.+..+       .+.+.|..+++||.+.+.|.|++++..-  |   ...+..++|+.
T Consensus       146 ~~~V~~-------~~~~lFra~i~LPanvp~G~Y~v~v~l~rdG~vv~~~~~~l~V~K  196 (236)
T PF09608_consen  146 EGGVQF-------LEGTLFRARIPLPANVPPGDYTVRVYLFRDGQVVASQETPLRVRK  196 (236)
T ss_pred             CCeEEE-------cCCCeEEEEeEcCCCCCcceEEEEEEEEECCEEEEEEeeEEEEEE
Confidence            554333       4568999999999999999999999873  3   34566677654


No 137
>TIGR02186 alph_Pro_TM conserved hypothetical protein. This family consists of predicted transmembrane proteins of about 270 amino acids. Members are found, so far, only among the Alphaproteobacteria and only once in each genome.
Probab=28.74  E-value=1.2e+02  Score=34.54  Aligned_cols=51  Identities=14%  Similarity=0.292  Sum_probs=37.0

Q ss_pred             EEEECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEc--C---eEEEEEEEEee
Q psy5522         136 IFITDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVL--D---QKFTKRFTVAE  193 (1397)
Q Consensus       136 v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~--~---~~~~~~F~VeE  193 (1397)
                      ++-.|+.|..       -.+.++|..++.||.+.+.|.|++++..-  |   ...+..|+|++
T Consensus       166 LY~~~~~gV~-------~~~~~LFra~i~LPAnvp~G~Y~v~v~L~r~G~vv~~~~t~l~V~K  221 (261)
T TIGR02186       166 LYREDAGGVT-------FISPTLFRATLRLPANVPNGTHEVRAYLFRGGVFIARTELALEIVK  221 (261)
T ss_pred             CeeecCCeEE-------EcCCceEEEeeecCCCCCCceEEEEEEEEeCCEEEEEEEeEEEEEE
Confidence            4445555532       24568899999999999999999999873  3   34566777754


No 138
>PF05751 FixH:  FixH;  InterPro: IPR008620 This family consists of several Rhizobium FixH like proteins. It has been suggested that the four proteins FixG, FixH, FixI, and FixS may participate in a membrane-bound complex coupling the FixI cation pump with a redox process catalysed by FixG [].
Probab=28.57  E-value=3e+02  Score=28.16  Aligned_cols=65  Identities=14%  Similarity=0.060  Sum_probs=36.6

Q ss_pred             EEEEEEEccCCCccCCCCceEEEEEeeeccccccceeeEeeCCCeeEEEEEeCCCCCceeEEEEEEEe
Q psy5522         331 TAYMKLTHHDGTPVTDNNNMVQVRHGFSYDESKYEANQYKLDRNGMIKLVYYPPANENVTTLGIEAEY  398 (1397)
Q Consensus       331 ~~~v~v~~~dG~P~~~~~~~V~v~~~~~~~~~~~~~~~~ttd~~G~~~f~i~~~~~~~~~~l~l~a~~  398 (1397)
                      .+.++++|++|.|+.+.  .+++.+...-+........++..++|.....+..+ ..|.+.++|....
T Consensus        70 ~~~i~~~d~~g~~~~~~--~~~l~l~rp~~~~~D~~~~l~~~~~g~y~~~~~~~-~~G~W~l~l~~~~  134 (146)
T PF05751_consen   70 SLTIRLTDPNGAPVSGA--KLTLSLYRPTDAKLDFTLTLTESAPGVYRAPVPLL-KKGRWYLRLDWEP  134 (146)
T ss_pred             eEEEEEEcCCCCcCcCc--eEEEEEECCCCccCCeeEEeeECCCceEEEEcCCC-CCccEEEEEEEec
Confidence            55777888999998743  45554422111112334455567777777666654 3456566664433


No 139
>cd05822 TLP_HIUase HIUase (5-hydroxyisourate hydrolase) catalyzes the second step in a three-step ureide pathway in which 5-hydroxyisourate (HIU), a product of the uricase (urate oxidase) reaction, is hydrolyzed to 2-oxo-4-hydroxy-4-carboxy-5-ureidoimidazoline (OHCU). HIUase has high sequence similarity with transthyretins and is a member of the transthyretin-like protein (TLP) family.   HIUase is distinguished from transthyretins by a conserved signature motif at its C-terminus that forms part of the active site.  In HIUase, this motif is YRGS, while transthyretins have a conserved TAVV sequence in the same location.  Most HIUases are cytosolic but in plants and slime molds, they are peroxisomal based on the presence of N-terminal periplasmic localization sequences.  HIUase forms a homotetramer with each subunit consisting of eight beta-strands arranged in two sheets and a short alpha-helix.  The central channel of the tetramer contains two independent binding sites, each located betw
Probab=27.94  E-value=1.9e+02  Score=28.65  Aligned_cols=60  Identities=20%  Similarity=0.301  Sum_probs=39.5

Q ss_pred             EEcC-CCccCCCcceEEEEECCCC-CeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC
Q psy5522         121 VLNS-HLKPSVTGALEIFITDGKG-NRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD  182 (1397)
Q Consensus       121 ~ld~-~l~P~~~~~~~v~I~DP~g-~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~  182 (1397)
                      +||. ..+|+.+..+.+.-.+.++ ..|.+  .+++..|-+..-+.-.+....|.|+|+...++
T Consensus         7 VLDt~~G~PAagv~V~L~~~~~~~~~~i~~--~~Td~DGR~~~~~~~~~~~~~G~Y~l~F~~~~   68 (112)
T cd05822           7 VLDTATGKPAAGVAVTLYRLDGNGWTLLAT--GVTNADGRCDDLLPPGAQLAAGTYKLTFDTGA   68 (112)
T ss_pred             EEeCCCCcccCCCEEEEEEecCCCeEEEEE--EEECCCCCccCcccccccCCCeeEEEEEEhhh
Confidence            4563 5688877788887776654 23443  46788888754333235566799999988765


No 140
>cd05790 S1_Rrp40 S1_Rrp40: Rrp40 S1-like RNA-binding domain. S1-like RNA-binding domains are found in a wide variety of RNA-associated proteins. Rrp4 protein is a subunit of the exosome complex. The exosome plays a central role in 3' to 5' RNA processing and degradation in eukarytes and archaea. Its functions include the removal of incorrectly processed RNA and the maintenance of proper levels of mRNA, rRNA and a number of small RNA species. In Saccharomyces cerevisiae, the exosome includes nine core components, six of which are homologous to bacterial RNase PH. These form a hexameric ring structure. The other three subunits (RrP4, Rrp40, and Csl4) contain an S1 RNA binding domain and are part of the "S1 pore structure".
Probab=27.89  E-value=84  Score=29.57  Aligned_cols=32  Identities=31%  Similarity=0.508  Sum_probs=27.8

Q ss_pred             CCcccCCceeEEEEEEEcCCCccCCCcceEEEEECCCC
Q psy5522         106 KAIYKPGDLVRFRAIVLNSHLKPSVTGALEIFITDGKG  143 (1397)
Q Consensus       106 KpIYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I~DP~g  143 (1397)
                      ||..++||-|.-||...+.++.|      .++..|+.|
T Consensus        47 rp~L~~GDlV~ArV~~~~~~~~~------eLtc~~~~~   78 (86)
T cd05790          47 RPNLNVGDLVYARVVKANRDMEP------ELSCVDSSG   78 (86)
T ss_pred             cccCCCCCEEEEEEEecCCCCCe------EEEEeCCCC
Confidence            99999999999999998877655      588888876


No 141
>KOG1948|consensus
Probab=27.56  E-value=1.6e+03  Score=30.01  Aligned_cols=57  Identities=12%  Similarity=0.190  Sum_probs=32.6

Q ss_pred             EeeCCCeeEEEEEeCCCCCceeEEEEEEEecCcceEEEEEEeecCCCCCeEEEEecCCCCCCCCeEEEE
Q psy5522         369 YKLDRNGMIKLVYYPPANENVTTLGIEAEYLDIKEWFSTISASESPSNSFIQAALLTQNPKVNKDVELE  437 (1397)
Q Consensus       369 ~ttd~~G~~~f~i~~~~~~~~~~l~l~a~~~~~~~~~~~v~~~~S~s~s~L~i~~~~~~~~~G~~~~v~  437 (1397)
                      ..+|++|..+|.-- |.+  .  +.+.+.|.++... +.+    ||  +-|.+...++.+++-+..+|.
T Consensus       258 ~vsd~~G~fsfksv-PsG--k--Y~l~a~y~ge~~~-fdv----SP--~~l~v~Vehd~lqi~~ef~vt  314 (1165)
T KOG1948|consen  258 GVSDPRGRFSFKSV-PSG--K--YYLAASYVGEPKS-FDV----SP--NPLKVVVEHDHLQIASEFRVT  314 (1165)
T ss_pred             EEEcCCceEEEEEc-CCC--C--EEEEEEecCCceE-EEe----CC--CceeEEEeccceeccceeEEE
Confidence            45799998777643 333  4  6678888876422 222    33  344555555556655555544


No 142
>PF07705 CARDB:  CARDB;  InterPro: IPR011635 The APHP (acidic peptide-dependent hydrolases/peptidase) domain is found in a variety of different proteins.; PDB: 2KUT_A 2L0D_A 3IDU_A 2KL6_A.
Probab=27.20  E-value=5.1e+02  Score=24.04  Aligned_cols=75  Identities=17%  Similarity=0.226  Sum_probs=42.0

Q ss_pred             EecCCCCCCCCeEEEEEEe----cCCcceEEEEEEeCCeEEEEEEE-EecCCceeEEEEEeeCCCCCceeEEEEEEEecC
Q psy5522         422 ALLTQNPKVNKDVELEINS----TAPLKYISYQVLGRGDVIMADTI-TVPGNKMSTVIRFLATYAMAPTAHVIVQYVRED  496 (1397)
Q Consensus       422 ~~~~~~~~~G~~~~v~v~s----~~~~~~~~ylV~srG~Iv~~~~~-~~~~~~~~~~~~l~vt~~maP~a~l~vy~v~~~  496 (1397)
                      .........|++..|.+.=    ......+...+...|..+....+ .+..+ ....+.|.+... .|+.+-+...++++
T Consensus         9 ~~~~~~~~~g~~~~i~~~V~N~G~~~~~~~~v~~~~~~~~~~~~~i~~L~~g-~~~~v~~~~~~~-~~G~~~i~~~iD~~   86 (101)
T PF07705_consen    9 TVSPSNVVPGEPVTITVTVKNNGTADAENVTVRLYLDGNSVSTVTIPSLAPG-ESETVTFTWTPP-SPGSYTIRVVIDPD   86 (101)
T ss_dssp             EEC-SEEETTSEEEEEEEEEE-SSS-BEEEEEEEEETTEEEEEEEESEB-TT-EEEEEEEEEE-S-S-CEEEEEEEESTT
T ss_pred             eeCCCcccCCCEEEEEEEEEECCCCCCCCEEEEEEECCceeccEEECCcCCC-cEEEEEEEEEeC-CCCeEEEEEEEeeC
Confidence            3445566789988887751    12223355666777877755555 45554 455666666666 66655555556654


Q ss_pred             Ce
Q psy5522         497 GE  498 (1397)
Q Consensus       497 ge  498 (1397)
                      +.
T Consensus        87 n~   88 (101)
T PF07705_consen   87 ND   88 (101)
T ss_dssp             TS
T ss_pred             Cc
Confidence            43


No 143
>PF01273 LBP_BPI_CETP:  LBP / BPI / CETP family, N-terminal domain;  InterPro: IPR017942 This entry represents the N-terminal domain found in several lipid-binding serum glycoproteins. The N- and C-terminal domains share a similar two-layer alpha/beta structure, but they show little sequence identity. Proteins containing this N-terminal domain include:   Bactericidal permeability-increasing protein (BPI) Lipopolysaccharide-binding protein (LBP) Cholesteryl ester transfer protein (CETP) Phospholipid transfer protein (PLTP) Palate, lung and nasal epithelium carcinoma-associated protein (PLUNC)    Bactericidal permeability-increasing protein (BPI) is a potent antimicrobial protein of 456 residues that binds to and neutralises lipopolysaccharides from the outer membrane of Gram-negative bacteria []. BPI contains two domains that adopt the same structural fold, even though they have little sequence similarity [].   Lipopolysaccharide-binding protein (LBP) is an endotoxin-binding protein that is closely related to, and functions in a co-ordinated manner with BPI to facilitate an integrated host response to invading Gram-negative bacteria []. Cholesteryl ester transfer protein (CETP) is a glycoprotein that facilitates the transfer of lipids (cholesteryl esters and triglycerides) between the different lipoproteins that transport them through plasma, including HDL, LDL, VLDL and chylomicrons. These lipoproteins shield the lipids from water by encapsulating them within a coating of polar lipids and proteins [].  Phospholipid transfer protein (PLTP) exchanges phospholipids between lipoproteins and remodels high-density lipoproteins (HDLs) []. Palate, lung and nasal epithelium carcinoma-associated protein (PLUNC) is a potential host defensive protein that is secreted from the submucosal gland to the saliva and nasal lavage fluid. PLUNC appears to be a secreted product of neutrophil granules that participates in an aspect of the inflammatory response that contributes to host defence []. Short palate, lung and nasal epithelium clone 1 (SPLUNC1) may bind the lipopolysaccharide of Gram-negative nanobacteria, thereby playing an important role in the host defence of nasopharyngeal epithelium [].; GO: 0008289 lipid binding; PDB: 1EWF_A 1BP1_A 2OBD_A.
Probab=27.15  E-value=4.4e+02  Score=27.46  Aligned_cols=46  Identities=11%  Similarity=0.229  Sum_probs=36.0

Q ss_pred             EEEEEEeeecCCcEEEEEeCCCceEeeCCeEEEEEEEEEecC-cccc
Q psy5522         186 TKRFTVAEYVLPKFEVNVNVPPHATFKDSKVVISVNAKYTYG-KPVK  231 (1397)
Q Consensus       186 ~~~F~VeEYvLPkFeV~v~~p~~i~~~~~~~~v~v~AkYtyG-kPV~  231 (1397)
                      -...+|.++.+|...++..++..+...-..+.+++.++|.+. .+++
T Consensus        41 i~~i~I~~~~~~~~~l~~~p~~~~~~~~~~~~~~~~~~~~~~~~~i~   87 (164)
T PF01273_consen   41 ISNIKISNFRLPQSSLELSPDGGIVLSIIPASASLSGNWPYKGGFIK   87 (164)
T ss_dssp             EEEEEEEEEEEEEEEEEEETTTEEEEEEEEEEEEEEEEEEEEECCEE
T ss_pred             EEeEEEEEeecCCccceECCCCceEEEEeeeEEEEEEEEecCCChhh
Confidence            446889999999999999999887666667788888877776 4444


No 144
>PF12863 DUF3821:  Domain of unknown function (DUF3821);  InterPro: IPR024277 This is a domain largely confined to sequences from Methanomicrobiales. It is found in putative lipases but the function is unknown.
Probab=26.20  E-value=1.9e+02  Score=32.00  Aligned_cols=39  Identities=26%  Similarity=0.355  Sum_probs=28.1

Q ss_pred             ccCCceeEEEEEEEcCCCccC--------CCcceEEEEECCCCCeEEeee
Q psy5522         109 YKPGDLVRFRAIVLNSHLKPS--------VTGALEIFITDGKGNRIKQWT  150 (1397)
Q Consensus       109 YkPGqtV~fRvv~ld~~l~P~--------~~~~~~v~I~DP~g~~I~qw~  150 (1397)
                      -.+|+-|.||+   +.||-..        ....+.|.|++|.|.....-.
T Consensus       102 V~~G~~v~FrI---~tNL~~~~~R~g~~~~~~~v~I~V~~P~G~~~t~L~  148 (209)
T PF12863_consen  102 VPRGDNVNFRI---DTNLYSIFQRGGYTPGDGPVDIKVTTPSGATYTSLF  148 (209)
T ss_pred             eccCCeEEEEE---cccHHHHhhcCCCCCCcceEEEEEeCCCCcEEEEEE
Confidence            45789999998   6666332        123499999999998776544


No 145
>PF11974 MG1:  Alpha-2-macroglobulin MG1 domain;  InterPro: IPR021868  This is the N-terminal MG1 domain from alpha-2-macroglobulin []. 
Probab=25.97  E-value=1.3e+02  Score=28.83  Aligned_cols=45  Identities=18%  Similarity=0.171  Sum_probs=30.6

Q ss_pred             EEEEEEc-cCCCccCCCCceEEEEEeeeccccccceeeEeeCCCeeEEEEEe
Q psy5522         332 AYMKLTH-HDGTPVTDNNNMVQVRHGFSYDESKYEANQYKLDRNGMIKLVYY  382 (1397)
Q Consensus       332 ~~v~v~~-~dG~P~~~~~~~V~v~~~~~~~~~~~~~~~~ttd~~G~~~f~i~  382 (1397)
                      ..|.+++ .+|+|+++.  .|++.-   + ..+....+.+||++|.+.|...
T Consensus        15 ~~v~v~~L~tg~Pv~ga--~V~l~~---~-~~~~~l~~g~TD~~G~a~~~~~   60 (97)
T PF11974_consen   15 LLVWVTSLSTGKPVAGA--EVELYD---S-RNGQVLASGKTDADGFASFDST   60 (97)
T ss_pred             EEEEEeeCCCCCccCCC--EEEEEE---C-CCCcEeeeeeeCCCceEEecCC
Confidence            3555665 489999953  565532   0 1134667889999999998875


No 146
>cd03459 3,4-PCD Protocatechuate 3,4-dioxygenase (3,4-PCD) catalyzes the oxidative ring cleavage of 3,4-dihydroxybenzoate to produce beta-carboxy-cis,cis-muconate. 3,4-PCDs are large aggregates of 12 protomers, each composed of an alpha- and beta-subunit and an Fe3+ ion bound in the beta-subunit at the alpha-beta-subunit interface. 3,4-PCD is a member of the aromatic dioxygenases which are non-heme iron intradiol-cleaving enzymes that break the C1-C2 bond and utilize Fe3+.
Probab=25.65  E-value=2e+02  Score=30.38  Aligned_cols=49  Identities=16%  Similarity=0.273  Sum_probs=0.0

Q ss_pred             cCCCeEEEEEEEEccCCCccCCCCceEEEEEeeecccc-------------------ccceeeEeeCCCeeEEEE
Q psy5522         325 KPGLKYTAYMKLTHHDGTPVTDNNNMVQVRHGFSYDES-------------------KYEANQYKLDRNGMIKLV  380 (1397)
Q Consensus       325 kpG~p~~~~v~v~~~dG~P~~~~~~~V~v~~~~~~~~~-------------------~~~~~~~ttd~~G~~~f~  380 (1397)
                      .+|.+..+..+|+|.+|+|++  +..|.+     |+-.                   -.......||++|...|.
T Consensus        11 ~~G~~l~l~g~V~D~~g~Pv~--~A~vei-----Wqad~~G~Y~~~~~~~~~~~~~~f~~rG~~~Td~~G~~~f~   78 (158)
T cd03459          11 AIGERIILEGRVLDGDGRPVP--DALVEI-----WQADAAGRYRHPRDSHRAPLDPNFTGFGRVLTDADGRYRFR   78 (158)
T ss_pred             CCCcEEEEEEEEECCCCCCCC--CCEEEE-----EccCCCCccCCccCCcccccCCCCCceeEEEECCCCcEEEE


No 147
>TIGR02439 catechol_proteo catechol 1,2-dioxygenase, proteobacterial. Members of this family known so far are catechol 1,2-dioxygenases of the Proteobacteria. They are distinct from catechol 1,2-dioxygenases and chlorocatechol 1,2-dioxygenases of the Actinobacteria, which are quite similar to each other and resolved by separate models. This enzyme catalyzes intradiol cleavage in which catechol + O2 becomes cis,cis-muconate. Catechol is an intermediate in the catabolism of many different aromatic compounds, as is the alternative intermediate protocatechuate. In Acinetobacter lwoffii, two isozymes are present with abilities, differing somewhat, to act on catechol analogs 3-methylcatechol, 4-methylcatechol, 4-methoxycatechol, and 4-chlorocatechol.
Probab=25.44  E-value=1.7e+02  Score=33.94  Aligned_cols=22  Identities=27%  Similarity=0.472  Sum_probs=20.2

Q ss_pred             cCCCeEEEEEEEEccCCCccCC
Q psy5522         325 KPGLKYTAYMKLTHHDGTPVTD  346 (1397)
Q Consensus       325 kpG~p~~~~v~v~~~dG~P~~~  346 (1397)
                      .+|.|..+..+|+|.+|+|+++
T Consensus       124 ~~G~pl~v~G~V~D~~G~PI~g  145 (285)
T TIGR02439       124 DPGETLFLHGQVTDADGKPIAG  145 (285)
T ss_pred             CCCcEEEEEEEEECCCCCCcCC
Confidence            3699999999999999999995


No 148
>PF05688 DUF824:  Salmonella repeat of unknown function (DUF824);  InterPro: IPR008542 This family consists of a series of repeated sequences (of around 180 residues) which are found in Salmonella typhimurium, Salmonella typhi and Escherichia coli. These repeats are almost always found with this entry. The repeats are associated with RatA and RatB, the coding sequences of which are found in the pathogeneicity island of Salmonella. The sequences may be determinants of pathogenicity [, ].
Probab=25.42  E-value=1.4e+02  Score=24.74  Aligned_cols=35  Identities=17%  Similarity=0.233  Sum_probs=29.5

Q ss_pred             cccCCceeEEEEEEEcCCCccCCCcceEEEEECCC
Q psy5522         108 IYKPGDLVRFRAIVLNSHLKPSVTGALEIFITDGK  142 (1397)
Q Consensus       108 IYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I~DP~  142 (1397)
                      .-|-||++.++|-+.|.+..|+.+.++.|..-|..
T Consensus         8 kaK~Ge~I~ltVt~kda~G~pv~n~~f~l~r~~~~   42 (47)
T PF05688_consen    8 KAKVGETIPLTVTVKDANGNPVPNAPFTLTRGDAK   42 (47)
T ss_pred             heecCCeEEEEEEEECCCCCCcCCceEEEEecCcc
Confidence            34789999999999999999998888888776654


No 149
>PF14874 PapD-like:  Flagellar-associated PapD-like
Probab=25.21  E-value=5.9e+02  Score=24.10  Aligned_cols=68  Identities=15%  Similarity=0.062  Sum_probs=46.4

Q ss_pred             eeeeCCEEEEEEEEEcCCCccEEEEEEEee--cCCeeeeecCCCccCCCCCCcceeeeEEEEEeCCCeeEEEEEEE-ece
Q psy5522         724 SVMRGEVVAIPVVVFNYLSQDLVADVTLEN--VGQFDFADFSNEVDAAPQPKFEVFRRKKLTIKANSGSTTTFVIT-PKE  800 (1397)
Q Consensus       724 sv~rGE~~~l~~~VfNyl~~~~~v~v~L~~--s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~a~~~~tv~f~v~-p~~  800 (1397)
                      .+..|+.....++|.|......+.++....  ++.|.+.+.                  .-.|++|....+...+. +..
T Consensus        15 ~v~~g~~~~~~v~l~N~s~~p~~f~v~~~~~~~~~~~v~~~------------------~g~l~PG~~~~~~V~~~~~~~   76 (102)
T PF14874_consen   15 NVFVGQTYSRTVTLTNTSSIPARFRVRQPESLSSFFSVEPP------------------SGFLAPGESVELEVTFSPTKP   76 (102)
T ss_pred             EEccCCEEEEEEEEEECCCCCEEEEEEeCCcCCCCEEEECC------------------CCEECCCCEEEEEEEEEeCCC
Confidence            456799999999999999998888887654  223333211                  12488888888888888 455


Q ss_pred             eeeEEEEEE
Q psy5522         801 LGYIGIKVT  809 (1397)
Q Consensus       801 lG~v~itV~  809 (1397)
                      +|...-++.
T Consensus        77 ~g~~~~~l~   85 (102)
T PF14874_consen   77 LGDYEGSLV   85 (102)
T ss_pred             CceEEEEEE
Confidence            676554443


No 150
>cd03463 3,4-PCD_alpha Protocatechuate 3,4-dioxygenase (3,4-PCD) , alpha subunit. 3,4-PCD catalyzes the oxidative ring cleavage of 3,4-dihydroxybenzoate to produce beta-carboxy-cis,cis-muconate. 3,4-PCDs are large aggregates of 12 protomers, each composed of an alpha- and beta-subunit and an Fe3+ ion bound in the beta-subunit at the alpha-subunit-beta-subunit interface. 3,4-PCD is a member of the aromatic dioxygenases which are non-heme iron intradiol-cleaving enzymes that break the C1-C2 bond and utilize Fe3+.
Probab=25.15  E-value=1.9e+02  Score=31.30  Aligned_cols=48  Identities=19%  Similarity=0.367  Sum_probs=0.0

Q ss_pred             CCCeEEEEEEEEccCCCccCCCCceEEEEEeeecccc----------cc--------ceeeEeeCCCeeEEEE
Q psy5522         326 PGLKYTAYMKLTHHDGTPVTDNNNMVQVRHGFSYDES----------KY--------EANQYKLDRNGMIKLV  380 (1397)
Q Consensus       326 pG~p~~~~v~v~~~dG~P~~~~~~~V~v~~~~~~~~~----------~~--------~~~~~ttd~~G~~~f~  380 (1397)
                      +|.+..+..+|+|.+|+|++  +..|.+     |.-.          ..        ......||++|...|.
T Consensus        33 ~G~~l~l~G~V~D~~g~Pi~--gA~Vei-----Wqad~~G~Y~~~~~~~~~~~~~f~~rGr~~TD~~G~y~F~   98 (185)
T cd03463          33 AGERITLEGRVYDGDGAPVP--DAMLEI-----WQADAAGRYAHPADSRRRLDPGFRGFGRVATDADGRFSFT   98 (185)
T ss_pred             CCCEEEEEEEEECCCCCCCC--CCEEEE-----EcCCCCCccCCcCCcccccCCCCCcEEEEEECCCCCEEEE


No 151
>PF00576 Transthyretin:  HIUase/Transthyretin family;  InterPro: IPR023416 This family includes transthyretin that is a thyroid hormone-binding protein that transports thyroxine from the bloodstream to the brain. However, most of the sequences listed in this family do not bind thyroid hormones. They are actually enzymes of the purine catabolism that catalyse the conversion of 5-hydroxyisourate (HIU) to OHCU [, ]. HIU hydrolysis is the original function of the family and is conserved from bacteria to mammals; transthyretins arose by gene duplications in the vertebrate lineage [, ]. HIUases are distinguished in the alignment from the conserved C-terminal YRGS sequence. Transthyretin (formerly prealbumin) is one of 3 thyroid hormone-binding proteins found in the blood of vertebrates []. It is produced in the liver and circulates in the bloodstream, where it binds retinol and thyroxine (T4) []. It differs from the other 2 hormone-binding proteins (T4-binding globulin and albumin) in 3 distinct ways: (1) the gene is expressed at a high rate in the brain choroid plexus; (2) it is enriched in cerebrospinal fluid; and (3) no genetically caused absence has been observed, suggesting an essential role in brain function, distinct from that played in the bloodstream []. The protein consists of around 130 amino acids, which assemble as a homotetramer that contains an internal channel in which T4 is bound. Within this complex, T4 appears to be transported across the blood-brain barrier, where, in the choroid plexus, the hormone stimulates further synthesis of transthyretin. The protein then diffuses back into the bloodstream, where it binds T4 for transport back to the brain [].; PDB: 1TFP_B 1KGJ_D 1IE4_C 1GKE_C 1KGI_D 2H0J_B 2H0E_B 2H0F_B 1ZD6_A 3DGD_D ....
Probab=25.12  E-value=2.1e+02  Score=28.35  Aligned_cols=62  Identities=18%  Similarity=0.204  Sum_probs=38.4

Q ss_pred             EEcC-CCccCCCcceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC
Q psy5522         121 VLNS-HLKPSVTGALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD  182 (1397)
Q Consensus       121 ~ld~-~l~P~~~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~  182 (1397)
                      +||. ..+|+.+..+.+.-.++.+....--...++..|-+...+.-......|.|+|+...++
T Consensus         7 VLDt~~G~PA~gv~V~L~~~~~~~~~~~l~~~~Td~DGR~~~~~~~~~~~~~G~Y~l~F~~~~   69 (112)
T PF00576_consen    7 VLDTTTGKPAAGVPVTLYRLDSDGSWTLLAEGVTDADGRIKQPLLEGESLEPGIYKLVFDTGD   69 (112)
T ss_dssp             EEETTTTEE-TT-EEEEEEEETTSCEEEEEEEEBETTSEESSTSSETTTS-SEEEEEEEEHHH
T ss_pred             EeeCCCCCCccCCEEEEEEecCCCCcEEEEEEEECCCCcccccccccccccceEEEEEEEHHH
Confidence            4554 4688877888887777544333323456888888753333356667899999887754


No 152
>PF01122 Cobalamin_bind:  Eukaryotic cobalamin-binding protein;  InterPro: IPR002157 Cobalamin (Cbl or vitamin B12) is only accessible through diet in mammals. Absorption, plasma transport and cellular uptake of Cbl in mammals involves three Cbl-transporting proteins, which are listed below in order of increasing Cbl-specificity:  Haptocorrin (cobalophilin), which binds Cbl and Cbl-derivatives such as cobinamide; it may play a role in preventing the absorption of cobalamin analogues produced by bacteria. Transcobalamin (TC), which transport Cbl from blood to cells. Intrinsic factor (IF), which promotes Cbl absorption in the ileum by specific receptor-mediated endocytosis.  The structure of TC reveals a two-domain structure, an N-terminal alpha(6)-alpha(6) barrel, and a smaller C-terminal domain []. Many interactions between Cbl and its binding site in the interface of the two domains are conserved among the other Cbl transporters. Specificity for Cbl between the different transporters may reside in a beta-hairpin motif found in the smaller C-terminal domain []. ; GO: 0031419 cobalamin binding, 0015889 cobalamin transport; PDB: 3KQ4_A 2PMV_A 2BB5_A 2V3N_A 2BBC_A 2BB6_D 2V3P_A.
Probab=25.10  E-value=88  Score=36.90  Aligned_cols=76  Identities=22%  Similarity=0.254  Sum_probs=48.6

Q ss_pred             HHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCccccc
Q psy5522         923 AKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSFPEV 1002 (1397)
Q Consensus       923 ~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~st 1002 (1397)
                      .++++.+.+   +++.-|.+||.|         ||++-|+-|+.+|..+..++.--...-.++++||.++. .+|.|...
T Consensus       214 ~~~i~~~~~---kIl~~q~~~G~~---------GNiySTglAmQAL~~~~~~~~~~~w~C~k~~~~ll~~i-~~G~F~nP  280 (326)
T PF01122_consen  214 QQAIRSLVE---KILSQQKPNGLF---------GNIYSTGLAMQALSVSPSPPSESEWNCQKALDALLKEI-SQGAFQNP  280 (326)
T ss_dssp             HHHHHHHHH---HHHHTB-TTS-B---------SSTTTHHHHHHHHTT-SS-SSHHHHHHHHHHHHHHHHH-TTTTT-SH
T ss_pred             HHHHHHHHH---HHHHhcCCCCcc---------cchhhHHHHHHHHhcCCCCCcchhhHHHHHHHHHHHHh-hcCCCCCH
Confidence            455554444   566678889976         47888999999999887652211244677899998764 59999864


Q ss_pred             hH---hHHHhhc
Q psy5522        1003 GK---VSHADMQ 1011 (1397)
Q Consensus      1003 qd---vl~al~~ 1011 (1397)
                      -.   ||.+|..
T Consensus       281 ~a~aQiLPaL~g  292 (326)
T PF01122_consen  281 MAIAQILPALNG  292 (326)
T ss_dssp             HHHHHHHHHHTT
T ss_pred             HHHHHHHHHHcC
Confidence            33   4666655


No 153
>COG4935 Regulatory P domain of the subtilisin-like proprotein convertases and other proteases [Posttranslational modification, protein turnover, chaperones]
Probab=24.98  E-value=5.4e+02  Score=27.59  Aligned_cols=53  Identities=21%  Similarity=0.334  Sum_probs=35.9

Q ss_pred             CCcceEEEEECCCCCeEEeeee---ec--cCCceeeeee---ecCCCCCCeeEEEEEEEcC
Q psy5522         130 VTGALEIFITDGKGNRIKQWTR---AL--TTRGVFSADL---QLSKSPVLGDWNITINVLD  182 (1397)
Q Consensus       130 ~~~~~~v~I~DP~g~~I~qw~~---~~--~~~Gi~~~sf---~L~~~~~lG~y~I~v~~~~  182 (1397)
                      +-+-+.|++..|.|..|..-+.   ..  ..+|....+|   ++=.+++.|+|++.+.-..
T Consensus        50 yrGDL~I~L~sP~g~~Il~~~~~~~~~D~~~q~~~~~s~~l~~~~ge~~~G~W~L~V~D~~  110 (177)
T COG4935          50 YRGDLRITLISPSGQGILSNLTLGRRYDDDSQGYNDQSFMLVQLWGERAEGNWRLEVQDLA  110 (177)
T ss_pred             ccccEEEEEeCCCCcceEEeeccCCccccccceeeeeeeeeeeeccCCCCceEEEEEEecC
Confidence            4567788888998655543221   12  2456777777   5567788999999998754


No 154
>PRK12813 flgD flagellar basal body rod modification protein; Reviewed
Probab=23.96  E-value=2.1e+02  Score=32.01  Aligned_cols=48  Identities=13%  Similarity=0.199  Sum_probs=34.0

Q ss_pred             CcceEEEEECCCCCeEEeeeeeccCCceeeeeee-cCCC---CCCeeEEEEEEEc
Q psy5522         131 TGALEIFITDGKGNRIKQWTRALTTRGVFSADLQ-LSKS---PVLGDWNITINVL  181 (1397)
Q Consensus       131 ~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~-L~~~---~~lG~y~I~v~~~  181 (1397)
                      -..+.|.|.|.+|+.|+....   ..|....++. ...+   .+-|.|+|.|...
T Consensus       123 a~~v~v~I~D~~G~vV~t~~~---~~G~~~f~WDG~d~~G~~l~~G~Yt~~V~A~  174 (223)
T PRK12813        123 ADKAELVVRDAAGAEVARETV---PVGAGPVEWAGEDADGNPLPNGAYSFVVESY  174 (223)
T ss_pred             CceEEEEEEcCCCCEEEEEee---CCCceeEEeCCcCCCCCcCCCccEEEEEEEE
Confidence            467899999999999987542   4566666665 2222   4469999998764


No 155
>TIGR01451 B_ant_repeat conserved repeat domain. This model represents the conserved region of about 53 amino acids shared between regions, usually repeated, of proteins from a small number of phylogenetically distant prokaryotes. Examples include a 132-residue region found repeated in three of the five longest proteins of Bacillus anthracis, a 131-residue repeat in a cell wall-anchored protein of Enterococcus faecalis, and a 120-residue repeat in Methanobacterium thermoautotrophicum. A similar region is found in some Chlamydial outer membrane proteins.
Probab=23.80  E-value=2.2e+02  Score=24.05  Aligned_cols=40  Identities=15%  Similarity=0.021  Sum_probs=30.9

Q ss_pred             CceeeeCCEEEEEEEEEcCCCccEE-EEEEEeecCCeeeee
Q psy5522         722 PYSVMRGEVVAIPVVVFNYLSQDLV-ADVTLENVGQFDFAD  761 (1397)
Q Consensus       722 Pysv~rGE~~~l~~~VfNyl~~~~~-v~v~L~~s~~~~~~~  761 (1397)
                      +..+.+||.+.-.++|.|..+.... |.|+-....++.+++
T Consensus         5 ~~~~~~Gd~v~Yti~v~N~g~~~a~~v~v~D~lP~g~~~v~   45 (53)
T TIGR01451         5 KTVATIGDTITYTITVTNNGNVPATNVVVTDILPSGTTFVS   45 (53)
T ss_pred             ccccCCCCEEEEEEEEEECCCCceEeEEEEEcCCCCCEEEe
Confidence            4567899999999999999887654 666655666777764


No 156
>TIGR02438 catachol_actin catechol 1,2-dioxygenase, Actinobacterial. Members of this family are catechol 1,2-dioxygenases of the Actinobacteria. They are more closely related to actinobacterial chlorocatechol 1,2-dioxygenases than to proteobacterial catechol 1,2-dioxygenases, and so are built in this separate model. The member from Rhodococcus rhodochrous NCIMB 13259 (GB|AAC33003.1) is described as a homodimer with bound Fe, similarly active on catechol, 3-methylcatechol and 4-methylcatechol.
Probab=23.46  E-value=2e+02  Score=33.37  Aligned_cols=23  Identities=17%  Similarity=0.222  Sum_probs=20.7

Q ss_pred             cCCCeEEEEEEEEccCCCccCCC
Q psy5522         325 KPGLKYTAYMKLTHHDGTPVTDN  347 (1397)
Q Consensus       325 kpG~p~~~~v~v~~~dG~P~~~~  347 (1397)
                      .+|.|..+..+|+|.+|+|+++.
T Consensus       128 ~~G~pl~v~G~V~D~~G~Pv~gA  150 (281)
T TIGR02438       128 EAGTPLVFSGQVTDLDGNGLAGA  150 (281)
T ss_pred             CCCCEEEEEEEEEcCCCCCcCCC
Confidence            47999999999999999999953


No 157
>cd03458 Catechol_intradiol_dioxygenases Catechol intradiol dioxygenases can be divided into several subgroups according to their substrate specificity for catechol, chlorocatechols and hydroxyquinols. Almost all members of this family are homodimers containing one ferric ion (Fe3+) per monomer. They belong to the intradiol dioxygenase family, a family of mononuclear non-heme iron intradiol-cleaving enzymes that catalyze the oxygenation of catecholates to aliphatic acids via the cleavage of aromatic rings.
Probab=23.23  E-value=2e+02  Score=32.78  Aligned_cols=22  Identities=23%  Similarity=0.309  Sum_probs=20.6

Q ss_pred             cCCCeEEEEEEEEccCCCccCC
Q psy5522         325 KPGLKYTAYMKLTHHDGTPVTD  346 (1397)
Q Consensus       325 kpG~p~~~~v~v~~~dG~P~~~  346 (1397)
                      .+|.|..+..+|+|.+|+|+++
T Consensus       100 ~~G~~l~l~G~V~D~~G~Pv~~  121 (256)
T cd03458         100 ADGEPLFVHGTVTDTDGKPLAG  121 (256)
T ss_pred             CCCcEEEEEEEEEcCCCCCCCC
Confidence            6799999999999999999995


No 158
>PRK15036 hydroxyisourate hydrolase; Provisional
Probab=23.21  E-value=3.6e+02  Score=27.73  Aligned_cols=64  Identities=16%  Similarity=0.235  Sum_probs=41.6

Q ss_pred             EEEEEEEcC-CCccCCCcceEEEEECCCC-CeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC
Q psy5522         116 RFRAIVLNS-HLKPSVTGALEIFITDGKG-NRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD  182 (1397)
Q Consensus       116 ~fRvv~ld~-~l~P~~~~~~~v~I~DP~g-~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~  182 (1397)
                      .++.-++|. .-+|+.+-.+.++-.++++ ..+.+  ..++..|-+..-+. +++...|.|+|+...++
T Consensus        28 ~Is~HVLDt~~G~PA~gV~V~L~~~~~~~w~~l~~--~~Td~dGR~~~l~~-~~~~~~G~Y~L~F~t~~   93 (137)
T PRK15036         28 ILSVHILNQQTGKPAADVTVTLEKKADNGWLQLNT--AKTDKDGRIKALWP-EQTATTGDYRVVFKTGD   93 (137)
T ss_pred             CeEEEEEeCCCCcCCCCCEEEEEEccCCceEEEEE--EEECCCCCCccccC-cccCCCeeEEEEEEcch
Confidence            477777886 4689877777776554432 22322  35788888865222 33356899999998765


No 159
>COG5266 CbiK ABC-type Co2+ transport system, periplasmic component [Inorganic ion transport and metabolism]
Probab=23.20  E-value=2.1e+02  Score=32.34  Aligned_cols=57  Identities=16%  Similarity=0.188  Sum_probs=35.9

Q ss_pred             cCCcccCCCeEEEEEEEEccCCCccCCCCceEEEEEee-----eccc---cccceeeEeeCCCeeEEEEE
Q psy5522         320 SSEYFKPGLKYTAYMKLTHHDGTPVTDNNNMVQVRHGF-----SYDE---SKYEANQYKLDRNGMIKLVY  381 (1397)
Q Consensus       320 ~~~~~kpG~p~~~~v~v~~~dG~P~~~~~~~V~v~~~~-----~~~~---~~~~~~~~ttd~~G~~~f~i  381 (1397)
                      .|...+.|.+|.++|.  + +|+|+++  .+|.+...+     +|-.   .+.......||++|.+.|..
T Consensus       164 nP~~a~~ge~f~~~vl--~-~GkPv~n--A~V~v~~~n~~~~d~~a~~~~~ek~~~~~~TD~kG~~~fip  228 (264)
T COG5266         164 NPANAWVGEVFRGKVL--D-NGKPVPN--ATVEVEFDNIDTKDNRAKTGNTEKTALVQFTDDKGEVSFIP  228 (264)
T ss_pred             CccccccCCeEEEEEE--E-CCccCCC--cEEEEEEecccccccccccCCCCCcceEEEcCCCceEEEEE
Confidence            3556778999987663  3 3999995  466665321     1211   12334567899999877764


No 160
>cd03462 1,2-CCD chlorocatechol 1,2-dioxygenases (1,2-CCDs) (type II enzymes) are homodimeric intradiol dioxygenases that degrade chlorocatechols via the addition of molecular oxygen and the subsequent cleavage between two adjacent hydroxyl groups. This reaction is part of the modified ortho-cleavage pathway which is a central oxidative bacterial pathway that channels chlorocatechols, derived from the degradation of chlorinated benzoic acids, phenoxyacetic acids, phenols, benzenes, and other aromatics into the energy-generating tricarboxylic acid pathway.
Probab=23.15  E-value=2.1e+02  Score=32.48  Aligned_cols=22  Identities=14%  Similarity=0.092  Sum_probs=20.2

Q ss_pred             cCCCeEEEEEEEEccCCCccCC
Q psy5522         325 KPGLKYTAYMKLTHHDGTPVTD  346 (1397)
Q Consensus       325 kpG~p~~~~v~v~~~dG~P~~~  346 (1397)
                      .+|.+..+.-+|+|.+|+|+++
T Consensus        95 ~~G~~l~l~G~V~D~~G~Pv~~  116 (247)
T cd03462          95 DDHKPLLFRGTVKDLAGAPVAG  116 (247)
T ss_pred             CCCCEEEEEEEEEcCCCCCcCC
Confidence            5699999999999999999995


No 161
>PF05506 DUF756:  Domain of unknown function (DUF756);  InterPro: IPR008475 This domain is found, normally as a tandem repeat, at the C terminus of bacterial phospholipase C proteins.; GO: 0004629 phospholipase C activity, 0016042 lipid catabolic process
Probab=23.00  E-value=4e+02  Score=24.87  Aligned_cols=58  Identities=16%  Similarity=0.083  Sum_probs=39.1

Q ss_pred             EEEEEEEcCCCccEEEEEEEeecCCeeeeecCCCccCCCCCCcceeeeEEEEEeCCCeeEEEEEEEeceeeeEEEEEEE
Q psy5522         732 AIPVVVFNYLSQDLVADVTLENVGQFDFADFSNEVDAAPQPKFEVFRRKKLTIKANSGSTTTFVITPKELGYIGIKVTA  810 (1397)
Q Consensus       732 ~l~~~VfNyl~~~~~v~v~L~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~a~~~~tv~f~v~p~~lG~v~itV~A  810 (1397)
                      .|.+++.|-....+.++|.-  . .+                 ......++.|++|+..++.|++ ...-|--.|+|++
T Consensus        21 ~l~l~l~N~g~~~~~~~v~~--~-~y-----------------~~~~~~~~~v~ag~~~~~~w~l-~~s~gwYDl~v~~   78 (89)
T PF05506_consen   21 NLRLTLSNPGSAAVTFTVYD--N-AY-----------------GGGGPWTYTVAAGQTVSLTWPL-AASGGWYDLTVTG   78 (89)
T ss_pred             EEEEEEEeCCCCcEEEEEEe--C-Cc-----------------CCCCCEEEEECCCCEEEEEEee-cCCCCcEEEEEEc
Confidence            67777788766666555542  1 11                 0122568899999999999998 5566777777776


No 162
>cd05469 Transthyretin_like Transthyretin_like.  This domain is present in the transthyretin-like protein (TLP) family which includes transthyretin (TTR) and a transthyretin-related protein called 5-hydroxyisourate hydrolase (HIUase).  TTR and HIUase are homotetrameric proteins with each subunit consisting of eight beta-strands arranged in two sheets and a short alpha-helix. The central channel of the tetramer contains two independent binding sites, each located between a pair of subunits. TTR transports thyroid hormones and retinol in the blood serum of vertebrates while HIUase catalyzes the second step in a three-step ureide pathway. TTRs are highly conserved and found only in vertebrates while the HIUases are found in a wide range of bacterial, plant, fungal, slime mold and vertebrate organisms.
Probab=22.95  E-value=4.2e+02  Score=26.33  Aligned_cols=59  Identities=14%  Similarity=0.265  Sum_probs=37.2

Q ss_pred             EEcC-CCccCCCcceEEEEECCCC--CeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC
Q psy5522         121 VLNS-HLKPSVTGALEIFITDGKG--NRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD  182 (1397)
Q Consensus       121 ~ld~-~l~P~~~~~~~v~I~DP~g--~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~  182 (1397)
                      +||. ..+|+.+..+.+.-.++.+  ..|.+  ..+++.|-+..-+. .++...|.|+|+...++
T Consensus         7 VLDt~~G~PAagv~V~L~~~~~~~~w~~l~~--~~Tn~DGR~~~~l~-~~~~~~G~Y~l~F~t~~   68 (113)
T cd05469           7 VLDAVRGSPAANVAIKVFRKTADGSWEIFAT--GKTNEDGELHGLIT-EEEFXAGVYRVEFDTKS   68 (113)
T ss_pred             EEeCCCCccCCCCEEEEEEecCCCceEEEEE--EEECCCCCccCccc-cccccceEEEEEEehHH
Confidence            4554 4578777788887776532  23332  45778888743222 45556799999887764


No 163
>cd03460 1,2-CTD Catechol 1,2 dioxygenase (1,2-CTD) catalyzes an intradiol cleavage reaction of catechol to form cis,cis-muconate. 1,2-CTDs is homodimers with one catalytic non-heme ferric ion per monomer. They belong to the aromatic dioxygenase family, a family of mononuclear non-heme iron intradiol-cleaving enzymes that catalyze the oxygenation of catecholates to aliphatic acids via the cleavage of aromatic rings.
Probab=22.95  E-value=2e+02  Score=33.37  Aligned_cols=22  Identities=27%  Similarity=0.371  Sum_probs=20.2

Q ss_pred             cCCCeEEEEEEEEccCCCccCC
Q psy5522         325 KPGLKYTAYMKLTHHDGTPVTD  346 (1397)
Q Consensus       325 kpG~p~~~~v~v~~~dG~P~~~  346 (1397)
                      .+|.|..+..+|+|.+|+|+++
T Consensus       120 ~~Gepl~l~G~V~D~~G~PI~~  141 (282)
T cd03460         120 DDGETLVMHGTVTDTDGKPVPG  141 (282)
T ss_pred             CCCCEEEEEEEEECCCCCCcCC
Confidence            4799999999999999999995


No 164
>PF14782 BBS2_C:  Ciliary BBSome complex subunit 2, C-terminal
Probab=22.81  E-value=2.7e+02  Score=34.38  Aligned_cols=81  Identities=17%  Similarity=0.325  Sum_probs=58.1

Q ss_pred             EEEEEeCCCeeEEEEEEEeceeeeEEEEEEEEeCCCCceeEEEEEecCCCeeeEEEEEEEEecCCCceeEEEEEecCCCC
Q psy5522         780 KKLTIKANSGSTTTFVITPKELGYIGIKVTATSNLAGDSMEGKLLVKPEGETQYKNKAIFVDLRKNKTFSVNVTLDMPKN  859 (1397)
Q Consensus       780 ~~~~v~a~~~~tv~f~v~p~~lG~v~itV~A~~~~~~D~v~k~L~V~peG~~~~~t~s~~l~~~~~~~~~~~i~l~~P~~  859 (1397)
                      ....+|++-.-.+.|.+.+ ..|.+.+.|++.    .|+++|.++|..||+-..+++.  .++.. .+.+-.+.|..|+|
T Consensus        93 ~~~~ipa~T~l~~~l~~n~-~~~~~eL~vst~----ndtiIr~vlIfaEglF~gEs~v--~~~~~-pss~l~ipL~~~kd  164 (431)
T PF14782_consen   93 DTGVIPANTQLQVSLSVNP-ETGCVELVVSTS----NDTIIRAVLIFAEGLFEGESHV--PSPQN-PSSTLRIPLRPPKD  164 (431)
T ss_pred             cCCccCCCCEEEEEEeeCC-cCCcEEEEEEeC----CCcEEEEEEEEeccccCCcccc--ccCCC-CCCeEEEEeeCCCC
Confidence            3557888988889998888 778888888885    3999999999999996665554  44432 23456677777765


Q ss_pred             cCCCCcEEEEEE
Q psy5522         860 IVPGSEHVEVSA  871 (1397)
Q Consensus       860 ~v~gs~~~~vsv  871 (1397)
                      .   ...+.+.+
T Consensus       165 ~---~vdL~iKv  173 (431)
T PF14782_consen  165 V---PVDLHIKV  173 (431)
T ss_pred             C---ceEEEEEE
Confidence            4   23455554


No 165
>TIGR02962 hdxy_isourate hydroxyisourate hydrolase. Members of this family, hydroxyisourate hydrolase, represent a distinct clade of transthyretin-related proteins. Bacterial members typically are encoded next to ureidoglycolate hydrolase and often near either xanthine dehydrogenase or xanthine/uracil permease genes and have been demonstrated to have hydroxyisourate hydrolase activity. In eukaryotes, a clade separate from the transthyretins (a family of thyroid-hormone binding proteins) has also been shown to have HIU hydrolase activity in urate catabolizing organisms. Transthyretin, then, would appear to be the recently diverged paralog of the more ancient HIUH family.
Probab=22.27  E-value=4.2e+02  Score=26.27  Aligned_cols=60  Identities=18%  Similarity=0.297  Sum_probs=37.5

Q ss_pred             EEcC-CCccCCCcceEEEEECCCC-CeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC
Q psy5522         121 VLNS-HLKPSVTGALEIFITDGKG-NRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD  182 (1397)
Q Consensus       121 ~ld~-~l~P~~~~~~~v~I~DP~g-~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~  182 (1397)
                      +||. ..+|+.+..+.++-.++++ ..|.  ..+++..|-+..-+.-.+....|.|+|+...++
T Consensus         7 VLDt~~G~PAagv~V~L~~~~~~~~~~i~--~~~Tn~DGR~~~~l~~~~~~~~G~Y~l~F~~g~   68 (112)
T TIGR02962         7 VLDTTSGKPAAGVPVTLYRLDGSGWTPLA--EGVTNADGRCPDLLPEGETLAAGIYKLRFDTGD   68 (112)
T ss_pred             EEeCCCCccCCCCEEEEEEecCCCeEEEE--EEEECCCCCCcCcccCcccCCCeeEEEEEEhhh
Confidence            4663 5688777777777666543 2232  346788888764333233345699999987764


No 166
>COG1689 Uncharacterized protein conserved in archaea [Function unknown]
Probab=22.10  E-value=76  Score=34.69  Aligned_cols=66  Identities=12%  Similarity=0.152  Sum_probs=46.2

Q ss_pred             HHHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhc-ccCCCcc
Q psy5522         921 IEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSN-QAVNGSF  999 (1397)
Q Consensus       921 ~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~-q~~~G~F  999 (1397)
                      ..++.++|++.       -||.||+|....+-  +.++-=|=||++.+...+    .|....++.+.||.+| |...++|
T Consensus         7 ~l~rvi~fi~~-------RrhedGGy~f~~~L--pdti~~TyYAi~i~s~lg----~evPr~EktiefL~d~~qt~~~~~   73 (274)
T COG1689           7 NLNRVIEFIEK-------RRHEDGGYCFVSQL--PDTINDTYYAIKIYSLLG----HEVPRKEKTIEFLYDQMQTAGVGV   73 (274)
T ss_pred             hHHHHHHHHHH-------hhcCCCCeEEeccC--cchhhhhhhhhhhhhhcC----CcCchHHHHHHHHHHHHHHhhhHH
Confidence            45677788876       48999999766442  223466889999988765    3556678999999874 3444444


No 167
>PF07718 Coatamer_beta_C:  Coatomer beta C-terminal region;  InterPro: IPR011710 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits.  This entry represents the C-terminal domain of the beta subunit from coatomer proteins (Beta-coat proteins). The C-terminal domain probably adapts the function of the N-terminal IPR002553 from INTERPRO domain. Coatomer protein complex I (COPI)-coated vesicles are involved in transport between the endoplasmic reticulum and the Golgi but also participate in transport from early to late endosomes within the endocytic pathway [].  More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030126 COPI vesicle coat
Probab=21.90  E-value=2.6e+02  Score=28.83  Aligned_cols=67  Identities=15%  Similarity=0.281  Sum_probs=48.2

Q ss_pred             eEEEEecCCceeeeCCEEEEEEEEEcCCCccE-EEEEEEeecCCeeeeecCCCccCCCCCCcceeeeEEEEEeCCCeeEE
Q psy5522         714 PFFISLDLPYSVMRGEVVAIPVVVFNYLSQDL-VADVTLENVGQFDFADFSNEVDAAPQPKFEVFRRKKLTIKANSGSTT  792 (1397)
Q Consensus       714 pffv~l~LPysv~rGE~~~l~~~VfNyl~~~~-~v~v~L~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~a~~~~tv  792 (1397)
                      |.|++..+=   +-.=.+.|.+.+.|..+..+ .+++.|....+++++.                +.+.+++.|++...+
T Consensus        57 PvYaEA~v~---v~q~DIvLDvllvNqT~~tLqNl~vElat~gdLklve----------------~p~~~tL~P~~~~~i  117 (140)
T PF07718_consen   57 PVYAEAYVT---VHQYDIVLDVLLVNQTNETLQNLTVELATLGDLKLVE----------------RPQPITLAPHGFARI  117 (140)
T ss_pred             CeEEEEEEE---EEeeeEEEEEEEEeCChhhhhcEEEEEEecCCcEEcc----------------CCCceeeCCCcEEEE
Confidence            555554432   22335678899999998765 4999999999988752                234678999999988


Q ss_pred             EEEEEec
Q psy5522         793 TFVITPK  799 (1397)
Q Consensus       793 ~f~v~p~  799 (1397)
                      ...|+..
T Consensus       118 ~~~iKVs  124 (140)
T PF07718_consen  118 KATIKVS  124 (140)
T ss_pred             EEEEEEE
Confidence            8887754


No 168
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=21.84  E-value=5.1e+02  Score=24.59  Aligned_cols=47  Identities=17%  Similarity=0.327  Sum_probs=31.0

Q ss_pred             ceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC
Q psy5522         133 ALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD  182 (1397)
Q Consensus       133 ~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~  182 (1397)
                      .+.+.+.-.+|..+-.|..-....+.+.+.   |..-..|.|+|++...+
T Consensus        31 ~lty~~~~~~~~~lP~Wl~fd~~~~~~~Gt---P~~~~~g~~~i~v~a~D   77 (97)
T smart00736       31 TLTYSATLSDGSALPSWLSFDSDTGTLSGT---PTNSDVGSLSLKVTATD   77 (97)
T ss_pred             eEEEEEEeCCCCCCCCeEEEeCCCCEEEEE---CCCCCCcEEEEEEEEEE
Confidence            444444433444445688777778888883   44444799999998865


No 169
>cd05703 S1_Rrp5_repeat_hs12_sc9 S1_Rrp5_repeat_hs12_sc9: Rrp5 is a trans-acting factor important for biogenesis of both the 40S and 60S eukaryotic ribosomal subunits. Rrp5 has two distinct regions, an N-terminal region containing tandemly repeated S1 RNA-binding domains (12 S1 repeats in Saccharomyces cerevisiae Rrp5 and 14 S1 repeats in Homo sapiens Rrp5) and a C-terminal region containing tetratricopeptide repeat (TPR) motifs thought to be involved in protein-protein interactions.  Mutational studies have shown that each region represents a specific functional domain. Deletions within the S1-containing region inhibit pre-rRNA processing at either site A3 or A2, whereas deletions within the TPR region confer an inability to support cleavage of A0-A2. This CD includes H. sapiens S1 repeat 12 (hs12) and S. cerevisiae S1 repeat 9 (sc9). Rrp5 is found in eukaryotes but not in prokaryotes or archaea.
Probab=21.71  E-value=1.2e+02  Score=27.31  Aligned_cols=18  Identities=6%  Similarity=0.287  Sum_probs=16.1

Q ss_pred             cccCCceeEEEEEEEcCC
Q psy5522         108 IYKPGDLVRFRAIVLNSH  125 (1397)
Q Consensus       108 IYkPGqtV~fRvv~ld~~  125 (1397)
                      .|++||.|.++++.+|.+
T Consensus        46 ~~~vG~~v~~kV~~id~~   63 (73)
T cd05703          46 KFPIGQALKAKVVGVDKE   63 (73)
T ss_pred             hCCCCCEEEEEEEEEeCC
Confidence            499999999999998864


No 170
>PF01345 DUF11:  Domain of unknown function DUF11;  InterPro: IPR001434 This group of sequences is represented by a conserved region of about 53 amino acids shared between regions, usually repeated, of proteins from a small number of phylogenetically distant prokaryotes. Examples include a 132-residue region found repeated in three of the five longest proteins of Bacillus anthracis, a 131-residue repeat in a cell wall-anchored protein of Enterococcus faecalis (Streptococcus faecalis), and a 120-residue repeat in Methanobacterium thermoautotrophicum. A similar region is found in some Chlamydia trachomatis outer membrane proteins.  In C. trachomatis, three cysteine-rich proteins (also believed to be lipoproteins), MOMP, OMP6 and OMP3, make up the extracellular matrix of the outer membrane []. They are involved in the essential structural integrity of both the elementary body (EB) and recticulate body (RB) phase. They are thought to be involved in porin formation and, as these bacteria lack the peptidoglycan layer common to most Gram-negative microbes, such proteins are highly important in the pathogenicity of the organism.; GO: 0005727 extrachromosomal circular DNA
Probab=20.84  E-value=2.1e+02  Score=25.75  Aligned_cols=40  Identities=20%  Similarity=0.142  Sum_probs=31.8

Q ss_pred             CCceeeeCCEEEEEEEEEcCCCccEE-EEEEEeecCCeeee
Q psy5522         721 LPYSVMRGEVVAIPVVVFNYLSQDLV-ADVTLENVGQFDFA  760 (1397)
Q Consensus       721 LPysv~rGE~~~l~~~VfNyl~~~~~-v~v~L~~s~~~~~~  760 (1397)
                      -+..+..||.+...++|.|..+.... |.|+-....+++++
T Consensus        33 ~~~~~~~Gd~v~ytitvtN~G~~~a~nv~v~D~lp~g~~~v   73 (76)
T PF01345_consen   33 NPSTANPGDTVTYTITVTNTGPAPATNVVVTDTLPAGLTFV   73 (76)
T ss_pred             CCCcccCCCEEEEEEEEEECCCCeeEeEEEEEcCCCCCEEe
Confidence            36778999999999999999988765 66666566677665


No 171
>COG2372 CopC Uncharacterized protein, homolog of Cu resistance protein CopC [General function prediction only]
Probab=20.77  E-value=2.2e+02  Score=28.73  Aligned_cols=58  Identities=12%  Similarity=0.137  Sum_probs=43.0

Q ss_pred             EeCCeEEEEEEEEecCCceeEEEEEeeCCCCCceeEEEEEEEe-cCCeEEEEeEEEEEcc
Q psy5522         452 LGRGDVIMADTITVPGNKMSTVIRFLATYAMAPTAHVIVQYVR-EDGEVVADGLDLELEG  510 (1397)
Q Consensus       452 ~srG~Iv~~~~~~~~~~~~~~~~~l~vt~~maP~a~l~vy~v~-~~gevvads~~~~V~~  510 (1397)
                      ...|.-+..+..+... +....++++....+.+..+.+.+-+. .||.++..++.|.|..
T Consensus        68 ~~d~~~v~t~~~~~~~-~~~~~l~v~l~~~L~aG~Y~v~WrvvS~DGH~v~G~~sFsV~~  126 (127)
T COG2372          68 GPDGEEVATAGTKLDE-QNHTQLEVPLPQPLKAGVYTVDWRVVSSDGHVVKGSISFSVGA  126 (127)
T ss_pred             CCCCCccccCcccccc-cCCcEEEecCcccCCCCcEEEEEEEEecCCcEeccEEEEEecC
Confidence            3445555444444432 23467999999999999999999865 5899999999999875


Done!