Query         psy5669
Match_columns 116
No_of_seqs    119 out of 1066
Neff          7.0 
Searched_HMMs 46136
Date          Fri Aug 16 23:33:17 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy5669.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5669hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF00201 UDPGT:  UDP-glucoronos 100.0 2.9E-31 6.4E-36  214.7   3.0  104    2-105   397-500 (500)
  2 PHA03392 egt ecdysteroid UDP-g  99.9 1.8E-22   4E-27  166.0  10.5   78    2-79    420-498 (507)
  3 KOG1192|consensus               99.5 3.7E-13 8.1E-18  108.6  10.0   70    4-75    412-481 (496)
  4 TIGR01426 MGT glycosyltransfer  95.2    0.05 1.1E-06   42.9   5.7   45    2-48    347-391 (392)
  5 PLN02670 transferase, transfer  95.2   0.055 1.2E-06   44.9   5.9   47    2-49    417-466 (472)
  6 COG1819 Glycosyl transferases,  94.2    0.16 3.4E-06   41.2   6.3   48    2-51    356-403 (406)
  7 PLN02554 UDP-glycosyltransfera  90.8    0.92   2E-05   37.5   6.6   30    2-31    428-458 (481)
  8 PF15050 SCIMP:  SCIMP protein   88.7     1.2 2.5E-05   30.9   4.7   47   62-111     1-48  (133)
  9 PLN02562 UDP-glycosyltransfera  87.1     2.1 4.5E-05   35.2   6.1   30    2-31    401-430 (448)
 10 PF07214 DUF1418:  Protein of u  83.9     3.4 7.3E-05   27.4   4.8   34   83-116    60-93  (96)
 11 PLN02207 UDP-glycosyltransfera  83.3     1.4   3E-05   36.6   3.5   30    2-31    414-445 (468)
 12 PLN03004 UDP-glycosyltransfera  81.3     2.2 4.7E-05   35.3   3.9   28    2-29    412-439 (451)
 13 PLN03007 UDP-glucosyltransfera  77.4       3 6.6E-05   34.5   3.6   30    2-31    428-460 (482)
 14 cd03784 GT1_Gtf_like This fami  76.2     5.1 0.00011   31.4   4.4   41    2-45    360-400 (401)
 15 PF07069 PRRSV_2b:  Porcine rep  73.3      11 0.00024   23.1   4.4   34   69-102    26-61  (73)
 16 PF05478 Prominin:  Prominin;    71.2      12 0.00026   33.1   5.9   36   63-98     85-120 (806)
 17 PF02038 ATP1G1_PLM_MAT8:  ATP1  70.8       7 0.00015   22.9   3.0   28   64-91      8-35  (50)
 18 PF13524 Glyco_trans_1_2:  Glyc  70.1      14 0.00031   22.7   4.7   42    2-43     50-91  (92)
 19 PLN02208 glycosyltransferase f  70.0      16 0.00034   30.1   6.0   43    2-45    389-436 (442)
 20 cd07912 Tweety_N N-terminal do  70.0      22 0.00048   29.3   6.9   13   39-51      4-16  (418)
 21 PF07319 DnaI_N:  Primosomal pr  68.0       2 4.4E-05   27.9   0.4   32    1-32      1-32  (94)
 22 PRK00726 murG undecaprenyldiph  64.8      19 0.00041   27.7   5.3   43    2-45    312-354 (357)
 23 PF11628 TCR_zetazeta:  T-cell   63.8      15 0.00033   19.6   3.2   19   67-85      5-23  (33)
 24 PLN02863 UDP-glucoronosyl/UDP-  63.6      22 0.00048   29.5   5.8   48    2-49    421-472 (477)
 25 PLN00414 glycosyltransferase f  63.1      27 0.00059   28.8   6.1   47    2-49    390-441 (446)
 26 PLN02167 UDP-glycosyltransfera  62.4      10 0.00022   31.4   3.5   29    2-30    422-451 (475)
 27 PLN02448 UDP-glycosyltransfera  62.3      10 0.00022   31.1   3.5   30    2-31    403-437 (459)
 28 PF03401 TctC:  Tripartite tric  60.5      11 0.00023   28.8   3.2   46    2-49    220-265 (274)
 29 PF11057 Cortexin:  Cortexin of  59.6      16 0.00034   23.3   3.2   23   73-95     31-53  (81)
 30 PLN02152 indole-3-acetate beta  59.5      12 0.00026   31.0   3.5   30    2-31    405-436 (455)
 31 PF05478 Prominin:  Prominin;    59.0      13 0.00029   32.9   3.9   29   73-102    99-127 (806)
 32 cd03820 GT1_amsD_like This fam  57.5      22 0.00049   25.7   4.4   41    2-43    307-347 (348)
 33 PLN02210 UDP-glucosyl transfer  56.8      15 0.00033   30.3   3.6   28    2-29    403-433 (456)
 34 PF09929 DUF2161:  Uncharacteri  56.7     8.4 0.00018   26.5   1.8   26   16-41     56-82  (118)
 35 PRK00025 lpxB lipid-A-disaccha  56.7      42 0.00092   25.9   6.0   46    2-47    329-375 (380)
 36 PLN02410 UDP-glucoronosyl/UDP-  56.7      15 0.00032   30.4   3.5   29    2-30    398-429 (451)
 37 PF11346 DUF3149:  Protein of u  56.0      35 0.00076   19.1   4.6   35   64-98      3-37  (42)
 38 PF05186 Dpy-30:  Dpy-30 motif;  55.8      34 0.00074   19.0   3.9   22   21-42     15-36  (42)
 39 PLN02764 glycosyltransferase f  53.9      45 0.00098   27.7   6.0   49    2-51    395-448 (453)
 40 PF03213 Pox_P35:  Poxvirus P35  50.8      33 0.00072   27.5   4.5   39   36-78    258-297 (325)
 41 COG0133 TrpB Tryptophan syntha  50.5      43 0.00093   27.3   5.1   44    2-46     26-69  (396)
 42 PF15013 CCSMST1:  CCSMST1 fami  50.4      14  0.0003   23.5   1.9   19   55-73     21-39  (77)
 43 cd03801 GT1_YqgM_like This fam  49.0      57  0.0012   23.5   5.4   44    2-45    329-372 (374)
 44 PF05454 DAG1:  Dystroglycan (D  47.6     6.2 0.00013   31.1   0.0   20   22-41     70-89  (290)
 45 KOG4331|consensus               47.0      34 0.00073   30.8   4.4   38   63-100    98-135 (865)
 46 PHA03105 EEV glycoprotein; Pro  46.2      21 0.00045   26.0   2.5   24   81-105    17-40  (188)
 47 KOG4433|consensus               46.2 1.2E+02  0.0026   25.8   7.3   65   35-104     4-76  (526)
 48 PLN00164 glucosyltransferase;   45.5      24 0.00052   29.3   3.1   30    2-31    419-453 (480)
 49 PTZ00234 variable surface prot  45.4      17 0.00037   30.1   2.3   31   83-113   377-407 (433)
 50 COG2011 AbcD ABC-type metal io  45.2      40 0.00087   25.6   4.0   34   65-98    186-219 (222)
 51 PF07219 HemY_N:  HemY protein   43.9      54  0.0012   21.4   4.2   23   74-96     20-42  (108)
 52 cd08806 CARD_CARD14_CARMA2 Cas  43.6      82  0.0018   20.4   4.7   38    7-44     35-75  (86)
 53 cd03822 GT1_ecORF704_like This  43.4      95  0.0021   22.9   5.9   43    2-45    322-364 (366)
 54 PF05510 Sarcoglycan_2:  Sarcog  42.7      40 0.00087   27.7   3.9   21   55-76    269-289 (386)
 55 PF04328 DUF466:  Protein of un  42.3      79  0.0017   19.2   4.8   44    2-47     10-53  (65)
 56 PF01601 Corona_S2:  Coronaviru  42.3     8.4 0.00018   33.3   0.0   33   60-92    544-576 (610)
 57 PF10661 EssA:  WXG100 protein   42.2      46 0.00099   23.4   3.7   14   81-94    130-143 (145)
 58 cd03794 GT1_wbuB_like This fam  40.8      46   0.001   24.4   3.8   39    2-40    353-391 (394)
 59 cd03821 GT1_Bme6_like This fam  40.1      62  0.0013   23.7   4.4   41    2-42    333-373 (375)
 60 PF01102 Glycophorin_A:  Glycop  40.0      67  0.0014   22.1   4.2   10   28-37     41-50  (122)
 61 PRK08939 primosomal protein Dn  39.8      42 0.00092   26.2   3.6   32    1-32      1-32  (306)
 62 PLN02992 coniferyl-alcohol glu  39.6      91   0.002   26.1   5.7   28    2-29    415-445 (481)
 63 KOG3989|consensus               39.6      45 0.00097   25.3   3.6   11   41-51    162-172 (224)
 64 PRK10907 intramembrane serine   39.6 1.5E+02  0.0033   23.0   6.6   31   12-42     39-69  (276)
 65 PF03579 SHP:  Small hydrophobi  39.4      87  0.0019   18.9   4.6    9   64-72     14-22  (64)
 66 PF14851 FAM176:  FAM176 family  37.7      57  0.0012   23.3   3.7    6   90-95     42-47  (153)
 67 PF06345 Drf_DAD:  DRF Autoregu  37.6      18 0.00038   15.7   0.7   10    1-10      3-12  (15)
 68 PF04906 Tweety:  Tweety;  Inte  37.6 1.6E+02  0.0036   24.0   6.8   18   86-103    34-51  (406)
 69 PHA02688 ORF059 IMV protein VP  37.5      52  0.0011   26.4   3.8   40   36-79    256-296 (323)
 70 TIGR03798 ocin_TIGR03798 bacte  37.4      37  0.0008   20.2   2.4   24    1-24      1-24  (64)
 71 PF14979 TMEM52:  Transmembrane  37.3      86  0.0019   22.4   4.5   15   70-84     20-34  (154)
 72 PHA02754 hypothetical protein;  37.2      55  0.0012   19.9   3.0   27    6-32      5-31  (67)
 73 PF01102 Glycophorin_A:  Glycop  37.1      66  0.0014   22.1   3.8    7   73-79     67-73  (122)
 74 PRK13028 tryptophan synthase s  36.7      45 0.00099   27.2   3.4   38    2-40     32-69  (402)
 75 smart00394 RIIa RIIalpha, Regu  36.4      70  0.0015   16.9   3.6   24   25-48     12-35  (38)
 76 PF07856 Orai-1:  Mediator of C  36.4      42 0.00091   24.4   2.9   56   30-85     93-153 (175)
 77 PF03889 DUF331:  Domain of unk  36.2      26 0.00056   19.4   1.3   19    5-23      5-23  (39)
 78 PF08374 Protocadherin:  Protoc  36.1      38 0.00082   25.7   2.7    7   43-49      8-14  (221)
 79 cd00904 Ferritin Ferritin iron  36.1      52  0.0011   23.0   3.3   38   20-57     35-72  (160)
 80 PF06837 Fijivirus_P9-2:  Fijiv  36.0      94   0.002   23.2   4.6   66   30-101    43-110 (214)
 81 PRK04346 tryptophan synthase s  34.9      53  0.0011   26.8   3.6   37    2-39     28-64  (397)
 82 PRK11024 colicin uptake protei  34.7      33 0.00072   23.5   2.1   14   69-82     20-33  (141)
 83 PLN02173 UDP-glucosyl transfer  34.7      52  0.0011   27.2   3.5   28    2-29    396-426 (449)
 84 cd03814 GT1_like_2 This family  34.7 1.5E+02  0.0033   21.7   5.8   42    2-44    320-361 (364)
 85 PRK14750 kdpF potassium-transp  34.5      72  0.0016   16.5   3.3   22   69-90      2-23  (29)
 86 PF05225 HTH_psq:  helix-turn-h  34.3      63  0.0014   17.9   2.8   23    2-24      3-26  (45)
 87 PF05568 ASFV_J13L:  African sw  34.3      98  0.0021   22.3   4.4   11   63-73     25-35  (189)
 88 PF02197 RIIa:  Regulatory subu  34.1      80  0.0017   16.9   3.2   24   25-48     12-35  (38)
 89 PRK12446 undecaprenyldiphospho  33.7      82  0.0018   24.8   4.4   37    2-44    313-350 (352)
 90 cd03808 GT1_cap1E_like This fa  31.9   1E+02  0.0022   22.3   4.4   39    2-40    317-355 (359)
 91 TIGR02801 tolR TolR protein. T  31.8      44 0.00096   22.3   2.3   13   69-81     10-22  (129)
 92 TIGR02804 ExbD_2 TonB system t  31.7      47   0.001   22.1   2.4   13   69-81      9-21  (121)
 93 PF12626 PolyA_pol_arg_C:  Poly  31.7      54  0.0012   22.4   2.7   38    9-47     55-92  (124)
 94 PF11417 Inhibitor_G39P:  Loade  31.6      56  0.0012   20.2   2.5   31    6-36     34-64  (71)
 95 TIGR03147 cyt_nit_nrfF cytochr  31.5 1.8E+02  0.0039   20.1   5.2   30   33-65     74-103 (126)
 96 PHA02849 putative transmembran  31.4   1E+02  0.0022   19.7   3.7   20   72-91     19-38  (82)
 97 PHA00646 hypothetical protein   31.2 1.2E+02  0.0026   18.5   3.8   25   69-93     39-63  (65)
 98 PF09959 DUF2193:  Uncharacteri  31.1 1.5E+02  0.0032   24.9   5.4   40    7-52     93-134 (499)
 99 PRK11267 biopolymer transport   30.9      45 0.00097   22.9   2.2   14   69-82     23-36  (141)
100 cd04962 GT1_like_5 This family  30.8 1.8E+02   0.004   21.8   5.8   45    2-46    324-368 (371)
101 PF14851 FAM176:  FAM176 family  30.7 1.1E+02  0.0025   21.8   4.3   16   85-100    33-48  (153)
102 PRK13718 conjugal transfer pro  30.6 1.5E+02  0.0032   19.1   4.3   19   89-107    61-79  (84)
103 cd03807 GT1_WbnK_like This fam  30.3 1.5E+02  0.0033   21.4   5.2   42    2-43    320-361 (365)
104 PF05148 Methyltransf_8:  Hypot  30.1 1.6E+02  0.0034   22.4   5.1   38    7-44     30-68  (219)
105 PRK13608 diacylglycerol glucos  29.7 2.8E+02  0.0061   21.9   7.1   44    2-46    326-369 (391)
106 PF00957 Synaptobrevin:  Synapt  29.4      88  0.0019   19.5   3.3   11   64-74     62-72  (89)
107 cd03795 GT1_like_4 This family  29.3      78  0.0017   23.5   3.5   25    2-26    320-344 (357)
108 PRK13803 bifunctional phosphor  28.6      65  0.0014   27.7   3.2   37    2-39    241-277 (610)
109 PHA02657 hypothetical protein;  28.6   1E+02  0.0022   20.1   3.4   23   67-89     24-46  (95)
110 PF10854 DUF2649:  Protein of u  28.6 1.4E+02  0.0031   18.2   4.3   33   60-92     32-64  (67)
111 PRK05749 3-deoxy-D-manno-octul  28.3 1.9E+02   0.004   22.9   5.7   43    2-44    376-418 (425)
112 PF11157 DUF2937:  Protein of u  28.3   1E+02  0.0022   22.1   3.8   26    4-29     58-85  (167)
113 PF04716 ETC_C1_NDUFA5:  ETC co  28.2      68  0.0015   19.0   2.4   23    3-25     13-35  (57)
114 COG3036 Uncharacterized protei  27.8      76  0.0016   19.5   2.5   20    6-25     15-34  (66)
115 PRK13609 diacylglycerol glucos  27.7 1.7E+02  0.0037   22.6   5.2   43    2-45    326-368 (380)
116 TIGR03449 mycothiol_MshA UDP-N  27.5 2.6E+02  0.0057   21.6   6.3   44    2-46    356-399 (405)
117 cd03818 GT1_ExpC_like This fam  27.3 1.2E+02  0.0027   23.6   4.4   25    2-26    354-378 (396)
118 PF04316 FlgM:  Anti-sigma-28 f  26.9      72  0.0016   18.5   2.4   25    5-29     30-54  (57)
119 TIGR02803 ExbD_1 TonB system t  26.8      62  0.0013   21.5   2.3   14   69-82      9-22  (122)
120 PF10577 UPF0560:  Uncharacteri  26.7 1.1E+02  0.0025   27.5   4.4    8   93-100   295-302 (807)
121 COG4455 ImpE Protein of avirul  26.4   1E+02  0.0022   23.9   3.6   35    1-35      1-35  (273)
122 cd03792 GT1_Trehalose_phosphor  26.3 2.3E+02  0.0051   21.7   5.8   45    2-46    325-369 (372)
123 KOG4482|consensus               26.2 1.7E+02  0.0036   24.4   4.9    7  105-111   330-336 (449)
124 PF06917 Pectate_lyase_2:  Peri  26.1      93   0.002   26.7   3.6   38    4-42     77-114 (557)
125 COG0848 ExbD Biopolymer transp  26.0      66  0.0014   22.3   2.3   16   69-84     21-36  (137)
126 PF08388 GIIM:  Group II intron  25.9 1.2E+02  0.0027   18.1   3.4    9   63-71     34-42  (80)
127 TIGR01133 murG undecaprenyldip  25.4 1.1E+02  0.0023   23.1   3.7   26    2-27    309-334 (348)
128 PTZ00443 Thioredoxin domain-co  24.7 1.6E+02  0.0035   22.0   4.4    6  105-110   211-216 (224)
129 PHA02902 putative IMV membrane  24.7 1.6E+02  0.0034   18.2   3.5   11   70-80      3-13  (70)
130 PF12122 DUF3582:  Protein of u  24.7      58  0.0013   21.5   1.8   31   12-42     41-71  (101)
131 PLN02618 tryptophan synthase,   24.6      98  0.0021   25.4   3.5   43    2-45     36-78  (410)
132 PF00558 Vpu:  Vpu protein;  In  24.3 1.3E+02  0.0028   19.3   3.3   16   69-84      3-18  (81)
133 PF06637 PV-1:  PV-1 protein (P  24.0 2.5E+02  0.0055   23.3   5.6   13   51-63      8-20  (442)
134 PF02723 NS3_envE:  Non-structu  23.9   2E+02  0.0043   18.4   4.1   10   89-98     38-47  (82)
135 PF08390 TRAM1:  TRAM1-like pro  23.8 1.7E+02  0.0037   17.5   4.6   21   63-83     20-40  (65)
136 TIGR02149 glgA_Coryne glycogen  23.6 3.2E+02  0.0069   20.8   6.1   44    2-45    340-383 (388)
137 PRK04125 murein hydrolase regu  23.6   2E+02  0.0043   20.2   4.4   11   86-96    108-118 (141)
138 PF00988 CPSase_sm_chain:  Carb  23.5      34 0.00073   23.9   0.5   11    9-19     38-48  (131)
139 PF15179 Myc_target_1:  Myc tar  23.5 1.9E+02   0.004   21.6   4.4   21   61-81      9-31  (197)
140 cd03809 GT1_mtfB_like This fam  23.5 1.8E+02  0.0039   21.3   4.5   37    2-39    324-360 (365)
141 TIGR00263 trpB tryptophan synt  23.2      89  0.0019   25.1   2.9   38    1-39     19-56  (385)
142 PHA02851 EEV glycoprotein; Pro  23.2      79  0.0017   23.1   2.4   15   88-102    20-34  (223)
143 TIGR00215 lpxB lipid-A-disacch  23.1 1.8E+02  0.0038   23.2   4.6   40    2-41    335-380 (385)
144 PLN02555 limonoid glucosyltran  22.9 1.1E+02  0.0024   25.5   3.5   29    2-30    417-448 (480)
145 PF07782 DC_STAMP:  DC-STAMP-li  22.8 2.5E+02  0.0054   20.2   5.0   45   61-105   141-185 (191)
146 PF12877 DUF3827:  Domain of un  22.7      57  0.0012   28.7   1.8    9   73-81    276-284 (684)
147 cd02646 R3H_G-patch R3H domain  22.3      71  0.0015   18.6   1.7   39    1-49      1-39  (58)
148 KOG3814|consensus               21.7 2.1E+02  0.0045   24.0   4.7   37   15-51    406-442 (531)
149 TIGR03088 stp2 sugar transfera  21.5 3.4E+02  0.0073   20.6   5.8   44    2-45    326-369 (374)
150 cd03825 GT1_wcfI_like This fam  21.4 2.9E+02  0.0062   20.5   5.3   19    2-20    318-336 (365)
151 PHA03054 IMV membrane protein;  21.3 2.2E+02  0.0048   17.8   7.2   25   26-50     10-34  (72)
152 cd04951 GT1_WbdM_like This fam  21.1 2.8E+02   0.006   20.5   5.1   42    2-44    314-356 (360)
153 PF02526 GBP_repeat:  Glycophor  21.0 1.6E+02  0.0034   15.9   3.5   24    8-31     10-33  (38)
154 PF04659 Arch_fla_DE:  Archaeal  21.0      77  0.0017   20.9   1.8   19   36-54     18-36  (99)
155 PRK10591 hypothetical protein;  21.0 2.6E+02  0.0056   18.4   4.4   12   84-95     61-72  (92)
156 PF06667 PspB:  Phage shock pro  20.8 2.3E+02   0.005   17.8   4.1   21   71-91      4-24  (75)
157 PF12834 Phage_int_SAM_2:  Phag  20.8 2.4E+02  0.0052   18.0   6.0   49    1-49      1-60  (91)
158 PRK10635 bacterioferritin; Pro  20.7 2.9E+02  0.0063   19.4   4.9   55    3-57      6-73  (158)
159 PRK10307 putative glycosyl tra  20.5 3.2E+02  0.0068   21.3   5.5   43    2-44    361-403 (412)
160 cd01056 Euk_Ferritin eukaryoti  20.4      82  0.0018   21.9   2.0   38   20-57     35-72  (161)
161 PTZ00046 rifin; Provisional     20.2 2.1E+02  0.0045   23.4   4.4    6   92-97    335-340 (358)

No 1  
>PF00201 UDPGT:  UDP-glucoronosyl and UDP-glucosyl transferase;  InterPro: IPR002213 UDP glycosyltransferases (UGT) are a superfamily of enzymes that catalyzes the addition of the glycosyl group from a UTP-sugar to a small hydrophobic molecule. This family currently consist of:  Mammalian UDP-glucuronosyl transferases (2.4.1.17 from EC) (UDPGT) []. A large family of membrane-bound microsomal enzymes which catalyze the transfer of glucuronic acid to a wide variety of exogenous and endogenous lipophilic substrates. These enzymes are of major importance in the detoxification and subsequent elimination of xenobiotics such as drugs and carcinogens. A large number of putative UDPGT from Caenorhabditis elegans. Mammalian 2-hydroxyacylsphingosine 1-beta-galactosyltransferase [] (2.4.1.45 from EC) (also known as UDP-galactose-ceramide galactosyltransferase). This enzyme catalyzes the transfer of galactose to ceramide, a key enzymatic step in the biosynthesis of galactocerebrosides, which are abundant sphingolipids of the myelin membrane of the central nervous system and peripheral nervous system. Plants flavonol O(3)-glucosyltransferase (2.4.1.91 from EC). An enzyme [] that catalyzes the transfer of glucose from UDP-glucose to a flavanol. This reaction is essential and one of the last steps in anthocyanin pigment biosynthesis. Baculoviruses ecdysteroid UDP-glucosyltransferase (2.4.1 from EC) [] (egt). This enzyme catalyzes the transfer of glucose from UDP-glucose to ectysteroids which are insect molting hormones. The expression of egt in the insect host interferes with the normal insect development by blocking the molting process. Prokaryotic zeaxanthin glucosyltransferase (2.4.1 from EC) (gene crtX), an enzyme involved in carotenoid biosynthesis and that catalyses the glycosylation reaction which converts zeaxanthin to zeaxanthin-beta-diglucoside. Streptomyces macrolide glycosyltransferases (2.4.1 from EC) []. These enzymes specifically inactivates macrolide anitibiotics via 2'-O-glycosylation using UDP-glucose.  These enzymes share a conserved domain of about 50 amino acid residues located in their C-terminal section.; GO: 0016758 transferase activity, transferring hexosyl groups, 0008152 metabolic process; PDB: 3HBJ_A 3HBF_A 2PQ6_A 3IA7_B 3RSC_A 3IAA_B 2IYA_A 2IYF_B 2O6L_A 2VCH_A ....
Probab=99.96  E-value=2.9e-31  Score=214.66  Aligned_cols=104  Identities=38%  Similarity=0.596  Sum_probs=27.0

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhCCCCCCcccCCCCCHHHHHHHHHHHHHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEYVIAHRGAPHLKTKATSLPWYQTSQLDVIASCTLGLG   81 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v~r~~g~~~l~~~~~~l~~~q~~~lDV~~~l~~v~~   81 (116)
                      +++.+||++||+|++|++||+++|++++|||++|+|+|+||||||+||||++||++++.+|||+|||+|||+++++++++
T Consensus       397 ~~l~~ai~~vl~~~~y~~~a~~ls~~~~~~p~~p~~~~~~~ie~v~~~~~~~~l~~~~~~l~~~~~~~lDv~~~~~~~~~  476 (500)
T PF00201_consen  397 EELRAAIREVLENPSYKENAKRLSSLFRDRPISPLERAVWWIEYVARHGGAPHLRSPARDLSFYQYYLLDVIAFLLLIIL  476 (500)
T ss_dssp             HHHHHHHHHHHHSHHHHHHHHHHHHTTT----------------------------------------------------
T ss_pred             HHHHHHHHHHHhhhHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHhcCCCcccCChhhcCCHHHHHHHHHHHHHHHHHH
Confidence            57899999999999999999999999999999999999999999999999999999999999999999999999888888


Q ss_pred             HHHHHHHHHHHHHhhhhccccccc
Q psy5669          82 LVLYTLARIGTSFAEFTRGKDDKK  105 (116)
Q Consensus        82 ~~~~~~~~~~~~~~~~~~~~~~~k  105 (116)
                      ++++++++++++++|+++++++||
T Consensus       477 ~~~~~~~~~~~~~~~~~~~~~~k~  500 (500)
T PF00201_consen  477 LIIYIIFKICRFVCRKCVKKKKKK  500 (500)
T ss_dssp             ------------------------
T ss_pred             HHHHHHHHHHHHHHHHHhcccCCC
Confidence            888999999999999998777664


No 2  
>PHA03392 egt ecdysteroid UDP-glucosyltransferase; Provisional
Probab=99.88  E-value=1.8e-22  Score=166.04  Aligned_cols=78  Identities=26%  Similarity=0.382  Sum_probs=73.9

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhC-CCCCCcccCCCCCHHHHHHHHHHHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEYVIAHR-GAPHLKTKATSLPWYQTSQLDVIASCTLG   79 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v~r~~-g~~~l~~~~~~l~~~q~~~lDV~~~l~~v   79 (116)
                      ++|.+||+++++||+|++||+++++.++++|.+|.|.|+||||||+||+ |++||++++.+|||+|||+|||+++++++
T Consensus       420 ~~l~~ai~~vl~~~~y~~~a~~ls~~~~~~p~~~~~~av~~iE~v~r~~~g~~~lr~~~~~l~~~qy~~lDv~~~~~~~  498 (507)
T PHA03392        420 AQLVLAIVDVIENPKYRKNLKELRHLIRHQPMTPLHKAIWYTEHVIRNKHGNTSLKTKAANVSYSDYFMSYILVPLVTF  498 (507)
T ss_pred             HHHHHHHHHHhCCHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHhCCCCcccccccccCCCHHHHHHHHHHHHHHHH
Confidence            5789999999999999999999999999999999999999999999999 99999999999999999999998666444


No 3  
>KOG1192|consensus
Probab=99.46  E-value=3.7e-13  Score=108.64  Aligned_cols=70  Identities=34%  Similarity=0.570  Sum_probs=67.6

Q ss_pred             HHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhCCCCCCcccCCCCCHHHHHHHHHHHH
Q psy5669           4 SRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEYVIAHRGAPHLKTKATSLPWYQTSQLDVIAS   75 (116)
Q Consensus         4 l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v~r~~g~~~l~~~~~~l~~~q~~~lDV~~~   75 (116)
                      +..|+.+++++++|++++++++.+++|||.+| +.+++|+||+.+++++.+++.. .+++|++|+++|++.+
T Consensus       412 ~~~~~~~il~~~~y~~~~~~l~~~~~~~p~~~-~~~~~~~e~~~~~~~~~~l~~~-~~~~~~~~~~~d~~~~  481 (496)
T KOG1192|consen  412 LLEAIKEILENEEYKEAAKRLSEILRDQPISP-ELAVKWVEFVARHGGAKHLKEA-AHLSFIEYGSLDVIAF  481 (496)
T ss_pred             HHHHHHHHHcChHHHHHHHHHHHHHHcCCCCH-HHHHHHHHHHHhcCCCcccCcc-ccCChhhhhhhHHHHH
Confidence            67899999999999999999999999999999 9999999999999999999998 9999999999999976


No 4  
>TIGR01426 MGT glycosyltransferase, MGT family. This model describes the MGT (macroside glycosyltransferase) subfamily of the UDP-glucuronosyltransferase family. Members include a number of glucosyl transferases for macrolide antibiotic inactivation, but also include transferases of glucose-related sugars for macrolide antibiotic production.
Probab=95.25  E-value=0.05  Score=42.94  Aligned_cols=45  Identities=20%  Similarity=0.314  Sum_probs=39.1

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEYVIA   48 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v~r   48 (116)
                      +.+.++|.++++|++|++++++++..+.+.+  +.+.++.++|.+++
T Consensus       347 ~~l~~ai~~~l~~~~~~~~~~~l~~~~~~~~--~~~~aa~~i~~~~~  391 (392)
T TIGR01426       347 EKLREAVLAVLSDPRYAERLRKMRAEIREAG--GARRAADEIEGFLA  391 (392)
T ss_pred             HHHHHHHHHHhcCHHHHHHHHHHHHHHHHcC--CHHHHHHHHHHhhc
Confidence            4688999999999999999999999998654  77889999987654


No 5  
>PLN02670 transferase, transferring glycosyl groups
Probab=95.18  E-value=0.055  Score=44.90  Aligned_cols=47  Identities=11%  Similarity=0.180  Sum_probs=38.0

Q ss_pred             chHHHHHHHHhcCH---HHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHh
Q psy5669           2 RSSRRKVQPACMSP---QYKTNVQRLSTIFRSEPLHPLQKAIRSIEYVIAH   49 (116)
Q Consensus         2 ~~l~~aI~~vl~n~---~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v~r~   49 (116)
                      +++.+||++++.|+   +|++||++++.+++++| .|.+.+---+++..+.
T Consensus       417 e~i~~av~~vm~~~~g~~~r~~a~~l~~~~~~~~-~~~~~~~~~~~~l~~~  466 (472)
T PLN02670        417 DSVAESVRLAMVDDAGEEIRDKAKEMRNLFGDMD-RNNRYVDELVHYLREN  466 (472)
T ss_pred             HHHHHHHHHHhcCcchHHHHHHHHHHHHHHhCcc-hhHHHHHHHHHHHHHh
Confidence            47899999999886   79999999999999998 5566666666665554


No 6  
>COG1819 Glycosyl transferases, related to UDP-glucuronosyltransferase [Carbohydrate transport and metabolism / Signal transduction mechanisms]
Probab=94.22  E-value=0.16  Score=41.23  Aligned_cols=48  Identities=21%  Similarity=0.243  Sum_probs=41.7

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhCC
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEYVIAHRG   51 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v~r~~g   51 (116)
                      +.+.++|++||+|++|++++++++..++..+-  .+.+..++|-..+.++
T Consensus       356 ~~l~~av~~vL~~~~~~~~~~~~~~~~~~~~g--~~~~a~~le~~~~~~~  403 (406)
T COG1819         356 ERLRAAVNEVLADDSYRRAAERLAEEFKEEDG--PAKAADLLEEFAREKK  403 (406)
T ss_pred             HHHHHHHHHHhcCHHHHHHHHHHHHHhhhccc--HHHHHHHHHHHHhccc
Confidence            46889999999999999999999999998864  6778999998777654


No 7  
>PLN02554 UDP-glycosyltransferase family protein
Probab=90.81  E-value=0.92  Score=37.54  Aligned_cols=30  Identities=10%  Similarity=0.123  Sum_probs=26.8

Q ss_pred             chHHHHHHHHhc-CHHHHHHHHHHHHHHhcC
Q psy5669           2 RSSRRKVQPACM-SPQYKTNVQRLSTIFRSE   31 (116)
Q Consensus         2 ~~l~~aI~~vl~-n~~Yk~na~~ls~~~~d~   31 (116)
                      +++.+||+++++ |++|++||+++++++++.
T Consensus       428 e~l~~av~~vm~~~~~~r~~a~~l~~~~~~a  458 (481)
T PLN02554        428 EEIERGIRCLMEQDSDVRKRVKEMSEKCHVA  458 (481)
T ss_pred             HHHHHHHHHHhcCCHHHHHHHHHHHHHHHHH
Confidence            468899999996 899999999999999854


No 8  
>PF15050 SCIMP:  SCIMP protein
Probab=88.71  E-value=1.2  Score=30.88  Aligned_cols=47  Identities=15%  Similarity=0.311  Sum_probs=20.7

Q ss_pred             CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc-ccCCCcc
Q psy5669          62 LPWYQTSQLDVIASCTLGLGLVLYTLARIGTSFAEFTRGKDDK-KATKPDR  111 (116)
Q Consensus        62 l~~~q~~~lDV~~~l~~v~~~~~~~~~~~~~~~~~~~~~~~~~-k~~~~~~  111 (116)
                      |+||.-+.+=++++-++++-+   ++..++.++||+..+..+| +..||-+
T Consensus         1 M~WWr~nFWiiLAVaII~vS~---~lglIlyCvcR~~lRqGkkweiakp~k   48 (133)
T PF15050_consen    1 MSWWRDNFWIILAVAIILVSV---VLGLILYCVCRWQLRQGKKWEIAKPLK   48 (133)
T ss_pred             CchHHhchHHHHHHHHHHHHH---HHHHHHHHHHHHHHHccccceeccchh
Confidence            567665544444333222111   2233445556655443333 4445543


No 9  
>PLN02562 UDP-glycosyltransferase
Probab=87.10  E-value=2.1  Score=35.20  Aligned_cols=30  Identities=7%  Similarity=0.059  Sum_probs=26.0

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcC
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSE   31 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~   31 (116)
                      +.+.++|+++++|++|++||++++...+..
T Consensus       401 ~~l~~~v~~~l~~~~~r~~a~~l~~~~~~~  430 (448)
T PLN02562        401 KEVEEGLRKVMEDSGMGERLMKLRERAMGE  430 (448)
T ss_pred             HHHHHHHHHHhCCHHHHHHHHHHHHHHHhc
Confidence            568899999999999999999998866543


No 10 
>PF07214 DUF1418:  Protein of unknown function (DUF1418);  InterPro: IPR010815 This family consists of several hypothetical Enterobacterial proteins of around 100 residues in length. Members of this family are often described as YbjC. In Escherichia coli the ybjC gene is located downstream of nfsA (which encodes the major oxygen-insensitive nitroreductase). It is thought that nfsA and ybjC form an operon an its promoter is a class I SoxS-dependent promoter []. The function of this family is unknown.
Probab=83.95  E-value=3.4  Score=27.40  Aligned_cols=34  Identities=24%  Similarity=0.398  Sum_probs=21.0

Q ss_pred             HHHHHHHHHHHHhhhhccccccccCCCcccccCC
Q psy5669          83 VLYTLARIGTSFAEFTRGKDDKKATKPDRNKKNK  116 (116)
Q Consensus        83 ~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~  116 (116)
                      .+.+++++.+.+.=..+..+.+++.+++|+|||.
T Consensus        60 av~ivWR~a~~lap~l~~~~~~~~~~s~~~k~~d   93 (96)
T PF07214_consen   60 AVNIVWRVAKGLAPLLIDRPPKSSSRSDREKKND   93 (96)
T ss_pred             HHHHHHHHHHHhchHhhcCCCCccccchhhhccc
Confidence            4455667766555545455555555688888873


No 11 
>PLN02207 UDP-glycosyltransferase
Probab=83.34  E-value=1.4  Score=36.60  Aligned_cols=30  Identities=7%  Similarity=0.107  Sum_probs=26.3

Q ss_pred             chHHHHHHHHhc--CHHHHHHHHHHHHHHhcC
Q psy5669           2 RSSRRKVQPACM--SPQYKTNVQRLSTIFRSE   31 (116)
Q Consensus         2 ~~l~~aI~~vl~--n~~Yk~na~~ls~~~~d~   31 (116)
                      ++|.++|+++++  +++||+||++++.+.+.-
T Consensus       414 e~i~~av~~vm~~~~~~~r~~a~~l~~~a~~A  445 (468)
T PLN02207        414 NEIETAIRCVMNKDNNVVRKRVMDISQMIQRA  445 (468)
T ss_pred             HHHHHHHHHHHhcchHHHHHHHHHHHHHHHHH
Confidence            578999999997  789999999999988843


No 12 
>PLN03004 UDP-glycosyltransferase
Probab=81.32  E-value=2.2  Score=35.29  Aligned_cols=28  Identities=7%  Similarity=0.036  Sum_probs=24.6

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHh
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFR   29 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~   29 (116)
                      +.+.++|+++++|++|++++++++..-+
T Consensus       412 e~l~~av~~vm~~~~~r~~a~~~~~~a~  439 (451)
T PLN03004        412 TEVEKRVQEIIGECPVRERTMAMKNAAE  439 (451)
T ss_pred             HHHHHHHHHHhcCHHHHHHHHHHHHHHH
Confidence            5788999999999999999999887544


No 13 
>PLN03007 UDP-glucosyltransferase family protein
Probab=77.42  E-value=3  Score=34.47  Aligned_cols=30  Identities=7%  Similarity=0.246  Sum_probs=26.1

Q ss_pred             chHHHHHHHHhcCH---HHHHHHHHHHHHHhcC
Q psy5669           2 RSSRRKVQPACMSP---QYKTNVQRLSTIFRSE   31 (116)
Q Consensus         2 ~~l~~aI~~vl~n~---~Yk~na~~ls~~~~d~   31 (116)
                      +.|.++|++++.|+   +|+++|++++.+.++.
T Consensus       428 ~~l~~av~~~m~~~~~~~~r~~a~~~~~~a~~a  460 (482)
T PLN03007        428 EKVEKAVREVIVGEEAEERRLRAKKLAEMAKAA  460 (482)
T ss_pred             HHHHHHHHHHhcCcHHHHHHHHHHHHHHHHHHH
Confidence            46889999999988   9999999999977754


No 14 
>cd03784 GT1_Gtf_like This family includes the Gtfs, a group of homologous glycosyltransferases involved in the final stages of the biosynthesis of antibiotics vancomycin and related chloroeremomycin. Gtfs transfer sugar moieties from an activated NDP-sugar donor to the oxidatively cross-linked heptapeptide core of vancomycin group antibiotics. The core structure is important for the bioactivity of the antibiotics.
Probab=76.23  E-value=5.1  Score=31.40  Aligned_cols=41  Identities=15%  Similarity=0.081  Sum_probs=27.8

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEY   45 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~   45 (116)
                      +.+.++|+++++++.|++ +.+..+.+.  ...+.+.++..||-
T Consensus       360 ~~l~~al~~~l~~~~~~~-~~~~~~~~~--~~~g~~~~~~~ie~  400 (401)
T cd03784         360 ERLAAALRRLLDPPSRRR-AAALLRRIR--EEDGVPSAADVIER  400 (401)
T ss_pred             HHHHHHHHHHhCHHHHHH-HHHHHHHHH--hccCHHHHHHHHhh
Confidence            468899999998765555 555544443  34677888887763


No 15 
>PF07069 PRRSV_2b:  Porcine reproductive and respiratory syndrome virus 2b ;  InterPro: IPR009775 This family consists of several Porcine reproductive and respiratory syndrome virus (PRRSV) ORF2b proteins. The function of this family is unknown however it is known that large amounts of 2b protein are present in the virion and it is thought that this protein may be an integral component of the virion [].
Probab=73.32  E-value=11  Score=23.13  Aligned_cols=34  Identities=9%  Similarity=0.067  Sum_probs=19.3

Q ss_pred             HHHHHHHHHHHHHHHH--HHHHHHHHHHhhhhcccc
Q psy5669          69 QLDVIASCTLGLGLVL--YTLARIGTSFAEFTRGKD  102 (116)
Q Consensus        69 ~lDV~~~l~~v~~~~~--~~~~~~~~~~~~~~~~~~  102 (116)
                      -+|++.++.+.+.+.+  +++..|++.+|.-+.+.+
T Consensus        26 ivdiiiflailfgftiagwlvvfcirlv~sailr~r   61 (73)
T PF07069_consen   26 IVDIIIFLAILFGFTIAGWLVVFCIRLVCSAILRAR   61 (73)
T ss_pred             HHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHhcc
Confidence            4677766655554333  344556677776665544


No 16 
>PF05478 Prominin:  Prominin;  InterPro: IPR008795 The prominins are an emerging family of proteins that, among the multispan membrane proteins, display a novel topology. Mouse and Homo sapiens prominin and (Mus musculus) prominin-like 1 (PROML1) are predicted to contain five membrane spanning domains, with an N-terminal domain exposed to the extracellular space followed by four, alternating small cytoplasmic and large extracellular, loops and a cytoplasmic C-terminal domain []. The exact function of prominin is unknown although in humans defects in PROM1, the gene coding for prominin, cause retinal degeneration [].; GO: 0016021 integral to membrane
Probab=71.22  E-value=12  Score=33.13  Aligned_cols=36  Identities=8%  Similarity=-0.075  Sum_probs=13.8

Q ss_pred             CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy5669          63 PWYQTSQLDVIASCTLGLGLVLYTLARIGTSFAEFT   98 (116)
Q Consensus        63 ~~~q~~~lDV~~~l~~v~~~~~~~~~~~~~~~~~~~   98 (116)
                      .++.|+-.=++++++.++++++.-+.-+|.++||||
T Consensus        85 ~~l~~~~g~~v~~~i~ll~~il~P~vg~~fCcCRCc  120 (806)
T PF05478_consen   85 ELLAYEWGFLVCAVIGLLFIILMPLVGLCFCCCRCC  120 (806)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccC
Confidence            344444443333333333322332333444444443


No 17 
>PF02038 ATP1G1_PLM_MAT8:  ATP1G1/PLM/MAT8 family;  InterPro: IPR000272  The FXYD protein family contains at least seven members in mammals []. Two other family members that are not obvious orthologs of any identified mammalian FXYD protein exist in zebrafish. All these proteins share a signature sequence of six conserved amino acids comprising the FXYD motif in the NH2-terminus, and two glycines and one serine residue in the transmembrane domain. FXYD proteins are widely distributed in mammalian tissues with prominent expression in tissues that perform fluid and solute transport or that are electrically excitable.   Initial functional characterisation suggested that FXYD proteins act as channels or as modulators of ion channels however studies have revealed that most FXYD proteins have another specific function and act as tissue-specific regulatory subunits of the Na,K-ATPase. Each of these auxiliary subunits produces a distinct functional effect on the transport characteristics of the Na,K-ATPase that is adjusted to the specific functional demands of the tissue in which the FXYD protein is expressed. FXYD proteins appear to preferentially associate with Na,K-ATPase alpha1-beta isozymes, and affect their function in a way that render them operationally complementary or supplementary to coexisting isozymes.; GO: 0005216 ion channel activity, 0006811 ion transport, 0016020 membrane; PDB: 2JO1_A 2JP3_A 2ZXE_G 3A3Y_G 3N23_E 3B8E_H 3KDP_G 3N2F_E.
Probab=70.80  E-value=7  Score=22.85  Aligned_cols=28  Identities=18%  Similarity=0.237  Sum_probs=18.4

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy5669          64 WYQTSQLDVIASCTLGLGLVLYTLARIG   91 (116)
Q Consensus        64 ~~q~~~lDV~~~l~~v~~~~~~~~~~~~   91 (116)
                      +|.|..|-+-.++++.+++++.++..+.
T Consensus         8 ~YDy~tLrigGLi~A~vlfi~Gi~iils   35 (50)
T PF02038_consen    8 YYDYETLRIGGLIFAGVLFILGILIILS   35 (50)
T ss_dssp             GGCHHHHHHHHHHHHHHHHHHHHHHHCT
T ss_pred             ccchhHhhccchHHHHHHHHHHHHHHHc
Confidence            3778888887777766666666544443


No 18 
>PF13524 Glyco_trans_1_2:  Glycosyl transferases group 1
Probab=70.10  E-value=14  Score=22.73  Aligned_cols=42  Identities=7%  Similarity=-0.008  Sum_probs=27.7

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSI   43 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wi   43 (116)
                      +.+.++|..+++||..++.+.+-+...-.+-.+...++-.++
T Consensus        50 ~el~~~i~~ll~~~~~~~~ia~~a~~~v~~~~t~~~~~~~il   91 (92)
T PF13524_consen   50 EELAEKIEYLLENPEERRRIAKNARERVLKRHTWEHRAEQIL   91 (92)
T ss_pred             HHHHHHHHHHHCCHHHHHHHHHHHHHHHHHhCCHHHHHHHHH
Confidence            567889999999997666655555444444556666655554


No 19 
>PLN02208 glycosyltransferase family protein
Probab=70.03  E-value=16  Score=30.15  Aligned_cols=43  Identities=16%  Similarity=0.108  Sum_probs=30.3

Q ss_pred             chHHHHHHHHhcCHH-----HHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQ-----YKTNVQRLSTIFRSEPLHPLQKAIRSIEY   45 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~-----Yk~na~~ls~~~~d~p~~P~~~av~wiE~   45 (116)
                      +++.++|+++++|++     ++++++++++...+ +-+.-...-.-||.
T Consensus       389 ~~l~~ai~~~m~~~~e~g~~~r~~~~~~~~~~~~-~gsS~~~l~~~v~~  436 (442)
T PLN02208        389 ESLSNAIKSVMDKDSDLGKLVRSNHTKLKEILVS-PGLLTGYVDKFVEE  436 (442)
T ss_pred             HHHHHHHHHHhcCCchhHHHHHHHHHHHHHHHhc-CCcHHHHHHHHHHH
Confidence            578899999998765     99999999988754 33444433333433


No 20 
>cd07912 Tweety_N N-terminal domain of the protein encoded by the Drosophila tweety gene and related proteins, a family of chloride ion channels. The protein product of the Drosophila tweety (tty) gene is thought to form a trans-membrane protein with five membrane-spanning regions and a cytoplasmic C-terminus. This N-terminal domain contains the putative transmembrane spanning regions. Tweety has been suggested as a candidate for a large conductance chloride channel, both in vertebrate and insect cells. Three human homologs have been identified and designated TTYH1-3. TTYH2 has been associated with the progression of cancer, and Drosophila melanogaster tweety has been assumed to play a role in development. TTYH2, and TTYH3 bind to and are ubiquinated by Nedd4-2, a HECT type E3 ubiquitin ligase, which most likely plays a role in controlling the cellular levels of tweety family proteins.
Probab=69.97  E-value=22  Score=29.33  Aligned_cols=13  Identities=8%  Similarity=-0.076  Sum_probs=8.7

Q ss_pred             HHHHHHHHHHhCC
Q psy5669          39 AIRSIEYVIAHRG   51 (116)
Q Consensus        39 av~wiE~v~r~~g   51 (116)
                      +=||+.+.=+.|-
T Consensus         4 ~~~~~~~~h~lPh   16 (418)
T cd07912           4 APWLVKLLHSLPH   16 (418)
T ss_pred             chHHHHHHHhCCC
Confidence            3478887766664


No 21 
>PF07319 DnaI_N:  Primosomal protein DnaI N-terminus;  InterPro: IPR009928 This entry represents the N terminus (approximately 120 residues) of bacterial primosomal DnaI proteins, although one family member appears to be of viral origin. DnaI is one of the components of the Bacillus subtilis replication restart primosome, and is required for the DnaB75-dependent loading of the DnaC helicase [].; PDB: 2K7R_A.
Probab=67.96  E-value=2  Score=27.89  Aligned_cols=32  Identities=16%  Similarity=0.246  Sum_probs=16.7

Q ss_pred             CchHHHHHHHHhcCHHHHHHHHHHHHHHhcCC
Q psy5669           1 MRSSRRKVQPACMSPQYKTNVQRLSTIFRSEP   32 (116)
Q Consensus         1 ~~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p   32 (116)
                      |++|.++|.++..+++++++.+++...+.+.|
T Consensus         1 Me~I~~~l~~~~~~~~~~~~~~~l~~~vl~dp   32 (94)
T PF07319_consen    1 MESIGDSLKQLMKRRNFEERYEQLKQEVLSDP   32 (94)
T ss_dssp             ---TT-S----S---HHHHHHHHHHHHHTT-H
T ss_pred             CccHHHHHHHHHhcccHHHHHHHHHHHHHcCH
Confidence            89999999999999999999888877776654


No 22 
>PRK00726 murG undecaprenyldiphospho-muramoylpentapeptide beta-N- acetylglucosaminyltransferase; Provisional
Probab=64.84  E-value=19  Score=27.65  Aligned_cols=43  Identities=12%  Similarity=0.007  Sum_probs=32.2

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEY   45 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~   45 (116)
                      +.+.++|.++++|+.+++.+.+-+..+. ++.++...+--|.+.
T Consensus       312 ~~l~~~i~~ll~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~  354 (357)
T PRK00726        312 EKLAEKLLELLSDPERLEAMAEAARALG-KPDAAERLADLIEEL  354 (357)
T ss_pred             HHHHHHHHHHHcCHHHHHHHHHHHHhcC-CcCHHHHHHHHHHHH
Confidence            4788999999999999988888776663 565666666555443


No 23 
>PF11628 TCR_zetazeta:  T-cell surface glycoprotein CD3 zeta chain;  InterPro: IPR021663 The TCR complex of T-lymphocytes consists of either a TCR alpha/beta or TCR gamma/delta heterodimer co-expressed at the cell surface with the invariant subunits of CD3 labelled gamma, delta, epsilon, zeta, and eta []. The zeta subunit forms either homodimers or heterodimers with eta [], but eta homodimers have not been observed. The structure of the zetazeta transmembrane dimer consists of a left-handed coiled coil with polar contacts. Two aspartic acids are critical for zetazeta dimerisation and assembly with TCR [].  The high affinity immunoglobulin epsilon receptor (IgE Fc receptor) subunit gamma associates with a variety of FcR alpha chains to form a functional signaling complex. The gamma subunit has a critical role in allowing the IgE Fc receptor to reach the cell surface and regulates several aspects of the immune response []. This family includes both CD3 zeta subunits and IgE Fc receptor gamma subunits. The gamma chain of the high affinity Fc receptor for IgE has significant structural homology to CD3 zeta and the related CD3 eta subunit and can facilitate T cell receptor expression and signaling in the absence of CD3 zeta and CD3 eta [].; PDB: 2HAC_B.
Probab=63.77  E-value=15  Score=19.58  Aligned_cols=19  Identities=26%  Similarity=-0.031  Sum_probs=12.9

Q ss_pred             HHHHHHHHHHHHHHHHHHH
Q psy5669          67 TSQLDVIASCTLGLGLVLY   85 (116)
Q Consensus        67 ~~~lDV~~~l~~v~~~~~~   85 (116)
                      .|.||.++++..++++.++
T Consensus         5 CYiLDgiL~iYgiiiT~L~   23 (33)
T PF11628_consen    5 CYILDGILFIYGIIITALY   23 (33)
T ss_dssp             HHHHHHHHHHHHHHHHHHH
T ss_pred             eeeHHHHHHHHHHHHHHHH
Confidence            4778998777666555544


No 24 
>PLN02863 UDP-glucoronosyl/UDP-glucosyl transferase family protein
Probab=63.64  E-value=22  Score=29.53  Aligned_cols=48  Identities=6%  Similarity=-0.030  Sum_probs=31.6

Q ss_pred             chHHHHHHHHh-cCHHHHHHHHHHHHHHhcCCC---CHHHHHHHHHHHHHHh
Q psy5669           2 RSSRRKVQPAC-MSPQYKTNVQRLSTIFRSEPL---HPLQKAIRSIEYVIAH   49 (116)
Q Consensus         2 ~~l~~aI~~vl-~n~~Yk~na~~ls~~~~d~p~---~P~~~av~wiE~v~r~   49 (116)
                      +.+.+++++++ ++++|++||++++...++.-.   +.......-|+.+.+.
T Consensus       421 ~~v~~~v~~~m~~~~~~r~~a~~l~e~a~~Av~~gGSS~~~l~~~v~~i~~~  472 (477)
T PLN02863        421 DELARVFMESVSENQVERERAKELRRAALDAIKERGSSVKDLDGFVKHVVEL  472 (477)
T ss_pred             HHHHHHHHHHhhccHHHHHHHHHHHHHHHHHhccCCcHHHHHHHHHHHHHHh
Confidence            46778899887 789999999999987553311   2333344444554444


No 25 
>PLN00414 glycosyltransferase family protein
Probab=63.08  E-value=27  Score=28.77  Aligned_cols=47  Identities=21%  Similarity=0.116  Sum_probs=31.7

Q ss_pred             chHHHHHHHHhcCHH-----HHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHh
Q psy5669           2 RSSRRKVQPACMSPQ-----YKTNVQRLSTIFRSEPLHPLQKAIRSIEYVIAH   49 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~-----Yk~na~~ls~~~~d~p~~P~~~av~wiE~v~r~   49 (116)
                      +.+.+++++++.|++     |++++++++..+.+.--+..+ ....|+.+..+
T Consensus       390 ~~i~~~v~~~m~~~~e~g~~~r~~a~~~~~~~~~~gg~ss~-l~~~v~~~~~~  441 (446)
T PLN00414        390 ESLRDTVKSVMDKDSEIGNLVKRNHKKLKETLVSPGLLSGY-ADKFVEALENE  441 (446)
T ss_pred             HHHHHHHHHHhcCChhhHHHHHHHHHHHHHHHHcCCCcHHH-HHHHHHHHHHh
Confidence            478899999998754     999999999987644332222 34444444433


No 26 
>PLN02167 UDP-glycosyltransferase family protein
Probab=62.38  E-value=10  Score=31.40  Aligned_cols=29  Identities=10%  Similarity=0.071  Sum_probs=23.8

Q ss_pred             chHHHHHHHHhcCH-HHHHHHHHHHHHHhc
Q psy5669           2 RSSRRKVQPACMSP-QYKTNVQRLSTIFRS   30 (116)
Q Consensus         2 ~~l~~aI~~vl~n~-~Yk~na~~ls~~~~d   30 (116)
                      +.+.++|++++.++ .|+++|++++...+.
T Consensus       422 ~~l~~av~~~m~~~~~~r~~a~~~~~~~~~  451 (475)
T PLN02167        422 DEIAGAVRSLMDGEDVPRKKVKEIAEAARK  451 (475)
T ss_pred             HHHHHHHHHHhcCCHHHHHHHHHHHHHHHH
Confidence            46889999999755 899999999886664


No 27 
>PLN02448 UDP-glycosyltransferase family protein
Probab=62.25  E-value=10  Score=31.15  Aligned_cols=30  Identities=13%  Similarity=0.118  Sum_probs=25.4

Q ss_pred             chHHHHHHHHhcCH-----HHHHHHHHHHHHHhcC
Q psy5669           2 RSSRRKVQPACMSP-----QYKTNVQRLSTIFRSE   31 (116)
Q Consensus         2 ~~l~~aI~~vl~n~-----~Yk~na~~ls~~~~d~   31 (116)
                      +.|.++++++++|+     +|+++|++++...+..
T Consensus       403 ~~l~~av~~vl~~~~~~~~~~r~~a~~~~~~~~~a  437 (459)
T PLN02448        403 EEIAELVKRFMDLESEEGKEMRRRAKELQEICRGA  437 (459)
T ss_pred             HHHHHHHHHHhcCCchhHHHHHHHHHHHHHHHHHH
Confidence            46899999999875     7999999999887754


No 28 
>PF03401 TctC:  Tripartite tricarboxylate transporter family receptor;  InterPro: IPR005064  Bordetella pertussis, the causative agent of human whooping cough (pertussis), is an obligate human pathogen with diverse high-affinity transport systems for the assimilation of iron, a biometal that is essential for growth []. Periplasmic binding proteins of a new family, particularly well represented in this organism (and more generally in beta-proteobacteria), have been called Bug receptors []. They adopt a characteristic Venus flytrap fold with two globular domains bisected by a ligand-binding cleft. The family is specific for carboxylated solutes, with a characteristic mode of binding involving two highly conserved beta strand-beta turn-alpha helix motifs originating from each domain. These two motifs form hydrogen bonds with a carboxylate group of the ligand, both directly and via conserved water molecules, and have thus been termed the carboxylate pincers. Domain 1 recognises the ligand and the carboxylate group serves as an initial anchoring point. Domain 2 discriminates between productively and non-productively bound ligands as proper interactions with this domain is needed for the of the closed conformation []. BugE has a glutamate bound ligand. No charged residues are involved in glutamate binding by BugE, unlike what has been described for all glutamate receptors reported so far. The Bug architecture is highly conserved despite limited sequence identity [].; GO: 0030288 outer membrane-bounded periplasmic space; PDB: 2QPQ_C 2DVZ_A 2F5X_A.
Probab=60.48  E-value=11  Score=28.84  Aligned_cols=46  Identities=7%  Similarity=0.021  Sum_probs=33.5

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHh
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEYVIAH   49 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v~r~   49 (116)
                      +.+.+|++++++||.|++.+++..-  ...+.+|.+...++-|...++
T Consensus       220 ~~l~~a~~~~~~~pe~~~~~~~~g~--~~~~~~~~~~~~~l~~~~~~~  265 (274)
T PF03401_consen  220 DKLADAIKKALEDPEFQEFLEKMGL--EPVYMDGEEFDAFLAEEYARY  265 (274)
T ss_dssp             HHHHHHHHHHHT-HHHHHHHHHHTE--EEECESHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHhCCHHHHHHHHHCCC--cCCCCCHHHHHHHHHHHHHHH
Confidence            5789999999999999999988653  334778877666665554443


No 29 
>PF11057 Cortexin:  Cortexin of kidney;  InterPro: IPR020066 Cortexin is a neuron-specific, 82-residue membrane protein which is found especially in vertebrate brain cortex tissue. It may mediate extracellular or intracellular signalling of cortical neurons during forebrain development. Cortexin is present at significant levels in the foetal brain, suggesting that it may be important to neurons of both the developing and adult cerebral cortex. Cortexin has a conserved single membrane-spanning region in the middle of each sequence []. In humans, there is selective expression of Cortexin 3 (CTXN3) in the kidney as well as the brain []. This entry contains Cortexins 1, 2 and 3.; GO: 0031224 intrinsic to membrane
Probab=59.63  E-value=16  Score=23.31  Aligned_cols=23  Identities=13%  Similarity=-0.122  Sum_probs=13.7

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHh
Q psy5669          73 IASCTLGLGLVLYTLARIGTSFA   95 (116)
Q Consensus        73 ~~~l~~v~~~~~~~~~~~~~~~~   95 (116)
                      +++++++++++..++.+|.+++.
T Consensus        31 faFV~~L~~fL~~liVRCfrIll   53 (81)
T PF11057_consen   31 FAFVGLLCLFLGLLIVRCFRILL   53 (81)
T ss_pred             ehHHHHHHHHHHHHHHHHHHHHc
Confidence            44455555555566677777664


No 30 
>PLN02152 indole-3-acetate beta-glucosyltransferase
Probab=59.45  E-value=12  Score=31.02  Aligned_cols=30  Identities=7%  Similarity=0.117  Sum_probs=24.5

Q ss_pred             chHHHHHHHHhcCHH--HHHHHHHHHHHHhcC
Q psy5669           2 RSSRRKVQPACMSPQ--YKTNVQRLSTIFRSE   31 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~--Yk~na~~ls~~~~d~   31 (116)
                      +++.++|+++++|++  |++++++++...+..
T Consensus       405 e~l~~av~~vm~~~~~~~r~~a~~~~~~~~~a  436 (455)
T PLN02152        405 GEIRRCLEAVMEEKSVELRESAEKWKRLAIEA  436 (455)
T ss_pred             HHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHH
Confidence            578999999998776  799998888876643


No 31 
>PF05478 Prominin:  Prominin;  InterPro: IPR008795 The prominins are an emerging family of proteins that, among the multispan membrane proteins, display a novel topology. Mouse and Homo sapiens prominin and (Mus musculus) prominin-like 1 (PROML1) are predicted to contain five membrane spanning domains, with an N-terminal domain exposed to the extracellular space followed by four, alternating small cytoplasmic and large extracellular, loops and a cytoplasmic C-terminal domain []. The exact function of prominin is unknown although in humans defects in PROM1, the gene coding for prominin, cause retinal degeneration [].; GO: 0016021 integral to membrane
Probab=59.02  E-value=13  Score=32.86  Aligned_cols=29  Identities=10%  Similarity=-0.172  Sum_probs=12.4

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHhhhhcccc
Q psy5669          73 IASCTLGLGLVLYTLARIGTSFAEFTRGKD  102 (116)
Q Consensus        73 ~~~l~~v~~~~~~~~~~~~~~~~~~~~~~~  102 (116)
                      +++++++++-++.+++.||| +|+.|.|+-
T Consensus        99 i~ll~~il~P~vg~~fCcCR-Cc~~CGg~~  127 (806)
T PF05478_consen   99 IGLLFIILMPLVGLCFCCCR-CCGNCGGRM  127 (806)
T ss_pred             HHHHHHHHHHHHHHHHhccc-cCCCcCCcc
Confidence            33444444444444444443 355554433


No 32 
>cd03820 GT1_amsD_like This family is most closely related to the GT1 family of glycosyltransferases. AmSD in Erwinia amylovora has been shown to be involved in the biosynthesis of amylovoran, the acidic exopolysaccharide acting as a virulence factor. This enzyme may be responsible for the formation of  galactose alpha-1,6 linkages in amylovoran.
Probab=57.53  E-value=22  Score=25.68  Aligned_cols=41  Identities=2%  Similarity=-0.152  Sum_probs=29.0

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSI   43 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wi   43 (116)
                      +++.++|.++++||..++++.+-+..+. +.++....+-.|.
T Consensus       307 ~~~~~~i~~ll~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~  347 (348)
T cd03820         307 EALAEALLRLMEDEELRKRMGANARESA-ERFSIENIIKQWE  347 (348)
T ss_pred             HHHHHHHHHHHcCHHHHHHHHHHHHHHH-HHhCHHHHHHHhc
Confidence            5788999999999988877776554443 3466666665553


No 33 
>PLN02210 UDP-glucosyl transferase
Probab=56.77  E-value=15  Score=30.29  Aligned_cols=28  Identities=14%  Similarity=0.289  Sum_probs=23.8

Q ss_pred             chHHHHHHHHhcCHH---HHHHHHHHHHHHh
Q psy5669           2 RSSRRKVQPACMSPQ---YKTNVQRLSTIFR   29 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~---Yk~na~~ls~~~~   29 (116)
                      ++|.+||++++.|++   +|+||++++...+
T Consensus       403 ~~l~~av~~~m~~~~g~~~r~~a~~l~~~a~  433 (456)
T PLN02210        403 EEVERCIEAVTEGPAAADIRRRAAELKHVAR  433 (456)
T ss_pred             HHHHHHHHHHhcCchHHHHHHHHHHHHHHHH
Confidence            478899999998875   9999999988655


No 34 
>PF09929 DUF2161:  Uncharacterized conserved protein (DUF2161);  InterPro: IPR018679 This family of various hypothetical prokaryotic proteins has no known function.
Probab=56.74  E-value=8.4  Score=26.45  Aligned_cols=26  Identities=15%  Similarity=0.303  Sum_probs=22.7

Q ss_pred             HHHHHHHHHHHHHh-cCCCCHHHHHHH
Q psy5669          16 QYKTNVQRLSTIFR-SEPLHPLQKAIR   41 (116)
Q Consensus        16 ~Yk~na~~ls~~~~-d~p~~P~~~av~   41 (116)
                      -|++.|.+++..+. +-|.+|.+.+..
T Consensus        56 AYRQ~Al~~A~~L~~~Gp~~~~~l~~~   82 (118)
T PF09929_consen   56 AYRQDALRCAAALAEHGPSRPADLRKA   82 (118)
T ss_pred             hhHHHHHHHHHHHHHcCCCCHHHHHHh
Confidence            49999999999999 889999987754


No 35 
>PRK00025 lpxB lipid-A-disaccharide synthase; Reviewed
Probab=56.71  E-value=42  Score=25.91  Aligned_cols=46  Identities=9%  Similarity=0.052  Sum_probs=29.3

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcC-CCCHHHHHHHHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSE-PLHPLQKAIRSIEYVI   47 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~-p~~P~~~av~wiE~v~   47 (116)
                      +.+.+++.++++|++.++.+.+-....+.. .-.+.+..+..++..+
T Consensus       329 ~~l~~~i~~ll~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~i~~~~  375 (380)
T PRK00025        329 EKLARALLPLLADGARRQALLEGFTELHQQLRCGADERAAQAVLELL  375 (380)
T ss_pred             HHHHHHHHHHhcCHHHHHHHHHHHHHHHHHhCCCHHHHHHHHHHHHh
Confidence            367889999999998776555543222222 2246777777766554


No 36 
>PLN02410 UDP-glucoronosyl/UDP-glucosyl transferase family protein
Probab=56.69  E-value=15  Score=30.38  Aligned_cols=29  Identities=14%  Similarity=0.226  Sum_probs=24.1

Q ss_pred             chHHHHHHHHhcCH---HHHHHHHHHHHHHhc
Q psy5669           2 RSSRRKVQPACMSP---QYKTNVQRLSTIFRS   30 (116)
Q Consensus         2 ~~l~~aI~~vl~n~---~Yk~na~~ls~~~~d   30 (116)
                      +.+.++|++++.|+   +|++++++++..++.
T Consensus       398 ~~v~~av~~lm~~~~~~~~r~~a~~l~~~~~~  429 (451)
T PLN02410        398 GAVERAVKRLMVEEEGEEMRKRAISLKEQLRA  429 (451)
T ss_pred             HHHHHHHHHHHcCCcHHHHHHHHHHHHHHHHH
Confidence            57889999999876   699999988887764


No 37 
>PF11346 DUF3149:  Protein of unknown function (DUF3149);  InterPro: IPR021494  This bacterial family of proteins has no known function. 
Probab=56.00  E-value=35  Score=19.14  Aligned_cols=35  Identities=11%  Similarity=-0.051  Sum_probs=21.0

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy5669          64 WYQTSQLDVIASCTLGLGLVLYTLARIGTSFAEFT   98 (116)
Q Consensus        64 ~~q~~~lDV~~~l~~v~~~~~~~~~~~~~~~~~~~   98 (116)
                      |.+.+.-||-+.-++++++++.+...+.+.+.+..
T Consensus         3 w~~LF~s~vGL~Sl~vI~~~igm~~~~~~~F~~k~   37 (42)
T PF11346_consen    3 WKDLFGSDVGLMSLIVIVFTIGMGVFFIRYFIRKM   37 (42)
T ss_pred             HHHHhcChHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            45556667766666666656665556666555543


No 38 
>PF05186 Dpy-30:  Dpy-30 motif;  InterPro: IPR007858 This motif is about 40 residues long and is probably formed of two alpha-helices. It is found in the Dpy-30 proteins, hence the motifs name. Dpy-30 from Caenorhabditis elegans is an essential component of dosage compensation machinery and loss of dpy-30 activity results in XX-specific lethality; in XO animals, Dpy-30 is required for developmental processes other than dosage compensation []. In yeast, the homologue of DPY-30, Saf19p, functions as part of the Set1 complex that is necessary for the methylation of histone H3 at lysine residue 4; Set1 is a key part of epigenetic developmental control []. There is also a human homologue of Dpy-30 []. This Dpy-30 region may be a dimerisation motif analogous that found in the cAMP-dependent protein kinase regulator, type II PKA, R subunit IPR003117 from INTERPRO.; PDB: 3G36_D.
Probab=55.76  E-value=34  Score=18.96  Aligned_cols=22  Identities=14%  Similarity=0.305  Sum_probs=14.4

Q ss_pred             HHHHHHHHhcCCCCHHHHHHHH
Q psy5669          21 VQRLSTIFRSEPLHPLQKAIRS   42 (116)
Q Consensus        21 a~~ls~~~~d~p~~P~~~av~w   42 (116)
                      ++-+..+.+.||.+|.+-...|
T Consensus        15 ~~gL~~l~~~rP~DPi~~La~~   36 (42)
T PF05186_consen   15 TEGLAELAKERPEDPIEFLAEY   36 (42)
T ss_dssp             HHHHHHHHHH--SSHHHHHHHH
T ss_pred             HHHHHHHHHHCCCChHHHHHHH
Confidence            4556777889999999855544


No 39 
>PLN02764 glycosyltransferase family protein
Probab=53.94  E-value=45  Score=27.69  Aligned_cols=49  Identities=14%  Similarity=-0.002  Sum_probs=35.9

Q ss_pred             chHHHHHHHHhcCHH-----HHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhCC
Q psy5669           2 RSSRRKVQPACMSPQ-----YKTNVQRLSTIFRSEPLHPLQKAIRSIEYVIAHRG   51 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~-----Yk~na~~ls~~~~d~p~~P~~~av~wiE~v~r~~g   51 (116)
                      +.+.+|++++++|++     +++++++++..+++.- +.-.....-++.+.+..+
T Consensus       395 e~i~~av~~vm~~~~~~g~~~r~~a~~~~~~~~~~G-SS~~~l~~lv~~~~~~~~  448 (453)
T PLN02764        395 ESLRDAINSVMKRDSEIGNLVKKNHTKWRETLASPG-LLTGYVDNFIESLQDLVS  448 (453)
T ss_pred             HHHHHHHHHHhcCCchhHHHHHHHHHHHHHHHHhcC-CHHHHHHHHHHHHHHhcc
Confidence            578899999998753     9999999999997664 334455556666665543


No 40 
>PF03213 Pox_P35:  Poxvirus P35 protein;  InterPro: IPR004900 The Poxvirus P35 protein is an immunodominant envelope protein. It binds to heparan sulphate on the cell surface to provide virion attachment to target cell [].; GO: 0019031 viral envelope
Probab=50.84  E-value=33  Score=27.46  Aligned_cols=39  Identities=13%  Similarity=0.177  Sum_probs=19.6

Q ss_pred             HHHHHHHHHHHHHhCCCCCC-cccCCCCCHHHHHHHHHHHHHHH
Q psy5669          36 LQKAIRSIEYVIAHRGAPHL-KTKATSLPWYQTSQLDVIASCTL   78 (116)
Q Consensus        36 ~~~av~wiE~v~r~~g~~~l-~~~~~~l~~~q~~~lDV~~~l~~   78 (116)
                      -.++..|++.  ++|++.+. ..|  -+||+-.+.++|+.++.+
T Consensus       258 wsrl~~Wla~--~~P~~~y~lttP--LfSfFGlfDInv~g~~ii  297 (325)
T PF03213_consen  258 WSRLGKWLAK--RFPGAYYFLTTP--LFSFFGLFDINVIGVIII  297 (325)
T ss_pred             HHHHHHHHHh--hCCCchhhhhch--HHHHcccchhHHHHHHHH
Confidence            3455566542  56665543 322  345555555565544433


No 41 
>COG0133 TrpB Tryptophan synthase beta chain [Amino acid transport and metabolism]
Probab=50.49  E-value=43  Score=27.29  Aligned_cols=44  Identities=20%  Similarity=0.302  Sum_probs=36.7

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEYV   46 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v   46 (116)
                      +.|..|..+.-.||+|++....+.+-+.-|| +|+-.|-.-+|+.
T Consensus        26 ~eLe~ay~~~~~D~~F~~el~~~l~~Y~GRp-tpLy~a~~Lt~~~   69 (396)
T COG0133          26 EELEKAYEKAKNDPEFQAELDYLLKDYAGRP-TPLYFAERLTEHL   69 (396)
T ss_pred             HHHHHHHHHHhcCHHHHHHHHHHHHHhCCCC-ChhHHHHHHHHhh
Confidence            4677788888899999999999999999999 8888776666655


No 42 
>PF15013 CCSMST1:  CCSMST1 family
Probab=50.42  E-value=14  Score=23.53  Aligned_cols=19  Identities=26%  Similarity=0.609  Sum_probs=12.8

Q ss_pred             CcccCCCCCHHHHHHHHHH
Q psy5669          55 LKTKATSLPWYQTSQLDVI   73 (116)
Q Consensus        55 l~~~~~~l~~~q~~~lDV~   73 (116)
                      +.+....++|||.+.+=+-
T Consensus        21 ~~~~~~~~PWyq~~~is~s   39 (77)
T PF15013_consen   21 RGGKQQRMPWYQVYPISLS   39 (77)
T ss_pred             cCCCCCCCcceeeehhHHH
Confidence            5555566899998765443


No 43 
>cd03801 GT1_YqgM_like This family is most closely related to the GT1 family of glycosyltransferases and named after YqgM in Bacillus licheniformis about which little is known. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. 
Probab=49.04  E-value=57  Score=23.54  Aligned_cols=44  Identities=11%  Similarity=0.027  Sum_probs=29.2

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEY   45 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~   45 (116)
                      +++.++|.++++|+.+.+.+.+-+.....+..+....+-.+.+.
T Consensus       329 ~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  372 (374)
T cd03801         329 EALAEAILRLLDDPELRRRLGEAARERVAERFSWDRVAARTEEV  372 (374)
T ss_pred             HHHHHHHHHHHcChHHHHHHHHHHHHHHHHhcCHHHHHHHHHHh
Confidence            56889999999999877666555543344555666666555543


No 44 
>PF05454 DAG1:  Dystroglycan (Dystrophin-associated glycoprotein 1);  InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5.5 kb transcript in Homo sapiens. The protein product is cleaved into two non-covalently associated subunits, [alpha] (N-terminal) and [beta] (C-terminal). In skeletal muscle the dystroglycan complex works as a transmembrane linkage between the extracellular matrix and the cytoskeleton [alpha]-dystroglycan is extracellular and binds to merosin ([alpha]-2 laminin) in the basement membrane, while [beta]-dystroglycan is a transmembrane protein and binds to dystrophin, which is a large rod-like cytoskeletal protein, absent in Duchenne muscular dystrophy patients. Dystrophin binds to intracellular actin cables. In this way, the dystroglycan complex, which links the extracellular matrix to the intracellular actin cables, is thought to provide structural integrity in muscle tissues. The dystroglycan complex is also known to serve as an agrin receptor in muscle, where it may regulate agrin-induced acetylcholine receptor clustering at the neuromuscular junction. There is also evidence which suggests the function of dystroglycan as a part of the signal transduction pathway because it is shown that Grb2, a mediator of the Ras-related signal pathway, can interact with the cytoplasmic domain of dystroglycan. In general, aberrant expression of dystrophin-associated protein complex underlies the pathogenesis of Duchenne muscular dystrophy, Becker muscular dystrophy and severe childhood autosomal recessive muscular dystrophy. Interestingly, no genetic disease has been described for either [alpha]- or [beta]-dystroglycan. Dystroglycan is widely distributed in non-muscle tissues as well as in muscle tissues. During epithelial morphogenesis of kidney, the dystroglycan complex is shown to act as a receptor for the basement membrane. Dystroglycan expression in Mus musculus brain and neural retina has also been reported. However, the physiological role of dystroglycan in non-muscle tissues has remained unclear [].; PDB: 1EG4_P.
Probab=47.62  E-value=6.2  Score=31.06  Aligned_cols=20  Identities=20%  Similarity=0.326  Sum_probs=0.0

Q ss_pred             HHHHHHHhcCCCCHHHHHHH
Q psy5669          22 QRLSTIFRSEPLHPLQKAIR   41 (116)
Q Consensus        22 ~~ls~~~~d~p~~P~~~av~   41 (116)
                      .++++++.+.--+|.+.+..
T Consensus        70 ~~L~~~L~~~~g~~~~~f~~   89 (290)
T PF05454_consen   70 EKLRKRLVDDDGKPSQEFVR   89 (290)
T ss_dssp             --------------------
T ss_pred             HHHHHHHhcCCCCcCHHHHH
Confidence            34444444443344443333


No 45 
>KOG4331|consensus
Probab=47.01  E-value=34  Score=30.80  Aligned_cols=38  Identities=13%  Similarity=-0.009  Sum_probs=17.4

Q ss_pred             CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc
Q psy5669          63 PWYQTSQLDVIASCTLGLGLVLYTLARIGTSFAEFTRG  100 (116)
Q Consensus        63 ~~~q~~~lDV~~~l~~v~~~~~~~~~~~~~~~~~~~~~  100 (116)
                      -|..|+-.=+...+++++++++......+.++|++|++
T Consensus        98 ~lv~~~~g~lv~sV~~v~~iil~p~~~~~yccc~C~~r  135 (865)
T KOG4331|consen   98 LLVIYEAGMLVCSVILVLYIILKPSEGLCYCCCRCCRR  135 (865)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHhHHhheeeeHhh
Confidence            34444444444444444444444444555555555543


No 46 
>PHA03105 EEV glycoprotein; Provisional
Probab=46.24  E-value=21  Score=25.97  Aligned_cols=24  Identities=25%  Similarity=0.163  Sum_probs=11.0

Q ss_pred             HHHHHHHHHHHHHHhhhhccccccc
Q psy5669          81 GLVLYTLARIGTSFAEFTRGKDDKK  105 (116)
Q Consensus        81 ~~~~~~~~~~~~~~~~~~~~~~~~k  105 (116)
                      ++++| ++.+++--++.|+.++++|
T Consensus        17 iLi~Y-ll~i~K~~iKKflkkkk~K   40 (188)
T PHA03105         17 VLILY-IFFICKNTIKKFLKKKKGK   40 (188)
T ss_pred             HHHHH-HHHHHHHHHHHHHhccCCC
Confidence            33444 3445555555554444433


No 47 
>KOG4433|consensus
Probab=46.19  E-value=1.2e+02  Score=25.82  Aligned_cols=65  Identities=11%  Similarity=-0.017  Sum_probs=31.7

Q ss_pred             HHHHHHHHHHHHHHhCCCC--------CCcccCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc
Q psy5669          35 PLQKAIRSIEYVIAHRGAP--------HLKTKATSLPWYQTSQLDVIASCTLGLGLVLYTLARIGTSFAEFTRGKDDK  104 (116)
Q Consensus        35 P~~~av~wiE~v~r~~g~~--------~l~~~~~~l~~~q~~~lDV~~~l~~v~~~~~~~~~~~~~~~~~~~~~~~~~  104 (116)
                      |.-.+-||+.+.-..|..+        .++++  +-.+.|-  |=+.+.+.+ ...++.+++.++..+|++|+++..-
T Consensus         4 ~~y~a~~~v~~lh~lP~~~~~~~~tns~F~pe--~~~Y~Qa--L~lla~l~a-a~l~l~Ll~ll~yli~~cC~Rr~~~   76 (526)
T KOG4433|consen    4 VMYFAPWWVNLLHVLPHLNFKLHPTNSVFRPE--DSEYQQA--LLLLAALAA-ACLGLSLLFLLFYLICRCCCRRETT   76 (526)
T ss_pred             ccccchhHHHHhhcCCCCCceEeeccccCCCC--cHHHHHH--HHHHHHHHH-HHHHHHHHHHHHHHHHHHHcCCCcc
Confidence            4456778998887766432        24443  2223332  222221211 1223344455566667777665543


No 48 
>PLN00164 glucosyltransferase; Provisional
Probab=45.51  E-value=24  Score=29.33  Aligned_cols=30  Identities=10%  Similarity=0.120  Sum_probs=23.2

Q ss_pred             chHHHHHHHHhcCHH-----HHHHHHHHHHHHhcC
Q psy5669           2 RSSRRKVQPACMSPQ-----YKTNVQRLSTIFRSE   31 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~-----Yk~na~~ls~~~~d~   31 (116)
                      +.+..+|++++.|++     ++++|++++...+..
T Consensus       419 e~l~~av~~vm~~~~~~~~~~r~~a~~~~~~~~~a  453 (480)
T PLN00164        419 AELERAVRSLMGGGEEEGRKAREKAAEMKAACRKA  453 (480)
T ss_pred             HHHHHHHHHHhcCCchhHHHHHHHHHHHHHHHHHH
Confidence            478899999998765     588888887776654


No 49 
>PTZ00234 variable surface protein Vir12; Provisional
Probab=45.38  E-value=17  Score=30.12  Aligned_cols=31  Identities=19%  Similarity=0.276  Sum_probs=15.2

Q ss_pred             HHHHHHHHHHHHhhhhccccccccCCCcccc
Q psy5669          83 VLYTLARIGTSFAEFTRGKDDKKATKPDRNK  113 (116)
Q Consensus        83 ~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~  113 (116)
                      ++++.+.-.-.-++.-+.+|++|.+..-|||
T Consensus       377 ifFlfyyn~ss~lks~~~krkrkk~~~ehny  407 (433)
T PTZ00234        377 LVFLFFFFKSTPIRSQTNKGEKKKRKPQNNY  407 (433)
T ss_pred             HHHhhhhhcccchhccccchhhcccchhhhh
Confidence            3344444443444444455555555556664


No 50 
>COG2011 AbcD ABC-type metal ion transport system, permease component [Inorganic ion transport and metabolism]
Probab=45.23  E-value=40  Score=25.57  Aligned_cols=34  Identities=18%  Similarity=0.266  Sum_probs=25.7

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy5669          65 YQTSQLDVIASCTLGLGLVLYTLARIGTSFAEFT   98 (116)
Q Consensus        65 ~q~~~lDV~~~l~~v~~~~~~~~~~~~~~~~~~~   98 (116)
                      ||.|+-||..+.++++++++.++-.+--++.+++
T Consensus       186 Y~Rf~~~Vm~~~viillilVq~iQ~~Gd~l~~r~  219 (222)
T COG2011         186 YQRFNTDVMVVAVVILLILVQLIQSLGDRLVRRL  219 (222)
T ss_pred             HHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            7889999998777777777777666666666654


No 51 
>PF07219 HemY_N:  HemY protein N-terminus;  InterPro: IPR010817 This entry represents the N terminus (approximately 150 residues) of bacterial HemY porphyrin biosynthesis proteins. These are membrane protein involved in a late step of protoheme IX synthesis [].
Probab=43.90  E-value=54  Score=21.42  Aligned_cols=23  Identities=30%  Similarity=0.216  Sum_probs=11.3

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHhh
Q psy5669          74 ASCTLGLGLVLYTLARIGTSFAE   96 (116)
Q Consensus        74 ~~l~~v~~~~~~~~~~~~~~~~~   96 (116)
                      +++++++++++++++++++.+++
T Consensus        20 ~~~l~~~~~~l~ll~~ll~~~~~   42 (108)
T PF07219_consen   20 LILLLLLFVVLYLLLRLLRRLLS   42 (108)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHh
Confidence            33444444455555565555543


No 52 
>cd08806 CARD_CARD14_CARMA2 Caspase activation and recruitment domain of CARD14-like proteins. Caspase activation and recruitment domain (CARD) similar to that found in CARD14, also known as BIMP2 or CARMA2 (caspase recruitment domain-containing membrane-associated guanylate kinase protein 2). CARD14 has been identified as a novel member of the MAGUK (membrane-associated guanylate kinase) family that functions as upstream activators of BCL10 (B-cell lymphoma 10) and NF-kB signaling. In general, CARDs are death domains (DDs) found associated with caspases. They are known to be important in the signaling pathways for apoptosis, inflammation, and host-defense mechanisms. DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including PYRIN and DED (Death Effector Domain). They serve as adaptors in signaling pathways 
Probab=43.64  E-value=82  Score=20.45  Aligned_cols=38  Identities=13%  Similarity=0.211  Sum_probs=30.0

Q ss_pred             HHHHHhcCHHHHHHHHH---HHHHHhcCCCCHHHHHHHHHH
Q psy5669           7 KVQPACMSPQYKTNVQR---LSTIFRSEPLHPLQKAIRSIE   44 (116)
Q Consensus         7 aI~~vl~n~~Yk~na~~---ls~~~~d~p~~P~~~av~wiE   44 (116)
                      -..+|++++.+..++.+   +-.++..+--.+...++.-.|
T Consensus        35 deeeIls~~t~~~r~~k~g~LLDIL~trG~~g~~aFLeSLe   75 (86)
T cd08806          35 DEEEVLHSPRLTNRAMRVGHLLDLLKTRGKNGAIAFLESLK   75 (86)
T ss_pred             hHHHHHccchHHHHHHHHHHHHHHHHhcCchHHHHHHHHHH
Confidence            45789999999999999   777888887676666666655


No 53 
>cd03822 GT1_ecORF704_like This family is most closely related to the GT1 family of glycosyltransferases. ORF704 in E. coli has been shown to be involved in the biosynthesis of O-specific mannose homopolysaccharides.
Probab=43.40  E-value=95  Score=22.88  Aligned_cols=43  Identities=7%  Similarity=0.084  Sum_probs=29.7

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEY   45 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~   45 (116)
                      +.+.++|.++++|+...+.+.+-+..+..+ ++....+-.|.+.
T Consensus       322 ~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~-~s~~~~~~~~~~~  364 (366)
T cd03822         322 AALAEAIRRLLADPELAQALRARAREYARA-MSWERVAERYLRL  364 (366)
T ss_pred             HHHHHHHHHHHcChHHHHHHHHHHHHHHhh-CCHHHHHHHHHHH
Confidence            468899999999976555555555445545 7877777766654


No 54 
>PF05510 Sarcoglycan_2:  Sarcoglycan alpha/epsilon;  InterPro: IPR008908 Sarcoglycans are a subcomplex of transmembrane proteins which are part of the dystrophin-glycoprotein complex. They are expressed in the skeletal, cardiac and smooth muscle. Although numerous studies have been conducted on the sarcoglycan subcomplex in skeletal and cardiac muscle, the manner of the distribution and localisation of these proteins along the nonjunctional sarcolemma is not clear []. This family contains alpha and epsilon members.; GO: 0016012 sarcoglycan complex
Probab=42.68  E-value=40  Score=27.65  Aligned_cols=21  Identities=14%  Similarity=0.137  Sum_probs=9.0

Q ss_pred             CcccCCCCCHHHHHHHHHHHHH
Q psy5669          55 LKTKATSLPWYQTSQLDVIASC   76 (116)
Q Consensus        55 l~~~~~~l~~~q~~~lDV~~~l   76 (116)
                      ..++...++-- .|.-|.++.+
T Consensus       269 ~~Pp~~~~p~R-~y~~d~~vtl  289 (386)
T PF05510_consen  269 FNPPKESVPGR-DYFPDFLVTL  289 (386)
T ss_pred             CCCCccccccc-ccHHHHHHHH
Confidence            44444444432 2444544333


No 55 
>PF04328 DUF466:  Protein of unknown function (DUF466);  InterPro: IPR007423 This is a small bacterial protein of unknown function.
Probab=42.28  E-value=79  Score=19.21  Aligned_cols=44  Identities=20%  Similarity=0.217  Sum_probs=34.4

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEYVI   47 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v~   47 (116)
                      +.+...++.++.+|.|..=+.....---|.|  |+..-.+|=+...
T Consensus        10 ~~~~~~~r~l~G~~~Ye~Yv~H~~~~HP~~p--~ms~~eF~r~r~~   53 (65)
T PF04328_consen   10 RRVRWYARLLVGEPDYERYVEHMRRHHPDEP--PMSEREFFRERQD   53 (65)
T ss_pred             HHHHHHHHHhcCcHHHHHHHHHHHHHCcCCC--CCCHHHHHHHHHH
Confidence            3567889999999999999999888877775  6666667766543


No 56 
>PF01601 Corona_S2:  Coronavirus S2 glycoprotein;  InterPro: IPR002552 The type I glycoprotein S of Coronavirus, trimers of which constitute the typical viral spikes, is assembled into virions through noncovalent interactions with the M protein. The spike glycoprotein is translated as a large polypeptide that is subsequently cleaved to S1 IPR002551 from INTERPRO and S2 []. Both chimeric S proteins appeared to cause cell fusion when expressed individually, suggesting that they were biologically fully active []. The spike is a type I membrane glycoprotein that possesses a conserved transmembrane anchor and an unusual cysteine-rich (cys) domain that bridges the putative junction of the anchor and the cytoplasmic tail [].; GO: 0006944 cellular membrane fusion, 0046813 virion attachment, binding of host cell surface receptor, 0016021 integral to membrane, 0019031 viral envelope; PDB: 2BEQ_B 2FXP_A 1ZVB_A 1WNC_D 1ZV8_H 1ZV7_B 1WYY_B 1ZVA_A 2BEZ_F 1WDG_A ....
Probab=42.27  E-value=8.4  Score=33.26  Aligned_cols=33  Identities=12%  Similarity=0.066  Sum_probs=0.4

Q ss_pred             CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy5669          60 TSLPWYQTSQLDVIASCTLGLGLVLYTLARIGT   92 (116)
Q Consensus        60 ~~l~~~q~~~lDV~~~l~~v~~~~~~~~~~~~~   92 (116)
                      ..+|||....+=+.+++++.+++.+++..-||+
T Consensus       544 iKWPWyVWL~i~~~li~~~~~l~~~~~~TGCCG  576 (610)
T PF01601_consen  544 IKWPWYVWLAIILALIAFALILLWCCCMTGCCG  576 (610)
T ss_dssp             H--------------------------------
T ss_pred             eehHHHHHHHHHHHHHHHHHHHHHHHHhcCCcc
Confidence            378998877666555554444444444444554


No 57 
>PF10661 EssA:  WXG100 protein secretion system (Wss), protein EssA;  InterPro: IPR018920  The Wss (WXG100 protein secretion system) in Staphylococcus aureus seems to be encoded by a locus of eight ORFs, called ess (eSAT-6 secretion system) []. This locus encodes, amongst several other proteins, EssA, a protein predicted to possess one transmembrane domain. Due to its predicted membrane location and its absolute requirement for WXG100 protein secretion, it has been speculated that EssA could form a secretion apparatus in conjunction with YukC and YukAB. Proteins homologous to EssA, YukC, EsaA and YukD were absent from mycobacteria [].   Members of this family are associated with type VII secretion of WXG100 family targets in the Firmicutes, but not in the Actinobacteria. This highly divergent protein family consists largely of a central region of highly polar low-complexity sequence containing occasional LF motifs in weak repeats about 17 residues in length, flanked by hydrophobic N- and C-terminal regions. 
Probab=42.20  E-value=46  Score=23.45  Aligned_cols=14  Identities=0%  Similarity=-0.278  Sum_probs=6.6

Q ss_pred             HHHHHHHHHHHHHH
Q psy5669          81 GLVLYTLARIGTSF   94 (116)
Q Consensus        81 ~~~~~~~~~~~~~~   94 (116)
                      +++.+++|.++|-+
T Consensus       130 l~i~~giy~~~r~~  143 (145)
T PF10661_consen  130 LAICGGIYVVLRKV  143 (145)
T ss_pred             HHHHHHHHHHHHHh
Confidence            33444455555544


No 58 
>cd03794 GT1_wbuB_like This family is most closely related to the GT1 family of glycosyltransferases. wbuB in E. coli is involved in the biosynthesis of the O26 O-antigen.  It has been proposed to function as an N-acetyl-L-fucosamine (L-FucNAc) transferase.
Probab=40.80  E-value=46  Score=24.43  Aligned_cols=39  Identities=8%  Similarity=0.034  Sum_probs=25.1

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAI   40 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av   40 (116)
                      +++.++|.++++|+..++++.+-+..+..+..++...+-
T Consensus       353 ~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~  391 (394)
T cd03794         353 EALAAAILELLDDPEERAEMGENGRRYVEEKFSREKLAE  391 (394)
T ss_pred             HHHHHHHHHHHhChHHHHHHHHHHHHHHHHhhcHHHHHH
Confidence            467888999998888776665555544444455554443


No 59 
>cd03821 GT1_Bme6_like This family is most closely related to the GT1 family of glycosyltransferases. Bme6 in Brucella melitensis has been shown to be involved in the biosynthesis of a polysaccharide.
Probab=40.12  E-value=62  Score=23.67  Aligned_cols=41  Identities=12%  Similarity=-0.031  Sum_probs=25.8

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRS   42 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~w   42 (116)
                      +.+.++|.++++|++..+.+.+-+.....+..+....+-.|
T Consensus       333 ~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~  373 (375)
T cd03821         333 DALAAALRRALELPQRLKAMGENGRALVEERFSWTAIAQQL  373 (375)
T ss_pred             HHHHHHHHHHHhCHHHHHHHHHHHHHHHHHhcCHHHHHHHh
Confidence            46788899999988766665555554434445555544433


No 60 
>PF01102 Glycophorin_A:  Glycophorin A;  InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=39.98  E-value=67  Score=22.07  Aligned_cols=10  Identities=10%  Similarity=0.282  Sum_probs=0.0

Q ss_pred             HhcCCCCHHH
Q psy5669          28 FRSEPLHPLQ   37 (116)
Q Consensus        28 ~~d~p~~P~~   37 (116)
                      +-..+.+|.+
T Consensus        41 is~tt~sP~e   50 (122)
T PF01102_consen   41 ISVTTVSPPE   50 (122)
T ss_dssp             ----------
T ss_pred             eccccccCCC
Confidence            3344444444


No 61 
>PRK08939 primosomal protein DnaI; Reviewed
Probab=39.82  E-value=42  Score=26.23  Aligned_cols=32  Identities=13%  Similarity=0.301  Sum_probs=25.7

Q ss_pred             CchHHHHHHHHhcCHHHHHHHHHHHHHHhcCC
Q psy5669           1 MRSSRRKVQPACMSPQYKTNVQRLSTIFRSEP   32 (116)
Q Consensus         1 ~~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p   32 (116)
                      ||++.+++.++..++.++++.+++...+...|
T Consensus         1 me~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~   32 (306)
T PRK08939          1 MESIGKTLKKLMKRQDFRKRYEKLKQEVLADP   32 (306)
T ss_pred             CccHHHHHHHHHhchhHHHHHHHHHHHHhcCH
Confidence            89999999999988888888887755554444


No 62 
>PLN02992 coniferyl-alcohol glucosyltransferase
Probab=39.61  E-value=91  Score=26.10  Aligned_cols=28  Identities=11%  Similarity=0.172  Sum_probs=21.1

Q ss_pred             chHHHHHHHHhcCH---HHHHHHHHHHHHHh
Q psy5669           2 RSSRRKVQPACMSP---QYKTNVQRLSTIFR   29 (116)
Q Consensus         2 ~~l~~aI~~vl~n~---~Yk~na~~ls~~~~   29 (116)
                      +++.++|+++++|+   .+++++++++...+
T Consensus       415 ~~l~~av~~vm~~~~g~~~r~~a~~~~~~a~  445 (481)
T PLN02992        415 SKIEALVRKVMVEEEGEEMRRKVKKLRDTAE  445 (481)
T ss_pred             HHHHHHHHHHhcCCchHHHHHHHHHHHHHHH
Confidence            57889999999774   57777777766555


No 63 
>KOG3989|consensus
Probab=39.60  E-value=45  Score=25.34  Aligned_cols=11  Identities=36%  Similarity=0.352  Sum_probs=5.5

Q ss_pred             HHHHHHHHhCC
Q psy5669          41 RSIEYVIAHRG   51 (116)
Q Consensus        41 ~wiE~v~r~~g   51 (116)
                      -|+.++...+|
T Consensus       162 ~~i~~v~~~~~  172 (224)
T KOG3989|consen  162 IWIHYVYFYNG  172 (224)
T ss_pred             HHHHHhhccCC
Confidence            35555554444


No 64 
>PRK10907 intramembrane serine protease GlpG; Provisional
Probab=39.60  E-value=1.5e+02  Score=22.96  Aligned_cols=31  Identities=16%  Similarity=0.068  Sum_probs=26.8

Q ss_pred             hcCHHHHHHHHHHHHHHhcCCCCHHHHHHHH
Q psy5669          12 CMSPQYKTNVQRLSTIFRSEPLHPLQKAIRS   42 (116)
Q Consensus        12 l~n~~Yk~na~~ls~~~~d~p~~P~~~av~w   42 (116)
                      +.|++..+.+++.-+.|.+.|.+|.-.+.-|
T Consensus        39 l~d~~~~~~~~~~~~~f~~~p~~~~y~~asw   69 (276)
T PRK10907         39 LADESQAERVRAELARFLENPADPRYLAASW   69 (276)
T ss_pred             ecCHHHHHHHHHHHHHHHhCCCchhHHhccc
Confidence            4699999999999999999999999555555


No 65 
>PF03579 SHP:  Small hydrophobic protein;  InterPro: IPR005327 The small hydrophobic integral membrane protein, SH (previously designated 1A) is found to have a variety of glycosylated forms [, ]. This protein is a component of the mature respiratory syncytial virion [] where it may form complexes and appears to play a structural role.; GO: 0016020 membrane, 0016021 integral to membrane, 0048222 glycoprotein network
Probab=39.43  E-value=87  Score=18.93  Aligned_cols=9  Identities=11%  Similarity=0.124  Sum_probs=5.1

Q ss_pred             HHHHHHHHH
Q psy5669          64 WYQTSQLDV   72 (116)
Q Consensus        64 ~~q~~~lDV   72 (116)
                      ||.|+.|-.
T Consensus        14 FW~YFtLi~   22 (64)
T PF03579_consen   14 FWTYFTLIF   22 (64)
T ss_pred             cchHHHHHH
Confidence            566666533


No 66 
>PF14851 FAM176:  FAM176 family
Probab=37.71  E-value=57  Score=23.32  Aligned_cols=6  Identities=0%  Similarity=-0.313  Sum_probs=2.4

Q ss_pred             HHHHHh
Q psy5669          90 IGTSFA   95 (116)
Q Consensus        90 ~~~~~~   95 (116)
                      +.++.|
T Consensus        42 V~risc   47 (153)
T PF14851_consen   42 VIRISC   47 (153)
T ss_pred             Hhhhee
Confidence            334444


No 67 
>PF06345 Drf_DAD:  DRF Autoregulatory Domain;  InterPro: IPR010465 This domain is found in Diaphanous-related formins (Drfs). It binds the N-terminal GTPase-binding domain; this link is broken when GTP-bound Rho binds to the GBD and activates the protein. The addition of diaphanous activating domains (DAD) to mammalian cells induces actin filament formation, stabilises microtubules, and activates serum-response mediated transcription [].; PDB: 3O4X_H 3OBV_E 2BAP_C 2F31_B.
Probab=37.62  E-value=18  Score=15.74  Aligned_cols=10  Identities=30%  Similarity=0.218  Sum_probs=6.5

Q ss_pred             CchHHHHHHH
Q psy5669           1 MRSSRRKVQP   10 (116)
Q Consensus         1 ~~~l~~aI~~   10 (116)
                      |+++.+|++.
T Consensus         3 mdsllealqt   12 (15)
T PF06345_consen    3 MDSLLEALQT   12 (15)
T ss_dssp             HHHHHHHHHH
T ss_pred             HHHHHHHHHc
Confidence            6677777653


No 68 
>PF04906 Tweety:  Tweety;  InterPro: IPR006990 None of the members of the tweety (tty) family have been functionally characterised. However, they are considered to be transmembrane proteins with five potential membrane-spanning regions. A number of potential functions have been suggested on the basis of homology to the yeast FTR1 and FTH1 iron transporter proteins and the mammalian neurotensin receptors 1 and 2 in that they have a similar hydrophobicity profiles although there is no detectable sequence homology to the tweety-related proteins. It has been proposed that the tweety-related proteins could be involved in transport of iron or other divalent cations or alternatively that they may be membrane-bound receptors [].
Probab=37.60  E-value=1.6e+02  Score=23.99  Aligned_cols=18  Identities=6%  Similarity=-0.153  Sum_probs=9.5

Q ss_pred             HHHHHHHHHhhhhccccc
Q psy5669          86 TLARIGTSFAEFTRGKDD  103 (116)
Q Consensus        86 ~~~~~~~~~~~~~~~~~~  103 (116)
                      ++..++..+|++||++++
T Consensus        34 Ll~ll~yl~~~CC~r~~~   51 (406)
T PF04906_consen   34 LLFLLIYLICRCCCRRPR   51 (406)
T ss_pred             HHHHHHHHHHHhhCCCCC
Confidence            344455556666655543


No 69 
>PHA02688 ORF059 IMV protein VP55; Provisional
Probab=37.53  E-value=52  Score=26.35  Aligned_cols=40  Identities=10%  Similarity=0.056  Sum_probs=21.7

Q ss_pred             HHHHHHHHHHHHHhCCCCCC-cccCCCCCHHHHHHHHHHHHHHHH
Q psy5669          36 LQKAIRSIEYVIAHRGAPHL-KTKATSLPWYQTSQLDVIASCTLG   79 (116)
Q Consensus        36 ~~~av~wiE~v~r~~g~~~l-~~~~~~l~~~q~~~lDV~~~l~~v   79 (116)
                      ..++..|+.  -++||+.+. ..|  -+||+-.+.++|+.++.++
T Consensus       256 wsrl~~Wla--~~~P~~~y~lttP--LfSfFGlfDInv~gviiil  296 (323)
T PHA02688        256 WSRLGTWLA--KRYPGFYYFLTTP--LFSFFGLFDINVIGVIIIL  296 (323)
T ss_pred             HHHHHHHHH--hhCCchheeecch--HHHhhccchhHHHHHHHHH
Confidence            445666754  366776553 332  3466666666665444433


No 70 
>TIGR03798 ocin_TIGR03798 bacteriocin propeptide, TIGR03798 family. This model describes a conserved, fairly long (about 65 residue) propeptide region for a family of putative microcins, that is, bacteriocins of small size. Members of the seed alignment tend to have the Gly-Gly motif as the last two residues of the matched region. This is a cleavage site for a combination processing/export ABC transporter with a peptidase domain.
Probab=37.39  E-value=37  Score=20.17  Aligned_cols=24  Identities=4%  Similarity=0.124  Sum_probs=21.3

Q ss_pred             CchHHHHHHHHhcCHHHHHHHHHH
Q psy5669           1 MRSSRRKVQPACMSPQYKTNVQRL   24 (116)
Q Consensus         1 ~~~l~~aI~~vl~n~~Yk~na~~l   24 (116)
                      |+.+..-+..+.+||..++.++..
T Consensus         1 ~e~l~~Fl~~~~~d~~L~~~l~~~   24 (64)
T TIGR03798         1 EEQLKAFLEKVKTDPDLREKLKAA   24 (64)
T ss_pred             CHHHHHHHHHHHcCHHHHHHHHHc
Confidence            788999999999999999987774


No 71 
>PF14979 TMEM52:  Transmembrane 52
Probab=37.33  E-value=86  Score=22.44  Aligned_cols=15  Identities=20%  Similarity=0.197  Sum_probs=6.8

Q ss_pred             HHHHHHHHHHHHHHH
Q psy5669          70 LDVIASCTLGLGLVL   84 (116)
Q Consensus        70 lDV~~~l~~v~~~~~   84 (116)
                      +-|++++++++++++
T Consensus        20 WyIwLill~~~llLL   34 (154)
T PF14979_consen   20 WYIWLILLIGFLLLL   34 (154)
T ss_pred             hHHHHHHHHHHHHHH
Confidence            345555544444333


No 72 
>PHA02754 hypothetical protein; Provisional
Probab=37.23  E-value=55  Score=19.88  Aligned_cols=27  Identities=19%  Similarity=0.370  Sum_probs=22.2

Q ss_pred             HHHHHHhcCHHHHHHHHHHHHHHhcCC
Q psy5669           6 RKVQPACMSPQYKTNVQRLSTIFRSEP   32 (116)
Q Consensus         6 ~aI~~vl~n~~Yk~na~~ls~~~~d~p   32 (116)
                      +-|.+.+.++.|++.++++..++.++-
T Consensus         5 eEi~k~i~eK~Fke~MRelkD~LSe~G   31 (67)
T PHA02754          5 EEIPKAIMEKDFKEAMRELKDILSEAG   31 (67)
T ss_pred             HHHHHHHHHhHHHHHHHHHHHHHhhCc
Confidence            457788889999999999999887653


No 73 
>PF01102 Glycophorin_A:  Glycophorin A;  InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=37.10  E-value=66  Score=22.10  Aligned_cols=7  Identities=14%  Similarity=-0.391  Sum_probs=2.5

Q ss_pred             HHHHHHH
Q psy5669          73 IASCTLG   79 (116)
Q Consensus        73 ~~~l~~v   79 (116)
                      +++++.+
T Consensus        67 ~~Ii~gv   73 (122)
T PF01102_consen   67 IGIIFGV   73 (122)
T ss_dssp             HHHHHHH
T ss_pred             eehhHHH
Confidence            3333333


No 74 
>PRK13028 tryptophan synthase subunit beta; Provisional
Probab=36.75  E-value=45  Score=27.25  Aligned_cols=38  Identities=16%  Similarity=0.257  Sum_probs=30.2

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAI   40 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av   40 (116)
                      +.|.++..++..||+|++....+-.-+..+| +|+..+-
T Consensus        32 ~~~~~~~~~~~~~~~f~~~~~~~~~~~~g~p-TPL~~~~   69 (402)
T PRK13028         32 DELEAAYEEIKKDPDFIAELRYLLKHYVGRP-TPLYHAK   69 (402)
T ss_pred             HHHHHHHHHHhCCHHHHHHHHHHHHHhCCCC-CCeeehH
Confidence            5788888999999999999999888887766 5554443


No 75 
>smart00394 RIIa RIIalpha, Regulatory subunit portion of type II PKA R-subunit. RIIalpha, Regulatory subunit portion of type II PKA R-subunit. Contains dimerisation interface and binding site for A-kinase-anchoring proteins (AKAPs).
Probab=36.44  E-value=70  Score=16.91  Aligned_cols=24  Identities=21%  Similarity=0.242  Sum_probs=19.0

Q ss_pred             HHHHhcCCCCHHHHHHHHHHHHHH
Q psy5669          25 STIFRSEPLHPLQKAIRSIEYVIA   48 (116)
Q Consensus        25 s~~~~d~p~~P~~~av~wiE~v~r   48 (116)
                      ..+++++|-+|.+-++.|.+-..+
T Consensus        12 ~~vl~~qP~d~~~f~~~yF~kL~~   35 (38)
T smart00394       12 VEVLRAQPSDLVQFAADYFEKLEE   35 (38)
T ss_pred             HHHHHHCCCcHHHHHHHHHHHHHH
Confidence            457889999999999888875543


No 76 
>PF07856 Orai-1:  Mediator of CRAC channel activity;  InterPro: IPR012446 This entry includes Drosophila Orai and human Orai1, Orai2 and Orai3. ORAI-1 GFP reporters are co-expressed with STIM-1 (ER CA(2+) sensors) in the gonad and intestine. The protein has four predicted transmembrane domains with a highly conserved region between TM2 ad TM3. This conserved domain is thought to function in channel regulation. ORAI1-related proteins are required for the production of the calcium channel, CRAC, along with STIM1-related proteins [].
Probab=36.37  E-value=42  Score=24.41  Aligned_cols=56  Identities=16%  Similarity=0.189  Sum_probs=32.8

Q ss_pred             cCCCCHHHHHHHHHHHHHHhC---CCCCCcccCCCCCHHHHHHH--HHHHHHHHHHHHHHH
Q psy5669          30 SEPLHPLQKAIRSIEYVIAHR---GAPHLKTKATSLPWYQTSQL--DVIASCTLGLGLVLY   85 (116)
Q Consensus        30 d~p~~P~~~av~wiE~v~r~~---g~~~l~~~~~~l~~~q~~~l--DV~~~l~~v~~~~~~   85 (116)
                      ..|.+|.+..-.|+|-.-++.   |..-+=..-.-++|++++..  ...+..+.++++...
T Consensus        93 ~~~~sP~~~f~~~cE~~W~~s~~lGi~lFL~~l~l~~WIKF~~~~~~~aa~~~t~i~~~~~  153 (175)
T PF07856_consen   93 PVPPSPHRRFHRYCELAWRFSTVLGIPLFLAELALLGWIKFWDSPSPAAAIAITAILVPVL  153 (175)
T ss_pred             CCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHheeehhccchHHHHHHHHHHHHHH
Confidence            377789888888888332221   43333333346789998888  555444444443333


No 77 
>PF03889 DUF331:  Domain of unknown function;  InterPro: IPR005589 The members of this family are uncharacterised proteins from a number of bacterial species. They range in size from 50-100 residues.
Probab=36.19  E-value=26  Score=19.41  Aligned_cols=19  Identities=16%  Similarity=0.448  Sum_probs=15.0

Q ss_pred             HHHHHHHhcCHHHHHHHHH
Q psy5669           5 RRKVQPACMSPQYKTNVQR   23 (116)
Q Consensus         5 ~~aI~~vl~n~~Yk~na~~   23 (116)
                      .+|+..++.+|-|+...++
T Consensus         5 dNalkAlv~~~LFr~rvek   23 (39)
T PF03889_consen    5 DNALKALVTSPLFRQRVEK   23 (39)
T ss_pred             hhHHHHHHhCcchhhhhhc
Confidence            3688888999988887654


No 78 
>PF08374 Protocadherin:  Protocadherin;  InterPro: IPR013585 The structure of protocadherins is similar to that of classic cadherins (IPR002126 from INTERPRO), but they also have some unique features associated with the cytoplasmic domains. They are expressed in a variety of organisms and are found in high concentrations in the brain where they seem to be localised mainly at cell-cell contact sites. Their expression seems to be developmentally regulated []. 
Probab=36.13  E-value=38  Score=25.70  Aligned_cols=7  Identities=29%  Similarity=0.468  Sum_probs=2.5

Q ss_pred             HHHHHHh
Q psy5669          43 IEYVIAH   49 (116)
Q Consensus        43 iE~v~r~   49 (116)
                      ||-.+++
T Consensus         8 i~~lv~r   14 (221)
T PF08374_consen    8 IEELVRR   14 (221)
T ss_pred             HHHHHHh
Confidence            3333333


No 79 
>cd00904 Ferritin Ferritin iron storage proteins. Ferritins are the primary iron storage proteins of most living organisms and members of a broad superfamily of ferritin-like diiron-carboxylate proteins. The iron-free (apoferritin) ferritin molecule is a protein shell composed of 24 protein chains arranged in 432 symmetry. Iron storage involves the uptake of iron (II) at the protein shell, its oxidation by molecular oxygen at the dinuclear ferroxidase centers, and the movement of iron (III) into the cavity for deposition as ferrihydrite; the protein shell can hold up to 4500 iron atoms. In vertebrates, two types of chains (subunits) have been characterized, H or M (fast) and L (slow), which differ in rates of iron uptake and mineralization. Bacterial non-heme ferritins are composed only of H chains. Fe(II) oxidation in the H/M subunits take place initially at the ferroxidase center, a carboxylate-bridged diiron center, located within the subunit four-helix bundle. In a complementary rol
Probab=36.06  E-value=52  Score=22.99  Aligned_cols=38  Identities=13%  Similarity=0.258  Sum_probs=31.1

Q ss_pred             HHHHHHHHHhcCCCCHHHHHHHHHHHHHHhCCCCCCcc
Q psy5669          20 NVQRLSTIFRSEPLHPLQKAIRSIEYVIAHRGAPHLKT   57 (116)
Q Consensus        20 na~~ls~~~~d~p~~P~~~av~wiE~v~r~~g~~~l~~   57 (116)
                      +...+++-|+.+-..-.+.|...++|+...||.+.+..
T Consensus        35 ~l~g~a~~f~~~s~eE~~HA~~l~~yi~~rgg~~~l~~   72 (160)
T cd00904          35 ALKGVAHFFKEQAQEEREHAEKFYKYQNERGGRVELQD   72 (160)
T ss_pred             cchhHHHHHHHHHHHHHHHHHHHHHHHHHCCCccccCc
Confidence            56667777888877888999999999999999776643


No 80 
>PF06837 Fijivirus_P9-2:  Fijivirus P9-2 protein;  InterPro: IPR009650 This family consists of several Fijivirus specific P9-2 proteins from Rice black streaked dwarf virus (RBSDV) and Fiji disease virus. The function of this family is unknown.
Probab=36.02  E-value=94  Score=23.23  Aligned_cols=66  Identities=9%  Similarity=0.077  Sum_probs=35.6

Q ss_pred             cCCC--CHHHHHHHHHHHHHHhCCCCCCcccCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc
Q psy5669          30 SEPL--HPLQKAIRSIEYVIAHRGAPHLKTKATSLPWYQTSQLDVIASCTLGLGLVLYTLARIGTSFAEFTRGK  101 (116)
Q Consensus        30 d~p~--~P~~~av~wiE~v~r~~g~~~l~~~~~~l~~~q~~~lDV~~~l~~v~~~~~~~~~~~~~~~~~~~~~~  101 (116)
                      ++|.  |.+|.-+.-.|..++-...|-++..-...+-+      +++++..+++.++-++.+++.-++|+.|.+
T Consensus        43 e~~~sdse~Dd~vd~lE~~ve~~~~pl~rr~ygkvg~I------i~mI~sflfFGifkl~lKmfYhLfrcV~CN  110 (214)
T PF06837_consen   43 ERPLSDSELDDKVDKLETDVEDKVDPLVRRKYGKVGHI------IWMIFSFLFFGIFKLTLKMFYHLFRCVCCN  110 (214)
T ss_pred             cccCcchhHHHHHHHHhhhHHhCCCHHHHhhccCccee------ehHHHHHHHHHHHHHHHHHHHHHHhhhhcC
Confidence            4564  55677788888887776555444332233221      122233333445555667776666665543


No 81 
>PRK04346 tryptophan synthase subunit beta; Validated
Probab=34.88  E-value=53  Score=26.82  Aligned_cols=37  Identities=19%  Similarity=0.312  Sum_probs=29.3

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKA   39 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~a   39 (116)
                      +.|.++..++.+||.|++....+-.-+.-+| +|+-.+
T Consensus        28 ~~~~~~~~~~~~~~~f~~~~~~~~~~~~grp-TPL~~~   64 (397)
T PRK04346         28 EELEEAYEKAKNDPEFQAELDYLLKNYVGRP-TPLYFA   64 (397)
T ss_pred             HHHHHHHHHHhcCHHHHHHHHHHHHHhcCCC-CCceEh
Confidence            5688888999999999999988888776665 555444


No 82 
>PRK11024 colicin uptake protein TolR; Provisional
Probab=34.72  E-value=33  Score=23.54  Aligned_cols=14  Identities=29%  Similarity=0.123  Sum_probs=10.2

Q ss_pred             HHHHHHHHHHHHHH
Q psy5669          69 QLDVIASCTLGLGL   82 (116)
Q Consensus        69 ~lDV~~~l~~v~~~   82 (116)
                      ++||.+++++++++
T Consensus        20 lIDVvfvLLiFFmv   33 (141)
T PRK11024         20 LLDVLLVLLLIFMA   33 (141)
T ss_pred             HHHHHHHHHHHHHh
Confidence            78999877766543


No 83 
>PLN02173 UDP-glucosyl transferase family protein
Probab=34.71  E-value=52  Score=27.22  Aligned_cols=28  Identities=7%  Similarity=0.061  Sum_probs=20.9

Q ss_pred             chHHHHHHHHhcCHH---HHHHHHHHHHHHh
Q psy5669           2 RSSRRKVQPACMSPQ---YKTNVQRLSTIFR   29 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~---Yk~na~~ls~~~~   29 (116)
                      +.+.+++++++.|++   .+++|++++...+
T Consensus       396 e~v~~av~~vm~~~~~~~~r~~a~~~~~~a~  426 (449)
T PLN02173        396 EEIEFSIKEVMEGEKSKEMKENAGKWRDLAV  426 (449)
T ss_pred             HHHHHHHHHHhcCChHHHHHHHHHHHHHHHH
Confidence            568999999998865   4667777666555


No 84 
>cd03814 GT1_like_2 This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homolog
Probab=34.69  E-value=1.5e+02  Score=21.68  Aligned_cols=42  Identities=7%  Similarity=-0.069  Sum_probs=27.7

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIE   44 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE   44 (116)
                      +.+.++|.++++|++..+.+.+-+.... +..+..+.+-.+++
T Consensus       320 ~~l~~~i~~l~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~  361 (364)
T cd03814         320 EAFAAALAALLADPELRRRMAARARAEA-ERRSWEAFLDNLLE  361 (364)
T ss_pred             HHHHHHHHHHHcCHHHHHHHHHHHHHHH-hhcCHHHHHHHHHH
Confidence            3478899999999987777666555554 34565555544443


No 85 
>PRK14750 kdpF potassium-transporting ATPase subunit F; Provisional
Probab=34.45  E-value=72  Score=16.47  Aligned_cols=22  Identities=18%  Similarity=0.297  Sum_probs=10.6

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHH
Q psy5669          69 QLDVIASCTLGLGLVLYTLARI   90 (116)
Q Consensus        69 ~lDV~~~l~~v~~~~~~~~~~~   90 (116)
                      +++++.=.+++++++.++++.+
T Consensus         2 s~~vi~g~llv~lLl~YLvYAL   23 (29)
T PRK14750          2 NFSIVCGALLVLLLLGYLVYAL   23 (29)
T ss_pred             cHHHHHHHHHHHHHHHHHHHHH
Confidence            3455544444444445554443


No 86 
>PF05225 HTH_psq:  helix-turn-helix, Psq domain;  InterPro: IPR007889 This DNA-binding motif is found in four copies in the pipsqueak protein of Drosophila melanogaster []. In pipsqueak this domain binds to GAGA sequence []. The pipsqueak family, which includes proteins from fungi, sea urchins, nematodes, insects, and vertebrates appear to be proteins essential for sequence-specific targeting of a polycomb group protein complex [].; GO: 0003677 DNA binding; PDB: 2COB_A.
Probab=34.28  E-value=63  Score=17.92  Aligned_cols=23  Identities=0%  Similarity=0.105  Sum_probs=17.0

Q ss_pred             chHHHHHHHHhcC-HHHHHHHHHH
Q psy5669           2 RSSRRKVQPACMS-PQYKTNVQRL   24 (116)
Q Consensus         2 ~~l~~aI~~vl~n-~~Yk~na~~l   24 (116)
                      +.|..||..|.++ -++++.|+..
T Consensus         3 e~l~~Ai~~v~~g~~S~r~AA~~y   26 (45)
T PF05225_consen    3 EDLQKAIEAVKNGKMSIRKAAKKY   26 (45)
T ss_dssp             HHHHHHHHHHHTTSS-HHHHHHHH
T ss_pred             HHHHHHHHHHHhCCCCHHHHHHHH
Confidence            5688999999877 6777777664


No 87 
>PF05568 ASFV_J13L:  African swine fever virus J13L protein;  InterPro: IPR008385 This family consists of several African swine fever virus (ASFV) j13L proteins [, , ].
Probab=34.25  E-value=98  Score=22.31  Aligned_cols=11  Identities=9%  Similarity=0.501  Sum_probs=6.0

Q ss_pred             CHHHHHHHHHH
Q psy5669          63 PWYQTSQLDVI   73 (116)
Q Consensus        63 ~~~q~~~lDV~   73 (116)
                      +|+.-|++-++
T Consensus        25 sffsthm~tIL   35 (189)
T PF05568_consen   25 SFFSTHMYTIL   35 (189)
T ss_pred             cHHHHHHHHHH
Confidence            56665555444


No 88 
>PF02197 RIIa:  Regulatory subunit of type II PKA R-subunit;  InterPro: IPR003117 Protein phosphorylation, which plays a key role in most cellular activities, is a reversible process mediated by protein kinases and phosphoprotein phosphatases. Protein kinases catalyse the transfer of the gamma phosphate from nucleotide triphosphates (often ATP) to one or more amino acid residues in a protein substrate side chain, resulting in a conformational change affecting protein function. Phosphoprotein phosphatases catalyse the reverse process. Protein kinases fall into three broad classes, characterised with respect to substrate specificity []:   Serine/threonine-protein kinases Tyrosine-protein kinases Dual specific protein kinases (e.g. MEK - phosphorylates both Thr and Tyr on target proteins)   Protein kinase function has been evolutionarily conserved from Escherichia coli to human []. Protein kinases play a role in a multitude of cellular processes, including division, proliferation, apoptosis, and differentiation []. Phosphorylation usually results in a functional change of the target protein by changing enzyme activity, cellular location, or association with other proteins. The catalytic subunits of protein kinases are highly conserved, and several structures have been solved [], leading to large screens to develop kinase-specific inhibitors for the treatments of a number of diseases [].  In the absence of cAMP, Protein Kinase A (PKA) exists as an equimolar tetramer of regulatory (R) and catalytic (C) subunits []. In addition to its role as an inhibitor of the C subunit, the R subunit anchors the holoenzyme to specific intracellular locations and prevents the C subunit from entering the nucleus. All R subunits have a conserved domain structure consisting of the N-terminal dimerization domain, inhibitory region, cAMP-binding domain A and cAMP-binding domain B. R subunits interact with C subunits primarily through the inhibitory site. The cAMP-binding domains show extensive sequence similarity and bind cAMP cooperatively.  Two types of regulatory (R) subunits exist - types I and I - which differ in molecular weight, sequence, autophosphorylation cabaility, cellular location and tissue distribution. Types I and II were further sub-divided into alpha and beta subtypes, based mainly on sequence similarity. This entry represents types I-alpha, I-beta, II-alpha and II-beta regulatory subunits of PKA proteins. These subunits contain the dimerisation interface and binding site for A-kinase-anchoring proteins (AKAPs).; GO: 0008603 cAMP-dependent protein kinase regulator activity, 0007165 signal transduction; PDB: 2IZY_E 1R2A_A 1L6E_A 2IZX_B 2KYG_A 2EZW_B 3IM4_B 3IM3_A 4F9K_C 2HWN_B ....
Probab=34.13  E-value=80  Score=16.95  Aligned_cols=24  Identities=25%  Similarity=0.235  Sum_probs=16.5

Q ss_pred             HHHHhcCCCCHHHHHHHHHHHHHH
Q psy5669          25 STIFRSEPLHPLQKAIRSIEYVIA   48 (116)
Q Consensus        25 s~~~~d~p~~P~~~av~wiE~v~r   48 (116)
                      ..+++++|-++++=+..|.+-..+
T Consensus        12 ~~vl~~qP~Di~~F~a~yF~~L~~   35 (38)
T PF02197_consen   12 REVLREQPDDILQFAADYFEKLEK   35 (38)
T ss_dssp             HHHHHH--S-HHHHHHHHHHHHHH
T ss_pred             HHHHHHCCCcHHHHHHHHHHHHHH
Confidence            677889999999988888865544


No 89 
>PRK12446 undecaprenyldiphospho-muramoylpentapeptide beta-N-acetylglucosaminyltransferase; Reviewed
Probab=33.73  E-value=82  Score=24.80  Aligned_cols=37  Identities=22%  Similarity=0.334  Sum_probs=24.6

Q ss_pred             chHHHHHHHHhcCH-HHHHHHHHHHHHHhcCCCCHHHHHHHHHH
Q psy5669           2 RSSRRKVQPACMSP-QYKTNVQRLSTIFRSEPLHPLQKAIRSIE   44 (116)
Q Consensus         2 ~~l~~aI~~vl~n~-~Yk~na~~ls~~~~d~p~~P~~~av~wiE   44 (116)
                      +.+.++|.++++|+ .|++++++++     .| ++.+..+.+++
T Consensus       313 ~~l~~~l~~ll~~~~~~~~~~~~~~-----~~-~aa~~i~~~i~  350 (352)
T PRK12446        313 NSLIKHVEELSHNNEKYKTALKKYN-----GK-EAIQTIIDHIS  350 (352)
T ss_pred             HHHHHHHHHHHcCHHHHHHHHHHcC-----CC-CHHHHHHHHHH
Confidence            46788999999887 6776665533     33 55666666554


No 90 
>cd03808 GT1_cap1E_like This family is most closely related to the GT1 family of glycosyltransferases. cap1E in Streptococcus pneumoniae is required for the synthesis of type 1 capsular polysaccharides.
Probab=31.88  E-value=1e+02  Score=22.30  Aligned_cols=39  Identities=5%  Similarity=0.035  Sum_probs=23.0

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAI   40 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av   40 (116)
                      +++.++|.++++|+...+.+.+-+.-.-.+..++...+-
T Consensus       317 ~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~s~~~~~~  355 (359)
T cd03808         317 EALADAIERLIEDPELRARMGQAARKRAEEEFDEEIVVK  355 (359)
T ss_pred             HHHHHHHHHHHhCHHHHHHHHHHHHHHHHHhcCHHHHHH
Confidence            457788888888887665554444333333445544443


No 91 
>TIGR02801 tolR TolR protein. The model describes the inner membrane protein TolR, part of the TolR/TolQ complex that transduces energy from the proton-motive force, through TolA, to an outer membrane complex made up of TolB and Pal (peptidoglycan-associated lipoprotein). The complex is required to maintain outer membrane integrity, and defects may cause a defect in the import of some organic compounds in addition to the resulting morphologic. While several gene pairs homologous to talR and tolQ may be found in a single genome, but the scope of this model is set to favor finding only bone fide TolR, supported by operon structure as well as by score.
Probab=31.84  E-value=44  Score=22.31  Aligned_cols=13  Identities=15%  Similarity=0.066  Sum_probs=9.2

Q ss_pred             HHHHHHHHHHHHH
Q psy5669          69 QLDVIASCTLGLG   81 (116)
Q Consensus        69 ~lDV~~~l~~v~~   81 (116)
                      ++||.++++++++
T Consensus        10 lIDVvFlLLiFFm   22 (129)
T TIGR02801        10 YVDVMLVLLIIFM   22 (129)
T ss_pred             HHHHHHHHHHHHH
Confidence            6899877765554


No 92 
>TIGR02804 ExbD_2 TonB system transport protein ExbD, group 2. Members of this family are Gram-negative bacterial inner membrane proteins, generally designated ExbD, related to the TolR family modeled by TIGRFAMs TIGR02801. Members always are encoded next to a protein designated ExbB (TIGR02797), related to the TolQ family modeled by TIGRFAMs TIGR02796. ExbD and ExbB together form a proton channel through which they can harness the proton-motive force to energize TonB, which in turn energizes TonB-dependent receptors in the outer membrane. TonB-dependent receptors with known specificity tend to import siderophores or vitamin B12. A TonB system and Tol-Pal system often will co-exist in a single bacterial genome.
Probab=31.74  E-value=47  Score=22.11  Aligned_cols=13  Identities=8%  Similarity=-0.069  Sum_probs=9.1

Q ss_pred             HHHHHHHHHHHHH
Q psy5669          69 QLDVIASCTLGLG   81 (116)
Q Consensus        69 ~lDV~~~l~~v~~   81 (116)
                      ++||.++++++++
T Consensus         9 lIDVvflLLiFFm   21 (121)
T TIGR02804         9 FIDIMLVLLAIVL   21 (121)
T ss_pred             HHHHHHHHHHHHH
Confidence            6899877765554


No 93 
>PF12626 PolyA_pol_arg_C:  Polymerase A arginine-rich C-terminus; PDB: 3AQN_A 3AQK_A 3AQM_B 3AQL_B.
Probab=31.68  E-value=54  Score=22.44  Aligned_cols=38  Identities=11%  Similarity=-0.061  Sum_probs=21.3

Q ss_pred             HHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHH
Q psy5669           9 QPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEYVI   47 (116)
Q Consensus         9 ~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v~   47 (116)
                      ..+++.|.|+..-.-|.---.. --...+.+.||.+|.-
T Consensus        55 ~~ll~hprFRAAyDFL~LR~~~-ge~~~~l~~WW~~fq~   92 (124)
T PF12626_consen   55 FRLLEHPRFRAAYDFLLLRAEA-GEELSELAEWWTEFQE   92 (124)
T ss_dssp             HHHHHSTTHHHHHHHHHHHHHH--HHHHHHHHHHHHHTT
T ss_pred             HHHHhCchHHHHHHHHHHHHHh-CCCcHHHHHHHHHHHh
Confidence            3556677777654333211111 1156788889988863


No 94 
>PF11417 Inhibitor_G39P:  Loader and inhibitor of phage G40P;  InterPro: IPR024424 G39P inhibits the initiation of DNA replication by blocking G40P replicative helicase. G39P has a bipartite stricture consisting of a folded N-terminal domain and an unfolded C-terminal domain. The C-terminal is essential for helicase interaction [].; PDB: 1NO1_B.
Probab=31.64  E-value=56  Score=20.21  Aligned_cols=31  Identities=10%  Similarity=0.073  Sum_probs=20.3

Q ss_pred             HHHHHHhcCHHHHHHHHHHHHHHhcCCCCHH
Q psy5669           6 RKVQPACMSPQYKTNVQRLSTIFRSEPLHPL   36 (116)
Q Consensus         6 ~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~   36 (116)
                      ++=.++|+|-.|....+.+...+...+++|.
T Consensus        34 ~~W~~~L~d~~ye~v~~al~~~i~~~kfPPs   64 (71)
T PF11417_consen   34 DLWYDMLKDYDYEIVMKALKKHIATNKFPPS   64 (71)
T ss_dssp             HHHHHHHTTS-HHHHHHHHHHHHHH-SS---
T ss_pred             HHHHHHHHhCCHHHHHHHHHHHHHhCCCCcC
Confidence            3344778888888888888888888887663


No 95 
>TIGR03147 cyt_nit_nrfF cytochrome c nitrite reductase, accessory protein NrfF.
Probab=31.47  E-value=1.8e+02  Score=20.12  Aligned_cols=30  Identities=3%  Similarity=-0.308  Sum_probs=18.5

Q ss_pred             CCHHHHHHHHHHHHHHhCCCCCCcccCCCCCHH
Q psy5669          33 LHPLQKAIRSIEYVIAHRGAPHLKTKATSLPWY   65 (116)
Q Consensus        33 ~~P~~~av~wiE~v~r~~g~~~l~~~~~~l~~~   65 (116)
                      .|..|..-+   ++.|+|+.-.++|+-...+|+
T Consensus        74 ~Sd~eI~~~---~v~RYG~~Vly~Pp~~~~t~~  103 (126)
T TIGR03147        74 KSNQQIIDF---MTARFGDFVLYNPPFKWQTLL  103 (126)
T ss_pred             CCHHHHHHH---HHHhcCCeEEecCCCCcchHH
Confidence            344444333   466888877788876655553


No 96 
>PHA02849 putative transmembrane protein; Provisional
Probab=31.37  E-value=1e+02  Score=19.73  Aligned_cols=20  Identities=20%  Similarity=0.247  Sum_probs=8.1

Q ss_pred             HHHHHHHHHHHHHHHHHHHH
Q psy5669          72 VIASCTLGLGLVLYTLARIG   91 (116)
Q Consensus        72 V~~~l~~v~~~~~~~~~~~~   91 (116)
                      |+.++++++.++++++..+.
T Consensus        19 vi~v~v~vI~i~~flLlyLv   38 (82)
T PHA02849         19 VILVFVLVISFLAFMLLYLI   38 (82)
T ss_pred             HHHHHHHHHHHHHHHHHHHH
Confidence            33333333334444444433


No 97 
>PHA00646 hypothetical protein
Probab=31.18  E-value=1.2e+02  Score=18.51  Aligned_cols=25  Identities=8%  Similarity=-0.020  Sum_probs=15.5

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHH
Q psy5669          69 QLDVIASCTLGLGLVLYTLARIGTS   93 (116)
Q Consensus        69 ~lDV~~~l~~v~~~~~~~~~~~~~~   93 (116)
                      ++-++++++....+.++.++++|+.
T Consensus        39 MVgIWlvI~Fl~Wf~i~mvfKiv~~   63 (65)
T PHA00646         39 MVGIWLVILFLTWFSLWMVFKIVGY   63 (65)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhe
Confidence            4455666666666666777777654


No 98 
>PF09959 DUF2193:  Uncharacterized protein conserved in archaea (DUF2193);  InterPro: IPR018694 This family of various hypothetical archaeal proteins has no known function
Probab=31.07  E-value=1.5e+02  Score=24.88  Aligned_cols=40  Identities=13%  Similarity=0.172  Sum_probs=32.1

Q ss_pred             HHHHHh--cCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhCCC
Q psy5669           7 KVQPAC--MSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEYVIAHRGA   52 (116)
Q Consensus         7 aI~~vl--~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v~r~~g~   52 (116)
                      +|.|||  +||+|++.+.++-..+.      ...++---|-+-|+||+
T Consensus        93 ~ilEILy~eD~~F~ks~~kfi~~I~------~sealIg~E~~RrygGF  134 (499)
T PF09959_consen   93 AILEILYEEDPAFRKSVEKFIEAIG------KSEALIGKESARRYGGF  134 (499)
T ss_pred             HHHHHHHhcCHHHHHHHHHHHHHHh------hhHHHhhHHHHHHhcCc
Confidence            677888  89999999999988775      34566677888899884


No 99 
>PRK11267 biopolymer transport protein ExbD; Provisional
Probab=30.89  E-value=45  Score=22.91  Aligned_cols=14  Identities=14%  Similarity=0.107  Sum_probs=10.1

Q ss_pred             HHHHHHHHHHHHHH
Q psy5669          69 QLDVIASCTLGLGL   82 (116)
Q Consensus        69 ~lDV~~~l~~v~~~   82 (116)
                      ++||++++++++++
T Consensus        23 lIDVvf~LLiFFmv   36 (141)
T PRK11267         23 FIDVMLVLLIIFMV   36 (141)
T ss_pred             HHHHHHHHHHHHHh
Confidence            78999877766543


No 100
>cd04962 GT1_like_5 This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homolog
Probab=30.85  E-value=1.8e+02  Score=21.78  Aligned_cols=45  Identities=4%  Similarity=-0.104  Sum_probs=28.9

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEYV   46 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v   46 (116)
                      +.+.++|.++++||...+.+..-+...-.+.+++...+-.|.+..
T Consensus       324 ~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~fs~~~~~~~~~~~y  368 (371)
T cd04962         324 EAMAEYALSLLEDDELWQEFSRAARNRAAERFDSERIVPQYEALY  368 (371)
T ss_pred             HHHHHHHHHHHhCHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHH
Confidence            457788889998887655544433333335577777777776543


No 101
>PF14851 FAM176:  FAM176 family
Probab=30.70  E-value=1.1e+02  Score=21.79  Aligned_cols=16  Identities=6%  Similarity=-0.128  Sum_probs=10.2

Q ss_pred             HHHHHHHHHHhhhhcc
Q psy5669          85 YTLARIGTSFAEFTRG  100 (116)
Q Consensus        85 ~~~~~~~~~~~~~~~~  100 (116)
                      .++..+|-++.+..|+
T Consensus        33 GLlLtLcllV~risc~   48 (153)
T PF14851_consen   33 GLLLTLCLLVIRISCR   48 (153)
T ss_pred             HHHHHHHHHHhhheee
Confidence            3445566777777774


No 102
>PRK13718 conjugal transfer protein TrbE; Provisional
Probab=30.63  E-value=1.5e+02  Score=19.06  Aligned_cols=19  Identities=16%  Similarity=0.023  Sum_probs=7.9

Q ss_pred             HHHHHHhhhhccccccccC
Q psy5669          89 RIGTSFAEFTRGKDDKKAT  107 (116)
Q Consensus        89 ~~~~~~~~~~~~~~~~k~~  107 (116)
                      ++|........+.+++|+.
T Consensus        61 y~ffs~Ltkl~~~d~~ks~   79 (84)
T PRK13718         61 YFFFSALTKLQKHDERKSD   79 (84)
T ss_pred             HHHHHHHHHHHhcccccch
Confidence            3333333334444444443


No 103
>cd03807 GT1_WbnK_like This family is most closely related to the GT1 family of glycosyltransferases. WbnK in Shigella dysenteriae has been shown to be involved in the type 7 O-antigen biosynthesis.
Probab=30.33  E-value=1.5e+02  Score=21.44  Aligned_cols=42  Identities=5%  Similarity=-0.051  Sum_probs=23.4

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSI   43 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wi   43 (116)
                      +.+.++|.++++|+...+...+-+...-.+-.++...+-.+.
T Consensus       320 ~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~  361 (365)
T cd03807         320 EALAEAIEALLADPALRQALGEAARERIEENFSIEAMVEAYE  361 (365)
T ss_pred             HHHHHHHHHHHhChHHHHHHHHHHHHHHHHhCCHHHHHHHHH
Confidence            467888999998876554443333333333345555444443


No 104
>PF05148 Methyltransf_8:  Hypothetical methyltransferase;  InterPro: IPR007823 This family consists of uncharacterised eukaryotic proteins which are related to S-adenosyl-L-methionine-dependent methyltransferases.; GO: 0008168 methyltransferase activity; PDB: 2ZFU_B.
Probab=30.13  E-value=1.6e+02  Score=22.39  Aligned_cols=38  Identities=11%  Similarity=0.247  Sum_probs=22.8

Q ss_pred             HHHHHhcCHH-HHHHHHHHHHHHhcCCCCHHHHHHHHHH
Q psy5669           7 KVQPACMSPQ-YKTNVQRLSTIFRSEPLHPLQKAIRSIE   44 (116)
Q Consensus         7 aI~~vl~n~~-Yk~na~~ls~~~~d~p~~P~~~av~wiE   44 (116)
                      |.+-.-+||+ |..==.-+++....-|..|+|..+.|+.
T Consensus        30 A~~lf~~dP~~F~~YH~Gfr~Qv~~WP~nPvd~iI~~l~   68 (219)
T PF05148_consen   30 ALKLFQEDPELFDIYHEGFRQQVKKWPVNPVDVIIEWLK   68 (219)
T ss_dssp             HHHHHHH-HHHHHHHHHHHHHHHCTSSS-HHHHHHHHHC
T ss_pred             HHHHHHhCHHHHHHHHHHHHHHHhcCCCCcHHHHHHHHH
Confidence            3344444553 3333334455667789999999999987


No 105
>PRK13608 diacylglycerol glucosyltransferase; Provisional
Probab=29.75  E-value=2.8e+02  Score=21.87  Aligned_cols=44  Identities=7%  Similarity=-0.071  Sum_probs=31.0

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEYV   46 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v   46 (116)
                      +.+.++|.++++|+.-.+.+.+-+... .+|.+..+.+-.|.+.+
T Consensus       326 ~~l~~~i~~ll~~~~~~~~m~~~~~~~-~~~~s~~~i~~~l~~l~  369 (391)
T PRK13608        326 EEAIKIVASLTNGNEQLTNMISTMEQD-KIKYATQTICRDLLDLI  369 (391)
T ss_pred             HHHHHHHHHHhcCHHHHHHHHHHHHHh-cCCCCHHHHHHHHHHHh
Confidence            457889999999987666555544443 45778877777776655


No 106
>PF00957 Synaptobrevin:  Synaptobrevin;  InterPro: IPR001388 Synaptobrevin is an intrinsic membrane protein of small synaptic vesicles [], specialised secretory organelles of neurons that actively accumulate neurotransmitters and participate in their calcium-dependent release by exocytosis. Vesicle function is mediated by proteins in their membranes, although the precise nature of the protein-protein interactions underlying this are still uncertain []. Synaptobrevin may play a role in the molecular events underlying neurotransmitter release and vesicle recycling and may be involved in the regulation of membrane flow in the nerve terminal, a process mediated by interaction with low molecular weight GTP-binding proteins []. Synaptic vesicle-associated membrane proteins (VAMPs) from Torpedo californica (Pacific electric ray) and SNC1 from yeast are related to synaptobrevin.; GO: 0016192 vesicle-mediated transport, 0016021 integral to membrane; PDB: 3EGX_C 2NUP_C 3EGD_C 2NUT_C 1IOU_A 1H8M_A 3B5N_A 3ZYM_A 2NPS_A 1SFC_E ....
Probab=29.43  E-value=88  Score=19.46  Aligned_cols=11  Identities=27%  Similarity=0.830  Sum_probs=5.9

Q ss_pred             HHHHHHHHHHH
Q psy5669          64 WYQTSQLDVIA   74 (116)
Q Consensus        64 ~~q~~~lDV~~   74 (116)
                      ||+.+-+=+++
T Consensus        62 ~~~~~k~~~i~   72 (89)
T PF00957_consen   62 WWRNYKLYIII   72 (89)
T ss_dssp             HHHHHHHHHHH
T ss_pred             HHHHHHHHHhH
Confidence            56666554443


No 107
>cd03795 GT1_like_4 This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP-linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homolog
Probab=29.35  E-value=78  Score=23.47  Aligned_cols=25  Identities=4%  Similarity=0.141  Sum_probs=18.3

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLST   26 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~   26 (116)
                      +++.++|.++++|++.++.+.+-+.
T Consensus       320 ~~~~~~i~~l~~~~~~~~~~~~~~~  344 (357)
T cd03795         320 AALAEAIRRLLEDPELRERLGEAAR  344 (357)
T ss_pred             HHHHHHHHHHHHCHHHHHHHHHHHH
Confidence            4688999999999976665554433


No 108
>PRK13803 bifunctional phosphoribosylanthranilate isomerase/tryptophan synthase subunit beta; Provisional
Probab=28.64  E-value=65  Score=27.72  Aligned_cols=37  Identities=19%  Similarity=0.362  Sum_probs=29.5

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKA   39 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~a   39 (116)
                      +.|.++..++.+||+|++....+-.-+..+| +|+-.+
T Consensus       241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~grp-TPL~~~  277 (610)
T PRK13803        241 QELQESYTKIIKSNEFQKTFKRLLQNYAGRP-TPLTEA  277 (610)
T ss_pred             HHHHHHHHHHhcCHHHHHHHHHHHHHhCCCC-CcceeH
Confidence            5678888899999999999999988886666 555433


No 109
>PHA02657 hypothetical protein; Provisional
Probab=28.60  E-value=1e+02  Score=20.10  Aligned_cols=23  Identities=13%  Similarity=0.184  Sum_probs=10.6

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHH
Q psy5669          67 TSQLDVIASCTLGLGLVLYTLAR   89 (116)
Q Consensus        67 ~~~lDV~~~l~~v~~~~~~~~~~   89 (116)
                      +.++=|+.++++++.++++++..
T Consensus        24 ~~~imVitvfv~vI~il~flLLY   46 (95)
T PHA02657         24 FESILVFTIFIFVVCILIYLLIY   46 (95)
T ss_pred             chhhhHHHHHHHHHHHHHHHHHH
Confidence            44555554444444444444433


No 110
>PF10854 DUF2649:  Protein of unknown function (DUF2649);  InterPro: IPR021217 This entry is represented by Spiroplasma phage 1-C74, Orf10. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.  Members in this family of proteins are also annotated as Plectrovirus orf 10 transmembrane proteins however currently no function is known. 
Probab=28.59  E-value=1.4e+02  Score=18.15  Aligned_cols=33  Identities=12%  Similarity=0.050  Sum_probs=19.9

Q ss_pred             CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy5669          60 TSLPWYQTSQLDVIASCTLGLGLVLYTLARIGT   92 (116)
Q Consensus        60 ~~l~~~q~~~lDV~~~l~~v~~~~~~~~~~~~~   92 (116)
                      ...+-|--.++-++++++.+..+.+..+++++.
T Consensus        32 ltqneYlt~MiGiWiVilFLtWf~lwm~fKiv~   64 (67)
T PF10854_consen   32 LTQNEYLTIMIGIWIVILFLTWFLLWMVFKIVG   64 (67)
T ss_pred             cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence            344445555666776666666666666666654


No 111
>PRK05749 3-deoxy-D-manno-octulosonic-acid transferase; Reviewed
Probab=28.33  E-value=1.9e+02  Score=22.90  Aligned_cols=43  Identities=14%  Similarity=0.196  Sum_probs=26.9

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIE   44 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE   44 (116)
                      +.+.++|.++++||.-++.+.+-+..+-.+.....+..+..++
T Consensus       376 ~~La~~l~~ll~~~~~~~~m~~~a~~~~~~~~~~~~~~~~~l~  418 (425)
T PRK05749        376 EDLAKAVTYLLTDPDARQAYGEAGVAFLKQNQGALQRTLQLLE  418 (425)
T ss_pred             HHHHHHHHHHhcCHHHHHHHHHHHHHHHHhCccHHHHHHHHHH
Confidence            4678899999999876665554444333333455566655554


No 112
>PF11157 DUF2937:  Protein of unknown function (DUF2937);  InterPro: IPR022584  This family of proteins with unknown function appears to be found mainly in Proteobacteria. 
Probab=28.25  E-value=1e+02  Score=22.08  Aligned_cols=26  Identities=4%  Similarity=0.221  Sum_probs=16.4

Q ss_pred             HHHHHHHH--hcCHHHHHHHHHHHHHHh
Q psy5669           4 SRRKVQPA--CMSPQYKTNVQRLSTIFR   29 (116)
Q Consensus         4 l~~aI~~v--l~n~~Yk~na~~ls~~~~   29 (116)
                      +..+|...  -+||..++.+..+..++.
T Consensus        58 ~~~li~~~~~s~dp~~~~~a~~~~~~~~   85 (167)
T PF11157_consen   58 REALIAHYRQSSDPVFRARAESMQATIE   85 (167)
T ss_pred             HHHHHHHHHhCCCHHHHhhHHHHHHHHH
Confidence            34445544  267778888777777665


No 113
>PF04716 ETC_C1_NDUFA5:  ETC complex I subunit conserved region;  InterPro: IPR006806 This is a family of eukaryotic NADH-ubiquinone oxidoreductase subunits (1.6.5.3 from EC) (1.6.99.3 from EC) from complex I of the electron transport chain initially identified in Neurospora crassa as a 29.9 kDa protein. The conserved region is found at the N terminus of the member proteins [].; GO: 0016651 oxidoreductase activity, acting on NADH or NADPH, 0022904 respiratory electron transport chain, 0005743 mitochondrial inner membrane
Probab=28.18  E-value=68  Score=18.97  Aligned_cols=23  Identities=4%  Similarity=0.199  Sum_probs=18.2

Q ss_pred             hHHHHHHHHhcCHHHHHHHHHHH
Q psy5669           3 SSRRKVQPACMSPQYKTNVQRLS   25 (116)
Q Consensus         3 ~l~~aI~~vl~n~~Yk~na~~ls   25 (116)
                      .+.+++.++-+|..||+.+..+-
T Consensus        13 ~~L~~L~~~P~~a~YR~~tE~it   35 (57)
T PF04716_consen   13 KTLKALKKIPEDAAYRQYTEAIT   35 (57)
T ss_pred             HHHHHHHhCCCccHHHHHHHHHH
Confidence            45677788888889999988764


No 114
>COG3036 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=27.84  E-value=76  Score=19.48  Aligned_cols=20  Identities=10%  Similarity=0.358  Sum_probs=16.4

Q ss_pred             HHHHHHhcCHHHHHHHHHHH
Q psy5669           6 RKVQPACMSPQYKTNVQRLS   25 (116)
Q Consensus         6 ~aI~~vl~n~~Yk~na~~ls   25 (116)
                      +|+..++.||-|+...++-+
T Consensus        15 NalkALv~~pLFR~RvEk~k   34 (66)
T COG3036          15 NALKALVHDPLFRQRVEKKK   34 (66)
T ss_pred             hHHHHHHcCHHHHHHHHHhh
Confidence            68888999999998877743


No 115
>PRK13609 diacylglycerol glucosyltransferase; Provisional
Probab=27.66  E-value=1.7e+02  Score=22.65  Aligned_cols=43  Identities=5%  Similarity=-0.142  Sum_probs=27.2

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEY   45 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~   45 (116)
                      +.+.++|.++++||..++.+.+-+..+. +|.+..+.+-.+.+.
T Consensus       326 ~~l~~~i~~ll~~~~~~~~m~~~~~~~~-~~~s~~~i~~~i~~~  368 (380)
T PRK13609        326 EEVFAKTEALLQDDMKLLQMKEAMKSLY-LPEPADHIVDDILAE  368 (380)
T ss_pred             HHHHHHHHHHHCCHHHHHHHHHHHHHhC-CCchHHHHHHHHHHh
Confidence            4678899999999988777665444333 444544444444443


No 116
>TIGR03449 mycothiol_MshA UDP-N-acetylglucosamine: 1L-myo-inositol-1-phosphate 1-alpha-D-N-acetylglucosaminyltransferase. Members of this protein family, found exclusively in the Actinobacteria, are MshA, the glycosyltransferase of mycothiol biosynthesis. Mycothiol replaces glutathione in these species.
Probab=27.51  E-value=2.6e+02  Score=21.58  Aligned_cols=44  Identities=5%  Similarity=-0.101  Sum_probs=27.9

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEYV   46 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v   46 (116)
                      +.+.++|.++++|+..++.+..-+.... +.++....+-.+.+..
T Consensus       356 ~~la~~i~~~l~~~~~~~~~~~~~~~~~-~~fsw~~~~~~~~~~y  399 (405)
T TIGR03449       356 ADWADALARLLDDPRTRIRMGAAAVEHA-AGFSWAATADGLLSSY  399 (405)
T ss_pred             HHHHHHHHHHHhCHHHHHHHHHHHHHHH-HhCCHHHHHHHHHHHH
Confidence            4677889999988876655544443333 2367666666665544


No 117
>cd03818 GT1_ExpC_like This family is most closely related to the GT1 family of glycosyltransferases. ExpC in Rhizobium meliloti has been shown to be involved in the biosynthesis of galactoglucan (exopolysaccharide II).
Probab=27.31  E-value=1.2e+02  Score=23.57  Aligned_cols=25  Identities=12%  Similarity=0.133  Sum_probs=17.3

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLST   26 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~   26 (116)
                      +.+.++|.++++|+..++.+.+-+.
T Consensus       354 ~~la~~i~~ll~~~~~~~~l~~~ar  378 (396)
T cd03818         354 DALAAAVIELLDDPARRARLRRAAR  378 (396)
T ss_pred             HHHHHHHHHHHhCHHHHHHHHHHHH
Confidence            4678889999988876555544333


No 118
>PF04316 FlgM:  Anti-sigma-28 factor, FlgM;  InterPro: IPR007412 FlgM binds and inhibits the activity of the transcription factor sigma 28. Inhibition of sigma 28 prevents the expression of genes from flagellar transcriptional class 3, which include genes for the filament and chemotaxis. Correctly assembled basal body-hook structures export FlgM, relieving inhibition of sigma 28 and allowing expression of class 3 genes. NMR studies show that free FlgM is mostly unfolded, which may facilitate its export. The C-terminal half of FlgM adopts a tertiary structure when it binds to sigma 28. All mutations in FlgM that prevent sigma 28 inhibition affect the C-terminal domain and is the region thought to constitute the binding domain. A minimal binding domain has been identified between Glu 64 and Arg 88 in Salmonella typhimurium (P26477 from SWISSPROT).The N-terminal portion remains unstructured and may be necessary for recognition by the export machinery [].; GO: 0045892 negative regulation of transcription, DNA-dependent, 0019861 flagellum; PDB: 1RP3_B 1SC5_B.
Probab=26.95  E-value=72  Score=18.47  Aligned_cols=25  Identities=16%  Similarity=0.315  Sum_probs=15.2

Q ss_pred             HHHHHHHhcCHHHHHHHHHHHHHHh
Q psy5669           5 RRKVQPACMSPQYKTNVQRLSTIFR   29 (116)
Q Consensus         5 ~~aI~~vl~n~~Yk~na~~ls~~~~   29 (116)
                      ...|++-+.+.+|+-++.+++.-+.
T Consensus        30 V~~ik~~I~~G~Y~vd~~~iA~~ml   54 (57)
T PF04316_consen   30 VAEIKAAIASGTYKVDAEKIAEKML   54 (57)
T ss_dssp             HHHHHHHHHTT-----HHHHHHHHH
T ss_pred             HHHHHHHHHcCCCCCCHHHHHHHHH
Confidence            4678888889999999888877554


No 119
>TIGR02803 ExbD_1 TonB system transport protein ExbD, group 1. Members of this family are Gram-negative bacterial inner membrane proteins, generally designated ExbD, related to the TolR family modeled by TIGRFAMs TIGR02801. Members always are encoded next to a protein designated ExbB (TIGR02797), related to the TolQ family modeled by TIGRFAMs TIGR02796. ExbD and ExbB together form a proton channel through which they can harness the proton-motive force to energize TonB, which in turn energizes TonB-dependent receptors in the outer membrane. TonB-dependent receptors with known specificity tend to import siderophores or vitamin B12. A TonB system and Tol-Pal system often will co-exist in a single bacterial genome.
Probab=26.81  E-value=62  Score=21.47  Aligned_cols=14  Identities=14%  Similarity=0.107  Sum_probs=9.8

Q ss_pred             HHHHHHHHHHHHHH
Q psy5669          69 QLDVIASCTLGLGL   82 (116)
Q Consensus        69 ~lDV~~~l~~v~~~   82 (116)
                      ++||.+++++++++
T Consensus         9 lIDVvflLLiFFmv   22 (122)
T TIGR02803         9 FIDVMLVLLIIFMV   22 (122)
T ss_pred             HHHHHHHHHHHHHH
Confidence            68998777666543


No 120
>PF10577 UPF0560:  Uncharacterised protein family UPF0560;  InterPro: IPR018890  This family of proteins has no known function. 
Probab=26.71  E-value=1.1e+02  Score=27.47  Aligned_cols=8  Identities=0%  Similarity=-0.147  Sum_probs=3.7

Q ss_pred             HHhhhhcc
Q psy5669          93 SFAEFTRG  100 (116)
Q Consensus        93 ~~~~~~~~  100 (116)
                      +.||+.|.
T Consensus       295 ~yCrrkc~  302 (807)
T PF10577_consen  295 CYCRRKCL  302 (807)
T ss_pred             HhhhcccC
Confidence            33555444


No 121
>COG4455 ImpE Protein of avirulence locus involved in temperature-dependent protein secretion [General function prediction only]
Probab=26.37  E-value=1e+02  Score=23.95  Aligned_cols=35  Identities=9%  Similarity=0.129  Sum_probs=31.5

Q ss_pred             CchHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCH
Q psy5669           1 MRSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHP   35 (116)
Q Consensus         1 ~~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P   35 (116)
                      |.++...|.++|.+.+-++...-+...++.+|-+.
T Consensus         1 ~~Tl~~t~seLL~~~sL~dai~~a~~qVkakPtda   35 (273)
T COG4455           1 MMTLRDTISELLDDNSLQDAIGLARDQVKAKPTDA   35 (273)
T ss_pred             CCchHHHHHHHHHhccHHHHHHHHHHHHhcCCccc
Confidence            67889999999999999999999999999998653


No 122
>cd03792 GT1_Trehalose_phosphorylase Trehalose phosphorylase (TP) reversibly catalyzes trehalose synthesis and degradation from alpha-glucose-1-phosphate (alpha-Glc-1-P) and glucose. The catalyzing activity includes the phosphorolysis of trehalose, which produce alpha-Glc-1-P and glucose, and the subsequent synthesis of trehalose. This family is most closely related to the GT1 family of glycosyltransferases.
Probab=26.31  E-value=2.3e+02  Score=21.68  Aligned_cols=45  Identities=4%  Similarity=0.006  Sum_probs=29.0

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEYV   46 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v   46 (116)
                      +.+..+|.++++|++.++.+.+-+.....+-++....+..|.+..
T Consensus       325 ~~~a~~i~~ll~~~~~~~~~~~~a~~~~~~~~s~~~~~~~~~~~~  369 (372)
T cd03792         325 EEAAVRILYLLRDPELRRKMGANAREHVRENFLITRHLKDYLYLI  369 (372)
T ss_pred             HHHHHHHHHHHcCHHHHHHHHHHHHHHHHHHcCHHHHHHHHHHHH
Confidence            456678889998988776655544444344566666666666543


No 123
>KOG4482|consensus
Probab=26.20  E-value=1.7e+02  Score=24.35  Aligned_cols=7  Identities=14%  Similarity=0.150  Sum_probs=3.3

Q ss_pred             ccCCCcc
Q psy5669         105 KATKPDR  111 (116)
Q Consensus       105 k~~~~~~  111 (116)
                      -.++|++
T Consensus       330 d~~ts~~  336 (449)
T KOG4482|consen  330 DDKTSDI  336 (449)
T ss_pred             cccchhh
Confidence            3345554


No 124
>PF06917 Pectate_lyase_2:  Periplasmic pectate lyase;  InterPro: IPR010702 This family consists of several Enterobacterial periplasmic pectate lyase proteins. A major virulence determinant of the plant-pathogenic enterobacterium Erwinia chrysanthemi is the production of pectate lyase enzymes that degrade plant cell walls [].; GO: 0016837 carbon-oxygen lyase activity, acting on polysaccharides, 0045490 pectin catabolic process, 0005737 cytoplasm; PDB: 2V8J_A 2V8K_A 2V8I_A.
Probab=26.13  E-value=93  Score=26.68  Aligned_cols=38  Identities=11%  Similarity=0.085  Sum_probs=25.2

Q ss_pred             HHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHH
Q psy5669           4 SRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRS   42 (116)
Q Consensus         4 l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~w   42 (116)
                      ..-++..|.+||+|++.|....+-+-|.-.++. -..+|
T Consensus        77 ~L~~LS~lTgd~~Y~q~A~~~~~yfldh~~D~~-GL~~W  114 (557)
T PF06917_consen   77 TLVGLSNLTGDPKYQQRAQEIIRYFLDHFVDDS-GLFYW  114 (557)
T ss_dssp             HHHHHHHHH--THHHHHHHHHHHHHHHHSB-TT-S-BS-
T ss_pred             HHHHHHhccCCHHHHHHHHHHHHHHHHhccCCC-cceee
Confidence            356788999999999999988888877755544 34444


No 125
>COG0848 ExbD Biopolymer transport protein [Intracellular trafficking and secretion]
Probab=25.97  E-value=66  Score=22.33  Aligned_cols=16  Identities=13%  Similarity=0.090  Sum_probs=11.6

Q ss_pred             HHHHHHHHHHHHHHHH
Q psy5669          69 QLDVIASCTLGLGLVL   84 (116)
Q Consensus        69 ~lDV~~~l~~v~~~~~   84 (116)
                      .+||.+++++++++..
T Consensus        21 lIDVmLVLLiiFmvta   36 (137)
T COG0848          21 LIDVMLVLLIIFMVTA   36 (137)
T ss_pred             HHHHHHHHHHHHHHhh
Confidence            6899988877766443


No 126
>PF08388 GIIM:  Group II intron, maturase-specific domain;  InterPro: IPR013597 This region is found mainly in various bacterial and archaeal species, but a few members of this family are expressed by fungal and chlamydomonal species. It has been implicated in the binding of intron RNA during reverse transcription and splicing []. 
Probab=25.88  E-value=1.2e+02  Score=18.08  Aligned_cols=9  Identities=11%  Similarity=0.434  Sum_probs=5.3

Q ss_pred             CHHHHHHHH
Q psy5669          63 PWYQTSQLD   71 (116)
Q Consensus        63 ~~~q~~~lD   71 (116)
                      +|..||..-
T Consensus        34 GW~nYy~~~   42 (80)
T PF08388_consen   34 GWANYYRIG   42 (80)
T ss_pred             HHHHhhcch
Confidence            366666653


No 127
>TIGR01133 murG undecaprenyldiphospho-muramoylpentapeptide beta-N-acetylglucosaminyltransferase. RL J Bacteriol 1993 Mar;175(6):1841-3
Probab=25.42  E-value=1.1e+02  Score=23.12  Aligned_cols=26  Identities=4%  Similarity=-0.006  Sum_probs=19.9

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTI   27 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~   27 (116)
                      +.+.++|.++++||++++.+.+-...
T Consensus       309 ~~l~~~i~~ll~~~~~~~~~~~~~~~  334 (348)
T TIGR01133       309 EKLLEALLKLLLDPANLEAMAEAARK  334 (348)
T ss_pred             HHHHHHHHHHHcCHHHHHHHHHHHHh
Confidence            46889999999999887766654443


No 128
>PTZ00443 Thioredoxin domain-containing protein; Provisional
Probab=24.69  E-value=1.6e+02  Score=22.04  Aligned_cols=6  Identities=33%  Similarity=0.595  Sum_probs=2.5

Q ss_pred             ccCCCc
Q psy5669         105 KATKPD  110 (116)
Q Consensus       105 k~~~~~  110 (116)
                      -+++|.
T Consensus       211 ~~~~~~  216 (224)
T PTZ00443        211 ANKKPK  216 (224)
T ss_pred             cccCCc
Confidence            334444


No 129
>PHA02902 putative IMV membrane protein; Provisional
Probab=24.68  E-value=1.6e+02  Score=18.20  Aligned_cols=11  Identities=9%  Similarity=0.153  Sum_probs=5.1

Q ss_pred             HHHHHHHHHHH
Q psy5669          70 LDVIASCTLGL   80 (116)
Q Consensus        70 lDV~~~l~~v~   80 (116)
                      +|.+.++.+++
T Consensus         3 ~dtfvi~~v~v   13 (70)
T PHA02902          3 IDTFVILAVIV   13 (70)
T ss_pred             hhhHHHHHHHH
Confidence            45554444333


No 130
>PF12122 DUF3582:  Protein of unknown function (DUF3582);  InterPro: IPR022732 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ].  This entry represents the N-terminal domain of membrane-bound serine endopeptidases belonging to MEROPS peptidase family S54 (rhomboid-1, clan ST). This domain contains a conserved ASW sequence motif and a single completely conserved residue F that may be functionally important.  The tertiary structure of the GlpG protein from Escherichia coli has been determined []. The GlpG protein has six transmembrane domains (other members of the family are predicted to have seven), with the N- and C-terminal ends anchored in the cytoplasm. One transmembrane domain is shorter than the rest, creating an internal, aqueous cavity just below the membrane surface and it is here were proteolysis occurs. There is also a membrane-embedded loop between the first and second transmembrane domains which is postulated to act as a gate controlling substrate access to the active site. No other family of serine peptidases is known to have active site residues within transmembrane domains (although transmembrane active sites are known for aspartic peptidase and metallopeptidases), and the GlpG protein has the type structure for clan ST.; GO: 0004252 serine-type endopeptidase activity, 0016021 integral to membrane; PDB: 3UBB_A 3B45_A 3B44_A 2NRF_A 3TXT_A 2O7L_A 2XTU_A 2IRV_A 2XOW_A 2XTV_A ....
Probab=24.65  E-value=58  Score=21.48  Aligned_cols=31  Identities=13%  Similarity=0.056  Sum_probs=19.2

Q ss_pred             hcCHHHHHHHHHHHHHHhcCCCCHHHHHHHH
Q psy5669          12 CMSPQYKTNVQRLSTIFRSEPLHPLQKAIRS   42 (116)
Q Consensus        12 l~n~~Yk~na~~ls~~~~d~p~~P~~~av~w   42 (116)
                      |.|+.+.+.|+.--..|...|..|.-.++-|
T Consensus        41 l~de~~~~~a~~el~~Fl~nP~~~rYqaASW   71 (101)
T PF12122_consen   41 LHDEEHLEQAEQELEEFLQNPNDPRYQAASW   71 (101)
T ss_dssp             ES-GGGHHHHHHHHHHHHHS-SS--------
T ss_pred             EeCHHHHHHHHHHHHHHHHCCCCHHHHHHHH
Confidence            4688999999999999999999998888777


No 131
>PLN02618 tryptophan synthase, beta chain
Probab=24.56  E-value=98  Score=25.43  Aligned_cols=43  Identities=14%  Similarity=0.193  Sum_probs=28.6

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEY   45 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~   45 (116)
                      +.|.++..++.+||+|++....+-+-+--+| +|+-.+-...+.
T Consensus        36 ~~~~~~~~~~~~~~~f~~~~~~~l~~~vGr~-TPL~~~~~Ls~~   78 (410)
T PLN02618         36 SELEAAFNALATDPEFQEELAGILKDYVGRE-TPLYFAERLTEH   78 (410)
T ss_pred             HHHHHHHHHHhcChhhHHHHHHHHHHhcCCC-CceeEhhhHHHH
Confidence            5788899999999999998777655553233 445444444444


No 132
>PF00558 Vpu:  Vpu protein;  InterPro: IPR008187 The Human immunodeficiency virus 1 (HIV-1) Vpu protein acts in the degradation of CD4 in the endoplasmic reticulum and in the enhancement of virion release from the plasma membrane of infected cells [].; GO: 0019076 release of virus from host; PDB: 2JPX_A 1PI8_A 2GOH_A 2GOF_A 1PI7_A 1PJE_A 1VPU_A 2K7Y_A.
Probab=24.28  E-value=1.3e+02  Score=19.26  Aligned_cols=16  Identities=25%  Similarity=0.513  Sum_probs=4.9

Q ss_pred             HHHHHHHHHHHHHHHH
Q psy5669          69 QLDVIASCTLGLGLVL   84 (116)
Q Consensus        69 ~lDV~~~l~~v~~~~~   84 (116)
                      .+|+++++.+++++++
T Consensus         3 ~l~i~~iialiv~~ii   18 (81)
T PF00558_consen    3 SLEILAIIALIVALII   18 (81)
T ss_dssp             -----HHHHHHHHHHH
T ss_pred             hHHHHHHHHHHHHHHH
Confidence            4555554444444333


No 133
>PF06637 PV-1:  PV-1 protein (PLVAP);  InterPro: IPR009538 This family consists of several PV-1 (PLVAP) proteins, which seem to be specific to mammals. PV-1 is a novel protein component of the endothelial fenestral and stomatal diaphragms []. The function of this family is unknown.
Probab=24.02  E-value=2.5e+02  Score=23.31  Aligned_cols=13  Identities=15%  Similarity=0.256  Sum_probs=6.7

Q ss_pred             CCCCCcccCCCCC
Q psy5669          51 GAPHLKTKATSLP   63 (116)
Q Consensus        51 g~~~l~~~~~~l~   63 (116)
                      |.++.|..+..-+
T Consensus         8 ~~~y~R~G~~~kg   20 (442)
T PF06637_consen    8 GGPYSRAGGKGKG   20 (442)
T ss_pred             CCcccccCCCCCC
Confidence            3456666555444


No 134
>PF02723 NS3_envE:  Non-structural protein NS3/Small envelope protein E;  InterPro: IPR003873 This is a family of small nonstructural proteins, well conserved among Coronavirus strains. This protein is also found in Murine hepatitis virus as small envelope protein E.; GO: 0016020 membrane
Probab=23.85  E-value=2e+02  Score=18.44  Aligned_cols=10  Identities=0%  Similarity=0.066  Sum_probs=4.0

Q ss_pred             HHHHHHhhhh
Q psy5669          89 RIGTSFAEFT   98 (116)
Q Consensus        89 ~~~~~~~~~~   98 (116)
                      ++|..+|+.+
T Consensus        38 qLC~~cc~~~   47 (82)
T PF02723_consen   38 QLCFQCCRLC   47 (82)
T ss_pred             HHHHHHhhhh
Confidence            3444344433


No 135
>PF08390 TRAM1:  TRAM1-like protein;  InterPro: IPR013599 This family comprises sequences that are similar to human TRAM1 (Q15629 from SWISSPROT). This is a transmembrane protein of the endoplasmic reticulum, thought to be involved in the membrane transfer of secretory proteins []. The region featured in this family is found N-terminal to the longevity-assurance protein region (IPR005547 from INTERPRO). 
Probab=23.82  E-value=1.7e+02  Score=17.45  Aligned_cols=21  Identities=10%  Similarity=-0.080  Sum_probs=11.1

Q ss_pred             CHHHHHHHHHHHHHHHHHHHH
Q psy5669          63 PWYQTSQLDVIASCTLGLGLV   83 (116)
Q Consensus        63 ~~~q~~~lDV~~~l~~v~~~~   83 (116)
                      +.|..=.-|++++++-+++++
T Consensus        20 ~~Y~~G~~D~~fV~fy~i~~t   40 (65)
T PF08390_consen   20 GLYGKGWDDLYFVFFYIIVFT   40 (65)
T ss_pred             CccccCcchHHHHHHHHHHHH
Confidence            334444567776665444433


No 136
>TIGR02149 glgA_Coryne glycogen synthase, Corynebacterium family. This model describes Corynebacterium glutamicum GlgA and closely related proteins in several other species. This enzyme is required for glycogen biosynthesis and appears to replace the distantly related TIGR02095 family of ADP-glucose type glycogen synthase in Corynebacterium glutamicum, Mycobacterium tuberculosis, Bifidobacterium longum, and Streptomyces coelicolor.
Probab=23.64  E-value=3.2e+02  Score=20.76  Aligned_cols=44  Identities=9%  Similarity=0.028  Sum_probs=26.8

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEY   45 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~   45 (116)
                      +.+.++|.++++|+.-++.+..-+...-.+.++....+..+.+.
T Consensus       340 ~~l~~~i~~l~~~~~~~~~~~~~a~~~~~~~~s~~~~~~~~~~~  383 (388)
T TIGR02149       340 AELAKAINILLADPELAKKMGIAGRKRAEEEFSWGSIAKKTVEM  383 (388)
T ss_pred             HHHHHHHHHHHhCHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHH
Confidence            56888999999888755544443333333446666666555554


No 137
>PRK04125 murein hydrolase regulator LrgA; Provisional
Probab=23.59  E-value=2e+02  Score=20.23  Aligned_cols=11  Identities=0%  Similarity=-0.176  Sum_probs=4.4

Q ss_pred             HHHHHHHHHhh
Q psy5669          86 TLARIGTSFAE   96 (116)
Q Consensus        86 ~~~~~~~~~~~   96 (116)
                      ++..+...+.+
T Consensus       108 vtg~v~~~l~r  118 (141)
T PRK04125        108 CTGLFSQFILG  118 (141)
T ss_pred             HHHHHHHHHHH
Confidence            33344444443


No 138
>PF00988 CPSase_sm_chain:  Carbamoyl-phosphate synthase small chain, CPSase domain;  InterPro: IPR002474 Carbamoyl phosphate synthase (CPSase) is a heterodimeric enzyme composed of a small and a large subunit (with the exception of CPSase III, see below). CPSase catalyses the synthesis of carbamoyl phosphate from biocarbonate, ATP and glutamine (6.3.5.5 from EC) or ammonia (6.3.4.16 from EC), and represents the first committed step in pyrimidine and arginine biosynthesis in prokaryotes and eukaryotes, and in the urea cycle in most terrestrial vertebrates [, ]. CPSase has three active sites, one in the small subunit and two in the large subunit. The small subunit contains the glutamine binding site and catalyses the hydrolysis of glutamine to glutamate and ammonia. The large subunit has two homologous carboxy phosphate domains, both of which have ATP-binding sites; however, the N-terminal carboxy phosphate domain catalyses the phosphorylation of biocarbonate, while the C-terminal domain catalyses the phosphorylation of the carbamate intermediate []. The carboxy phosphate domain found duplicated in the large subunit of CPSase is also present as a single copy in the biotin-dependent enzymes acetyl-CoA carboxylase (6.4.1.2 from EC) (ACC), propionyl-CoA carboxylase (6.4.1.3 from EC) (PCCase), pyruvate carboxylase (6.4.1.1 from EC) (PC) and urea carboxylase (6.3.4.6 from EC). Most prokaryotes carry one form of CPSase that participates in both arginine and pyrimidine biosynthesis, however certain bacteria can have separate forms. The large subunit in bacterial CPSase has four structural domains: the carboxy phosphate domain 1, the oligomerisation domain, the carbamoyl phosphate domain 2 and the allosteric domain []. CPSase heterodimers from Escherichia coli contain two molecular tunnels: an ammonia tunnel and a carbamate tunnel. These inter-domain tunnels connect the three distinct active sites, and function as conduits for the transport of unstable reaction intermediates (ammonia and carbamate) between successive active sites []. The catalytic mechanism of CPSase involves the diffusion of carbamate through the interior of the enzyme from the site of synthesis within the N-terminal domain of the large subunit to the site of phosphorylation within the C-terminal domain. Eukaryotes have two distinct forms of CPSase: a mitochondrial enzyme (CPSase I) that participates in both arginine biosynthesis and the urea cycle; and a cytosolic enzyme (CPSase II) involved in pyrimidine biosynthesis. CPSase II occurs as part of a multi-enzyme complex along with aspartate transcarbamoylase and dihydroorotase; this complex is referred to as the CAD protein []. The hepatic expression of CPSase is transcriptionally regulated by glucocorticoids and/or cAMP []. There is a third form of the enzyme, CPSase III, found in fish, which uses glutamine as a nitrogen source instead of ammonia []. CPSase III is closely related to CPSase I, and is composed of a single polypeptide that may have arisen from gene fusion of the glutaminase and synthetase domains [].  This entry represents the N-terminal domain of the small subunit of carbamoyl phosphate synthase. The small subunit has a 3-layer beta/beta/alpha structure, and is thought to be mobile in most proteins that carry it. The C-terminal domain of the small subunit of CPSase has glutamine amidotransferase activity.; GO: 0006807 nitrogen compound metabolic process; PDB: 1CE8_B 1KEE_B 1CS0_D 1T36_D 1M6V_H 1A9X_F 1JDB_I 1BXR_F 1C3O_B 1C30_F ....
Probab=23.51  E-value=34  Score=23.85  Aligned_cols=11  Identities=27%  Similarity=0.422  Sum_probs=7.4

Q ss_pred             HHHhcCHHHHH
Q psy5669           9 QPACMSPQYKT   19 (116)
Q Consensus         9 ~~vl~n~~Yk~   19 (116)
                      +|+|+||||.-
T Consensus        38 qE~lTDPSY~g   48 (131)
T PF00988_consen   38 QEILTDPSYAG   48 (131)
T ss_dssp             HHHHT-GGGBT
T ss_pred             chhhcCCcCCc
Confidence            57788888864


No 139
>PF15179 Myc_target_1:  Myc target protein 1
Probab=23.48  E-value=1.9e+02  Score=21.56  Aligned_cols=21  Identities=29%  Similarity=0.406  Sum_probs=10.0

Q ss_pred             CCCHHHHHHH-HHH-HHHHHHHH
Q psy5669          61 SLPWYQTSQL-DVI-ASCTLGLG   81 (116)
Q Consensus        61 ~l~~~q~~~l-DV~-~~l~~v~~   81 (116)
                      .+.|.+-+++ |++ +|.+++++
T Consensus         9 ~~~~~~~f~~~~lIlaF~vSm~i   31 (197)
T PF15179_consen    9 LLEWLENFDWEDLILAFCVSMAI   31 (197)
T ss_pred             cccchhhcchhhHHHHHHHHHHH
Confidence            3556666644 444 34444433


No 140
>cd03809 GT1_mtfB_like This family is most closely related to the GT1 family of glycosyltransferases. mtfB (mannosyltransferase B) in E. coli has been shown to direct the growth of the O9-specific polysaccharide chain. It transfers two mannoses into the position 3 of the previously synthesized polysaccharide.
Probab=23.47  E-value=1.8e+02  Score=21.32  Aligned_cols=37  Identities=5%  Similarity=0.027  Sum_probs=24.2

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKA   39 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~a   39 (116)
                      +.+.++|.++++|+..++.+.+-+.... +..+....+
T Consensus       324 ~~~~~~i~~l~~~~~~~~~~~~~~~~~~-~~~sw~~~~  360 (365)
T cd03809         324 EALAAAIERLLEDPALREELRERGLARA-KRFSWEKTA  360 (365)
T ss_pred             HHHHHHHHHHhcCHHHHHHHHHHHHHHH-HhCCHHHHH
Confidence            4678899999999988777666554333 234544443


No 141
>TIGR00263 trpB tryptophan synthase, beta subunit. Tryptophan synthase catalyzes the last step in the biosynthesis of tryptophan. the beta chain contains the functional domain for or the synthesis of tryptophan from indole and serine. The enzyme requires pyridoxal-phosphate as a cofactor. The pyridoxal-P attachment site is contained within the conserved region [LIVM]-x-H-x-G-[STA]-H-K-x-N] [K is the pyridoxal-P attachment site] which is present between residues 90-100 of the model.
Probab=23.16  E-value=89  Score=25.13  Aligned_cols=38  Identities=18%  Similarity=0.324  Sum_probs=28.0

Q ss_pred             CchHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHH
Q psy5669           1 MRSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKA   39 (116)
Q Consensus         1 ~~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~a   39 (116)
                      ++.|..+...++.|++|.+....+..-+..+| +|+..+
T Consensus        19 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~-TPL~~~   56 (385)
T TIGR00263        19 LEELEAAFEDAKADPAFWAELNELLRNYAGRP-TPLTFA   56 (385)
T ss_pred             HHHHHHHHHHHhcCHHHHHHHHHHHHHhCCCC-CCceeh
Confidence            36788888999999999999877777675444 455443


No 142
>PHA02851 EEV glycoprotein; Provisional
Probab=23.15  E-value=79  Score=23.05  Aligned_cols=15  Identities=13%  Similarity=0.020  Sum_probs=8.0

Q ss_pred             HHHHHHHhhhhcccc
Q psy5669          88 ARIGTSFAEFTRGKD  102 (116)
Q Consensus        88 ~~~~~~~~~~~~~~~  102 (116)
                      +.+|+.-|+.++.++
T Consensus        20 llIcKk~cKKFFkKK   34 (223)
T PHA02851         20 FLINKKYCKKFFKKK   34 (223)
T ss_pred             HHHhHHHHHHHHhcC
Confidence            445555555555443


No 143
>TIGR00215 lpxB lipid-A-disaccharide synthase. Lipid-A precursor biosynthesis producing lipid A disaccharide in a condensation reaction. transcribed as part of an operon including lpxA
Probab=23.09  E-value=1.8e+02  Score=23.15  Aligned_cols=40  Identities=5%  Similarity=-0.068  Sum_probs=26.0

Q ss_pred             chHHHHHHHHhcCH----HHHHHHHHHHHHHhcCCC--CHHHHHHH
Q psy5669           2 RSSRRKVQPACMSP----QYKTNVQRLSTIFRSEPL--HPLQKAIR   41 (116)
Q Consensus         2 ~~l~~aI~~vl~n~----~Yk~na~~ls~~~~d~p~--~P~~~av~   41 (116)
                      +.|.+++.++++|+    ++++.+..--..++++--  ++.++++.
T Consensus       335 ~~l~~~~~~ll~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~a~  380 (385)
T TIGR00215       335 HPLAIALLLLLENGLKAYKEMHRERQFFEELRQRIYCNADSERAAQ  380 (385)
T ss_pred             HHHHHHHHHHhcCCcccHHHHHHHHHHHHHHHHHhcCCCHHHHHHH
Confidence            46889999999999    777666555554544332  34455554


No 144
>PLN02555 limonoid glucosyltransferase
Probab=22.90  E-value=1.1e+02  Score=25.53  Aligned_cols=29  Identities=10%  Similarity=0.064  Sum_probs=23.4

Q ss_pred             chHHHHHHHHhcCH---HHHHHHHHHHHHHhc
Q psy5669           2 RSSRRKVQPACMSP---QYKTNVQRLSTIFRS   30 (116)
Q Consensus         2 ~~l~~aI~~vl~n~---~Yk~na~~ls~~~~d   30 (116)
                      +++..+|+++++++   ..|+||++++...+.
T Consensus       417 ~~v~~~v~~vm~~~~g~~~r~ra~~l~~~a~~  448 (480)
T PLN02555        417 EEVAECLLEATVGEKAAELKQNALKWKEEAEA  448 (480)
T ss_pred             HHHHHHHHHHhcCchHHHHHHHHHHHHHHHHH
Confidence            46889999999764   699999999887553


No 145
>PF07782 DC_STAMP:  DC-STAMP-like protein;  InterPro: IPR012858 This group of sequences is similar to a region of the dendritic cell-specific transmembrane protein (DC-STAMP, Q9H295 from SWISSPROT). This is thought to be a novel receptor protein that shares no identity with other multimembrane-spanning proteins []. It is thought to have seven putative transmembrane regions [], two of which are found in the region featured in this family. DC-STAMP is also described as having potential N-linked glycosylation sites and a potential phosphorylation site for PKC [], but these are not conserved. ; GO: 0016021 integral to membrane
Probab=22.76  E-value=2.5e+02  Score=20.18  Aligned_cols=45  Identities=11%  Similarity=-0.034  Sum_probs=23.4

Q ss_pred             CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc
Q psy5669          61 SLPWYQTSQLDVIASCTLGLGLVLYTLARIGTSFAEFTRGKDDKK  105 (116)
Q Consensus        61 ~l~~~q~~~lDV~~~l~~v~~~~~~~~~~~~~~~~~~~~~~~~~k  105 (116)
                      ..++..|..+=++..++.+++++=..+.++.+.+|..++-.++++
T Consensus       141 ~p~~~~~~~i~~l~~l~~ll~~le~Y~~RLR~~I~a~fyP~re~~  185 (191)
T PF07782_consen  141 PPDYSVYIQIGLLYLLLWLLVLLEPYALRLRRVICASFYPEREKE  185 (191)
T ss_pred             CCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhChHHHHH
Confidence            445555555444333333322222335677777787777666553


No 146
>PF12877 DUF3827:  Domain of unknown function (DUF3827);  InterPro: IPR024606 The function of the proteins in this entry is not currently known, but one of the human proteins (Q9HCM3 from SWISSPROT) has been implicated in pilocytic astrocytomas [, , ]. In the majority of cases of pilocytic astrocytomas a tandem duplication produces an in-frame fusion of the gene encoding this protein and the BRAF oncogene. The resulting fusion protein has constitutive BRAF kinase activity and is capable of transforming cells. 
Probab=22.70  E-value=57  Score=28.67  Aligned_cols=9  Identities=11%  Similarity=0.006  Sum_probs=3.5

Q ss_pred             HHHHHHHHH
Q psy5669          73 IASCTLGLG   81 (116)
Q Consensus        73 ~~~l~~v~~   81 (116)
                      ++.++++++
T Consensus       276 lvPv~vV~~  284 (684)
T PF12877_consen  276 LVPVLVVLL  284 (684)
T ss_pred             hHHHHHHHH
Confidence            333433333


No 147
>cd02646 R3H_G-patch R3H domain of a group of fungal and plant proteins with unknown function, who also contain a G-patch domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the R3H domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=22.26  E-value=71  Score=18.58  Aligned_cols=39  Identities=3%  Similarity=0.087  Sum_probs=27.7

Q ss_pred             CchHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHh
Q psy5669           1 MRSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEYVIAH   49 (116)
Q Consensus         1 ~~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v~r~   49 (116)
                      |+.+.+.|+..+.++         ...+.-.|++|.++++-+ +.+..+
T Consensus         1 ~~~i~~~i~~F~~~~---------~~~~~fppm~~~~R~~vH-~lA~~~   39 (58)
T cd02646           1 IEDIKDEIEAFLLDS---------RDSLSFPPMDKHGRKTIH-KLANCY   39 (58)
T ss_pred             ChHHHHHHHHHHhCC---------CceEecCCCCHHHHHHHH-HHHHHc
Confidence            567778888887666         345677899999988765 455444


No 148
>KOG3814|consensus
Probab=21.70  E-value=2.1e+02  Score=24.01  Aligned_cols=37  Identities=19%  Similarity=0.460  Sum_probs=30.4

Q ss_pred             HHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhCC
Q psy5669          15 PQYKTNVQRLSTIFRSEPLHPLQKAIRSIEYVIAHRG   51 (116)
Q Consensus        15 ~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v~r~~g   51 (116)
                      |+....+++--..-+.||..|.+....++|+.+.|+-
T Consensus       406 ~smaRalqKYLrtTrqQ~rHt~esil~Hle~Clt~~l  442 (531)
T KOG3814|consen  406 PSMARALQKYLRTTRQQPRHTMESILQHLEFCLTHDL  442 (531)
T ss_pred             HHHHHHHHHHHHHhhcCCCCcHHHHHHHHHHHHhcCC
Confidence            4555566666677789999999999999999999974


No 149
>TIGR03088 stp2 sugar transferase, PEP-CTERM/EpsH1 system associated. Members of this family include a match to the pfam00534 Glycosyl transferases group 1 domain. Nearly all are found in species that encode the PEP-CTERM/exosortase system predicted to act in protein sorting in a number of Gram-negative bacteria. In particular, these transferases are found proximal to a particular variant of exosortase, EpsH1, which appears to travel with a conserved group of genes summarized by Genome Property GenProp0652. The nature of the sugar transferase reaction catalyzed by members of this clade is unknown and may conceivably be variable with respect to substrate by species, but we hypothesize a conserved substrate.
Probab=21.49  E-value=3.4e+02  Score=20.63  Aligned_cols=44  Identities=9%  Similarity=-0.002  Sum_probs=25.2

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEY   45 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~   45 (116)
                      +++.++|.++++|+..++.+.+-+.-.-.+.++....+-.|.+.
T Consensus       326 ~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~~~~~~~~  369 (374)
T TIGR03088       326 VALARALQPYVSDPAARRAHGAAGRARAEQQFSINAMVAAYAGL  369 (374)
T ss_pred             HHHHHHHHHHHhCHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHH
Confidence            46778888888888655444333332333455655555555543


No 150
>cd03825 GT1_wcfI_like This family is most closely related to the GT1 family of glycosyltransferases. wcfI in Bacteroides fragilis has been shown to be involved in the capsular polysaccharide biosynthesis.
Probab=21.38  E-value=2.9e+02  Score=20.45  Aligned_cols=19  Identities=5%  Similarity=0.067  Sum_probs=12.8

Q ss_pred             chHHHHHHHHhcCHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTN   20 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~n   20 (116)
                      +++.++|.++++|+...+.
T Consensus       318 ~~~~~~l~~l~~~~~~~~~  336 (365)
T cd03825         318 EDLAEGIEWLLADPDEREE  336 (365)
T ss_pred             HHHHHHHHHHHhCHHHHHH
Confidence            3567788888888764433


No 151
>PHA03054 IMV membrane protein; Provisional
Probab=21.31  E-value=2.2e+02  Score=17.79  Aligned_cols=25  Identities=20%  Similarity=0.371  Sum_probs=16.1

Q ss_pred             HHHhcCCCCHHHHHHHHHHHHHHhC
Q psy5669          26 TIFRSEPLHPLQKAIRSIEYVIAHR   50 (116)
Q Consensus        26 ~~~~d~p~~P~~~av~wiE~v~r~~   50 (116)
                      ..|...|-+..+..+.-+.-|+.-.
T Consensus        10 GvF~ss~d~Df~~Fi~vV~sVl~dk   34 (72)
T PHA03054         10 GVFMGSPEDDLTDFIEIVKSVLSDE   34 (72)
T ss_pred             HHhhCCchHHHHHHHHHHHHHHcCC
Confidence            3455566666777777777777664


No 152
>cd04951 GT1_WbdM_like This family is most closely related to the GT1 family of glycosyltransferases and is named after WbdM in Escherichia coli. In general glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have
Probab=21.15  E-value=2.8e+02  Score=20.50  Aligned_cols=42  Identities=7%  Similarity=-0.068  Sum_probs=26.1

Q ss_pred             chHHHHHHHHh-cCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHH
Q psy5669           2 RSSRRKVQPAC-MSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIE   44 (116)
Q Consensus         2 ~~l~~aI~~vl-~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE   44 (116)
                      +++.++|.+++ .++.+++.+... .-...+.++....+-.|.+
T Consensus       314 ~~~~~~i~~ll~~~~~~~~~~~~~-~~~~~~~~s~~~~~~~~~~  356 (360)
T cd04951         314 EALANKIDEILKMSGEERDIIGAR-RERIVKKFSINSIVQQWLT  356 (360)
T ss_pred             HHHHHHHHHHHhCCHHHHHHHHHH-HHHHHHhcCHHHHHHHHHH
Confidence            45778888888 567777766655 3333355666665555544


No 153
>PF02526 GBP_repeat:  Glycophorin-binding protein;  InterPro: IPR003681 The glycophorin-binding protein contains a tandem repeat. The repeated sequence determines the binding domain for an erythrocyte receptor binding protein of Plasmodium falciparum, the malarial parasite []. Erythrocyte invasion by the malarial merozoite is a receptor-mediated process, an obligatory step in the development of the parasite. The P. falciparum protein binds to the erythrocyte receptor glycophorin.
Probab=21.02  E-value=1.6e+02  Score=15.90  Aligned_cols=24  Identities=17%  Similarity=0.443  Sum_probs=19.4

Q ss_pred             HHHHhcCHHHHHHHHHHHHHHhcC
Q psy5669           8 VQPACMSPQYKTNVQRLSTIFRSE   31 (116)
Q Consensus         8 I~~vl~n~~Yk~na~~ls~~~~d~   31 (116)
                      +++-..||.|++.+.-+-+++.+.
T Consensus        10 mk~yaadpeyrkh~~v~yqil~~t   33 (38)
T PF02526_consen   10 MKAYAADPEYRKHLNVLYQILTNT   33 (38)
T ss_pred             HHHHhcCHHHHHHHHHHHHHHccC
Confidence            455668999999999998888754


No 154
>PF04659 Arch_fla_DE:  Archaeal flagella protein ;  InterPro: IPR006752  Archaeal flagella are unique motility structures, and the absence of bacterial structural motility genes in the complete genome sequences of flagellated archaeal species has always suggested that archaeal flagellar biogenesis is likely mediated by novel components. FlaD and FlaE, are present in the cell as membrane-associated proteins but are not major components of isolated flagellar filaments. Interestingly, flaD was found to encode two proteins, each translated from a separate ribosome binding site. This group of sequences contain the archaeal flaD and flaE proteins. The conserved region that defines these sequences is found in the N-teminal region of flaE but towards the C-terminal region of flaD [].; GO: 0001539 ciliary or flagellar motility
Probab=20.99  E-value=77  Score=20.93  Aligned_cols=19  Identities=11%  Similarity=0.228  Sum_probs=15.2

Q ss_pred             HHHHHHHHHHHHHhCCCCC
Q psy5669          36 LQKAIRSIEYVIAHRGAPH   54 (116)
Q Consensus        36 ~~~av~wiE~v~r~~g~~~   54 (116)
                      .-.+..|+||++...|...
T Consensus        18 ~~~~~eWLefLve~~G~~~   36 (99)
T PF04659_consen   18 EIVVFEWLEFLVERVGHNN   36 (99)
T ss_pred             HHHHHHHHHHHHHHccccc
Confidence            4467899999999988654


No 155
>PRK10591 hypothetical protein; Provisional
Probab=20.96  E-value=2.6e+02  Score=18.37  Aligned_cols=12  Identities=17%  Similarity=0.349  Sum_probs=6.1

Q ss_pred             HHHHHHHHHHHh
Q psy5669          84 LYTLARIGTSFA   95 (116)
Q Consensus        84 ~~~~~~~~~~~~   95 (116)
                      +.+++++.+.+.
T Consensus        61 v~ivWR~a~~la   72 (92)
T PRK10591         61 VVIIWRVAKGLA   72 (92)
T ss_pred             HHHHHHHHHHhc
Confidence            345566655443


No 156
>PF06667 PspB:  Phage shock protein B;  InterPro: IPR009554 This family consists of several bacterial phage shock protein B (PspB) sequences. The phage shock protein (psp) operon is induced in response to heat, ethanol, osmotic shock and infection by filamentous bacteriophages []. Expression of the operon requires the alternative sigma factor sigma54 and the transcriptional activator PspF. In addition, PspA plays a negative regulatory role, and the integral-membrane proteins PspB and PspC play a positive one [].; GO: 0006355 regulation of transcription, DNA-dependent, 0009271 phage shock
Probab=20.85  E-value=2.3e+02  Score=17.76  Aligned_cols=21  Identities=5%  Similarity=-0.075  Sum_probs=10.1

Q ss_pred             HHHHHHHHHHHHHHHHHHHHH
Q psy5669          71 DVIASCTLGLGLVLYTLARIG   91 (116)
Q Consensus        71 DV~~~l~~v~~~~~~~~~~~~   91 (116)
                      +++++.++++++++..++.++
T Consensus         4 ~fl~~plivf~ifVap~WL~l   24 (75)
T PF06667_consen    4 EFLFVPLIVFMIFVAPIWLIL   24 (75)
T ss_pred             HHHHHHHHHHHHHHHHHHHHH
Confidence            444555555554544444443


No 157
>PF12834 Phage_int_SAM_2:  Phage integrase, N-terminal;  InterPro: IPR024457 Phage integrase proteins cleave DNA substrates by a series of staggered cuts, during which the protein becomes covalently linked to the DNA through a catalytic tyrosine residue at the carboxy end of the alignment [, ]. This entry represents the N-terminal predicted alpha-helical domain found in a family of putative prophage DNA-binding integrases.
Probab=20.83  E-value=2.4e+02  Score=17.97  Aligned_cols=49  Identities=16%  Similarity=0.215  Sum_probs=33.2

Q ss_pred             CchHHHHHHHHhc--CHHHHHHHHHHH------HHH---hcCCCCHHHHHHHHHHHHHHh
Q psy5669           1 MRSSRRKVQPACM--SPQYKTNVQRLS------TIF---RSEPLHPLQKAIRSIEYVIAH   49 (116)
Q Consensus         1 ~~~l~~aI~~vl~--n~~Yk~na~~ls------~~~---~d~p~~P~~~av~wiE~v~r~   49 (116)
                      |+.|...+.++..  +.||++.+.+..      .-+   .-++.+|...-..+||..+++
T Consensus         1 m~~L~~~l~~lar~~~gS~~T~~~R~~~~~~~~~~L~~~g~~i~~~~~lk~kHI~~lv~~   60 (91)
T PF12834_consen    1 MSKLRYQLKTLARQAGGSFKTQADRMRILKRFARFLRELGFQIRSIRNLKPKHIEALVQH   60 (91)
T ss_pred             ChHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHCCCCcCCHHHhhHHHHHHHHHH
Confidence            7778888877774  346666554432      222   356778888888888888777


No 158
>PRK10635 bacterioferritin; Provisional
Probab=20.70  E-value=2.9e+02  Score=19.42  Aligned_cols=55  Identities=16%  Similarity=0.222  Sum_probs=37.5

Q ss_pred             hHHHHHHHHhcCH-----HHHHHHHHH--------HHHHhcCCCCHHHHHHHHHHHHHHhCCCCCCcc
Q psy5669           3 SSRRKVQPACMSP-----QYKTNVQRL--------STIFRSEPLHPLQKAIRSIEYVIAHRGAPHLKT   57 (116)
Q Consensus         3 ~l~~aI~~vl~n~-----~Yk~na~~l--------s~~~~d~p~~P~~~av~wiE~v~r~~g~~~l~~   57 (116)
                      .+.+.+++.|.++     .|...+.-+        +..+..+-..-++.|..-+|+++..||.|.+.+
T Consensus         6 ~vi~~LN~~L~~El~Ai~QY~~ha~~~~~~G~~~la~~~~~ea~eEm~HA~~l~eRIl~LgG~P~~~~   73 (158)
T PRK10635          6 KIINYLNKLLGNELVAINQYFLHARMFKNWGLMRLNDVEYHESIDEMKHADKYIERILFLEGIPNLQD   73 (158)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCC
Confidence            4566666666555     344444333        445555555678889999999999999988765


No 159
>PRK10307 putative glycosyl transferase; Provisional
Probab=20.47  E-value=3.2e+02  Score=21.35  Aligned_cols=43  Identities=5%  Similarity=0.038  Sum_probs=25.3

Q ss_pred             chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHH
Q psy5669           2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIE   44 (116)
Q Consensus         2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE   44 (116)
                      +++.++|.++++|+.-++.+.+-+...-.+.++....+-.+.+
T Consensus       361 ~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~~~~~~~  403 (412)
T PRK10307        361 EALVAAIAALARQALLRPKLGTVAREYAERTLDKENVLRQFIA  403 (412)
T ss_pred             HHHHHHHHHHHhCHHHHHHHHHHHHHHHHHHcCHHHHHHHHHH
Confidence            4677888888888866655555444444444554444444433


No 160
>cd01056 Euk_Ferritin eukaryotic ferritins. Eukaryotic Ferritin (Euk_Ferritin) domain. Ferritins are the primary iron storage proteins of most living organisms and members of a broad superfamily of ferritin-like diiron-carboxylate proteins. The iron-free (apoferritin) ferritin molecule is a protein shell composed of 24 protein chains arranged in 432 symmetry. Iron storage involves the uptake of iron (II) at the protein shell, its oxidation by molecular oxygen at the dinuclear ferroxidase centers, and the movement of iron (III) into the cavity for deposition as ferrihydrite; the protein shell can hold up to 4500 iron atoms. In vertebrates, two types of chains (subunits) have been characterized, H or M (fast) and L (slow), which differ in rates of iron uptake and mineralization. Fe(II) oxidation in the H/M subunits take place initially at the ferroxidase center, a carboxylate-bridged diiron center, located within the subunit four-helix bundle. In a complementary role, negatively charged r
Probab=20.42  E-value=82  Score=21.87  Aligned_cols=38  Identities=18%  Similarity=0.228  Sum_probs=31.9

Q ss_pred             HHHHHHHHHhcCCCCHHHHHHHHHHHHHHhCCCCCCcc
Q psy5669          20 NVQRLSTIFRSEPLHPLQKAIRSIEYVIAHRGAPHLKT   57 (116)
Q Consensus        20 na~~ls~~~~d~p~~P~~~av~wiE~v~r~~g~~~l~~   57 (116)
                      +...+++.|+.+-..-.+.|...++|+...||.+.+++
T Consensus        35 ~l~g~a~~f~~~a~eE~~HA~~l~~~i~~rgg~~~~~~   72 (161)
T cd01056          35 ALPGFAKFFRKLSDEEREHAEKLIKYQNKRGGRVVLQD   72 (161)
T ss_pred             cchhHHHHHHHHHHHHHHHHHHHHHHHHHcCCeeecCC
Confidence            45667788888888889999999999999999877655


No 161
>PTZ00046 rifin; Provisional
Probab=20.16  E-value=2.1e+02  Score=23.37  Aligned_cols=6  Identities=0%  Similarity=0.074  Sum_probs=2.5

Q ss_pred             HHHhhh
Q psy5669          92 TSFAEF   97 (116)
Q Consensus        92 ~~~~~~   97 (116)
                      .+++|.
T Consensus       335 YLILRY  340 (358)
T PTZ00046        335 YLILRY  340 (358)
T ss_pred             HHHHHh
Confidence            344443


Done!