Query         psy12933
Match_columns 65
No_of_seqs    159 out of 1015
Neff          6.2 
Searched_HMMs 46136
Date          Fri Aug 16 23:28:27 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy12933.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/12933hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF13621 Cupin_8:  Cupin-like d  99.8 1.1E-20 2.4E-25  123.5   5.3   56    8-65    187-245 (251)
  2 KOG2132|consensus               99.2 4.5E-13 9.7E-18   95.6  -5.2   54    6-61    302-355 (355)
  3 PF08007 Cupin_4:  Cupin superf  99.0 1.3E-09 2.8E-14   76.3   5.3   35   29-63    173-207 (319)
  4 KOG2130|consensus               98.9 1.3E-09 2.8E-14   78.3   5.0   44   20-63    250-294 (407)
  5 PF02373 JmjC:  JmjC domain, hy  98.9 6.4E-09 1.4E-13   61.6   5.3   37   27-63     76-112 (114)
  6 KOG2131|consensus               98.8 1.6E-09 3.4E-14   78.7   2.0   38   28-65    264-301 (427)
  7 COG2850 Uncharacterized conser  98.3 1.4E-07   3E-12   68.4   1.1   36   28-63    175-212 (383)
  8 KOG2508|consensus               97.5 6.3E-05 1.4E-09   55.1   2.0   47   19-65    256-307 (437)
  9 COG2140 Thermophilic glucose-6  97.2  0.0015 3.3E-08   44.4   5.7   29   29-57    123-151 (209)
 10 KOG3706|consensus               97.0 0.00031 6.6E-09   53.3   1.5   28   29-56    378-405 (629)
 11 PF06560 GPI:  Glucose-6-phosph  97.0  0.0017 3.8E-08   43.2   4.9   36   30-65    108-144 (182)
 12 PF00190 Cupin_1:  Cupin;  Inte  96.9  0.0033 7.3E-08   39.1   5.3   28   35-62     87-116 (144)
 13 PRK04190 glucose-6-phosphate i  96.9  0.0041 8.8E-08   41.5   5.9   34   31-64    119-154 (191)
 14 COG0662 {ManC} Mannose-6-phosp  96.8  0.0019 4.1E-08   39.9   3.6   27   31-57     74-100 (127)
 15 PF07883 Cupin_2:  Cupin domain  96.8  0.0036 7.9E-08   33.7   4.1   26   32-57     37-62  (71)
 16 KOG2508|consensus               96.4  0.0071 1.5E-07   44.6   5.0   61    5-65    333-401 (437)
 17 COG4101 Predicted mannose-6-ph  95.9   0.011 2.3E-07   37.9   3.3   29   28-56     84-112 (142)
 18 KOG1633|consensus               95.8    0.01 2.2E-07   46.8   3.5   31   29-59    193-223 (776)
 19 smart00835 Cupin_1 Cupin. This  95.7   0.035 7.6E-07   34.5   4.9   26   31-56     74-99  (146)
 20 TIGR03404 bicupin_oxalic bicup  95.5   0.032   7E-07   40.3   4.8   26   31-56    288-313 (367)
 21 KOG2132|consensus               95.5   0.024 5.2E-07   41.2   4.0   45   12-56    231-277 (355)
 22 PRK13290 ectC L-ectoine syntha  95.4    0.05 1.1E-06   33.9   5.0   27   31-57     74-100 (125)
 23 PF09313 DUF1971:  Domain of un  95.2   0.057 1.2E-06   31.7   4.4   29   29-57     47-75  (82)
 24 COG1917 Uncharacterized conser  95.1   0.051 1.1E-06   33.1   4.3   28   31-58     81-108 (131)
 25 PF05523 FdtA:  WxcM-like, C-te  95.0   0.067 1.5E-06   33.4   4.6   26   32-57     76-102 (131)
 26 TIGR01221 rmlC dTDP-4-dehydror  95.0   0.081 1.7E-06   34.9   5.2   44   15-59     82-125 (176)
 27 TIGR03404 bicupin_oxalic bicup  94.6    0.11 2.3E-06   37.6   5.4   25   32-56    110-134 (367)
 28 PF13759 2OG-FeII_Oxy_5:  Putat  94.3    0.12 2.6E-06   30.3   4.3   36   28-63     62-100 (101)
 29 PF05899 Cupin_3:  Protein of u  94.2    0.03 6.5E-07   31.7   1.5   19   30-48     42-60  (74)
 30 KOG2107|consensus               94.2   0.092   2E-06   35.0   4.0   30   27-56    111-140 (179)
 31 PRK09943 DNA-binding transcrip  94.1    0.13 2.8E-06   33.2   4.5   27   31-57    145-171 (185)
 32 PF03079 ARD:  ARD/ARD' family;  93.9    0.13 2.7E-06   33.4   4.2   30   27-56    110-139 (157)
 33 TIGR03037 anthran_nbaC 3-hydro  93.5    0.14   3E-06   33.6   3.8   30   32-61     70-99  (159)
 34 COG3450 Predicted enzyme of th  93.5   0.084 1.8E-06   33.0   2.7   28   29-56     79-106 (116)
 35 KOG1356|consensus               93.4   0.011 2.3E-07   47.1  -1.7   32   29-60    796-827 (889)
 36 PF01050 MannoseP_isomer:  Mann  93.4    0.15 3.2E-06   32.8   3.8   26   31-56    101-126 (151)
 37 PF02311 AraC_binding:  AraC-li  92.9    0.43 9.4E-06   27.5   5.0   27   31-57     40-66  (136)
 38 COG3615 TehB Uncharacterized p  92.7    0.21 4.6E-06   30.5   3.6   33   31-64     61-95  (99)
 39 COG1898 RfbC dTDP-4-dehydrorha  92.7    0.52 1.1E-05   31.2   5.6   47   15-64     82-130 (173)
 40 PF00908 dTDP_sugar_isom:  dTDP  92.5    0.44 9.5E-06   31.3   5.2   42   15-59     82-125 (176)
 41 PRK13264 3-hydroxyanthranilate  92.2    0.27 5.8E-06   32.8   3.9   31   32-62     76-106 (177)
 42 TIGR02466 conserved hypothetic  91.9    0.28   6E-06   32.9   3.7   32   33-64    163-199 (201)
 43 PF13640 2OG-FeII_Oxy_3:  2OG-F  91.9    0.19 4.1E-06   28.8   2.6   30   36-65     66-99  (100)
 44 TIGR03214 ura-cupin putative a  91.7    0.29 6.2E-06   33.7   3.7   27   31-57    217-243 (260)
 45 PRK15457 ethanolamine utilizat  91.2    0.26 5.7E-06   34.1   3.1   24   31-54    192-215 (233)
 46 TIGR03214 ura-cupin putative a  90.8    0.67 1.4E-05   31.9   4.8   24   33-56     99-122 (260)
 47 PRK15131 mannose-6-phosphate i  90.8    0.25 5.4E-06   36.1   2.8   24   31-54    236-259 (389)
 48 PF06249 EutQ:  Ethanolamine ut  90.8    0.19 4.2E-06   32.7   2.0   18   31-48    112-129 (152)
 49 TIGR02272 gentisate_1_2 gentis  90.7    0.45 9.7E-06   34.4   4.0   30   28-57    116-145 (335)
 50 PF15138 Syncollin:  Syncollin   90.5    0.35 7.5E-06   30.1   2.9   36   29-64     32-70  (112)
 51 PLN00212 glutelin; Provisional  90.3    0.92   2E-05   34.4   5.5   33   29-61    390-422 (493)
 52 PRK05467 Fe(II)-dependent oxyg  89.6     0.9   2E-05   31.1   4.6   34   31-64    140-175 (226)
 53 TIGR01479 GMP_PMI mannose-1-ph  89.2    0.86 1.9E-05   33.7   4.6   27   31-57    414-440 (468)
 54 TIGR00218 manA mannose-6-phosp  88.8    0.27 5.8E-06   34.3   1.6   22   31-52    150-171 (302)
 55 PRK11171 hypothetical protein;  88.2     1.4   3E-05   30.3   4.9   26   31-56    100-125 (266)
 56 PLN02288 mannose-6-phosphate i  87.9    0.46   1E-05   34.9   2.4   22   32-53    251-272 (394)
 57 PRK13502 transcriptional activ  87.4     1.1 2.5E-05   29.9   3.9   26   31-56     55-80  (282)
 58 TIGR03027 pepcterm_export puta  86.4    0.52 1.1E-05   30.1   1.8   18   32-49    148-165 (165)
 59 PF12852 Cupin_6:  Cupin         86.3    0.99 2.2E-05   28.7   3.1   25   32-56     55-79  (186)
 60 COG4766 EutQ Ethanolamine util  85.9    0.66 1.4E-05   30.8   2.1   16   33-48    137-152 (176)
 61 COG1482 ManA Phosphomannose is  85.8    0.77 1.7E-05   33.0   2.6   24   31-54    157-180 (312)
 62 PRK11171 hypothetical protein;  84.8     1.5 3.3E-05   30.2   3.6   26   31-56    222-247 (266)
 63 PRK13501 transcriptional activ  84.5     1.8 3.9E-05   29.2   3.8   27   30-56     54-80  (290)
 64 PF07385 DUF1498:  Protein of u  84.0     2.5 5.5E-05   29.2   4.4   28   30-57    152-179 (225)
 65 PF05721 PhyH:  Phytanoyl-CoA d  83.9     1.5 3.3E-05   27.0   3.1   26   29-54    177-202 (211)
 66 PRK15460 cpsB mannose-1-phosph  83.8     2.6 5.6E-05   31.7   4.6   26   32-57    424-449 (478)
 67 COG4297 Uncharacterized protei  82.8     1.8   4E-05   28.3   3.1   33   25-57     78-110 (163)
 68 smart00702 P4Hc Prolyl 4-hydro  82.3     2.9 6.3E-05   26.3   3.9   43   23-65    130-177 (178)
 69 PRK13500 transcriptional activ  81.4       3 6.5E-05   28.8   4.0   27   31-57     85-111 (312)
 70 PRK10296 DNA-binding transcrip  79.7     2.5 5.4E-05   28.2   3.1   24   31-54     60-83  (278)
 71 PF12973 Cupin_7:  ChrR Cupin-l  79.6     2.8 6.1E-05   24.0   2.9   25   35-59     61-85  (91)
 72 COG1791 Uncharacterized conser  79.6     3.6 7.8E-05   27.6   3.7   29   28-56    114-142 (181)
 73 TIGR03028 EpsE polysaccharide   79.2     1.3 2.7E-05   29.9   1.5   15   35-49    225-239 (239)
 74 PF13532 2OG-FeII_Oxy_2:  2OG-F  78.9     2.3   5E-05   26.9   2.6   29   28-57    144-176 (194)
 75 COG5285 Protein involved in bi  77.5     3.9 8.5E-05   29.4   3.6   31   26-56    187-217 (299)
 76 PF01238 PMI_typeI:  Phosphoman  74.9     2.4 5.2E-05   30.6   2.1   23   32-54    250-272 (373)
 77 PF06052 3-HAO:  3-hydroxyanthr  74.7      10 0.00022   24.8   4.7   33   29-61     72-104 (151)
 78 PRK15401 alpha-ketoglutarate-d  73.2       8 0.00017   26.3   4.2   36   28-63    163-210 (213)
 79 PRK13503 transcriptional activ  73.0     3.2   7E-05   27.5   2.2   25   32-56     53-77  (278)
 80 TIGR02297 HpaA 4-hydroxyphenyl  73.0     6.7 0.00014   26.1   3.7   26   31-56     61-86  (287)
 81 PLN00212 glutelin; Provisional  71.9     9.5 0.00021   29.0   4.7   32   26-57    142-173 (493)
 82 PRK12335 tellurite resistance   68.1      17 0.00036   24.8   4.9   31   34-64     60-92  (287)
 83 PF05995 CDO_I:  Cysteine dioxy  67.3      27 0.00059   22.5   5.6   33   32-64    127-162 (175)
 84 COG3822 ABC-type sugar transpo  64.2      15 0.00033   25.3   4.0   28   29-56    150-177 (225)
 85 PRK10371 DNA-binding transcrip  62.8     7.1 0.00015   26.9   2.3   27   30-56     62-88  (302)
 86 PF12851 Tet_JBP:  Oxygenase do  62.2      22 0.00048   23.0   4.4   36   30-65    126-169 (171)
 87 PF06251 Caps_synth_GfcC:  Caps  62.2       6 0.00013   26.5   1.8   13   34-46    191-203 (229)
 88 COG3435 Gentisate 1,2-dioxygen  61.4     5.1 0.00011   29.3   1.4   31   26-56    125-155 (351)
 89 cd02786 MopB_CT_3 The MopB_CT_  59.7     6.7 0.00014   22.9   1.5   12   40-51     73-84  (116)
 90 PRK10572 DNA-binding transcrip  57.5      15 0.00032   24.6   3.1   25   31-55     66-90  (290)
 91 TIGR02408 ectoine_ThpD ectoine  56.8     8.3 0.00018   26.4   1.8   25   31-55    210-234 (277)
 92 TIGR01762 chlorin-enz chlorina  55.7      15 0.00032   25.6   2.9   27   29-55    206-232 (288)
 93 COG3128 PiuC Uncharacterized i  54.9      25 0.00055   24.2   3.8   34   31-64    143-178 (229)
 94 PF01476 LysM:  LysM domain;  I  53.5     6.3 0.00014   19.1   0.6   10   36-45     35-44  (44)
 95 TIGR02272 gentisate_1_2 gentis  52.8      36 0.00078   24.7   4.6   27   32-58    288-314 (335)
 96 PRK09685 DNA-binding transcrip  52.7      33 0.00071   23.0   4.2   26   31-56     88-113 (302)
 97 cd02782 MopB_CT_1 The MopB_CT_  52.5     8.5 0.00019   23.0   1.2   13   40-52     75-87  (129)
 98 KOG0162|consensus               52.4      11 0.00023   30.9   1.9   23   29-51   1063-1089(1106)
 99 cd02794 MopB_CT_DmsA-EC The Mo  51.8     9.1  0.0002   22.7   1.2   13   40-52     72-84  (121)
100 PF14604 SH3_9:  Variant SH3 do  51.6     6.6 0.00014   20.2   0.5   19   33-51     12-34  (49)
101 PRK15078 polysaccharide export  51.5     9.9 0.00022   27.6   1.6   26   31-56    236-261 (379)
102 PF04970 LRAT:  Lecithin retino  50.6     6.7 0.00014   23.6   0.5   23   34-56      5-28  (125)
103 KOG2757|consensus               50.2      22 0.00048   26.6   3.2   27   31-57    371-397 (411)
104 PF02080 TrkA_C:  TrkA-C domain  49.9     8.5 0.00018   20.4   0.8   12   33-44     45-56  (71)
105 PF15340 COPR5:  Cooperator of   49.7       7 0.00015   25.4   0.5   31   26-58    106-136 (152)
106 COG3837 Uncharacterized conser  49.6      38 0.00082   22.4   3.9   27   31-57     81-109 (161)
107 TIGR02899 spore_safA spore coa  48.9      11 0.00025   17.7   1.1   12   35-46     33-44  (44)
108 PF10014 2OG-Fe_Oxy_2:  2OG-Fe   48.2      15 0.00032   24.2   1.9   30   27-56    148-178 (195)
109 COG3257 GlxB Uncharacterized p  46.7      42 0.00091   23.7   4.0   26   31-56    100-125 (264)
110 PRK15175 Vi polysaccharide exp  46.4      14  0.0003   26.9   1.6   16   32-47    222-237 (355)
111 KOG2757|consensus               45.1      21 0.00045   26.8   2.4   24   33-56    249-272 (411)
112 PRK13498 chemoreceptor glutami  45.0      76  0.0016   20.7   4.8   34   30-64      6-39  (167)
113 cd02781 MopB_CT_Acetylene-hydr  44.6      15 0.00032   21.9   1.3   13   40-52     75-87  (130)
114 PF01427 Peptidase_M15:  D-ala-  43.7      11 0.00023   25.2   0.7   16   40-55    170-185 (198)
115 PF00877 NLPC_P60:  NlpC/P60 fa  43.2      13 0.00028   21.4   1.0   16   31-46     47-62  (105)
116 PF02041 Auxin_BP:  Auxin bindi  42.8      40 0.00086   22.4   3.2   28   29-56     88-115 (167)
117 TIGR01015 hmgA homogentisate 1  42.1      33 0.00072   25.9   3.1   23   34-56    168-190 (429)
118 PRK05341 homogentisate 1,2-dio  42.1      34 0.00073   25.9   3.1   23   34-56    174-196 (438)
119 PRK00924 5-keto-4-deoxyuronate  41.9      24 0.00052   25.1   2.2   18   32-49     92-109 (276)
120 TIGR02451 anti_sig_ChrR anti-s  41.6      33 0.00071   22.9   2.8   28   35-62    164-193 (215)
121 TIGR02219 phage_NlpC_fam putat  41.6      15 0.00033   22.6   1.1   11   35-45     76-86  (134)
122 PF11543 UN_NPL4:  Nuclear pore  41.5      15 0.00031   21.1   0.9   19   27-45     60-78  (80)
123 TIGR03805 beta_helix_1 paralle  41.3      38 0.00081   23.9   3.2   15   36-50      8-22  (314)
124 PF04209 HgmA:  homogentisate 1  41.2      32  0.0007   25.8   2.9   24   34-57    166-189 (424)
125 cd02792 MopB_CT_Formate-Dh-Na-  40.8      23 0.00049   20.7   1.7   12   40-51     77-88  (122)
126 PF07653 SH3_2:  Variant SH3 do  40.5     8.9 0.00019   19.9  -0.1   21   32-52     14-39  (55)
127 TIGR03505 FimV_core FimV N-ter  39.8      19 0.00041   20.5   1.2   13   35-47     40-52  (74)
128 PF11699 CENP-C_C:  Mif2/CENP-C  39.7      78  0.0017   18.4   4.5   27   31-57     50-76  (85)
129 PF11142 DUF2917:  Protein of u  39.6      32  0.0007   18.8   2.1   18   30-47     34-51  (63)
130 PLN02658 homogentisate 1,2-dio  39.6      39 0.00084   25.6   3.1   23   34-56    167-189 (435)
131 COG1935 Uncharacterized conser  39.6      18 0.00039   22.9   1.2   12   35-46     38-49  (122)
132 smart00306 HintN Hint (Hedgeho  38.7      21 0.00046   19.7   1.3   10   36-45     90-99  (100)
133 PF14578 GTP_EFTU_D4:  Elongati  38.7      18  0.0004   21.1   1.1   10   35-44     72-81  (81)
134 cd02785 MopB_CT_4 The MopB_CT_  38.2      20 0.00043   21.3   1.2   12   40-51     74-85  (124)
135 cd02793 MopB_CT_DMSOR-BSOR-TMA  36.7      24 0.00052   21.2   1.4   12   40-51     75-86  (129)
136 KOG3905|consensus               36.7      18  0.0004   27.1   1.0   11   38-48    302-312 (473)
137 COG2164 Uncharacterized conser  36.2      17 0.00036   22.9   0.6   16   34-49     63-78  (126)
138 TIGR00568 alkb DNA alkylation   35.6      58  0.0013   21.1   3.2   20   27-46    141-160 (169)
139 PF05118 Asp_Arg_Hydrox:  Aspar  35.0 1.1E+02  0.0024   19.3   4.4   27   31-57    123-149 (163)
140 cd02791 MopB_CT_Nitrate-R-NapA  34.8      47   0.001   19.3   2.5   20   32-51     66-88  (122)
141 cd00118 LysM Lysin domain, fou  34.5      27 0.00058   15.6   1.1   12   34-45     35-46  (46)
142 PF02814 UreE_N:  UreE urease a  34.1      27 0.00058   19.0   1.2   12   36-47     51-62  (65)
143 PF05708 DUF830:  Orthopoxvirus  33.9      24 0.00051   21.6   1.1    9   36-44      2-10  (158)
144 cd00250 CAS_like Clavaminic ac  33.7      42  0.0009   22.4   2.3   15   30-44    218-232 (262)
145 PRK00794 flbT flagellar biosyn  33.7      42 0.00091   21.2   2.2   17   31-47      3-19  (132)
146 PF07228 SpoIIE:  Stage II spor  33.2      37 0.00079   21.0   1.9   17   28-44    113-129 (193)
147 cd00508 MopB_CT_Fdh-Nap-like T  33.1      52  0.0011   18.9   2.4   12   40-51     77-88  (120)
148 PF05783 DLIC:  Dynein light in  32.4      19 0.00041   27.2   0.5   14   35-48    273-286 (472)
149 PRK10178 D-alanyl-D-alanine di  32.2      16 0.00035   24.4   0.2   13   41-53    156-168 (184)
150 PRK10838 spr outer membrane li  31.6      31 0.00068   23.0   1.5   14   34-47    127-140 (190)
151 PRK06933 type III secretion sy  31.1      32 0.00069   24.6   1.5   14   35-48    175-188 (308)
152 COG2173 DdpX D-alanyl-D-alanin  30.8      16 0.00035   25.1  -0.1   24   31-54    175-198 (211)
153 cd00174 SH3 Src homology 3 dom  30.3      25 0.00054   16.9   0.6   23   32-55     14-40  (54)
154 PF02668 TauD:  Taurine catabol  29.9      46 0.00099   21.5   2.0   25   29-53    218-243 (258)
155 cd02777 MopB_CT_DMSOR-like The  29.5      34 0.00074   20.3   1.2   12   40-51     76-87  (127)
156 PRK12791 flbT flagellar biosyn  29.4      53  0.0011   20.8   2.1   17   31-47      2-18  (131)
157 PRK13488 chemoreceptor glutami  29.2 1.6E+02  0.0035   18.9   4.8   30   33-64      3-34  (157)
158 PRK13494 chemoreceptor glutami  29.0 1.5E+02  0.0033   19.3   4.3   32   30-64     10-42  (163)
159 PF07378 FlbT:  Flagellar prote  28.7      44 0.00095   20.9   1.7   15   33-47      2-16  (126)
160 PF03171 2OG-FeII_Oxy:  2OG-Fe(  28.5      96  0.0021   17.2   3.0   33   33-65     57-96  (98)
161 smart00326 SH3 Src homology 3   28.2      26 0.00057   16.9   0.5   22   30-51     15-40  (58)
162 PF00122 E1-E2_ATPase:  E1-E2 A  28.1      27 0.00059   22.5   0.7   13   35-47     51-63  (230)
163 COG2255 RuvB Holliday junction  28.0      31 0.00067   25.2   1.0   14   32-45     97-110 (332)
164 PF05726 Pirin_C:  Pirin C-term  28.0 1.2E+02  0.0026   17.6   3.5   30   32-61      2-33  (104)
165 PRK10202 ebgC cryptic beta-D-g  27.7      64  0.0014   20.4   2.3   22   32-53    106-127 (149)
166 PF04322 DUF473:  Protein of un  27.6      37 0.00079   21.3   1.2   12   35-46     38-49  (119)
167 COG0490 Putative regulatory, l  27.5      33 0.00071   22.7   1.0   12   33-44    132-143 (162)
168 PRK13495 chemoreceptor glutami  27.2 1.8E+02  0.0039   18.8   4.7   31   32-64      2-34  (159)
169 COG3508 HmgA Homogentisate 1,2  26.9   1E+02  0.0022   23.3   3.5   25   33-57    165-189 (427)
170 cd02779 MopB_CT_Arsenite-Ox Th  26.8      41 0.00088   19.7   1.2   11   40-50     75-85  (115)
171 PF00018 SH3_1:  SH3 domain;  I  26.5      14  0.0003   18.6  -0.8   29   26-55      6-38  (48)
172 PRK14125 cell division suppres  26.3      40 0.00088   20.2   1.2   13   35-47     79-91  (103)
173 cd02780 MopB_CT_Tetrathionate_  25.9      43 0.00094   20.4   1.3   12   40-51     72-83  (143)
174 cd02789 MopB_CT_FmdC-FwdD The   25.5      42 0.00091   19.7   1.1   10   40-49     73-82  (106)
175 TIGR02410 carnitine_TMLD trime  25.4      68  0.0015   22.9   2.4   15   30-44    311-325 (362)
176 PF10162 G8:  G8 domain;  Inter  25.1      64  0.0014   19.6   1.9   19   37-55     10-28  (125)
177 PF07591 PT-HINT:  Pretoxin HIN  25.0      72  0.0016   19.6   2.1   22   35-56     76-97  (130)
178 cd02790 MopB_CT_Formate-Dh_H F  24.7      46   0.001   19.1   1.2   12   40-51     77-88  (116)
179 KOG0572|consensus               24.5      62  0.0013   24.7   2.0   25   29-56    217-241 (474)
180 PF02938 GAD:  GAD domain;  Int  24.5      39 0.00084   19.6   0.8   17   34-50     73-89  (95)
181 PF00856 SET:  SET domain;  Int  24.2      34 0.00074   19.7   0.6   13   35-47      9-21  (162)
182 PRK11198 LysM domain/BON super  24.1      48   0.001   20.8   1.3   11   36-46    137-147 (147)
183 PF14525 AraC_binding_2:  AraC-  23.9 1.7E+02  0.0036   17.3   4.8   27   31-57     72-98  (172)
184 PF03737 Methyltransf_6:  Demet  23.7      64  0.0014   20.4   1.8   10   34-43    137-146 (154)
185 PRK13491 chemoreceptor glutami  23.6 2.4E+02  0.0053   19.1   5.6   35   28-64     10-46  (199)
186 KOG2414|consensus               23.5      57  0.0012   25.0   1.7   19   28-48    424-442 (488)
187 PF08857 ParBc_2:  Putative Par  23.4   1E+02  0.0022   20.1   2.7   25   22-46     36-60  (163)
188 PF11649 T4_neck-protein:  Viru  23.4      49  0.0011   23.0   1.3   11   37-47    115-125 (230)
189 cd02784 MopB_CT_PHLH The MopB_  23.3      43 0.00094   20.9   0.9   14   40-53     80-93  (137)
190 PF01426 BAH:  BAH domain;  Int  22.9      64  0.0014   18.5   1.6   15   34-48      1-15  (119)
191 PRK11479 hypothetical protein;  22.5      93   0.002   22.0   2.5   13   33-45     62-74  (274)
192 cd02775 MopB_CT Molybdopterin-  22.3 1.3E+02  0.0028   16.5   2.7   12   40-51     65-76  (101)
193 PF07915 PRKCSH:  Glucosidase I  22.1      50  0.0011   18.1   0.9   16   39-54      1-16  (81)
194 KOG1633|consensus               22.0      48   0.001   26.8   1.1   31   25-55    541-571 (776)
195 TIGR00022 uncharacterized prot  21.8 1.4E+02   0.003   18.5   3.0   26   28-53    109-134 (142)
196 PF02563 Poly_export:  Polysacc  21.7      55  0.0012   18.3   1.1   13   32-44      9-21  (82)
197 cd04721 BAH_plant_1 BAH, or Br  21.5      74  0.0016   19.7   1.7   11   34-44      6-16  (130)
198 PF13533 Biotin_lipoyl_2:  Biot  21.4      85  0.0018   16.0   1.7    8   36-43     24-31  (50)
199 PF06525 SoxE:  Sulfocyanin (So  21.4      50  0.0011   22.4   1.0   16   41-56     87-102 (196)
200 COG4340 Uncharacterized protei  21.3      97  0.0021   21.4   2.4   30   26-55    169-199 (226)
201 PF04074 DUF386:  Domain of unk  21.2 1.5E+02  0.0032   18.5   3.1   25   29-53    110-134 (153)
202 TIGR02998 RraA_entero regulato  21.1      79  0.0017   20.6   1.8   11   33-43    135-145 (161)
203 PRK09372 ribonuclease activity  21.0      80  0.0017   20.4   1.8   10   34-43    136-145 (159)
204 PF13550 Phage-tail_3:  Putativ  20.7      57  0.0012   19.7   1.1   14   34-47    138-151 (164)
205 cd02778 MopB_CT_Thiosulfate-R-  20.5      57  0.0012   19.0   1.0   12   40-51     72-83  (123)
206 PF01568 Molydop_binding:  Moly  20.5      30 0.00065   19.7  -0.2   11   40-50     72-82  (110)
207 PF00568 WH1:  WH1 domain;  Int  20.4   2E+02  0.0043   16.9   5.3   37   27-63     58-94  (111)
208 PF08255 Leader_Trp:  Trp-opero  20.1      53  0.0012   13.3   0.6   10   42-51      4-13  (14)

No 1  
>PF13621 Cupin_8:  Cupin-like domain; PDB: 3AL6_C 3AL5_C 2XUM_A 2Y0I_A 1MZE_A 3KCY_A 1MZF_A 1YCI_A 2ILM_A 1H2L_A ....
Probab=99.82  E-value=1.1e-20  Score=123.52  Aligned_cols=56  Identities=34%  Similarity=0.634  Sum_probs=41.1

Q ss_pred             eecccCCccccCccCCCCCCCCceEEEEECCCCEEEeCCCCeEEEEec--CC-eEEEEEeC
Q psy12933          8 ETSQSNSKEEAQSSAPHNQTPGHLIECTLNPGDMLYIPPKVWHHVRSL--ST-SISVSFWF   65 (65)
Q Consensus         8 ~s~~~~~~~d~~~~~~p~~~~~~~~~~~l~pGd~LyiP~~WwH~V~s~--~~-sisvn~w~   65 (65)
                      +|.++.+..|.  +.+|.++++++++++|+|||+||||+||||+|+++  ++ +||||+||
T Consensus       187 ~~~~d~~~~d~--~~~p~~~~~~~~~~~l~pGD~LfiP~gWwH~V~~~~~~~~sisvn~w~  245 (251)
T PF13621_consen  187 FSWVDPDNPDL--ERFPKFRKAPPYEVVLEPGDVLFIPPGWWHQVENLSDDDLSISVNYWF  245 (251)
T ss_dssp             BBSS-TTS--T--TT-CGGGG--EEEEEEETT-EEEE-TT-EEEEEESTTSSCEEEEEEEE
T ss_pred             eeeeeccChhh--hhhhhhccCceeEEEECCCeEEEECCCCeEEEEEcCCCCeEEEEEEEe
Confidence            45555544444  56999999999999999999999999999999999  65 99999997


No 2  
>KOG2132|consensus
Probab=99.16  E-value=4.5e-13  Score=95.60  Aligned_cols=54  Identities=35%  Similarity=0.622  Sum_probs=46.5

Q ss_pred             cceecccCCccccCccCCCCCCCCceEEEEECCCCEEEeCCCCeEEEEecCCeEEE
Q psy12933          6 KCETSQSNSKEEAQSSAPHNQTPGHLIECTLNPGDMLYIPPKVWHHVRSLSTSISV   61 (65)
Q Consensus         6 ~~~s~~~~~~~d~~~~~~p~~~~~~~~~~~l~pGd~LyiP~~WwH~V~s~~~sisv   61 (65)
                      .|+|++++...|  .+.||++++++.++++|+|||+||+|..|||.|++++.++|+
T Consensus       302 ~~tsqvdvenPd--lk~fp~~~k~~~l~~lL~pGe~L~iP~kwwhyvrs~d~s~s~  355 (355)
T KOG2132|consen  302 LETSQVDVENPD--LKAFPKFAKARFLDCLLEPGEALFIPPKWWHYVRSLDKSFSV  355 (355)
T ss_pred             hcccccccCCCC--hhhhhHHHHHHHHHHhcCCchhccccHHHhhhhhhccccccC
Confidence            457777775544  456999999999999999999999999999999999988764


No 3  
>PF08007 Cupin_4:  Cupin superfamily protein;  InterPro: IPR022777  This signature represents primarily the cupin fold found in JmjC transcription factors. The fold is also found in lysine-specific demethylase NO66.; PDB: 2XDV_A 1VRB_B 4DIQ_B.
Probab=98.95  E-value=1.3e-09  Score=76.34  Aligned_cols=35  Identities=31%  Similarity=0.689  Sum_probs=26.6

Q ss_pred             CceEEEEECCCCEEEeCCCCeEEEEecCCeEEEEE
Q psy12933         29 GHLIECTLNPGDMLYIPPKVWHHVRSLSTSISVSF   63 (65)
Q Consensus        29 ~~~~~~~l~pGd~LyiP~~WwH~V~s~~~sisvn~   63 (65)
                      .+..+++|+|||+||||+||||++.+.+.|+++++
T Consensus       173 ~~~~~~~L~pGD~LYlPrG~~H~~~~~~~S~hltv  207 (319)
T PF08007_consen  173 EPVEEVVLEPGDVLYLPRGWWHQAVTTDPSLHLTV  207 (319)
T ss_dssp             STSEEEEE-TT-EEEE-TT-EEEEEESS-EEEEEE
T ss_pred             ceeEEEEECCCCEEEECCCccCCCCCCCCceEEEE
Confidence            56889999999999999999999999997766665


No 4  
>KOG2130|consensus
Probab=98.94  E-value=1.3e-09  Score=78.30  Aligned_cols=44  Identities=25%  Similarity=0.542  Sum_probs=37.5

Q ss_pred             ccCCCCCCC-CceEEEEECCCCEEEeCCCCeEEEEecCCeEEEEE
Q psy12933         20 SSAPHNQTP-GHLIECTLNPGDMLYIPPKVWHHVRSLSTSISVSF   63 (65)
Q Consensus        20 ~~~~p~~~~-~~~~~~~l~pGd~LyiP~~WwH~V~s~~~sisvn~   63 (65)
                      +.+.|.... -+++||+++|||++|+|.||||-|.+++++|+|.-
T Consensus       250 rt~~Pswp~E~kPIEc~q~pGEt~fVP~GWWHvVlNle~TIAiTq  294 (407)
T KOG2130|consen  250 RTQLPSWPDEYKPIECLQKPGETMFVPSGWWHVVLNLEPTIAITQ  294 (407)
T ss_pred             cccCCCCccccCCceeeecCCceEEecCCeEEEEeccCceeeeee
Confidence            345566554 47999999999999999999999999999999863


No 5  
>PF02373 JmjC:  JmjC domain, hydroxylase;  InterPro: IPR013129 Jumonji protein is required for neural tube formation in mice [].There is evidence of domain swapping within the jumonji family of transcription factors []. This domain is often associated with jmjN (see IPR003349 from INTERPRO) and belongs to the Cupin superfamily [].; PDB: 2YU2_A 2YU1_A 3AVR_A 3AVS_A 2OX0_B 2OQ6_B 2WWJ_A 2Q8D_A 3PDQ_A 2YBK_A ....
Probab=98.86  E-value=6.4e-09  Score=61.60  Aligned_cols=37  Identities=27%  Similarity=0.542  Sum_probs=28.1

Q ss_pred             CCCceEEEEECCCCEEEeCCCCeEEEEecCCeEEEEE
Q psy12933         27 TPGHLIECTLNPGDMLYIPPKVWHHVRSLSTSISVSF   63 (65)
Q Consensus        27 ~~~~~~~~~l~pGd~LyiP~~WwH~V~s~~~sisvn~   63 (65)
                      +..+.++++++|||.+++|+|++|+|.+++.+++++.
T Consensus        76 ~gi~~~~~~Q~~Ge~V~i~pg~~H~v~n~g~~i~~a~  112 (114)
T PF02373_consen   76 AGIPVYRFVQKPGEFVFIPPGAYHQVFNLGDNISEAV  112 (114)
T ss_dssp             TTS--EEEEEETT-EEEE-TT-EEEEEESSSEEEEEE
T ss_pred             cCcccccceECCCCEEEECCCceEEEEeCCceEEEEe
Confidence            3568999999999999999999999999998776654


No 6  
>KOG2131|consensus
Probab=98.82  E-value=1.6e-09  Score=78.73  Aligned_cols=38  Identities=29%  Similarity=0.580  Sum_probs=35.1

Q ss_pred             CCceEEEEECCCCEEEeCCCCeEEEEecCCeEEEEEeC
Q psy12933         28 PGHLIECTLNPGDMLYIPPKVWHHVRSLSTSISVSFWF   65 (65)
Q Consensus        28 ~~~~~~~~l~pGd~LyiP~~WwH~V~s~~~sisvn~w~   65 (65)
                      ..+.++++++|||++|+|+||.|||-+++++||||.-|
T Consensus       264 ~~~~lei~Qepge~VFvPsGW~hQV~NL~dTISINHNW  301 (427)
T KOG2131|consen  264 RGPLLEIFQEPGETVFVPSGWHHQVLNLGDTISINHNW  301 (427)
T ss_pred             ccchhhhhccCCceeeccCccccccccccceeeecccc
Confidence            45669999999999999999999999999999999866


No 7  
>COG2850 Uncharacterized conserved protein [Function unknown]
Probab=98.35  E-value=1.4e-07  Score=68.45  Aligned_cols=36  Identities=39%  Similarity=0.523  Sum_probs=30.9

Q ss_pred             CCceEEEEECCCCEEEeCCCCeEEEEecCC--eEEEEE
Q psy12933         28 PGHLIECTLNPGDMLYIPPKVWHHVRSLST--SISVSF   63 (65)
Q Consensus        28 ~~~~~~~~l~pGd~LyiP~~WwH~V~s~~~--sisvn~   63 (65)
                      ....++.+|+|||+||||++|||+..+.+.  ++||.+
T Consensus       175 f~~~~d~vlepGDiLYiPp~~~H~gvae~dc~tySvG~  212 (383)
T COG2850         175 FEPDIDEVLEPGDILYIPPGFPHYGVAEDDCMTYSVGF  212 (383)
T ss_pred             CCchhhhhcCCCceeecCCCCCcCCcccccccceeeec
Confidence            456788999999999999999999999974  566655


No 8  
>KOG2508|consensus
Probab=97.47  E-value=6.3e-05  Score=55.15  Aligned_cols=47  Identities=21%  Similarity=0.388  Sum_probs=41.0

Q ss_pred             CccCCCCCCC-CceEEEEECCCCEEEeCCCCeEEEEecC----CeEEEEEeC
Q psy12933         19 QSSAPHNQTP-GHLIECTLNPGDMLYIPPKVWHHVRSLS----TSISVSFWF   65 (65)
Q Consensus        19 ~~~~~p~~~~-~~~~~~~l~pGd~LyiP~~WwH~V~s~~----~sisvn~w~   65 (65)
                      .++++|.+.. ..+..+.+.+|+++|.|..|.|+|-..+    .+|+||+|+
T Consensus       256 ~~~k~~l~~~~~k~~~~rv~a~~i~~lp~~w~h~~sq~~gd~~~~Iavn~~~  307 (437)
T KOG2508|consen  256 PQEKIPLLEAEYKPEMDRVLAGQILYLPEDWFHHVSQVGGDPESDIAVNEWY  307 (437)
T ss_pred             chhhCccccccccceeeeeeccccccChhhhhhhccccCCCccceeEEeeec
Confidence            4556777765 7899999999999999999999999984    589999996


No 9  
>COG2140 Thermophilic glucose-6-phosphate isomerase and related metalloenzymes [Carbohydrate transport and metabolism / General function prediction only]
Probab=97.15  E-value=0.0015  Score=44.43  Aligned_cols=29  Identities=24%  Similarity=0.465  Sum_probs=26.0

Q ss_pred             CceEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933         29 GHLIECTLNPGDMLYIPPKVWHHVRSLST   57 (65)
Q Consensus        29 ~~~~~~~l~pGd~LyiP~~WwH~V~s~~~   57 (65)
                      .....+.+++||++|||++|-|.+.|.+.
T Consensus       123 G~~~v~~~~~Gd~iyVPp~~gH~t~N~Gd  151 (209)
T COG2140         123 GEARVIAVRAGDVIYVPPGYGHYTINTGD  151 (209)
T ss_pred             CcEEEEEecCCcEEEeCCCcceEeecCCC
Confidence            34788899999999999999999999973


No 10 
>KOG3706|consensus
Probab=97.01  E-value=0.00031  Score=53.28  Aligned_cols=28  Identities=32%  Similarity=0.608  Sum_probs=25.5

Q ss_pred             CceEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933         29 GHLIECTLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        29 ~~~~~~~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      .+.++.+|+|||+||+|.|.-||.+.-+
T Consensus       378 ePV~e~vle~GDllYfPRG~IHQA~t~~  405 (629)
T KOG3706|consen  378 EPVHEFVLEPGDLLYFPRGTIHQADTPA  405 (629)
T ss_pred             CchHHhhcCCCcEEEecCcceeeccccc
Confidence            5789999999999999999999998765


No 11 
>PF06560 GPI:  Glucose-6-phosphate isomerase (GPI);  InterPro: IPR010551 This entry consists of several bacterial and archaeal glucose-6-phosphate isomerase (GPI) proteins (5.3.1.9 from EC), which are involved in glycolysis and in gluconeogenesis and catalyse the conversion of D-glucose 6-phosphate to D-fructose 6-phosphate. The deduced amino acid sequence of the first archaeal PGI isolated from Pyrococcus furiosus revealed that it is not related to its eukaryotic and many of its bacterial counterparts. In contrast, this archaeal PGI shares similarity with the cupin superfamily that consists of a variety of proteins that are generally involved in sugar metabolism in both prokaryotes and eukaryotes [].; GO: 0004347 glucose-6-phosphate isomerase activity, 0006094 gluconeogenesis, 0006096 glycolysis, 0005737 cytoplasm; PDB: 1J3Q_B 1J3R_B 1J3P_A 2GC0_A 1X8E_A 1X82_A 1QY4_B 2GC2_B 1QXJ_A 1QXR_B ....
Probab=97.01  E-value=0.0017  Score=43.19  Aligned_cols=36  Identities=28%  Similarity=0.637  Sum_probs=24.6

Q ss_pred             ceEEEEECCCCEEEeCCCCeEEEEecCC-eEEEEEeC
Q psy12933         30 HLIECTLNPGDMLYIPPKVWHHVRSLST-SISVSFWF   65 (65)
Q Consensus        30 ~~~~~~l~pGd~LyiP~~WwH~V~s~~~-sisvn~w~   65 (65)
                      ..+-+.++|||+++||++|-|...|.+. .+.+..||
T Consensus       108 ~~~~v~~~~G~~v~IPp~yaH~tIN~g~~~L~~~~~~  144 (182)
T PF06560_consen  108 DVIAVEAKPGDVVYIPPGYAHRTINTGDEPLVFAAWV  144 (182)
T ss_dssp             -EEEEEE-TTEEEEE-TT-EEEEEE-SSS-EEEEEEE
T ss_pred             eEEEEEeCCCCEEEECCCceEEEEECCCCcEEEEEEE
Confidence            4667889999999999999999999873 55555554


No 12 
>PF00190 Cupin_1:  Cupin;  InterPro: IPR006045 This family represents the conserved barrel domain of the 'cupin' superfamily ('cupa' is the Latin term for a small barrel). This family contains 11S and 7S plant seed storage proteins, and germins. Plant seed storage proteins provide the major nitrogen source for the developing plant. ; GO: 0045735 nutrient reservoir activity; PDB: 2E9Q_A 2EVX_A 1OD5_A 1UCX_A 1UD1_C 1FXZ_C 3KGL_C 3KSC_D 1UIJ_F 1IPK_B ....
Probab=96.90  E-value=0.0033  Score=39.11  Aligned_cols=28  Identities=18%  Similarity=0.330  Sum_probs=21.4

Q ss_pred             EECCCCEEEeCCCCeEEEEecC--CeEEEE
Q psy12933         35 TLNPGDMLYIPPKVWHHVRSLS--TSISVS   62 (65)
Q Consensus        35 ~l~pGd~LyiP~~WwH~V~s~~--~sisvn   62 (65)
                      .+++||++++|+|+.|.+.+.+  ....+.
T Consensus        87 ~l~~Gdv~~vP~G~~h~~~n~~~~~~~~~~  116 (144)
T PF00190_consen   87 RLKAGDVFVVPAGHPHWIINDGDDEALVLI  116 (144)
T ss_dssp             EEETTEEEEE-TT-EEEEEECSSSSEEEEE
T ss_pred             eeecccceeeccceeEEEEcCCCCCCEEEE
Confidence            4999999999999999999994  444443


No 13 
>PRK04190 glucose-6-phosphate isomerase; Provisional
Probab=96.88  E-value=0.0041  Score=41.52  Aligned_cols=34  Identities=18%  Similarity=0.375  Sum_probs=27.0

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEecC--CeEEEEEe
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRSLS--TSISVSFW   64 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s~~--~sisvn~w   64 (65)
                      .....++|||+++||++|-|++.+.+  +-..++.|
T Consensus       119 ~~~~~v~pGd~v~IPpg~~H~~iN~G~epl~fl~v~  154 (191)
T PRK04190        119 ARWIEMEPGTVVYVPPYWAHRSVNTGDEPLVFLACY  154 (191)
T ss_pred             EEEEEECCCCEEEECCCCcEEeEECCCCCEEEEEEE
Confidence            44678999999999999999999976  34444444


No 14 
>COG0662 {ManC} Mannose-6-phosphate isomerase [Carbohydrate transport and metabolism]
Probab=96.81  E-value=0.0019  Score=39.86  Aligned_cols=27  Identities=33%  Similarity=0.487  Sum_probs=24.4

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRSLST   57 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s~~~   57 (65)
                      .-++.|++||++|||+|-.|++++.+.
T Consensus        74 ~~~~~v~~gd~~~iP~g~~H~~~N~G~  100 (127)
T COG0662          74 GEEVEVKAGDSVYIPAGTPHRVRNTGK  100 (127)
T ss_pred             CEEEEecCCCEEEECCCCcEEEEcCCC
Confidence            457889999999999999999999874


No 15 
>PF07883 Cupin_2:  Cupin domain;  InterPro: IPR013096 This family represents the conserved barrel domain of the cupin superfamily [] (cupa is the Latin term for a small barrel). ; PDB: 2OPK_C 3BU7_B 2PHD_D 3NVC_A 3NKT_A 3NJZ_A 3NW4_A 3NST_A 3NL1_A 2H0V_A ....
Probab=96.76  E-value=0.0036  Score=33.73  Aligned_cols=26  Identities=46%  Similarity=0.795  Sum_probs=23.9

Q ss_pred             EEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933         32 IECTLNPGDMLYIPPKVWHHVRSLST   57 (65)
Q Consensus        32 ~~~~l~pGd~LyiP~~WwH~V~s~~~   57 (65)
                      -...+++||++++|++=+|++.+.+.
T Consensus        37 ~~~~l~~Gd~~~i~~~~~H~~~n~~~   62 (71)
T PF07883_consen   37 ERVELKPGDAIYIPPGVPHQVRNPGD   62 (71)
T ss_dssp             EEEEEETTEEEEEETTSEEEEEEESS
T ss_pred             EEeEccCCEEEEECCCCeEEEEECCC
Confidence            47899999999999999999999874


No 16 
>KOG2508|consensus
Probab=96.44  E-value=0.0071  Score=44.62  Aligned_cols=61  Identities=28%  Similarity=0.498  Sum_probs=45.3

Q ss_pred             CcceecccCCc-cccCccCCCCCCCCceEEEEECCCCEEEeCCCCeEEEEecCC-------eEEEEEeC
Q psy12933          5 LKCETSQSNSK-EEAQSSAPHNQTPGHLIECTLNPGDMLYIPPKVWHHVRSLST-------SISVSFWF   65 (65)
Q Consensus         5 ~~~~s~~~~~~-~d~~~~~~p~~~~~~~~~~~l~pGd~LyiP~~WwH~V~s~~~-------sisvn~w~   65 (65)
                      ++.|+..+..+ .|.+...+-..+..-.+-+++++|+++|+|+-|.|.+++-..       .|.+|+|+
T Consensus       333 ~~sf~~~~d~e~~d~~~~~l~~lA~~~~l~v~~~~~~~~~l~~~~~~~~~~~~~ssg~~~v~l~~~y~~  401 (437)
T KOG2508|consen  333 LHKFMGFDDAKNVDVDKDELSALAGTVPLVVDLEPGDMLYLPASWFHEVTSSSASSGGSDVHIAFNYWF  401 (437)
T ss_pred             hhhhccCCcccccccchhhccccccceeEEEecccCceeeechhheeeeeccccccCCcceeEEeeccc
Confidence            45555543333 455555565666778899999999999999999999998652       58888885


No 17 
>COG4101 Predicted mannose-6-phosphate isomerase [Carbohydrate transport and metabolism]
Probab=95.94  E-value=0.011  Score=37.89  Aligned_cols=29  Identities=38%  Similarity=0.603  Sum_probs=24.8

Q ss_pred             CCceEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933         28 PGHLIECTLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        28 ~~~~~~~~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      +--...++..|||++|||+|--|+-.++.
T Consensus        84 ~rLE~ha~~~pGDf~YiPpgVPHqp~N~S  112 (142)
T COG4101          84 NRLEEHAEVGPGDFFYIPPGVPHQPANLS  112 (142)
T ss_pred             cceeeeEEecCCCeEEcCCCCCCcccccC
Confidence            33456789999999999999999998876


No 18 
>KOG1633|consensus
Probab=95.83  E-value=0.01  Score=46.78  Aligned_cols=31  Identities=26%  Similarity=0.514  Sum_probs=27.2

Q ss_pred             CceEEEEECCCCEEEeCCCCeEEEEecCCeE
Q psy12933         29 GHLIECTLNPGDMLYIPPKVWHHVRSLSTSI   59 (65)
Q Consensus        29 ~~~~~~~l~pGd~LyiP~~WwH~V~s~~~si   59 (65)
                      ...+.|+|++|+.+|||.||-|.|..-..++
T Consensus       193 dkC~~~~l~~g~T~~iPsGwIhAV~Tp~d~l  223 (776)
T KOG1633|consen  193 DKCYKCILKQGQTLFIPSGWIHAVLTPTDCL  223 (776)
T ss_pred             ceeEEEEeccCceEecccceeEeeecCcchh
Confidence            4688999999999999999999999876543


No 19 
>smart00835 Cupin_1 Cupin. This family represents the conserved barrel domain of the 'cupin' superfamily ('cupa' is the Latin term for a small barrel). This family contains 11S and 7S plant seed storage proteins, and germins. Plant seed storage proteins provide the major nitrogen source for the developing plant.
Probab=95.65  E-value=0.035  Score=34.54  Aligned_cols=26  Identities=19%  Similarity=0.331  Sum_probs=23.1

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      .....+++||+++||++..|...+.+
T Consensus        74 ~~~~~l~~GD~~~ip~g~~H~~~n~~   99 (146)
T smart00835       74 VYDARLREGDVFVVPQGHPHFQVNSG   99 (146)
T ss_pred             EEEEEecCCCEEEECCCCEEEEEcCC
Confidence            34778999999999999999999875


No 20 
>TIGR03404 bicupin_oxalic bicupin, oxalate decarboxylase family. Members of this protein family are defined as bicupins as they have two copies of the cupin domain (pfam00190). Two different known activities for members of this family are oxalate decarboxylase (EC 4.1.1.2) and oxalate oxidase (EC 1.2.3.4), although the latter activity has more often been found in distantly related monocupin (germin) proteins.
Probab=95.49  E-value=0.032  Score=40.28  Aligned_cols=26  Identities=23%  Similarity=0.368  Sum_probs=23.3

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      ..+..+++||++|+|+|..|++++.+
T Consensus       288 ~~~~~l~~GD~~~iP~g~~H~i~N~G  313 (367)
T TIGR03404       288 ARTFDYQAGDVGYVPRNMGHYVENTG  313 (367)
T ss_pred             EEEEEECCCCEEEECCCCeEEEEECC
Confidence            44567999999999999999999986


No 21 
>KOG2132|consensus
Probab=95.47  E-value=0.024  Score=41.22  Aligned_cols=45  Identities=16%  Similarity=0.072  Sum_probs=34.0

Q ss_pred             cCCccccCccCCCCCCC--CceEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933         12 SNSKEEAQSSAPHNQTP--GHLIECTLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        12 ~~~~~d~~~~~~p~~~~--~~~~~~~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      ..|...++..++|.|..  +....++..+|++++||..|||.+.+.-
T Consensus       231 k~Di~iPDyc~~~~f~~~~v~~~~w~GpaGtV~pih~dp~hNi~~qv  277 (355)
T KOG2132|consen  231 KFDISIPDYCSFPNFENEVVDINAWIGPAGTVLPIHMDPWHNILSQV  277 (355)
T ss_pred             hhccCCCceeecCCCCccccceeEEeccCCceeccccccccceeeee
Confidence            33444456667899988  5555566666999999999999999875


No 22 
>PRK13290 ectC L-ectoine synthase; Reviewed
Probab=95.42  E-value=0.05  Score=33.88  Aligned_cols=27  Identities=19%  Similarity=0.175  Sum_probs=23.6

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRSLST   57 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s~~~   57 (65)
                      ..+..|+|||++|+|++=.|+.++.++
T Consensus        74 g~~~~L~aGD~i~~~~~~~H~~~N~e~  100 (125)
T PRK13290         74 GEVHPIRPGTMYALDKHDRHYLRAGED  100 (125)
T ss_pred             CEEEEeCCCeEEEECCCCcEEEEcCCC
Confidence            455789999999999999999999754


No 23 
>PF09313 DUF1971:  Domain of unknown function (DUF1971);  InterPro: IPR015392 This uncharacterised domain is predominantly found in bacterial Tellurite resistance proteins. ; PDB: 3BB6_C 3M70_A 3DL3_I.
Probab=95.17  E-value=0.057  Score=31.72  Aligned_cols=29  Identities=31%  Similarity=0.463  Sum_probs=24.6

Q ss_pred             CceEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933         29 GHLIECTLNPGDMLYIPPKVWHHVRSLST   57 (65)
Q Consensus        29 ~~~~~~~l~pGd~LyiP~~WwH~V~s~~~   57 (65)
                      ....++.+.+|+.-+|||..||+|+.+++
T Consensus        47 ~~~~~~~~~~~~~~~i~Pq~wH~V~p~s~   75 (82)
T PF09313_consen   47 EPEEEVFIPAGQPPVIEPQQWHRVEPLSD   75 (82)
T ss_dssp             SESEEEEEETTEEEEE-TT-EEEEEESST
T ss_pred             ceeEEEEeCCCCCceeCCCceEEEEECCC
Confidence            56789999999999999999999999875


No 24 
>COG1917 Uncharacterized conserved protein, contains double-stranded beta-helix domain [Function unknown]
Probab=95.13  E-value=0.051  Score=33.06  Aligned_cols=28  Identities=29%  Similarity=0.508  Sum_probs=24.2

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEecCCe
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRSLSTS   58 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s~~~s   58 (65)
                      .-..++++||++++|++=.|.+.+.+.+
T Consensus        81 g~~~~l~~Gd~i~ip~g~~H~~~a~~~~  108 (131)
T COG1917          81 GEKKELKAGDVIIIPPGVVHGLKAVEDE  108 (131)
T ss_pred             CCceEecCCCEEEECCCCeeeeccCCCC
Confidence            4456899999999999999999998754


No 25 
>PF05523 FdtA:  WxcM-like, C-terminal ;  InterPro: IPR008894  This entry includes FdtA (Q6T1W8 from SWISSPROT) from Aneurinibacillus thermoaerophilus, which has been characterised as a dTDP-6-deoxy-3,4-keto-hexulose isomerase []. It also includes WxcM (Q93S92 from SWISSPROT) from Xanthomonas campestris pv campestris) []. ; PDB: 2PAK_A 2PAE_A 2PA7_B 2PAM_A.
Probab=95.00  E-value=0.067  Score=33.39  Aligned_cols=26  Identities=42%  Similarity=0.814  Sum_probs=15.6

Q ss_pred             EEEEECCC-CEEEeCCCCeEEEEecCC
Q psy12933         32 IECTLNPG-DMLYIPPKVWHHVRSLST   57 (65)
Q Consensus        32 ~~~~l~pG-d~LyiP~~WwH~V~s~~~   57 (65)
                      .+++|... ..|+||+|+||...+.+.
T Consensus        76 ~~~~L~~~~~~L~Ippg~w~~~~~~s~  102 (131)
T PF05523_consen   76 EEFILDEPNKGLYIPPGVWHGIKNFSE  102 (131)
T ss_dssp             EEEEE--TTEEEEE-TT-EEEEE---T
T ss_pred             EEEEECCCCeEEEECCchhhHhhccCC
Confidence            56666665 499999999999998874


No 26 
>TIGR01221 rmlC dTDP-4-dehydrorhamnose 3,5-epimerase. This enzyme participates in the biosynthesis of dTDP-L-rhamnose, often as a precursor to LPS O-antigen
Probab=95.00  E-value=0.081  Score=34.86  Aligned_cols=44  Identities=14%  Similarity=0.046  Sum_probs=30.0

Q ss_pred             ccccCccCCCCCCCCceEEEEECCCCEEEeCCCCeEEEEecCCeE
Q psy12933         15 KEEAQSSAPHNQTPGHLIECTLNPGDMLYIPPKVWHHVRSLSTSI   59 (65)
Q Consensus        15 ~~d~~~~~~p~~~~~~~~~~~l~pGd~LyiP~~WwH~V~s~~~si   59 (65)
                      .+|.+... |.|.+-..++..-+.+.+||||+|.+|-..+++...
T Consensus        82 ~VDlR~~S-pTfG~~~~~~L~~~~~~~l~IP~G~aHGF~~L~d~a  125 (176)
T TIGR01221        82 AVDLRRNS-PTFGKWVGVLLSAENKRQLWIPEGFAHGFVVLSDEA  125 (176)
T ss_pred             EEECCCCc-CCCCeEEEEEECCCCCCEEEeCCcceeEEEEcCCCe
Confidence            45666544 455554444444444779999999999999987643


No 27 
>TIGR03404 bicupin_oxalic bicupin, oxalate decarboxylase family. Members of this protein family are defined as bicupins as they have two copies of the cupin domain (pfam00190). Two different known activities for members of this family are oxalate decarboxylase (EC 4.1.1.2) and oxalate oxidase (EC 1.2.3.4), although the latter activity has more often been found in distantly related monocupin (germin) proteins.
Probab=94.60  E-value=0.11  Score=37.62  Aligned_cols=25  Identities=28%  Similarity=0.591  Sum_probs=21.9

Q ss_pred             EEEEECCCCEEEeCCCCeEEEEecC
Q psy12933         32 IECTLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        32 ~~~~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      +...|++||++|+|++..|..++.+
T Consensus       110 ~~~~L~~GD~~~fP~g~~H~~~n~~  134 (367)
T TIGR03404       110 YIDDVGAGDLWYFPPGIPHSLQGLD  134 (367)
T ss_pred             EEeEECCCCEEEECCCCeEEEEECC
Confidence            3346999999999999999999985


No 28 
>PF13759 2OG-FeII_Oxy_5:  Putative 2OG-Fe(II) oxygenase; PDB: 3BVC_B 2RG4_A.
Probab=94.26  E-value=0.12  Score=30.31  Aligned_cols=36  Identities=28%  Similarity=0.501  Sum_probs=23.2

Q ss_pred             CCceEEEEECCCCEEEeCCCCeEEEEecC---CeEEEEE
Q psy12933         28 PGHLIECTLNPGDMLYIPPKVWHHVRSLS---TSISVSF   63 (65)
Q Consensus        28 ~~~~~~~~l~pGd~LyiP~~WwH~V~s~~---~sisvn~   63 (65)
                      ......+..++||++..|+.-+|.|....   .-|+|+|
T Consensus        62 ~~~~~~~~p~~G~lvlFPs~l~H~v~p~~~~~~Risisf  100 (101)
T PF13759_consen   62 NSPYYIVEPEEGDLVLFPSWLWHGVPPNNSDEERISISF  100 (101)
T ss_dssp             C-SEEEE---TTEEEEEETTSEEEE----SSS-EEEEEE
T ss_pred             cCceEEeCCCCCEEEEeCCCCEEeccCcCCCCCEEEEEc
Confidence            45678889999999999999999998765   3566665


No 29 
>PF05899 Cupin_3:  Protein of unknown function (DUF861);  InterPro: IPR008579 The function of the proteins in this entry are unknown. They contain the conserved barrel domain of the 'cupin' superfamily and members are specific to plants and bacteria.; PDB: 1RC6_A 3MYX_A 1O5U_A 2K9Z_A 1LKN_A 3ES4_A 1SFN_B 3BCW_A.
Probab=94.20  E-value=0.03  Score=31.68  Aligned_cols=19  Identities=21%  Similarity=0.398  Sum_probs=15.8

Q ss_pred             ceEEEEECCCCEEEeCCCC
Q psy12933         30 HLIECTLNPGDMLYIPPKV   48 (65)
Q Consensus        30 ~~~~~~l~pGd~LyiP~~W   48 (65)
                      ..-..+++|||++++|+||
T Consensus        42 ~G~~~~~~aGD~~~~p~G~   60 (74)
T PF05899_consen   42 DGETVTFKAGDAFFLPKGW   60 (74)
T ss_dssp             TTEEEEEETTEEEEE-TTE
T ss_pred             CCCEEEEcCCcEEEECCCC
Confidence            4567899999999999998


No 30 
>KOG2107|consensus
Probab=94.20  E-value=0.092  Score=35.04  Aligned_cols=30  Identities=17%  Similarity=0.478  Sum_probs=26.3

Q ss_pred             CCCceEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933         27 TPGHLIECTLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        27 ~~~~~~~~~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      ++..++.+-++.||++++|+|-.|+.....
T Consensus       111 ~dd~WIRi~vekGDlivlPaGiyHRFTtt~  140 (179)
T KOG2107|consen  111 KDDQWIRIFVEKGDLIVLPAGIYHRFTTTP  140 (179)
T ss_pred             CCCCEEEEEEecCCEEEecCcceeeeecCc
Confidence            356889999999999999999999987654


No 31 
>PRK09943 DNA-binding transcriptional repressor PuuR; Provisional
Probab=94.07  E-value=0.13  Score=33.15  Aligned_cols=27  Identities=19%  Similarity=0.124  Sum_probs=23.3

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRSLST   57 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s~~~   57 (65)
                      .-+..|++||++|+|++-.|...+.+.
T Consensus       145 ~~~~~l~~Gd~~~~~~~~~H~~~n~~~  171 (185)
T PRK09943        145 GQDYHLVAGQSYAINTGIPHSFSNTSA  171 (185)
T ss_pred             CEEEEecCCCEEEEcCCCCeeeeCCCC
Confidence            346789999999999999999999764


No 32 
>PF03079 ARD:  ARD/ARD' family;  InterPro: IPR004313 The two acireductone dioxygenase enzymes (ARD and ARD', previously known as E-2 and E-2') from Klebsiella pneumoniae share the same amino acid sequence Q9ZFE7 from SWISSPROT, but bind different metal ions: ARD binds Ni2+, ARD' binds Fe2+ []. ARD and ARD' can be experimentally interconverted by removal of the bound metal ion and reconstitution with the appropriate metal ion. The two enzymes share the same substrate, 1,2-dihydroxy-3-keto-5-(methylthio)pentene, but yield different products. ARD' yields the alpha-keto precursor of methionine (and formate), thus forming part of the ubiquitous methionine salvage pathway that converts 5'-methylthioadenosine (MTA) to methionine. This pathway is responsible for the tight control of the concentration of MTA, which is a powerful inhibitor of polyamine biosynthesis and transmethylation reactions []. ARD yields methylthiopropanoate, carbon monoxide and formate, and thus prevents the conversion of MTA to methionine. The role of the ARD catalysed reaction is unclear: methylthiopropanoate is cytotoxic, and carbon monoxide can activate guanylyl cyclase, leading to increased intracellular cGMP levels [, ].  This family also contains other proteins, whose functions are not well characterised.; GO: 0010309 acireductone dioxygenase [iron(II)-requiring] activity, 0055114 oxidation-reduction process; PDB: 1VR3_A 1ZRR_A 2HJI_A.
Probab=93.95  E-value=0.13  Score=33.44  Aligned_cols=30  Identities=17%  Similarity=0.295  Sum_probs=22.5

Q ss_pred             CCCceEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933         27 TPGHLIECTLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        27 ~~~~~~~~~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      ++...+++.+++||.|.||+|-.|.....+
T Consensus       110 ~~~~wiri~~e~GDli~vP~g~~HrF~~~~  139 (157)
T PF03079_consen  110 GDDVWIRILCEKGDLIVVPAGTYHRFTLGE  139 (157)
T ss_dssp             TTCEEEEEEEETTCEEEE-TT--EEEEEST
T ss_pred             CCCEEEEEEEcCCCEEecCCCCceeEEcCC
Confidence            345677899999999999999999998544


No 33 
>TIGR03037 anthran_nbaC 3-hydroxyanthranilate 3,4-dioxygenase. Members of this protein family, from both bacteria and eukaryotes, are the enzyme 3-hydroxyanthranilate 3,4-dioxygenase. This enzyme acts on the tryptophan metabolite 3-hydroxyanthranilate and produces 2-amino-3-carboxymuconate semialdehyde, which can rearrange spontaneously to quinolinic acid and feed into nicotinamide biosynthesis, or undergo further enzymatic degradation.
Probab=93.53  E-value=0.14  Score=33.62  Aligned_cols=30  Identities=27%  Similarity=0.522  Sum_probs=26.0

Q ss_pred             EEEEECCCCEEEeCCCCeEEEEecCCeEEE
Q psy12933         32 IECTLNPGDMLYIPPKVWHHVRSLSTSISV   61 (65)
Q Consensus        32 ~~~~l~pGd~LyiP~~WwH~V~s~~~sisv   61 (65)
                      -.+.|++||++++|+|--|.......++.+
T Consensus        70 ~~v~L~eGd~flvP~gvpHsP~r~~~t~~L   99 (159)
T TIGR03037        70 EDVPIREGDIFLLPPHVPHSPQRPAGSIGL   99 (159)
T ss_pred             EEEEECCCCEEEeCCCCCcccccCCCcEEE
Confidence            478999999999999999999887766655


No 34 
>COG3450 Predicted enzyme of the cupin superfamily [General function prediction only]
Probab=93.48  E-value=0.084  Score=32.99  Aligned_cols=28  Identities=14%  Similarity=0.096  Sum_probs=21.0

Q ss_pred             CceEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933         29 GHLIECTLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        29 ~~~~~~~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      ...-.+.++|||.+|+|+||.=-=+-++
T Consensus        79 d~Ge~v~~~aGD~~~~~~G~~g~W~V~E  106 (116)
T COG3450          79 DGGEPVEVRAGDSFVFPAGFKGTWEVLE  106 (116)
T ss_pred             CCCeEEEEcCCCEEEECCCCeEEEEEee
Confidence            3456688999999999999975444433


No 35 
>KOG1356|consensus
Probab=93.44  E-value=0.011  Score=47.09  Aligned_cols=32  Identities=28%  Similarity=0.361  Sum_probs=28.4

Q ss_pred             CceEEEEECCCCEEEeCCCCeEEEEecCCeEE
Q psy12933         29 GHLIECTLNPGDMLYIPPKVWHHVRSLSTSIS   60 (65)
Q Consensus        29 ~~~~~~~l~pGd~LyiP~~WwH~V~s~~~sis   60 (65)
                      +.+..+++..||.+|||+|--|||+++..++.
T Consensus       796 Ve~WtfvQ~LGdAVfIPAGaPHQVrNLkSCik  827 (889)
T KOG1356|consen  796 VEPWTFVQFLGDAVFIPAGAPHQVRNLKSCIK  827 (889)
T ss_pred             CCccchhhcccceEEecCCCcHHhhhhhhHHH
Confidence            67889999999999999999999999976543


No 36 
>PF01050 MannoseP_isomer:  Mannose-6-phosphate isomerase;  InterPro: IPR001538 Mannose-6-phosphate isomerase or phosphomannose isomerase (5.3.1.8 from EC) (PMI) is the enzyme that catalyses the interconversion of mannose-6-phosphate and fructose-6-phosphate. In eukaryotes PMI is involved in the synthesis of GDP-mannose, a constituent of N- and O-linked glycans and GPI anchors and in prokaryotes it participates in a variety of pathways, including capsular polysaccharide biosynthesis and D-mannose metabolism. PMI's belong to the cupin superfamily whose functions range from isomerase and epimerase activities involved in the modification of cell wall carbohydrates in bacteria and plants, to non-enzymatic storage proteins in plant seeds, and transcription factors linked to congenital baldness in mammals []. Three classes of PMI have been defined []. The type II phosphomannose isomerases are bifunctional enzymes 5.3.1.8 from EC. This entry covers the isomerase region of the protein []. The guanosine diphospho-D-mannose pyrophosphorylase region is described in another InterPro entry (see IPR005836 from INTERPRO).; GO: 0016779 nucleotidyltransferase activity, 0005976 polysaccharide metabolic process
Probab=93.39  E-value=0.15  Score=32.85  Aligned_cols=26  Identities=35%  Similarity=0.484  Sum_probs=22.7

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      .-...+.+||.+|||+|-.|++++.+
T Consensus       101 ~~~~~~~~g~sv~Ip~g~~H~i~n~g  126 (151)
T PF01050_consen  101 DEEFTLKEGDSVYIPRGAKHRIENPG  126 (151)
T ss_pred             CEEEEEcCCCEEEECCCCEEEEECCC
Confidence            34567999999999999999999875


No 37 
>PF02311 AraC_binding:  AraC-like ligand binding domain;  InterPro: IPR003313 This entry defines the arabinose-binding and dimerisation domain of the bacterial gene regulatory protein AraC. The crystal structure of the arabinose-binding and dimerization domain of the Escherichia coli gene regulatory protein AraC was determined in the presence and absence of L-arabinose. The arabinose-bound molecule shows that the protein adopts an unusual fold, binding sugar within a beta barrel and completely burying the arabinose with the amino-terminal arm of the protein. Dimer contacts in the presence of arabinose are mediated by an antiparallel coiled-coil. In the uncomplexed protein, the amino-terminal arm is disordered, uncovering the sugar-binding pocket and allowing it to serve as an oligomerization interface [].; GO: 0006355 regulation of transcription, DNA-dependent; PDB: 1XJA_B 2ARA_A 2AAC_B 2ARC_A.
Probab=92.87  E-value=0.43  Score=27.48  Aligned_cols=27  Identities=37%  Similarity=0.482  Sum_probs=19.0

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRSLST   57 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s~~~   57 (65)
                      .-+..++|||++++|++-.|.....+.
T Consensus        40 ~~~~~l~~g~~~li~p~~~H~~~~~~~   66 (136)
T PF02311_consen   40 GQEYPLKPGDLFLIPPGQPHSYYPDSN   66 (136)
T ss_dssp             TEEEEE-TT-EEEE-TTS-EEEEE-TT
T ss_pred             CEEEEEECCEEEEecCCccEEEecCCC
Confidence            456789999999999999999998873


No 38 
>COG3615 TehB Uncharacterized protein/domain, possibly involved in tellurite resistance [Inorganic ion transport and metabolism]
Probab=92.73  E-value=0.21  Score=30.54  Aligned_cols=33  Identities=21%  Similarity=0.632  Sum_probs=24.4

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEecC--CeEEEEEe
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRSLS--TSISVSFW   64 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s~~--~sisvn~w   64 (65)
                      ...++.++ +.-++|+.-||+|+.++  .-+.++||
T Consensus        61 e~~~~~ea-~~~~~~PQ~WHrVea~tDD~e~~l~Fy   95 (99)
T COG3615          61 EHVFSIEA-QFPVFPPQAWHRVEAMTDDAEFNLSFY   95 (99)
T ss_pred             eEEEeecC-CCCccChhHeeeeeecccccEEEEEEE
Confidence            44455555 55778888999999876  47888886


No 39 
>COG1898 RfbC dTDP-4-dehydrorhamnose 3,5-epimerase and related enzymes [Cell envelope biogenesis, outer membrane]
Probab=92.66  E-value=0.52  Score=31.22  Aligned_cols=47  Identities=23%  Similarity=0.382  Sum_probs=32.3

Q ss_pred             ccccCccCCCCCCCCceEEEEECCC--CEEEeCCCCeEEEEecCCeEEEEEe
Q psy12933         15 KEEAQSSAPHNQTPGHLIECTLNPG--DMLYIPPKVWHHVRSLSTSISVSFW   64 (65)
Q Consensus        15 ~~d~~~~~~p~~~~~~~~~~~l~pG--d~LyiP~~WwH~V~s~~~sisvn~w   64 (65)
                      .+|.+... |.|  .+...++|.+-  .+|+||+|+.|-..+++.+..+.++
T Consensus        82 ~vDlR~~S-pTy--g~~~~~~ls~~N~~~l~IP~G~AHGf~~L~d~~~~~y~  130 (173)
T COG1898          82 AVDLRKDS-PTY--GKWVGVVLSAENKRQLYIPPGFAHGFQVLSDDAEVVYK  130 (173)
T ss_pred             EEEccCCC-CCc--ceEEEEEecCCCceEEEeCCcccceeEEccCceEEEEE
Confidence            34555533 333  44555555554  9999999999999999976665553


No 40 
>PF00908 dTDP_sugar_isom:  dTDP-4-dehydrorhamnose 3,5-epimerase;  InterPro: IPR000888 Deoxythymidine diphosphate (dTDP)-4-keto-6-deoxy-d-hexulose 3, 5-epimerase (RmlC, 5.1.3.13 from EC) is involved in the biosynthesis of dTDP-l-rhamnose, which is an essential component of the bacterial cell wall, converting dTDP-4-keto-6-deoxy-D-glucose to dTDP-4-keto-L-rhamnose. The crystal structure of RmlC from Methanobacterium thermoautotrophicum was determined in the presence and absence of a substrate analogue. RmlC is a homodimer comprising a central jelly roll motif, which extends in two directions into longer beta-sheets. Binding of dTDP is stabilised by ionic interactions to the phosphate group and by a combination of ionic and hydrophobic interactions with the base. The active site, which is located in the centre of the jelly roll, is formed by residues that are conserved in all known RmlC sequence homologues. The active site is lined with a number of charged residues and a number of residues with hydrogen-bonding potentials, which together comprise a potential network for substrate binding and catalysis. The active site is also lined with aromatic residues which provide favorable environments for the base moiety of dTDP and potentially for the sugar moiety of the substrate [].; GO: 0008830 dTDP-4-dehydrorhamnose 3,5-epimerase activity, 0009103 lipopolysaccharide biosynthetic process; PDB: 1EPZ_A 1EP0_A 1NXM_A 1NZC_D 2IXL_C 1NYW_B 2IXC_D 1PM7_B 1UPI_A 3RYK_B ....
Probab=92.51  E-value=0.44  Score=31.31  Aligned_cols=42  Identities=24%  Similarity=0.423  Sum_probs=26.7

Q ss_pred             ccccCccCCCCCCCCceEEEEECCC--CEEEeCCCCeEEEEecCCeE
Q psy12933         15 KEEAQSSAPHNQTPGHLIECTLNPG--DMLYIPPKVWHHVRSLSTSI   59 (65)
Q Consensus        15 ~~d~~~~~~p~~~~~~~~~~~l~pG--d~LyiP~~WwH~V~s~~~si   59 (65)
                      .+|.++.. |.|  .+...+.|.++  -.||||+|.+|-..+++..-
T Consensus        82 ~vDlR~~S-pTf--g~~~~~~Ls~~n~~~l~IP~G~aHGf~~l~d~a  125 (176)
T PF00908_consen   82 AVDLRKGS-PTF--GKWVSVELSAENPRQLYIPPGVAHGFQTLEDDA  125 (176)
T ss_dssp             EEE-BTTS-TTT--T-EEEEEEETTT--EEEE-TTEEEEEEESSSEE
T ss_pred             EEECCCCC-CCC--CEEEEEEeCccccCEEEeCCcceeeEEeccCce
Confidence            34666533 444  45555666555  58999999999999998653


No 41 
>PRK13264 3-hydroxyanthranilate 3,4-dioxygenase; Provisional
Probab=92.22  E-value=0.27  Score=32.80  Aligned_cols=31  Identities=26%  Similarity=0.490  Sum_probs=26.4

Q ss_pred             EEEEECCCCEEEeCCCCeEEEEecCCeEEEE
Q psy12933         32 IECTLNPGDMLYIPPKVWHHVRSLSTSISVS   62 (65)
Q Consensus        32 ~~~~l~pGd~LyiP~~WwH~V~s~~~sisvn   62 (65)
                      ..+.|++||++++|++--|..+....++.+-
T Consensus        76 ~~v~L~eGd~fllP~gvpHsP~r~~~tv~Lv  106 (177)
T PRK13264         76 RDVPIREGEMFLLPPHVPHSPQREAGSIGLV  106 (177)
T ss_pred             eeEEECCCCEEEeCCCCCcCCccCCCeEEEE
Confidence            4788999999999999999998877666553


No 42 
>TIGR02466 conserved hypothetical protein. This family consists of uncharacterized proteins in Caulobacter crescentus CB15, Bdellovibrio bacteriovorus HD100, Synechococcus sp. WH 8102 (2), Silicibacter pomeroyi DSS-3 (2), and Hyphomonas neptunium ATCC 15444. The context of nearby genes differs substantially between members and does point to any specific biological role.
Probab=91.92  E-value=0.28  Score=32.91  Aligned_cols=32  Identities=19%  Similarity=0.223  Sum_probs=24.7

Q ss_pred             EEEECCCCEEEeCCCCeEEEEecC---C--eEEEEEe
Q psy12933         33 ECTLNPGDMLYIPPKVWHHVRSLS---T--SISVSFW   64 (65)
Q Consensus        33 ~~~l~pGd~LyiP~~WwH~V~s~~---~--sisvn~w   64 (65)
                      .+.-++|++|..|+.-||.|....   .  |||.|+-
T Consensus       163 ~v~P~~G~lvlFPS~L~H~v~p~~~~~~RISiSFNl~  199 (201)
T TIGR02466       163 YVPPQEGRVLLFESWLRHEVPPNESEEERISVSFNYA  199 (201)
T ss_pred             EECCCCCeEEEECCCCceecCCCCCCCCEEEEEEeeE
Confidence            455599999999999999998865   2  5555553


No 43 
>PF13640 2OG-FeII_Oxy_3:  2OG-Fe(II) oxygenase superfamily; PDB: 3DKQ_B 3GZE_D 3HQR_A 2Y34_A 2G1M_A 2G19_A 3OUI_A 3OUJ_A 2HBU_A 2Y33_A ....
Probab=91.88  E-value=0.19  Score=28.80  Aligned_cols=30  Identities=30%  Similarity=0.747  Sum_probs=25.5

Q ss_pred             ECCCCEEEeCC-CCeEEEEec-C--CeEEEEEeC
Q psy12933         36 LNPGDMLYIPP-KVWHHVRSL-S--TSISVSFWF   65 (65)
Q Consensus        36 l~pGd~LyiP~-~WwH~V~s~-~--~sisvn~w~   65 (65)
                      -++|+++++|+ ..||.|... .  .-++|++||
T Consensus        66 p~~g~~v~F~~~~~~H~v~~v~~~~~R~~l~~~~   99 (100)
T PF13640_consen   66 PKPGRLVIFPSDNSLHGVTPVGEGGRRYSLTFWF   99 (100)
T ss_dssp             -BTTEEEEEESCTCEEEEEEE-EESEEEEEEEEE
T ss_pred             CCCCEEEEEeCCCCeecCcccCCCCCEEEEEEEE
Confidence            89999999999 999999998 3  368888885


No 44 
>TIGR03214 ura-cupin putative allantoin catabolism protein. This model represents a protein containing a tandem arrangement of cupin domains (N-terminal part of pfam07883 and C-terminal more distantly related to pfam00190). This protein is found in the vicinity of genes involved in the catabolism of allantoin, a breakdown product of urate and sometimes of urate iteslf. The distribution of pathway components in the genomes in which this family is observed suggests that the function is linked to the allantoate catabolism to glyoxylate pathway (GenProp0686) since it is sometimes found in genomes lacking any elements of the xanthine-to-allantoin pathways (e.g. in Enterococcus faecalis).
Probab=91.71  E-value=0.29  Score=33.67  Aligned_cols=27  Identities=7%  Similarity=0.010  Sum_probs=23.6

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRSLST   57 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s~~~   57 (65)
                      .-...+++||++|||++--|+..+.+.
T Consensus       217 g~~~~V~~GD~i~i~~~~~h~~~~~G~  243 (260)
T TIGR03214       217 NNWVPVEAGDYIWMGAYCPQACYAGGR  243 (260)
T ss_pred             CEEEEecCCCEEEECCCCCEEEEecCC
Confidence            446789999999999999999999863


No 45 
>PRK15457 ethanolamine utilization protein EutQ; Provisional
Probab=91.17  E-value=0.26  Score=34.14  Aligned_cols=24  Identities=17%  Similarity=0.229  Sum_probs=19.7

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEe
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRS   54 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s   54 (65)
                      .-+.+++|||++|||+|-.|+-.+
T Consensus       192 G~t~~l~pGDvlfIPkGs~~hf~t  215 (233)
T PRK15457        192 GETMIAKAGDVMFIPKGSSIEFGT  215 (233)
T ss_pred             CEEEEeCCCcEEEECCCCeEEecC
Confidence            467789999999999999955543


No 46 
>TIGR03214 ura-cupin putative allantoin catabolism protein. This model represents a protein containing a tandem arrangement of cupin domains (N-terminal part of pfam07883 and C-terminal more distantly related to pfam00190). This protein is found in the vicinity of genes involved in the catabolism of allantoin, a breakdown product of urate and sometimes of urate iteslf. The distribution of pathway components in the genomes in which this family is observed suggests that the function is linked to the allantoate catabolism to glyoxylate pathway (GenProp0686) since it is sometimes found in genomes lacking any elements of the xanthine-to-allantoin pathways (e.g. in Enterococcus faecalis).
Probab=90.84  E-value=0.67  Score=31.87  Aligned_cols=24  Identities=21%  Similarity=0.387  Sum_probs=21.6

Q ss_pred             EEEECCCCEEEeCCCCeEEEEecC
Q psy12933         33 ECTLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        33 ~~~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      ..+|++||.+|+|++=-|+.++.+
T Consensus        99 ~~~L~~Gd~~y~pa~~~H~~~N~~  122 (260)
T TIGR03214        99 THELREGGYAYLPPGSKWTLANAQ  122 (260)
T ss_pred             EEEECCCCEEEECCCCCEEEEECC
Confidence            458999999999999999999876


No 47 
>PRK15131 mannose-6-phosphate isomerase; Provisional
Probab=90.80  E-value=0.25  Score=36.06  Aligned_cols=24  Identities=25%  Similarity=0.402  Sum_probs=20.9

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEe
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRS   54 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s   54 (65)
                      --.+.|+|||++|+|+|--|..-+
T Consensus       236 LN~v~l~pGeaifipAg~~HAyl~  259 (389)
T PRK15131        236 LNVVKLNPGEAMFLFAETPHAYLQ  259 (389)
T ss_pred             eeEEEeCCCCEEEeCCCCCeEEcC
Confidence            457899999999999999998654


No 48 
>PF06249 EutQ:  Ethanolamine utilisation protein EutQ;  InterPro: IPR010424 The eut operon of Salmonella typhimurium encodes proteins involved in the cobalamin-dependent degradation of ethanolamine. The role of EutQ in this process is unclear [].; PDB: 2PYT_B 3LWC_A.
Probab=90.79  E-value=0.19  Score=32.67  Aligned_cols=18  Identities=33%  Similarity=0.612  Sum_probs=12.6

Q ss_pred             eEEEEECCCCEEEeCCCC
Q psy12933         31 LIECTLNPGDMLYIPPKV   48 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~W   48 (65)
                      .-..+-+|||+||||.|-
T Consensus       112 G~~~~A~~GDvi~iPkGs  129 (152)
T PF06249_consen  112 GQTVTAKPGDVIFIPKGS  129 (152)
T ss_dssp             TEEEEEETT-EEEE-TT-
T ss_pred             CEEEEEcCCcEEEECCCC
Confidence            456778999999999993


No 49 
>TIGR02272 gentisate_1_2 gentisate 1,2-dioxygenase. This family consists of gentisate 1,2-dioxygenases. This ring-opening enzyme acts in salicylate degradation that goes via gentisate rather than via catechol. It converts gentisate to maleylpyruvate. Some putative gentisate 1,2-dioxygenases are excluded by a relatively high trusted cutoff score because they are too closely related to known examples of 1-hydroxy-2-naphthoate dioxygenase. Therefore some homologs may be bona fide gentisate 1,2-dioxygenases even if they score below the given cutoffs.
Probab=90.70  E-value=0.45  Score=34.36  Aligned_cols=30  Identities=23%  Similarity=0.443  Sum_probs=25.3

Q ss_pred             CCceEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933         28 PGHLIECTLNPGDMLYIPPKVWHHVRSLST   57 (65)
Q Consensus        28 ~~~~~~~~l~pGd~LyiP~~WwH~V~s~~~   57 (65)
                      .+...++.+++||++.+|++.||.-.+.+.
T Consensus       116 ~V~g~~~~~~~gD~~~tP~w~wH~H~n~~d  145 (335)
T TIGR02272       116 AVDGERTTMHPGDFIITPSWTWHDHGNPGD  145 (335)
T ss_pred             EECCEEEeeeCCCEEEeCCCeeEecccCCC
Confidence            356778899999999999999999877653


No 50 
>PF15138 Syncollin:  Syncollin
Probab=90.50  E-value=0.35  Score=30.12  Aligned_cols=36  Identities=25%  Similarity=0.509  Sum_probs=28.6

Q ss_pred             CceEEEEECCC-CEEEeCCCCeEEEEecC--CeEEEEEe
Q psy12933         29 GHLIECTLNPG-DMLYIPPKVWHHVRSLS--TSISVSFW   64 (65)
Q Consensus        29 ~~~~~~~l~pG-d~LyiP~~WwH~V~s~~--~sisvn~w   64 (65)
                      ...-+..++|| |.-|+|.+|=+.+.|+-  .-..+..|
T Consensus        32 C~G~~l~v~pg~DlPylPs~w~n~iSSlvV~~RCeLtVW   70 (112)
T PF15138_consen   32 CGGAELSVEPGDDLPYLPSGWANKISSLVVAPRCELTVW   70 (112)
T ss_pred             cCCcEEeecCCCCCCcCCccccCeeeeEEEcCceEEEEE
Confidence            35678899999 78899999999999874  45666655


No 51 
>PLN00212 glutelin; Provisional
Probab=90.33  E-value=0.92  Score=34.37  Aligned_cols=33  Identities=18%  Similarity=0.270  Sum_probs=26.5

Q ss_pred             CceEEEEECCCCEEEeCCCCeEEEEecCCeEEE
Q psy12933         29 GHLIECTLNPGDMLYIPPKVWHHVRSLSTSISV   61 (65)
Q Consensus        29 ~~~~~~~l~pGd~LyiP~~WwH~V~s~~~sisv   61 (65)
                      -..+...|++||+++||.+..|..++....+.+
T Consensus       390 ~~vf~~~L~~GdvfVVPqg~~v~~~A~~egfe~  422 (493)
T PLN00212        390 KTVFNGVLRPGQLLIIPQHYAVLKKAEREGCQY  422 (493)
T ss_pred             CEEEEEEEcCCCEEEECCCCeEEEeecCCceEE
Confidence            356888999999999999999988776544433


No 52 
>PRK05467 Fe(II)-dependent oxygenase superfamily protein; Provisional
Probab=89.62  E-value=0.9  Score=31.05  Aligned_cols=34  Identities=24%  Similarity=0.544  Sum_probs=29.1

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEecC--CeEEEEEe
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRSLS--TSISVSFW   64 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s~~--~sisvn~w   64 (65)
                      ...+.+++|++|..|+.-+|+|...+  .-+++.+|
T Consensus       140 ~~~Vkp~aG~~vlfps~~lH~v~pVt~G~R~~~~~W  175 (226)
T PRK05467        140 EHRVKLPAGDLVLYPSTSLHRVTPVTRGVRVASFFW  175 (226)
T ss_pred             cEEEecCCCeEEEECCCCceeeeeccCccEEEEEec
Confidence            36788999999999999999999865  46777777


No 53 
>TIGR01479 GMP_PMI mannose-1-phosphate guanylyltransferase/mannose-6-phosphate isomerase. This enzyme is known to be bifunctional, as both mannose-6-phosphate isomerase (EC 5.3.1.8) (PMI) and mannose-1-phosphate guanylyltransferase (EC 2.7.7.22) in Pseudomonas aeruginosa, Xanthomonas campestris, and Gluconacetobacter xylinus. The literature on the enzyme from E. coli attributes mannose-6-phosphate isomerase activity to an adjacent gene, but the present sequence has not been shown to lack the activity. The PMI domain is C-terminal.
Probab=89.25  E-value=0.86  Score=33.66  Aligned_cols=27  Identities=22%  Similarity=0.252  Sum_probs=23.6

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRSLST   57 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s~~~   57 (65)
                      .-+..|++||++|+|++--|+.++.+.
T Consensus       414 g~~~~l~~GDsi~ip~~~~H~~~N~g~  440 (468)
T TIGR01479       414 DETLLLTENESTYIPLGVIHRLENPGK  440 (468)
T ss_pred             CEEEEecCCCEEEECCCCcEEEEcCCC
Confidence            346789999999999999999999763


No 54 
>TIGR00218 manA mannose-6-phosphate isomerase, class I. The names phosphomannose isomerase and mannose-6-phosphate isomerase are synonomous. This family contains two rather deeply branched groups. One group contains an experimentally determined phosphomannose isomerase of Streptococcus mutans as well as three uncharacterized paralogous proteins of Bacillus subtilis, all at more than 50 % identity to each other, plus a more distant homolog from Archaeoglobus fulgidus. The other group contains members from E. coli, budding yeast, Borrelia burgdorferi, etc.
Probab=88.76  E-value=0.27  Score=34.28  Aligned_cols=22  Identities=36%  Similarity=0.679  Sum_probs=19.3

Q ss_pred             eEEEEECCCCEEEeCCCCeEEE
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHV   52 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V   52 (65)
                      --.+.++|||++|||+|--|..
T Consensus       150 ln~v~v~~Gd~i~ipaGt~HA~  171 (302)
T TIGR00218       150 LNRIKLKPGDFFYVPSGTPHAY  171 (302)
T ss_pred             hcccccCCCCEEEeCCCCcccc
Confidence            4567899999999999999984


No 55 
>PRK11171 hypothetical protein; Provisional
Probab=88.22  E-value=1.4  Score=30.33  Aligned_cols=26  Identities=23%  Similarity=0.340  Sum_probs=22.9

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      .-+.+|++||++|+|++=-|+.++.+
T Consensus       100 g~~~~L~~GDsi~~p~~~~H~~~N~g  125 (266)
T PRK11171        100 GKTHALSEGGYAYLPPGSDWTLRNAG  125 (266)
T ss_pred             CEEEEECCCCEEEECCCCCEEEEECC
Confidence            34668999999999999999999876


No 56 
>PLN02288 mannose-6-phosphate isomerase
Probab=87.91  E-value=0.46  Score=34.89  Aligned_cols=22  Identities=32%  Similarity=0.421  Sum_probs=19.2

Q ss_pred             EEEEECCCCEEEeCCCCeEEEE
Q psy12933         32 IECTLNPGDMLYIPPKVWHHVR   53 (65)
Q Consensus        32 ~~~~l~pGd~LyiP~~WwH~V~   53 (65)
                      -.+.|+|||.+|+|+|--|.--
T Consensus       251 N~v~L~PGeaifl~ag~~HAYl  272 (394)
T PLN02288        251 NYVKLNPGEALYLGANEPHAYL  272 (394)
T ss_pred             ceEecCCCCEEEecCCCCceec
Confidence            3688999999999999999743


No 57 
>PRK13502 transcriptional activator RhaR; Provisional
Probab=87.40  E-value=1.1  Score=29.92  Aligned_cols=26  Identities=23%  Similarity=0.354  Sum_probs=22.0

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      .-+..++|||+++||++=.|.....+
T Consensus        55 ~~~~~l~~g~l~li~~~~~H~~~~~~   80 (282)
T PRK13502         55 ERPYRITRGDLFYIRAEDKHSYTSVN   80 (282)
T ss_pred             CEEEeecCCcEEEECCCCcccccccC
Confidence            45678999999999999999887644


No 58 
>TIGR03027 pepcterm_export putative polysaccharide export protein, PEP-CTERM sytem-associated. This protein family belongs to the larger set of polysaccharide biosynthesis/export proteins described by Pfam model pfam02563. Members of this family are variable in either containing of lacking a 78-residue insert, but appear to fall within a single clade, nevertheless, where the regions in which the gene is found encode components of the PEP-CTERM/EpsH proposed exosortase protein sorting system.
Probab=86.40  E-value=0.52  Score=30.05  Aligned_cols=18  Identities=39%  Similarity=0.756  Sum_probs=15.2

Q ss_pred             EEEEECCCCEEEeCCCCe
Q psy12933         32 IECTLNPGDMLYIPPKVW   49 (65)
Q Consensus        32 ~~~~l~pGd~LyiP~~Ww   49 (65)
                      ....|+|||.+|+|..++
T Consensus       148 ~n~~L~~gD~I~Vp~~~f  165 (165)
T TIGR03027       148 ANVELKPGDVLIIPESWF  165 (165)
T ss_pred             CCceeCCCCEEEEecccC
Confidence            557899999999998753


No 59 
>PF12852 Cupin_6:  Cupin
Probab=86.29  E-value=0.99  Score=28.74  Aligned_cols=25  Identities=24%  Similarity=0.339  Sum_probs=21.9

Q ss_pred             EEEEECCCCEEEeCCCCeEEEEecC
Q psy12933         32 IECTLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        32 ~~~~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      -.+.|++||++++|.+=-|...+..
T Consensus        55 ~~~~L~~GDivllp~g~~H~l~~~~   79 (186)
T PF12852_consen   55 EPIRLEAGDIVLLPRGTAHVLSSDP   79 (186)
T ss_pred             CeEEecCCCEEEEcCCCCeEeCCCC
Confidence            5688999999999999999997654


No 60 
>COG4766 EutQ Ethanolamine utilization protein [Amino acid transport and metabolism]
Probab=85.88  E-value=0.66  Score=30.81  Aligned_cols=16  Identities=25%  Similarity=0.594  Sum_probs=13.2

Q ss_pred             EEEECCCCEEEeCCCC
Q psy12933         33 ECTLNPGDMLYIPPKV   48 (65)
Q Consensus        33 ~~~l~pGd~LyiP~~W   48 (65)
                      .++-+|||++|||.|-
T Consensus       137 tv~a~aGDvifiPKgs  152 (176)
T COG4766         137 TVIAGAGDVIFIPKGS  152 (176)
T ss_pred             eEecCCCcEEEecCCC
Confidence            4556899999999984


No 61 
>COG1482 ManA Phosphomannose isomerase [Carbohydrate transport and metabolism]
Probab=85.82  E-value=0.77  Score=32.98  Aligned_cols=24  Identities=25%  Similarity=0.506  Sum_probs=20.4

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEe
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRS   54 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s   54 (65)
                      --.+.|+|||.+|+|+|--|....
T Consensus       157 Ln~v~lkpGe~~fl~Agt~HA~~~  180 (312)
T COG1482         157 LNRVKLKPGEAFFLPAGTPHAYLK  180 (312)
T ss_pred             hcEEecCCCCEEEecCCCceeecc
Confidence            346889999999999999998654


No 62 
>PRK11171 hypothetical protein; Provisional
Probab=84.83  E-value=1.5  Score=30.15  Aligned_cols=26  Identities=8%  Similarity=0.030  Sum_probs=22.5

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      .-+..|++||+|++|+.--|..++.+
T Consensus       222 ~~~~~l~~GD~i~~~~~~~h~~~N~g  247 (266)
T PRK11171        222 NDWVEVEAGDFIWMRAYCPQACYAGG  247 (266)
T ss_pred             CEEEEeCCCCEEEECCCCCEEEECCC
Confidence            34567999999999999999999875


No 63 
>PRK13501 transcriptional activator RhaR; Provisional
Probab=84.55  E-value=1.8  Score=29.24  Aligned_cols=27  Identities=22%  Similarity=0.317  Sum_probs=22.9

Q ss_pred             ceEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933         30 HLIECTLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        30 ~~~~~~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      ......+++||+++||++=.|..++.+
T Consensus        54 ~~~~~~l~~g~~~~I~p~~~H~~~~~~   80 (290)
T PRK13501         54 NDHPYRITCGDVFYIQAADHHSYESVH   80 (290)
T ss_pred             CCeeeeecCCeEEEEcCCCcccccccC
Confidence            355678999999999999999988654


No 64 
>PF07385 DUF1498:  Protein of unknown function (DUF1498);  InterPro: IPR010864 This family consists of several hypothetical bacterial proteins of around 225 residues in length. The function of this family is unknown.; PDB: 3MPB_B 3KMH_A.
Probab=84.02  E-value=2.5  Score=29.22  Aligned_cols=28  Identities=25%  Similarity=0.514  Sum_probs=20.0

Q ss_pred             ceEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933         30 HLIECTLNPGDMLYIPPKVWHHVRSLST   57 (65)
Q Consensus        30 ~~~~~~l~pGd~LyiP~~WwH~V~s~~~   57 (65)
                      ..-+++|+||+.+-+|++-||+-.....
T Consensus       152 aG~~l~L~PGESiTL~Pg~yH~Fw~e~g  179 (225)
T PF07385_consen  152 AGTQLRLNPGESITLPPGIYHWFWGEGG  179 (225)
T ss_dssp             TT-EEEE-TT-EEEE-TTEEEEEEE-TT
T ss_pred             CCceEEeCCCCeEeeCCCCeeeEEecCC
Confidence            4567899999999999999999887664


No 65 
>PF05721 PhyH:  Phytanoyl-CoA dioxygenase (PhyH);  InterPro: IPR008775 This family is made up of several eukaryotic phytanoyl-CoA dioxygenase (PhyH) proteins as well as a number of bacterial deoxygenases. PhyH is a peroxisomal enzyme catalysing the first step of phytanic acid alpha-oxidation. PhyH deficiency causes Refsum's disease (RD) which is an inherited neurological syndrome biochemically characterised by the accumulation of phytanic acid in plasma and tissues [].; PDB: 3GJA_A 3EMR_A 3OBZ_A 2OPW_A 3NNL_B 3NNF_A 3NNM_B 3NNJ_A 2FCV_B 2FCU_A ....
Probab=83.94  E-value=1.5  Score=26.96  Aligned_cols=26  Identities=15%  Similarity=0.442  Sum_probs=22.0

Q ss_pred             CceEEEEECCCCEEEeCCCCeEEEEe
Q psy12933         29 GHLIECTLNPGDMLYIPPKVWHHVRS   54 (65)
Q Consensus        29 ~~~~~~~l~pGd~LyiP~~WwH~V~s   54 (65)
                      .....+.+++||+|+.-+.-||....
T Consensus       177 ~~~~~~~~~~Gdvl~~~~~~~H~s~~  202 (211)
T PF05721_consen  177 DEWVPVPMKAGDVLFFHSRLIHGSGP  202 (211)
T ss_dssp             SGCEEE-BSTTEEEEEETTSEEEEE-
T ss_pred             CceEEeecCCCeEEEEcCCccccCCC
Confidence            56789999999999999999999875


No 66 
>PRK15460 cpsB mannose-1-phosphate guanyltransferase; Provisional
Probab=83.79  E-value=2.6  Score=31.67  Aligned_cols=26  Identities=19%  Similarity=0.212  Sum_probs=23.0

Q ss_pred             EEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933         32 IECTLNPGDMLYIPPKVWHHVRSLST   57 (65)
Q Consensus        32 ~~~~l~pGd~LyiP~~WwH~V~s~~~   57 (65)
                      .+..|+|||.+|+|++--|+.++.+.
T Consensus       424 ~~~~L~~GDSi~ip~g~~H~~~N~g~  449 (478)
T PRK15460        424 DIKLLGENESIYIPLGATHCLENPGK  449 (478)
T ss_pred             EEEEecCCCEEEECCCCcEEEEcCCC
Confidence            45789999999999999999999863


No 67 
>COG4297 Uncharacterized protein containing double-stranded beta helix domain [Function unknown]
Probab=82.81  E-value=1.8  Score=28.35  Aligned_cols=33  Identities=27%  Similarity=0.229  Sum_probs=26.5

Q ss_pred             CCCCCceEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933         25 NQTPGHLIECTLNPGDMLYIPPKVWHHVRSLST   57 (65)
Q Consensus        25 ~~~~~~~~~~~l~pGd~LyiP~~WwH~V~s~~~   57 (65)
                      .+..+...++.+..||+|.||+|-=|.-.....
T Consensus        78 ~iGG~~G~el~v~~GDvlliPAGvGH~rl~sS~  110 (163)
T COG4297          78 QIGGADGQELEVGEGDVLLIPAGVGHCRLHSSA  110 (163)
T ss_pred             EecCCCCceeeecCCCEEEEecCcccccccCCC
Confidence            355677888999999999999999887655543


No 68 
>smart00702 P4Hc Prolyl 4-hydroxylase alpha subunit homologues. Mammalian enzymes catalyse hydroxylation of collagen, for example. Prokaryotic enzymes might catalyse hydroxylation of antibiotic peptides. These are 2-oxoglutarate-dependent dioxygenases, requiring 2-oxoglutarate and dioxygen as cosubstrates and ferrous iron as a cofactor.
Probab=82.32  E-value=2.9  Score=26.33  Aligned_cols=43  Identities=16%  Similarity=0.271  Sum_probs=32.5

Q ss_pred             CCCCCCCceEEEEECCCCEEEeCCC---CeEEEEecC--CeEEEEEeC
Q psy12933         23 PHNQTPGHLIECTLNPGDMLYIPPK---VWHHVRSLS--TSISVSFWF   65 (65)
Q Consensus        23 ~p~~~~~~~~~~~l~pGd~LyiP~~---WwH~V~s~~--~sisvn~w~   65 (65)
                      ||.........+.-++|++|+.|.+   -+|.|....  .-++++.|+
T Consensus       130 f~~~~~~~~~~v~P~~G~~v~f~~~~~~~~H~v~pv~~G~r~~~~~W~  177 (178)
T smart00702      130 FPGLGLMVCATVKPKKGDLLFFPSGRGRSLHGVCPVTRGSRWAITGWI  177 (178)
T ss_pred             ecCCCCccceEEeCCCCcEEEEeCCCCCccccCCcceeCCEEEEEEEE
Confidence            3333323456788899999999985   899999876  578899885


No 69 
>PRK13500 transcriptional activator RhaR; Provisional
Probab=81.40  E-value=3  Score=28.77  Aligned_cols=27  Identities=22%  Similarity=0.310  Sum_probs=22.5

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRSLST   57 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s~~~   57 (65)
                      ..+..+++||+++||++=-|...+.+.
T Consensus        85 ~~~~~l~~Gdl~~I~~~~~H~~~~~~~  111 (312)
T PRK13500         85 DRPYRITRGDLFYIHADDKHSYASVND  111 (312)
T ss_pred             CEEEeecCCeEEEECCCCeecccccCC
Confidence            456788999999999999999876553


No 70 
>PRK10296 DNA-binding transcriptional regulator ChbR; Provisional
Probab=79.68  E-value=2.5  Score=28.25  Aligned_cols=24  Identities=25%  Similarity=0.333  Sum_probs=20.6

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEe
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRS   54 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s   54 (65)
                      .....++|||+++||+|-.|+...
T Consensus        60 ~~~~~l~~g~l~~i~p~~~H~~~~   83 (278)
T PRK10296         60 GKRVLLERGDFVFIPLGSHHQSFY   83 (278)
T ss_pred             CEEEEECCCcEEEeCCCCccceee
Confidence            456799999999999999997753


No 71 
>PF12973 Cupin_7:  ChrR Cupin-like domain; PDB: 3O14_B 2Z2S_F 2Q1Z_B 3EBR_A.
Probab=79.62  E-value=2.8  Score=23.97  Aligned_cols=25  Identities=32%  Similarity=0.460  Sum_probs=20.1

Q ss_pred             EECCCCEEEeCCCCeEEEEecCCeE
Q psy12933         35 TLNPGDMLYIPPKVWHHVRSLSTSI   59 (65)
Q Consensus        35 ~l~pGd~LyiP~~WwH~V~s~~~si   59 (65)
                      ...+||.++.|++--|...+.+.++
T Consensus        61 ~~~~G~~~~~p~g~~h~~~s~~gc~   85 (91)
T PF12973_consen   61 RYGAGDWLRLPPGSSHTPRSDEGCL   85 (91)
T ss_dssp             EEETTEEEEE-TTEEEEEEESSCEE
T ss_pred             cCCCCeEEEeCCCCccccCcCCCEE
Confidence            3479999999999999999876643


No 72 
>COG1791 Uncharacterized conserved protein, contains double-stranded beta-helix domain [Function unknown]
Probab=79.57  E-value=3.6  Score=27.63  Aligned_cols=29  Identities=21%  Similarity=0.554  Sum_probs=24.7

Q ss_pred             CCceEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933         28 PGHLIECTLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        28 ~~~~~~~~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      ..+.+.+.+++||.|-+|+|-.|.-.-.+
T Consensus       114 d~~~~~i~c~~gDLI~vP~gi~HwFtlt~  142 (181)
T COG1791         114 DGKVYQIRCEKGDLISVPPGIYHWFTLTE  142 (181)
T ss_pred             CCcEEEEEEccCCEEecCCCceEEEEccC
Confidence            34788999999999999999999887444


No 73 
>TIGR03028 EpsE polysaccharide export protein EpsE. Sequences in this family of proteins are members of a polysaccharide export protein family (pfam02563) which includes the wza protein from E.coli. This family of proteins are homologous to the EpsE protein of the methanolan biosynthesis operon of Methylobacillus species strain 12S. The distribution of this protein appears to be restricted to a subset of exopolysaccharide operons containing a syntenic grouping of genes including a variant of the EpsH exosortase protein. Exosortase has been proposed to be involved in the targetting and processing of proteins containing the PEP-CTERM domain to the exopolysaccharide layer.
Probab=79.24  E-value=1.3  Score=29.93  Aligned_cols=15  Identities=20%  Similarity=0.733  Sum_probs=13.2

Q ss_pred             EECCCCEEEeCCCCe
Q psy12933         35 TLNPGDMLYIPPKVW   49 (65)
Q Consensus        35 ~l~pGd~LyiP~~Ww   49 (65)
                      .|+|||++|+|..|+
T Consensus       225 ~l~~gDii~V~~s~~  239 (239)
T TIGR03028       225 LVQPDDVIYVRESLF  239 (239)
T ss_pred             ccCCCCEEEEeCccC
Confidence            499999999998874


No 74 
>PF13532 2OG-FeII_Oxy_2:  2OG-Fe(II) oxygenase superfamily; PDB: 2IUW_A 3BTZ_A 3RZL_A 3RZH_A 3S5A_A 3RZG_A 3RZJ_A 3BUC_A 3H8X_A 3H8R_A ....
Probab=78.92  E-value=2.3  Score=26.93  Aligned_cols=29  Identities=24%  Similarity=0.410  Sum_probs=20.7

Q ss_pred             CCceEEEEECCCCEEEeC----CCCeEEEEecCC
Q psy12933         28 PGHLIECTLNPGDMLYIP----PKVWHHVRSLST   57 (65)
Q Consensus        28 ~~~~~~~~l~pGd~LyiP----~~WwH~V~s~~~   57 (65)
                      ....+++.|.+||++++-    ..| |.|.....
T Consensus       144 ~~~~~~~~L~~gsl~vm~g~~r~~~-H~I~~~~~  176 (194)
T PF13532_consen  144 DDEPIEVPLPPGSLLVMSGEARYDW-HGIPPVKK  176 (194)
T ss_dssp             TS-EEEEEE-TTEEEEEETTHHHHE-EEE-S-SC
T ss_pred             CCccEEEEcCCCCEEEeChHHhhhe-eEcccccC
Confidence            347899999999999998    457 99988764


No 75 
>COG5285 Protein involved in biosynthesis of mitomycin antibiotics/polyketide fumonisin [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=77.53  E-value=3.9  Score=29.39  Aligned_cols=31  Identities=19%  Similarity=0.364  Sum_probs=25.9

Q ss_pred             CCCCceEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933         26 QTPGHLIECTLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        26 ~~~~~~~~~~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      |-+.....+.|++||+|+.-+.-||...+-.
T Consensus       187 y~~~~~~pv~lekGDallF~~~L~HaA~aNr  217 (299)
T COG5285         187 YLERNAVPVELEKGDALLFNGSLWHAAGANR  217 (299)
T ss_pred             hhhhcceeeeecCCCEEEEcchhhhhhhcCC
Confidence            3344588999999999999999999988765


No 76 
>PF01238 PMI_typeI:  Phosphomannose isomerase type I;  InterPro: IPR001250 Mannose-6-phosphate isomerase or phosphomannose isomerase (5.3.1.8 from EC) (PMI) is the enzyme that catalyses the interconversion of mannose-6-phosphate and fructose-6-phosphate. In eukaryotes PMI is involved in the synthesis of GDP-mannose, a constituent of N- and O-linked glycans and GPI anchors and in prokaryotes it participates in a variety of pathways, including capsular polysaccharide biosynthesis and D-mannose metabolism. PMI's belong to the cupin superfamily whose functions range from isomerase and epimerase activities involved in the modification of cell wall carbohydrates in bacteria and plants, to non-enzymatic storage proteins in plant seeds, and transcription factors linked to congenital baldness in mammals []. Three classes of PMI have been defined []. Type I includes eukaryotic PMI and the enzyme encoded by the manA gene in enterobacteria. PMI has a bound zinc ion, which is essential for activity. A crystal structure of PMI from Candida albicans shows that the enzyme has three distinct domains []. The active site lies in the central domain, contains a single essential zinc atom, and forms a deep, open cavity of suitable dimensions to contain M6P or F6P The central domain is flanked by a helical domain on one side and a jelly-roll like domain on the other.; GO: 0004476 mannose-6-phosphate isomerase activity, 0008270 zinc ion binding, 0005975 carbohydrate metabolic process; PDB: 1PMI_A 1QWR_B 1ZX5_A 3H1Y_A 2WFP_A 3H1M_A 3H1W_A.
Probab=74.95  E-value=2.4  Score=30.62  Aligned_cols=23  Identities=26%  Similarity=0.436  Sum_probs=18.0

Q ss_pred             EEEEECCCCEEEeCCCCeEEEEe
Q psy12933         32 IECTLNPGDMLYIPPKVWHHVRS   54 (65)
Q Consensus        32 ~~~~l~pGd~LyiP~~WwH~V~s   54 (65)
                      -.+.|+|||.+|+|++--|.-.+
T Consensus       250 N~v~L~pGeaifl~a~~~HAYl~  272 (373)
T PF01238_consen  250 NYVELQPGEAIFLPAGEPHAYLS  272 (373)
T ss_dssp             EEEEE-TT-EEEEHTTHHEEEEE
T ss_pred             EEEEecCCceEEecCCCcccccc
Confidence            45699999999999999998665


No 77 
>PF06052 3-HAO:  3-hydroxyanthranilic acid dioxygenase;  InterPro: IPR010329 Members of this protein family, from both bacteria and eukaryotes, are the enzyme 3-hydroxyanthranilate 3,4-dioxygenase (1.13.11.6 from EC). It is part of the kynurenine pathway for the degradation of tryptophan and the biosynthesis of nicotinic acid [].The prokaryotic homologue is involved in the 2-nitrobenzoate degradation pathway []. The enzyme acts on the tryptophan metabolite 3-hydroxyanthranilate and produces 2-amino-3-carboxymuconate semialdehyde, which can rearrange spontaneously to quinolinic acid and feed into nicotinamide biosynthesis, or undergo further enzymatic degradation.; GO: 0000334 3-hydroxyanthranilate 3,4-dioxygenase activity, 0005506 iron ion binding, 0008152 metabolic process, 0055114 oxidation-reduction process; PDB: 1ZVF_A 1YFX_A 1YFW_A 1YFY_A 1YFU_A 2QNK_A 3FE5_A.
Probab=74.70  E-value=10  Score=24.81  Aligned_cols=33  Identities=21%  Similarity=0.493  Sum_probs=22.4

Q ss_pred             CceEEEEECCCCEEEeCCCCeEEEEecCCeEEE
Q psy12933         29 GHLIECTLNPGDMLYIPPKVWHHVRSLSTSISV   61 (65)
Q Consensus        29 ~~~~~~~l~pGd~LyiP~~WwH~V~s~~~sisv   61 (65)
                      ...-++.+++||+-++|++--|.-+-...||.+
T Consensus        72 g~~kdi~I~EGe~fLLP~~vpHsP~R~~~tiGL  104 (151)
T PF06052_consen   72 GKFKDIPIREGEMFLLPANVPHSPQRPADTIGL  104 (151)
T ss_dssp             TEEEEEEE-TTEEEEE-TT--EEEEE-TT-EEE
T ss_pred             CceEEEEeCCCcEEecCCCCCCCCcCCCCcEEE
Confidence            455689999999999999999998877777654


No 78 
>PRK15401 alpha-ketoglutarate-dependent dioxygenase AlkB; Provisional
Probab=73.16  E-value=8  Score=26.28  Aligned_cols=36  Identities=22%  Similarity=0.345  Sum_probs=27.1

Q ss_pred             CCceEEEEECCCCEEEeC---CCCeEEEEecC---------CeEEEEE
Q psy12933         28 PGHLIECTLNPGDMLYIP---PKVWHHVRSLS---------TSISVSF   63 (65)
Q Consensus        28 ~~~~~~~~l~pGd~LyiP---~~WwH~V~s~~---------~sisvn~   63 (65)
                      ..+...+.|+.||+|++-   ..|||.|....         .-|+++|
T Consensus       163 ~~~~~~l~L~~Gdllvm~G~sr~~~HgVp~~~~~~~p~~g~~RINLTF  210 (213)
T PRK15401        163 SDPLQRILLEHGDVVVWGGPSRLRYHGILPLKAGEHPLTGECRINLTF  210 (213)
T ss_pred             CCceEEEEeCCCCEEEECchHhheeccCCcCCCCcCCCCCCCeEEEEe
Confidence            455789999999999975   35899996553         3577776


No 79 
>PRK13503 transcriptional activator RhaS; Provisional
Probab=73.04  E-value=3.2  Score=27.46  Aligned_cols=25  Identities=16%  Similarity=0.198  Sum_probs=20.9

Q ss_pred             EEEEECCCCEEEeCCCCeEEEEecC
Q psy12933         32 IECTLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        32 ~~~~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      ....+++||+++||++=.|..+..+
T Consensus        53 ~~~~l~~g~~~~i~~~~~h~~~~~~   77 (278)
T PRK13503         53 QPYTLSGGTVCFVRDHDRHLYEHTD   77 (278)
T ss_pred             CcccccCCcEEEECCCccchhhhcc
Confidence            3567899999999999999876654


No 80 
>TIGR02297 HpaA 4-hydroxyphenylacetate catabolism regulatory protein HpaA. This putative transcriptional regulator, which contains both the substrate-binding, dimerization domain (pfam02311) and the helix-turn-helix DNA-binding domain (pfam00165) of the AraC famil, is located proximal to genes of the 4-hydroxyphenylacetate catabolism pathway.
Probab=72.96  E-value=6.7  Score=26.14  Aligned_cols=26  Identities=19%  Similarity=0.075  Sum_probs=21.8

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      .....+++||++++|++--|......
T Consensus        61 ~~~~~l~~g~~~ii~~~~~H~~~~~~   86 (287)
T TIGR02297        61 EHEYSEYAPCFFLTPPSVPHGFVTDL   86 (287)
T ss_pred             CEEEEecCCeEEEeCCCCccccccCC
Confidence            34678999999999999999987654


No 81 
>PLN00212 glutelin; Provisional
Probab=71.88  E-value=9.5  Score=29.02  Aligned_cols=32  Identities=19%  Similarity=0.198  Sum_probs=25.6

Q ss_pred             CCCCceEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933         26 QTPGHLIECTLNPGDMLYIPPKVWHHVRSLST   57 (65)
Q Consensus        26 ~~~~~~~~~~l~pGd~LyiP~~WwH~V~s~~~   57 (65)
                      +++.+--.-.++.||++.||+|--|...+.+.
T Consensus       142 ~~d~hqkv~~lr~GDViaiPaG~~hw~yN~Gd  173 (493)
T PLN00212        142 FRDEHQKIHQFRQGDVVALPAGVAHWFYNDGD  173 (493)
T ss_pred             cccccccceEeccCCEEEECCCCeEEEEeCCC
Confidence            33444444689999999999999999999874


No 82 
>PRK12335 tellurite resistance protein TehB; Provisional
Probab=68.10  E-value=17  Score=24.85  Aligned_cols=31  Identities=23%  Similarity=0.493  Sum_probs=22.5

Q ss_pred             EEECCCCEEEeCCCCeEEEEecC--CeEEEEEe
Q psy12933         34 CTLNPGDMLYIPPKVWHHVRSLS--TSISVSFW   64 (65)
Q Consensus        34 ~~l~pGd~LyiP~~WwH~V~s~~--~sisvn~w   64 (65)
                      .....++..++|++-||++...+  ..+.+.|+
T Consensus        60 ~l~~~~~~~~i~p~~wh~v~~~s~d~~~~l~fy   92 (287)
T PRK12335         60 IFDAENQPPFIEPQAWHRIEAASDDLECQLSFY   92 (287)
T ss_pred             EEecCCCCceeCCcceEEEEEcCCCcEEEEEEE
Confidence            33444566679999999999985  35666664


No 83 
>PF05995 CDO_I:  Cysteine dioxygenase type I;  InterPro: IPR010300 Cysteine dioxygenase type I (1.13.11.20 from EC) converts cysteine to cysteinesulphinic acid and is the rate-limiting step in sulphate production.; GO: 0005506 iron ion binding, 0017172 cysteine dioxygenase activity, 0046439 L-cysteine metabolic process, 0055114 oxidation-reduction process; PDB: 2IC1_A 3EQE_B 3ELN_A 2B5H_A 2GH2_A 2Q4S_A 2ATF_A 2GM6_A 3USS_B.
Probab=67.28  E-value=27  Score=22.52  Aligned_cols=33  Identities=24%  Similarity=0.289  Sum_probs=23.8

Q ss_pred             EEEEECCCCEEEeCCCCeEEEEecC---CeEEEEEe
Q psy12933         32 IECTLNPGDMLYIPPKVWHHVRSLS---TSISVSFW   64 (65)
Q Consensus        32 ~~~~l~pGd~LyiP~~WwH~V~s~~---~sisvn~w   64 (65)
                      ....+.+|.+.+.++.-.|+|++..   .++|+..+
T Consensus       127 ~~~~~~~g~~~~~~~~~iH~v~n~s~~~~avSLHvY  162 (175)
T PF05995_consen  127 RERLLPGGVTYIFDPHGIHRVENPSGDEPAVSLHVY  162 (175)
T ss_dssp             EEEEETTTEEEEBTTTBEEEEEES-SSS-EEEEEEE
T ss_pred             ceEecCCCeEEecCCCCeEEeccCCCCCCEEEEEEc
Confidence            4456788888888999999997653   57777654


No 84 
>COG3822 ABC-type sugar transport system, auxiliary component [General function prediction only]
Probab=64.22  E-value=15  Score=25.26  Aligned_cols=28  Identities=21%  Similarity=0.467  Sum_probs=23.9

Q ss_pred             CceEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933         29 GHLIECTLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        29 ~~~~~~~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      ...-+..|+||+.+-+|||-||..-+.+
T Consensus       150 ~ag~~lkL~PGesitL~Pg~~HsFwae~  177 (225)
T COG3822         150 TAGSQLKLSPGESITLPPGLYHSFWAEE  177 (225)
T ss_pred             ccceeEEECCCCcEecCCCceeeeeecC
Confidence            3467789999999999999999887665


No 85 
>PRK10371 DNA-binding transcriptional regulator MelR; Provisional
Probab=62.79  E-value=7.1  Score=26.92  Aligned_cols=27  Identities=11%  Similarity=0.091  Sum_probs=22.9

Q ss_pred             ceEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933         30 HLIECTLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        30 ~~~~~~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      ..-+.+++|||+++||++--|.....+
T Consensus        62 ~g~~~~l~~Gd~ili~s~~~H~~~~~~   88 (302)
T PRK10371         62 NNEKVQINQGHITLFWACTPHQLTDPG   88 (302)
T ss_pred             CCEEEEEcCCcEEEEecCCcccccccC
Confidence            356788999999999999999887654


No 86 
>PF12851 Tet_JBP:  Oxygenase domain of the 2OGFeDO superfamily ;  InterPro: IPR024779 TETs are 2OG- and Fe(II)-dependent oxygenases that catalyse the conversion of 5 methyl-Cytosine (5-MC) to 5-hydroxymethyl-cytosine (hmC) in cultured cells and in vitro []. Interestingly TET2 is considered as an oncogene, as it is found mutated in some types of cancer []. This entry represents the double-stranded beta helix (DSBH) fold of the 2-oxoglutarate (2OG) - Fe(II) oxygenases. DSBH comprises a part of the catalytic domain in TETS. It is found in many organisms including fruit fly, African malaria mosquito, zebrafish, mouse and human.
Probab=62.23  E-value=22  Score=23.01  Aligned_cols=36  Identities=28%  Similarity=0.401  Sum_probs=29.7

Q ss_pred             ceEEEEECCCCEEEeCCC-CeEEEEecC-------CeEEEEEeC
Q psy12933         30 HLIECTLNPGDMLYIPPK-VWHHVRSLS-------TSISVSFWF   65 (65)
Q Consensus        30 ~~~~~~l~pGd~LyiP~~-WwH~V~s~~-------~sisvn~w~   65 (65)
                      ..+.+.+.+|++|++-+. .-|.|..++       .-+++.|++
T Consensus       126 ~g~~~~~~~GtVl~~~~~~~~Hgvtpv~~~~~~~~~R~slvfy~  169 (171)
T PF12851_consen  126 LGVAFAYQPGTVLIFCAKRELHGVTPVESPNRNHGTRISLVFYQ  169 (171)
T ss_pred             CCEEEecCCCcEEEEcccceeeecCcccCCCCCCCeEEEEEEEe
Confidence            577889999999999999 888888776       258887764


No 87 
>PF06251 Caps_synth_GfcC:  Capsule biosynthesis GfcC;  InterPro: IPR010425 This entry represents uncharacterised bacterial proteins that contain a central beta-grasp like domain related to the SLBB domain [].; PDB: 3P42_B.
Probab=62.22  E-value=6  Score=26.45  Aligned_cols=13  Identities=38%  Similarity=0.820  Sum_probs=7.8

Q ss_pred             EEECCCCEEEeCC
Q psy12933         34 CTLNPGDMLYIPP   46 (65)
Q Consensus        34 ~~l~pGd~LyiP~   46 (65)
                      ..+.||++||||-
T Consensus       191 ~~l~PG~~I~Vp~  203 (229)
T PF06251_consen  191 QELAPGATIYVPF  203 (229)
T ss_dssp             EE--TT-EEEE-B
T ss_pred             CCCCCCCEEEEcC
Confidence            4699999999997


No 88 
>COG3435 Gentisate 1,2-dioxygenase [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=61.44  E-value=5.1  Score=29.29  Aligned_cols=31  Identities=19%  Similarity=0.309  Sum_probs=24.8

Q ss_pred             CCCCceEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933         26 QTPGHLIECTLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        26 ~~~~~~~~~~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      +.-+..-...+++||.|..|.+-||.--+.+
T Consensus       125 ~T~VdGer~~M~~GDfilTP~w~wHdHgn~g  155 (351)
T COG3435         125 YTVVDGERTPMEAGDFILTPAWTWHDHGNEG  155 (351)
T ss_pred             eEeecCceeeccCCCEEEccCceeccCCCCC
Confidence            4455667788999999999999999765543


No 89 
>cd02786 MopB_CT_3 The MopB_CT_3 CD includes a group of related uncharacterized bacterial molybdopterin-binding oxidoreductase-like domains with a putative N-terminal iron-sulfur [4Fe-4S] cluster binding site and molybdopterin cofactor binding site. This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=59.74  E-value=6.7  Score=22.92  Aligned_cols=12  Identities=8%  Similarity=0.307  Sum_probs=10.6

Q ss_pred             CEEEeCCCCeEE
Q psy12933         40 DMLYIPPKVWHH   51 (65)
Q Consensus        40 d~LyiP~~WwH~   51 (65)
                      +++++|.||||.
T Consensus        73 g~v~~~~g~~~~   84 (116)
T cd02786          73 GVVVAEGGWWRE   84 (116)
T ss_pred             CEEEeecccccc
Confidence            678999999995


No 90 
>PRK10572 DNA-binding transcriptional regulator AraC; Provisional
Probab=57.50  E-value=15  Score=24.64  Aligned_cols=25  Identities=32%  Similarity=0.482  Sum_probs=20.7

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEec
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRSL   55 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s~   55 (65)
                      .-+..++|||++++|++--|.....
T Consensus        66 ~~~~~~~~g~~i~i~p~~~h~~~~~   90 (290)
T PRK10572         66 GRAFVCRPGDLLLFPPGEIHHYGRH   90 (290)
T ss_pred             CeeEecCCCCEEEECCCCceeeccC
Confidence            4467899999999999999986544


No 91 
>TIGR02408 ectoine_ThpD ectoine hydroxylase. Both ectoine and hydroxyectoine are compatible solvents that serve as protectants against osmotic and thermal stresses. A number of genomes synthesize ectoine. This enzyme allows conversion of ectoine to hydroxyectoine, which may be more effective for some purposes, and is found in a subset of ectoine-producing organisms.
Probab=56.83  E-value=8.3  Score=26.41  Aligned_cols=25  Identities=16%  Similarity=0.141  Sum_probs=21.1

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEec
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRSL   55 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s~   55 (65)
                      ...+.++|||+|++-+.-||....-
T Consensus       210 ~v~~~~~aGDvl~f~~~~~H~S~~N  234 (277)
T TIGR02408       210 ISTFTGKAGSAVWFDCNTMHGSGSN  234 (277)
T ss_pred             ceeeccCCceEEEEccccccCCCCC
Confidence            4567789999999999999987654


No 92 
>TIGR01762 chlorin-enz chlorinating enzymes. This model represents a a group of highly homologous enzymes related to dioxygenases which chlorinate amino acid methyl groups. BarB1 and BarB2 are proposed to trichlorinate one of the methyl groups of a leucine residue in the biosynthesis of barbamide in the cyanobacterium Lyngbya majuscula. SyrB2 is proposed to chlorinate the methyl group of threonine in the biosynthesis of syringomycin in Pseudomonas syringae. CmaB is proposed to chlorinate the beta-methyl group of alloisoleucine in the process of ring closure in the biosynthesis of coronamic acid, a component of coronatine also in Pseudomonas syringae.
Probab=55.73  E-value=15  Score=25.60  Aligned_cols=27  Identities=7%  Similarity=0.162  Sum_probs=22.9

Q ss_pred             CceEEEEECCCCEEEeCCCCeEEEEec
Q psy12933         29 GHLIECTLNPGDMLYIPPKVWHHVRSL   55 (65)
Q Consensus        29 ~~~~~~~l~pGd~LyiP~~WwH~V~s~   55 (65)
                      .....++++|||+++.=..-||....-
T Consensus       206 ~~~v~~~lkaGd~~~f~~~t~HgS~~N  232 (288)
T TIGR01762       206 ASAVPMQMKAGQFIIFWSTLMHASYPN  232 (288)
T ss_pred             cceeeeeeCCceEEEECCCceecCCCC
Confidence            356899999999999999999986543


No 93 
>COG3128 PiuC Uncharacterized iron-regulated protein [Function unknown]
Probab=54.86  E-value=25  Score=24.25  Aligned_cols=34  Identities=24%  Similarity=0.517  Sum_probs=28.0

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEecCC--eEEEEEe
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRSLST--SISVSFW   64 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s~~~--sisvn~w   64 (65)
                      -.++.|-+||.+..|+.--|.|...+.  -++.=||
T Consensus       143 ~h~VklPAGdLVlypStSlH~VtPVTRg~R~asffW  178 (229)
T COG3128         143 NHRVKLPAGDLVLYPSTSLHEVTPVTRGERFASFFW  178 (229)
T ss_pred             ceEEeccCCCEEEcccccceeccccccCceEEEeee
Confidence            578899999999999999999999874  4554444


No 94 
>PF01476 LysM:  LysM domain;  InterPro: IPR018392 This domain is about 40 residues long and is found in a variety of enzymes involved in bacterial cell wall degradation []. This domain may have a general peptidoglycan binding function.; GO: 0016998 cell wall macromolecule catabolic process; PDB: 2DJP_A 3ZQD_A 1Y7M_B 4A52_A 2L9Y_A 1E0G_A.
Probab=53.54  E-value=6.3  Score=19.13  Aligned_cols=10  Identities=60%  Similarity=1.115  Sum_probs=9.0

Q ss_pred             ECCCCEEEeC
Q psy12933         36 LNPGDMLYIP   45 (65)
Q Consensus        36 l~pGd~LyiP   45 (65)
                      |.+|+.|.||
T Consensus        35 l~~G~~l~iP   44 (44)
T PF01476_consen   35 LQPGQKLCIP   44 (44)
T ss_dssp             GGTTEEEEEC
T ss_pred             CCCCCEEEeC
Confidence            8899999988


No 95 
>TIGR02272 gentisate_1_2 gentisate 1,2-dioxygenase. This family consists of gentisate 1,2-dioxygenases. This ring-opening enzyme acts in salicylate degradation that goes via gentisate rather than via catechol. It converts gentisate to maleylpyruvate. Some putative gentisate 1,2-dioxygenases are excluded by a relatively high trusted cutoff score because they are too closely related to known examples of 1-hydroxy-2-naphthoate dioxygenase. Therefore some homologs may be bona fide gentisate 1,2-dioxygenases even if they score below the given cutoffs.
Probab=52.78  E-value=36  Score=24.66  Aligned_cols=27  Identities=11%  Similarity=0.165  Sum_probs=20.2

Q ss_pred             EEEEECCCCEEEeCCCCeEEEEecCCe
Q psy12933         32 IECTLNPGDMLYIPPKVWHHVRSLSTS   58 (65)
Q Consensus        32 ~~~~l~pGd~LyiP~~WwH~V~s~~~s   58 (65)
                      ..+..++||++.+|+.-+|+-.+.+..
T Consensus       288 ~~~~W~~gD~f~vPsW~~~~h~a~~da  314 (335)
T TIGR02272       288 AVFRFSPKDVFVVPSWHPVRFEASDDA  314 (335)
T ss_pred             EEEEecCCCEEEECCCCcEecccCCCe
Confidence            355789999999999767766655543


No 96 
>PRK09685 DNA-binding transcriptional activator FeaR; Provisional
Probab=52.73  E-value=33  Score=23.02  Aligned_cols=26  Identities=15%  Similarity=0.098  Sum_probs=21.1

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      .-+..++|||++++|++--|.....+
T Consensus        88 g~~~~l~~G~~~l~~~~~p~~~~~~~  113 (302)
T PRK09685         88 DRQVQLAAGDITLIDASRPCSIYPQG  113 (302)
T ss_pred             CeEEEEcCCCEEEEECCCCcEeecCC
Confidence            45678999999999999888776544


No 97 
>cd02782 MopB_CT_1 The MopB_CT_1 CD includes a group of related uncharacterized bacterial molybdopterin-binding oxidoreductase-like domains with a putative N-terminal iron-sulfur [4Fe-4S] cluster binding site and molybdopterin cofactor binding site. This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=52.50  E-value=8.5  Score=22.98  Aligned_cols=13  Identities=15%  Similarity=0.220  Sum_probs=11.3

Q ss_pred             CEEEeCCCCeEEE
Q psy12933         40 DMLYIPPKVWHHV   52 (65)
Q Consensus        40 d~LyiP~~WwH~V   52 (65)
                      +++++|.+|||..
T Consensus        75 g~v~~~~g~~~~~   87 (129)
T cd02782          75 GVVSLPHGWGHDY   87 (129)
T ss_pred             CeEEeecCcCCCc
Confidence            6899999999964


No 98 
>KOG0162|consensus
Probab=52.39  E-value=11  Score=30.93  Aligned_cols=23  Identities=26%  Similarity=0.527  Sum_probs=19.3

Q ss_pred             CceEEEEECCCCEEEe----CCCCeEE
Q psy12933         29 GHLIECTLNPGDMLYI----PPKVWHH   51 (65)
Q Consensus        29 ~~~~~~~l~pGd~Lyi----P~~WwH~   51 (65)
                      ...-|..++.||+++|    |+|||--
T Consensus      1063 q~~dEls~~~~diIei~~edpSGWw~g 1089 (1106)
T KOG0162|consen 1063 QDVDELSFKKGDIIEIMREDPSGWWLG 1089 (1106)
T ss_pred             CCcccccccCCCEEEEeccCCCcchhh
Confidence            3566889999999998    9999964


No 99 
>cd02794 MopB_CT_DmsA-EC The MopB_CT_DmsA-EC CD includes the DmsA enzyme of the dmsABC operon encoding the anaerobic dimethylsulfoxide reductase (DMSOR) of Escherichia coli and other related DMSOR-like enzymes. Unlike other DMSOR-like enzymes, this group has a  predicted N-terminal iron-sulfur [4Fe-4S] cluster binding site. This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=51.82  E-value=9.1  Score=22.71  Aligned_cols=13  Identities=15%  Similarity=0.542  Sum_probs=11.2

Q ss_pred             CEEEeCCCCeEEE
Q psy12933         40 DMLYIPPKVWHHV   52 (65)
Q Consensus        40 d~LyiP~~WwH~V   52 (65)
                      +++|+|.|||+..
T Consensus        72 g~v~~~~g~~~~~   84 (121)
T cd02794          72 GVVALPQGAWYEP   84 (121)
T ss_pred             CEEEecCccccCC
Confidence            6799999999864


No 100
>PF14604 SH3_9:  Variant SH3 domain; PDB: 2CRE_A 2E5K_A 2CT3_A 2DE0_X 2D8H_A 2DA9_A 2X3X_E 2X3W_D 2KRN_A 2ED0_A ....
Probab=51.61  E-value=6.6  Score=20.24  Aligned_cols=19  Identities=21%  Similarity=0.496  Sum_probs=13.0

Q ss_pred             EEEECCCCEEEe----CCCCeEE
Q psy12933         33 ECTLNPGDMLYI----PPKVWHH   51 (65)
Q Consensus        33 ~~~l~pGd~Lyi----P~~WwH~   51 (65)
                      +..+++||+|.+    ..+||.-
T Consensus        12 ELs~~~Gd~i~v~~~~~~~W~~g   34 (49)
T PF14604_consen   12 ELSFKKGDVITVLEKSDDGWWYG   34 (49)
T ss_dssp             B-EB-TTEEEEEEEESSTSEEEE
T ss_pred             EeeEcCCCEEEEEEeCCCCEEEE
Confidence            788889988885    6777764


No 101
>PRK15078 polysaccharide export protein Wza; Provisional
Probab=51.47  E-value=9.9  Score=27.62  Aligned_cols=26  Identities=31%  Similarity=0.422  Sum_probs=18.3

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      .....|+|||++|+|+.==..|.-++
T Consensus       236 ~~ni~L~~GDvI~Vp~~~~~~v~V~G  261 (379)
T PRK15078        236 SQNRLLYPGDILYVPRNDDLKVFVMG  261 (379)
T ss_pred             ccCceeCCCCEEEECCCCCcEEEEee
Confidence            35677999999999985444454443


No 102
>PF04970 LRAT:  Lecithin retinol acyltransferase;  InterPro: IPR007053 This entry represents a conserved sequence region found in proteins from viruses, bacteria and eukaryotes. It contains a well-conserved NCEHF motif, though its function in these proteins is unknown.; PDB: 2KYT_A 4DOT_A 4FA0_A.
Probab=50.62  E-value=6.7  Score=23.64  Aligned_cols=23  Identities=26%  Similarity=0.315  Sum_probs=14.0

Q ss_pred             EEECCCCEEEeCC-CCeEEEEecC
Q psy12933         34 CTLNPGDMLYIPP-KVWHHVRSLS   56 (65)
Q Consensus        34 ~~l~pGd~LyiP~-~WwH~V~s~~   56 (65)
                      ..++|||+|+++. .++|..-..+
T Consensus         5 ~~~~~GD~I~~~r~~y~H~gIYvG   28 (125)
T PF04970_consen    5 KRLKPGDHIEVPRGLYEHWGIYVG   28 (125)
T ss_dssp             -S--TT-EEEEEETTEEEEEEEEE
T ss_pred             cCCCCCCEEEEecCCccEEEEEec
Confidence            4589999999775 5677776655


No 103
>KOG2757|consensus
Probab=50.19  E-value=22  Score=26.61  Aligned_cols=27  Identities=33%  Similarity=0.410  Sum_probs=19.5

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRSLST   57 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s~~~   57 (65)
                      .-...+++||++|||+.--=+..+.+.
T Consensus       371 ~~~~~v~rG~V~fI~a~~~i~~~~~sd  397 (411)
T KOG2757|consen  371 DSKILVNRGDVLFIPANHPIHLSSSSD  397 (411)
T ss_pred             CCceeeccCcEEEEcCCCCceeeccCc
Confidence            445678999999999986555555443


No 104
>PF02080 TrkA_C:  TrkA-C domain;  InterPro: IPR006037 The regulator of K+ conductance (RCK) domain is found in many ligand-gated K+ channels, most often attached to the intracellular carboxy terminus. The domain is prevalent among prokaryotic K+ channels, and also found in eukaryotic, high-conductance Ca2+-activated K+ channels (BK channels) [, , ]. Largely involved in redox-linked regulation of potassium channels, the N-terminal part of the RCK domain is predicted to be an active dehydrogenase at least in some cases []. Some have a conserved sequence motif (G-x-G-x-x-G-x(n)-[DE]) for NAD+ binding [], but others do not, reflecting the diversity of ligands for RCK domains. The C-terminal part is less conserved, being absent in some channels, such as the kefC antiporter from Escherichia coli. It is predicted to bind unidentified ligands and to regulate sulphate, sodium and other transporters. The X-ray structure of several RCK domains has been solved [, , ]. It reveals an alpha-beta fold similar to dehydrogenase enzymes. The domain forms a homodimer, producing a cleft between two lobes. It has a composite structure, with an N-terminal (RCK-N), and a C-terminal (RCK-C) subdomain. The RCK-N subdomain forms a Rossmann fold with two alpha helices on one side of a six stranded parallel beta sheet and three alpha helices on the other side. The RCK-C subdomain is an all-beta-strand fold. It forms an extention of the dimer interface and further stabilises the RCK homodimer [, , ]. Ca2+ is a ligand that opens the channel in a concentration-dependent manner. Two Ca2+ ions are located at the base of a cleft between two RCK domains, coordinated by the carboxylate groups of two glutamate residues, and by an aspartate residue [, , ]. RCK domains occur in at least five different contexts:   As a single domain on the C terminus of some K+ channels (for example, many prokaryotic K+ channels).  As two tandem RCK domains on the C terminus of some transporters that form gating rings (for example, eukaryotic BK channels). The gating ring has an arrangement of eight identical RCK domains, one from each of the four pore-forming subunits and four from the intracellular solution. As two domains, one at the N terminus and another at the C terminus of transporter (for example, the prokaryotic trk system potassium uptake protein A). As a soluble protein (not part of a K+ channel) consisting of two tandem RCK domains. As a soluble protein consisting of a single RCK domain.   This entry represents the C-terminal subdomain of RCK.; GO: 0008324 cation transmembrane transporter activity, 0006813 potassium ion transport; PDB: 2BKP_A 1VCT_A 2BKO_A 2BKN_A 3L4B_C 2FY8_D 2AEF_A 1LNQ_E 3RBX_C 3KXD_A ....
Probab=49.89  E-value=8.5  Score=20.38  Aligned_cols=12  Identities=33%  Similarity=0.761  Sum_probs=8.7

Q ss_pred             EEEECCCCEEEe
Q psy12933         33 ECTLNPGDMLYI   44 (65)
Q Consensus        33 ~~~l~pGd~Lyi   44 (65)
                      +.+|++||.|++
T Consensus        45 ~~~l~~gD~l~v   56 (71)
T PF02080_consen   45 DTVLQAGDILIV   56 (71)
T ss_dssp             T-BE-TTEEEEE
T ss_pred             CCEECCCCEEEE
Confidence            678999999986


No 105
>PF15340 COPR5:  Cooperator of PRMT5 family
Probab=49.70  E-value=7  Score=25.38  Aligned_cols=31  Identities=32%  Similarity=0.604  Sum_probs=23.9

Q ss_pred             CCCCceEEEEECCCCEEEeCCCCeEEEEecCCe
Q psy12933         26 QTPGHLIECTLNPGDMLYIPPKVWHHVRSLSTS   58 (65)
Q Consensus        26 ~~~~~~~~~~l~pGd~LyiP~~WwH~V~s~~~s   58 (65)
                      ....++..|.---|||+|=|+  ||+-..|.+.
T Consensus       106 sqE~kpWvccapqGDmIYDPS--WHHPpPLiPH  136 (152)
T PF15340_consen  106 SQEVKPWVCCAPQGDMIYDPS--WHHPPPLIPH  136 (152)
T ss_pred             hhhcccceeecCCCCcccCCc--cCCCCCCccc
Confidence            445667888888999999997  8887666543


No 106
>COG3837 Uncharacterized conserved protein, contains double-stranded beta-helix domain [Function unknown]
Probab=49.56  E-value=38  Score=22.41  Aligned_cols=27  Identities=33%  Similarity=0.309  Sum_probs=23.0

Q ss_pred             eEEEEECCCCEEEeCCC--CeEEEEecCC
Q psy12933         31 LIECTLNPGDMLYIPPK--VWHHVRSLST   57 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~--WwH~V~s~~~   57 (65)
                      .-+..|+|||++-+|+|  --|+-.+-+.
T Consensus        81 ~~e~~lrpGD~~gFpAG~~~aHhliN~s~  109 (161)
T COG3837          81 GGETRLRPGDSAGFPAGVGNAHHLINRSD  109 (161)
T ss_pred             CeeEEecCCceeeccCCCcceeEEeecCC
Confidence            45788999999999999  8898888764


No 107
>TIGR02899 spore_safA spore coat assembly protein SafA. in which one of which is found in most examples of endospore-forming bacteria. Lysin motifs are repeated in many proteins.
Probab=48.90  E-value=11  Score=17.67  Aligned_cols=12  Identities=33%  Similarity=0.628  Sum_probs=10.2

Q ss_pred             EECCCCEEEeCC
Q psy12933         35 TLNPGDMLYIPP   46 (65)
Q Consensus        35 ~l~pGd~LyiP~   46 (65)
                      .+.+|+.|.+|.
T Consensus        33 ~~~~g~~l~ip~   44 (44)
T TIGR02899        33 LIYPGMKIKIPS   44 (44)
T ss_pred             CcCCCCEEecCC
Confidence            588999999984


No 108
>PF10014 2OG-Fe_Oxy_2:  2OG-Fe dioxygenase;  InterPro: IPR018724  Members of this family of hypothetical bacterial proteins have no known function. Some are described as putative biofilm formation or putative agglutination proteins. ; PDB: 3PL0_B.
Probab=48.16  E-value=15  Score=24.23  Aligned_cols=30  Identities=30%  Similarity=0.551  Sum_probs=21.2

Q ss_pred             CCCceEEEEECCCCEEEeC-CCCeEEEEecC
Q psy12933         27 TPGHLIECTLNPGDMLYIP-PKVWHHVRSLS   56 (65)
Q Consensus        27 ~~~~~~~~~l~pGd~LyiP-~~WwH~V~s~~   56 (65)
                      +........++|||+|++- ..-||.|....
T Consensus       148 ~~~~~~~~l~~p~d~l~~~D~~~~H~vtpI~  178 (195)
T PF10014_consen  148 KEILFFFTLLEPGDTLLVDDRRVWHYVTPIR  178 (195)
T ss_dssp             SSEEEEE---STTEEEEEETTTEEEEE--EE
T ss_pred             CCcceEEEecCCCCEEEEeCCcceECCCcee
Confidence            4556778899999999999 99999998765


No 109
>COG3257 GlxB Uncharacterized protein, possibly involved in glyoxylate utilization [General function prediction only]
Probab=46.68  E-value=42  Score=23.68  Aligned_cols=26  Identities=23%  Similarity=0.345  Sum_probs=22.4

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      .-+..|.+|+-+|+|+|-=|.+++..
T Consensus       100 G~th~l~eggyaylPpgs~~~~~N~~  125 (264)
T COG3257         100 GKTHALREGGYAYLPPGSGWTLRNAQ  125 (264)
T ss_pred             CeEEEeccCCeEEeCCCCcceEeecc
Confidence            44567899999999999999999765


No 110
>PRK15175 Vi polysaccharide export protein VexA; Provisional
Probab=46.35  E-value=14  Score=26.91  Aligned_cols=16  Identities=19%  Similarity=0.445  Sum_probs=14.1

Q ss_pred             EEEEECCCCEEEeCCC
Q psy12933         32 IECTLNPGDMLYIPPK   47 (65)
Q Consensus        32 ~~~~l~pGd~LyiP~~   47 (65)
                      ..+.|+|||++++|..
T Consensus       222 ~ni~L~~GDvI~V~~~  237 (355)
T PRK15175        222 LDIALQPDDRITLRQV  237 (355)
T ss_pred             CCcEeCCCCEEEEccC
Confidence            4789999999999974


No 111
>KOG2757|consensus
Probab=45.07  E-value=21  Score=26.76  Aligned_cols=24  Identities=29%  Similarity=0.396  Sum_probs=19.8

Q ss_pred             EEEECCCCEEEeCCCCeEEEEecC
Q psy12933         33 ECTLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        33 ~~~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      .+.|+||+++|+-+.--|.--+-+
T Consensus       249 ~~~L~PGEA~yL~AnepHAYlsGd  272 (411)
T KOG2757|consen  249 YVRLNPGEAIYLEANEPHAYLSGD  272 (411)
T ss_pred             heecCCCceeeecCCCcceeecCc
Confidence            578999999999999999755433


No 112
>PRK13498 chemoreceptor glutamine deamidase CheD; Provisional
Probab=45.04  E-value=76  Score=20.71  Aligned_cols=34  Identities=21%  Similarity=0.550  Sum_probs=20.8

Q ss_pred             ceEEEEECCCCEEEeCCCCeEEEEecCCeEEEEEe
Q psy12933         30 HLIECTLNPGDMLYIPPKVWHHVRSLSTSISVSFW   64 (65)
Q Consensus        30 ~~~~~~l~pGd~LyiP~~WwH~V~s~~~sisvn~w   64 (65)
                      ...++.|.|||+-+ ...==-=...|+.|++|-+|
T Consensus         6 ~~~~v~v~~Ge~~v-~~~p~~i~T~LGSCVav~l~   39 (167)
T PRK13498          6 GTKEIVLQPGEVVF-ATRPTRLRTLLGSCVAITFW   39 (167)
T ss_pred             CceEEEEeCCcEEE-ECCCCEEEEECcccEEEEEE
Confidence            33588899999544 33311112226789999887


No 113
>cd02781 MopB_CT_Acetylene-hydratase The MopB_CT_Acetylene-hydratase CD contains acetylene hydratase (Ahy) and other related proteins. The acetylene hydratase of Pelobacter acetylenicus is a tungsten iron-sulfur protein involved in the fermentation of acetylene to ethanol and acetate. This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=44.58  E-value=15  Score=21.85  Aligned_cols=13  Identities=8%  Similarity=0.120  Sum_probs=11.4

Q ss_pred             CEEEeCCCCeEEE
Q psy12933         40 DMLYIPPKVWHHV   52 (65)
Q Consensus        40 d~LyiP~~WwH~V   52 (65)
                      +++++|.|||+..
T Consensus        75 g~v~~~~g~~~~~   87 (130)
T cd02781          75 GVVRAEHGWWYPE   87 (130)
T ss_pred             CEEEEeccccccc
Confidence            7799999999874


No 114
>PF01427 Peptidase_M15:  D-ala-D-ala dipeptidase This is family M15 in the peptidase classification. ;  InterPro: IPR000755 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M15 (clan MD), subfamily M15D (vanX D-Ala-D-Ala dipeptidase). The D-alanyl-D-alanine dipeptidase enzyme from Enterococcus faecalis is also known as the vancomycin resistance protein VanX, and hydrolyses D-ala-D-ala. It has a 250-fold differential in catalytic efficiency for hydrolysis of D-ala-D-ala versus D-ala-D-lactate. The latter therefore remains intact for subsequent incorporation into peptidoglycan precursors that terminate in the dipeptide D-ala-D-lactate rather than the dipeptide D-ala-D-ala, thereby preventing vancomycin from binding. The enzyme requires a metal cofactor, and is induced by vancomycin through regulation by VanS and VanR.; GO: 0008237 metallopeptidase activity, 0016805 dipeptidase activity, 0006508 proteolysis, 0005618 cell wall; PDB: 1R44_B.
Probab=43.73  E-value=11  Score=25.17  Aligned_cols=16  Identities=19%  Similarity=0.447  Sum_probs=10.4

Q ss_pred             CEEEeCCCCeEEEEec
Q psy12933         40 DMLYIPPKVWHHVRSL   55 (65)
Q Consensus        40 d~LyiP~~WwH~V~s~   55 (65)
                      -..=+|..|||.--..
T Consensus       170 GF~~~p~EWWHf~~~d  185 (198)
T PF01427_consen  170 GFVNYPTEWWHFSYGD  185 (198)
T ss_dssp             TEEE-TT-TTEEEESS
T ss_pred             CCccCCchhcCcCCCC
Confidence            3667999999987654


No 115
>PF00877 NLPC_P60:  NlpC/P60 family;  InterPro: IPR000064 The Escherichia coli NLPC/Listeria P60 domain occurs at the C terminus of a number of different bacterial and viral proteins. The viral proteins are either described as tail assembly proteins or Gp19. In bacteria, the proteins are variously described as being putative tail component of prophage, invasin, invasion associated protein, putative lipoprotein, cell wall hydrolase, or putative endopeptidase.  The E. coli NLPC/Listeria P60 domain is contained within the boundaries of the cysteine peptidase domain that defines the MEROPS peptidase family C40 (clan C-). A type example being dipeptidyl-peptidase VI from Bacillus sphaericus and gamma-glutamyl-diamino acid-endopeptidase precursor from Lactococcus lactis 3.4.19.11 from EC. This group also contains proteins classified as non-peptidase homologues in that they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity of peptidases in the C40 family. ; PDB: 3PVQ_B 3GT2_A 3NPF_B 2K1G_A 3I86_A 3S0Q_A 2XIV_A 3PBC_A 3NE0_A 3M1U_B ....
Probab=43.23  E-value=13  Score=21.39  Aligned_cols=16  Identities=25%  Similarity=0.617  Sum_probs=12.1

Q ss_pred             eEEEEECCCCEEEeCC
Q psy12933         31 LIECTLNPGDMLYIPP   46 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~   46 (65)
                      .-...++|||+||+..
T Consensus        47 ~~~~~~~pGDlif~~~   62 (105)
T PF00877_consen   47 VPISELQPGDLIFFKG   62 (105)
T ss_dssp             EEGGG-TTTEEEEEEG
T ss_pred             cchhcCCcccEEEEeC
Confidence            4456789999999887


No 116
>PF02041 Auxin_BP:  Auxin binding protein;  InterPro: IPR000526 Auxin binding protein is located in the lumen of the endoplasmic reticulum (ER). The primary structure contains an N-terminal hydrophobic leader sequence of 30-40 amino acids, which could represent a signal for translocation of the protein to the ER [, ]. The mature protein comprises around 165 residues, and contains a number of potential N-glycosylation sites. In vitro transport studies have demonstrated co-translational glycosylation []. Retention within the lumen of the ER correlates with an additional signal located at the C terminus, represented by the sequence Lys-Asp-Glu-Leu, known to be responsible for preventing secretion of proteins from the lumen of the ER in eukaryotic cells [, ].; GO: 0004872 receptor activity, 0005788 endoplasmic reticulum lumen; PDB: 1LR5_D 1LRH_D.
Probab=42.76  E-value=40  Score=22.38  Aligned_cols=28  Identities=25%  Similarity=0.322  Sum_probs=18.1

Q ss_pred             CceEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933         29 GHLIECTLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        29 ~~~~~~~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      ..+.+....|++.++||.+=-||+-+.+
T Consensus        88 G~pqef~~~pnSTf~IPvn~~HQv~NT~  115 (167)
T PF02041_consen   88 GKPQEFPIFPNSTFHIPVNDAHQVWNTN  115 (167)
T ss_dssp             -S-EEEEE-TTEEEEE-TT--EEEE---
T ss_pred             CCceEEEecCCCeEEeCCCCcceeecCC
Confidence            4678999999999999999999999987


No 117
>TIGR01015 hmgA homogentisate 1,2-dioxygenase. Missing in human disease alkaptonuria.
Probab=42.13  E-value=33  Score=25.87  Aligned_cols=23  Identities=26%  Similarity=0.419  Sum_probs=19.8

Q ss_pred             EEECCCCEEEeCCCCeEEEEecC
Q psy12933         34 CTLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        34 ~~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      ..++|||+++||.|-=++|+-.+
T Consensus       168 L~v~pgei~VIPRG~~frv~l~g  190 (429)
T TIGR01015       168 LLVEPNEICVIPRGVRFRVTVLE  190 (429)
T ss_pred             eEecCCCEEEecCccEEEEeeCC
Confidence            37899999999999999998544


No 118
>PRK05341 homogentisate 1,2-dioxygenase; Provisional
Probab=42.06  E-value=34  Score=25.92  Aligned_cols=23  Identities=26%  Similarity=0.410  Sum_probs=19.4

Q ss_pred             EEECCCCEEEeCCCCeEEEEecC
Q psy12933         34 CTLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        34 ~~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      ..++|||+++||.|-=++|+-.+
T Consensus       174 L~v~pgei~VIPRG~~frv~l~~  196 (438)
T PRK05341        174 LDVEPGEIAVIPRGVKFRVELPD  196 (438)
T ss_pred             eEecCCCEEEEcCccEEEEecCC
Confidence            46789999999999999998433


No 119
>PRK00924 5-keto-4-deoxyuronate isomerase; Provisional
Probab=41.88  E-value=24  Score=25.05  Aligned_cols=18  Identities=22%  Similarity=0.208  Sum_probs=14.2

Q ss_pred             EEEEECCCCEEEeCCCCe
Q psy12933         32 IECTLNPGDMLYIPPKVW   49 (65)
Q Consensus        32 ~~~~l~pGd~LyiP~~Ww   49 (65)
                      .+..|++.|+||+|.|==
T Consensus        92 ~~~~l~~~d~LYVp~G~~  109 (276)
T PRK00924         92 ETYELGHRDALYVGKGAK  109 (276)
T ss_pred             EEEecCCCcEEEECCCCc
Confidence            455688999999999943


No 120
>TIGR02451 anti_sig_ChrR anti-sigma factor, putative, ChrR family. The member of this family from Rhodobacter sphaeroides has been shown both to form a complex with sigma(E) and to negatively regulate tetrapyrrole biosynthesis. This protein likely contains (at least) two distinct functional domains; several smaller homologs (excluded by the model) show homology only to the C-terminal, including a motif PxHxHxGxE.
Probab=41.64  E-value=33  Score=22.93  Aligned_cols=28  Identities=21%  Similarity=0.288  Sum_probs=23.0

Q ss_pred             EECCCCEEEeCCCCeEEEEecCC--eEEEE
Q psy12933         35 TLNPGDMLYIPPKVWHHVRSLST--SISVS   62 (65)
Q Consensus        35 ~l~pGd~LyiP~~WwH~V~s~~~--sisvn   62 (65)
                      ...+||.+.+|++--|+..+.+.  ++-+.
T Consensus       164 ~y~~Gd~i~~p~~~~H~p~a~~~~~Cicl~  193 (215)
T TIGR02451       164 VYGVGDFEEADGSVQHQPRTVSGGDCLCLA  193 (215)
T ss_pred             ccCCCeEEECCCCCCcCcccCCCCCeEEEE
Confidence            45799999999999999999964  65543


No 121
>TIGR02219 phage_NlpC_fam putative phage cell wall peptidase, NlpC/P60 family. Members of this family show sequence similarity to members of the NlpC/P60 family described by Pfam model pfam00877 and by Anantharaman and Aravind (PubMed:12620121). The NlpC/P60 family includes a number of characterized bacterial cell wall hydrolases. Members of this related family are all found in prophage regions of bacterial genomes.
Probab=41.60  E-value=15  Score=22.58  Aligned_cols=11  Identities=36%  Similarity=0.715  Sum_probs=9.5

Q ss_pred             EECCCCEEEeC
Q psy12933         35 TLNPGDMLYIP   45 (65)
Q Consensus        35 ~l~pGd~LyiP   45 (65)
                      .++|||++|+-
T Consensus        76 ~~qpGDlvff~   86 (134)
T TIGR02219        76 AAQPGDVLVFR   86 (134)
T ss_pred             cCCCCCEEEEe
Confidence            58999999983


No 122
>PF11543 UN_NPL4:  Nuclear pore localisation protein NPL4;  InterPro: IPR024682 Npl4, along with Ufd1, forms the heterodimer adaptor complex UN, which is involved in the recruitment of p97, an AAA ATPase, for tasks involving the ubiquitin pathway.  Npl4 has a N-terminal ubiquitin-like domain which has within its structure a beta-grasp fold with a helical insert []. This entry represents the ubiquitin-like domain.; PDB: 2PJH_A 1WF9_A.
Probab=41.55  E-value=15  Score=21.12  Aligned_cols=19  Identities=37%  Similarity=0.564  Sum_probs=8.9

Q ss_pred             CCCceEEEEECCCCEEEeC
Q psy12933         27 TPGHLIECTLNPGDMLYIP   45 (65)
Q Consensus        27 ~~~~~~~~~l~pGd~LyiP   45 (65)
                      .....-+.-|+-||+||+=
T Consensus        60 ~~~tl~~lglkHGdmlyL~   78 (80)
T PF11543_consen   60 DSKTLSSLGLKHGDMLYLK   78 (80)
T ss_dssp             TT-CCCCT---TT-EEE--
T ss_pred             CcCCHHHcCCCCccEEEEe
Confidence            3455666789999999974


No 123
>TIGR03805 beta_helix_1 parallel beta-helix repeat-containing protein. Members of this protein family contain a tandem pair of beta-helix repeats (see TIGR03804). Each repeat is expected to consist of three beta strands that form a single turn as they form a right-handed helix of stacked beta-structure. Member proteinsa occur regularly in two-gene pairs along with another uncharacterized protein family; both protein families exhibit either lipoprotein or regular signal peptides, suggesting transit through the plasma membrane, and the two may be fused. The function of the pair is unknown.
Probab=41.29  E-value=38  Score=23.88  Aligned_cols=15  Identities=33%  Similarity=0.811  Sum_probs=13.8

Q ss_pred             ECCCCEEEeCCCCeE
Q psy12933         36 LNPGDMLYIPPKVWH   50 (65)
Q Consensus        36 l~pGd~LyiP~~WwH   50 (65)
                      .+|||+|.+|+|-++
T Consensus         8 A~~GDtI~l~~G~Y~   22 (314)
T TIGR03805         8 AQPGDTIVLPEGVFQ   22 (314)
T ss_pred             CCCCCEEEECCCEEE
Confidence            479999999999998


No 124
>PF04209 HgmA:  homogentisate 1,2-dioxygenase;  InterPro: IPR005708  Alkaptonuria (AKU), a rare hereditary disorder, was the first disease to be interpreted as an inborn error of metabolism. The deficiency causes homogentisic aciduria, ochronosis, and arthritis. AKU patients are deficient for homogentisate 1,2 dioxygenase (1.13.11.5 from EC), the enzyme that mediates the conversion of homogentisate to maleylacetoacetate; a step in the catabolism of both tyrosine and phenylalanine.  Homogentisate + O(2) = 4-maleylacetoacetate.   ; GO: 0004411 homogentisate 1,2-dioxygenase activity, 0006559 L-phenylalanine catabolic process, 0006570 tyrosine metabolic process, 0055114 oxidation-reduction process; PDB: 1EY2_A 1EYB_A.
Probab=41.15  E-value=32  Score=25.82  Aligned_cols=24  Identities=25%  Similarity=0.396  Sum_probs=16.1

Q ss_pred             EEECCCCEEEeCCCCeEEEEecCC
Q psy12933         34 CTLNPGDMLYIPPKVWHHVRSLST   57 (65)
Q Consensus        34 ~~l~pGd~LyiP~~WwH~V~s~~~   57 (65)
                      ..++|||+++||.|-=++|+-.++
T Consensus       166 L~v~pGd~~VIPRG~~~rv~l~~p  189 (424)
T PF04209_consen  166 LDVRPGDYVVIPRGTRFRVELPGP  189 (424)
T ss_dssp             EEE-TTEEEEE-TT--EEEE-SSS
T ss_pred             EEEcCCeEEEECCeeEEEEEeCCC
Confidence            458999999999999999987654


No 125
>cd02792 MopB_CT_Formate-Dh-Na-like Formate dehydrogenase N, alpha subunit (Formate-Dh-Na) is a major component of nitrate respiration in bacteria such as in the E. coli formate dehydrogenase N (Fdh-N). Fdh-N is a membrane protein that is a complex of three different subunits and is the major electron donor to the nitrate respiratory chain. Also included in this CD is the Desulfovibrio gigas tungsten formate dehydrogenase, DgW-FDH. In contrast to Fdh-N, which is a  functional heterotrimer, DgW-FDH is a heterodimer. The DgW-FDH complex is composed of a large subunit carrying the W active site and one [4Fe-4S] center, and a small subunit that harbors a series of three [4Fe-4S] clusters as well as a putative vacant binding site for a fourth cluster. The smaller subunit is not included in this alignment. This CD (MopB_CT_Formate-Dh-Na-like) is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=40.76  E-value=23  Score=20.67  Aligned_cols=12  Identities=17%  Similarity=-0.108  Sum_probs=10.7

Q ss_pred             CEEEeCCCCeEE
Q psy12933         40 DMLYIPPKVWHH   51 (65)
Q Consensus        40 d~LyiP~~WwH~   51 (65)
                      +++++|.+||+.
T Consensus        77 g~v~~~~g~~~~   88 (122)
T cd02792          77 HEVGIPYHWGGM   88 (122)
T ss_pred             CEEEEecccCcc
Confidence            689999999985


No 126
>PF07653 SH3_2:  Variant SH3 domain;  InterPro: IPR011511 SH3 (src Homology-3) domains are small protein modules containing approximately 50 amino acid residues [, ]. They are found in a great variety of intracellular or membrane-associated proteins [, , ] for example, in a variety of proteins with enzymatic activity, in adaptor proteins that lack catalytic sequences and in cytoskeletal proteins, such as fodrin and yeast actin binding protein ABP-1. The SH3 domain has a characteristic fold which consists of five or six beta-strands arranged as two tightly packed anti-parallel beta sheets. The linker regions may contain short helices []. The surface of the SH3-domain bears a flat, hydrophobic ligand-binding pocket which consists of three shallow grooves defined by conservative aromatic residues in which the ligand adopts an extended left-handed helical arrangement. The ligand binds with low affinity but this may be enhanced by multiple interactions. The region bound by the SH3 domain is in all cases proline-rich and contains PXXP as a core-conserved binding motif. The function of the SH3 domain is not well understood but they may mediate many diverse processes such as increasing local concentration of proteins, altering their subcellular location and mediating the assembly of large multiprotein complexes []. This entry represents a variant of the SH3 domain.; PDB: 1I1J_B 1K0X_A 1HJD_A 2KEA_A 1KJW_A 1JXM_A 1JXO_B 2EBP_A 2DL3_A 2EYX_A ....
Probab=40.55  E-value=8.9  Score=19.86  Aligned_cols=21  Identities=19%  Similarity=0.351  Sum_probs=12.5

Q ss_pred             EEEEECCCCEEEe-----CCCCeEEE
Q psy12933         32 IECTLNPGDMLYI-----PPKVWHHV   52 (65)
Q Consensus        32 ~~~~l~pGd~Lyi-----P~~WwH~V   52 (65)
                      -+..++.||++.+     +.+||--.
T Consensus        14 ~~Ls~~~Gd~i~v~~~~~~~~ww~~~   39 (55)
T PF07653_consen   14 DELSFKKGDVIEVLGEKDDDGWWLGE   39 (55)
T ss_dssp             TB-EB-TTEEEEEEEEECSTSEEEEE
T ss_pred             CceEEecCCEEEEEEeecCCCEEEEE
Confidence            3477888887765     57776543


No 127
>TIGR03505 FimV_core FimV N-terminal domain. This region is found at, or about 200 amino acids from, the N-terminus of FimV from Pseudomonas aeruginosa, TspA of Neisseria meningitidis, and related proteins. Disruption of FimV blocks twitching motility from type IV pili; Semmler, et al. suggest a role for this family in peptidoglycan layer remodelling required by type IV fimbrial systems. Most but not all members of this protein family have a C-terminal region recognized by TIGR03504. In between is a highly variable, often repeat-filled region rich in the negatively charged amino acids Asp and Glu.
Probab=39.84  E-value=19  Score=20.52  Aligned_cols=13  Identities=38%  Similarity=0.723  Sum_probs=11.3

Q ss_pred             EECCCCEEEeCCC
Q psy12933         35 TLNPGDMLYIPPK   47 (65)
Q Consensus        35 ~l~pGd~LyiP~~   47 (65)
                      -|++|..|.||..
T Consensus        40 ~L~~G~~L~iP~~   52 (74)
T TIGR03505        40 RLKVGQILRIPSE   52 (74)
T ss_pred             hcCCCCEEeCCCH
Confidence            5799999999974


No 128
>PF11699 CENP-C_C:  Mif2/CENP-C like; PDB: 2VPV_B.
Probab=39.67  E-value=78  Score=18.42  Aligned_cols=27  Identities=7%  Similarity=0.272  Sum_probs=18.6

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRSLST   57 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s~~~   57 (65)
                      .-+.++.+|++.+||+|=--..+++..
T Consensus        50 ~~~f~v~~G~~F~VP~gN~Y~i~N~~~   76 (85)
T PF11699_consen   50 ETSFVVTKGGSFQVPRGNYYSIKNIGN   76 (85)
T ss_dssp             TEEEEEETT-EEEE-TT-EEEEEE-SS
T ss_pred             CcEEEEeCCCEEEECCCCEEEEEECCC
Confidence            346678899999999998888888764


No 129
>PF11142 DUF2917:  Protein of unknown function (DUF2917);  InterPro: IPR021317  This bacterial family of proteins appears to be restricted to Proteobacteria. 
Probab=39.62  E-value=32  Score=18.82  Aligned_cols=18  Identities=22%  Similarity=0.187  Sum_probs=14.7

Q ss_pred             ceEEEEECCCCEEEeCCC
Q psy12933         30 HLIECTLNPGDMLYIPPK   47 (65)
Q Consensus        30 ~~~~~~l~pGd~LyiP~~   47 (65)
                      ..-+..|+|||.+-++++
T Consensus        34 ~~~D~~L~~G~~l~l~~g   51 (63)
T PF11142_consen   34 DPDDYWLQAGDSLRLRRG   51 (63)
T ss_pred             CCCCEEECCCCEEEeCCC
Confidence            456778999999988887


No 130
>PLN02658 homogentisate 1,2-dioxygenase
Probab=39.61  E-value=39  Score=25.57  Aligned_cols=23  Identities=22%  Similarity=0.403  Sum_probs=19.2

Q ss_pred             EEECCCCEEEeCCCCeEEEEecC
Q psy12933         34 CTLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        34 ~~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      ..++|||+++||.|-=++|+-.+
T Consensus       167 L~v~pgei~VIPRG~~frv~l~~  189 (435)
T PLN02658        167 LQVSPGEIVVIPRGFRFAVDLPD  189 (435)
T ss_pred             eEecCCCEEEecCccEEEEecCC
Confidence            45789999999999999998433


No 131
>COG1935 Uncharacterized conserved protein [Function unknown]
Probab=39.60  E-value=18  Score=22.90  Aligned_cols=12  Identities=25%  Similarity=0.800  Sum_probs=9.8

Q ss_pred             EECCCCEEEeCC
Q psy12933         35 TLNPGDMLYIPP   46 (65)
Q Consensus        35 ~l~pGd~LyiP~   46 (65)
                      -|+|||++|+-+
T Consensus        38 rl~~GDlVFlT~   49 (122)
T COG1935          38 RLHEGDLVFLTS   49 (122)
T ss_pred             cCCCCCEEEEeh
Confidence            478999999754


No 132
>smart00306 HintN Hint (Hedgehog/Intein) domain N-terminal region. Hedgehog/Intein domain, N-terminal region. Domain has been split to accommodate large insertions of endonucleases.
Probab=38.71  E-value=21  Score=19.70  Aligned_cols=10  Identities=50%  Similarity=1.198  Sum_probs=5.4

Q ss_pred             ECCCCEEEeC
Q psy12933         36 LNPGDMLYIP   45 (65)
Q Consensus        36 l~pGd~LyiP   45 (65)
                      |++||.|++|
T Consensus        90 l~~gd~v~~~   99 (100)
T smart00306       90 LKPGDYVLVP   99 (100)
T ss_pred             CCCCCEEEec
Confidence            4455555554


No 133
>PF14578 GTP_EFTU_D4:  Elongation factor Tu domain 4; PDB: 1G7R_A 1G7S_A 1G7T_A 1XE1_A.
Probab=38.65  E-value=18  Score=21.11  Aligned_cols=10  Identities=40%  Similarity=1.059  Sum_probs=6.5

Q ss_pred             EECCCCEEEe
Q psy12933         35 TLNPGDMLYI   44 (65)
Q Consensus        35 ~l~pGd~Lyi   44 (65)
                      .+++||+||+
T Consensus        72 ~i~eGDiLyV   81 (81)
T PF14578_consen   72 QIKEGDILYV   81 (81)
T ss_dssp             TB-TT-EEEE
T ss_pred             cCCCCCEEeC
Confidence            5789999986


No 134
>cd02785 MopB_CT_4 The MopB_CT_4 CD includes a group of related uncharacterized bacterial and archaeal molybdopterin-binding oxidoreductase-like domains with a putative N-terminal iron-sulfur [4Fe-4S] cluster binding site and molybdopterin cofactor binding site. This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=38.16  E-value=20  Score=21.28  Aligned_cols=12  Identities=8%  Similarity=0.307  Sum_probs=10.5

Q ss_pred             CEEEeCCCCeEE
Q psy12933         40 DMLYIPPKVWHH   51 (65)
Q Consensus        40 d~LyiP~~WwH~   51 (65)
                      +++++|.|||..
T Consensus        74 g~v~~~~g~~~~   85 (124)
T cd02785          74 GVVTAEQGWWSR   85 (124)
T ss_pred             CEEEeecccCcc
Confidence            679999999975


No 135
>cd02793 MopB_CT_DMSOR-BSOR-TMAOR The MopB_DMSOR-BSOR-TMAOR CD contains dimethylsulfoxide reductase (DMSOR), biotin sulfoxide reductase (BSOR),  trimethylamine N-oxide reductase (TMAOR) and other related proteins. DMSOR always catalyzes the reduction of DMSO to dimethylsulfide, but its cellular location and oligomerization state are organism-dependent. For example, in Rhodobacter sphaeriodes and Rhodobacter capsulatus, it is an 82-kDa monomeric soluble protein found in the periplasmic space; in E. coli, it is membrane-bound and exists as a heterotrimer. BSOR catalyzes the reduction of biotin sulfixode to biotin, and is unique among Mo enzymes because no additional auxiliary proteins or cofactors are required. TMAOR is similar to DMSOR, but its only natural substrate is TMAO.This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=36.70  E-value=24  Score=21.23  Aligned_cols=12  Identities=17%  Similarity=0.659  Sum_probs=10.3

Q ss_pred             CEEEeCCCCeEE
Q psy12933         40 DMLYIPPKVWHH   51 (65)
Q Consensus        40 d~LyiP~~WwH~   51 (65)
                      +++++|.|||..
T Consensus        75 g~v~~~~g~w~~   86 (129)
T cd02793          75 GVVQLPTGAWYD   86 (129)
T ss_pred             CEEEEccccCcC
Confidence            578999999974


No 136
>KOG3905|consensus
Probab=36.66  E-value=18  Score=27.13  Aligned_cols=11  Identities=27%  Similarity=0.531  Sum_probs=9.7

Q ss_pred             CCCEEEeCCCC
Q psy12933         38 PGDMLYIPPKV   48 (65)
Q Consensus        38 pGd~LyiP~~W   48 (65)
                      .-|.+|||+||
T Consensus       302 EkdaVfIPAGW  312 (473)
T KOG3905|consen  302 EKDAVFIPAGW  312 (473)
T ss_pred             ecceeEeccCC
Confidence            46899999998


No 137
>COG2164 Uncharacterized conserved protein [Function unknown]
Probab=36.19  E-value=17  Score=22.86  Aligned_cols=16  Identities=38%  Similarity=0.681  Sum_probs=13.3

Q ss_pred             EEECCCCEEEeCCCCe
Q psy12933         34 CTLNPGDMLYIPPKVW   49 (65)
Q Consensus        34 ~~l~pGd~LyiP~~Ww   49 (65)
                      -++++||+-|-|+|.-
T Consensus        63 dvve~GDv~YWpPGkA   78 (126)
T COG2164          63 DVVEPGDVSYWPPGKA   78 (126)
T ss_pred             cccCcccccccCCCcE
Confidence            4679999999999853


No 138
>TIGR00568 alkb DNA alkylation damage repair protein AlkB. Proteins in this family have an as of yet undetermined function in the repair of alkylation damage to DNA. Alignment and family designation based on phylogenomic analysis of Jonathan A. Eisen (PhD Thesis, Stanford University, 1999).
Probab=35.61  E-value=58  Score=21.14  Aligned_cols=20  Identities=15%  Similarity=0.187  Sum_probs=15.9

Q ss_pred             CCCceEEEEECCCCEEEeCC
Q psy12933         27 TPGHLIECTLNPGDMLYIPP   46 (65)
Q Consensus        27 ~~~~~~~~~l~pGd~LyiP~   46 (65)
                      +..+...+.|++||+|++-.
T Consensus       141 ~~~~~~~l~L~sGsllvM~G  160 (169)
T TIGR00568       141 RNDPPKRLRLHSGDVVIMGG  160 (169)
T ss_pred             CCCceEEEEeCCCCEEEECC
Confidence            34557899999999999853


No 139
>PF05118 Asp_Arg_Hydrox:  Aspartyl/Asparaginyl beta-hydroxylase;  InterPro: IPR007803 The alpha-ketoglutarate-dependent dioxygenase aspartyl (asparaginyl) beta-hydroxylase (1.14.11.16 from EC) specifically hydroxylates one aspartic or asparagine residue in certain epidermal growth factor-like domains of a number of proteins. Its action may be due to histidine-675, which, when mutated to an alanine residue, causes the loss of enzymatic activity in the protein [].  An invertebrate alpha-ketoglutarate-dependent aspartyl/asparaginyl beta-hydroxylase, which posttranslationally hydroxylates specific aspartyl or asparaginyl residues within epidermal growth factor-like modules [], activity was found to be similar to that of the purified mammalian aspartyl/asparaginyl beta-hydroxylase with respect to cofactor requirements, stereochemistry and substrate sequence specificity []. This enzyme requires Fe2+ as a cofactor. Some vitamin K-dependent coagulation factors, as well as synthetic peptides based on the structure of the first epidermal growth factor domain of human coagulation factor IX or X, can act as acceptors.; GO: 0018193 peptidyl-amino acid modification, 0030176 integral to endoplasmic reticulum membrane; PDB: 3RCQ_A 1E5S_A 1E5R_B.
Probab=35.04  E-value=1.1e+02  Score=19.31  Aligned_cols=27  Identities=11%  Similarity=-0.022  Sum_probs=19.6

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRSLST   57 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s~~~   57 (65)
                      ......++|+++++=...-|.|.+.+.
T Consensus       123 ~~~~~w~~G~~~~fD~s~~H~~~N~~~  149 (163)
T PF05118_consen  123 GETRHWREGECWVFDDSFEHEVWNNGD  149 (163)
T ss_dssp             TEEEB--CTEEEEE-TTS-EEEEESSS
T ss_pred             CeEEEeccCcEEEEeCCEEEEEEeCCC
Confidence            345677999999999999999999875


No 140
>cd02791 MopB_CT_Nitrate-R-NapA-like Nitrate reductases, NapA (Nitrate-R-NapA), NasA, and NarB catalyze the reduction of nitrate to nitrite. Monomeric Nas is located in the cytoplasm and participates in nitrogen assimilation. Dimeric Nap is located in the periplasm and is coupled to quinol oxidation via a membrane-anchored tetraheme cytochrome. This CD (MopB_CT_Nitrate-R-Nap) is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs
Probab=34.84  E-value=47  Score=19.27  Aligned_cols=20  Identities=5%  Similarity=0.210  Sum_probs=14.7

Q ss_pred             EEEEECCC---CEEEeCCCCeEE
Q psy12933         32 IECTLNPG---DMLYIPPKVWHH   51 (65)
Q Consensus        32 ~~~~l~pG---d~LyiP~~WwH~   51 (65)
                      ..+.+.++   +++++|.+||+.
T Consensus        66 ~~v~~~~~i~~g~v~~~~g~~~~   88 (122)
T cd02791          66 LRVRVTDRVRPGEVFVPMHWGDQ   88 (122)
T ss_pred             EEEEECCCcCCCeEEEecccCcc
Confidence            45555554   789999999975


No 141
>cd00118 LysM Lysin domain, found in a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function.
Probab=34.47  E-value=27  Score=15.55  Aligned_cols=12  Identities=42%  Similarity=0.551  Sum_probs=9.4

Q ss_pred             EEECCCCEEEeC
Q psy12933         34 CTLNPGDMLYIP   45 (65)
Q Consensus        34 ~~l~pGd~LyiP   45 (65)
                      ..+.+|+.|.+|
T Consensus        35 ~~~~~g~~l~ip   46 (46)
T cd00118          35 DNLQVGQKLKIP   46 (46)
T ss_pred             cccCCCCEEecC
Confidence            357889988887


No 142
>PF02814 UreE_N:  UreE urease accessory protein, N-terminal domain;  InterPro: IPR004029 Urease and other nickel metalloenzymes are synthesised as precursors devoid of the metalloenzyme active site. These precursors then undergo a complex post-translational maturation process that requires a number of accessory proteins. Members of this group are nickel-binding proteins required for urease metallocentre assembly []. They are believed to function as metallochaperones to deliver nickel to urease apoprotein [, ]. It has been shown by yeast two-hybrid analysis that UreE forms a dimeric complex with UreG in Helicobacter pylori []. The UreDFG-apoenzyme complex has also been shown to exist [, ] and is believed to be, with the addition of UreE, the assembly system for active urease []. The complexes, rather than the individual proteins, presumably bind to UreB via UreE/H recognition sites. The structure of Klebsiella aerogenes UreE reveals a unique two-domain architecture.The N-terminal domain is structurally related to a heat shock protein, while the C-terminal domain shows homology to the Atx1 copper metallochaperone [, ]. Significantly, the metal-binding sites in UreE and Atx1 are distinct in location and types of residues despite the relationship between these proteins and the mechanism for UreE activation of urease is proposed to be different from the thiol ligand exchange mechanism used by the copper metallochaperones. The N-terminal domain is termed the peptide-binding domain. Deletion of this domain does not eliminate enzymatic activity, and the truncated protein can still activate urease [].; GO: 0016151 nickel ion binding, 0006461 protein complex assembly, 0019627 urea metabolic process; PDB: 3NXZ_B 3TJA_B 3LA0_B 3TJ9_B 3NY0_A 3L9Z_A 3TJ8_A 1EAR_A 1EB0_A 1GMU_B ....
Probab=34.05  E-value=27  Score=18.95  Aligned_cols=12  Identities=42%  Similarity=0.794  Sum_probs=7.5

Q ss_pred             ECCCCEEEeCCC
Q psy12933         36 LNPGDMLYIPPK   47 (65)
Q Consensus        36 l~pGd~LyiP~~   47 (65)
                      |+.||+|+.-.+
T Consensus        51 L~~GDvL~~d~g   62 (65)
T PF02814_consen   51 LRDGDVLYLDDG   62 (65)
T ss_dssp             --TTEEEEECTS
T ss_pred             cCCCCEEEeCCC
Confidence            788899987544


No 143
>PF05708 DUF830:  Orthopoxvirus protein of unknown function (DUF830); PDB: 2IF6_B 3KW0_C.
Probab=33.92  E-value=24  Score=21.62  Aligned_cols=9  Identities=33%  Similarity=0.770  Sum_probs=5.2

Q ss_pred             ECCCCEEEe
Q psy12933         36 LNPGDMLYI   44 (65)
Q Consensus        36 l~pGd~Lyi   44 (65)
                      |++||+|+.
T Consensus         2 l~~GDIil~   10 (158)
T PF05708_consen    2 LQTGDIILT   10 (158)
T ss_dssp             --TT-EEEE
T ss_pred             CCCeeEEEE
Confidence            689999986


No 144
>cd00250 CAS_like Clavaminic acid synthetase (CAS) -like;  CAS is a trifunctional Fe(II)/ 2-oxoglutarate (2OG) oxygenase carrying out three reactions in the biosynthesis of clavulanic acid, an inhibitor of class A serine beta-lactamases. In general, Fe(II)-2OG oxygenases catalyze a hydroxylation reaction, which leads to the incorporation of an oxygen atom from dioxygen into a hydroxyl group and conversion of 2OG to succinate and CO2
Probab=33.66  E-value=42  Score=22.37  Aligned_cols=15  Identities=33%  Similarity=0.740  Sum_probs=13.1

Q ss_pred             ceEEEEECCCCEEEe
Q psy12933         30 HLIECTLNPGDMLYI   44 (65)
Q Consensus        30 ~~~~~~l~pGd~Lyi   44 (65)
                      ..+...++|||+|++
T Consensus       218 ~~~~~~l~~Gdivi~  232 (262)
T cd00250         218 NQLTVKLEPGDLLIF  232 (262)
T ss_pred             hEEEEEcCCCCEEEE
Confidence            577899999999986


No 145
>PRK00794 flbT flagellar biosynthesis repressor FlbT; Reviewed
Probab=33.65  E-value=42  Score=21.23  Aligned_cols=17  Identities=24%  Similarity=0.561  Sum_probs=14.4

Q ss_pred             eEEEEECCCCEEEeCCC
Q psy12933         31 LIECTLNPGDMLYIPPK   47 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~   47 (65)
                      ++...|+|||-++|-..
T Consensus         3 ~Lkl~LKp~ERi~INGA   19 (132)
T PRK00794          3 GLKLSLKPGERIFINGA   19 (132)
T ss_pred             CeeEEecCCCEEEEcCe
Confidence            57899999999998754


No 146
>PF07228 SpoIIE:  Stage II sporulation protein E (SpoIIE);  InterPro: IPR001932 This domain is found in protein phosphatase 2C, as well as other proteins eg. pyruvate dehydrogenase (lipoamide)-phosphatase (3.1.3.43 from EC), adenylate cyclase (4.6.1.1 from EC) and some bacterial stage II sporulation E proteins (3.1.3.16 from EC).  Protein phosphatase 2C (PP2C) is one of the four major classes of mammalian serine/threonine specific protein phosphatases (3.1.3.16 from EC). PP2C [] is a monomeric enzyme of about 42 Kd which shows broad substrate specificity and is dependent on divalent cations (mainly manganese and magnesium) for its activity. Its exact physiological role is still unclear. Three isozymes are currently known in mammals: PP2C-alpha, -beta and -gamma. In yeast, there are at least four PP2C homologs: phosphatase PTC1 [], which has weak tyrosine phosphatase activity in addition to its activity on serines, phosphatases PTC2 and PTC3, and hypothetical protein YBR125c. Isozymes of PP2C are also known from Arabidopsis thaliana (ABI1, PPH1), Caenorhabditis elegans (FEM-2, F42G9.1, T23F11.1), Leishmania chagasi and Paramecium tetraurelia. In A. thaliana, the kinase associated protein phosphatase (KAPP) [] is an enzyme that dephosphorylates the Ser/Thr receptor-like kinase RLK5 and which contains a C-terminal PP2C domain. PP2C does not seem to be evolutionary related to the main family of serine/ threonine phosphatases: PP1, PP2A and PP2B. However, it is significantly similar to the catalytic subunit of pyruvate dehydrogenase phosphatase 3.1.3.43 from EC (PDPC) [], which catalyzes dephosphorylation and concomitant reactivation of the alpha subunit of the E1 component of the pyruvate dehydrogenase complex. PDPC is a mitochondrial enzyme and, like PP2C, is magnesium-dependent.; GO: 0003824 catalytic activity; PDB: 3KE6_B 3ZT9_A 3RNR_A 3EQ2_A 3F7A_B 3F79_A 3ES2_B 3PU9_B 3T91_B 3T9Q_B ....
Probab=33.18  E-value=37  Score=20.99  Aligned_cols=17  Identities=35%  Similarity=0.456  Sum_probs=12.4

Q ss_pred             CCceEEEEECCCCEEEe
Q psy12933         28 PGHLIECTLNPGDMLYI   44 (65)
Q Consensus        28 ~~~~~~~~l~pGd~Lyi   44 (65)
                      ..+..++.+++||.|++
T Consensus       113 ~~~~~~~~l~~gd~l~l  129 (193)
T PF07228_consen  113 DYQEQEIQLEPGDRLLL  129 (193)
T ss_dssp             CEEEEEEE--TTEEEEE
T ss_pred             cccceEEEeccccEEEE
Confidence            45678999999999986


No 147
>cd00508 MopB_CT_Fdh-Nap-like This CD includes formate dehydrogenases (Fdh) H and N; nitrate reductases, Nap and Nas; and other related proteins. Formate dehydrogenase H is a component of the anaerobic formate hydrogen lyase complex  and catalyzes the reversible oxidation of formate to CO2 with the release of a proton and two electrons. Formate dehydrogenase N (alpha subunit) is the major electron donor to the bacterial nitrate respiratory chain and nitrate reductases, Nap and Nas, catalyze the reduction of nitrate to nitrite. This CD (MopB_CT_Fdh-Nap-like) is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=33.14  E-value=52  Score=18.88  Aligned_cols=12  Identities=8%  Similarity=0.160  Sum_probs=10.5

Q ss_pred             CEEEeCCCCeEE
Q psy12933         40 DMLYIPPKVWHH   51 (65)
Q Consensus        40 d~LyiP~~WwH~   51 (65)
                      +++++|.+|||.
T Consensus        77 g~v~~~~~~~~~   88 (120)
T cd00508          77 GTVFMPFHWGGE   88 (120)
T ss_pred             CEEEEecccCCc
Confidence            689999999975


No 148
>PF05783 DLIC:  Dynein light intermediate chain (DLIC);  InterPro: IPR022780  This entry consists of several eukaryotic dynein light intermediate chain proteins. The light intermediate chains (LICs) of cytoplasmic dynein consist of multiple isoforms, which undergo post-translational modification to produce a large number of species. DLIC1 is known to be involved in assembly, organisation, and function of centrosomes and mitotic spindles when bound to pericentrin [, ]. DLIC2 is a subunit of cytoplasmic dynein 2 that may play a role in maintaining Golgi organisation by binding cytoplasmic dynein 2 to its Golgi-associated cargo []. 
Probab=32.39  E-value=19  Score=27.16  Aligned_cols=14  Identities=21%  Similarity=0.434  Sum_probs=11.2

Q ss_pred             EECCCCEEEeCCCC
Q psy12933         35 TLNPGDMLYIPPKV   48 (65)
Q Consensus        35 ~l~pGd~LyiP~~W   48 (65)
                      .+-.-|.+|||+||
T Consensus       273 ~vv~~d~ifIP~Gw  286 (472)
T PF05783_consen  273 QVVERDAIFIPAGW  286 (472)
T ss_pred             eeecccccccCCCC
Confidence            34556899999998


No 149
>PRK10178 D-alanyl-D-alanine dipeptidase; Provisional
Probab=32.23  E-value=16  Score=24.37  Aligned_cols=13  Identities=23%  Similarity=0.473  Sum_probs=10.1

Q ss_pred             EEEeCCCCeEEEE
Q psy12933         41 MLYIPPKVWHHVR   53 (65)
Q Consensus        41 ~LyiP~~WwH~V~   53 (65)
                      ..-+|..|||.--
T Consensus       156 F~~~~~EWWHf~~  168 (184)
T PRK10178        156 FVHIASEWWHYEL  168 (184)
T ss_pred             CCCCCCcccCcCC
Confidence            4558999999854


No 150
>PRK10838 spr outer membrane lipoprotein; Provisional
Probab=31.63  E-value=31  Score=22.96  Aligned_cols=14  Identities=21%  Similarity=0.472  Sum_probs=11.0

Q ss_pred             EEECCCCEEEeCCC
Q psy12933         34 CTLNPGDMLYIPPK   47 (65)
Q Consensus        34 ~~l~pGd~LyiP~~   47 (65)
                      -.++|||++|+..+
T Consensus       127 ~~lqpGDLVfF~~~  140 (190)
T PRK10838        127 SKLRTGDLVLFRAG  140 (190)
T ss_pred             CCCCCCcEEEECCC
Confidence            45899999998743


No 151
>PRK06933 type III secretion system protein; Validated
Probab=31.09  E-value=32  Score=24.64  Aligned_cols=14  Identities=36%  Similarity=0.600  Sum_probs=11.6

Q ss_pred             EECCCCEEEeCCCC
Q psy12933         35 TLNPGDMLYIPPKV   48 (65)
Q Consensus        35 ~l~pGd~LyiP~~W   48 (65)
                      .|++||+|.+|+.+
T Consensus       175 ~L~~GDvLli~~~~  188 (308)
T PRK06933        175 SLELGDVLLAPEGS  188 (308)
T ss_pred             ccCCCCEEEecccc
Confidence            46899999998764


No 152
>COG2173 DdpX D-alanyl-D-alanine dipeptidase [Cell envelope biogenesis, outer membrane]
Probab=30.77  E-value=16  Score=25.11  Aligned_cols=24  Identities=17%  Similarity=0.429  Sum_probs=19.3

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEe
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRS   54 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s   54 (65)
                      -+...++.|-..-.|..|||.-..
T Consensus       175 lL~~~M~a~GF~~~~~EWWHF~~~  198 (211)
T COG2173         175 LLNAAMEAGGFTVYSGEWWHFDLP  198 (211)
T ss_pred             HHHHHHHhCCCccCCccceecccc
Confidence            345677889999999999998654


No 153
>cd00174 SH3 Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
Probab=30.33  E-value=25  Score=16.86  Aligned_cols=23  Identities=26%  Similarity=0.439  Sum_probs=14.0

Q ss_pred             EEEEECCCCEEEeC----CCCeEEEEec
Q psy12933         32 IECTLNPGDMLYIP----PKVWHHVRSL   55 (65)
Q Consensus        32 ~~~~l~pGd~LyiP----~~WwH~V~s~   55 (65)
                      -+..+.+||++.+-    .+|| .++..
T Consensus        14 ~~l~~~~Gd~v~v~~~~~~~w~-~~~~~   40 (54)
T cd00174          14 DELSFKKGDIIEVLEKSDDGWW-EGRLL   40 (54)
T ss_pred             CCCCCCCCCEEEEEEcCCCCeE-EEEEC
Confidence            46677788877753    5555 44433


No 154
>PF02668 TauD:  Taurine catabolism dioxygenase TauD, TfdA family;  InterPro: IPR003819 This family consists of TauD/TfdA taurine catabolism dioxygenases. The Escherichia coli tauD gene is required for the utilization of taurine (2-aminoethanesulphonic acid) as a sulphur source and is expressed only under conditions of sulphate starvation. TauD is an alpha-ketoglutarate-dependent dioxygenase catalyzing the oxygenolytic release of sulphite from taurine []. The 2,4-dichlorophenoxyacetic acid/alpha-ketoglutarate dioxygenase from Burkholderia sp. (strain RASC) also belongs to this family []. TfdA from Ralstonia eutropha (Alcaligenes eutrophus) is a 2,4-D monooxygenase [].; GO: 0016491 oxidoreductase activity, 0055114 oxidation-reduction process; PDB: 3SWT_B 3R1J_A 1GVG_A 1DRT_A 1DS1_A 1DS0_A 1DRY_A 3V15_A 3PVJ_D 3V17_A ....
Probab=29.85  E-value=46  Score=21.45  Aligned_cols=25  Identities=24%  Similarity=0.284  Sum_probs=15.7

Q ss_pred             CceEEEEECCCCEEEeCCC-CeEEEE
Q psy12933         29 GHLIECTLNPGDMLYIPPK-VWHHVR   53 (65)
Q Consensus        29 ~~~~~~~l~pGd~LyiP~~-WwH~V~   53 (65)
                      ...+...+++||+|++=-. ..|.-+
T Consensus       218 ~~~~~~~~~~GDlli~dN~~~lHgR~  243 (258)
T PF02668_consen  218 EYTYRHRWQPGDLLIWDNHRVLHGRT  243 (258)
T ss_dssp             GGEEEEE--TTEEEEEETTTEEEEE-
T ss_pred             hhcccccCCCceEEEEcCCeeEecCC
Confidence            5688999999999998543 334333


No 155
>cd02777 MopB_CT_DMSOR-like The MopB_CT_DMSOR-like CD contains dimethylsulfoxide reductase (DMSOR), biotin sulfoxide reductase (BSOR),  trimethylamine N-oxide reductase (TMAOR) and other related proteins. DMSOR always catalyzes the reduction of DMSO to dimethylsulfide, but its cellular location and oligomerization state are organism-dependent. For example, in Rhodobacter sphaeriodes and Rhodobacter capsulatus, it is an 82-kDa monomeric soluble protein found in the periplasmic space; in E. coli, it is membrane-bound and exists as a heterotrimer. BSOR catalyzes the reduction of biotin sulfixode to biotin, and is unique among Mo enzymes because no additional auxiliary proteins or cofactors are required. TMAOR is similar to DMSOR, but its only natural substrate is TMAO. Also included in this group is the pyrogallol-phloroglucinol transhydroxylase from Pelobacter acidigallici. This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB hom
Probab=29.46  E-value=34  Score=20.33  Aligned_cols=12  Identities=17%  Similarity=0.628  Sum_probs=9.7

Q ss_pred             CEEEeCCCCeEE
Q psy12933         40 DMLYIPPKVWHH   51 (65)
Q Consensus        40 d~LyiP~~WwH~   51 (65)
                      +++++|.|||..
T Consensus        76 g~v~~~~g~~~~   87 (127)
T cd02777          76 GVVALPEGAWYD   87 (127)
T ss_pred             CEEEeCcccccC
Confidence            568999999953


No 156
>PRK12791 flbT flagellar biosynthesis repressor FlbT; Reviewed
Probab=29.41  E-value=53  Score=20.81  Aligned_cols=17  Identities=18%  Similarity=0.257  Sum_probs=14.0

Q ss_pred             eEEEEECCCCEEEeCCC
Q psy12933         31 LIECTLNPGDMLYIPPK   47 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~   47 (65)
                      ++.++|+|||-++|-..
T Consensus         2 ~Lkl~LKP~ERi~INGa   18 (131)
T PRK12791          2 PLRVELKPFERIVIGQS   18 (131)
T ss_pred             CeeEEeCCCCEEEEcCE
Confidence            46789999999998654


No 157
>PRK13488 chemoreceptor glutamine deamidase CheD; Provisional
Probab=29.24  E-value=1.6e+02  Score=18.93  Aligned_cols=30  Identities=7%  Similarity=0.207  Sum_probs=20.3

Q ss_pred             EEEECCCCEEEeCCCCeEEEEe--cCCeEEEEEe
Q psy12933         33 ECTLNPGDMLYIPPKVWHHVRS--LSTSISVSFW   64 (65)
Q Consensus        33 ~~~l~pGd~LyiP~~WwH~V~s--~~~sisvn~w   64 (65)
                      ++.|.+||.-+--..  ..+.+  |+.|++|.+|
T Consensus         3 ~v~v~~Ge~~v~~~p--~~i~T~~LGSCVav~l~   34 (157)
T PRK13488          3 VIKVGIGEYAVGKSP--EVIETSGLGSCVGIILY   34 (157)
T ss_pred             cEEEecCcEEEECCC--CEEEEEcCcCeEEEEEE
Confidence            467899996655543  23433  5679999887


No 158
>PRK13494 chemoreceptor glutamine deamidase CheD; Provisional
Probab=29.03  E-value=1.5e+02  Score=19.32  Aligned_cols=32  Identities=13%  Similarity=0.395  Sum_probs=20.9

Q ss_pred             ceEEEEECCCCEEEeCCCCeEEEEe-cCCeEEEEEe
Q psy12933         30 HLIECTLNPGDMLYIPPKVWHHVRS-LSTSISVSFW   64 (65)
Q Consensus        30 ~~~~~~l~pGd~LyiP~~WwH~V~s-~~~sisvn~w   64 (65)
                      ....+.|.||| +++...  ..+.. ++.|++|.+|
T Consensus        10 ~~~~~~l~~Ge-~~v~~~--~~i~T~LGSCVaV~l~   42 (163)
T PRK13494         10 KRDVTIIVPGE-AFVSND--RVISTILGSCVAVVLY   42 (163)
T ss_pred             CccEEEEecCe-EEEeCC--cEEEEeCcccEEEEEE
Confidence            34556799999 555554  33332 5678999887


No 159
>PF07378 FlbT:  Flagellar protein FlbT;  InterPro: IPR009967 This family consists of several FlbT proteins. FlbT is a post-transcriptional repressor function in flagellum biogenesis. FlbT is associated with the 5' untranslated region (UTR) of fljK (25 kDa flagellin) mRNA and that this association requires a predicted loop structure in the transcript. Mutations within this loop abolish FlbT association and result in increased mRNA stability. It is therefore thought that FlbT promotes the degradation of flagellin mRNA by associating with the 5' UTR [].; GO: 0048027 mRNA 5'-UTR binding, 0006402 mRNA catabolic process, 0045718 negative regulation of flagellum assembly
Probab=28.74  E-value=44  Score=20.95  Aligned_cols=15  Identities=27%  Similarity=0.543  Sum_probs=12.6

Q ss_pred             EEEECCCCEEEeCCC
Q psy12933         33 ECTLNPGDMLYIPPK   47 (65)
Q Consensus        33 ~~~l~pGd~LyiP~~   47 (65)
                      ..+|+|||-++|-..
T Consensus         2 kl~LkpgERi~INGA   16 (126)
T PF07378_consen    2 KLSLKPGERIIINGA   16 (126)
T ss_pred             eeeecCCCEEEEcCe
Confidence            578999999998764


No 160
>PF03171 2OG-FeII_Oxy:  2OG-Fe(II) oxygenase superfamily Entry for Lysyl hydrolases This Prosite entry is a sub-family of the Pfam entry;  InterPro: IPR005123 This domain is found in members of the 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily [], as well as the C-terminal of prolyl 4-hydroxylase alpha subunit. The holoenzyme has the activity (1.14.11.2 from EC) catalysing the reaction:   Procollagen L-proline + 2-oxoglutarate + O2 = procollagen trans-4-hydroxy-L-proline + succinate + CO2.   The full enzyme consists of a alpha2 beta2 complex with the alpha subunit contributing most of the parts of the active site []. The family also includes lysyl hydrolases, isopenicillin synthases and AlkB. ; GO: 0016491 oxidoreductase activity, 0016706 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, 2-oxoglutarate as one donor, and incorporation of one atom each of oxygen into both donors, 0055114 oxidation-reduction process; PDB: 3ON7_D 1BK0_A 1IPS_B 1QIQ_A 1OC1_A 2Y86_A 2Y60_A 1W03_A 2VE1_A 1QJF_A ....
Probab=28.50  E-value=96  Score=17.22  Aligned_cols=33  Identities=27%  Similarity=0.462  Sum_probs=16.7

Q ss_pred             EEEECCCCEEE-eC----CCCeEEEEecC--CeEEEEEeC
Q psy12933         33 ECTLNPGDMLY-IP----PKVWHHVRSLS--TSISVSFWF   65 (65)
Q Consensus        33 ~~~l~pGd~Ly-iP----~~WwH~V~s~~--~sisvn~w~   65 (65)
                      .+++..||+|- +.    ..+.|+|....  .-+|+.+|+
T Consensus        57 ~~~v~~G~~l~~~t~g~~~~~~HrV~~~~~~~R~s~~~f~   96 (98)
T PF03171_consen   57 GFIVNFGDALEILTNGRYPATLHRVVPPTEGERYSLTFFL   96 (98)
T ss_dssp             CEEEEEBHHHHHHTTTSS----EEEE--STS-EEEEEEEE
T ss_pred             eeeeeceeeeecccCCccCCceeeeEcCCCCCEEEEEEEE
Confidence            45566666442 11    23689999885  578888874


No 161
>smart00326 SH3 Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
Probab=28.21  E-value=26  Score=16.92  Aligned_cols=22  Identities=18%  Similarity=0.398  Sum_probs=14.3

Q ss_pred             ceEEEEECCCCEEEe----CCCCeEE
Q psy12933         30 HLIECTLNPGDMLYI----PPKVWHH   51 (65)
Q Consensus        30 ~~~~~~l~pGd~Lyi----P~~WwH~   51 (65)
                      ..-+..+++||.+.+    ..+||.-
T Consensus        15 ~~~~l~~~~Gd~v~v~~~~~~~w~~~   40 (58)
T smart00326       15 DPDELSFKKGDIITVLEKSDDGWWKG   40 (58)
T ss_pred             CCCCCCCCCCCEEEEEEcCCCCeEEE
Confidence            344566778887765    5777764


No 162
>PF00122 E1-E2_ATPase:  E1-E2 ATPase p-type cation-transporting ATPase superfamily signature H+-transporting ATPase (proton pump) signature sodium/potassium-transporting ATPase signature;  InterPro: IPR008250 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include:   F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP.   P-ATPases (sometime known as E1-E2 ATPases) (3.6.3.- from EC) are found in bacteria and in a number of eukaryotic plasma membranes and organelles []. P-ATPases function to transport a variety of different compounds, including ions and phospholipids, across a membrane using ATP hydrolysis for energy. There are many different classes of P-ATPases, each of which transports a specific type of ion: H+, Na+, K+, Mg2+, Ca2+, Ag+ and Ag2+, Zn2+, Co2+, Pb2+, Ni2+, Cd2+, Cu+ and Cu2+. P-ATPases can be composed of one or two polypeptides, and can usually assume two main conformations called E1 and E2. This entry represents the actuator (A) domain, and some transmembrane helices found in P-type ATPases []. It contains the TGES-loop which is essential for the metal ion binding which results in tight association between the A and P (phosphorylation) domains []. It does not contain the phosphorylation site. It is thought that the large movement of the actuator domain, which is transmitted to the transmembrane helices, is essential to the long distance coupling between formation/decomposition of the acyl phosphate in the cytoplasmic P-domain and the changes in the ion-binding sites buried deep in the membranous region []. This domain has a modulatory effect on the phosphoenzyme processing steps through its nucleotide binding [],[].  P-type (or E1-E2-type) ATPases that form an aspartyl phosphate intermediate in the course of ATP hydrolysis, can be divided into 4 major groups []: (1) Ca2+-transporting ATPases; (2) Na+/K+- and gastric H+/K+-transporting ATPases; (3) plasma membrane H+-transporting ATPases (proton pumps) of plants, fungi and lower eukaryotes; and (4) all bacterial P-type ATPases, except the g2+-ATPase of Salmonella typhimurium, which is more similar to the eukaryotic sequences. However, great variety of sequence analysis methods results in diversity of classification. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0000166 nucleotide binding, 0046872 metal ion binding; PDB: 2XZB_A 1MHS_B 3TLM_A 3A3Y_A 2ZXE_A 3NAL_A 3NAM_A 3NAN_A 2YJ6_B 2IYE_A ....
Probab=28.12  E-value=27  Score=22.47  Aligned_cols=13  Identities=31%  Similarity=0.682  Sum_probs=9.7

Q ss_pred             EECCCCEEEeCCC
Q psy12933         35 TLNPGDMLYIPPK   47 (65)
Q Consensus        35 ~l~pGd~LyiP~~   47 (65)
                      .|.|||++.+.+|
T Consensus        51 ~L~~GDiI~l~~g   63 (230)
T PF00122_consen   51 ELVPGDIIILKAG   63 (230)
T ss_dssp             GT-TTSEEEEETT
T ss_pred             hccceeeeecccc
Confidence            4789999988776


No 163
>COG2255 RuvB Holliday junction resolvasome, helicase subunit [DNA replication, recombination, and repair]
Probab=28.02  E-value=31  Score=25.21  Aligned_cols=14  Identities=43%  Similarity=0.736  Sum_probs=10.8

Q ss_pred             EEEEECCCCEEEeC
Q psy12933         32 IECTLNPGDMLYIP   45 (65)
Q Consensus        32 ~~~~l~pGd~LyiP   45 (65)
                      +-..|++||+|||=
T Consensus        97 iLt~Le~~DVLFID  110 (332)
T COG2255          97 ILTNLEEGDVLFID  110 (332)
T ss_pred             HHhcCCcCCeEEEe
Confidence            34568999999973


No 164
>PF05726 Pirin_C:  Pirin C-terminal cupin domain;  InterPro: IPR008778 This entry represents C-terminal domain of Pirin proteins from both eukaryotes and prokaryotes. The function of Pirin is unknown but the gene coding for this protein is known to be expressed in all tissues in the human body although it is expressed most strongly in the liver and heart. Pirin is known to be a nuclear protein, exclusively localised within the nucleoplasma and predominantly concentrated within dot-like subnuclear structures []. Pirin is composed of two structurally similar domains arranged face to face. The N-terminal domain additionally features four beta-strands, and the C-terminal domain also includes four additional -strands and a short alpha-helix. Although the two domains are similar, the C-terminal domain of Pirin differs from the N-terminal domain as it does not contain a metal binding site and its sequence does not contain the conserved metal-coordinating residues [].  Pirin is confirmed to be a member of the cupin superfamily on the basis of primary sequence and structural similarity. The presence of a metal binding site in the N-terminal beta-barrel of Pirin, may be significant in its role in regulating NFI DNA replication and NF-kappaB transcription factor activity []. Pirin structure has been found to closely resemble members of the cupin superfamily. Pirin contains the two characteristic sequences of the cupin superfamily, namely PG-(X)5-HXH-(X)4-E-(X)6-G and G-(X)5-PXG-(X)2-H-(X)3-N separated by a variable stretch of 15-50 amino acids. These motifs are best conserved in the N-terminal where the conserved histidine and glutamic acid residues correspond to the metal-coordinating residues. The C-terminal domain motifs lack the metal binding residues normally associated with the cupin fold [].  Pirin was identified to be a metal-binding protein [], and was found that the metal-binding residues of Pirins are highly conserved across mammals, plants, fungi, and prokaryotic organisms. Pirin acts as a cofactor for the transcription factor NFI, the regulatory mechanism of which is generally believed to require the assistance of a metal ion []. Structural data supports the hypothesis that the bound iron of Pirin may participate in this transcriptional regulation by enhancing and stabilising the formation of the p50,Bcl3,DNA complex []. Metals have been implicated directly or indirectly in the NF-kappaB family of transcription factors that control expression of a number of early response genes associated with inflammatory responses, cell growth, cell cycle progression, and neoplastic transformation []. However, most metal-dependent transcription factors are DNA-binding proteins that bind to specific sequences when the metal binds to the protein. Pirin, on the other hand, appears to function differently and bind to the transcription factor DNA complex [].; PDB: 1J1L_A 3ACL_A 2P17_A.
Probab=28.02  E-value=1.2e+02  Score=17.59  Aligned_cols=30  Identities=27%  Similarity=0.498  Sum_probs=19.2

Q ss_pred             EEEEECCCCEEE--eCCCCeEEEEecCCeEEE
Q psy12933         32 IECTLNPGDMLY--IPPKVWHHVRSLSTSISV   61 (65)
Q Consensus        32 ~~~~l~pGd~Ly--iP~~WwH~V~s~~~sisv   61 (65)
                      +++.|+||-.+-  +|++|-+.+..++.++.|
T Consensus         2 ~di~l~~g~~~~~~~~~~~~~~iyv~~G~~~v   33 (104)
T PF05726_consen    2 LDIKLEPGASFTLPLPPGHNAFIYVLEGSVEV   33 (104)
T ss_dssp             EEEEE-TT-EEEEEEETT-EEEEEEEESEEEE
T ss_pred             EEEEECCCCEEEeecCCCCEEEEEEEECcEEE
Confidence            578888887766  458887777777666554


No 165
>PRK10202 ebgC cryptic beta-D-galactosidase subunit beta; Reviewed
Probab=27.66  E-value=64  Score=20.41  Aligned_cols=22  Identities=5%  Similarity=-0.035  Sum_probs=19.5

Q ss_pred             EEEEECCCCEEEeCCCCeEEEE
Q psy12933         32 IECTLNPGDMLYIPPKVWHHVR   53 (65)
Q Consensus        32 ~~~~l~pGd~LyiP~~WwH~V~   53 (65)
                      ..++|+||+.+.+-|+=.|+..
T Consensus       106 ~~v~l~~G~F~iffP~daH~P~  127 (149)
T PRK10202        106 ETVEVHEGQIVICDIHEAYRFI  127 (149)
T ss_pred             cEEEeCCCeEEEECCcccccCC
Confidence            3899999999999999899875


No 166
>PF04322 DUF473:  Protein of unknown function (DUF473);  InterPro: IPR007417 This is a family of uncharacterised archaeal proteins.
Probab=27.63  E-value=37  Score=21.29  Aligned_cols=12  Identities=33%  Similarity=0.974  Sum_probs=9.3

Q ss_pred             EECCCCEEEeCC
Q psy12933         35 TLNPGDMLYIPP   46 (65)
Q Consensus        35 ~l~pGd~LyiP~   46 (65)
                      .++|||++|+=.
T Consensus        38 ~~~~Gd~VFlT~   49 (119)
T PF04322_consen   38 SLDPGDRVFLTS   49 (119)
T ss_pred             cCCCCCEEEEec
Confidence            458999999743


No 167
>COG0490 Putative regulatory, ligand-binding protein related to C-terminal domains of K+ channels [Inorganic ion transport and metabolism]
Probab=27.52  E-value=33  Score=22.66  Aligned_cols=12  Identities=33%  Similarity=0.520  Sum_probs=10.2

Q ss_pred             EEEECCCCEEEe
Q psy12933         33 ECTLNPGDMLYI   44 (65)
Q Consensus        33 ~~~l~pGd~Lyi   44 (65)
                      +.+|++||.||+
T Consensus       132 y~vle~gDtlvv  143 (162)
T COG0490         132 YTVLEAGDTLVV  143 (162)
T ss_pred             hhhhcCCCEEEE
Confidence            378999999986


No 168
>PRK13495 chemoreceptor glutamine deamidase CheD; Provisional
Probab=27.18  E-value=1.8e+02  Score=18.81  Aligned_cols=31  Identities=10%  Similarity=0.227  Sum_probs=20.6

Q ss_pred             EEEEECCCCEEEeCCCCeEEEEe--cCCeEEEEEe
Q psy12933         32 IECTLNPGDMLYIPPKVWHHVRS--LSTSISVSFW   64 (65)
Q Consensus        32 ~~~~l~pGd~LyiP~~WwH~V~s--~~~sisvn~w   64 (65)
                      +.++|.+||.-+--..  ..+.+  |+.|++|.+|
T Consensus         2 ~~v~v~~Ge~~v~~~p--~~l~T~~LGSCVaV~l~   34 (159)
T PRK13495          2 MKKVIGIGEYAVMKNP--GVIVTLGLGSCVGVCMR   34 (159)
T ss_pred             ccEEEecCcEEEECCC--CEEEEecCcCeEEEEEE
Confidence            4678999996554332  23443  6789999887


No 169
>COG3508 HmgA Homogentisate 1,2-dioxygenase [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=26.91  E-value=1e+02  Score=23.27  Aligned_cols=25  Identities=20%  Similarity=0.294  Sum_probs=20.8

Q ss_pred             EEEECCCCEEEeCCCCeEEEEecCC
Q psy12933         33 ECTLNPGDMLYIPPKVWHHVRSLST   57 (65)
Q Consensus        33 ~~~l~pGd~LyiP~~WwH~V~s~~~   57 (65)
                      +..++|||++.||.|-=-+|+-.+.
T Consensus       165 ~l~v~pgeiavIPRG~~frve~~~~  189 (427)
T COG3508         165 VLEVEPGEIAVIPRGTTFRVELKDG  189 (427)
T ss_pred             eEEecCCcEEEeeCCceEEEEecCC
Confidence            4678999999999998888877653


No 170
>cd02779 MopB_CT_Arsenite-Ox This CD contains the molybdopterin_binding C-terminal (MopB_CT) region of Arsenite oxidase (Arsenite-Ox) and related proteins. Arsenite oxidase oxidizes arsenite to the less toxic arsenate; it transfers the electrons obtained from the oxidation of arsenite towards the soluble periplasmic electron carriers cytochrome c and/or amicyanin.
Probab=26.81  E-value=41  Score=19.74  Aligned_cols=11  Identities=0%  Similarity=-0.182  Sum_probs=9.4

Q ss_pred             CEEEeCCCCeE
Q psy12933         40 DMLYIPPKVWH   50 (65)
Q Consensus        40 d~LyiP~~WwH   50 (65)
                      +++++|.|||.
T Consensus        75 g~v~~p~g~~~   85 (115)
T cd02779          75 GQTFMLMAHPR   85 (115)
T ss_pred             CeEEEEccCCc
Confidence            67999999994


No 171
>PF00018 SH3_1:  SH3 domain;  InterPro: IPR001452 SH3 (src Homology-3) domains are small protein modules containing approximately 50 amino acid residues [, ]. They are found in a great variety of intracellular or membrane-associated proteins [, , ] for example, in a variety of proteins with enzymatic activity, in adaptor proteins that lack catalytic sequences and in cytoskeletal proteins, such as fodrin and yeast actin binding protein ABP-1. The SH3 domain has a characteristic fold which consists of five or six beta-strands arranged as two tightly packed anti-parallel beta sheets. The linker regions may contain short helices []. The surface of the SH3-domain bears a flat, hydrophobic ligand-binding pocket which consists of three shallow grooves defined by conservative aromatic residues in which the ligand adopts an extended left-handed helical arrangement. The ligand binds with low affinity but this may be enhanced by multiple interactions. The region bound by the SH3 domain is in all cases proline-rich and contains PXXP as a core-conserved binding motif. The function of the SH3 domain is not well understood but they may mediate many diverse processes such as increasing local concentration of proteins, altering their subcellular location and mediating the assembly of large multiprotein complexes []. The crystal structure of the SH3 domain of the cytoskeletal protein spectrin, and the solution structures of SH3 domains of phospholipase C (PLC-y) and phosphatidylinositol 3-kinase p85 alpha-subunit, have been determined [, , ]. In spite of relatively limited sequence similarity, their overall structures are similar. The domains belong to the alpha+beta structural class, with 5 to 8 beta-strands forming 2 tightly-packed, anti-parallel beta-sheets arranged in a barrel-like structure, and intervening loops sometimes forming helices. Conserved aliphatic and aromatic residues form a hydrophobic core (A11, L23, A29, V34, W42, L52 and V59 in PLC-y []) and a hydrophobic pocket on the molecular surface (L12, F13, W53 and P55 in PLC-y). The conserved core is believed to stabilise the fold, while the pocket is thought to serve as a binding site for target proteins. Conserved carboxylic amino acids located in the loops, on the periphery of the pocket (D14 and E22), may be involved in protein-protein interactions via proline-rich regions. The N- and C-termini are packed in close proximity, indicating that they are independent structural modules.; GO: 0005515 protein binding; PDB: 1UHF_A 1W1F_A 1WA7_A 1SEM_A 1KFZ_A 2SEM_B 1K76_A 3SEM_B 1X2Q_A 2J06_B ....
Probab=26.51  E-value=14  Score=18.56  Aligned_cols=29  Identities=21%  Similarity=0.345  Sum_probs=16.7

Q ss_pred             CCCCceEEEEECCCCEEEe----CCCCeEEEEec
Q psy12933         26 QTPGHLIECTLNPGDMLYI----PPKVWHHVRSL   55 (65)
Q Consensus        26 ~~~~~~~~~~l~pGd~Lyi----P~~WwH~V~s~   55 (65)
                      |.....-+..++.||.|+|    ..+||. ++..
T Consensus         6 f~~~~~~eLs~~~Gd~i~v~~~~~~~Ww~-~~~~   38 (48)
T PF00018_consen    6 FDAEDPDELSFKKGDIIEVLEKSDDGWWK-VRNE   38 (48)
T ss_dssp             BETSSTTBSEB-TTEEEEEEEESSSSEEE-EEET
T ss_pred             eCCCCCCEEeEECCCEEEEEEecCCCEEE-EEEC
Confidence            3344455677889988874    345554 4444


No 172
>PRK14125 cell division suppressor protein YneA; Provisional
Probab=26.26  E-value=40  Score=20.24  Aligned_cols=13  Identities=38%  Similarity=0.618  Sum_probs=10.4

Q ss_pred             EECCCCEEEeCCC
Q psy12933         35 TLNPGDMLYIPPK   47 (65)
Q Consensus        35 ~l~pGd~LyiP~~   47 (65)
                      .+.||+.|.||..
T Consensus        79 ~I~~Gq~L~IP~~   91 (103)
T PRK14125         79 HIKAGDKLVIPVL   91 (103)
T ss_pred             cCCCCCEEEEecC
Confidence            3889999999853


No 173
>cd02780 MopB_CT_Tetrathionate_Arsenate-R This CD contains the molybdopterin_binding C-terminal (MopB_CT) region of tetrathionate reductase, subunit A, (TtrA); respiratory arsenate As(V) reductase, catalytic subunit (ArrA); and other related proteins.
Probab=25.93  E-value=43  Score=20.39  Aligned_cols=12  Identities=17%  Similarity=0.077  Sum_probs=10.4

Q ss_pred             CEEEeCCCCeEE
Q psy12933         40 DMLYIPPKVWHH   51 (65)
Q Consensus        40 d~LyiP~~WwH~   51 (65)
                      +++++|.+|+|.
T Consensus        72 g~V~~p~g~g~~   83 (143)
T cd02780          72 GVVAIEHGYGHW   83 (143)
T ss_pred             CEEEEecccCcc
Confidence            689999999775


No 174
>cd02789 MopB_CT_FmdC-FwdD The MopB_FmdC-FwdD CD includes the  C-terminus of subunit C of molybdenum formylmethanofuran dehydrogenase (FmdC) and subunit D of tungsten formylmethanofuran dehydrogenase (FwdD), and other related proteins. Formylmethanofuran dehydrogenase catalyzes the first step in methane formation from CO2 in methanogenic archaea and some eubacteria. Members of this CD belong to the molybdopterin_binding superfamily of proteins. This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=25.46  E-value=42  Score=19.69  Aligned_cols=10  Identities=40%  Similarity=1.059  Sum_probs=9.0

Q ss_pred             CEEEeCCCCe
Q psy12933         40 DMLYIPPKVW   49 (65)
Q Consensus        40 d~LyiP~~Ww   49 (65)
                      +++++|.+||
T Consensus        73 g~v~~~~g~~   82 (106)
T cd02789          73 GMVFIPMGPW   82 (106)
T ss_pred             CEEEEecccc
Confidence            6789999998


No 175
>TIGR02410 carnitine_TMLD trimethyllysine dioxygenase. Members of this family with known function act as trimethyllysine dioxygenase, an enzyme in the pathway for carnitine biosynthesis from lysine. This enzyme is homologous to gamma-butyrobetaine,2-oxoglutarate dioxygenase, which catalyzes the last step in carnitine biosynthesis. Members of this family appear to be eukaryotic only.
Probab=25.44  E-value=68  Score=22.86  Aligned_cols=15  Identities=40%  Similarity=0.660  Sum_probs=12.9

Q ss_pred             ceEEEEECCCCEEEe
Q psy12933         30 HLIECTLNPGDMLYI   44 (65)
Q Consensus        30 ~~~~~~l~pGd~Lyi   44 (65)
                      -.++..|+|||++++
T Consensus       311 ~~~~~~l~pGd~vi~  325 (362)
T TIGR02410       311 NEIEFKLRPGTVLIF  325 (362)
T ss_pred             cEEEEEcCCccEEEE
Confidence            478899999999985


No 176
>PF10162 G8:  G8 domain;  InterPro: IPR019316  This entry represents a domain found in disease proteins PKHD1 and KIAA1199 and is named G8 after its 8 conserved glycines. It is predicted to contain 10 beta strands and an alpha helix []. 
Probab=25.10  E-value=64  Score=19.60  Aligned_cols=19  Identities=21%  Similarity=0.192  Sum_probs=13.9

Q ss_pred             CCCCEEEeCCCCeEEEEec
Q psy12933         37 NPGDMLYIPPKVWHHVRSL   55 (65)
Q Consensus        37 ~pGd~LyiP~~WwH~V~s~   55 (65)
                      .+||.++||++++=-+...
T Consensus        10 ~~g~~V~I~~g~~v~lD~~   28 (125)
T PF10162_consen   10 GAGDNVVIPAGQTVLLDVS   28 (125)
T ss_pred             CCCCEEEECCCCEEEEcCC
Confidence            4789999999976554443


No 177
>PF07591 PT-HINT:  Pretoxin HINT domain;  InterPro: IPR011451 This entry represents a cluster of homologous proteins identified in Leptospira interrogans. One member (Q8EZX6 from SWISSPROT) has been predicted to be a phenazine biosynthesis family protein.; PDB: 2JNQ_A 2JMZ_A.
Probab=24.99  E-value=72  Score=19.60  Aligned_cols=22  Identities=32%  Similarity=0.507  Sum_probs=14.5

Q ss_pred             EECCCCEEEeCCCCeEEEEecC
Q psy12933         35 TLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        35 ~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      .|++||.|.-+.|-+-.+.+..
T Consensus        76 ~L~~GD~L~~~~G~~~~v~~i~   97 (130)
T PF07591_consen   76 DLKVGDRLLTADGSWVTVTSIR   97 (130)
T ss_dssp             G--TTSEEEEE-SSEEEEE---
T ss_pred             hCCCCCEEEcCCCCEEEEEEEE
Confidence            5899999999999998887754


No 178
>cd02790 MopB_CT_Formate-Dh_H Formate dehydrogenase H (Formate-Dh-H) catalyzes the reversible oxidation of formate to CO2 with the release of a proton and two electrons. It is a component of the anaerobic formate hydrogen lyase complex. The E. coli formate dehydrogenase H (Fdh-H) is a monomer composed of a single polypeptide chain with a  Mo active site region and a [4Fe-4S] center. This CD (MopB_CT_Formate-Dh_H) is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=24.67  E-value=46  Score=19.07  Aligned_cols=12  Identities=8%  Similarity=0.324  Sum_probs=10.3

Q ss_pred             CEEEeCCCCeEE
Q psy12933         40 DMLYIPPKVWHH   51 (65)
Q Consensus        40 d~LyiP~~WwH~   51 (65)
                      +++++|.+||+.
T Consensus        77 g~v~~~~g~~~~   88 (116)
T cd02790          77 GVVFMPFHFAEA   88 (116)
T ss_pred             CEEEEecCcCCC
Confidence            679999999964


No 179
>KOG0572|consensus
Probab=24.48  E-value=62  Score=24.67  Aligned_cols=25  Identities=28%  Similarity=0.517  Sum_probs=19.6

Q ss_pred             CceEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933         29 GHLIECTLNPGDMLYIPPKVWHHVRSLS   56 (65)
Q Consensus        29 ~~~~~~~l~pGd~LyiP~~WwH~V~s~~   56 (65)
                      .-.++..++|||++.|+.   |.++++.
T Consensus       217 ~a~y~Rev~PGEiV~i~r---~g~~s~~  241 (474)
T KOG0572|consen  217 GARYEREVRPGEIVEISR---NGVKSVD  241 (474)
T ss_pred             ccEEEEeecCceEEEEec---CCceeee
Confidence            356788999999999998   6666553


No 180
>PF02938 GAD:  GAD domain;  InterPro: IPR004115 This entry represetns an 2 layer alpha/beta insertion domain found in some glutamyl-tRNA amidotransferases and aspartyl tRNA synthetases [, ]. The function of this domain is not yet known.; GO: 0004812 aminoacyl-tRNA ligase activity, 0005524 ATP binding, 0005737 cytoplasm; PDB: 1ZQ1_D 1EQR_B 1IL2_B 1C0A_A 1L0W_A 1G51_B 1EFW_B 2D6F_D.
Probab=24.45  E-value=39  Score=19.55  Aligned_cols=17  Identities=24%  Similarity=0.321  Sum_probs=10.2

Q ss_pred             EEECCCCEEEeCCCCeE
Q psy12933         34 CTLNPGDMLYIPPKVWH   50 (65)
Q Consensus        34 ~~l~pGd~LyiP~~WwH   50 (65)
                      +-+++||+|++=++=|+
T Consensus        73 ~~a~~GD~ll~~Ag~~~   89 (95)
T PF02938_consen   73 LGAKPGDLLLFVAGKKE   89 (95)
T ss_dssp             TT--TTEEEEEEEESHH
T ss_pred             hCCCCCCEEEEECCCHH
Confidence            34788999887665443


No 181
>PF00856 SET:  SET domain;  InterPro: IPR001214 The SET domain appears generally as one part of a larger multidomain protein, and recently there were described three structures of very different proteins with distinct domain compositions: Neurospora crassa DIM-5, a member of the Su(var) family of HKMTs which methylate histone H3 on lysine 9,human SET7 (also called SET9), which methylates H3 on lysine 4 and garden pea Rubisco LSMT, an enzyme that does not modify histones, but instead methylates lysine 14 in the flexible tail of the large subunit of the enzyme Rubisco. The SET domain itself turned out to be an uncommon structure. Although in all three studies, electron density maps revealed the location of the AdoMet or AdoHcy cofactor, the SET domain bears no similarity at all to the canonical/AdoMet-dependent methyltransferase fold. Strictly conserved in the C-terminal motif of the SET domain tyrosine could be involved in abstracting a proton from the protonated amino group of the substrate lysine, promoting its nucleophilic attack on the sulphonium methyl group of the AdoMet cofactor. In contrast to the AdoMet-dependent protein methyltranferases of the classical type, which tend to bind their polypeptide substrates on top of the cofactor, it is noted from the Rubisco LSMT structure that the AdoMet seems to bind in a separate cleft, suggesting how a polypeptide substrate could be subjected to multiple rounds of methylation without having to be released from the enzyme. In contrast, SET7/9 is able to add only a single methyl group to its substrate. It has been demonstrated that association of SET domain and myotubularin-related proteins modulates growth control []. The SET domain-containing Drosophila melanogaster (Fruit fly) protein, enhancer of zeste, has a function in segment determination and the mammalian homologue may be involved in the regulation of gene transcription and chromatin structure. Histone lysine methylation is part of the histone code that regulated chromatin function and epigenetic control of gene function. Histone lysine methyltransferases (HMTase) differ both in their substrate specificity for the various acceptor lysines as well as in their product specificity for the number of methyl groups (one, two, or three) they transfer. With just one exception [], the HMTases belong to SET family that can be classified according to the sequences surrounding the SET domain [, ]. Structural studies on the human SET7/9, a mono-methylase, have revealed the molecular basis for the specificity of the enzyme for the histone-target and the roles of the invariant residues in the SET domain in determining the methylation specificities [].  The pre-SET domain, as found in the SUV39 SET family, contains nine invariant cysteine residues that are grouped into two segments separated by a region of variable length. These 9 cysteines coordinate 3 zinc ions to form to form a triangular cluster, where each of the zinc ions is coordinated by 4 four cysteines to give a tetrahedral configuration. The function of this domain is structural, holding together 2 long segments of random coils. The C-terminal region including the post-SET domain is disordered when not interacting with a histone tail and in the absence of zinc. The three conserved cysteines in the post-SET domain form a zinc-binding site when coupled to a fourth conserved cysteine in the knot-like structure close to the SET domain active site []. The structured post-SET region brings in the C-terminal residues that participate in S-adenosylmethine-binding and histone tail interactions. The three conserved cysteine residues are essential for HMTase activity, as replacement with serine abolishes HMTase activity [], []. ; GO: 0005515 protein binding; PDB: 3TG5_A 3S7F_A 3RIB_B 3TG4_A 3S7J_A 3S7D_A 3S7B_A 3H6L_A 3SMT_A 3K5K_A ....
Probab=24.18  E-value=34  Score=19.72  Aligned_cols=13  Identities=23%  Similarity=0.710  Sum_probs=8.5

Q ss_pred             EECCCCEEEeCCC
Q psy12933         35 TLNPGDMLYIPPK   47 (65)
Q Consensus        35 ~l~pGd~LyiP~~   47 (65)
                      .+++||+|++|..
T Consensus         9 dI~~Ge~I~~p~~   21 (162)
T PF00856_consen    9 DIKAGEVILIPRP   21 (162)
T ss_dssp             -B-TTEEEEEESE
T ss_pred             cCCCCCEEEEECc
Confidence            5788998887764


No 182
>PRK11198 LysM domain/BON superfamily protein; Provisional
Probab=24.10  E-value=48  Score=20.81  Aligned_cols=11  Identities=45%  Similarity=0.951  Sum_probs=7.7

Q ss_pred             ECCCCEEEeCC
Q psy12933         36 LNPGDMLYIPP   46 (65)
Q Consensus        36 l~pGd~LyiP~   46 (65)
                      +.||+.|.||+
T Consensus       137 I~pGq~L~IP~  147 (147)
T PRK11198        137 IYPGQVLRIPE  147 (147)
T ss_pred             cCcCCEEecCC
Confidence            66777777774


No 183
>PF14525 AraC_binding_2:  AraC-binding-like domain
Probab=23.94  E-value=1.7e+02  Score=17.32  Aligned_cols=27  Identities=22%  Similarity=0.255  Sum_probs=22.3

Q ss_pred             eEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933         31 LIECTLNPGDMLYIPPKVWHHVRSLST   57 (65)
Q Consensus        31 ~~~~~l~pGd~LyiP~~WwH~V~s~~~   57 (65)
                      .-++.+.|||+++++++=-...+..+.
T Consensus        72 g~~~~~~pg~~~l~d~~~~~~~~~~~~   98 (172)
T PF14525_consen   72 GREVELAPGDVVLLDPGQPYRLEFSAG   98 (172)
T ss_pred             CEEEEEcCCeEEEEcCCCCEEEEECCC
Confidence            568899999999999998877776653


No 184
>PF03737 Methyltransf_6:  Demethylmenaquinone methyltransferase;  InterPro: IPR005493 This entry represents a structural motif found in demethylmenaquinone methyltransferases and in the regulator of ribonuclease E activity A (RraA). These proteins contain a swivelling 3-layer beta/beta/alpha domain that appears to be mobile in most multi-domain proteins known to contain it. These proteins are structurally similar, and may have distant homology, to the phosphohistidine domain of pyruvate phosphate dikinase. The RraA fold is an ancient platform that has been adapted for a wide range of functions. RraA had been identified as a putative demethylmenaquinone methyltransferase and was annotated as MenG, but further analysis showed that RraA lacked the structural motifs usually required for methylases.  The Escherichia coli protein regulator RraA acts as a trans-acting modulator of RNA turnover, binding essential endonuclease RNase E and inhibiting RNA processing []. RNase E forms the core of a large RNA-catalysis machine termed the degradosomes. RraA (and RraB) causes remodelling of degradosome composition, which is associated with alterations in RNA decay and global transcript abundance and as such is a bacterial mechanism for the regulation of RNA cleavage. Demethylmenaquinone methyltransferases convert dimethylmenaquinone (DMK) to menaquinone (MK) in the final step of menaquinone biosynthesis.  This region is also found at the C terminus of the DlpA protein Q48806 from SWISSPROT.; PDB: 3C8O_A 2PCN_A 3K4I_B 2YJV_A 1Q5X_B 2C5Q_D 2YJT_B 1NXJ_B 3NOJ_A 1J3L_A ....
Probab=23.68  E-value=64  Score=20.37  Aligned_cols=10  Identities=50%  Similarity=1.039  Sum_probs=6.6

Q ss_pred             EEECCCCEEE
Q psy12933         34 CTLNPGDMLY   43 (65)
Q Consensus        34 ~~l~pGd~Ly   43 (65)
                      ++++|||++|
T Consensus       137 v~V~pGD~I~  146 (154)
T PF03737_consen  137 VTVNPGDIIV  146 (154)
T ss_dssp             EEE-TTSEEE
T ss_pred             EEECCCCEEE
Confidence            4678888776


No 185
>PRK13491 chemoreceptor glutamine deamidase CheD; Provisional
Probab=23.57  E-value=2.4e+02  Score=19.08  Aligned_cols=35  Identities=11%  Similarity=0.177  Sum_probs=23.7

Q ss_pred             CCceEEEEECCCCEEEeCCCCeEEE--EecCCeEEEEEe
Q psy12933         28 PGHLIECTLNPGDMLYIPPKVWHHV--RSLSTSISVSFW   64 (65)
Q Consensus        28 ~~~~~~~~l~pGd~LyiP~~WwH~V--~s~~~sisvn~w   64 (65)
                      +.+..++.|.+||+.+--..  ..+  ..|+.|++|-+|
T Consensus        10 ~~~~~~v~V~~Ge~~vs~~p--~~litT~LGSCVaV~Ly   46 (199)
T PRK13491         10 ASQISITHVTQGSCVASDSP--NEVYATILGSCICTCMC   46 (199)
T ss_pred             cceEEEEEEecCcEEEECCC--CEEEEEeccccEEEEEE
Confidence            45566888999996665442  132  336789999887


No 186
>KOG2414|consensus
Probab=23.48  E-value=57  Score=25.00  Aligned_cols=19  Identities=32%  Similarity=0.517  Sum_probs=16.1

Q ss_pred             CCceEEEEECCCCEEEeCCCC
Q psy12933         28 PGHLIECTLNPGDMLYIPPKV   48 (65)
Q Consensus        28 ~~~~~~~~l~pGd~LyiP~~W   48 (65)
                      -.+.+.+++|||  ||||..|
T Consensus       424 L~pg~ViTIEPG--vYIP~d~  442 (488)
T KOG2414|consen  424 LQPGMVITIEPG--VYIPEDD  442 (488)
T ss_pred             CCCCceEEecCc--eecCccC
Confidence            356788999999  8999987


No 187
>PF08857 ParBc_2:  Putative ParB-like nuclease;  InterPro: IPR014956 These proteins are probably distantly related to IPR003115 from INTERPRO. Suggesting these, uncharacterised proteins have a nuclease function. ; PDB: 2HWJ_E.
Probab=23.42  E-value=1e+02  Score=20.14  Aligned_cols=25  Identities=20%  Similarity=0.409  Sum_probs=14.3

Q ss_pred             CCCCCCCCceEEEEECCCCEEEeCC
Q psy12933         22 APHNQTPGHLIECTLNPGDMLYIPP   46 (65)
Q Consensus        22 ~~p~~~~~~~~~~~l~pGd~LyiP~   46 (65)
                      .+-.+-+.+++.++++||.-+|+=-
T Consensus        36 ~~~~~l~~~~vpvViGP~g~lyl~D   60 (163)
T PF08857_consen   36 KFDDYLEKHPVPVVIGPGGQLYLTD   60 (163)
T ss_dssp             -----TT-BEEEEEE-STT-EEE-S
T ss_pred             HHHHHHhcCCCCEEECCCCCeEEEC
Confidence            4455778899999999999999743


No 188
>PF11649 T4_neck-protein:  Virus neck protein;  InterPro: IPR021674  This family of protein is represented by bacteriophage T4 Gp14, a major component of the neck along with Gp13. When added to completed heads, Gp13 and Gp14 structural proteins activate the heads for tail joining in vitro complementation mixtures. Gp14 is rich is beta-sheets. The formation of the neck to the head of the bacteriophage is crucial for the tail attachment []. 
Probab=23.37  E-value=49  Score=23.00  Aligned_cols=11  Identities=36%  Similarity=0.948  Sum_probs=9.5

Q ss_pred             CCCCEEEeCCC
Q psy12933         37 NPGDMLYIPPK   47 (65)
Q Consensus        37 ~pGd~LyiP~~   47 (65)
                      +.||+||+|.+
T Consensus       115 ~eGDLIYfP~~  125 (230)
T PF11649_consen  115 KEGDLIYFPMT  125 (230)
T ss_pred             CcCCEEEEEcC
Confidence            67999999975


No 189
>cd02784 MopB_CT_PHLH The MopB_CT_PHLH CD includes a group of related uncharacterized putative hydrogenase-like homologs (PHLH) of molybdopterin binding proteins. This CD is of the PHLH region homologous to the conserved molybdopterin-binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=23.31  E-value=43  Score=20.93  Aligned_cols=14  Identities=0%  Similarity=-0.097  Sum_probs=12.0

Q ss_pred             CEEEeCCCCeEEEE
Q psy12933         40 DMLYIPPKVWHHVR   53 (65)
Q Consensus        40 d~LyiP~~WwH~V~   53 (65)
                      +++++|.||.|...
T Consensus        80 gvV~i~~G~G~~~~   93 (137)
T cd02784          80 GVVLLALGYGRTHA   93 (137)
T ss_pred             CEEEEecccCcccC
Confidence            78999999999644


No 190
>PF01426 BAH:  BAH domain;  InterPro: IPR001025 The BAH (bromo-adjacent homology) family contains proteins such as eukaryotic DNA (cytosine-5) methyltransferases IPR001525 from INTERPRO, the origin recognition complex 1 (Orc1) proteins, as well as several proteins involved in transcriptional regulation. The BAH domain appears to act as a protein-protein interaction module specialised in gene silencing, as suggested for example by its interaction within yeast Orc1p with the silent information regulator Sir1p. The BAH module might therefore play an important role by linking DNA methylation, replication and transcriptional regulation [].; GO: 0003677 DNA binding; PDB: 4DA4_A 3PT6_B 3AV6_A 3AV5_A 3AV4_A 3PT9_A 3SWR_A 3PTA_A 1M4Z_A 1ZBX_A ....
Probab=22.92  E-value=64  Score=18.49  Aligned_cols=15  Identities=33%  Similarity=0.705  Sum_probs=11.8

Q ss_pred             EEECCCCEEEeCCCC
Q psy12933         34 CTLNPGDMLYIPPKV   48 (65)
Q Consensus        34 ~~l~pGd~LyiP~~W   48 (65)
                      ++++.||++|+-+.-
T Consensus         1 ~~~~vGD~V~v~~~~   15 (119)
T PF01426_consen    1 VTYKVGDFVYVKPDD   15 (119)
T ss_dssp             EEEETTSEEEEECTS
T ss_pred             CEEeCCCEEEEeCCC
Confidence            467889999987765


No 191
>PRK11479 hypothetical protein; Provisional
Probab=22.49  E-value=93  Score=22.05  Aligned_cols=13  Identities=38%  Similarity=0.797  Sum_probs=10.4

Q ss_pred             EEEECCCCEEEeC
Q psy12933         33 ECTLNPGDMLYIP   45 (65)
Q Consensus        33 ~~~l~pGd~LyiP   45 (65)
                      .-.|+|||+||.-
T Consensus        62 ~~~LqpGDLVFfs   74 (274)
T PRK11479         62 APDLKPGDLLFSS   74 (274)
T ss_pred             hhhCCCCCEEEEe
Confidence            3578999999964


No 192
>cd02775 MopB_CT Molybdopterin-Binding, C-terminal (MopB_CT) domain of the MopB superfamily of proteins, a  large, diverse, heterogeneous superfamily of enzymes that, in general, bind molybdopterin as a cofactor. The MopB domain is found in a wide variety of molybdenum- and tungsten-containing enzymes, including formate dehydrogenase-H (Fdh-H) and -N (Fdh-N), several forms of nitrate reductase (Nap, Nas, NarG), dimethylsulfoxide reductase (DMSOR), thiosulfate reductase, formylmethanofuran dehydrogenase, and arsenite oxidase. Molybdenum is present in most of these enzymes in the form of molybdopterin, a modified pterin ring with a dithiolene side chain, which is responsible for ligating the Mo. In many bacterial and archaeal species, molybdopterin is in the form of a dinucleotide, with two molybdopterin dinucleotide units per molybdenum. These proteins can function as monomers, heterodimers, or heterotrimers, depending on the protein and organism. Also included in the MopB superfamily is
Probab=22.28  E-value=1.3e+02  Score=16.54  Aligned_cols=12  Identities=17%  Similarity=0.468  Sum_probs=10.6

Q ss_pred             CEEEeCCCCeEE
Q psy12933         40 DMLYIPPKVWHH   51 (65)
Q Consensus        40 d~LyiP~~WwH~   51 (65)
                      +++++|.+|||.
T Consensus        65 g~v~~~~~~~~~   76 (101)
T cd02775          65 GVVFLPHGWGHR   76 (101)
T ss_pred             CEEEeeCccCcc
Confidence            589999999985


No 193
>PF07915 PRKCSH:  Glucosidase II beta subunit-like protein;  InterPro: IPR012913 The sequences found in this family are similar to a region found in the beta-subunit of glucosidase II (P14314 from SWISSPROT), which is also known as protein kinase C substrate 80K-H (PRKCSH). The enzyme catalyses the sequential removal of two alpha-1,3-linked glucose residues in the second step of N-linked oligosaccharide processing []. The beta subunit is required for the solubility and stability of the heterodimeric enzyme, and is involved in retaining the enzyme within the endoplasmic reticulum []. Mutations in the gene coding for PRKCSH have been found to be involved in the development of autosomal dominant polycystic liver disease (ADPLD), but the precise role the protein has in the pathogenesis of this disease is unknown []. ; PDB: 3AIH_A.
Probab=22.12  E-value=50  Score=18.08  Aligned_cols=16  Identities=25%  Similarity=0.395  Sum_probs=10.7

Q ss_pred             CCEEEeCCCCeEEEEe
Q psy12933         39 GDMLYIPPKVWHHVRS   54 (65)
Q Consensus        39 Gd~LyiP~~WwH~V~s   54 (65)
                      |.|++...|||..--.
T Consensus         1 ~~C~~~~~g~WtYe~C   16 (81)
T PF07915_consen    1 GQCLYRSEGWWTYEFC   16 (81)
T ss_dssp             --EEEEEETTEEEEEE
T ss_pred             CcceecCCCEEEEEEC
Confidence            6788888999975433


No 194
>KOG1633|consensus
Probab=22.00  E-value=48  Score=26.79  Aligned_cols=31  Identities=19%  Similarity=0.142  Sum_probs=25.7

Q ss_pred             CCCCCceEEEEECCCCEEEeCCCCeEEEEec
Q psy12933         25 NQTPGHLIECTLNPGDMLYIPPKVWHHVRSL   55 (65)
Q Consensus        25 ~~~~~~~~~~~l~pGd~LyiP~~WwH~V~s~   55 (65)
                      ...........+++|+.+.+|++|-|++..-
T Consensus       541 ~~~~~~~~~~~~~~g~~~~~P~~~i~~~~~p  571 (776)
T KOG1633|consen  541 DSANCDDKRDVLKEGETGLIPAGPIHAVLTP  571 (776)
T ss_pred             ccccCCCccccccCCCcccCCCCcccccccc
Confidence            3344567788999999999999999999865


No 195
>TIGR00022 uncharacterized protein, YhcH/YjgK/YiaL family. This family consists of conserved hypothetical proteins, about 150 amino acids in length. Members with limited information include YhcH, a possible sugar isomerase of sialic acid catabolism, and YjgK.
Probab=21.75  E-value=1.4e+02  Score=18.48  Aligned_cols=26  Identities=19%  Similarity=0.119  Sum_probs=21.9

Q ss_pred             CCceEEEEECCCCEEEeCCCCeEEEE
Q psy12933         28 PGHLIECTLNPGDMLYIPPKVWHHVR   53 (65)
Q Consensus        28 ~~~~~~~~l~pGd~LyiP~~WwH~V~   53 (65)
                      ......+.|.||+.+.+-|+=.|+..
T Consensus       109 ~~~~~~i~l~~G~faiffP~D~H~P~  134 (142)
T TIGR00022       109 IDDEQTVILKPGMFAVFYPGEPHKPG  134 (142)
T ss_pred             CCCceEEEeCCCcEEEECCCCccccc
Confidence            35678999999999998888899864


No 196
>PF02563 Poly_export:  Polysaccharide biosynthesis/export protein;  InterPro: IPR003715 The extracellular polysaccharide colanic acid (CA) is produced by species of the family Enterobacteriaceae. In Escherichia coli (strain K12) the CA cluster comprises 19 genes. The wzx gene encodes a protein with multiple transmembrane segments that may function in export of the CA repeat unit from the cytoplasm into the periplasm in a process analogous to O-unit export. The CA gene clusters may be involved in the export of polysaccharide from the cell [].; GO: 0015159 polysaccharide transmembrane transporter activity, 0015774 polysaccharide transport, 0016020 membrane; PDB: 2W8I_E 2W8H_E 2J58_D.
Probab=21.71  E-value=55  Score=18.29  Aligned_cols=13  Identities=54%  Similarity=0.805  Sum_probs=4.8

Q ss_pred             EEEEECCCCEEEe
Q psy12933         32 IECTLNPGDMLYI   44 (65)
Q Consensus        32 ~~~~l~pGd~Lyi   44 (65)
                      -+-.|.|||.|-|
T Consensus         9 ~~y~l~pGD~l~i   21 (82)
T PF02563_consen    9 PEYRLGPGDVLRI   21 (82)
T ss_dssp             ------TT-EEEE
T ss_pred             CCCEECCCCEEEE
Confidence            3457889998875


No 197
>cd04721 BAH_plant_1 BAH, or Bromo Adjacent Homology domain, plant-specific sub-family with unknown function. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
Probab=21.45  E-value=74  Score=19.70  Aligned_cols=11  Identities=27%  Similarity=0.667  Sum_probs=9.5

Q ss_pred             EEECCCCEEEe
Q psy12933         34 CTLNPGDMLYI   44 (65)
Q Consensus        34 ~~l~pGd~Lyi   44 (65)
                      .++++||++||
T Consensus         6 ~~i~vGD~V~v   16 (130)
T cd04721           6 VTISVHDFVYV   16 (130)
T ss_pred             EEEECCCEEEE
Confidence            45899999999


No 198
>PF13533 Biotin_lipoyl_2:  Biotin-lipoyl like
Probab=21.44  E-value=85  Score=15.99  Aligned_cols=8  Identities=38%  Similarity=0.787  Sum_probs=5.0

Q ss_pred             ECCCCEEE
Q psy12933         36 LNPGDMLY   43 (65)
Q Consensus        36 l~pGd~Ly   43 (65)
                      ++.||.|+
T Consensus        24 VkkGd~L~   31 (50)
T PF13533_consen   24 VKKGDVLL   31 (50)
T ss_pred             EcCCCEEE
Confidence            45667665


No 199
>PF06525 SoxE:  Sulfocyanin (SoxE);  InterPro: IPR010532 Members of this family are blue-copper redox proteins designated sulfocyanin, from the archaeal genera Sulfolobus, Ferroplasma, and Picrophilus. The most closely related proteins characterised as functionally different are the rusticyanins.
Probab=21.41  E-value=50  Score=22.40  Aligned_cols=16  Identities=13%  Similarity=0.264  Sum_probs=12.0

Q ss_pred             EEEeCCCCeEEEEecC
Q psy12933         41 MLYIPPKVWHHVRSLS   56 (65)
Q Consensus        41 ~LyiP~~WwH~V~s~~   56 (65)
                      .||+|+||==+++-..
T Consensus        87 ~i~VPAGw~V~i~f~N  102 (196)
T PF06525_consen   87 TIYVPAGWNVQITFTN  102 (196)
T ss_pred             EEEEcCCCEEEEEEEc
Confidence            6999999976665443


No 200
>COG4340 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=21.34  E-value=97  Score=21.40  Aligned_cols=30  Identities=20%  Similarity=0.374  Sum_probs=23.2

Q ss_pred             CCCCceEEEEECCCCEEEeC-CCCeEEEEec
Q psy12933         26 QTPGHLIECTLNPGDMLYIP-PKVWHHVRSL   55 (65)
Q Consensus        26 ~~~~~~~~~~l~pGd~LyiP-~~WwH~V~s~   55 (65)
                      -..++...-+|+|||++++- ..-||.+...
T Consensus       169 ~~~~p~f~kvl~pGe~~~l~Dh~~~H~~tpi  199 (226)
T COG4340         169 DGASPGFFKVLAPGEAVFLDDHRVLHGVTPI  199 (226)
T ss_pred             CCCCcceEEeccCCcEEEeccchhcccccce
Confidence            45677888999999999985 4678877654


No 201
>PF04074 DUF386:  Domain of unknown function (DUF386);  InterPro: IPR004375 This family consists of conserved hypothetical proteins, about 150 amino acids in length, with no known function. The family is restricted to the bacteria. It includes three members in Escherichia coli (strain K12) and three in Streptococcus pneumoniae.; PDB: 1S4C_B 1JOP_B.
Probab=21.24  E-value=1.5e+02  Score=18.46  Aligned_cols=25  Identities=28%  Similarity=0.264  Sum_probs=16.9

Q ss_pred             CceEEEEECCCCEEEeCCCCeEEEE
Q psy12933         29 GHLIECTLNPGDMLYIPPKVWHHVR   53 (65)
Q Consensus        29 ~~~~~~~l~pGd~LyiP~~WwH~V~   53 (65)
                      .....+.|+||+.+.+=|+=.|+..
T Consensus       110 ~~~~~i~l~~g~f~iffP~d~H~p~  134 (153)
T PF04074_consen  110 KNESFITLKPGDFAIFFPEDAHRPG  134 (153)
T ss_dssp             TTEEEEEE-TTEEEEE-TT--EEEE
T ss_pred             CCceEEEEcCCEEEEECCCcccccc
Confidence            4456788999999998888899965


No 202
>TIGR02998 RraA_entero regulator of ribonuclease activity A. THIS PROTEIN IS _NOT_ MenG, AKA S-adenosylmethionine: 2-demethylmenaquinone methyltransferase (EC 2.1.-.-). See the references characterizing this as a case of transitive annotation error .
Probab=21.10  E-value=79  Score=20.55  Aligned_cols=11  Identities=36%  Similarity=0.709  Sum_probs=8.4

Q ss_pred             EEEECCCCEEE
Q psy12933         33 ECTLNPGDMLY   43 (65)
Q Consensus        33 ~~~l~pGd~Ly   43 (65)
                      -++++|||.++
T Consensus       135 gv~V~PGD~Iv  145 (161)
T TIGR02998       135 GVTFFPDDYIY  145 (161)
T ss_pred             CEEECCCCEEE
Confidence            46788998776


No 203
>PRK09372 ribonuclease activity regulator protein RraA; Provisional
Probab=21.00  E-value=80  Score=20.37  Aligned_cols=10  Identities=60%  Similarity=1.062  Sum_probs=8.1

Q ss_pred             EEECCCCEEE
Q psy12933         34 CTLNPGDMLY   43 (65)
Q Consensus        34 ~~l~pGd~Ly   43 (65)
                      ++++|||+++
T Consensus       136 v~V~PGD~I~  145 (159)
T PRK09372        136 VTFFPGDYLY  145 (159)
T ss_pred             EEECCCCEEE
Confidence            6888998876


No 204
>PF13550 Phage-tail_3:  Putative phage tail protein
Probab=20.66  E-value=57  Score=19.71  Aligned_cols=14  Identities=29%  Similarity=0.669  Sum_probs=11.2

Q ss_pred             EEECCCCEEEeCCC
Q psy12933         34 CTLNPGDMLYIPPK   47 (65)
Q Consensus        34 ~~l~pGd~LyiP~~   47 (65)
                      ..|+|||++-++..
T Consensus       138 ~~l~pGDvi~l~~~  151 (164)
T PF13550_consen  138 LALEPGDVIALSDD  151 (164)
T ss_pred             ccCCCCCEEEEEeC
Confidence            55899999988754


No 205
>cd02778 MopB_CT_Thiosulfate-R-like The MopB_CT_Thiosulfate-R-like CD contains thiosulfate-, sulfur-, and polysulfide-reductases, and other related proteins. Thiosulfate reductase catalyzes the cleavage of sulfur-sulfur bonds in thiosulfate. Polysulfide reductase is a membrane-bound enzyme that catalyzes the reduction of polysulfide using either hydrogen or formate as the electron donor. Also included in this CD is the phenylacetyl-CoA:acceptor oxidoreductase, large subunit (PadB2), which has been characterized as a membrane-bound molybdenum-iron-sulfur enzyme involved in anaerobic metabolism of phenylalanine in the denitrifying bacterium Thauera aromatica. The MopB_CT_Thiosulfate-R-like CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=20.52  E-value=57  Score=18.97  Aligned_cols=12  Identities=25%  Similarity=0.562  Sum_probs=9.8

Q ss_pred             CEEEeCCCCeEE
Q psy12933         40 DMLYIPPKVWHH   51 (65)
Q Consensus        40 d~LyiP~~WwH~   51 (65)
                      .++++|.+|+|.
T Consensus        72 g~v~~~~g~g~~   83 (123)
T cd02778          72 DTVFMPHGFGHW   83 (123)
T ss_pred             CEEEEeccCCcC
Confidence            579999999764


No 206
>PF01568 Molydop_binding:  Molydopterin dinucleotide binding domain;  InterPro: IPR006657 A domain in this entry corresponds to the C-terminal domain IV in dimethyl sulphoxide (DMSO)reductase which interacts with the 2-amino pyrimidone ring of both molybdopterin guanine dinucleotide molecules [].; GO: 0016491 oxidoreductase activity, 0030151 molybdenum ion binding, 0055114 oxidation-reduction process; PDB: 2IVF_A 1OGY_G 3ML1_A 3O5A_A 1TI2_G 1VLE_M 1VLD_U 1VLF_O 1TI4_I 1TI6_E ....
Probab=20.47  E-value=30  Score=19.74  Aligned_cols=11  Identities=9%  Similarity=0.259  Sum_probs=10.2

Q ss_pred             CEEEeCCCCeE
Q psy12933         40 DMLYIPPKVWH   50 (65)
Q Consensus        40 d~LyiP~~WwH   50 (65)
                      +++++|.+||+
T Consensus        72 g~v~~~~~~~~   82 (110)
T PF01568_consen   72 GVVFMPHGWGG   82 (110)
T ss_dssp             TEEEEESTHTT
T ss_pred             CEEEEeccccc
Confidence            78999999998


No 207
>PF00568 WH1:  WH1 domain;  InterPro: IPR000697 The EVH1 (WH1, RanBP1-WASP) domain is found in multi-domain proteins implicated in a diverse range of signalling, nuclear transport and cytoskeletal events. This domain of around 115 amino acids is present in species ranging from yeast to mammals. Many EVH1-containing proteins associate with actin-based structures and play a role in cytoskeletal organisation. EVH1 domains recognise and bind the proline-rich motif FPPPP with low-affinity, further interactions then form between flanking residues [][].  WASP family proteins contain a EVH1 (WH1) in their N-terminals which bind proline-rich sequences in the WASP interacting protein. Proteins of the RanBP1 family contain a WH1 domain in their N-terminal region, which seems to bind a different sequence motif present in the C-terminal part of RanGTP protein [,].  Tertiary structure of the WH1 domain of the Mena protein revealed structure similarities with the pleckstrin homology (PH) domain. The overall fold consists of a compact parallel beta-sandwich, closed along one edge by a long alpha-helix. A highly conserved cluster of three surface-exposed aromatic side-chains forms the recognition site for the molecules target ligands. [].; GO: 0005515 protein binding; PDB: 1I2H_A 1DDV_A 1DDW_A 1EGX_A 3SYX_A 1TJ6_B 1XOD_B 1EVH_A 1I7A_B 2JP2_A ....
Probab=20.41  E-value=2e+02  Score=16.92  Aligned_cols=37  Identities=16%  Similarity=0.195  Sum_probs=30.2

Q ss_pred             CCCceEEEEECCCCEEEeCCCCeEEEEecCCeEEEEE
Q psy12933         27 TPGHLIECTLNPGDMLYIPPKVWHHVRSLSTSISVSF   63 (65)
Q Consensus        27 ~~~~~~~~~l~pGd~LyiP~~WwH~V~s~~~sisvn~   63 (65)
                      .+....++.|.+|=.-.-+...+|+-+..+..+.+||
T Consensus        58 ~~~~v~e~~l~~~~~Y~~~~~~Fh~f~~~~~~~GLnF   94 (111)
T PF00568_consen   58 DGKVVWEQELYPGFVYTKARPFFHQFEDDDCVYGLNF   94 (111)
T ss_dssp             TTEEEEEEEESTT-EEEEESSSEEEEEETTCEEEEEE
T ss_pred             ccEEEEEeEecCCCEEEeCCCcEEEEEeCCeEEEEec
Confidence            4556888899998766678999999999888888887


No 208
>PF08255 Leader_Trp:  Trp-operon Leader Peptide;  InterPro: IPR013205 The tryptophan operon regulatory region of Citrobacter freundii (leader transcript) encodes a 14-residue peptide containing characteristic tandem tryptophan residues. It is about 10 nucleotides shorter than those of Escherichia coli and Salmonella typhimurium [].
Probab=20.10  E-value=53  Score=13.28  Aligned_cols=10  Identities=10%  Similarity=0.315  Sum_probs=6.4

Q ss_pred             EEeCCCCeEE
Q psy12933         42 LYIPPKVWHH   51 (65)
Q Consensus        42 LyiP~~WwH~   51 (65)
                      +|.=.+|||-
T Consensus         4 ~~~L~~WWrt   13 (14)
T PF08255_consen    4 TFSLHGWWRT   13 (14)
T ss_pred             EEEEeeEEEc
Confidence            4555678873


Done!