Query         537021.9.peg.393_1
Match_columns 150
No_of_seqs    118 out of 1509
Neff          5.8 
Searched_HMMs 39220
Date          Wed May 25 05:33:15 2011
Command       /home/congqian_1/programs/hhpred/hhsearch -i peg_393.hhm -d /home/congqian_1/database/cdd/Cdd.hhm 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 CHL00202 argB acetylglutamate  100.0       0       0  322.1  15.0  143    7-149     5-148 (284)
  2 cd04250 AAK_NAGK-C AAK_NAGK-C: 100.0 6.8E-43       0  296.1  15.9  138   12-149     1-145 (279)
  3 PRK05279 N-acetylglutamate syn 100.0   8E-42       0  289.2  14.5  142    6-149     6-155 (441)
  4 cd04237 AAK_NAGS-ABP AAK_NAGS- 100.0 4.9E-41 1.4E-45  284.1  15.5  138   10-149     3-148 (280)
  5 COG0548 ArgB Acetylglutamate k 100.0 3.2E-39 8.1E-44  272.5  13.7  126   24-149     1-130 (265)
  6 TIGR00761 argB acetylglutamate 100.0 1.3E-37 3.3E-42  262.2  12.0  123   27-149     1-141 (254)
  7 TIGR01890 N-Ac-Glu-synth amino 100.0 7.4E-37 1.9E-41  257.3  11.2  139   10-150     2-148 (439)
  8 cd04249 AAK_NAGK-NC AAK_NAGK-N 100.0 2.4E-34 6.2E-39  241.2  11.5  122   28-149     1-123 (252)
  9 PRK00942 acetylglutamate kinas 100.0 2.3E-33   6E-38  234.9  13.1  123   27-149     1-130 (262)
 10 PRK04531 acetylglutamate kinas 100.0 6.1E-33 1.6E-37  232.2  13.7  131   13-149     4-139 (421)
 11 cd04238 AAK_NAGK-like AAK_NAGK 100.0 7.9E-32   2E-36  225.1  13.3  122   28-149     1-125 (256)
 12 cd04252 AAK_NAGK-fArgBP AAK_NA 100.0 4.9E-30 1.2E-34  213.6  13.1  115   29-149     2-117 (248)
 13 cd04251 AAK_NAGK-UC AAK_NAGK-U 100.0 2.6E-28 6.8E-33  202.5  11.4  115   28-149     1-132 (257)
 14 KOG2436 consensus               99.9 2.6E-24 6.6E-29  176.9  10.9  141    8-149    77-223 (520)
 15 cd04236 AAK_NAGS-Urea AAK_NAGS  99.9   3E-21 7.6E-26  157.3  11.8  128    3-149     6-140 (271)
 16 cd02115 AAK Amino Acid Kinases  99.8 4.5E-18 1.2E-22  136.9  11.6  114   29-148     1-114 (248)
 17 pfam00696 AA_kinase Amino acid  99.6 1.6E-14 4.1E-19  114.2  11.3  109   26-137     1-109 (230)
 18 PRK12353 carbamate kinase; Rev  99.3 2.2E-11 5.5E-16   94.1  11.7   94   25-120     2-104 (312)
 19 cd04241 AAK_FomA-like AAK_FomA  99.0 5.7E-09 1.5E-13   78.5  10.9   99   27-132     1-107 (252)
 20 PRK12686 carbamate kinase; Rev  98.7 3.1E-07 7.8E-12   67.4  11.6   96   24-121     2-106 (313)
 21 cd04235 AAK_CK AAK_CK: Carbama  98.7 4.9E-07 1.2E-11   66.2  11.4   93   27-125     1-107 (308)
 22 PRK12454 carbamate kinase-like  98.3 4.6E-06 1.2E-10   59.9   7.7   92   25-121     2-107 (313)
 23 cd04254 AAK_UMPK-PyrH-Ec UMP k  98.1   4E-05   1E-09   53.9  10.0   92   26-129     1-101 (231)
 24 PRK00358 pyrH uridylate kinase  98.1 7.1E-05 1.8E-09   52.3  11.1   90   26-127     1-99  (231)
 25 cd04256 AAK_P5CS_ProBA AAK_P5C  98.0   3E-05 7.7E-10   54.6   8.1   47   24-70      7-60  (284)
 26 cd04239 AAK_UMPK-like AAK_UMPK  98.0 0.00011 2.8E-09   51.1  10.3   88   28-126     2-96  (229)
 27 PRK12314 gamma-glutamyl kinase  97.8 0.00017 4.4E-09   49.8   9.1   46   24-69      8-59  (265)
 28 COG1608 Predicted archaeal kin  97.8 7.8E-05   2E-09   52.0   7.0  100   28-134     3-115 (252)
 29 PRK12352 putative carbamate ki  97.7 0.00018 4.5E-09   49.7   7.6   87   25-116     2-99  (316)
 30 cd04242 AAK_G5K_ProB AAK_G5K_P  97.7 0.00021 5.3E-09   49.3   7.8   51   28-80      2-58  (251)
 31 PRK09411 carbamate kinase; Rev  97.7 0.00075 1.9E-08   45.7  10.1   87   26-121     2-99  (297)
 32 TIGR00746 arcC carbamate kinas  97.6 0.00017 4.3E-09   49.9   6.2   94   26-125     1-110 (321)
 33 COG0549 ArcC Carbamate kinase   97.5  0.0012   3E-08   44.5   9.3   86   26-116     1-97  (312)
 34 cd04261 AAK_AKii-LysC-BS AAK_A  97.5  0.0019 4.9E-08   43.1   9.8   98   27-132     1-101 (239)
 35 PRK12354 carbamate kinase; Rev  97.2  0.0021 5.2E-08   42.9   7.9   82   27-117     2-94  (302)
 36 cd04246 AAK_AK-DapG-like AAK_A  97.2  0.0084 2.1E-07   39.0  10.3   96   27-128     1-97  (239)
 37 cd04253 AAK_UMPK-PyrH-Pf AAK_U  97.2  0.0015 3.9E-08   43.7   6.5   44   28-72      2-48  (221)
 38 COG0528 PyrH Uridylate kinase   97.0   0.014 3.5E-07   37.6  10.2   97   23-130     3-107 (238)
 39 PRK08841 aspartate kinase; Val  96.7   0.024 6.1E-07   36.1   9.6   99   26-131     2-102 (392)
 40 PRK07431 aspartate kinase; Pro  96.4    0.05 1.3E-06   34.0   9.6  102   26-133     2-104 (594)
 41 PRK08210 aspartate kinase I; R  96.4    0.08 2.1E-06   32.7  10.7  100   26-132     2-108 (405)
 42 cd04240 AAK_UC AAK_UC: Unchara  96.4  0.0038 9.8E-08   41.2   3.6   41   29-74      1-41  (203)
 43 cd04260 AAK_AKi-DapG-BS AAK_AK  96.4   0.068 1.7E-06   33.2   9.9   99   27-132     1-106 (244)
 44 TIGR02076 pyrH_arch uridylate   96.2  0.0059 1.5E-07   40.0   3.7   75   28-114     1-86  (232)
 45 PRK05429 gamma-glutamyl kinase  96.1    0.11 2.7E-06   31.9  10.0   56   24-81      7-68  (372)
 46 PRK13402 gamma-glutamyl kinase  96.1   0.019 4.8E-07   36.7   6.0   55   25-81      5-65  (363)
 47 COG0263 ProB Glutamate 5-kinas  96.0   0.064 1.6E-06   33.3   8.5   89   25-123     6-102 (369)
 48 PRK12443 uridylate kinase; Rev  95.9   0.024 6.1E-07   36.1   5.9   91   26-128     5-105 (247)
 49 TIGR01027 proB glutamate 5-kin  95.9    0.07 1.8E-06   33.1   8.2   86   26-121     1-96  (379)
 50 PRK06635 aspartate kinase; Rev  95.9     0.1 2.6E-06   32.0   8.9  100   26-132     2-103 (402)
 51 TIGR02075 pyrH_bact uridylate   95.8     0.2 5.2E-06   30.1  11.1  100   26-137     2-112 (236)
 52 COG0527 LysC Aspartokinases [A  95.6    0.18 4.7E-06   30.4   9.5  107   27-134     3-150 (447)
 53 COG5630 ARG2 Acetylglutamate s  95.3   0.013 3.2E-07   37.9   2.7  124   14-148   125-252 (495)
 54 cd04234 AAK_AK AAK_AK: Amino A  95.3    0.22 5.5E-06   29.9   9.0   39   27-66      1-39  (227)
 55 cd07041 STAS_RsbR_RsbS_like Su  95.0    0.18 4.5E-06   30.5   7.7   77    5-82     20-97  (109)
 56 KOG1154 consensus               94.2   0.078   2E-06   32.8   4.3   98   24-124     8-118 (285)
 57 PRK06291 aspartate kinase; Pro  92.6       1 2.5E-05   25.7   8.9   50   27-76      2-51  (466)
 58 cd04259 AAK_AK-DapDC AAK_AK-Da  89.7     1.1 2.7E-05   25.5   5.6   47   28-74      2-48  (295)
 59 cd06844 STAS Sulphate Transpor  87.9     2.5 6.5E-05   23.1   7.5   74    7-81     20-94  (100)
 60 cd04244 AAK_AK-LysC-like AAK_A  87.6     1.6   4E-05   24.4   5.3   48   28-76      2-49  (298)
 61 pfam00752 XPG_N XPG N-terminal  87.3     1.7 4.2E-05   24.3   5.3   52   20-71     20-85  (100)
 62 PRK08961 bifunctional aspartat  86.7     1.9   5E-05   23.8   5.4   54   23-76      5-59  (865)
 63 COG1366 SpoIIAA Anti-anti-sigm  84.0       4  0.0001   21.8   8.2   76    7-83     25-101 (117)
 64 PRK00192 mannosyl-3-phosphogly  82.5     3.4 8.7E-05   22.2   5.1   67   24-90      5-74  (275)
 65 smart00485 XPGN Xeroderma pigm  81.9     4.2 0.00011   21.7   5.4   52   20-71     19-84  (99)
 66 TIGR00657 asp_kinases aspartat  79.8     4.1  0.0001   21.8   4.7   52   26-77      2-55  (504)
 67 cd04243 AAK_AK-HSDH-like AAK_A  77.9     5.9 0.00015   20.7   5.1   47   28-76      2-48  (293)
 68 cd04257 AAK_AK-HSDH AAK_AK-HSD  77.7     6.2 0.00016   20.6   5.1   48   28-76      2-49  (294)
 69 TIGR01283 nifE nitrogenase MoF  77.5     6.6 0.00017   20.4   5.7   72    9-91    321-392 (470)
 70 PRK08373 aspartate kinase; Val  75.5     7.4 0.00019   20.1   5.1  100   24-127     2-132 (359)
 71 pfam01740 STAS STAS domain. Th  75.3     7.5 0.00019   20.1   7.3   73   10-83     24-97  (106)
 72 cd04258 AAK_AKiii-LysC-EC AAK_  75.0     6.7 0.00017   20.4   4.7   45   27-74      1-45  (292)
 73 cd04247 AAK_AK-Hom3 AAK_AK-Hom  72.3     8.9 0.00023   19.6   4.9   38   27-66      2-39  (306)
 74 COG0474 MgtA Cation transport   71.8     9.1 0.00023   19.5   7.5  100   29-130   536-649 (917)
 75 pfam10686 DUF2493 Protein of u  70.5     9.7 0.00025   19.4   5.5   52   26-80      3-59  (71)
 76 COG0561 Cof Predicted hydrolas  68.8      11 0.00027   19.1   5.6  102   25-128     3-109 (264)
 77 TIGR00268 TIGR00268 conserved   68.7     9.6 0.00025   19.4   4.3  109   14-136     3-131 (263)
 78 PRK05925 aspartate kinase; Pro  67.5      11 0.00028   19.0   9.5  104   26-136     2-148 (440)
 79 cd04255 AAK_UMPK-MosAB AAK_UMP  67.2     7.4 0.00019   20.1   3.5   85   26-123    31-121 (262)
 80 PRK04596 minC septum formation  67.0      11 0.00029   18.9   6.5   59   21-81     46-104 (248)
 81 TIGR01092 P5CS delta l-pyrroli  64.6      10 0.00027   19.2   3.9   45   25-69      7-58  (738)
 82 PTZ00217 flap endonuclease-1;   64.5      13 0.00033   18.6   5.2   50   20-69     23-92  (394)
 83 TIGR01116 ATPase-IIA1_Ca calci  63.5     5.9 0.00015   20.8   2.4   95    4-99    418-538 (800)
 84 cd04245 AAK_AKiii-YclM-BS AAK_  63.1      13 0.00032   18.6   4.1   35   28-64      2-36  (288)
 85 PRK00009 phosphoenolpyruvate c  63.1      14 0.00035   18.4   4.8   65   26-93    568-647 (916)
 86 KOG2518 consensus               60.0     9.7 0.00025   19.3   3.0   49   19-69     18-82  (556)
 87 PRK03868 glucose-6-phosphate i  60.0      15 0.00039   18.1   8.9   89   10-102    42-135 (409)
 88 PRK10444 UMP phosphatase; Prov  58.4      15 0.00038   18.1   3.8   55   26-80      2-60  (248)
 89 cd01979 Pchlide_reductase_N Pc  58.3      16 0.00042   17.9   5.0   77    7-95    258-334 (396)
 90 TIGR00600 rad2 DNA excision re  57.8      11 0.00027   19.1   2.9   62   33-98     46-113 (1127)
 91 pfam06953 ArsD Arsenical resis  57.4      15 0.00038   18.2   3.6   67   38-137    22-88  (119)
 92 PRK10530 phosphotransferase; P  57.4      17 0.00043   17.8   4.3   97   26-122     4-105 (272)
 93 cd00128 XPG Xeroderma pigmento  57.1      17 0.00044   17.8   5.4   50   20-69     19-82  (316)
 94 PRK00973 glucose-6-phosphate i  57.0      17 0.00044   17.7   6.8   87   12-99     61-154 (454)
 95 TIGR01292 TRX_reduct thioredox  56.3      18 0.00045   17.7   4.1   36   21-65    147-182 (321)
 96 cd03141 GATase1_Hsp31_like Typ  56.2      18 0.00045   17.7   6.8   54   82-135   146-205 (221)
 97 TIGR00014 arsC arsenate reduct  55.5     6.5 0.00017   20.4   1.5   10   59-68     92-101 (114)
 98 TIGR01465 cobM_cbiF precorrin-  55.1      18 0.00047   17.6  11.7  126    5-135    49-203 (252)
 99 pfam08282 Hydrolase_3 haloacid  54.5      19 0.00048   17.5   4.1   26   10-35     39-64  (254)
100 PRK13385 2-C-methyl-D-erythrit  53.9      18 0.00046   17.6   3.6   59    9-72     54-119 (238)
101 cd01976 Nitrogenase_MoFe_alpha  53.7      19  0.0005   17.4   7.2   67    5-80    280-349 (421)
102 PRK09444 pntB pyridine nucleot  50.2      22 0.00056   17.1   4.2   39   45-83    291-339 (459)
103 COG2054 Uncharacterized archae  49.7      21 0.00053   17.2   3.3   73   29-113     3-77  (212)
104 TIGR00453 ispD 2-C-methyl-D-er  48.6      19 0.00049   17.5   3.0   34   27-65     70-104 (226)
105 PTZ00114 Heat shock protein 60  47.9      24 0.00061   16.8   7.8  113   18-134   397-519 (580)
106 PRK00694 4-hydroxy-3-methylbut  47.6      23 0.00059   16.9   3.3   53   44-96     45-104 (606)
107 pfam02233 PNTB NAD(P) transhyd  47.3      24 0.00062   16.8   4.2   43   45-87    296-352 (464)
108 pfam01973 MAF_flag10 Protein o  46.6      25 0.00064   16.7   3.8   27   10-36     33-59  (169)
109 PTZ00063 histone deacetylase;   46.4      25 0.00064   16.7   6.1  118   11-133   237-390 (439)
110 TIGR00656 asp_kin_monofn aspar  45.4      26 0.00067   16.6   4.5   54   25-78      1-54  (480)
111 cd01974 Nitrogenase_MoFe_beta   45.2      26 0.00067   16.6   7.9   65    3-76    281-347 (435)
112 TIGR00705 SppA_67K signal pept  45.2      15 0.00038   18.2   2.0   75   33-126   371-454 (614)
113 PRK08605 D-lactate dehydrogena  45.0      27 0.00068   16.5   3.5   36   51-86     63-100 (332)
114 TIGR02078 AspKin_pair Pyrococc  44.0      27  0.0007   16.4   3.8   46   27-77      1-48  (341)
115 PRK10637 cysG siroheme synthas  42.7      29 0.00073   16.3   5.4   69   15-86    254-329 (457)
116 TIGR02472 sucr_P_syn_N sucrose  42.4      15 0.00039   18.0   1.8   78   35-122   348-427 (445)
117 COG1211 IspD 4-diphosphocytidy  42.1      29 0.00075   16.3   3.4   20   55-74     70-89  (230)
118 cd03134 GATase1_PfpI_like A ty  41.9      30 0.00075   16.2   5.7   67   47-136    84-151 (165)
119 COG2378 Predicted transcriptio  41.0      25 0.00065   16.7   2.7   80    8-100     5-87  (311)
120 PRK10017 putative pyruvyl tran  40.9      30 0.00077   16.2   3.0   51   19-71    111-163 (426)
121 pfam00850 Hist_deacetyl Histon  40.4      31  0.0008   16.1   7.4   62   12-73    218-290 (302)
122 PRK10513 sugar phosphatase; Pr  40.0      32 0.00081   16.1   5.2   65   26-90      4-75  (270)
123 PRK09034 aspartate kinase; Rev  39.9      32 0.00081   16.0   7.6   34   28-63      2-35  (450)
124 cd03147 GATase1_Ydr533c_like T  37.9      34 0.00087   15.9   5.6  102   29-135    98-215 (231)
125 cd01427 HAD_like Haloacid deha  36.9      33 0.00083   16.0   2.7   43   41-83     24-67  (139)
126 cd05015 SIS_PGI_1 Phosphogluco  36.1      36 0.00093   15.7   9.4  111   13-131     4-122 (158)
127 PRK11440 hypothetical protein;  35.6      17 0.00043   17.8   1.1   93   38-131    29-133 (188)
128 TIGR03674 fen_arch flap struct  35.5      37 0.00095   15.6   5.7   28   41-69     57-84  (338)
129 KOG0339 consensus               34.8      38 0.00097   15.5   6.2   73   21-101   292-364 (731)
130 PRK00155 ispD 2-C-methyl-D-ery  34.6      38 0.00098   15.5   3.6   14   59-72     98-112 (228)
131 TIGR01508 rib_reduct_arch diam  34.4      39 0.00099   15.5   4.0   63   18-87    116-186 (224)
132 PRK05301 pyrroloquinoline quin  34.0      39   0.001   15.5   7.8   67   14-80    105-185 (375)
133 TIGR01245 trpD anthranilate ph  33.9      39   0.001   15.4   3.4   63   39-106   192-279 (331)
134 PRK05991 precorrin-3B C17-meth  33.9      39   0.001   15.4   5.0   71    9-86     28-113 (254)
135 PRK00339 minC septum formation  33.3      40   0.001   15.4   6.2   61   20-82     47-107 (249)
136 PRK13639 cbiO cobalt transport  33.2      40   0.001   15.4   4.9   13   46-58    224-236 (275)
137 TIGR02090 LEU1_arch isopropylm  32.6      17 0.00043   17.8   0.7   14   66-79    321-334 (371)
138 PRK06136 uroporphyrin-III C-me  32.3      42  0.0011   15.3   6.5   92   24-120    79-190 (253)
139 PRK09084 aspartate kinase III;  32.3      42  0.0011   15.3   5.2   37   27-66      1-37  (447)
140 KOG4701 consensus               32.2      42  0.0011   15.3   3.1   51   17-68     58-114 (568)
141 PRK10517 magnesium-transportin  31.1      44  0.0011   15.2   7.9   89   37-127   545-643 (900)
142 COG0547 TrpD Anthranilate phos  30.9      44  0.0011   15.1   3.6   65   38-105   197-284 (338)
143 cd01987 USP_OKCHK USP domain i  30.8      44  0.0011   15.1   8.4  103    6-114     9-115 (124)
144 PRK09436 thrA bifunctional asp  30.1      45  0.0012   15.0   5.1   47   28-75      2-48  (817)
145 PRK10976 putative sugar phosph  29.9      46  0.0012   15.0   5.4   66   26-91      3-72  (266)
146 pfam11869 DUF3389 Protein of u  29.9      46  0.0012   15.0   4.6   35   28-68     17-51  (75)
147 KOG0456 consensus               29.7      46  0.0012   15.0   5.7   47   25-73     78-124 (559)
148 TIGR01369 CPSaseII_lrg carbamo  29.6      46  0.0012   15.0   3.7   63   19-86    948-1013(1089)
149 pfam01872 RibD_C RibD C-termin  29.3      47  0.0012   15.0   3.2   31   46-76    119-149 (196)
150 TIGR00329 gcp metalloendopepti  28.8      48  0.0012   14.9   3.0   62   34-99    159-225 (337)
151 PRK06859 consensus              28.2      49  0.0012   14.8   6.4   47   70-119   106-152 (253)
152 KOG3517 consensus               27.8      50  0.0013   14.8   2.4   71   31-101    10-90  (334)
153 PRK06278 cobyrinic acid a,c-di  27.8      34 0.00087   15.9   1.5   17   20-36    266-282 (482)
154 pfam07521 RMMBL RNA-metabolisi  27.6      50  0.0013   14.8   3.1   33   27-66      9-41  (43)
155 cd02871 GH18_chitinase_D-like   27.4      51  0.0013   14.8   3.4   24   44-67     60-83  (312)
156 PRK12852 groEL chaperonin GroE  27.1      51  0.0013   14.7   6.7  109   23-133   372-488 (542)
157 KOG0202 consensus               27.1      51  0.0013   14.7   4.1  112   10-126   557-684 (972)
158 PRK09451 glmU bifunctional N-a  26.8      52  0.0013   14.7   2.9   38    9-48     82-119 (456)
159 PRK11391 etp phosphotyrosine-p  26.7      52  0.0013   14.7   2.9   69   26-95      3-74  (144)
160 TIGR01737 FGAM_synth_I phospho  26.6      52  0.0013   14.7   3.0   84   26-125     2-100 (264)
161 TIGR00099 Cof-subfamily Cof-li  26.4      53  0.0013   14.6   7.4  112   28-139     2-126 (270)
162 TIGR03590 PseG pseudaminic aci  26.3      53  0.0013   14.6   6.2   99   25-130   171-273 (280)
163 COG3956 Protein containing tet  25.7      54  0.0014   14.5   6.3   72   20-97     77-148 (488)
164 TIGR01249 pro_imino_pep_1 prol  25.6      54  0.0014   14.5   2.6   55   45-101   242-305 (310)
165 cd07022 S49_Sppa_36K_type Sign  25.6      55  0.0014   14.5   5.0   78   67-146   115-202 (214)
166 KOG1533 consensus               25.1      56  0.0014   14.5   4.3  107   11-123    86-231 (290)
167 TIGR02824 quinone_pig3 putativ  24.9      43  0.0011   15.2   1.6   28   51-78    163-190 (334)
168 TIGR01326 OAH_OAS_sulfhy O-ace  24.9      42  0.0011   15.3   1.6   51   44-97     83-138 (434)
169 cd01013 isochorismatase Isocho  24.8      56  0.0014   14.4   3.9   91   40-131    53-154 (203)
170 PRK12851 groEL chaperonin GroE  24.5      57  0.0015   14.4   6.9  109   21-133   370-487 (541)
171 cd02516 CDP-ME_synthetase CDP-  24.3      58  0.0015   14.4   5.5   48   18-70     61-109 (218)
172 PRK05802 hypothetical protein;  24.2      58  0.0015   14.4   6.1   73    9-87    155-233 (328)
173 PRK09181 aspartate kinase; Val  23.9      59  0.0015   14.3   3.8   39   26-67      3-45  (476)
174 pfam02579 Nitro_FeMo-Co Dinitr  23.8      59  0.0015   14.3   3.5   32   52-83     48-79  (94)
175 KOG0356 consensus               23.7      59  0.0015   14.3   5.7   82   18-103   381-463 (550)
176 PRK13561 putative phosphodiest  23.4      60  0.0015   14.3   7.6   63   10-74    500-565 (651)
177 TIGR03343 biphenyl_bphD 2-hydr  23.3      30 0.00078   16.2   0.6   65   11-76     48-120 (282)
178 PTZ00286 6-phospho-1-fructokin  23.2      61  0.0015   14.2   3.4   82   50-135   180-280 (486)
179 pfam01075 Glyco_transf_9 Glyco  23.0      61  0.0016   14.2   7.8  114   12-128    94-214 (249)
180 PRK05569 flavodoxin; Provision  22.9      61  0.0016   14.2   5.5   70   26-100    50-134 (141)
181 TIGR01250 pro_imino_pep_2 prol  22.7      44  0.0011   15.1   1.4   30   57-87     25-63  (302)
182 TIGR00963 secA preprotein tran  22.6      62  0.0016   14.2   4.6   45   40-85    418-466 (904)
183 cd03466 Nitrogenase_NifN_2 Nit  22.5      62  0.0016   14.2   7.8  105    3-122   278-394 (429)
184 KOG1342 consensus               22.5      62  0.0016   14.2   5.2   57   16-72    244-306 (425)
185 cd03148 GATase1_EcHsp31_like T  22.3      63  0.0016   14.1   5.2  100   29-135   100-215 (232)
186 pfam02593 DUF166 Uncharacteriz  22.3      63  0.0016   14.1   5.1  106   11-135    35-145 (215)
187 cd01425 RPS2 Ribosomal protein  22.1      63  0.0016   14.1   5.4  116   10-130    13-162 (193)
188 cd04248 AAK_AK-Ectoine AAK_AK-  22.1      64  0.0016   14.1   3.5   15   28-42      2-16  (304)
189 COG2217 ZntA Cation transport   22.1      64  0.0016   14.1   6.4   80   18-100   507-594 (713)
190 PRK01158 phosphoglycolate phos  22.1      64  0.0016   14.1   5.7   67   26-92      4-74  (226)
191 cd07042 STAS_SulP_like_sulfate  21.1      67  0.0017   14.0   7.5   74    9-83     23-98  (107)
192 COG0648 Nfo Endonuclease IV [D  21.0      67  0.0017   14.0   3.7   28   41-68     84-111 (280)
193 cd04895 ACT_ACR_1 ACT domain-c  20.9      67  0.0017   14.0   2.0   19   85-103    51-69  (72)
194 PRK01096 deoxyguanosinetriphos  20.6      68  0.0017   13.9   2.7   35   44-79    100-134 (442)
195 COG4148 ModC ABC-type molybdat  20.5      61  0.0015   14.2   1.7   50    1-54    152-203 (352)
196 PRK10528 multifunctional acyl-  20.1      70  0.0018   13.9   6.0   54   10-63     57-114 (191)

No 1  
>CHL00202 argB acetylglutamate kinase; Provisional
Probab=100.00  E-value=0  Score=322.08  Aligned_cols=143  Identities=40%  Similarity=0.724  Sum_probs=139.4

Q ss_pred             HHCCCHHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECC
Q ss_conf             22062289999999987598899986756724746899999999998518971799845724578899970898115578
Q 537021.9.peg.3    7 QFQAEILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENG   86 (150)
Q Consensus         7 ~~~a~~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G   86 (150)
                      .+++.|||||+|||++||||||||||||++|.|++++.+|++||++|+++|+|+||||||||||+++|+++|++|+|++|
T Consensus         5 ~e~v~~lrea~pYi~~~rgktfVIk~gG~~~~d~~l~~~~~~DialL~~lGik~VlVHGgg~qI~~~l~~~gi~~~f~~G   84 (284)
T CHL00202          5 DERVQVLSEALPYIQKFRGRIMVIKYGGAAMKNLILKADIIKDILFLSCIGLKIVVVHGGGPEINFWLKQLNISPKFWNG   84 (284)
T ss_pred             HHHHHHHHHHHHHHHHHCCCEEEEEECCEEEECHHHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHCCCCCCCCCC
T ss_conf             89999999977999996899899998966663754899999999999988997999889966899999976998436288


Q ss_pred             CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEECC-CCCCCCCCCCC
Q ss_conf             3466124899999997416689999999828997687786558989999747-52111799510
Q 537021.9.peg.3   87 LRITDQQTAEVVEMVLAGSINKKIVSLINQTGTQAIGICGKDGNMVFAEKAR-HSLRLSPNTKK  149 (150)
Q Consensus        87 ~RVTd~~tl~iv~~vL~G~vn~~lv~~L~~~g~~avglsg~dg~li~a~~~~-~~lg~vg~~~k  149 (150)
                      +||||+++|++++|+|+|++|++|++.|+++|.+|+|++|.|||+++|+|.+ .++|++|++++
T Consensus        85 ~RvTd~~~l~vv~~vl~g~vn~~lv~~l~~~g~~a~gl~g~dg~~i~A~~~~~~D~g~vG~V~~  148 (284)
T CHL00202         85 IRVTDKVTMEIVEMVLAGKVNKDLVGSINANGGKAVGLCGKDANLIVARASDKKDLGLVGEIQQ  148 (284)
T ss_pred             CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCEEEEECCCCCCCCCCCCCEE
T ss_conf             3218878999999998778999999999856995055312557669983589877444444123


No 2  
>cd04250 AAK_NAGK-C AAK_NAGK-C: N-Acetyl-L-glutamate kinase - cyclic (NAGK-C) catalyzes the phosphorylation of the gamma-COOH group of N-acetyl-L-glutamate (NAG) by ATP in the second step of arginine biosynthesis found in some bacteria and photosynthetic organisms using the non-acetylated, cyclic route of ornithine biosynthesis. In this pathway, glutamate is first N-acetylated and then phosphorylated by NAGK to give phosphoryl NAG, which is converted to NAG-ornithine. There are two variants of this pathway. In one, typified by the pathway in Thermotoga maritima and Pseudomonas aeruginosa, the acetyl group is recycled by reversible transacetylation from acetylornithine to glutamate. The phosphorylation of NAG by NAGK is feedback inhibited by arginine. In photosynthetic organisms, NAGK is the target of the nitrogen-signaling protein PII. Hexameric formation of NAGK domains appears to be essential to both arginine inhibition and NAGK-PII complex formation. NAGK-C are members of the Amino A
Probab=100.00  E-value=6.8e-43  Score=296.07  Aligned_cols=138  Identities=49%  Similarity=0.869  Sum_probs=133.6

Q ss_pred             HHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCC
Q ss_conf             28999999998759889998675672474689999999999851897179984572457889997089811557834661
Q 537021.9.peg.3   12 ILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITD   91 (150)
Q Consensus        12 ~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd   91 (150)
                      +||||+|||++|||||||||+||+++.|++++.+|++||++|+.+|+++||||||||||+++|+++|++|+|++|+||||
T Consensus         1 ~l~ea~PYi~~~rgktiVIklGG~~l~d~~~~~~~~~dI~~L~~~G~~vVlVHGggpqi~~~l~~~gi~~~~~~G~RvT~   80 (279)
T cd04250           1 VLIEALPYIQKFRGKTVVIKYGGNAMKDEELKESFARDIVLLKYVGINPVVVHGGGPEINEMLKKLGIESEFVNGLRVTD   80 (279)
T ss_pred             CHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHCCCCCCCCCCCCCCC
T ss_conf             90466678998589989999890784697799999999999998899299996996799999998699842237743573


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEECC-------CCCCCCCCCCC
Q ss_conf             24899999997416689999999828997687786558989999747-------52111799510
Q 537021.9.peg.3   92 QQTAEVVEMVLAGSINKKIVSLINQTGTQAIGICGKDGNMVFAEKAR-------HSLRLSPNTKK  149 (150)
Q Consensus        92 ~~tl~iv~~vL~G~vn~~lv~~L~~~g~~avglsg~dg~li~a~~~~-------~~lg~vg~~~k  149 (150)
                      +++|++++|+|+|++|++|++.|+++|++|+|+||.|+|+++|+|+.       .++|++|++++
T Consensus        81 ~~~l~iv~~~l~G~vn~~lv~~l~~~g~~avglsg~dg~~i~a~~~~~~~~~~~~D~g~vG~v~~  145 (279)
T cd04250          81 EETMEIVEMVLVGKVNKEIVSLINRAGGKAVGLSGKDGNLIKAKKKDATVIEEIIDLGFVGEVTE  145 (279)
T ss_pred             HHHHHHHHHHHHCCHHHHHHHHHHHCCCCEEECCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCC
T ss_conf             88899999998561469999999973998546226668669999755555555665664566552


No 3  
>PRK05279 N-acetylglutamate synthase; Validated
Probab=100.00  E-value=8e-42  Score=289.21  Aligned_cols=142  Identities=27%  Similarity=0.406  Sum_probs=129.2

Q ss_pred             HHHCCCHHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEEC
Q ss_conf             32206228999999998759889998675672474689999999999851897179984572457889997089811557
Q 537021.9.peg.3    6 YQFQAEILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFEN   85 (150)
Q Consensus         6 ~~~~a~~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~   85 (150)
                      ..+=..|||||+|||++||||||||||||++|.|+++ .+|++||++|+++|+++||||||||||+++|++.|++|+|++
T Consensus         6 ~~~fv~~fRea~PYI~~frgktfVI~~gG~a~~~~~~-~~~~~DIaLL~slGIr~VlVHGggpqI~~~L~~~gi~s~f~~   84 (441)
T PRK05279          6 TTEFVDWFRHSAPYINAHRGKTFVIMLGGEAIAHGNF-SNIVNDIALLHSLGIRLVLVHGARPQIEEQLAAHGIEPRYHK   84 (441)
T ss_pred             HHHHHHHHHHHHHHHHHHCCCEEEEEECCEECCCCCH-HHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHCCCCCEEEC
T ss_conf             1689999985137789858998999989720368358-999999999986897699987997788999997699855139


Q ss_pred             CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCC----CEEEEEEEECCEEEEEECC----CCCCCCCCCCC
Q ss_conf             8346612489999999741668999999982899----7687786558989999747----52111799510
Q 537021.9.peg.3   86 GLRITDQQTAEVVEMVLAGSINKKIVSLINQTGT----QAIGICGKDGNMVFAEKAR----HSLRLSPNTKK  149 (150)
Q Consensus        86 G~RVTd~~tl~iv~~vL~G~vn~~lv~~L~~~g~----~avglsg~dg~li~a~~~~----~~lg~vg~~~k  149 (150)
                      |+||||++||++++|+ +|++|.+|++.|++...    .+.++++++||+++|+|++    .++|++|++++
T Consensus        85 G~RVTD~~tl~~v~~v-~G~v~~~I~a~Ls~gl~nspm~g~~i~v~sGn~i~A~p~Gv~dGvD~g~vG~V~~  155 (441)
T PRK05279         85 GLRITDAAALECVKQA-AGELRLDIEARLSMGLPNTPMAGAHIRVVSGNFVTARPLGVDDGVDYCHTGEVRR  155 (441)
T ss_pred             CCCCCCHHHHHHHHHH-HHHHHHHHHHHHHCCCCCCCCCCCCEEEECCCEEEEEECCCCCCCCCCCCCEEEE
T ss_conf             9136999999999999-9999999999983436787555772467548668987635313756675415879


No 4  
>cd04237 AAK_NAGS-ABP AAK_NAGS-ABP: N-acetylglutamate (NAG) kinase-like domain of the NAG Synthase (NAGS) of the arginine-biosynthesis pathway (ABP) found in gamma- and beta-proteobacteria and higher plant chloroplasts. Domain architecture of these NAGS consisted of an N-terminal NAG kinase-like (ArgB) domain (this CD) and a C-terminal NAG synthase, acetyltransferase (ArgA) domain. Both bacterial and plant sequences in this CD have a conserved N-terminal extension; a similar sequence in the NAG kinases of the cyclic arginine-biosynthesis pathway has been implicated in feedback inhibition sensing. Plant sequences also have an N-terminal chloroplast transit peptide and an insert (approx. 70 residues) in the C-terminal region of ArgB. Members of this CD belong to the Amino Acid Kinase Superfamily (AAK).
Probab=100.00  E-value=4.9e-41  Score=284.15  Aligned_cols=138  Identities=28%  Similarity=0.471  Sum_probs=126.0

Q ss_pred             CCHHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCC
Q ss_conf             62289999999987598899986756724746899999999998518971799845724578899970898115578346
Q 537021.9.peg.3   10 AEILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRI   89 (150)
Q Consensus        10 a~~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RV   89 (150)
                      ..|||||+|||++||||||||||||++|+|++ +++|++||++|+++|++|||||||||||+++|+++|++|+|++|+||
T Consensus         3 v~~~rea~pYi~~f~~kt~VIK~GG~~~~~~~-l~~~~~di~lL~~~Gi~~VlVHGGG~~I~~~l~~~gi~~~fv~G~RV   81 (280)
T cd04237           3 VDWFREAAPYINAHRGKTFVIAFGGEAVAHPN-FDNIVHDIALLHSLGIRLVLVHGARPQIDQRLAERGLEPRYHRGLRI   81 (280)
T ss_pred             HHHHHHHHHHHHHHCCCEEEEEECCHHHCCHH-HHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHCCCCEEECCCCCC
T ss_conf             78888867999986899899998963307744-99999999999988996999928986779999986997462178777


Q ss_pred             CCHHHHHHHHHHHHHHHHHHHHHHHHHC----CCCEEEEEEEECCEEEEEECC----CCCCCCCCCCC
Q ss_conf             6124899999997416689999999828----997687786558989999747----52111799510
Q 537021.9.peg.3   90 TDQQTAEVVEMVLAGSINKKIVSLINQT----GTQAIGICGKDGNMVFAEKAR----HSLRLSPNTKK  149 (150)
Q Consensus        90 Td~~tl~iv~~vL~G~vn~~lv~~L~~~----g~~avglsg~dg~li~a~~~~----~~lg~vg~~~k  149 (150)
                      ||+++|++++++ .|.+|++|++.|++.    |..+.++++.+||++.|+|++    .++|+||++++
T Consensus        82 Td~~tl~vv~~~-~G~vn~~i~a~ls~g~~~~g~~g~~~~~~~g~~i~a~p~g~~dgiD~G~vG~v~~  148 (280)
T cd04237          82 TDAAALECVKEA-AGAVRLEIEALLSMGLPNSPMAGARIRVVSGNFVTARPLGVVDGVDFGHTGEVRR  148 (280)
T ss_pred             CCHHHHHHHHHH-HHHHHHHHHHHHHHCCCCCCCCCCCCEEECCCEEEEEECCCCCCCCCCCCCCCCE
T ss_conf             999999999999-9999999999997053435653557234058658988865314744467983026


No 5  
>COG0548 ArgB Acetylglutamate kinase [Amino acid transport and metabolism]
Probab=100.00  E-value=3.2e-39  Score=272.53  Aligned_cols=126  Identities=49%  Similarity=0.802  Sum_probs=123.4

Q ss_pred             CCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHH
Q ss_conf             59889998675672474689999999999851897179984572457889997089811557834661248999999974
Q 537021.9.peg.3   24 ENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEMVLA  103 (150)
Q Consensus        24 r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~vL~  103 (150)
                      ++||+||||||++|.|++++++|++||++|+..|+++|+|||||||||++|+++|++|+|++|+||||++||++++|||+
T Consensus         1 ~~k~~VIK~GG~~~~~~~l~~~~~~di~lL~~~G~~~VvVHGggp~I~~~l~~~gie~~f~~glRvTd~~tlevv~mvl~   80 (265)
T COG0548           1 RGKTIVIKLGGSAMEDENLLEAFASDIALLKSVGIRPVVVHGGGPQIDEMLAKLGIEPEFVKGLRVTDAETLEVVEMVLG   80 (265)
T ss_pred             CCCEEEEEECCEEECCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCEECCCHHHHHHHHHHHH
T ss_conf             99669999894644170278999999999997799479992895688899997699976349777679899999999998


Q ss_pred             HHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEECCCC----CCCCCCCCC
Q ss_conf             1668999999982899768778655898999974752----111799510
Q 537021.9.peg.3  104 GSINKKIVSLINQTGTQAIGICGKDGNMVFAEKARHS----LRLSPNTKK  149 (150)
Q Consensus       104 G~vn~~lv~~L~~~g~~avglsg~dg~li~a~~~~~~----lg~vg~~~k  149 (150)
                      |++|++|++.|+++|++|+|+||.|||+++|+|++++    +|+||++++
T Consensus        81 G~vNk~iva~l~~~g~~avGlsg~Dg~li~A~~~~~~~~id~g~vG~i~~  130 (265)
T COG0548          81 GTVNKEIVARLSKHGGQAVGLSGVDGNLVTAKKLDVDDGVDLGYVGEIRK  130 (265)
T ss_pred             HHHHHHHHHHHHHHCCCCEEEEECCCCEEEEEECCCCCCCCCCEEEEEEE
T ss_conf             88899999999971874145553378789999866413433640445898


No 6  
>TIGR00761 argB acetylglutamate kinase; InterPro: IPR004662   N -Acetylglutamate (NAG) fulfils distinct biological roles in lower and higher organisms. In prokaryotes, lower eukaryotes and plants it is the first intermediate in the biosynthesis of arginine, whereas in ureotelic (excreting nitrogen mostly in the form of urea) vertebrates, it is an essential allosteric cofactor for carbamyl phosphate synthetase I (CPSI), the first enzyme of the urea cycle. The pathway that leads from glutamate to arginine in lower organisms employs eight steps, starting with the acetylation of glutamate to form NAG. In these species, NAG can be produced by two enzymatic reactions: one catalysed by NAG synthase (NAGS) and the other by ornithine acetyltransferase (OAT). In ureotelic species, NAG is produced exclusively by NAGS. In lower organisms, NAGS is feedback-inhibited by L-arginine, whereas mammalian NAGS activity is significantly enhanced by this amino acid. The NAGS genes of bacteria, fungi and mammals are more diverse than other arginine-biosynthesis and urea-cycle genes. The evolutionary relationship between the distinctly different roles of NAG and its metabolism in lower and higher organisms remains to be determined .   The pathway from glutamate to arginine is:   NAGS; N-acetylglutamate synthase (2.3.1.1 from EC) (glutamate to N-acetylglutamate) NAGK; N-acetylglutamate kinase (2.7.2.8 from EC) (N-acetylglutamate to N-acetylglutamate-5P) NAGSA; N-acetyl-gamma-glutamyl-phosphate reductase (1.2.1.38 from EC) (N-acetylglutamate-5P to N-acetylglumate semialdehyde) Acetylornithine aminotransferase (2.6.1.11 from EC) (N-acetylglumate semialdehyde to N-acetylornithine) Acetylornithine deacetylase (3.5.1.16 from EC) (N-acetylornithine to ornithine) Arginase (3.5.3.1 from EC) (ornithine to arginine) This entry describes N-acetylglutamate kinases (ArgB) of many prokaryotes and the N-acetylglutamate kinase domains of multifunctional proteins from yeasts as well as some, as yet, uncharacterised proteins.; GO: 0003991 acetylglutamate kinase activity, 0006526 arginine biosynthetic process, 0005737 cytoplasm.
Probab=100.00  E-value=1.3e-37  Score=262.17  Aligned_cols=123  Identities=48%  Similarity=0.834  Sum_probs=118.9

Q ss_pred             EEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCC-EEEEECCCHHHHHHHHHCCCCCEEECCC-----CCCCHHHHHHHHH
Q ss_conf             899986756724746899999999998518971-7998457245788999708981155783-----4661248999999
Q 537021.9.peg.3   27 TIVVKYGGHVMNCTDLSKDFVNDIALLKKSNIT-PVIVHGGGPQIGAVLEKMGIKSKFENGL-----RITDQQTAEVVEM  100 (150)
Q Consensus        27 tiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k-~ViVHGgg~qI~~~l~~~gi~~~~v~G~-----RVTd~~tl~iv~~  100 (150)
                      ++|||+||+++.|+++.++|++|++.|+..|++ |||||||||+||++|+++||||+|++|+     ||||++||+|++|
T Consensus         1 ~~ViK~GG~~~~~~~~~~~~~~~~~~l~~~g~~PpviVHGGGp~in~~~~~lgi~~~f~~Gl~G~~~RvTd~~tl~v~~~   80 (254)
T TIGR00761         1 TIVIKIGGSAISNDELLEEFFSDIAFLRAVGIKPPVIVHGGGPEINELLEALGIEPEFKNGLSGVVFRVTDKETLEVVEM   80 (254)
T ss_pred             CEEEEECCEEECCCHHHHHHHHHHHHHHHCCCCCCEEEECCCHHHHHHHHHCCCCCEEECCCCCEEEECCCHHHHHHHHH
T ss_conf             97898447341041246999999999997298553898378526888998328983676688720375278899999899


Q ss_pred             HHHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEE----ECCCC--------CCCCCCCCC
Q ss_conf             97416689999999828997687786558989999----74752--------111799510
Q 537021.9.peg.3  101 VLAGSINKKIVSLINQTGTQAIGICGKDGNMVFAE----KARHS--------LRLSPNTKK  149 (150)
Q Consensus       101 vL~G~vn~~lv~~L~~~g~~avglsg~dg~li~a~----~~~~~--------lg~vg~~~k  149 (150)
                      ||+|++|+.|++.|+++|++|+||||.|+.+++|+    .++.+        +||||++++
T Consensus        81 vL~g~vN~~LV~~l~~~g~~A~GL~g~Dg~L~~a~rk~~~~~~~~~~vv~~~~G~VG~~~~  141 (254)
T TIGR00761        81 VLIGDVNKELVALLNKHGINAIGLTGLDGQLFTADRKTASLDKEVIKVVAIDLGYVGEIKK  141 (254)
T ss_pred             HHHHHHHHHHHHHHHHCCCCEEEECCCCCCEEEEEEEEEEEEECCCEEEEEECCCEEEECC
T ss_conf             9836466999999996799286311357735799987898851576899997484123132


No 7  
>TIGR01890 N-Ac-Glu-synth amino-acid N-acetyltransferase; InterPro: IPR010167   This entry represents a clade of amino-acid N-acetyltransferases, or N-acetylglutamate synthase, which is the product of the argA gene and the first enzyme in arginine biosynthesis. This enzyme displays more diversity between bacteria, fungi and mammals than other enzymes in arginine metabolism, and N-acetylglutamate itself can have different roles in different taxonomic groups: in prokaryotes, lower eukaryotes and plants it is the first intermediate in arginine biosynthesis, while in vertebrates it is an allosteric cofactor for the first enzyme in the urea cycle . In bacteria, arginine can regulate ornithine biosynthesis via a feedback inhibition mechanism . This enzyme may also act on aspartate.; GO: 0004042 amino-acid N-acetyltransferase activity, 0006526 arginine biosynthetic process, 0005737 cytoplasm.
Probab=100.00  E-value=7.4e-37  Score=257.33  Aligned_cols=139  Identities=27%  Similarity=0.413  Sum_probs=125.3

Q ss_pred             CCHHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCC
Q ss_conf             62289999999987598899986756724746899999999998518971799845724578899970898115578346
Q 537021.9.peg.3   10 AEILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRI   89 (150)
Q Consensus        10 a~~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RV   89 (150)
                      ..|||+|+|||++||||||||.+||.++.+++ +.+|+.||+|||++|+|+|||||.+|||++.|++++..|+|+.|+||
T Consensus         2 V~~lR~aAPYi~a~RdKT~Vvgl~g~~v~~~~-l~~~v~Di~LLh~LGv~lVLvHGaRPq~~~~la~~~~~p~Y~~G~Rv   80 (439)
T TIGR01890         2 VAWLREAAPYINAHRDKTFVVGLGGELVEDDN-LGNIVADIALLHSLGVRLVLVHGARPQIEERLAARGRTPHYHRGLRV   80 (439)
T ss_pred             CCCHHHCCCHHHHHCCCEEEEECCCCCCCCCC-HHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHCCCCCCCCCCEEE
T ss_conf             10033214035541586699842840014444-24599999999758918999708852689999962897870322571


Q ss_pred             CCHHHHHHHHHHHHHHHHHHHHHHHHHC---CCCEE-EEEEEECCEEEEEECCC----CCCCCCCCCCC
Q ss_conf             6124899999997416689999999828---99768-77865589899997475----21117995109
Q 537021.9.peg.3   90 TDQQTAEVVEMVLAGSINKKIVSLINQT---GTQAI-GICGKDGNMVFAEKARH----SLRLSPNTKKT  150 (150)
Q Consensus        90 Td~~tl~iv~~vL~G~vn~~lv~~L~~~---g~~av-glsg~dg~li~a~~~~~----~lg~vg~~~k~  150 (150)
                      ||+.+|+.++++ +|+++..|.+.|+..   -..|- -++.+|||||+|+|.++    |+-+.|.++|+
T Consensus        81 TD~asLe~~~~a-aG~lr~~ieA~Ls~~~~~~pmaGsr~~vvSGNfvtArPiGv~~GvDy~h~G~vRk~  148 (439)
T TIGR01890        81 TDEASLELVKQA-AGELRLAIEARLSMSLSNTPMAGSRLSVVSGNFVTARPIGVIEGVDYEHTGVVRKI  148 (439)
T ss_pred             CCHHHHHHHHHH-HHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCEEEECCCEEECCCCCEECCCEEHH
T ss_conf             356567789998-76688888888752124678888531356243332033101058662032745110


No 8  
>cd04249 AAK_NAGK-NC AAK_NAGK-NC: N-Acetyl-L-glutamate kinase - noncyclic (NAGK-NC) catalyzes the phosphorylation of the gamma-COOH group of N-acetyl-L-glutamate (NAG) by ATP in the second step of microbial arginine biosynthesis using the acetylated, noncyclic route of ornithine biosynthesis. There are two variants of this pathway. In one, typified by the pathway in Escherichia coli, glutamate is acetylated by acetyl-CoA and acetylornithine is deacylated hydrolytically. In this pathway, feedback inhibition by arginine occurs at the initial acetylation of glutamate and not at the phosphorylation of NAG by NAGK. Homodimeric NAGK-NC are members of the Amino Acid Kinase Superfamily (AAK).
Probab=100.00  E-value=2.4e-34  Score=241.21  Aligned_cols=122  Identities=25%  Similarity=0.442  Sum_probs=116.6

Q ss_pred             EEEEECHHHHCCHHHHHHHHHHHHHHH-HCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHHHHH
Q ss_conf             999867567247468999999999985-1897179984572457889997089811557834661248999999974166
Q 537021.9.peg.3   28 IVVKYGGHVMNCTDLSKDFVNDIALLK-KSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEMVLAGSI  106 (150)
Q Consensus        28 iVIK~GG~~l~d~~l~~~~~~dI~ll~-~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~vL~G~v  106 (150)
                      |||||||++|.|++++++|++|+++|+ ..|.++||||||||||+++|+++|++++|++|+||||+++|++++|+|+|++
T Consensus         1 iVIK~GG~~l~~~~~~~~~~~~i~~l~~~~~~~vVlVHGggpqI~~~l~~~gi~~~f~~G~RvTd~~tl~vv~~vl~g~v   80 (252)
T cd04249           1 LVIKLGGALLETEAALEQLFSALSEYQQQHNRQLVIVHGGGCVVDELLKKLNFPSEKKNGLRVTPKEQIPYITGALAGTA   80 (252)
T ss_pred             CEEEECCHHCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHCCCCCCCCCCEEECCHHHHHHHHHHHCCCH
T ss_conf             98997824617888999999999999981898769987996899999997699972379656589999999999984615


Q ss_pred             HHHHHHHHHHCCCCEEEEEEEECCEEEEEECCCCCCCCCCCCC
Q ss_conf             8999999982899768778655898999974752111799510
Q 537021.9.peg.3  107 NKKIVSLINQTGTQAIGICGKDGNMVFAEKARHSLRLSPNTKK  149 (150)
Q Consensus       107 n~~lv~~L~~~g~~avglsg~dg~li~a~~~~~~lg~vg~~~k  149 (150)
                      |+.|++.++++|.+|+|+|+.|++++.|++.++++|+||++++
T Consensus        81 n~~lv~~l~~~g~~a~gl~~~dg~~~~~~~~~~dlG~vG~v~~  123 (252)
T cd04249          81 NKQLMAQAIKAGLKPVGLSLADGGMTAVTQLDPELGAVGKATA  123 (252)
T ss_pred             HHHHHHHHHHCCCCCCCCCCCCCCEEEECCCCCCCCCCCCEEE
T ss_conf             9999999973589862044567746873023666786555323


No 9  
>PRK00942 acetylglutamate kinase; Provisional
Probab=100.00  E-value=2.3e-33  Score=234.91  Aligned_cols=123  Identities=54%  Similarity=0.887  Sum_probs=118.9

Q ss_pred             EEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHHHHH
Q ss_conf             89998675672474689999999999851897179984572457889997089811557834661248999999974166
Q 537021.9.peg.3   27 TIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEMVLAGSI  106 (150)
Q Consensus        27 tiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~vL~G~v  106 (150)
                      |||||+||+++.|++++++|++||++|+.+|+++||||||||||+++|+++|++|+|++|+||||+++|++++|+|+|++
T Consensus         1 TiVIkiGG~~l~~~~~~~~~~~dIa~L~~~G~~vVlVHGgg~qi~~~l~~~gi~~~~~~g~RvT~~~~l~~~~~~l~g~i   80 (262)
T PRK00942          1 TIVIKYGGNAMTDEELKEAFARDIVLLKQVGINPVVVHGGGPQIDELLKKLGIESEFVNGLRVTDKETMEVVEMVLAGKV   80 (262)
T ss_pred             CEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHCHH
T ss_conf             99999885884498799999999999998899199993882999999998699723047754661899999999981719


Q ss_pred             HHHHHHHHHHCCCCEEEEEEEECCEEEEEECC-------CCCCCCCCCCC
Q ss_conf             89999999828997687786558989999747-------52111799510
Q 537021.9.peg.3  107 NKKIVSLINQTGTQAIGICGKDGNMVFAEKAR-------HSLRLSPNTKK  149 (150)
Q Consensus       107 n~~lv~~L~~~g~~avglsg~dg~li~a~~~~-------~~lg~vg~~~k  149 (150)
                      |+.|++.|+++|++++|+||.|+|+++|+|+.       .++|++|++++
T Consensus        81 ~~~l~~~l~~~g~~avglsg~d~~~v~a~~~~~~~~~~~~d~g~vG~v~~  130 (262)
T PRK00942         81 NKELVSLINKHGGKAVGLSGKDGGLVTARKLDENGKEKIIDLGYVGEVEE  130 (262)
T ss_pred             HHHHHHHHHHCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCEEE
T ss_conf             99999999855874231456678758999864456654566762046166


No 10 
>PRK04531 acetylglutamate kinase; Provisional
Probab=100.00  E-value=6.1e-33  Score=232.24  Aligned_cols=131  Identities=29%  Similarity=0.501  Sum_probs=121.6

Q ss_pred             HHHHHHHHHHHCC----CEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCC
Q ss_conf             8999999998759----889998675672474689999999999851897179984572457889997089811557834
Q 537021.9.peg.3   13 LEQVLPFVQFYEN----ETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLR   88 (150)
Q Consensus        13 l~ea~pYi~~~r~----ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~R   88 (150)
                      =||+..|+++|++    ++.|||+||++|.+.  +++|++|+++|+.+|++|||||||||||+++|+++|++++|++|+|
T Consensus         4 ~kev~~yL~~Fss~~~~~fAVIKvGGavi~e~--l~~la~dLafL~~vGl~PVVVHGgGpqI~~~L~~~gIe~~fi~G~R   81 (421)
T PRK04531          4 AKEISQYLKRFSSLDPERFAVIKVGGAVLRED--LEALTSSLKFLQEVGLTPVVVHGAGPQLDAELSAAGIEKDTVNGLR   81 (421)
T ss_pred             HHHHHHHHHHHCCCCCCEEEEEEECHHHHHHH--HHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHCCCCCCCCCCCC
T ss_conf             79999999973588876189999887999979--9999999999998799799993898778799997599976779630


Q ss_pred             CCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEECC-CCCCCCCCCCC
Q ss_conf             66124899999997416689999999828997687786558989999747-52111799510
Q 537021.9.peg.3   89 ITDQQTAEVVEMVLAGSINKKIVSLINQTGTQAIGICGKDGNMVFAEKAR-HSLRLSPNTKK  149 (150)
Q Consensus        89 VTd~~tl~iv~~vL~G~vn~~lv~~L~~~g~~avglsg~dg~li~a~~~~-~~lg~vg~~~k  149 (150)
                      |||+++|++++|+|. ++|.+|++.|+++|.+|++++|   +++.|+..+ +++|+||++++
T Consensus        82 VTd~etl~vv~~vl~-~~n~~lV~~L~~~G~~A~~i~g---~~~~a~~~d~~~~G~VG~V~~  139 (421)
T PRK04531         82 VTSPEALAIVRKVFQ-ASNLKLVEALQQNGARATSITG---GVFEAEYLDRDTYGLVGEVKA  139 (421)
T ss_pred             CCCHHHHHHHHHHHH-HHHHHHHHHHHHCCCCEEECCC---CCEECCCCCCCCCCCCCCEEE
T ss_conf             588889999999997-6529999999967997573466---543503366666764333477


No 11 
>cd04238 AAK_NAGK-like AAK_NAGK-like: N-Acetyl-L-glutamate kinase (NAGK)-like . Included in this CD are the Escherichia coli and Pseudomonas aeruginosa type NAGKs which catalyze the phosphorylation of N-acetyl-L-glutamate (NAG) by ATP in the second step of arginine biosynthesis found in bacteria and photosynthetic organisms using either the acetylated, noncyclic (NC), or non-acetylated, cyclic (C) route of ornithine biosynthesis. Also included in this CD is a distinct group of uncharacterized (UC) bacterial and archeal NAGKs. Members of this CD belong to the Amino Acid Kinase Superfamily (AAK).
Probab=99.98  E-value=7.9e-32  Score=225.12  Aligned_cols=122  Identities=50%  Similarity=0.894  Sum_probs=117.8

Q ss_pred             EEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHHHHHH
Q ss_conf             99986756724746899999999998518971799845724578899970898115578346612489999999741668
Q 537021.9.peg.3   28 IVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEMVLAGSIN  107 (150)
Q Consensus        28 iVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~vL~G~vn  107 (150)
                      +|||+||+++.|++++++|++||++|+++|+++||||||||||++.|+++|++++|++|+||||+++|++++|+++|++|
T Consensus         1 IVIKlGG~~l~d~~~~~~~~~dIa~L~~~G~~vVlVHGgg~qi~~~l~~~gi~~~~~~g~RvTd~~~l~~~~~~l~g~in   80 (256)
T cd04238           1 VVIKYGGSAMKDEELKEAFADDIVLLKQVGINPVIVHGGGPEINELLKRLGIESEFVNGLRVTDKETMEIVEMVLAGKVN   80 (256)
T ss_pred             CEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHCCCCCEECCCCCCCCHHHHHHHHHHHHHHHH
T ss_conf             98997868945987999999999999988992999918829999999986998350267314618789999999853389


Q ss_pred             HHHHHHHHHCCCCEEEEEEEECCEEEEEEC---CCCCCCCCCCCC
Q ss_conf             999999982899768778655898999974---752111799510
Q 537021.9.peg.3  108 KKIVSLINQTGTQAIGICGKDGNMVFAEKA---RHSLRLSPNTKK  149 (150)
Q Consensus       108 ~~lv~~L~~~g~~avglsg~dg~li~a~~~---~~~lg~vg~~~k  149 (150)
                      ++|++.|+++|++|+|+|+.|+|+++|+|.   ..++|++|++++
T Consensus        81 ~~l~~~l~~~g~~aiglsg~d~~~i~a~~~~~~~~d~g~vG~v~~  125 (256)
T cd04238          81 KELVSLLNRAGGKAVGLSGKDGGLIKAEKKEEKDIDLGFVGEVTE  125 (256)
T ss_pred             HHHHHHHHHCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCC
T ss_conf             999999997799721214655877999975564434342455443


No 12 
>cd04252 AAK_NAGK-fArgBP AAK_NAGK-fArgBP: N-Acetyl-L-glutamate kinase (NAGK) of the fungal arginine-biosynthetic pathway (fArgBP). The nuclear-encoded, mitochondrial polyprotein precursor with an N-terminal NAGK (ArgB) domain (this CD), a central DUF619 domain, and a C-terminal reductase domain (ArgC, N-Acetylglutamate Phosphate Reductase, NAGPR). The precursor is cleaved in the mitochondria into two distinct enzymes (NAGK-DUF619 and NAGPR). Native molecular weights of these proteins indicate that the kinase is an octamer whereas the reductase is a dimer. This CD also includes some gamma-proteobacteria (Xanthomonas and Xylella) NAG kinases with an N-terminal NAGK (ArgB) domain (this CD) and a C-terminal DUF619 domain. The DUF619 domain is described as a putative distant homolog of the acetyltransferase, ArgA, predicted to function in NAG synthase association in fungi. Eukaryotic sequences have an N-terminal mitochondrial transit peptide. Members of this NAG kinase domain CD belong to th
Probab=99.97  E-value=4.9e-30  Score=213.63  Aligned_cols=115  Identities=28%  Similarity=0.503  Sum_probs=107.7

Q ss_pred             EEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHHHHHHH
Q ss_conf             99867567247468999999999985189717998457245788999708981155783466124899999997416689
Q 537021.9.peg.3   29 VVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEMVLAGSINK  108 (150)
Q Consensus        29 VIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~vL~G~vn~  108 (150)
                      ||||||+++.|+  +++|++||++|+++|+++||||||||||+++|+++|++|+|++|+||||+++|++++|+++ ++|+
T Consensus         2 VIk~GG~~~~d~--~~~l~~di~lL~~lGi~~VlVHGgg~qI~~~l~~~gi~~~~~~G~RvTd~~~l~~v~~vl~-~vn~   78 (248)
T cd04252           2 VIKVGGAIIEDD--LDELAASLSFLQHVGLYPIVVHGAGPQLNEELEAAGVEPEYVDGLRVTDPETLAVARKVFL-EENL   78 (248)
T ss_pred             EEEECCHHHHHH--HHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHH-HHHH
T ss_conf             799991776521--9999999999998899599991898788899998699853138844689899999999999-9999


Q ss_pred             HHHHHHHHCCCCEEEEEEEECCEEEEEECCC-CCCCCCCCCC
Q ss_conf             9999998289976877865589899997475-2111799510
Q 537021.9.peg.3  109 KIVSLINQTGTQAIGICGKDGNMVFAEKARH-SLRLSPNTKK  149 (150)
Q Consensus       109 ~lv~~L~~~g~~avglsg~dg~li~a~~~~~-~lg~vg~~~k  149 (150)
                      +|++.|+++|.++.+++|   +++.|++.+. ++|++|++++
T Consensus        79 ~lv~~l~~~g~~a~~i~g---~~~~a~~~~~~~~g~vG~v~~  117 (248)
T cd04252          79 KLVEALERNGARARPITS---GVFEAEYLDKDKYGLVGKITG  117 (248)
T ss_pred             HHHHHHHHCCCCEEEECC---CCEEEEECCCCCCCCCCCEEE
T ss_conf             999999965997386037---636888667656784322589


No 13 
>cd04251 AAK_NAGK-UC AAK_NAGK-UC: N-Acetyl-L-glutamate kinase - uncharacterized (NAGK-UC). This domain is similar to Escherichia coli and Pseudomonas aeruginosa NAGKs which catalyze the phosphorylation of the gamma-COOH group of N-acetyl-L-glutamate (NAG) by ATP in the second step of microbial arginine biosynthesis. These uncharacterized domain sequences are found in some bacteria (Deinococci and Chloroflexi) and archea and belong to the Amino Acid Kinase Superfamily (AAK).
Probab=99.96  E-value=2.6e-28  Score=202.50  Aligned_cols=115  Identities=37%  Similarity=0.632  Sum_probs=103.3

Q ss_pred             EEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEE---CCC--CCCCHHHHHHHHHHH
Q ss_conf             999867567247468999999999985189717998457245788999708981155---783--466124899999997
Q 537021.9.peg.3   28 IVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFE---NGL--RITDQQTAEVVEMVL  102 (150)
Q Consensus        28 iVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v---~G~--RVTd~~tl~iv~~vL  102 (150)
                      +||||||+++.|++   +++.|+   .++|.++||||||||||+++|+++|++|+|+   +|+  ||||+++|++++|++
T Consensus         1 iVIK~GG~~l~d~~---~~~~~l---~~lG~~~VlVHGgg~qI~~~l~~~gi~~~fv~~~~Gl~~RvTd~~~l~~v~~~~   74 (257)
T cd04251           1 IVVKIGGSVVSDLD---KVIDDI---ANFGERLIVVHGGGNYVNEYLKRLGVEPKFVTSPSGIRSRYTDKETLEVFVMVM   74 (257)
T ss_pred             CEEEECHHHHHCHH---HHHHHH---HHCCCCEEEECCCCHHHHHHHHHCCCCCEEECCCCCCCEEECCHHHHHHHHHHH
T ss_conf             98998958876989---999999---975999899979977999999985999677417898620218989999999999


Q ss_pred             HHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEECCC------------CCCCCCCCCC
Q ss_conf             4166899999998289976877865589899997475------------2111799510
Q 537021.9.peg.3  103 AGSINKKIVSLINQTGTQAIGICGKDGNMVFAEKARH------------SLRLSPNTKK  149 (150)
Q Consensus       103 ~G~vn~~lv~~L~~~g~~avglsg~dg~li~a~~~~~------------~lg~vg~~~k  149 (150)
                       |++|++|++.|+++|++|+|+||.|++++.|++++.            ++|++|++++
T Consensus        75 -g~vn~~lv~~l~~~g~~a~glsg~d~~~~~~~~~~~~~~~~~g~~~~id~g~vG~v~~  132 (257)
T cd04251          75 -GLINKKIVARLHSLGVKAVGLTGLDGRLLEAKRKEIVRVNERGRKMIIRGGYTGKVEK  132 (257)
T ss_pred             -HHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEEECCCEEECCCCCCCCCCCCCEEEEEE
T ss_conf             -9999999999984899865222666736898512524424678601045797677989


No 14 
>KOG2436 consensus
Probab=99.91  E-value=2.6e-24  Score=176.91  Aligned_cols=141  Identities=27%  Similarity=0.455  Sum_probs=129.9

Q ss_pred             HCCCHHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCC
Q ss_conf             20622899999999875988999867567247468999999999985189717998457245788999708981155783
Q 537021.9.peg.3    8 FQAEILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGL   87 (150)
Q Consensus         8 ~~a~~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~   87 (150)
                      ..-.+++|++|||..|++++||+..+|.++ ...++..+++|+++|+.+|++||+|||++||||+.|...|++++|.+|+
T Consensus        77 ~~v~~~~e~l~yi~~~~~q~fvV~~~g~~~-~t~~~~sl~s~lafl~h~gl~pIvv~g~~~qin~~l~~~~ie~~y~~~~  155 (520)
T KOG2436          77 TTVRILRESLPYITSFRDQKFVVIKSGEAI-STSLLHSLASDLAFLHHVGLRPIVVPGTQPQINRLLAERGIEPEYVDGY  155 (520)
T ss_pred             HHHHHHHHHHHHHHHHCCCEEEEEECCCCC-CCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHCCCCCCCCCCE
T ss_conf             089999988789987337507999435435-5635889987778875077368884386277778999728883003631


Q ss_pred             CCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEE--ECCEEEEEEC----CCCCCCCCCCCC
Q ss_conf             4661248999999974166899999998289976877865--5898999974----752111799510
Q 537021.9.peg.3   88 RITDQQTAEVVEMVLAGSINKKIVSLINQTGTQAIGICGK--DGNMVFAEKA----RHSLRLSPNTKK  149 (150)
Q Consensus        88 RVTd~~tl~iv~~vL~G~vn~~lv~~L~~~g~~avglsg~--dg~li~a~~~----~~~lg~vg~~~k  149 (150)
                      |+||+.+|+++.+...++.|..++..|+++|..+.+.++.  .+|++.|++.    +..+|++|+..+
T Consensus       156 RvTda~t~q~~~~~~~~E~n~~lv~nL~~~g~~ar~~s~g~~v~~~f~a~~~~v~d~~~y~~~gei~~  223 (520)
T KOG2436         156 RVTDAHTLQAAKESVSLEANLNLVINLSQLGTRARPSSSGVRVGNFFPADRNGVLDGEDYGLVGEIKK  223 (520)
T ss_pred             ECCCHHHHHHHHHCCHHHHHHHHHHHHHHHHCEECCCCCCCCCCCEEECCCCCCCCCCEEEEECCCCE
T ss_conf             02618888776531300215679999997540103312566443044235234554000220033003


No 15 
>cd04236 AAK_NAGS-Urea AAK_NAGS-Urea: N-acetylglutamate (NAG) kinase-like domain of the NAG Synthase (NAGS) of the urea cycle found in animals. Ureogenic NAGS is a mitochondrial enzyme catalyzing the formation of NAG from acetylcoenzyme A and L-glutamate; NAG is an essential allosteric activator of carbamylphosphate synthase I, the first and rate limiting enzyme of the urea cycle. Ureogenic NAGS activity is dependent on the concentration of glutamate (substrate) and arginine (activator). Domain architecture of ureogenic NAGS consists of an N-terminal NAG kinase-like (ArgB) domain (this CD) and a C-terminal DUF619 domain. Members of this CD belong to the protein superfamily, the Amino Acid Kinase Family (AAKF).
Probab=99.86  E-value=3e-21  Score=157.28  Aligned_cols=128  Identities=10%  Similarity=0.131  Sum_probs=110.0

Q ss_pred             CHHHHHCCCHHHHHHHHHHHHC-------CCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHH
Q ss_conf             0023220622899999999875-------988999867567247468999999999985189717998457245788999
Q 537021.9.peg.3    3 EKIYQFQAEILEQVLPFVQFYE-------NETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLE   75 (150)
Q Consensus         3 ~~~~~~~a~~l~ea~pYi~~~r-------~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~   75 (150)
                      ++++|++.+..+|+..|+++|+       .+.+|||+||+++.|.+.+++|++|+++|+.+|+.||+||||||++...++
T Consensus         6 k~~l~~~G~s~kEvr~YL~~F~~~~~~~~~~faVIKvGGaVl~d~~~l~~l~~~lsfL~~vGl~pIVVHG~g~~i~~~~~   85 (271)
T cd04236           6 KAFLHQKGGDPREARYWLTQFQIAMPNDWPAFAVLEVDHSVFRSLEMVQSLSFGLAFLQRMDMKLLVVMGLSAPDGTNMS   85 (271)
T ss_pred             HHHHHHCCCCHHHHHHHHHHHCCCCCCCCCCEEEEEECHHHEECHHHHHHHHHHHHHHHHCCCEEEEEECCCHHCCCCCC
T ss_conf             99866509998999999997335677777876999988131505889999999999999889948999389701155746


Q ss_pred             HCCCCCEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEECCCCCCCCCCCCC
Q ss_conf             70898115578346612489999999741668999999982899768778655898999974752111799510
Q 537021.9.peg.3   76 KMGIKSKFENGLRITDQQTAEVVEMVLAGSINKKIVSLINQTGTQAIGICGKDGNMVFAEKARHSLRLSPNTKK  149 (150)
Q Consensus        76 ~~gi~~~~v~G~RVTd~~tl~iv~~vL~G~vn~~lv~~L~~~g~~avglsg~dg~li~a~~~~~~lg~vg~~~k  149 (150)
                                      +.+++.+++++. +-|+.|+++|++.|++|.++++.+. .+.+. ...++++||++.+
T Consensus        86 ----------------~~~~~~~~~v~~-~~n~~lV~aL~~~G~~A~~i~s~~~-~~~~~-~~e~~~~vG~~~~  140 (271)
T cd04236          86 ----------------DLELQAARSRLV-KDCKTLVEALQANSAAAHPLFSGES-VLQAE-EPEPGASKGPSVS  140 (271)
T ss_pred             ----------------CHHHHHHHHHHH-HHHHHHHHHHHHCCCCEEEEECCCC-EEEEC-CCCCCCCCCCCEE
T ss_conf             ----------------012256766637-9999999999976996245416873-37722-5301376687138


No 16 
>cd02115 AAK Amino Acid Kinases (AAK) superfamily, catalytic domain; present in such enzymes like N-acetylglutamate kinase (NAGK), carbamate kinase (CK), aspartokinase (AK), glutamate-5-kinase (G5K) and UMP kinase (UMPK). The AAK superfamily includes kinases that phosphorylate a variety of amino acid substrates. These kinases catalyze the formation of phosphoric anhydrides, generally with a carboxylate, and use ATP as the source of the phosphoryl group; are involved in amino acid biosynthesis. Some of these kinases control the process via allosteric feed-back inhibition.
Probab=99.77  E-value=4.5e-18  Score=136.90  Aligned_cols=114  Identities=25%  Similarity=0.355  Sum_probs=104.3

Q ss_pred             EEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHHHHHHH
Q ss_conf             99867567247468999999999985189717998457245788999708981155783466124899999997416689
Q 537021.9.peg.3   29 VVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEMVLAGSINK  108 (150)
Q Consensus        29 VIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~vL~G~vn~  108 (150)
                      ||||||+.+.|++.+.+++++|+.++..|.++|+||||+++++..|.+++++++|.+|+|+|++++++++.+.. +..+.
T Consensus         1 ViK~GGssl~~~~~~~~~~~~i~~l~~~g~~~vvV~sa~~~it~~l~~~~~~~~~~~~~~~t~~~~~~i~~~g~-~~~~~   79 (248)
T cd02115           1 VIKFGGSSVSSEERLRNLARILVKLASEGGRVVVVHGAGPQITDELLAHGELLGYARGLRITDRETDALAAMGE-GMSNL   79 (248)
T ss_pred             CEEECHHHCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHH-HHHHH
T ss_conf             99889136069999999999999999789998999698445329999865354331367899899999999999-99999


Q ss_pred             HHHHHHHHCCCCEEEEEEEECCEEEEEECCCCCCCCCCCC
Q ss_conf             9999998289976877865589899997475211179951
Q 537021.9.peg.3  109 KIVSLINQTGTQAIGICGKDGNMVFAEKARHSLRLSPNTK  148 (150)
Q Consensus       109 ~lv~~L~~~g~~avglsg~dg~li~a~~~~~~lg~vg~~~  148 (150)
                      .+++.|++.|+++.++++.+..++..     +++++|+..
T Consensus        80 ~l~~~L~~~g~~a~~l~~~~~~~~~~-----~~~~~~~v~  114 (248)
T cd02115          80 LIAAALEQHGIKAVPLDLTQAGFASP-----NQGHVGKIT  114 (248)
T ss_pred             HHHHHHHHCCCCEEEEEEEECCEECC-----CCCCCCCCC
T ss_conf             99999985699816986553233136-----644432102


No 17 
>pfam00696 AA_kinase Amino acid kinase family. This family includes kinases that phosphorylate a variety of amino acid substrates, as well as uridylate kinase and carbamate kinase. This family includes: Aspartokinase EC:2.7.2.4. Acetylglutamate kinase EC:2.7.2.8. Glutamate 5-kinase EC:2.7.2.11. Uridylate kinase EC:2.7.4.-. Carbamate kinase EC:2.7.2.2.
Probab=99.60  E-value=1.6e-14  Score=114.15  Aligned_cols=109  Identities=32%  Similarity=0.457  Sum_probs=101.6

Q ss_pred             CEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHHHH
Q ss_conf             88999867567247468999999999985189717998457245788999708981155783466124899999997416
Q 537021.9.peg.3   26 ETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEMVLAGS  105 (150)
Q Consensus        26 ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~vL~G~  105 (150)
                      |++|||+||+++.|++.+..++++|+.++..|.++|||||||++.+.++++.+.+++  .+.|+|+.+++.+.++++. .
T Consensus         1 k~iViK~GGs~l~~~~~i~~~~~~I~~l~~~g~~vvvV~ggg~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~-~   77 (230)
T pfam00696         1 KRIVIKLGGSSLTDEDAIKRIAEEIALLSELGIKVVVVSGGGGFTDKLLAAYGIAEK--IGLRVTAGATGLIIEAALA-G   77 (230)
T ss_pred             CEEEEEECHHHCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHCCCCC--CCHHHHHHHHHHHHHHHHH-H
T ss_conf             909999585677898999999999999996899499997985888799998177765--7515669999999999999-9


Q ss_pred             HHHHHHHHHHHCCCCEEEEEEEECCEEEEEEC
Q ss_conf             68999999982899768778655898999974
Q 537021.9.peg.3  106 INKKIVSLINQTGTQAIGICGKDGNMVFAEKA  137 (150)
Q Consensus       106 vn~~lv~~L~~~g~~avglsg~dg~li~a~~~  137 (150)
                      ++..+++.+...+.+++++...+.++..++..
T Consensus        78 l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~  109 (230)
T pfam00696        78 LLDIVVSAGERLGARAVALLLSDGGIGAVRLD  109 (230)
T ss_pred             HHHHHHHHHHHCCCHHHHCCCCCCCCCEEEEE
T ss_conf             99999999998297301325651345516884


No 18 
>PRK12353 carbamate kinase; Reviewed
Probab=99.34  E-value=2.2e-11  Score=94.05  Aligned_cols=94  Identities=19%  Similarity=0.300  Sum_probs=65.4

Q ss_pred             CCEEEEEECHHHHCCH--------HHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHH-HHHCCCCCEEECCCCCCCHHHH
Q ss_conf             9889998675672474--------689999999999851897179984572457889-9970898115578346612489
Q 537021.9.peg.3   25 NETIVVKYGGHVMNCT--------DLSKDFVNDIALLKKSNITPVIVHGGGPQIGAV-LEKMGIKSKFENGLRITDQQTA   95 (150)
Q Consensus        25 ~ktiVIK~GG~~l~d~--------~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~-l~~~gi~~~~v~G~RVTd~~tl   95 (150)
                      .|++||+|||++|.++        +....++++|+.|...|.++|||||+||||... |++....+.+..+++.+...++
T Consensus         2 ~k~IVIalGGNAl~~~~~t~~~Q~~~~~~~a~~ia~l~~~G~~~VitHGnGPQVG~lll~~~~~~~~~~~~~pld~~~a~   81 (312)
T PRK12353          2 MKKIVVALGGNALGSSDGSAEEQLEAVKKTAKSLVDLIEEGHEVVITHGNGPQVGNLLLQQAAADSEKNPAMPLDVCVAM   81 (312)
T ss_pred             CCEEEEEECHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHHHCCCCCCCCEECCHHHH
T ss_conf             84499983725318999998999999999999999999889969998388448899999866542688898211014343


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHCCCC
Q ss_conf             9999997416689999999828997
Q 537021.9.peg.3   96 EVVEMVLAGSINKKIVSLINQTGTQ  120 (150)
Q Consensus        96 ~iv~~vL~G~vn~~lv~~L~~~g~~  120 (150)
                      .-  .-+.-.+...|...|...|.+
T Consensus        82 tq--G~IGy~l~q~l~~~l~~~g~~  104 (312)
T PRK12353         82 SQ--GYIGYWLQNALDNELLKRGIN  104 (312)
T ss_pred             CC--CHHHHHHHHHHHHHHHHCCCC
T ss_conf             03--069999999999999845899


No 19 
>cd04241 AAK_FomA-like AAK_FomA-like: This CD includes a fosfomycin biosynthetic gene product, FomA, and similar proteins found in a wide range of organisms. Together, the fomA and fomB genes in the fosfomycin biosynthetic gene cluster of Streptomyces wedmorensis confer high-level fosfomycin resistance. FomA and FomB proteins converted fosfomycin to fosfomycin monophosphate and fosfomycin diphosphate in the presence of ATP and a magnesium ion, indicating that FomA and FomB catalyzed phosphorylations of fosfomycin and fosfomycin monophosphate, respectively. FomA and related  sequences in this CD are members of the Amino Acid Kinase Superfamily (AAK).
Probab=99.01  E-value=5.7e-09  Score=78.53  Aligned_cols=99  Identities=19%  Similarity=0.277  Sum_probs=74.3

Q ss_pred             EEEEEECHHHHCCHH--------HHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHH
Q ss_conf             899986756724746--------899999999998518971799845724578899970898115578346612489999
Q 537021.9.peg.3   27 TIVVKYGGHVMNCTD--------LSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVV   98 (150)
Q Consensus        27 tiVIK~GG~~l~d~~--------l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv   98 (150)
                      .||||+||+++.+++        ...+++++|+.  ..|.++|||||||.....++++.+..    .+.+.++++.+...
T Consensus         1 mIVIKiGgs~lt~~~~~~~~~~~~i~~~~~~i~~--~~~~~iiiV~GgG~~g~~~~~~~~~~----~~~~~~~~~~~~~~   74 (252)
T cd04241           1 MIILKLGGSVITDKDRPETIREENLERIARELAE--AIDEKLVLVHGGGSFGHPKAKEYGLP----DGDGSFSAEGVAET   74 (252)
T ss_pred             CEEEEECHHHEECCCCCCCCCHHHHHHHHHHHHH--HCCCCEEEEECCHHHHHHHHHHHCCC----CCCCCCCHHHHHHH
T ss_conf             9999965787048976565119999999999998--67998899989548769999871344----68741228889999


Q ss_pred             HHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCEE
Q ss_conf             9997416689999999828997687786558989
Q 537021.9.peg.3   99 EMVLAGSINKKIVSLINQTGTQAIGICGKDGNMV  132 (150)
Q Consensus        99 ~~vL~G~vn~~lv~~L~~~g~~avglsg~dg~li  132 (150)
                      .+.+. ++|..+++.|.++|.+++++++.+..+.
T Consensus        75 ~~~~~-~l~~~~~~~l~~~g~~a~~l~~~~~~~~  107 (252)
T cd04241          75 HEAML-ELNSIVVDALLEAGVPAVSVPPSSFFVT  107 (252)
T ss_pred             HHHHH-HHHHHHHHHHHHCCCCEEEECHHHHHHH
T ss_conf             99999-9999999999976996352040444443


No 20 
>PRK12686 carbamate kinase; Reviewed
Probab=98.73  E-value=3.1e-07  Score=67.43  Aligned_cols=96  Identities=18%  Similarity=0.288  Sum_probs=65.0

Q ss_pred             CCCEEEEEECHHHHCCH-----HH---HHHHHHHHHHHHHCCCCEEEEECCCHHHHH-HHHHCCCCCEEECCCCCCCHHH
Q ss_conf             59889998675672474-----68---999999999985189717998457245788-9997089811557834661248
Q 537021.9.peg.3   24 ENETIVVKYGGHVMNCT-----DL---SKDFVNDIALLKKSNITPVIVHGGGPQIGA-VLEKMGIKSKFENGLRITDQQT   94 (150)
Q Consensus        24 r~ktiVIK~GG~~l~d~-----~l---~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~-~l~~~gi~~~~v~G~RVTd~~t   94 (150)
                      -.|++||.+||++|..+     +.   ....+++|+.|...|.++||+||.|||+.. .|++....+..+..++...-.+
T Consensus         2 m~k~iVIAlGGNAl~~~~~t~~~Q~~~~~~~a~~ia~l~~~g~~vVitHGNGPQVG~l~l~~~~~~~~~~p~~Pld~~~A   81 (313)
T PRK12686          2 MKEKIVIALGGNAILQTEASAEAQQTAVREAMQHLVDLIEAGADIVITHGNGPQVGSLLLQQAESNSNKVPAMPLDTCVA   81 (313)
T ss_pred             CCCEEEEEECCHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHHHCCCCCCCCCCCHHHH
T ss_conf             97779998280312899999999999999999999999986997999818807989999998764368889876404543


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHCCCCE
Q ss_conf             999999974166899999998289976
Q 537021.9.peg.3   95 AEVVEMVLAGSINKKIVSLINQTGTQA  121 (150)
Q Consensus        95 l~iv~~vL~G~vn~~lv~~L~~~g~~a  121 (150)
                      +.--  -+.-.+...|-+.|.+++.+.
T Consensus        82 ~sQG--~IGy~lqqal~n~l~~~~~~~  106 (313)
T PRK12686         82 MSQG--MIGYWLQTALNNELTERNIDK  106 (313)
T ss_pred             CCCH--HHHHHHHHHHHHHHHHCCCCC
T ss_conf             0233--799999999999998638998


No 21 
>cd04235 AAK_CK AAK_CK: Carbamate kinase (CK) catalyzes both the ATP-phosphorylation of carbamate and carbamoyl phosphate (CP) utilization with the production of ATP from ADP and CP. Both CK (this CD) and nonhomologous CP synthetase synthesize carbamoyl phosphate, an essential precursor of arginine and pyrimidine bases, in the presence of ATP, bicarbonate, and ammonia. CK is a homodimer of 33 kDa subunits and is a member of the Amino Acid Kinase Superfamily (AAK).
Probab=98.68  E-value=4.9e-07  Score=66.16  Aligned_cols=93  Identities=23%  Similarity=0.280  Sum_probs=64.3

Q ss_pred             EEEEEECHHHHCCH-------H---HHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHH
Q ss_conf             89998675672474-------6---8999999999985189717998457245788999708981155783466124899
Q 537021.9.peg.3   27 TIVVKYGGHVMNCT-------D---LSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAE   96 (150)
Q Consensus        27 tiVIK~GG~~l~d~-------~---l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~   96 (150)
                      .+||.+||+++..+       +   .....+++|+.|...|.++||+||.|||+...+.++...+++...+      .|+
T Consensus         1 riViALGGNAl~~~g~~~t~~~Q~~~v~~~a~~ia~l~~~G~~vVitHGNGPQVG~ll~~~~~~~~~~p~~------Pld   74 (308)
T cd04235           1 RIVVALGGNALLRRGEPGTAEEQRENVKIAAKALADLIKNGHEVVITHGNGPQVGNLLLQNEAAAEKVPAY------PLD   74 (308)
T ss_pred             CEEEEECHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHHHCCCCCCC------CCH
T ss_conf             98999573650899999999999999999999999999869979998699518999999888624678998------620


Q ss_pred             H-HHHH---HHHHHHHHHHHHHHHCCCCEEEEE
Q ss_conf             9-9999---741668999999982899768778
Q 537021.9.peg.3   97 V-VEMV---LAGSINKKIVSLINQTGTQAIGIC  125 (150)
Q Consensus        97 i-v~~v---L~G~vn~~lv~~L~~~g~~avgls  125 (150)
                      + +.|.   +.-.++..|...|.+++.+..-.+
T Consensus        75 ~~~A~sQG~IGy~lqqal~n~l~~~~~~~~v~t  107 (308)
T cd04235          75 VCGAMSQGMIGYMLQQALDNELPKRGIDKPVVT  107 (308)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEE
T ss_conf             464653059999999999999985489984599


No 22 
>PRK12454 carbamate kinase-like carbamoyl phosphate synthetase; Reviewed
Probab=98.27  E-value=4.6e-06  Score=59.92  Aligned_cols=92  Identities=24%  Similarity=0.367  Sum_probs=62.9

Q ss_pred             CCEEEEEECHHHHCCH----------HHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHH
Q ss_conf             9889998675672474----------689999999999851897179984572457889997089811557834661248
Q 537021.9.peg.3   25 NETIVVKYGGHVMNCT----------DLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQT   94 (150)
Q Consensus        25 ~ktiVIK~GG~~l~d~----------~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~t   94 (150)
                      .|++||.+||+++..+          +.....+++|+.|...|.++||+||.|||+...+.+...-...    . +++--
T Consensus         2 ~k~iVvALGGNAl~~~g~~~t~~~Q~~~v~~~a~~ia~li~~g~~vVitHGNGPQVG~i~l~~~~~~~~----~-~p~~P   76 (313)
T PRK12454          2 KKRIVIALGGNALQQRGEKGTAENMMKNVRRTAEQIARLIEEGYEVVITHGNGPQVGNLLLQMDAAKDV----G-IPPFP   76 (313)
T ss_pred             CCEEEEECCHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHHHCCC----C-CCCCC
T ss_conf             955999808365089999999999999999999999999987997999818813999999999873246----8-99866


Q ss_pred             HHHHHHHHHHHHHHHHHHH----HHHCCCCE
Q ss_conf             9999999741668999999----98289976
Q 537021.9.peg.3   95 AEVVEMVLAGSINKKIVSL----INQTGTQA  121 (150)
Q Consensus        95 l~iv~~vL~G~vn~~lv~~----L~~~g~~a  121 (150)
                      |++.-..--|.+...|..+    |.++|...
T Consensus        77 Ld~~~A~sQG~IGy~lqqal~n~L~~~~~~~  107 (313)
T PRK12454         77 LDVAGAMTQGWIGYMIQQALRNVLPKRGIER  107 (313)
T ss_pred             CHHHHHCCCHHHHHHHHHHHHHHHHHCCCCC
T ss_conf             0355230134899999999999998539998


No 23 
>cd04254 AAK_UMPK-PyrH-Ec UMP kinase (UMPK)-Ec, the microbial/chloroplast uridine monophosphate kinase (uridylate kinase) enzyme that catalyzes UMP phosphorylation and plays a key role in pyrimidine nucleotide biosynthesis; regulation of this process is via feed-back control and via gene repression of carbamoyl phosphate synthetase (the first enzyme of the pyrimidine biosynthesis pathway). The UMP kinase of E. coli (Ec) is known to function as a homohexamer, with GTP and UTP being allosteric effectors. Like other related enzymes (carbamate kinase, aspartokinase, and N-acetylglutamate kinase) the E. coli and most bacterial and chloroplast UMPKs (this CD) have a conserved, N-terminal, lysine residue proposed to function in the catalysis of the phosphoryl group transfer, whereas most archaeal UMPKs appear to lack this residue and the Pyrococcus furiosus structure has an additional Mg ion bound to the ATP molecule which is proposed to function as the catalysis instead. Members of this CD be
Probab=98.12  E-value=4e-05  Score=53.89  Aligned_cols=92  Identities=18%  Similarity=0.284  Sum_probs=61.2

Q ss_pred             CEEEEEECHHHHC-------CHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHH-HHHHHCCCCCEEECCCCCCCHHHHHH
Q ss_conf             8899986756724-------746899999999998518971799845724578-89997089811557834661248999
Q 537021.9.peg.3   26 ETIVVKYGGHVMN-------CTDLSKDFVNDIALLKKSNITPVIVHGGGPQIG-AVLEKMGIKSKFENGLRITDQQTAEV   97 (150)
Q Consensus        26 ktiVIK~GG~~l~-------d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~-~~l~~~gi~~~~v~G~RVTd~~tl~i   97 (150)
                      |.+|||+||+++.       |.+.+.+++++|..++..|.+++||+|||...- ......++.       | ...+.+. 
T Consensus         1 kRIViKiGGs~L~~~~~~~ld~~~I~~la~~I~~l~~~G~eVviVvggG~i~rg~~~~~~~~~-------~-~~~d~~g-   71 (231)
T cd04254           1 KRVLLKLSGEALAGENGFGIDPEVLNRIAREIKEVVDLGVEVAIVVGGGNIFRGASAAEAGMD-------R-ATADYMG-   71 (231)
T ss_pred             CEEEEEECHHHCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCHHCCHHHHHCCCC-------C-CHHHHHH-
T ss_conf             959999516660789998859999999999999999779869999817730124224534666-------3-5188999-


Q ss_pred             HHHHHHHHHHHH-HHHHHHHCCCCEEEEEEEEC
Q ss_conf             999974166899-99999828997687786558
Q 537021.9.peg.3   98 VEMVLAGSINKK-IVSLINQTGTQAIGICGKDG  129 (150)
Q Consensus        98 v~~vL~G~vn~~-lv~~L~~~g~~avglsg~dg  129 (150)
                        | ++-.+|.. +..+|...|.++.-++..+-
T Consensus        72 --~-lat~~n~~ll~~al~~~g~~~~~~t~~~~  101 (231)
T cd04254          72 --M-LATVINALALQDALESLGVKTRVMSAIPM  101 (231)
T ss_pred             --H-HHHHHHHHHHHHHHHHCCCCEEEEECCCC
T ss_conf             --9-99999999999999867998799742234


No 24 
>PRK00358 pyrH uridylate kinase; Provisional
Probab=98.11  E-value=7.1e-05  Score=52.27  Aligned_cols=90  Identities=22%  Similarity=0.331  Sum_probs=56.5

Q ss_pred             CEEEEEECHHHHC-------CHHHHHHHHHHHHHHHHCCCCEEEEECCCHH-HHHHHHHCCCCCEEECCCCCCCHHHHHH
Q ss_conf             8899986756724-------7468999999999985189717998457245-7889997089811557834661248999
Q 537021.9.peg.3   26 ETIVVKYGGHVMN-------CTDLSKDFVNDIALLKKSNITPVIVHGGGPQ-IGAVLEKMGIKSKFENGLRITDQQTAEV   97 (150)
Q Consensus        26 ktiVIK~GG~~l~-------d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~q-I~~~l~~~gi~~~~v~G~RVTd~~tl~i   97 (150)
                      |-+|||+||+++.       |.+...+++++|+.++..|.++|||+|||.. ....+.+.+++..           .++.
T Consensus         1 kRIViKiG~s~L~~~~~~~l~~~~i~~la~~I~~l~~~G~evvlVsSGg~~a~~~~~~~~~~~~~-----------~~d~   69 (231)
T PRK00358          1 KRVLLKLSGEALAGDEGFGIDPDVLDRIAQEIKELVELGVEVAVVIGGGNIFRGAAAAKAGMDRV-----------TADY   69 (231)
T ss_pred             CEEEEEECHHHCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCHHHCCHHHHHCCCCCC-----------HHHH
T ss_conf             95999965565038999983999999999999999977996999983873315264774588654-----------6999


Q ss_pred             HHHHHHHHHHHHH-HHHHHHCCCCEEEEEEE
Q ss_conf             9999741668999-99998289976877865
Q 537021.9.peg.3   98 VEMVLAGSINKKI-VSLINQTGTQAIGICGK  127 (150)
Q Consensus        98 v~~vL~G~vn~~l-v~~L~~~g~~avglsg~  127 (150)
                      ..+ ++-..|..+ ...|...+..+...+..
T Consensus        70 ~g~-~at~~n~~ll~~~l~~~~~~~~~~~~~   99 (231)
T PRK00358         70 MGM-LATVMNALALQDALERAGVNTRVQSAI   99 (231)
T ss_pred             HHH-HHHHHHHHHHHHHHHHCCCCEEEEECC
T ss_conf             999-999999999999997369732675122


No 25 
>cd04256 AAK_P5CS_ProBA AAK_P5CS_ProBA: Glutamate-5-kinase (G5K) domain of the bifunctional delta 1-pyrroline-5-carboxylate synthetase (P5CS), composed of an N-terminal G5K (ProB) and a C-terminal glutamyl 5- phosphate reductase (G5PR, ProA), the first and second enzyme catalyzing proline (and, in mammals, ornithine) biosynthesis. G5K transfers the terminal phosphoryl group of ATP to the gamma-carboxyl group of glutamate, and is subject to feedback allosteric inhibition by proline or ornithine. In plants, proline plays an important role as an osmoprotectant and, in mammals, ornithine biosynthesis is crucial for proper ammonia detoxification, since a G5K mutation has been shown to cause human hyperammonaemia.
Probab=98.04  E-value=3e-05  Score=54.64  Aligned_cols=47  Identities=23%  Similarity=0.356  Sum_probs=39.1

Q ss_pred             CCCEEEEEECHHHHCCHH-------HHHHHHHHHHHHHHCCCCEEEEECCCHHH
Q ss_conf             598899986756724746-------89999999999851897179984572457
Q 537021.9.peg.3   24 ENETIVVKYGGHVMNCTD-------LSKDFVNDIALLKKSNITPVIVHGGGPQI   70 (150)
Q Consensus        24 r~ktiVIK~GG~~l~d~~-------l~~~~~~dI~ll~~~G~k~ViVHGgg~qI   70 (150)
                      +-|.+|||+|++++.+++       .+.+++++|+.|+..|..+||||+|+-..
T Consensus         7 ~~krIVIKiGss~lt~~d~~~~~~~~i~~l~~~I~~L~~~G~evvlVsSGAv~~   60 (284)
T cd04256           7 HAKRIVVKLGSAVVTREDECGLALGRLASIVEQVSELQSQGREVILVTSGAVAF   60 (284)
T ss_pred             CCCEEEEEECHHHEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCHHHC
T ss_conf             487899995702237999997589999999999999997899699990684653


No 26 
>cd04239 AAK_UMPK-like AAK_UMPK-like: UMP kinase (UMPK)-like, the microbial/chloroplast uridine monophosphate kinase (uridylate kinase) enzyme that catalyzes UMP phosphorylation and plays a key role in pyrimidine nucleotide biosynthesis. Regulation of this process is via feed-back control and via gene repression of carbamoyl phosphate synthetase (the first enzyme of the pyrimidine biosynthesis pathway). The UMP kinases of E. coli (Ec) and Pyrococcus furiosus (Pf) are known to function as homohexamers, with GTP and UTP being allosteric effectors. Like other related enzymes (carbamate kinase, aspartokinase, and N-acetylglutamate kinase) the E. coli and most bacterial UMPKs have a conserved, N-terminal, lysine residue proposed to function in the catalysis of the phosphoryl group transfer, whereas most archaeal UMPKs appear to lack this residue and the Pyrococcus furiosus structure has an additional Mg ion bound to the ATP molecule which is proposed to function as the catalysis instead. Als
Probab=98.00  E-value=0.00011  Score=51.06  Aligned_cols=88  Identities=19%  Similarity=0.287  Sum_probs=56.4

Q ss_pred             EEEEECHHHHC------CHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHH
Q ss_conf             99986756724------746899999999998518971799845724578899970898115578346612489999999
Q 537021.9.peg.3   28 IVVKYGGHVMN------CTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEMV  101 (150)
Q Consensus        28 iVIK~GG~~l~------d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~v  101 (150)
                      +|||+||+.+.      |.+.+.+++++|+.|+..|.++|||+|||...... ...+..      .   ++..++...+.
T Consensus         2 IViKiG~s~lt~~~~~l~~~~i~~la~~i~~l~~~g~evilV~s~G~~a~g~-~~~~~~------~---~~~~~~~~~~~   71 (229)
T cd04239           2 IVLKLSGEALAGEGGGIDPEVLKEIAREIKEVVDLGVEVAIVVGGGNIARGY-IAAARG------M---PRATADYIGML   71 (229)
T ss_pred             EEEEECHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCH-HHHHCC------C---CCCHHHHHHHH
T ss_conf             8999765770189999199999999999999997799799998997344664-344156------6---71359999999


Q ss_pred             HHHHHHHHH-HHHHHHCCCCEEEEEE
Q ss_conf             741668999-9999828997687786
Q 537021.9.peg.3  102 LAGSINKKI-VSLINQTGTQAIGICG  126 (150)
Q Consensus       102 L~G~vn~~l-v~~L~~~g~~avglsg  126 (150)
                       +-..|..+ ...|.+++..+.-.+.
T Consensus        72 -a~~~~~~ll~~~l~~~~~~~~~~~~   96 (229)
T cd04239          72 -ATVMNALALQDALEKLGVKTRVMSA   96 (229)
T ss_pred             -HHHHHHHHHHHHHHHCCCCEEEECH
T ss_conf             -9999999999999867997898317


No 27 
>PRK12314 gamma-glutamyl kinase; Provisional
Probab=97.85  E-value=0.00017  Score=49.80  Aligned_cols=46  Identities=20%  Similarity=0.249  Sum_probs=39.8

Q ss_pred             CCCEEEEEECHHHHC------CHHHHHHHHHHHHHHHHCCCCEEEEECCCHH
Q ss_conf             598899986756724------7468999999999985189717998457245
Q 537021.9.peg.3   24 ENETIVVKYGGHVMN------CTDLSKDFVNDIALLKKSNITPVIVHGGGPQ   69 (150)
Q Consensus        24 r~ktiVIK~GG~~l~------d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~q   69 (150)
                      .-|.+|||+||+++.      |.+...+++++|+.++..|.++|||++|.-.
T Consensus         8 ~~krIViKiG~s~lt~~~g~~~~~~i~~la~~I~~l~~~G~~vvlVsSGav~   59 (265)
T PRK12314          8 SAKRIVIKVGSSSLSYENGKINLERIEQLVFVISDLMNKGKEVILVSSGAIG   59 (265)
T ss_pred             CCCEEEEEECHHHEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCHHH
T ss_conf             0878999966112678999859999999999999999789989998637443


No 28 
>COG1608 Predicted archaeal kinase [General function prediction only]
Probab=97.82  E-value=7.8e-05  Score=52.00  Aligned_cols=100  Identities=20%  Similarity=0.407  Sum_probs=67.5

Q ss_pred             EEEEECHHHHCCHHH--------HHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHH
Q ss_conf             999867567247468--------999999999985189717998457245788999708981155783466124899999
Q 537021.9.peg.3   28 IVVKYGGHVMNCTDL--------SKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVVE   99 (150)
Q Consensus        28 iVIK~GG~~l~d~~l--------~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~   99 (150)
                      +|+|+|||++.|.+.        ...++..|+-  .--.+.|+|||||..=--..++.++ +.-.+   +-++.-...+.
T Consensus         3 ~IlKlGGSvITdK~~p~t~r~~~l~ria~eI~~--~~~~~livVHGgGSFGHp~Ak~~~~-~~~~~---~~s~~G~~~~~   76 (252)
T COG1608           3 IILKLGGSVITDKDKPRTVREDRLRRIAREISN--GKPEKLIVVHGGGSFGHPAAKEFGL-EGLKN---YLSPLGFSLTH   76 (252)
T ss_pred             EEEEECCEEEECCCCCCHHHHHHHHHHHHHHHC--CCCCCEEEEECCCCCCCHHHHHHCC-CCCCC---CCCCCCHHHHH
T ss_conf             899955616525887321118999999999845--9954389994586656788997076-54446---66743439999


Q ss_pred             HHHHHHHHHHHHHHHHHCCCCEEE---EE--EEECCEEEE
Q ss_conf             997416689999999828997687---78--655898999
Q 537021.9.peg.3  100 MVLAGSINKKIVSLINQTGTQAIG---IC--GKDGNMVFA  134 (150)
Q Consensus       100 ~vL~G~vn~~lv~~L~~~g~~avg---ls--g~dg~li~a  134 (150)
                      .++. +.|..++..|.+.|+.|+.   +|  -.+|++..+
T Consensus        77 ~am~-~L~~~V~~~l~~~Gv~av~~~P~s~~~~~gr~~~~  115 (252)
T COG1608          77 LAML-ELNSIVVDALLDAGVRAVSVVPISFSTFNGRILYT  115 (252)
T ss_pred             HHHH-HHHHHHHHHHHHCCCCCCCCCCCCEEECCCCEEEC
T ss_conf             9999-99999999998669853222686403317703220


No 29 
>PRK12352 putative carbamate kinase; Reviewed
Probab=97.73  E-value=0.00018  Score=49.74  Aligned_cols=87  Identities=24%  Similarity=0.316  Sum_probs=56.8

Q ss_pred             CCEEEEEECHHHHCC-------HHHH---HHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHH
Q ss_conf             988999867567247-------4689---999999999851897179984572457889997089811557834661248
Q 537021.9.peg.3   25 NETIVVKYGGHVMNC-------TDLS---KDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQT   94 (150)
Q Consensus        25 ~ktiVIK~GG~~l~d-------~~l~---~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~t   94 (150)
                      .+++||-+||+++..       ++.+   ..-++.|+-|-..|.++||+||-|||+...+.+...-... +   .+++--
T Consensus         2 ~k~iViALGGNAll~~g~~~t~~~Q~~~v~~~a~~ia~li~~g~~vvitHGNGPQVG~l~lq~e~a~~~-~---~~p~~P   77 (316)
T PRK12352          2 KELVVVAIGGNSIIKDNASQSIEHQAEAVKAVADTVLEMLASDYDIVLTHGNGPQVGLDLRRAEIAHER-E---GLPLTP   77 (316)
T ss_pred             CCEEEEECCHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHHHHCC-C---CCCCCC
T ss_conf             966999767343289999999999999999999999999976996999817816988999999875336-7---899875


Q ss_pred             HHH-HHHHHHHHHHHHHHHHHHH
Q ss_conf             999-9999741668999999982
Q 537021.9.peg.3   95 AEV-VEMVLAGSINKKIVSLINQ  116 (150)
Q Consensus        95 l~i-v~~vL~G~vn~~lv~~L~~  116 (150)
                      |++ +.|. -|.+...|..+|..
T Consensus        78 Ld~~~A~s-QG~IGY~lqqal~n   99 (316)
T PRK12352         78 LANCVADT-QGGIGYLIQQALNN   99 (316)
T ss_pred             HHHHHHCC-CHHHHHHHHHHHHH
T ss_conf             45541212-20899999999999


No 30 
>cd04242 AAK_G5K_ProB AAK_G5K_ProB: Glutamate-5-kinase (G5K) catalyzes glutamate-dependent ATP cleavage; G5K transfers the terminal phosphoryl group of ATP to the gamma-carboxyl group of glutamate, in the first and controlling step of proline (and, in mammals, ornithine) biosynthesis. G5K is subject to feedback allosteric inhibition by proline or ornithine. In microorganisms and plants, proline plays an important role as an osmoprotectant and, in mammals, ornithine biosynthesis is crucial for proper ammonia detoxification, since a G5K mutation has been shown to cause human hyperammonaemia. Microbial G5K generally consists of two domains: a catalytic G5K domain and one PUA (pseudo uridine synthases and archaeosine-specific transglycosylases) domain, and some lack the PUA domain. G5K requires free Mg for activity, it is tetrameric, and it aggregates to higher forms in a proline-dependent way. G5K lacking the PUA domain remains tetrameric, active, and proline-inhibitable, but the Mg requir
Probab=97.72  E-value=0.00021  Score=49.30  Aligned_cols=51  Identities=25%  Similarity=0.440  Sum_probs=41.1

Q ss_pred             EEEEECHHHHCCH------HHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCC
Q ss_conf             9998675672474------6899999999998518971799845724578899970898
Q 537021.9.peg.3   28 IVVKYGGHVMNCT------DLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIK   80 (150)
Q Consensus        28 iVIK~GG~~l~d~------~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~   80 (150)
                      +|||+|++++.++      +...+++++|+.|+..|.++||||+|.  |..-...+|+.
T Consensus         2 iViKiGss~lt~~~~~~~~~~i~~la~~I~~l~~~G~evvlVsSGA--va~G~~~l~~~   58 (251)
T cd04242           2 IVVKVGSSLLTDEDGGLDLGRLASLVEQIAELRNQGKEVILVSSGA--VAAGRQRLGLE   58 (251)
T ss_pred             EEEEECCCEEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCH--HHHCHHHCCCC
T ss_conf             8999670116899998599999999999999997899799982475--66187653998


No 31 
>PRK09411 carbamate kinase; Reviewed
Probab=97.67  E-value=0.00075  Score=45.71  Aligned_cols=87  Identities=24%  Similarity=0.336  Sum_probs=59.3

Q ss_pred             CEEEEEECHHHHCC-------HHHH---HHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHH
Q ss_conf             88999867567247-------4689---9999999998518971799845724578899970898115578346612489
Q 537021.9.peg.3   26 ETIVVKYGGHVMNC-------TDLS---KDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTA   95 (150)
Q Consensus        26 ktiVIK~GG~~l~d-------~~l~---~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl   95 (150)
                      |++||-+||+++..       ++.+   ...++.|+.|.. +.++||.||-|||+...+-+. ...+      -+++-.|
T Consensus         2 k~iViALGGNAl~~~g~~~t~e~Q~~~v~~aa~~ia~l~~-~~~vVITHGNGPQVG~l~lq~-~a~~------~~p~~PL   73 (297)
T PRK09411          2 KTLVVALGGNALLQRGEALTAENQYRNIASAVPALARLAR-SYRLAIVHGNGPQVGLLALQN-LAWK------EVEPYPL   73 (297)
T ss_pred             CEEEEECCHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHH-CCCEEEEECCCHHHHHHHHHH-HHHC------CCCCCCC
T ss_conf             7599976814217999998999999999999999999980-893999847812989999998-8632------6999883


Q ss_pred             HH-HHHHHHHHHHHHHHHHHHHCCCCE
Q ss_conf             99-999974166899999998289976
Q 537021.9.peg.3   96 EV-VEMVLAGSINKKIVSLINQTGTQA  121 (150)
Q Consensus        96 ~i-v~~vL~G~vn~~lv~~L~~~g~~a  121 (150)
                      ++ ..|. -|.+...|...|...+...
T Consensus        74 d~~~A~s-QG~IGY~l~q~L~~~~~~~   99 (297)
T PRK09411         74 DVLVAES-QGMIGYMLAQSLSAQPQMP   99 (297)
T ss_pred             CHHHHHH-HHHHHHHHHHHHHHCCCCC
T ss_conf             0342422-0699999999998437899


No 32 
>TIGR00746 arcC carbamate kinase; InterPro: IPR003964   The arginine dihydrolase (AD) pathway is found in many prokaryotes and some primitive eukaryotes. The three- enzyme anaerobic pathway breaks down L-arginine to form 1 mol of ATP, carbon dioxide and ammonia. In simpler bacteria, the first enzyme, arginine deiminase, can account for up to 100f total cell protein .   Carbamate kinase is involved in the last step of the AD pathway, converting carbamoyl phosphate and ADP into ammonia, carbon dioxide and ATP . The second step of the pathway involves the degradation of L-citrulline to carbamoyl phosphate and L-ornithine, using ornithine carbamoyltransferase .    The crystal structure of Enterococcus faecium carbamate kinase has been determined to 2.8A resolution . The enzyme exists as a homodimer of two 33kDa subunits. The hallmark of the dimer is a 16-stranded beta-sheet, surrounded by alpha-helices. Each subunit contains an active site within a large crevice.; GO: 0008804 carbamate kinase activity, 0006525 arginine metabolic process.
Probab=97.63  E-value=0.00017  Score=49.90  Aligned_cols=94  Identities=23%  Similarity=0.357  Sum_probs=63.3

Q ss_pred             CEEEEEECHHHHC--CH-----HHH---HHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHC-CCCCEEECCCCCCCHHH
Q ss_conf             8899986756724--74-----689---9999999998518971799845724578899970-89811557834661248
Q 537021.9.peg.3   26 ETIVVKYGGHVMN--CT-----DLS---KDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKM-GIKSKFENGLRITDQQT   94 (150)
Q Consensus        26 ktiVIK~GG~~l~--d~-----~l~---~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~-gi~~~~v~G~RVTd~~t   94 (150)
                      |++||-+||+||.  .+     +.+   +.-+..|+.+...|.++||.||=|||+..++=+. -.+++     .++.+.=
T Consensus         1 k~~VvALGGNAlLqRg~~~s~~~Q~~nv~~t~~~l~~~~~~G~~lvi~HGNGPQVG~LlLQ~~A~~~~-----~~~~~~P   75 (321)
T TIGR00746         1 KRVVVALGGNALLQRGEKGSAENQRKNVEQTAPQLVKLIKRGYELVITHGNGPQVGLLLLQNQAADSE-----KVVPAMP   75 (321)
T ss_pred             CEEEEEECCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHHHHC-----CCCCCCC
T ss_conf             94889737358863788379889999999999999999861897999847803688899999897603-----7889887


Q ss_pred             HHH-HHHHHHHHHHHHHHHHHH----HCCCCEEEEE
Q ss_conf             999-999974166899999998----2899768778
Q 537021.9.peg.3   95 AEV-VEMVLAGSINKKIVSLIN----QTGTQAIGIC  125 (150)
Q Consensus        95 l~i-v~~vL~G~vn~~lv~~L~----~~g~~avgls  125 (150)
                      |++ .-|. =|.|...|...|+    +.|...--.|
T Consensus        76 LDV~GAmS-QG~IGYml~q~l~~~L~~~g~~~~vaT  110 (321)
T TIGR00746        76 LDVLGAMS-QGMIGYMLQQALNNELPKEGLEKPVAT  110 (321)
T ss_pred             CCEECCCC-CCHHHHHHHHHHHHHHHHCCCCCCCEE
T ss_conf             43202033-530789999999999986689998538


No 33 
>COG0549 ArcC Carbamate kinase [Amino acid transport and metabolism]
Probab=97.52  E-value=0.0012  Score=44.47  Aligned_cols=86  Identities=29%  Similarity=0.382  Sum_probs=58.4

Q ss_pred             CEEEEEECHHHHCCH----------HHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHH
Q ss_conf             889998675672474----------6899999999998518971799845724578899970898115578346612489
Q 537021.9.peg.3   26 ETIVVKYGGHVMNCT----------DLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTA   95 (150)
Q Consensus        26 ktiVIK~GG~~l~d~----------~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl   95 (150)
                      +.+||-+||+++...          +....-++.|+.|..-|.++||.||-|||+...+.+.-..-+.. +.   .+--|
T Consensus         1 ~~iVvALGGNAll~~g~~~tae~Q~~~v~~ta~~i~~l~~~g~e~VitHGNGPQVG~l~lq~~aa~~~~-~~---p~~PL   76 (312)
T COG0549           1 KRIVVALGGNALLQRGEPLTAEAQYEAVKITAEQIADLIASGYEVVITHGNGPQVGLLLLQNEAADSEK-GV---PAYPL   76 (312)
T ss_pred             CEEEEEECCHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHHCCCC-CC---CCCCH
T ss_conf             918999353453479999898999999999999999998558859997189756779998766532545-78---88646


Q ss_pred             HH-HHHHHHHHHHHHHHHHHHH
Q ss_conf             99-9999741668999999982
Q 537021.9.peg.3   96 EV-VEMVLAGSINKKIVSLINQ  116 (150)
Q Consensus        96 ~i-v~~vL~G~vn~~lv~~L~~  116 (150)
                      ++ +.|. -|.+...|..+|..
T Consensus        77 d~~~Ams-QG~IGy~l~qal~n   97 (312)
T COG0549          77 DVLVAMS-QGMIGYMLQQALRN   97 (312)
T ss_pred             HHHHHHH-HHHHHHHHHHHHHH
T ss_conf             8886730-12899999999997


No 34 
>cd04261 AAK_AKii-LysC-BS AAK_AKii-LysC-BS: Amino Acid Kinase Superfamily (AAK), AKii; this CD includes the N-terminal catalytic aspartokinase (AK) domain of the lysine-sensitive aspartokinase isoenzyme AKII of Bacillus subtilis 168, and the lysine plus threonine-sensitive aspartokinase of Corynebacterium glutamicum, and related sequences. In B. subtilis 168, the regulation of the diaminopimelate (Dap)-lysine biosynthetic pathway involves dual control by Dap and lysine, effected through separate Dap- and lysine-sensitive aspartokinase isoenzymes. The B. subtilis 168 AKII is induced by methionine, and repressed and inhibited by lysine. Although Corynebacterium glutamicum is known to contain a single aspartokinase isoenzyme type, both the succinylase and dehydrogenase variant pathways of DAP-lysine synthesis operate simultaneously in this organism. In this organism and other various Gram-positive bacteria, the DAP-lysine pathway is feedback regulated by the concerted action of lysine and 
Probab=97.46  E-value=0.0019  Score=43.10  Aligned_cols=98  Identities=20%  Similarity=0.299  Sum_probs=63.9

Q ss_pred             EEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECC-CHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHHHH
Q ss_conf             8999867567247468999999999985189717998457-245788999708981155783466124899999997416
Q 537021.9.peg.3   27 TIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGG-GPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEMVLAGS  105 (150)
Q Consensus        27 tiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGg-g~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~vL~G~  105 (150)
                      .+|.||||+-+.|++.+..+++-|......|.++++|.++ |.--|++++...-   ..+   --++..++.+..  .|+
T Consensus         1 ~iV~KFGGtSv~~~e~i~~va~iI~~~~~~~~~vvVVvSA~gg~Td~Li~la~~---~~~---~~~~~e~d~ils--~GE   72 (239)
T cd04261           1 LIVQKFGGTSVASIERIKRVAERIKKRKKKGNQVVVVVSAMGGTTDELIELAKE---ISP---RPPARELDVLLS--TGE   72 (239)
T ss_pred             CEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHH---HCC---CCCHHHHHHHHH--HHH
T ss_conf             989997820139999999999999989866992899977987536889999998---706---999899988877--779


Q ss_pred             -HHHH-HHHHHHHCCCCEEEEEEEECCEE
Q ss_conf             -6899-99999828997687786558989
Q 537021.9.peg.3  106 -INKK-IVSLINQTGTQAIGICGKDGNMV  132 (150)
Q Consensus       106 -vn~~-lv~~L~~~g~~avglsg~dg~li  132 (150)
                       .+.. +...|++.|.++..+++.+-.++
T Consensus        73 ~lsa~ll~~~L~~~G~~a~~~~~~~~~i~  101 (239)
T cd04261          73 QVSIALLAMALNRLGIKAISLTGWQAGIL  101 (239)
T ss_pred             HHHHHHHHHHHHHCCCCCEEEEHHHCCEE
T ss_conf             99999999999976997579686667543


No 35 
>PRK12354 carbamate kinase; Reviewed
Probab=97.25  E-value=0.0021  Score=42.91  Aligned_cols=82  Identities=23%  Similarity=0.291  Sum_probs=54.6

Q ss_pred             EEEEEECHHHHCC-------HH---HHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHH
Q ss_conf             8999867567247-------46---8999999999985189717998457245788999708981155783466124899
Q 537021.9.peg.3   27 TIVVKYGGHVMNC-------TD---LSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAE   96 (150)
Q Consensus        27 tiVIK~GG~~l~d-------~~---l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~   96 (150)
                      .+||-+||+++..       ++   ....-++.|+.|.. |.++||+||-|||+...+.+.. ...  +    +++--|+
T Consensus         2 riViALGGNAll~~g~~~t~~~Q~~~v~~aa~~ia~l~~-g~~vvitHGNGPQVG~i~l~~~-~~~--~----~p~~PLd   73 (302)
T PRK12354          2 RIVIALGGNALLRRGEPMTAENQRANIRIAAEQIAKIAP-GNEVVIVHGNGPQVGLLALQNA-AYK--A----VPPYPLD   73 (302)
T ss_pred             EEEEECCHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHC-CCEEEEEECCCHHHHHHHHHHH-HCC--C----CCCCCHH
T ss_conf             499982735317999999999999999999999999975-9959998278168899999877-416--8----9998826


Q ss_pred             H-HHHHHHHHHHHHHHHHHHHC
Q ss_conf             9-99997416689999999828
Q 537021.9.peg.3   97 V-VEMVLAGSINKKIVSLINQT  117 (150)
Q Consensus        97 i-v~~vL~G~vn~~lv~~L~~~  117 (150)
                      + +.|. -|.+...|..+|...
T Consensus        74 ~~~A~S-QG~IGY~lqqaL~n~   94 (302)
T PRK12354         74 VLGAET-EGMIGYMLEQELGNL   94 (302)
T ss_pred             HHHHHC-CHHHHHHHHHHHHHH
T ss_conf             764521-169999999999987


No 36 
>cd04246 AAK_AK-DapG-like AAK_AK-DapG-like: Amino Acid Kinase Superfamily (AAK), AK-DapG-like; this CD includes the N-terminal catalytic aspartokinase (AK) domain of the diaminopimelate-sensitive aspartokinase isoenzyme AKI (DapG), a monofunctional enzymes found in Bacilli (Bacillus subtilis 168), Clostridia, and Actinobacteria bacterial species, as well as, the catalytic AK domain of the lysine-sensitive aspartokinase isoenzyme AKII of Bacillus subtilis 168, the lysine plus threonine-sensitive aspartokinase of Corynebacterium glutamicum, and related isoenzymes. In Bacillus subtilis, the regulation of the diaminopimelate-lysine biosynthetic pathway involves dual control by diaminopimelate and lysine, effected through separate diaminopimelate- and lysine-sensitive aspartokinase isoenzymes. The role of the AKI isoenzyme is most likely to provide a constant level of aspartyl-beta-phosphate for the biosynthesis of diaminopimelate for peptidoglycan synthesis and dipicolinate during sporulati
Probab=97.16  E-value=0.0084  Score=38.99  Aligned_cols=96  Identities=18%  Similarity=0.199  Sum_probs=61.3

Q ss_pred             EEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHHHHH
Q ss_conf             89998675672474689999999999851897179984572457889997089811557834661248999999974166
Q 537021.9.peg.3   27 TIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEMVLAGSI  106 (150)
Q Consensus        27 tiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~vL~G~v  106 (150)
                      .+|.||||+-+.|++.++++++-|......|.++++|.++=--++..|-++.-+.  .   .-.++..++.+... .-..
T Consensus         1 ~iV~KFGGtSv~~~e~i~~va~iI~~~~~~g~~viVVvSA~gg~Td~L~~l~~~~--~---~~~~~~~~d~ils~-GE~l   74 (239)
T cd04246           1 IIVQKFGGTSVADIERIKRVAERIKKAVKKGYQVVVVVSAMGGTTDELIGLAKEV--S---PRPSPRELDMLLST-GEQI   74 (239)
T ss_pred             CEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHHHHH--C---CCCCHHHHHHHHHH-HHHH
T ss_conf             9899878100499999999999999998679829999848986567799988876--4---77258899999998-8999


Q ss_pred             HHH-HHHHHHHCCCCEEEEEEEE
Q ss_conf             899-9999982899768778655
Q 537021.9.peg.3  107 NKK-IVSLINQTGTQAIGICGKD  128 (150)
Q Consensus       107 n~~-lv~~L~~~g~~avglsg~d  128 (150)
                      +.. +...|++.|+++.-+.+.+
T Consensus        75 s~~lla~~L~~~G~~a~~~~~~~   97 (239)
T cd04246          75 SAALLAMALNRLGIKAISLTGWQ   97 (239)
T ss_pred             HHHHHHHHHHHCCCCCEEEEHHH
T ss_conf             99999999997599808857323


No 37 
>cd04253 AAK_UMPK-PyrH-Pf AAK_UMPK-PyrH-Pf: UMP kinase (UMPK)-Pf, the mostly archaeal uridine monophosphate kinase (uridylate kinase) enzymes that catalyze UMP phosphorylation and play a key role in pyrimidine nucleotide biosynthesis; regulation of this process is via feed-back control and via gene repression of carbamoyl phosphate synthetase (the first enzyme of the pyrimidine biosynthesis pathway). The UMP kinase of Pyrococcus furiosus (Pf) is known to function as a homohexamer, with GTP and UTP being allosteric effectors. Like other related enzymes (carbamate kinase, aspartokinase, and N-acetylglutamate kinase) the E. coli and most bacterial UMPKs have a conserved, N-terminal, lysine residue proposed to function in the catalysis of the phosphoryl group transfer, whereas most archaeal UMPKs (this CD) appear to lack this residue and the Pyrococcus furiosus structure has an additional Mg ion bound to the ATP molecule which is proposed to function as the catalysis instead. Members of thi
Probab=97.16  E-value=0.0015  Score=43.73  Aligned_cols=44  Identities=27%  Similarity=0.486  Sum_probs=36.0

Q ss_pred             EEEEECHHHHCC---HHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHH
Q ss_conf             999867567247---468999999999985189717998457245788
Q 537021.9.peg.3   28 IVVKYGGHVMNC---TDLSKDFVNDIALLKKSNITPVIVHGGGPQIGA   72 (150)
Q Consensus        28 iVIK~GG~~l~d---~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~   72 (150)
                      +|||+||+++.+   .+...++++.|+.++. |.++|+|+|||.....
T Consensus         2 IViKiGgs~l~~~~~~~~i~~la~~i~~l~~-~~~~iiVvggG~~ar~   48 (221)
T cd04253           2 IVISLGGSVLAPEKDADFIKEYANVLRKISD-GHKVAVVVGGGRLARE   48 (221)
T ss_pred             EEEEEEHHHCCCCCCHHHHHHHHHHHHHHHC-CCEEEEEECCCHHHHH
T ss_conf             7999607770799887999999999999975-9969999877778876


No 38 
>COG0528 PyrH Uridylate kinase [Nucleotide transport and metabolism]
Probab=97.00  E-value=0.014  Score=37.60  Aligned_cols=97  Identities=19%  Similarity=0.257  Sum_probs=69.9

Q ss_pred             HCCCEEEEEECHHHHCC-------HHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHH
Q ss_conf             75988999867567247-------46899999999998518971799845724578899970898115578346612489
Q 537021.9.peg.3   23 YENETIVVKYGGHVMNC-------TDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTA   95 (150)
Q Consensus        23 ~r~ktiVIK~GG~~l~d-------~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl   95 (150)
                      .+-+-+|+|+||+++..       ++....++++|..+...|.++-+|-|||.-.-.......       |.--+   +.
T Consensus         3 ~~~~rillkLsGe~l~g~~~~gid~~~i~~~a~~i~~~~~~g~eV~iVvGGGni~Rg~~~~~~-------g~~r~---~~   72 (238)
T COG0528           3 PKYMRILLKLSGEALAGEQGFGIDPEVLDRIANEIKELVDLGVEVAVVVGGGNIARGYIGAAA-------GMDRV---TA   72 (238)
T ss_pred             CCEEEEEEEEECCEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHC-------CCCHH---HH
T ss_conf             633799999421036478888879899999999999998669689999789789976789876-------98512---24


Q ss_pred             HHHHHHHHHHHHH-HHHHHHHHCCCCEEEEEEEECC
Q ss_conf             9999997416689-9999998289976877865589
Q 537021.9.peg.3   96 EVVEMVLAGSINK-KIVSLINQTGTQAIGICGKDGN  130 (150)
Q Consensus        96 ~iv~~vL~G~vn~-~lv~~L~~~g~~avglsg~dg~  130 (150)
                      +..=| |+-.+|. -|-..|.+.|+++.-+|.....
T Consensus        73 D~mGm-laTvmNal~L~~aL~~~~~~~~v~sai~~~  107 (238)
T COG0528          73 DYMGM-LATVMNALALQDALERLGVDTRVQSAIAMP  107 (238)
T ss_pred             HHHHH-HHHHHHHHHHHHHHHHCCCCCEECCCCCCC
T ss_conf             68889-999999999999986358761221311176


No 39 
>PRK08841 aspartate kinase; Validated
Probab=96.73  E-value=0.024  Score=36.09  Aligned_cols=99  Identities=16%  Similarity=0.253  Sum_probs=66.3

Q ss_pred             CEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECC-CHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHHH
Q ss_conf             88999867567247468999999999985189717998457-24578899970898115578346612489999999741
Q 537021.9.peg.3   26 ETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGG-GPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEMVLAG  104 (150)
Q Consensus        26 ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGg-g~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~vL~G  104 (150)
                      +.+|.||||+-+.+++.++++++-|..-...|.++|+|.-+ +..-|++++....    ++  .+-+++.|+.+... .-
T Consensus         2 ~~iV~KFGGTSv~~~e~i~~va~~i~~~~~~~~~vvVVvSAm~g~Td~Ll~la~~----i~--~~~~~relD~lls~-GE   74 (392)
T PRK08841          2 PLIVQKFGGTSVGSIERIQTVAEHIIKAKNDGNQVVVVVSAMAGETNRLLGLAKQ----VD--SVPTARELDVLLSA-GE   74 (392)
T ss_pred             CEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHH----HC--CCCCHHHHHHHHHH-HH
T ss_conf             8799915863579999999999999988756998999962899968999999986----32--59998999999976-59


Q ss_pred             HHHHHH-HHHHHHCCCCEEEEEEEECCE
Q ss_conf             668999-999982899768778655898
Q 537021.9.peg.3  105 SINKKI-VSLINQTGTQAIGICGKDGNM  131 (150)
Q Consensus       105 ~vn~~l-v~~L~~~g~~avglsg~dg~l  131 (150)
                      ++...| ...|++.|.+|..+++.+..+
T Consensus        75 ~lS~~lla~~L~~~G~~a~~l~~~~~~i  102 (392)
T PRK08841         75 QVSMALLAMTLNKLGYKARSLTGAQANI  102 (392)
T ss_pred             HHHHHHHHHHHHHCCCCCEEEECHHCCC
T ss_conf             9999999999986799757987123351


No 40 
>PRK07431 aspartate kinase; Provisional
Probab=96.43  E-value=0.05  Score=34.01  Aligned_cols=102  Identities=17%  Similarity=0.246  Sum_probs=69.5

Q ss_pred             CEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHHHH
Q ss_conf             88999867567247468999999999985189717998457245788999708981155783466124899999997416
Q 537021.9.peg.3   26 ETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEMVLAGS  105 (150)
Q Consensus        26 ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~vL~G~  105 (150)
                      +.+|-||||+-+.|.+.++++++-|...+..|.++|+|--+--..+..|-++.-+  ..+   -.+++.|+.+... .-+
T Consensus         2 ~lIV~KFGGTSVa~~e~i~~va~~I~~~~~~g~~vvVVVSAmg~~Td~L~~la~~--i~~---~~~~re~D~lls~-GE~   75 (594)
T PRK07431          2 ALIVQKFGGTSVGSVERIQAVAQRIARTKEAGNDVVVVVSAMGKTTDELVKLAEE--ISD---NPPRREMDMLLST-GEQ   75 (594)
T ss_pred             CEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHH--HCC---CCCHHHHHHHHHH-HHH
T ss_conf             7499842876579899999999999999875899999974998729999999998--747---9998999999977-599


Q ss_pred             HHHHHH-HHHHHCCCCEEEEEEEECCEEE
Q ss_conf             689999-9998289976877865589899
Q 537021.9.peg.3  106 INKKIV-SLINQTGTQAIGICGKDGNMVF  133 (150)
Q Consensus       106 vn~~lv-~~L~~~g~~avglsg~dg~li~  133 (150)
                      +...|. ..|+..|.+|+.++|...++++
T Consensus        76 ~S~~Lla~~L~~~G~~a~sl~~~q~gi~T  104 (594)
T PRK07431         76 VSIALLSMALQELGQPAISLTGAQVGIVT  104 (594)
T ss_pred             HHHHHHHHHHHHCCCCCEEEEHHHCCEEC
T ss_conf             99999999999689971992000161776


No 41 
>PRK08210 aspartate kinase I; Reviewed
Probab=96.43  E-value=0.08  Score=32.69  Aligned_cols=100  Identities=19%  Similarity=0.326  Sum_probs=63.0

Q ss_pred             CEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCH------HHHHHHHHCCCCCEEECCCCCCCHHHHHHHH
Q ss_conf             8899986756724746899999999998518971799845724------5788999708981155783466124899999
Q 537021.9.peg.3   26 ETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGP------QIGAVLEKMGIKSKFENGLRITDQQTAEVVE   99 (150)
Q Consensus        26 ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~------qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~   99 (150)
                      |++|.||||+-+.|++.+..+++-|..-...|.++|+|.-+=-      ..|.+++...-.      .+--++..++.+.
T Consensus         2 k~iV~KFGGtSv~~~~~i~~v~~ii~~~~~~~~~~vVVvSAmg~~~~~~~td~L~~l~~~~------~~~~~~~~~D~l~   75 (405)
T PRK08210          2 KIIVQKFGGTSVSTEERRKMVVNKIKKALKEGYKVVVVVSAMGRKGDPYATDTLLSLVGEE------FSDISKREQDLLM   75 (405)
T ss_pred             CEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCHHHHHHHHHHHH------CCCCCHHHHHHHH
T ss_conf             8798816850269999999999999878866998899979989899811499999999865------0259878899998


Q ss_pred             HHHHHHHHHHH-HHHHHHCCCCEEEEEEEECCEE
Q ss_conf             99741668999-9999828997687786558989
Q 537021.9.peg.3  100 MVLAGSINKKI-VSLINQTGTQAIGICGKDGNMV  132 (150)
Q Consensus       100 ~vL~G~vn~~l-v~~L~~~g~~avglsg~dg~li  132 (150)
                      . ..-.+...| ...|++.|++|.-+.+.+..++
T Consensus        76 s-~GE~~S~~l~a~~L~~~g~~a~~~~~~~~~i~  108 (405)
T PRK08210         76 S-CGEIISSVVFSNMLNEAGIKAVALTGGQAGII  108 (405)
T ss_pred             H-HHHHHHHHHHHHHHHHCCCCCEEEEHHHCCEE
T ss_conf             7-75899999999999747998679774158478


No 42 
>cd04240 AAK_UC AAK_UC: Uncharacterized (UC) amino acid kinase-like proteins found mainly in archaea and a few bacteria. Sequences in this CD are members of the Amino Acid Kinase (AAK) superfamily.
Probab=96.36  E-value=0.0038  Score=41.17  Aligned_cols=41  Identities=37%  Similarity=0.463  Sum_probs=29.9

Q ss_pred             EEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHH
Q ss_conf             9986756724746899999999998518971799845724578899
Q 537021.9.peg.3   29 VVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVL   74 (150)
Q Consensus        29 VIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l   74 (150)
                      |||+|||.+.+.   .++++.+..+  .|.+++||.|||+..|..=
T Consensus         1 VvKiGGSL~~~~---~~ll~~l~~~--~~~~v~IVpGGG~fAd~vR   41 (203)
T cd04240           1 VVKIGGSLIREA---VRLLRWLKTL--SGGGVVIVPGGGPFADVVR   41 (203)
T ss_pred             CEEEEEECCCCH---HHHHHHHHHH--CCCCEEEEECCHHHHHHHH
T ss_conf             979820564668---9999999983--5997899979808999999


No 43 
>cd04260 AAK_AKi-DapG-BS AAK_AKi-DapG-BS: Amino Acid Kinase Superfamily (AAK), AKi-DapG; this CD includes the N-terminal catalytic aspartokinase (AK) domain of  the diaminopimelate-sensitive aspartokinase isoenzyme AKI (DapG), a monofunctional class enzyme found in Bacilli (Bacillus subtilis 168), Clostridia, and Actinobacteria bacterial species.  In Bacillus subtilis, the regulation of the diaminopimelate-lysine biosynthetic pathway involves dual control by diaminopimelate and lysine, effected through separate diaminopimelate- and lysine-sensitive aspartokinase isoenzymes. AKI activity is invariant during the exponential and stationary phases of growth and is not altered by addition of amino acids to the growth medium. The role of this isoenzyme is most likely to provide a constant level of aspartyl-beta-phosphate for the biosynthesis of diaminopimelate for peptidoglycan synthesis and dipicolinate during sporulation. The B. subtilis AKI is tetrameric consisting of two alpha and two bet
Probab=96.35  E-value=0.068  Score=33.18  Aligned_cols=99  Identities=13%  Similarity=0.260  Sum_probs=62.2

Q ss_pred             EEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCC-H-----HHHHHHHHCCCCCEEECCCCCCCHHHHHHHHH
Q ss_conf             89998675672474689999999999851897179984572-4-----57889997089811557834661248999999
Q 537021.9.peg.3   27 TIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGG-P-----QIGAVLEKMGIKSKFENGLRITDQQTAEVVEM  100 (150)
Q Consensus        27 tiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg-~-----qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~  100 (150)
                      .+|.||||+-+.|++-+.++++-|..-...|.++++|..+= .     ..|++++...-..      +--+++.++.+-.
T Consensus         1 iiV~KFGGtSv~~~~~i~~va~iI~~~~~~~~~~vVVvSAmg~~~~~~~td~L~~l~~~~~------~~~~~~~~D~l~s   74 (244)
T cd04260           1 IIVQKFGGTSVSTKERREQVAKKVKQAVDEGYKPVVVVSAMGRKGDPYATDTLINLVYAEN------SDISPRELDLLMS   74 (244)
T ss_pred             CEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHC------CCCCHHHHHHHHH
T ss_conf             9898118523799999999999998898779988999799999997589999999999753------0389899999997


Q ss_pred             HHHHHHHHHH-HHHHHHCCCCEEEEEEEECCEE
Q ss_conf             9741668999-9999828997687786558989
Q 537021.9.peg.3  101 VLAGSINKKI-VSLINQTGTQAIGICGKDGNMV  132 (150)
Q Consensus       101 vL~G~vn~~l-v~~L~~~g~~avglsg~dg~li  132 (150)
                       +.-.+...| ...|++.|+++.-+.+.+-.++
T Consensus        75 -~GE~lSa~lla~~L~~~Gi~a~~~~~~~~~i~  106 (244)
T cd04260          75 -CGEIISAVVLTSTLRAQGLKAVALTGAQAGIL  106 (244)
T ss_pred             -HCCHHHHHHHHHHHHHCCCCCEEEEHHHCCEE
T ss_conf             -43099999999999964998748658888353


No 44 
>TIGR02076 pyrH_arch uridylate kinase, putative; InterPro: IPR011818   Uridylate kinases (also known as UMP kinases) are key enzymes in the synthesis of nucleoside triphosphates. They catalyse the reversible transfer of the gamma-phosphoryl group from an ATP donor to UMP, yielding UDP, which is the starting point for the synthesis of all other pyrimidine nucleotides. The eukaryotic enzyme has a dual specificity, phosphorylating both UMP and CMP, while the bacterial enzyme is specific to UMP. The bacterial enzyme shows no sequence similarity to the eukaryotic enzyme or other nucleoside monophosphate kinases, but rather appears to be part of the amino acid kinase family. It is dependent on magnesium for activity and is activated by GTP and repressed by UTP , . In many bacterial genomes, the gene tends to be located immediately downstream of elongation factor T and upstream of ribosome recycling factor. A related protein family, believed to be equivalent in function is found in the archaea and in spirochetes.   Structurally, the bacterial and archaeal proteins are homohexamers centred around a hollow nucleus and organised as a trimer of dimers , . Each monomer within the protein forms the amino acid kinase fold and can be divided into an N-terminal region which binds UMP and mediates intersubunit interactions within the dimer, and a C-terminal region which binds ATP and contains a mobile loop covering the active site. Inhibition of enzyme activity by UTP appears to be due to competition for the binding site for UMP, not allosteric inhibition as was previously suspected.   This entry represents the archaeal and spirochete proteins.; GO: 0009041 uridylate kinase activity, 0006221 pyrimidine nucleotide biosynthetic process.
Probab=96.17  E-value=0.0059  Score=39.98  Aligned_cols=75  Identities=21%  Similarity=0.325  Sum_probs=55.7

Q ss_pred             EEEEECHHHHCCH----HHHHHHHHHHHHHHHCC--CCEEEEECCCHHHHHH---HHHC--CCCCEEECCCCCCCHHHHH
Q ss_conf             9998675672474----68999999999985189--7179984572457889---9970--8981155783466124899
Q 537021.9.peg.3   28 IVVKYGGHVMNCT----DLSKDFVNDIALLKKSN--ITPVIVHGGGPQIGAV---LEKM--GIKSKFENGLRITDQQTAE   96 (150)
Q Consensus        28 iVIK~GG~~l~d~----~l~~~~~~dI~ll~~~G--~k~ViVHGgg~qI~~~---l~~~--gi~~~~v~G~RVTd~~tl~   96 (150)
                      +||++||+++.++    +.+.+|++-|..+..-|  .++.+|-|||+-.-++   .+++  |..-.|.|=        | 
T Consensus         1 iV~~LGGsV~~~~~~d~e~~~~~A~~l~~~~~~Gek~~v~vVvGGG~~AR~YI~~aR~lnPG~~E~~lD~--------i-   71 (232)
T TIGR02076         1 IVISLGGSVLSPENIDAEKIKEYANILRKLSDEGEKHKVGVVVGGGKTAREYIGVARELNPGASETFLDE--------I-   71 (232)
T ss_pred             CEEECCCEEECCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEECCCHHHHHHHHHHHHHCCCCCCHHHHH--------H-
T ss_conf             9783345161688888789999999999998449946899887887658899999986088888024567--------7-


Q ss_pred             HHHHHHHHHHHHHHHHHH
Q ss_conf             999997416689999999
Q 537021.9.peg.3   97 VVEMVLAGSINKKIVSLI  114 (150)
Q Consensus        97 iv~~vL~G~vn~~lv~~L  114 (150)
                         -+.+-++|..|...+
T Consensus        72 ---GI~~TRLNA~LLi~a   86 (232)
T TIGR02076        72 ---GIDATRLNAMLLIAA   86 (232)
T ss_pred             ---HHHHHHHHHHHHHHH
T ss_conf             ---799999999999987


No 45 
>PRK05429 gamma-glutamyl kinase; Provisional
Probab=96.15  E-value=0.11  Score=31.92  Aligned_cols=56  Identities=23%  Similarity=0.396  Sum_probs=45.8

Q ss_pred             CCCEEEEEECHHHHCCH------HHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCC
Q ss_conf             59889998675672474------68999999999985189717998457245788999708981
Q 537021.9.peg.3   24 ENETIVVKYGGHVMNCT------DLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKS   81 (150)
Q Consensus        24 r~ktiVIK~GG~~l~d~------~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~   81 (150)
                      .-|.+|||+|-+.+.++      ....+++++|+.|+..|.++|||--|-  |..-+..++++.
T Consensus         7 ~~krIVIKiGSs~Lt~~~g~l~~~~i~~l~~~I~~L~~~G~evilVSSGA--Ia~G~~~L~~~~   68 (372)
T PRK05429          7 DARRIVVKVGSSLLTDDGGGLDRARIAELARQIAALRAAGHEVVLVSSGA--VAAGRSRLGLPK   68 (372)
T ss_pred             CCCEEEEECCCCEEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCHH--HHHHHHHCCCCC
T ss_conf             28789999174336898888889999999999999997899999988407--986277609998


No 46 
>PRK13402 gamma-glutamyl kinase; Provisional
Probab=96.10  E-value=0.019  Score=36.72  Aligned_cols=55  Identities=22%  Similarity=0.265  Sum_probs=44.2

Q ss_pred             CCEEEEEECHHHHCC------HHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCC
Q ss_conf             988999867567247------468999999999985189717998457245788999708981
Q 537021.9.peg.3   25 NETIVVKYGGHVMNC------TDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKS   81 (150)
Q Consensus        25 ~ktiVIK~GG~~l~d------~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~   81 (150)
                      -|.+|||+|-+.+.+      ...+.+++++|+.|+..|.++|||-.|.-..  -+..++.+.
T Consensus         5 ~kRIVIKiGSs~lt~~~g~l~~~~i~~l~~~I~~L~~~G~evilVSSGAia~--G~~~L~~~~   65 (363)
T PRK13402          5 WKRIVVKVGSALITPHKQGCSSHYLLGIAQFIVYLRAQGHQVVLVSSGSVAA--GYHKLGFID   65 (363)
T ss_pred             CCEEEEEECCCEEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH--HHHHCCCCC
T ss_conf             8669999670106899998789999999999999997899899988787996--477609976


No 47 
>COG0263 ProB Glutamate 5-kinase [Amino acid transport and metabolism]
Probab=96.05  E-value=0.064  Score=33.34  Aligned_cols=89  Identities=22%  Similarity=0.356  Sum_probs=59.3

Q ss_pred             CCEEEEEECHHHHCCH------HHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHH
Q ss_conf             9889998675672474------6899999999998518971799845724578899970898115578346612489999
Q 537021.9.peg.3   25 NETIVVKYGGHVMNCT------DLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVV   98 (150)
Q Consensus        25 ~ktiVIK~GG~~l~d~------~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv   98 (150)
                      -+++|||+|-+.+.++      ..+..++++++.|+..|.++|||--|.  |-.=+..+|++.+-.+   +..++.+.. 
T Consensus         6 ~~riVvKiGSs~Lt~~~g~l~~~~l~~l~~~ia~L~~~G~eVilVSSGA--iaaG~~~Lg~~~rp~~---l~~kQA~AA-   79 (369)
T COG0263           6 ARRIVVKIGSSSLTDGTGGLDRSKLEELVRQVAALHKAGHEVVLVSSGA--IAAGRTRLGLPKRPKT---LAEKQAAAA-   79 (369)
T ss_pred             CEEEEEEECCCEEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCH--HHHCHHHCCCCCCCCC---HHHHHHHHH-
T ss_conf             5379999775023579998298999999999999986898899983446--6517444199988761---288899987-


Q ss_pred             HHHHHHHHH--HHHHHHHHHCCCCEEE
Q ss_conf             999741668--9999999828997687
Q 537021.9.peg.3   99 EMVLAGSIN--KKIVSLINQTGTQAIG  123 (150)
Q Consensus        99 ~~vL~G~vn--~~lv~~L~~~g~~avg  123 (150)
                          .||+.  ..-.+.+.++|...--
T Consensus        80 ----VGQ~~Lm~~y~~~f~~~g~~v~Q  102 (369)
T COG0263          80 ----VGQVRLMQLYEELFARYGIKVGQ  102 (369)
T ss_pred             ----HCHHHHHHHHHHHHHHCCCEEEE
T ss_conf             ----37999999999999860974568


No 48 
>PRK12443 uridylate kinase; Reviewed
Probab=95.94  E-value=0.024  Score=36.07  Aligned_cols=91  Identities=20%  Similarity=0.311  Sum_probs=57.8

Q ss_pred             CEEEEEECHHHHC-------CHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHH-HHHCCCCCEEECCCCCCCHHHHHH
Q ss_conf             8899986756724-------74689999999999851897179984572457889-997089811557834661248999
Q 537021.9.peg.3   26 ETIVVKYGGHVMN-------CTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAV-LEKMGIKSKFENGLRITDQQTAEV   97 (150)
Q Consensus        26 ktiVIK~GG~~l~-------d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~-l~~~gi~~~~v~G~RVTd~~tl~i   97 (150)
                      |.+++|++|+++.       |++....++++|..++..|.++-||-|||.-.--. .+..|++       |+| .+.|  
T Consensus         5 kRVllKlSGEaL~g~~~~g~d~~~l~~ia~eI~~~~~~G~evaiVvGGGNi~RG~~~~~~gid-------r~~-aD~i--   74 (247)
T PRK12443          5 KRVLIKLSGGALADQTGNSFNSKRLEHIANEILSIVDLGIEVSIVIGGGNIFRGHLAEEWGID-------RVE-ADNI--   74 (247)
T ss_pred             CEEEEEEEHHHHCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHCCCC-------CCH-HHHH--
T ss_conf             789999738882899988879999999999999999779879999768850232236553987-------310-3688--


Q ss_pred             HHHHHHHHHHHHHH-HHHHH-CCCCEEEEEEEE
Q ss_conf             99997416689999-99982-899768778655
Q 537021.9.peg.3   98 VEMVLAGSINKKIV-SLINQ-TGTQAIGICGKD  128 (150)
Q Consensus        98 v~~vL~G~vn~~lv-~~L~~-~g~~avglsg~d  128 (150)
                       =| |+-.+|..+. +.|.+ .|+.+.-+|...
T Consensus        75 -GM-LaTvmNal~L~~~l~~~~g~~~rv~sai~  105 (247)
T PRK12443         75 -GT-LGTIINSLMLRGVLTSKTNKEVRVMTSIP  105 (247)
T ss_pred             -HH-HHHHHHHHHHHHHHHHHCCCCEEEEECCC
T ss_conf             -89-99999999999999975599469973233


No 49 
>TIGR01027 proB glutamate 5-kinase; InterPro: IPR005715    L-glutamate 5-phosphotransferase, (gamma-glutamyl kinase, proB, 2.7.2.11 from EC), catalyzes the first step in proline biosynthesis  ATP + L-glutamate = ADP + L-glutamate 5-phosphate. the product of which rapidly cyclises to 5-oxoproline and phosphate.    Bacterial ProB proteins hits the full length of this model, but the ProB-like domain of delta 1-pyrroline-5-carboxylate synthetase does not hit the C-terminal 100 residues. ; GO: 0004349 glutamate 5-kinase activity, 0006561 proline biosynthetic process, 0005737 cytoplasm.
Probab=95.92  E-value=0.07  Score=33.10  Aligned_cols=86  Identities=22%  Similarity=0.382  Sum_probs=59.8

Q ss_pred             CEEEEEECHHHHCCH--------HHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHH
Q ss_conf             889998675672474--------689999999999851897179984572457889997089811557834661248999
Q 537021.9.peg.3   26 ETIVVKYGGHVMNCT--------DLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEV   97 (150)
Q Consensus        26 ktiVIK~GG~~l~d~--------~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~i   97 (150)
                      +++|||+|=|.|.++        .-+..++++|+.|+..|.++|||=-|.  |-.=++.+|++.+=.+   .+.++.+..
T Consensus         1 ~riVvK~GsS~Lt~~~g~h~l~~~~i~~lv~~v~~L~~~Gh~vviVSSGA--~AaG~~~LG~~~rP~~---la~KQAlAA   75 (379)
T TIGR01027         1 KRIVVKVGSSSLTGSSGSHELDRSRIAELVEQVAALHAAGHEVVIVSSGA--VAAGFEALGLPERPKT---LAEKQALAA   75 (379)
T ss_pred             CEEEEEEECCEEECCCCCCEECHHHHHHHHHHHHHHHHCCCEEEEEECCH--HHHHHHHCCCCCCCCC---HHHHHHHHH
T ss_conf             96899970321117888741377999999999999986599899981675--8862344556999862---567887873


Q ss_pred             HHHHHHHHHHH--HHHHHHHHCCCCE
Q ss_conf             99997416689--9999998289976
Q 537021.9.peg.3   98 VEMVLAGSINK--KIVSLINQTGTQA  121 (150)
Q Consensus        98 v~~vL~G~vn~--~lv~~L~~~g~~a  121 (150)
                           .||+.-  .=...++.+|.+.
T Consensus        76 -----VGQ~~Lm~~y~~~F~~Yg~~~   96 (379)
T TIGR01027        76 -----VGQVRLMQLYENLFSSYGLKV   96 (379)
T ss_pred             -----CCHHHHHHHHHHHHHHCCCCH
T ss_conf             -----145689999999997548822


No 50 
>PRK06635 aspartate kinase; Reviewed
Probab=95.87  E-value=0.1  Score=32.04  Aligned_cols=100  Identities=14%  Similarity=0.211  Sum_probs=61.6

Q ss_pred             CEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHHHH
Q ss_conf             88999867567247468999999999985189717998457245788999708981155783466124899999997416
Q 537021.9.peg.3   26 ETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEMVLAGS  105 (150)
Q Consensus        26 ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~vL~G~  105 (150)
                      +.+|.||||+-+.|++.++.+++-|..-...|.++++|.-+---++..|-.+.-+  +.+   --+++.++.+..  .|+
T Consensus         2 ~~iV~KFGGtSv~~~~~~~~v~~ii~~~~~~~~~~vvVvSA~~~~Td~Li~la~~--~~~---~~~~~e~d~ils--~GE   74 (402)
T PRK06635          2 ALIVQKFGGTSVGDVERIKNVAERVKAEVEAGHQVVVVVSAMGGTTDELLDLAKE--VSP---APDPRELDMLLS--TGE   74 (402)
T ss_pred             CEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHH--HCC---CCCHHHHHHHHH--HHH
T ss_conf             7799957850369999999999999878872999899964999878999999998--607---998799999998--717


Q ss_pred             -HHHHH-HHHHHHCCCCEEEEEEEECCEE
Q ss_conf             -68999-9999828997687786558989
Q 537021.9.peg.3  106 -INKKI-VSLINQTGTQAIGICGKDGNMV  132 (150)
Q Consensus       106 -vn~~l-v~~L~~~g~~avglsg~dg~li  132 (150)
                       +...| ...|++.|++|.-+.+.+..++
T Consensus        75 ~~S~~l~a~~L~~~g~~a~~~~~~~~~i~  103 (402)
T PRK06635         75 QVSVALLAMALQSLGVKARSFTGWQAPII  103 (402)
T ss_pred             HHHHHHHHHHHHHCCCCCEEECHHHCCEE
T ss_conf             99999999999857998589147878553


No 51 
>TIGR02075 pyrH_bact uridylate kinase; InterPro: IPR015963   Uridylate kinases (also known as UMP kinases) are key enzymes in the synthesis of nucleoside triphosphates. They catalyse the reversible transfer of the gamma-phosphoryl group from an ATP donor to UMP, yielding UDP, which is the starting point for the synthesis of all other pyrimidine nucleotides. The eukaryotic enzyme has a dual specificity, phosphorylating both UMP and CMP, while the bacterial enzyme is specific to UMP. The bacterial enzyme shows no sequence similarity to the eukaryotic enzyme or other nucleoside monophosphate kinases, but rather appears to be part of the amino acid kinase family. It is dependent on magnesium for activity and is activated by GTP and repressed by UTP , . In many bacterial genomes, the gene tends to be located immediately downstream of elongation factor T and upstream of ribosome recycling factor. A related protein family, believed to be equivalent in function is found in the archaea and in spirochetes.   Structurally, the bacterial and archaeal proteins are homohexamers centred around a hollow nucleus and organised as a trimer of dimers , . Each monomer within the protein forms the amino acid kinase fold and can be divided into an N-terminal region which binds UMP and mediates intersubunit interactions within the dimer, and a C-terminal region which binds ATP and contains a mobile loop covering the active site. Inhibition of enzyme activity by UTP appears to be due to competition for the binding site for UMP, not allosteric inhibition as was previously suspected.   This entry represents the bacterial proteins..
Probab=95.76  E-value=0.2  Score=30.13  Aligned_cols=100  Identities=20%  Similarity=0.285  Sum_probs=73.6

Q ss_pred             CEEEEEECHHHHCC-H-------HHHHHHHHHHHHHHHCCCCEEEEECCCHHH-HHHHHHCCCCCEEECCCCCCCHHHHH
Q ss_conf             88999867567247-4-------689999999999851897179984572457-88999708981155783466124899
Q 537021.9.peg.3   26 ETIVVKYGGHVMNC-T-------DLSKDFVNDIALLKKSNITPVIVHGGGPQI-GAVLEKMGIKSKFENGLRITDQQTAE   96 (150)
Q Consensus        26 ktiVIK~GG~~l~d-~-------~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI-~~~l~~~gi~~~~v~G~RVTd~~tl~   96 (150)
                      |-+++|++|++|.+ +       ..++.++++|..++..|+.+=||=|||-=. ...+++.|++       | ...+-|-
T Consensus         2 kRvLlKLSGEaL~g~~~~f~ID~~~l~~iA~eik~~~~~G~evgIViGGGNIfRG~~~~~~Gi~-------R-~~aDymG   73 (236)
T TIGR02075         2 KRVLLKLSGEALAGKDSQFGIDPKRLNRIANEIKEVVKKGIEVGIVIGGGNIFRGKSAAELGID-------R-VSADYMG   73 (236)
T ss_pred             CEEEEEECCHHHCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHCCCC-------E-EECCCCC
T ss_conf             6489996104324788886426899999999999998679489999778278877889874896-------1-1143230


Q ss_pred             HHHHHHHHHHHH-HHHHHHHHCCCCEEEEE-EEECCEEEEEEC
Q ss_conf             999997416689-99999982899768778-655898999974
Q 537021.9.peg.3   97 VVEMVLAGSINK-KIVSLINQTGTQAIGIC-GKDGNMVFAEKA  137 (150)
Q Consensus        97 iv~~vL~G~vn~-~lv~~L~~~g~~avgls-g~dg~li~a~~~  137 (150)
                      ..    +--+|- -|.++|.+.|+++.=+| ...-+--.|++.
T Consensus        74 ML----ATviNglaL~~~L~~~g~~~~V~Sfai~~~~~i~E~Y  112 (236)
T TIGR02075        74 ML----ATVINGLALKDALEKLGLKTRVLSFAISMPKQICESY  112 (236)
T ss_pred             HH----HHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCCCCCC
T ss_conf             58----9999999999888736995689877753586413232


No 52 
>COG0527 LysC Aspartokinases [Amino acid transport and metabolism]
Probab=95.64  E-value=0.18  Score=30.41  Aligned_cols=107  Identities=12%  Similarity=0.153  Sum_probs=69.6

Q ss_pred             EEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHC-CC-CC---------------EEECCC--
Q ss_conf             899986756724746899999999998518971799845724578899970-89-81---------------155783--
Q 537021.9.peg.3   27 TIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKM-GI-KS---------------KFENGL--   87 (150)
Q Consensus        27 tiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~-gi-~~---------------~~v~G~--   87 (150)
                      .+|.||||+-+.|.+-+...++=+.....-|.++|+|--+..-++..|-++ .- .+               ++....  
T Consensus         3 ~iV~KFGGTSva~~e~i~~va~iv~~~~~~g~~vVVVvSA~~~vTd~Lv~~a~~~~~~~~~~~~~~~~~~~~el~~~~~~   82 (447)
T COG0527           3 LIVQKFGGTSVADAERILRVADIVKEDSEEGVKVVVVVSAMGGVTDLLVALAEGAESGRDAVAEQRHRDIASELILDPFI   82 (447)
T ss_pred             EEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCHH
T ss_conf             08998687553888999999999985463397189997898774589999986331445205788899999987415104


Q ss_pred             ---------------------CCCCHHHHHHHHHHHHHHHHHHH-HHHHHHCCCCEEEEEEEECCEEEE
Q ss_conf             ---------------------46612489999999741668999-999982899768778655898999
Q 537021.9.peg.3   88 ---------------------RITDQQTAEVVEMVLAGSINKKI-VSLINQTGTQAIGICGKDGNMVFA  134 (150)
Q Consensus        88 ---------------------RVTd~~tl~iv~~vL~G~vn~~l-v~~L~~~g~~avglsg~dg~li~a  134 (150)
                                           +-.++..++.+.-. .-+++..| .+.|+..|++|.-+++.+..+++.
T Consensus        83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~D~lls~-GE~~Sa~lla~~L~~~Gv~A~~~~~~~~~i~t~  150 (447)
T COG0527          83 AARLAEVIAEFKKVLLGIALLGEVSPRERDELLSL-GERLSAALLAAALNALGVDARSLDGRQAGIATD  150 (447)
T ss_pred             HHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH-CHHHHHHHHHHHHHHCCCCEEEECHHHCEEEEC
T ss_conf             56676666666676666664147987899999850-548899999999996799659975588425535


No 53 
>COG5630 ARG2 Acetylglutamate synthase [Amino acid transport and metabolism]
Probab=95.35  E-value=0.013  Score=37.85  Aligned_cols=124  Identities=9%  Similarity=-0.069  Sum_probs=91.1

Q ss_pred             HHHHHHHHHHCC----CEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCC
Q ss_conf             999999998759----8899986756724746899999999998518971799845724578899970898115578346
Q 537021.9.peg.3   14 EQVLPFVQFYEN----ETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRI   89 (150)
Q Consensus        14 ~ea~pYi~~~r~----ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RV   89 (150)
                      |+.+-|-..++.    ..-+.++++-+..++-  ....+.+..+...+.-+++-|||+++-+..+-..|.+  ..+| |+
T Consensus       125 r~i~~~~vfs~~~~~~r~~~s~v~~~v~~~~~--~e~t~s~f~~~~v~~t~ivl~~~~~p~i~e~~i~gn~--i~~G-r~  199 (495)
T COG5630         125 RTGLYHSVFSQESRFFDIGNSNFIPIVKPYVY--NEETASEFMTKDVVKFMDCLCQGNIPHIDKFFILNNA--IPSG-RN  199 (495)
T ss_pred             HHHHHHHHHHHHCCCCCCCCCCEECCCCCCCC--HHHHHHHHHHHHHHHEEEEECCCCCCCCEEEEECCCC--CCCC-CC
T ss_conf             32233110232200024432312035588750--3566778999764001354247888653025424650--3456-55


Q ss_pred             CCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEECCCCCCCCCCCC
Q ss_conf             61248999999974166899999998289976877865589899997475211179951
Q 537021.9.peg.3   90 TDQQTAEVVEMVLAGSINKKIVSLINQTGTQAIGICGKDGNMVFAEKARHSLRLSPNTK  148 (150)
Q Consensus        90 Td~~tl~iv~~vL~G~vn~~lv~~L~~~g~~avglsg~dg~li~a~~~~~~lg~vg~~~  148 (150)
                      |.++++. +..++ ..++..+...++.++..+-+.++   .++.+.- +.+.+.+++++
T Consensus       200 ~~~h~~~-~~~~~-~~~~~~l~~~~~~~~kR~t~s~~---~l~~~~~-~~de~~~~e~~  252 (495)
T COG5630         200 DNAHVFI-LSQEL-EHLSSSLSHNISTLTKREPRSQN---LLHRMEV-KKDEISSLECE  252 (495)
T ss_pred             CCCCHHH-HHHHH-HHHHHHHHHCCCCCCCCCCCCCC---HHHHHHH-HHCCCCCCCCC
T ss_conf             6621012-78888-88788876246635455766433---0233232-21100004665


No 54 
>cd04234 AAK_AK AAK_AK: Amino Acid Kinase Superfamily (AAK), Aspartokinase (AK); this CD includes the N-terminal catalytic domain of aspartokinase (4-L-aspartate-4-phosphotransferase;). AK is the first enzyme in the biosynthetic pathway of the aspartate family of amino acids (lysine, threonine, methionine, and isoleucine) and the bacterial cell wall component, meso-diaminopimelate. It also catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP. One mechanism for the regulation of this pathway is by the production of several isoenzymes of aspartokinase with different repressors and allosteric inhibitors. Pairs of ACT domains are proposed to specifically bind amino acids leading to allosteric regulation of the enzyme. In Escherichia coli, three different aspartokinase isoenzymes are regulated specifically by lysine, methionine, and threonine. AK-HSDHI (ThrA) and AK-HSDHII (MetL) are bifunctional enzymes that consist of an N-terminal AK and a C-terminal homoserine dehyd
Probab=95.34  E-value=0.22  Score=29.93  Aligned_cols=39  Identities=21%  Similarity=0.300  Sum_probs=29.5

Q ss_pred             EEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECC
Q ss_conf             8999867567247468999999999985189717998457
Q 537021.9.peg.3   27 TIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGG   66 (150)
Q Consensus        27 tiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGg   66 (150)
                      .+|.||||+-+.|++.++.+++-|... .-+.++|+|.-+
T Consensus         1 ~~V~KFGGtSv~~~~~i~~v~~ii~~~-~~~~~~vVVvSA   39 (227)
T cd04234           1 MVVQKFGGTSVASAERIKRVADIIKAY-EKGNRVVVVVSA   39 (227)
T ss_pred             CEEEEECCCCCCCHHHHHHHHHHHHHH-HCCCCEEEEECC
T ss_conf             989967830179999999999999865-438988999738


No 55 
>cd07041 STAS_RsbR_RsbS_like Sulphate Transporter and Anti-Sigma factor antagonist domain of the "stressosome" complex proteins RsbS and RsbR, regulators of the bacterial stress activated alternative sigma factor sigma-B by phosphorylation. The STAS (Sulphate Transporter and Anti-Sigma factor antagonist) domain of proteins related to RsbS and RsbR which are part of the "stressosome" complex that plays an important role in the regulation of the bacterial stress activated alternative sigma factor sigma-B. During stress conditions RsbS and RsbR are phosphorylated which leads to the release of RsbT, an activator of of the RsbU phosphatase, which in turn activates RsbV which leads to the release and activation of sigma factor B. RsbS is a single domain protein (STAS domain), while RsbR-like proteins have a well-conserved C-terminal STATS domain and a variable N-terminal domain. The STAS domain is also found in the C- terminal region of sulphate transporters and anti-anti-sigma factors.
Probab=95.01  E-value=0.18  Score=30.52  Aligned_cols=77  Identities=14%  Similarity=0.259  Sum_probs=65.7

Q ss_pred             HHHHCCCHHHH-HHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCE
Q ss_conf             23220622899-9999998759889998675672474689999999999851897179984572457889997089811
Q 537021.9.peg.3    5 IYQFQAEILEQ-VLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSK   82 (150)
Q Consensus         5 ~~~~~a~~l~e-a~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~   82 (150)
                      +...+++.+.| .+.++.+.+-+++++-+.|-...|+.....+.+-...++.+|.+++++ |-+|++-+.+-+.|++..
T Consensus        20 ld~~r~~~l~e~ll~~i~~~~~~~viiDlsGv~~iD~~~~~~L~~l~~~~~LlG~~~il~-Gi~P~vA~~lv~~g~~l~   97 (109)
T cd07041          20 LDDERAEQLQERLLEAISRRRARGVIIDLTGVPVIDSAVARHLLRLARALRLLGARTILT-GIRPEVAQTLVELGIDLS   97 (109)
T ss_pred             ECHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEE-ECCHHHHHHHHHHCCCCC
T ss_conf             859999999999999999729889999745886377899999999999999639869999-439999999999299803


No 56 
>KOG1154 consensus
Probab=94.20  E-value=0.078  Score=32.79  Aligned_cols=98  Identities=20%  Similarity=0.260  Sum_probs=60.0

Q ss_pred             CCCEEEEEECHHHHCCHH-------HHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCH---H
Q ss_conf             598899986756724746-------899999999998518971799845724578899970898115578346612---4
Q 537021.9.peg.3   24 ENETIVVKYGGHVMNCTD-------LSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQ---Q   93 (150)
Q Consensus        24 r~ktiVIK~GG~~l~d~~-------l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~---~   93 (150)
                      +-+++|||+|-+++.++.       -+..++++++.|+..|.+++||--|+-...  ++++.++-..-.-.|-|-+   +
T Consensus         8 ka~rIVVKLGSavit~e~~~~laLgrla~IVEqV~~L~~~G~evilVSSGaVA~G--~qrLr~~~~~s~s~r~~l~~~~~   85 (285)
T KOG1154           8 KAYRIVVKLGSAVITREDTCGLALGRLASIVEQVSELQRMGREVILVSSGAVAFG--RQRLRQELLPSSSMRQTLKPQSE   85 (285)
T ss_pred             CCEEEEEEECCEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEEECCHHHHH--HHHHHHHHCCCHHHHHHHCCCCC
T ss_conf             0049999814468877987540478999999999999856855999923525550--77751020343567875267541


Q ss_pred             HHHH-HHHHHHHH--HHHHHHHHHHHCCCCEEEE
Q ss_conf             8999-99997416--6899999998289976877
Q 537021.9.peg.3   94 TAEV-VEMVLAGS--INKKIVSLINQTGTQAIGI  124 (150)
Q Consensus        94 tl~i-v~~vL~G~--vn~~lv~~L~~~g~~avgl  124 (150)
                      +.+. +..+ .|+  +-..-...+.+.|+.+..+
T Consensus        86 l~e~rA~AA-vGQ~~Lmalye~lF~Qy~~~~AQv  118 (285)
T KOG1154          86 LAEKRACAA-VGQSGLMALYETLFTQYGITIAQV  118 (285)
T ss_pred             HHHHHHHHH-HCCCHHHHHHHHHHHHHCCEEEEE
T ss_conf             146788887-275218999999999837530035


No 57 
>PRK06291 aspartate kinase; Provisional
Probab=92.59  E-value=1  Score=25.68  Aligned_cols=50  Identities=14%  Similarity=0.210  Sum_probs=36.0

Q ss_pred             EEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHH
Q ss_conf             89998675672474689999999999851897179984572457889997
Q 537021.9.peg.3   27 TIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEK   76 (150)
Q Consensus        27 tiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~   76 (150)
                      .+|.||||+-+.|.+.+..+++=|..-..-|.++|+|.-+=.-++..|-+
T Consensus         2 ~iV~KFGGTSva~~~~i~~v~~iI~~~~~~~~~~vVVvSA~~gvTd~L~~   51 (466)
T PRK06291          2 RLVMKFGGTSVGDGERIRHVAKLVKKYREEGNEVVVVVSAMTGVTDRLLE   51 (466)
T ss_pred             CEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHH
T ss_conf             78997384237989999999999876776589899996799997899999


No 58 
>cd04259 AAK_AK-DapDC AAK_AK-DapDC: Amino Acid Kinase Superfamily (AAK), AK-DapDC; this CD includes the N-terminal catalytic aspartokinase (AK) domain of the bifunctional enzyme AK - DAP decarboxylase (DapDC) found in some bacteria. Aspartokinase is the first enzyme in the aspartate metabolic pathway, catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP. DapDC, which is the lysA gene product, catalyzes the decarboxylation of DAP to lysine.
Probab=89.65  E-value=1.1  Score=25.54  Aligned_cols=47  Identities=19%  Similarity=0.232  Sum_probs=32.5

Q ss_pred             EEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHH
Q ss_conf             99986756724746899999999998518971799845724578899
Q 537021.9.peg.3   28 IVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVL   74 (150)
Q Consensus        28 iVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l   74 (150)
                      +|.||||+-+.|.+.++.+++=|..-...|.++|+|--+-.-++..|
T Consensus         2 iV~KFGGTSv~~~~~~~~v~~ii~~~~~~~~~~vvVVSA~~gvTd~L   48 (295)
T cd04259           2 VVLKFGGTSVSSRARWDTIAKLAQKHLNTGGQPLIVCSALSGISNKL   48 (295)
T ss_pred             EEEEECHHCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCHHH
T ss_conf             89976801058799999999999988747999899979999983689


No 59 
>cd06844 STAS Sulphate Transporter and Anti-Sigma factor antagonist domain found in the C-terminal region of sulphate transporters as well as in bacterial and archaeal proteins involved in the regulation of sigma factors. The STAS (Sulphate Transporter and Anti-Sigma factor antagonist) domain is found in the C-terminal region of sulphate transporters as well as in bacterial and archaeal proteins involved in the regulation of sigma factors, like anti-anti-sigma factors and "stressosome" components. The sigma factor regulators are involved in protein-protein interaction which is regulated by phosphorylation.
Probab=87.85  E-value=2.5  Score=23.07  Aligned_cols=74  Identities=15%  Similarity=0.160  Sum_probs=62.0

Q ss_pred             HHCCCHHHHHH-HHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCC
Q ss_conf             22062289999-9999875988999867567247468999999999985189717998457245788999708981
Q 537021.9.peg.3    7 QFQAEILEQVL-PFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKS   81 (150)
Q Consensus         7 ~~~a~~l~ea~-pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~   81 (150)
                      -..|+-|++.+ +++.+...+++|+-+.|-...|..-...++.-...++..|.++++ .|-.|++.+.++..|+.-
T Consensus        20 ~~ta~~lr~~l~~~i~~~~~~~iVvDls~v~f~DSsGl~~Ll~~~k~~~~~Gg~l~l-~~~~p~V~~vf~~tGldk   94 (100)
T cd06844          20 HHSVEQFKEELLHNITNVAGKTIVIDISALEFMDSSGTGVLLERSRLAEAVGGQFVL-TGISPAVRITLTESGLDK   94 (100)
T ss_pred             HHHHHHHHHHHHHHHHHCCCCEEEEECCCCEEECHHHHHHHHHHHHHHHHCCCEEEE-ECCCHHHHHHHHHHCCCC
T ss_conf             888999999999999738997899987688188478999999999999966998999-839989999999948685


No 60 
>cd04244 AAK_AK-LysC-like AAK_AK-LysC-like: Amino Acid Kinase Superfamily (AAK), AK-LysC-like; this CD includes the N-terminal catalytic aspartokinase (AK) domain of the lysine-sensitive AK isoenzyme found in higher plants. The lysine-sensitive AK isoenzyme is a monofunctional protein. It is involved in the overall regulation of the aspartate pathway and can be synergistically inhibited by S-adenosylmethionine. Also included in this CD is an uncharacterized LysC-like AK found in Euryarchaeota and some bacteria. AK catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP.
Probab=87.61  E-value=1.6  Score=24.44  Aligned_cols=48  Identities=15%  Similarity=0.194  Sum_probs=32.1

Q ss_pred             EEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHH
Q ss_conf             9998675672474689999999999851897179984572457889997
Q 537021.9.peg.3   28 IVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEK   76 (150)
Q Consensus        28 iVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~   76 (150)
                      +|.||||+-+.|.+.+.++++ |..-..-+.++|+|.-+=--++..|-+
T Consensus         2 iV~KFGGtSva~~~~i~~v~~-II~~~~~~~~~vVVVSA~~gvTn~L~~   49 (298)
T cd04244           2 LVMKFGGTSVGSAERIRHVAD-LVGTYAEGHEVVVVVSAMGGVTDRLLL   49 (298)
T ss_pred             EEEEECCCCCCCHHHHHHHHH-HHHHHCCCCCEEEEECCCCCCHHHHHH
T ss_conf             579849101189999999999-985546699989997799998899999


No 61 
>pfam00752 XPG_N XPG N-terminal domain.
Probab=87.30  E-value=1.7  Score=24.26  Aligned_cols=52  Identities=23%  Similarity=0.373  Sum_probs=36.4

Q ss_pred             HHHHCCCEEEEEECH--------------HHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHH
Q ss_conf             998759889998675--------------6724746899999999998518971799845724578
Q 537021.9.peg.3   20 VQFYENETIVVKYGG--------------HVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIG   71 (150)
Q Consensus        20 i~~~r~ktiVIK~GG--------------~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~   71 (150)
                      +..|+||++-|-..-              ..+.+......|+..+..|...|++||.|+-|.|--.
T Consensus        20 l~~~~Gk~vAID~s~wl~q~~~~~~~~~~~~~~~~~~l~~~~~ri~~l~~~~I~PvfVFDG~~p~~   85 (100)
T pfam00752        20 IEALEGKTLAIDASIWLYQFLKAVRDQLGNALQNTSHLMGFFSRLCRLKDFGIKPIFVFDGGPPPL   85 (100)
T ss_pred             HHHHCCCEEEECCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCCCHH
T ss_conf             999689879977199999999997023137888828999999999999988998999878999663


No 62 
>PRK08961 bifunctional aspartate kinase/diaminopimelate decarboxylase protein; Provisional
Probab=86.72  E-value=1.9  Score=23.82  Aligned_cols=54  Identities=13%  Similarity=0.140  Sum_probs=40.1

Q ss_pred             HCCCEEEEEECHHHHCCHHHHHHHHHHHH-HHHHCCCCEEEEECCCHHHHHHHHH
Q ss_conf             75988999867567247468999999999-9851897179984572457889997
Q 537021.9.peg.3   23 YENETIVVKYGGHVMNCTDLSKDFVNDIA-LLKKSNITPVIVHGGGPQIGAVLEK   76 (150)
Q Consensus        23 ~r~ktiVIK~GG~~l~d~~l~~~~~~dI~-ll~~~G~k~ViVHGgg~qI~~~l~~   76 (150)
                      +-.+.+|.||||+-+.+.+-++++++-|. .+...|.++|+|.-+---++..|.+
T Consensus         5 ~~~~wVV~KFGGTSVas~~~~~~Va~iI~~~~~~~g~rvvVVvSAm~gvTn~L~~   59 (865)
T PRK08961          5 SADRWVVLKFGGTSVSRRHRWDTIAKLARKRLNETGGRVLVVVSALSGVTNELTA   59 (865)
T ss_pred             CCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHH
T ss_conf             8898799862846689899999999999999985099889998599986389999


No 63 
>COG1366 SpoIIAA Anti-anti-sigma regulatory factor (antagonist of anti-sigma factor) [Signal transduction mechanisms]
Probab=83.95  E-value=4  Score=21.84  Aligned_cols=76  Identities=18%  Similarity=0.226  Sum_probs=63.5

Q ss_pred             HHCCCHHHHHHH-HHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEE
Q ss_conf             220622899999-99987598899986756724746899999999998518971799845724578899970898115
Q 537021.9.peg.3    7 QFQAEILEQVLP-FVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKF   83 (150)
Q Consensus         7 ~~~a~~l~ea~p-Yi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~   83 (150)
                      -..|+.|+|... ++..-..+.+||-+.+-...|..-...++.-...++..|.+++++ |-.|++.+.++..|+...|
T Consensus        25 ~~~a~~~~e~~~~~~~~~~~~~ivIDls~v~~~dS~gl~~L~~~~~~~~~~g~~~~l~-~i~p~v~~~~~~~gl~~~~  101 (117)
T COG1366          25 AARAPALKETLLEVIAASGARGLVIDLSGVDFMDSAGLGVLVALLKSARLRGVELVLV-GIQPEVARTLELTGLDKSF  101 (117)
T ss_pred             HHHHHHHHHHHHHHHHHCCCCEEEEECCCCCEECCHHHHHHHHHHHHHHHCCCCEEEE-ECCHHHHHHHHHHCCCCEE
T ss_conf             8889999999999997469977999888887745458999999999998669948999-2999999999993974125


No 64 
>PRK00192 mannosyl-3-phosphoglycerate phosphatase; Reviewed
Probab=82.52  E-value=3.4  Score=22.24  Aligned_cols=67  Identities=15%  Similarity=0.215  Sum_probs=45.8

Q ss_pred             CCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHH-HHHHHHHCCCCCEEE--CCCCCC
Q ss_conf             5988999867567247468999999999985189717998457245-788999708981155--783466
Q 537021.9.peg.3   24 ENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQ-IGAVLEKMGIKSKFE--NGLRIT   90 (150)
Q Consensus        24 r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~q-I~~~l~~~gi~~~~v--~G~RVT   90 (150)
                      .-+.|+.=+.|..+.+........+.|..|+..|+++|+.-|=.+. +...++++|+...|+  ||-+|-
T Consensus         5 ~~~LI~tDlDGTLL~~~~~~~~~~~ai~~l~~~Gi~vviaTGR~~~~~~~~~~~l~l~~~~I~~NGa~I~   74 (275)
T PRK00192          5 SKLLVFTDLDGTLLDHTYDWEPAKPTLKALKEKGIPVIPCTSKTAAEVEVLRAELGLEDPFIVENGAAIY   74 (275)
T ss_pred             CCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHCCCCEEEEECCCEEE
T ss_conf             8469999485778489998368999999999789999996898899999999995899749996795899


No 65 
>smart00485 XPGN Xeroderma pigmentosum G N-region. domain in nucleases
Probab=81.90  E-value=4.2  Score=21.68  Aligned_cols=52  Identities=13%  Similarity=0.179  Sum_probs=35.7

Q ss_pred             HHHHCCCEEEEEECHHH--------------HCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHH
Q ss_conf             99875988999867567--------------24746899999999998518971799845724578
Q 537021.9.peg.3   20 VQFYENETIVVKYGGHV--------------MNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIG   71 (150)
Q Consensus        20 i~~~r~ktiVIK~GG~~--------------l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~   71 (150)
                      +..|+|+++.|=...=.              ..+......|+.-+..|...|+.||.|+.|.|--.
T Consensus        19 l~~l~gk~vAIDas~wl~k~~~~~~~~~~~~~~~~~~l~~~~~ri~~l~~~~I~PIfVFDG~~~~~   84 (99)
T smart00485       19 LEALRGKTLAIDASIWLYQFLTACREKLGTPLPNSKHLMGLFYRTCRLLEFGIKPIFVFDGKPPPL   84 (99)
T ss_pred             HHHHCCCEEEEECHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCCCHH
T ss_conf             999699879991589999999997100157777719999999999999987998999888999765


No 66 
>TIGR00657 asp_kinases aspartate kinase; InterPro: IPR001341   Bacteria, plants and fungi metabolise aspartic acid to produce four amino acids - lysine, threonine, methionine and isoleucine - in a series of reactions known as the aspartate pathway. Additionally, several important metabolic intermediates are produced by these reactions, such as diaminopimelic acid, an essential component of bacterial cell wall biosynthesis, and dipicolinic acid, which is involved in sporulation in Gram-positive bacteria. Members of the animal kingdom do not posses this pathway and must therefore acquire these essential amino acids through their diet. Research into improving the metabolic flux through this pathway has the potential to increase the yield of the essential amino acids in important crops, thus improving their nutritional value. Additionally, since the enzymes are not present in animals, inhibitors of them are promising targets for the development of novel antibiotics and herbicides. For more information see .   Aspartate kinase (2.7.2.4 from EC) (AK) catalyzes the first reaction in the aspartate pathway; the phosphorylation of aspartate. The product of this reaction can then be used in the biosynthesis of lysine or in the pathway leading to homoserine, which participates in the biosynthesis of threonine, isoleucine and methionine .   In bacteria there are three different aspartate kinase isozymes which differ in sensitivity to repression and inhibition by Lys, Met and Thr. AK1 and AK2 are bifunctional enzymes which both consist of an N-terminal AK domain and a C-terminal homoserine dehydrogenase domain. AK1 is involved in threonine biosynthesis and AK2, in that of methionine. The third isozyme, AK3 is monofunctional and involved in lysine synthesis. In archaea and plants there may be a single isozyme of AK which in plants is multifunctional.   This entry represents a region encoding aspartate kinase activity found in both the monofunctional and bifunctional enzymes.   Synonym(s): Aspartokinase; GO: 0004072 aspartate kinase activity, 0008652 amino acid biosynthetic process.
Probab=79.81  E-value=4.1  Score=21.75  Aligned_cols=52  Identities=17%  Similarity=0.205  Sum_probs=45.4

Q ss_pred             CEEEEEECHHHHCCH--HHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHC
Q ss_conf             889998675672474--6899999999998518971799845724578899970
Q 537021.9.peg.3   26 ETIVVKYGGHVMNCT--DLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKM   77 (150)
Q Consensus        26 ktiVIK~GG~~l~d~--~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~   77 (150)
                      ..+|.||||.-+.+.  +-+.+.++=|..-...|.++|+|-=+-.-++.+|-++
T Consensus         2 ~~iV~KFGGTSVg~~PverI~~~A~~v~~~~~~g~~vvVVVSA~~g~Td~Lv~~   55 (504)
T TIGR00657         2 MLIVQKFGGTSVGNAPVERIRRVAKIVLKEKEKGNQVVVVVSAMAGVTDALVEL   55 (504)
T ss_pred             CEEEEEECCEEECCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHH
T ss_conf             747997367576673237899999988887535897389997377533689999


No 67 
>cd04243 AAK_AK-HSDH-like AAK_AK-HSDH-like: Amino Acid Kinase Superfamily (AAK), AK-HSDH-like; this family includes the N-terminal catalytic domain of aspartokinase (AK) of the bifunctional enzyme AK- homoserine dehydrogenase (HSDH). These aspartokinases are found in such bacteria as E. coli (AKI-HSDHI, ThrA  and  AKII-HSDHII, MetL) and in higher plants (Z. mays AK-HSDH). AK and HSDH are the first and third enzymes in the biosynthetic pathway of the aspartate family of amino acids. AK catalyzes the phosphorylation of Asp to P-aspartyl phosphate. HSDH catalyzes the NADPH-dependent conversion of Asp 3-semialdehyde to homoserine. ThrA and MetL are involved in threonine and methionine biosynthesis, respectively. In E. coli, ThrA is subject to allosteric regulation by the end product L-threonine and the native enzyme is reported to be tetrameric. As with bacteria, plant AK and HSDH are feedback inhibited by pathway end products. Maize AK-HSDH is a Thr-sensitive 180-kD enzyme. Arabidopsis AK-
Probab=77.88  E-value=5.9  Score=20.72  Aligned_cols=47  Identities=15%  Similarity=0.234  Sum_probs=30.4

Q ss_pred             EEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHH
Q ss_conf             9998675672474689999999999851897179984572457889997
Q 537021.9.peg.3   28 IVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEK   76 (150)
Q Consensus        28 iVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~   76 (150)
                      .|.||||+-+.|++.+.++++ |.. +..+.++|+|.-+=--++..|.+
T Consensus         2 kV~KFGGtSv~~~~~i~~v~~-iI~-~~~~~~~vVVVSA~~gvTd~L~~   48 (293)
T cd04243           2 KVLKFGGTSVASAERIRRVAD-IIK-SRASSPVLVVVSALGGVTNRLVA   48 (293)
T ss_pred             EEEEECCCCCCCHHHHHHHHH-HHH-HCCCCCEEEEECCCCCCHHHHHH
T ss_conf             899878123699999999999-998-37798959998199886899999


No 68 
>cd04257 AAK_AK-HSDH AAK_AK-HSDH: Amino Acid Kinase Superfamily (AAK), AK-HSDH; this CD includes the N-terminal catalytic domain of aspartokinase (AK) of the bifunctional enzyme AK - homoserine dehydrogenase (HSDH). These aspartokinases are found in bacteria (E. coli AKI-HSDHI, ThrA  and E. coli AKII-HSDHII, MetL) and higher plants (Z. mays AK-HSDH). AK and HSDH are the first and third enzymes in the biosynthetic pathway of the aspartate family of amino acids. AK catalyzes the phosphorylation of Asp to P-aspartyl phosphate. HSDH catalyzes the NADPH-dependent conversion of Asp 3-semialdehyde to homoserine. ThrA and MetL are involved in threonine and methionine biosynthesis, respectively. In E. coli, ThrA is subject to allosteric regulation by the end product L-threonine and the native enzyme is reported to be tetrameric. As with bacteria, plant AK and HSDH are feedback inhibited by pathway end products. Maize AK-HSDH is a Thr-sensitive 180-kD enzyme. Arabidopsis AK-HSDH is an alanine-act
Probab=77.66  E-value=6.2  Score=20.58  Aligned_cols=48  Identities=17%  Similarity=0.262  Sum_probs=28.9

Q ss_pred             EEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHH
Q ss_conf             9998675672474689999999999851897179984572457889997
Q 537021.9.peg.3   28 IVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEK   76 (150)
Q Consensus        28 iVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~   76 (150)
                      -|.||||+-+.|.+.+.++++ |..-+.-+-++|+|--+=--++..|.+
T Consensus         2 kV~KFGGtSv~~~~~i~~v~~-iI~~~~~~~~~vVVVSA~ggvTd~L~~   49 (294)
T cd04257           2 KVLKFGGTSLANAERIRRVAD-IILNAAKQEQVAVVVSAPGKVTDLLLE   49 (294)
T ss_pred             EEEEECCCCCCCHHHHHHHHH-HHHHHCCCCCEEEEECCCCCCHHHHHH
T ss_conf             899878111589999999999-997532489989998099986799999


No 69 
>TIGR01283 nifE nitrogenase MoFe cofactor biosynthesis protein NifE; InterPro: IPR005973    The enzyme responsible for nitrogen fixation, the nitrogenase, shows a high degree of conservation of structure, function, and amino acid sequence across wide phylogenetic ranges. All known Mo-nitrogenases consist of two components, component I (also called dinitrogenase, or Fe-Mo protein), an alpha2beta2 tetramer encoded by the nifD and nifK genes, and component II (dinitrogenase reductase, or Fe protein) a homodimer encoded by the nifH gene. Two operons, nifDK and nifEN, encode a tetrameric (alpha2beta2 and N2E2) enzymatic complex. Nitrogenase contains two unusual rare metal clusters; one of them is the iron molybdenum cofactor (FeMo-co), which is considered to be the site of dinitrogen reduction and whose biosynthesis requires the products of nifNE and of some other nif genes. It has been proposed that NifNE might serve as a scaffold upon which FeMo-co is built and then inserted into component I. ; GO: 0005515 protein binding, 0006461 protein complex assembly, 0009399 nitrogen fixation.
Probab=77.47  E-value=6.6  Score=20.44  Aligned_cols=72  Identities=19%  Similarity=0.302  Sum_probs=50.5

Q ss_pred             CCCHHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCC
Q ss_conf             06228999999998759889998675672474689999999999851897179984572457889997089811557834
Q 537021.9.peg.3    9 QAEILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLR   88 (150)
Q Consensus         9 ~a~~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~R   88 (150)
                      -+.+=.+-.||=.+.+||..+|-.||.+    +.+ ++   |+.|+.+|+.+|++   |.|=+..-++..|....-+|..
T Consensus       321 ~~~~~~~l~~yR~~L~GKkaaiY~GGa~----Ksw-Sl---v~Al~dLGMeVV~~---GTqkg~~EDy~~I~e~~~~~~~  389 (470)
T TIGR01283       321 EAKIRPELEPYRERLKGKKAAIYTGGAV----KSW-SL---VSALQDLGMEVVAT---GTQKGTEEDYERIRELMGEGTV  389 (470)
T ss_pred             HHHHHHHHHHHHHHHCCCEEEEECCCHH----HHH-HH---HHHHHHCCCEEEEE---EECCCCHHHHHHHHHHHCCCCE
T ss_conf             9999999999899734988998658647----899-99---98884579179998---3007988899999997079967


Q ss_pred             CCC
Q ss_conf             661
Q 537021.9.peg.3   89 ITD   91 (150)
Q Consensus        89 VTd   91 (150)
                      +=|
T Consensus       390 m~D  392 (470)
T TIGR01283       390 MLD  392 (470)
T ss_pred             EEE
T ss_conf             862


No 70 
>PRK08373 aspartate kinase; Validated
Probab=75.53  E-value=7.4  Score=20.10  Aligned_cols=100  Identities=14%  Similarity=0.176  Sum_probs=51.0

Q ss_pred             CCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCC----------------------CCC
Q ss_conf             5988999867567247468999999999985189717998457245788999708----------------------981
Q 537021.9.peg.3   24 ENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMG----------------------IKS   81 (150)
Q Consensus        24 r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~g----------------------i~~   81 (150)
                      ..|.+|.||||+-+.+.  ++..+ +|..-..-+.++|+|.-+-.-++..|-++.                      +++
T Consensus         2 ~~km~V~KFGGTSV~~a--~~~v~-~iv~~~~~~~~vvVVVSAm~gvTd~Li~~a~~~~~~~~~I~~~h~~~a~~l~~~~   78 (359)
T PRK08373          2 VGKMIVVKFGGSSVRYD--FEEAV-ELVKSLSEEKEVIVVVSALKGVTDCLLKYANTDKSAALEIEKIHEEFAKEHGIDT   78 (359)
T ss_pred             CCCCEEEEECHHCCHHH--HHHHH-HHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCH
T ss_conf             88879984281013328--99999-9999875289889996799997799999997657778999999999999836667


Q ss_pred             EE--------ECCCCCCCHHHHHHHHHHHHHHHHHHH-HHHHHHCCCCEEEEEEE
Q ss_conf             15--------578346612489999999741668999-99998289976877865
Q 537021.9.peg.3   82 KF--------ENGLRITDQQTAEVVEMVLAGSINKKI-VSLINQTGTQAIGICGK  127 (150)
Q Consensus        82 ~~--------v~G~RVTd~~tl~iv~~vL~G~vn~~l-v~~L~~~g~~avglsg~  127 (150)
                      +.        .+.....++...+.+ ..+.-.+...| .+.|++.|++|.-+.+.
T Consensus        79 ~~l~~~l~~l~~~~~~~~~~~~D~I-ls~GE~lSa~Lla~~L~~~Gi~a~~lda~  132 (359)
T PRK08373         79 EILSPYLKELFNRPDFPSEALRDYI-LSIGERLSAVIFAEGLKILGINGKVVDPW  132 (359)
T ss_pred             HHHHHHHHHHHHHCCCCCHHHHHHH-HHHHHHHHHHHHHHHHHHCCCCCEEECHH
T ss_conf             7889999998731138845766667-76739999999999999779980997548


No 71 
>pfam01740 STAS STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP binding function.
Probab=75.27  E-value=7.5  Score=20.06  Aligned_cols=73  Identities=19%  Similarity=0.274  Sum_probs=54.4

Q ss_pred             CCHHHHHH-HHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEE
Q ss_conf             62289999-999987598899986756724746899999999998518971799845724578899970898115
Q 537021.9.peg.3   10 AEILEQVL-PFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKF   83 (150)
Q Consensus        10 a~~l~ea~-pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~   83 (150)
                      |+-|++.. ..+..-.-+.+|+-+.+--..|......+..-...++..|.++.++ |-.|++.+.++..|+...|
T Consensus        24 a~~~~~~l~~~~~~~~~~~vvlD~~~v~~iDssgl~~l~~~~~~~~~~g~~l~l~-~~~~~v~~~l~~~gl~~~~   97 (106)
T pfam01740        24 AEAFRERLLRAIEEGEIRHVILDLSGVPFIDSSGLGALLELYKELRRRGVELVLV-GPSPEVRRTLEKTGLDDII   97 (106)
T ss_pred             HHHHHHHHHHHHHCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCEEEEE-CCCHHHHHHHHHHCCCCEE
T ss_conf             9999999999986589978999970898889029999999999999769909997-6998999999980998664


No 72 
>cd04258 AAK_AKiii-LysC-EC AAK_AKiii-LysC-EC: Amino Acid Kinase Superfamily (AAK), AKiii-LysC-EC: this CD includes the N-terminal catalytic aspartokinase (AK) domain of the lysine-sensitive aspartokinase isoenzyme AKIII. AKIII is a monofunctional class enzyme (LysC) found in some bacteria such as E. coli. Aspartokinase is the first enzyme in the aspartate metabolic pathway and catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP. In E. coli, LysC is reported to be a homodimer of 50 kD subunits.
Probab=75.04  E-value=6.7  Score=20.36  Aligned_cols=45  Identities=16%  Similarity=0.228  Sum_probs=28.3

Q ss_pred             EEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHH
Q ss_conf             899986756724746899999999998518971799845724578899
Q 537021.9.peg.3   27 TIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVL   74 (150)
Q Consensus        27 tiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l   74 (150)
                      .+|.||||+-+.|.+.+.++++=|   ..-+.++|+|--+-.-++..|
T Consensus         1 m~V~KFGGtSv~~~~~i~~v~~ii---~~~~~~~vVVVSA~~gvTd~L   45 (292)
T cd04258           1 MVVAKFGGTSVADYAAMLRCAAIV---KSDASVRLVVVSASAGVTNLL   45 (292)
T ss_pred             CEEEEECCCCCCCHHHHHHHHHHH---HCCCCCEEEEEECCCCCHHHH
T ss_conf             989957865448899999999999---758998099994899974999


No 73 
>cd04247 AAK_AK-Hom3 AAK_AK-Hom3: Amino Acid Kinase Superfamily (AAK), AK-Hom3; this CD includes the N-terminal catalytic domain of the aspartokinase HOM3, a monofunctional class enzyme found in Saccharomyces cerevisiae and other related AK domains. Aspartokinase, the first enzyme in the aspartate metabolic pathway, catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP, and in fungi, is responsible for the production of threonine, isoleucine and methionine. S. cerevisiae has a single aspartokinase isoenzyme type, which is regulated by feedback, allosteric inhibition by L-threonine. Recent studies show that the allosteric transition triggered by binding of threonine to AK involves a large change in the conformation of the native hexameric enzyme that is converted to an inactive one of different shape and substantially smaller hydrodynamic size.
Probab=72.30  E-value=8.9  Score=19.60  Aligned_cols=38  Identities=18%  Similarity=0.253  Sum_probs=26.7

Q ss_pred             EEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECC
Q ss_conf             8999867567247468999999999985189717998457
Q 537021.9.peg.3   27 TIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGG   66 (150)
Q Consensus        27 tiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGg   66 (150)
                      -+|.||||+-+.+.  ....+.+|..-...+.++|+|.-+
T Consensus         2 ~iV~KFGGTSva~~--~~~~~~~I~~~~~~~~~vvVVvSA   39 (306)
T cd04247           2 WVVQKFGGTSVGKF--PDNIADDIVKAYLKGNKVAVVCSA   39 (306)
T ss_pred             CEEEEECHHHHHHH--HHHHHHHHHHHHHCCCCEEEEECC
T ss_conf             89997791408879--999999998776279988999889


No 74 
>COG0474 MgtA Cation transport ATPase [Inorganic ion transport and metabolism]
Probab=71.82  E-value=9.1  Score=19.53  Aligned_cols=100  Identities=19%  Similarity=0.340  Sum_probs=69.6

Q ss_pred             EEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHH-HHHCCCCCE-----EECCC---CCCCHHHHHHHH
Q ss_conf             998675672474689999999999851897179984572457889-997089811-----55783---466124899999
Q 537021.9.peg.3   29 VVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAV-LEKMGIKSK-----FENGL---RITDQQTAEVVE   99 (150)
Q Consensus        29 VIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~-l~~~gi~~~-----~v~G~---RVTd~~tl~iv~   99 (150)
                      .+-+|.-.|.||- ++...+.|..++..|++++++-|=.+..-.. .+++|+..+     .++|.   ..+|++-.+.++
T Consensus       536 l~~lGl~g~~Dpp-r~~v~~aI~~l~~AGI~v~MiTGD~~~TA~aIa~~~Gi~~~~~~~~vi~G~el~~l~~~el~~~~~  614 (917)
T COG0474         536 LVFLGLTGIEDPP-REDVKEAIEELREAGIKVWMITGDHVETAIAIAKECGIEAEAESALVIDGAELDALSDEELAELVE  614 (917)
T ss_pred             CEEEEEEEECCCC-CCCHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHHCCCCCCCCCEEEEHHHHHHCCHHHHHHHHH
T ss_conf             0402001212899-711899999999889947998799989999999981998676642565159987368899999887


Q ss_pred             HH-HHHHH----HHHHHHHHHHCCCCEEEEEEEECC
Q ss_conf             99-74166----899999998289976877865589
Q 537021.9.peg.3  100 MV-LAGSI----NKKIVSLINQTGTQAIGICGKDGN  130 (150)
Q Consensus       100 ~v-L~G~v----n~~lv~~L~~~g~~avglsg~dg~  130 (150)
                      .. ...++    -..|+.+|+++ .+-++++|--.|
T Consensus       615 ~~~VfARvsP~qK~~IV~~lq~~-g~vVamtGDGvN  649 (917)
T COG0474         615 ELSVFARVSPEQKARIVEALQKS-GHVVAMTGDGVN  649 (917)
T ss_pred             HCCEEEEECHHHHHHHHHHHHHC-CCEEEEECCCHH
T ss_conf             58989992789999999999977-999999679716


No 75 
>pfam10686 DUF2493 Protein of unknown function (DUF2493). Members of this family are all Proteobacteria. The function is not known.
Probab=70.54  E-value=9.7  Score=19.35  Aligned_cols=52  Identities=25%  Similarity=0.355  Sum_probs=34.0

Q ss_pred             CEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCH-H----HHHHHHHCCCC
Q ss_conf             8899986756724746899999999998518971799845724-5----78899970898
Q 537021.9.peg.3   26 ETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGP-Q----IGAVLEKMGIK   80 (150)
Q Consensus        26 ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~-q----I~~~l~~~gi~   80 (150)
                      -+-|+-.||--..|-.+....+..+   +..--..||||||.+ -    ..++.+++|++
T Consensus         3 g~~V~~~Ggrd~~D~~~i~~~Ld~~---~~~~pd~vlihGG~~kGad~lA~~WA~~~gv~   59 (71)
T pfam10686         3 GTRVAFSGGRDFNDHRLIWDALDKV---HARHPDMVLLHGGAPKGAERIAARWARRRGVP   59 (71)
T ss_pred             CCEEEEEECCCCCHHHHHHHHHHHH---HHHCCCEEEEECCCCCCHHHHHHHHHHHCCCC
T ss_conf             9989998089865099999999999---98689879997798633799999999986997


No 76 
>COG0561 Cof Predicted hydrolases of the HAD superfamily [General function prediction only]
Probab=68.79  E-value=11  Score=19.12  Aligned_cols=102  Identities=17%  Similarity=0.311  Sum_probs=66.6

Q ss_pred             CCEEEEEECHHHHCCHH-HHHHHHHHHHHHHHCCCCEEEEECCCH-HHHHHHHHCCCCCEEE--CCCCCCCHHHHH-HHH
Q ss_conf             98899986756724746-899999999998518971799845724-5788999708981155--783466124899-999
Q 537021.9.peg.3   25 NETIVVKYGGHVMNCTD-LSKDFVNDIALLKKSNITPVIVHGGGP-QIGAVLEKMGIKSKFE--NGLRITDQQTAE-VVE   99 (150)
Q Consensus        25 ~ktiVIK~GG~~l~d~~-l~~~~~~dI~ll~~~G~k~ViVHGgg~-qI~~~l~~~gi~~~~v--~G~RVTd~~tl~-iv~   99 (150)
                      -|.++.=+.|..+.+.. ..+...+-|..++..|++++|+-|-.. .+-..+++++++..++  ||-.+-++  -+ +..
T Consensus         3 ~kli~~DlDGTLl~~~~~i~~~~~~al~~~~~~g~~v~iaTGR~~~~~~~~~~~l~~~~~~I~~NGa~i~~~--~~~i~~   80 (264)
T COG0561           3 IKLLAFDLDGTLLDSNKTISPETKEALARLREKGVKVVLATGRPLPDVLSILEELGLDGPLITFNGALIYNG--GELLFQ   80 (264)
T ss_pred             CEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHCCCCCCEEECCCCEEEEC--CEEEEE
T ss_conf             569999476667489998899999999999978298999889987999999997299984998689599989--988882


Q ss_pred             HHHHHHHHHHHHHHHHHCCCCEEEEEEEE
Q ss_conf             99741668999999982899768778655
Q 537021.9.peg.3  100 MVLAGSINKKIVSLINQTGTQAIGICGKD  128 (150)
Q Consensus       100 ~vL~G~vn~~lv~~L~~~g~~avglsg~d  128 (150)
                      ..+....-..+...+........-+...+
T Consensus        81 ~~l~~~~~~~i~~~~~~~~~~~~~~~~~~  109 (264)
T COG0561          81 KPLSREDVEELLELLEDFQGIALVLYTDD  109 (264)
T ss_pred             CCCCHHHHHHHHHHHHHCCCCCEEEEECC
T ss_conf             46999999999999986678628999326


No 77 
>TIGR00268 TIGR00268 conserved hypothetical protein TIGR00268; InterPro: IPR005232   This family of conserved hypothetical proteins has no known function but is predicted to be part of the wider PP-loop superfamily ..
Probab=68.69  E-value=9.6  Score=19.37  Aligned_cols=109  Identities=17%  Similarity=0.298  Sum_probs=64.8

Q ss_pred             HHHHHHHHH-HCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHH--H-------HHHCCCCCEE
Q ss_conf             999999998-75988999867567247468999999999985189717998457245788--9-------9970898115
Q 537021.9.peg.3   14 EQVLPFVQF-YENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGA--V-------LEKMGIKSKF   83 (150)
Q Consensus        14 ~ea~pYi~~-~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~--~-------l~~~gi~~~~   83 (150)
                      ..--.+++. |.+|-++|-|+|.+  |..++..++.|++-...     +=|.=-.|+++.  .       .++.|+..+|
T Consensus         3 ~~L~~fl~~kf~nk~~~IAySGGv--DS~lla~v~~~v~g~~~-----lait~~sP~~sp~~el~~A~~~A~~~g~~he~   75 (263)
T TIGR00268         3 ENLRNFLKEKFKNKKVLIAYSGGV--DSSLLAAVCSDVAGTEV-----LAITVVSPSISPRRELEDAKAIAKEIGVKHEL   75 (263)
T ss_pred             HHHHHHHHHHHHCCCEEEEEECCH--HHHHHHHHHHHHHHHCE-----EEEEEECCCCCCHHHHHHHHHHHHHHCCCEEE
T ss_conf             688889876764081699951746--58999999987531011-----35787627757354489999999880830121


Q ss_pred             ECCCCCC--------CHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEE--EEEEEECCEEEEEE
Q ss_conf             5783466--------12489999999741668999999982899768--77865589899997
Q 537021.9.peg.3   84 ENGLRIT--------DQQTAEVVEMVLAGSINKKIVSLINQTGTQAI--GICGKDGNMVFAEK  136 (150)
Q Consensus        84 v~G~RVT--------d~~tl~iv~~vL~G~vn~~lv~~L~~~g~~av--glsg~dg~li~a~~  136 (150)
                      ++==..+        +..+-=+=+     ++-..|+..-.+.|.+.|  |.++.|  |-.-+|
T Consensus        76 ~~~d~~~n~~f~~N~~~RCY~CK~-----~~~~~L~~~a~~~gy~~V~dGtN~dD--L~~~RP  131 (263)
T TIGR00268        76 VKIDKMANPNFRANVEERCYFCKK-----KVLSILVKLAEKRGYDVVVDGTNADD--LEDHRP  131 (263)
T ss_pred             ECCCCCCCCCCCCCCCCCCHHHHH-----HHHHHHHHHHHHCCCCEEEECCCCCC--CCCCCC
T ss_conf             100123685001688544415488-----89999899998639957982346200--023675


No 78 
>PRK05925 aspartate kinase; Provisional
Probab=67.54  E-value=11  Score=18.96  Aligned_cols=104  Identities=18%  Similarity=0.302  Sum_probs=54.8

Q ss_pred             CEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHH--------------------------CCC
Q ss_conf             889998675672474689999999999851897179984572457889997--------------------------089
Q 537021.9.peg.3   26 ETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEK--------------------------MGI   79 (150)
Q Consensus        26 ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~--------------------------~gi   79 (150)
                      +.+|-||||+-+.+++.++++++=|.   . +.+.++|.-+=.-++..|.+                          +++
T Consensus         2 ~~iV~KFGGtSv~~~e~i~~va~ii~---~-~~~~~vVVSA~~gvTd~L~~~~~~~~~~~~~~~~~l~~~h~~i~~~l~~   77 (440)
T PRK05925          2 APLVYKFGGTSLGTAESIRRVCDIIC---K-EKPSFVVVSAVAGVTDLLEEFCRVSKGKREQITEKIREKHEEIAKELGI   77 (440)
T ss_pred             CEEEEEECCCCCCCHHHHHHHHHHHH---H-CCCCEEEEECCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCC
T ss_conf             86999379623389999999999997---1-5997899969999708999998762010789999999999999987254


Q ss_pred             CCE----------EECCCCCCCHHHHHHHHHHHHHH-HHHHH-HHHHHHCCCCEEEEEEE-----ECCEEEEEE
Q ss_conf             811----------55783466124899999997416-68999-99998289976877865-----589899997
Q 537021.9.peg.3   80 KSK----------FENGLRITDQQTAEVVEMVLAGS-INKKI-VSLINQTGTQAIGICGK-----DGNMVFAEK  136 (150)
Q Consensus        80 ~~~----------~v~G~RVTd~~tl~iv~~vL~G~-vn~~l-v~~L~~~g~~avglsg~-----dg~li~a~~  136 (150)
                      +..          +.+..+.|+.+.-.++  . .|+ +...| ...|+..|.++..+.+.     |+++-.|+|
T Consensus        78 ~~~~~~~~~~l~~~~~~~~~~~~~~~~il--s-~GE~~Sa~lla~~l~~~g~~a~~~da~~~i~td~~~~~a~~  148 (440)
T PRK05925         78 EFSLSPWWERLEHFEDKEEISSEDQARIL--A-IGEDISASLICAYCCTNVLPLEFLEARQVILTDDQYLRATP  148 (440)
T ss_pred             CHHHHHHHHHHHHHHHCCCCCHHHHHHHH--H-HHHHHHHHHHHHHHHHCCCCCEEECCEEEEEECCCCCCCCC
T ss_conf             21246899999987640448847788898--8-72999999999999967998288430328982684433033


No 79 
>cd04255 AAK_UMPK-MosAB AAK_UMPK-MosAB: This CD includes the alpha and beta subunits of the Mo storage protein (MosA and MosB) which are related to uridine monophosphate kinase (UMPK) enzymes that catalyze the phosphorylation of UMP by ATP, yielding UDP, and playing a key role in pyrimidine nucleotide biosynthesis. The Mo storage protein from the nitrogen-fixing bacterium, Azotobacter vinelandii, is characterized as an alpha4-beta4 octamer containing a polynuclear molybdenum-oxide cluster which is ATP-dependent to bind Mo and pH-dependent to release Mo. These and related bacterial sequences in this CD are members of the Amino Acid Kinase Superfamily (AAK).
Probab=67.15  E-value=7.4  Score=20.09  Aligned_cols=85  Identities=22%  Similarity=0.408  Sum_probs=50.9

Q ss_pred             CEEEEEECHHHHCCH--HHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHH---HCCCCCEEECCCCCCCHHHHHHHHH
Q ss_conf             889998675672474--68999999999985189717998457245788999---7089811557834661248999999
Q 537021.9.peg.3   26 ETIVVKYGGHVMNCT--DLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLE---KMGIKSKFENGLRITDQQTAEVVEM  100 (150)
Q Consensus        26 ktiVIK~GG~~l~d~--~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~---~~gi~~~~v~G~RVTd~~tl~iv~~  100 (150)
                      ..-|||+||.-+.|-  +..-.+++.|..+..-. |.+|+-|||-..-..++   .+|+++--.          -+....
T Consensus        31 ~~~VvKIGGqsi~DrG~~av~Plv~Ei~~~~~~h-k~li~TGgG~RaRHiy~igldLgmPtGvL----------a~L~~~   99 (262)
T cd04255          31 DLNVVKIGGQSIIDRGAEAVLPLVEEIVALRPEH-KLLILTGGGTRARHVYSIGLDLGMPTGVL----------AKLGAS   99 (262)
T ss_pred             CCEEEEECCEEEECCCHHHHHHHHHHHHHHHHHC-CEEEEECCCHHHHHHHHHHHCCCCCHHHH----------HHHHHH
T ss_conf             7228998780300373342432899999751315-28998367500334312341479961156----------776422


Q ss_pred             HHHHHHHHHHHHH-HHHCCCCEEE
Q ss_conf             9741668999999-9828997687
Q 537021.9.peg.3  101 VLAGSINKKIVSL-INQTGTQAIG  123 (150)
Q Consensus       101 vL~G~vn~~lv~~-L~~~g~~avg  123 (150)
                      + ++| |..+.+. |+.+|+..++
T Consensus       100 v-s~Q-Na~ml~~LLa~~G~~~i~  121 (262)
T cd04255         100 V-SEQ-NAEMLATLLAKHGGSKVG  121 (262)
T ss_pred             H-HHH-HHHHHHHHHCCCCCEEEC
T ss_conf             4-464-799999984313870257


No 80 
>PRK04596 minC septum formation inhibitor; Reviewed
Probab=67.00  E-value=11  Score=18.89  Aligned_cols=59  Identities=17%  Similarity=0.406  Sum_probs=44.4

Q ss_pred             HHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCC
Q ss_conf             9875988999867567247468999999999985189717998457245788999708981
Q 537021.9.peg.3   21 QFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKS   81 (150)
Q Consensus        21 ~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~   81 (150)
                      ..|++.-+||-+..-. .++++ ..+-.=+..|+..|+.+|=|.|+.++...+.+.+|++.
T Consensus        46 ~FF~naPVVIDls~L~-~~~~~-~~~~aLv~~Lr~~gl~pVgVrg~~~e~~~~A~~~GL~i  104 (248)
T PRK04596         46 KLFGRAAVILDFGGLS-QVPDL-ATAKALLDGLRSAGVLPVALAYGTSEIDLLSQQLGLPL  104 (248)
T ss_pred             HHHCCCCEEEECHHHC-CCCCH-HHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHCCCCC
T ss_conf             8666994999815715-77567-89999999999879889998279989999999769864


No 81 
>TIGR01092 P5CS delta l-pyrroline-5-carboxylate synthetase; InterPro: IPR005766    delta l-pyrroline-5-carboxylate synthetase contains a glutamate 5-kinase (ProB, 2.7.2.11 from EC) region followed by a gamma-glutamyl phosphate reductase (ProA, 1.2.1.41 from EC) region and catalyses the first and second steps in proline biosynthesis.; GO: 0003824 catalytic activity, 0006561 proline biosynthetic process.
Probab=64.65  E-value=10  Score=19.15  Aligned_cols=45  Identities=24%  Similarity=0.323  Sum_probs=36.1

Q ss_pred             CCEEEEEECHHHHCCHH-------HHHHHHHHHHHHHHCCCCEEEEECCCHH
Q ss_conf             98899986756724746-------8999999999985189717998457245
Q 537021.9.peg.3   25 NETIVVKYGGHVMNCTD-------LSKDFVNDIALLKKSNITPVIVHGGGPQ   69 (150)
Q Consensus        25 ~ktiVIK~GG~~l~d~~-------l~~~~~~dI~ll~~~G~k~ViVHGgg~q   69 (150)
                      -|-||||+|-.++.-.+       -+.+++++++.|++-|..++||--|--.
T Consensus         7 ~KRiVVK~G~AVVTr~d~~GLALGRL~si~EQl~~L~~~G~E~~LVtSGAV~   58 (738)
T TIGR01092         7 VKRIVVKVGTAVVTRGDGRGLALGRLGSIVEQLSELNSDGYEVILVTSGAVA   58 (738)
T ss_pred             CCEEEEEECCEEEECCCCCEEHHHHHHHHHHHHHHHCCCCCEEEEECCCHHH
T ss_conf             5478998254578778984014567899999999860289289998144123


No 82 
>PTZ00217 flap endonuclease-1; Provisional
Probab=64.50  E-value=13  Score=18.58  Aligned_cols=50  Identities=16%  Similarity=0.236  Sum_probs=31.4

Q ss_pred             HHHHCCCEEEEEECHHH---------------HCCH-----HHHHHHHHHHHHHHHCCCCEEEEECCCHH
Q ss_conf             99875988999867567---------------2474-----68999999999985189717998457245
Q 537021.9.peg.3   20 VQFYENETIVVKYGGHV---------------MNCT-----DLSKDFVNDIALLKKSNITPVIVHGGGPQ   69 (150)
Q Consensus        20 i~~~r~ktiVIK~GG~~---------------l~d~-----~l~~~~~~dI~ll~~~G~k~ViVHGgg~q   69 (150)
                      |..|+|+++.|=..-.+               +.+.     .-...++..+..|...|++||.|+.|.|-
T Consensus        23 L~~l~GK~VAIDas~~LYqfl~air~~~~~~~l~n~~G~~tshl~gl~~r~~~ll~~gIkPVfVFDGkpp   92 (394)
T PTZ00217         23 LKNLFGRRIAIDASMALYQFIIAIRSSGQGSELTNESGEVTSHLNGLFNRTIRLLEEGIKPIYVFDGKPP   92 (394)
T ss_pred             HHHCCCCEEEEECHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEEECCCCC
T ss_conf             8990998899961499999999864442344210467870299999999999999879979999789994


No 83 
>TIGR01116 ATPase-IIA1_Ca calcium-translocating P-type ATPase, SERCA-type; InterPro: IPR005782    This model describes Ca2+ ATPases (3.6.3.8 from EC) from eukaryotes and includes significant representatives from plants. Plasma membrane Ca2+ ATPases are primarily responsible for extrusion of calcium from cytoplasmic milieu to outside, coupled to the hydrolyis of ATP. Besides Ca2+ ATPases, Na+/Ca2+ exchangers are also involved in the maintenance of Ca2+ homestasis in the cell. ; GO: 0005388 calcium-transporting ATPase activity, 0006816 calcium ion transport, 0016021 integral to membrane.
Probab=63.53  E-value=5.9  Score=20.75  Aligned_cols=95  Identities=15%  Similarity=0.273  Sum_probs=64.9

Q ss_pred             HHHHHCCCHHHHHHHHHHHH------------CCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHH
Q ss_conf             02322062289999999987------------598899986756724746899999999998518971799845724578
Q 537021.9.peg.3    4 KIYQFQAEILEQVLPFVQFY------------ENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIG   71 (150)
Q Consensus         4 ~~~~~~a~~l~ea~pYi~~~------------r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~   71 (150)
                      .+.|.+.+.=+..+--|+++            |.++=.+=+|=-.|.||. ++...+.|...|..|||+++|-|=.+.--
T Consensus       418 ~~VPL~~~~k~~il~~~~~~~~~aLRCLA~A~~~Es~Ltf~G~vGl~DPP-R~EV~~ai~~CR~AGIrVImITGD~KeTA  496 (800)
T TIGR01116       418 SAVPLTEKMKNTILASIKEMSSEALRCLALAFKEESKLTFIGVVGLLDPP-RPEVADAIEKCREAGIRVIMITGDNKETA  496 (800)
T ss_pred             CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHEECCCCCCEEEEEEECCCCC-CHHHHHHHHHHHHCCCEEEEEECCCCHHH
T ss_conf             01347846789999999987641324110012146786788885068948-16899999988737978999847982468


Q ss_pred             HHHH-HCCCCCEEE----------CCC---CCCCHHHHHHHH
Q ss_conf             8999-708981155----------783---466124899999
Q 537021.9.peg.3   72 AVLE-KMGIKSKFE----------NGL---RITDQQTAEVVE   99 (150)
Q Consensus        72 ~~l~-~~gi~~~~v----------~G~---RVTd~~tl~iv~   99 (150)
                      +.-- +-|+=+.|-          .|+   |+=+..-+++|+
T Consensus       497 ~AicR~IG~lsSfTG~EF~~m~~~~~~~FSR~ePsHK~~lV~  538 (800)
T TIGR01116       497 EAICRRIGILSSFTGREFDEMGPKKALVFSRVEPSHKLRLVE  538 (800)
T ss_pred             HHHHHHHHHCCCCCCCHHHHCCCCCCEEEEECCCHHHHHHHH
T ss_conf             999767501024200036500601466876058255787653


No 84 
>cd04245 AAK_AKiii-YclM-BS AAK_AKiii-YclM-BS: Amino Acid Kinase Superfamily (AAK), AKiii-YclM-BS; this CD includes the N-terminal catalytic aspartokinase (AK) domain of the lysine plus threonine-sensitive aspartokinase isoenzyme AKIII, a monofunctional class enzyme found in Bacilli (Bacillus subtilis YclM) and Clostridia species. Aspartokinase is the first enzyme in the aspartate metabolic pathway and catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP. In Bacillus subtilis (BS), YclM is reported to be a single polypeptide of 50 kD. The Bacillus subtilis 168 AKIII is induced by lysine and repressed by threonine, and it is synergistically inhibited by lysine and threonine.
Probab=63.09  E-value=13  Score=18.61  Aligned_cols=35  Identities=23%  Similarity=0.325  Sum_probs=23.2

Q ss_pred             EEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEE
Q ss_conf             9998675672474689999999999851897179984
Q 537021.9.peg.3   28 IVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVH   64 (150)
Q Consensus        28 iVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVH   64 (150)
                      -|.||||+-+.|.+-++++++ |..- ....++|+|-
T Consensus         2 kV~KFGGtSv~~~~~i~~v~~-Ii~~-~~~~~vVVVS   36 (288)
T cd04245           2 KVVKFGGSSLASAEQFQKVKA-IVKA-DPERKIVVVS   36 (288)
T ss_pred             EEEEECCCCCCCHHHHHHHHH-HHHC-CCCCEEEEEC
T ss_conf             899888113589999999999-9971-7898099990


No 85 
>PRK00009 phosphoenolpyruvate carboxylase; Reviewed
Probab=63.06  E-value=14  Score=18.41  Aligned_cols=65  Identities=25%  Similarity=0.451  Sum_probs=42.8

Q ss_pred             CEEEEEEC------HHHHCCHHHHHHHHHHHHHHHHCCCCEEEEE--------CCCHHHHHHHHHCCCCCEEECC-CCCC
Q ss_conf             88999867------5672474689999999999851897179984--------5724578899970898115578-3466
Q 537021.9.peg.3   26 ETIVVKYG------GHVMNCTDLSKDFVNDIALLKKSNITPVIVH--------GGGPQIGAVLEKMGIKSKFENG-LRIT   90 (150)
Q Consensus        26 ktiVIK~G------G~~l~d~~l~~~~~~dI~ll~~~G~k~ViVH--------Ggg~qI~~~l~~~gi~~~~v~G-~RVT   90 (150)
                      ..+-+-|.      |-...+..+.++--+=.......|+++.+-|        ||||.-.+.+   ..|+.-++| +|+|
T Consensus       568 QeVMlGYSDS~KDgG~l~s~W~ly~Aq~~L~~v~~~~gv~l~~FHGRGGsvgRGGGP~~~ai~---aqP~gsv~g~ir~T  644 (916)
T PRK00009        568 QEVMLGYSDSNKDGGFLASNWALYRAQEALVELAEKHGVRLTLFHGRGGTVGRGGGPAYAAIL---SQPPGSVKGRIRVT  644 (916)
T ss_pred             EEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCHHHHHH---CCCCCCCCCCEEEE
T ss_conf             999732566654233425568999999999999997699379850798766778860899986---39986606843665


Q ss_pred             CHH
Q ss_conf             124
Q 537021.9.peg.3   91 DQQ   93 (150)
Q Consensus        91 d~~   93 (150)
                      -+-
T Consensus       645 EQG  647 (916)
T PRK00009        645 EQG  647 (916)
T ss_pred             ECC
T ss_conf             346


No 86 
>KOG2518 consensus
Probab=60.00  E-value=9.7  Score=19.34  Aligned_cols=49  Identities=24%  Similarity=0.440  Sum_probs=34.5

Q ss_pred             HHHHHCCCEEEEEECHHHHCC----------------HHHHHHHHHHHHHHHHCCCCEEEEECCCHH
Q ss_conf             999875988999867567247----------------468999999999985189717998457245
Q 537021.9.peg.3   19 FVQFYENETIVVKYGGHVMNC----------------TDLSKDFVNDIALLKKSNITPVIVHGGGPQ   69 (150)
Q Consensus        19 Yi~~~r~ktiVIK~GG~~l~d----------------~~l~~~~~~dI~ll~~~G~k~ViVHGgg~q   69 (150)
                      -+..|+|+|+-|  .|.++..                ..-.+=|++-+.+|...|++||||+-|++-
T Consensus        18 hi~~~~g~tvav--D~y~WLhrg~~~Ca~el~~~~pT~ryi~y~ik~v~lL~~~gikPilVFDG~~L   82 (556)
T KOG2518          18 HISEYKGKTVAV--DGYCWLHRGALACAEKLAKGKPTDRYIQFFIKRVKLLLSYGIKPILVFDGDPL   82 (556)
T ss_pred             HHHHHCCCEEEE--EHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCEEEEEECCCCC
T ss_conf             688755836887--32558864577679998628973899999999999998569748999669976


No 87 
>PRK03868 glucose-6-phosphate isomerase; Provisional
Probab=59.98  E-value=15  Score=18.07  Aligned_cols=89  Identities=15%  Similarity=0.250  Sum_probs=57.7

Q ss_pred             CCHHHHHHHHHHHHC--CCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCH-HHHHHHHHCCCCCEEE--
Q ss_conf             622899999999875--98899986756724746899999999998518971799845724-5788999708981155--
Q 537021.9.peg.3   10 AEILEQVLPFVQFYE--NETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGP-QIGAVLEKMGIKSKFE--   84 (150)
Q Consensus        10 a~~l~ea~pYi~~~r--~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~-qI~~~l~~~gi~~~~v--   84 (150)
                      ...+.+...+++.-+  ..++||-+|||.+- +.+...++   .-....+.++..|-.--| .+++.++.+..+....  
T Consensus        42 ~~~~~~~~~~~~~~~~~~~vV~IGIGGS~LG-~~ai~~aL---~~~~~~~~~i~F~~N~Dp~~~~~~l~~l~~~~TlfiV  117 (409)
T PRK03868         42 TNLIEESLKYLNDKQAIKNIVVIGIGGSSLG-VKAIYSML---KNKKQLKRELHFLDNTDPISINKTLKKINLEETLFIV  117 (409)
T ss_pred             HHHHHHHHHHHHHHCCCCEEEEEEECCHHHH-HHHHHHHH---CCCCCCCCEEEEEECCCHHHHHHHHHHCCCCCEEEEE
T ss_conf             5799999999864524987999825528899-99999984---1204589749998488989999999608935179999


Q ss_pred             CCCCCCCHHHHHHHHHHH
Q ss_conf             783466124899999997
Q 537021.9.peg.3   85 NGLRITDQQTAEVVEMVL  102 (150)
Q Consensus        85 ~G~RVTd~~tl~iv~~vL  102 (150)
                      --==-|+.||+...+.++
T Consensus       118 iSKSgtT~ET~~~~~~~~  135 (409)
T PRK03868        118 ISKSGTTIETISIFKYIL  135 (409)
T ss_pred             ECCCCCCHHHHHHHHHHH
T ss_conf             859998389999999999


No 88 
>PRK10444 UMP phosphatase; Provisional
Probab=58.40  E-value=15  Score=18.13  Aligned_cols=55  Identities=7%  Similarity=0.149  Sum_probs=28.0

Q ss_pred             CEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCC----HHHHHHHHHCCCC
Q ss_conf             889998675672474689999999999851897179984572----4578899970898
Q 537021.9.peg.3   26 ETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGG----PQIGAVLEKMGIK   80 (150)
Q Consensus        26 ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg----~qI~~~l~~~gi~   80 (150)
                      |++++=+.|.+....++++.-.+-|..|+..|+++++|-=..    .++-+.|.++|++
T Consensus         2 k~il~DlDGvl~~g~~~ipga~e~l~~L~~~g~~~~fvTNnss~s~~~~~~kL~~~Gl~   60 (248)
T PRK10444          2 KNVICDIDGVLMHDNVAVPGAAEFLHRILEKGLPLVLLTNYPSQTGQDLANRFATAGVD   60 (248)
T ss_pred             CEEEECCCCEEEECCEECCCHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHHHHCCCC
T ss_conf             89999075613889984869999999999879929999389999999999999977999


No 89 
>cd01979 Pchlide_reductase_N Pchlide_reductase_N: N protein of the NB protein complex of Protochlorophyllide (Pchlide)_reductase. Pchlide reductase catalyzes the reductive formation of chlorophyllide (chlide) from protochlorophyllide (pchlide) during biosynthesis of chlorophylls and bacteriochlorophylls. This group contains both the light-independent Pchlide reductase (DPOR) and light-dependent Pchlide reductase (LPOR).  Angiosperms contain only LPOR, cyanobacteria, algae and gymnosperms contain both DPOR and LPOR, primitive anoxygenic photosynthetic bacteria contain only DPOR. NB is structurally similar to the FeMo protein of nitrogenase, forming an N2B2 heterotetramer. N and B are homologous to the FeMo alpha and beta subunits respectively. Also in common with nitrogenase in vitro DPOR activity requires ATP hydrolysis and dithoionite or ferredoxin as electron donor. The NB protein complex may serve as a catalytic site for Pchlide reduction similar to MoFe for nitrogen reduction.
Probab=58.34  E-value=16  Score=17.89  Aligned_cols=77  Identities=13%  Similarity=0.315  Sum_probs=42.4

Q ss_pred             HHCCCHHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECC
Q ss_conf             22062289999999987598899986756724746899999999998518971799845724578899970898115578
Q 537021.9.peg.3    7 QFQAEILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENG   86 (150)
Q Consensus         7 ~~~a~~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G   86 (150)
                      +-.+..-+...+|....+||. |+-+|++-++=|     +++   +|+.+|+.+|-|  |-|++++.+...-++.- -.|
T Consensus       258 ~~~~ra~~~le~yr~~L~GK~-vfF~pDsqLEIp-----LAR---FL~rcGm~~vEV--Gtpy~~~~~~~~el~ll-~~~  325 (396)
T cd01979         258 EREARAWRALEPYLDLLRGKS-IFFMGDNLLEIP-----LAR---FLTRCGMIVVEV--GTPYLDKRFQAAELELL-PPM  325 (396)
T ss_pred             HHHHHHHHHHHHHHHHHCCCE-EEECCCCCHHHH-----HHH---HHHHCCCEEEEE--CCCCCCHHHHHHHHHHC-CCC
T ss_conf             899999999999999838974-998188602404-----899---999879878995--56654876678899746-999


Q ss_pred             CCCCCHHHH
Q ss_conf             346612489
Q 537021.9.peg.3   87 LRITDQQTA   95 (150)
Q Consensus        87 ~RVTd~~tl   95 (150)
                      .|+......
T Consensus       326 ~~i~E~~d~  334 (396)
T cd01979         326 VRIVEKPDN  334 (396)
T ss_pred             CCCCCCCCH
T ss_conf             844307977


No 90 
>TIGR00600 rad2 DNA excision repair protein (rad2); InterPro: IPR001044   Xeroderma pigmentosum (XP)  is a human autosomal recessive disease, characterised by a high incidence of sunlight-induced skin cancer. People's skin cells with this condition are hypersensitive to ultraviolet light, due to defects in the incision step of DNA excision repair. There are a minimum of seven genetic complementation groups involved in this pathway: XP-A to XP-G. XP-G is one of the most rare and phenotypically heterogeneous of XP, showing anything from slight to extreme dysfunction in DNA excision repair , . XP-G can be corrected by a 133 Kd nuclear protein, XPGC . XPGC is an acidic protein that confers normal UV resistance in expressing cells . It is a magnesium-dependent, single-strand DNA endonuclease that makes structure-specific endonucleolytic incisions in a DNA substrate containing a duplex region and single-stranded arms , . XPGC cleaves one strand of the duplex at the border with the single-stranded region .   XPG belongs to a family of proteins that includes RAD2 from Saccharomyces cerevisiae (Baker's yeast) and rad13 from Schizosaccharomyces pombe (Fission yeast), which are single-stranded DNA endonucleases , ; mouse and human FEN-1, a structure-specific endonuclease; RAD2 from fission yeast and RAD27 from budding yeast; fission yeast exo1, a 5'-3' double-stranded DNA exonuclease that may act in a pathway that corrects mismatched base pairs; yeast DHS1, and yeast DIN7. Sequence alignment of this family of proteins reveals that similarities are largely confined to two regions. The first is located at the N-terminal extremity (N-region) and corresponds to the first 95 to 105 amino acids. The second region is internal (I-region) and found towards the C-terminus; it spans about 140 residues and contains a highly conserved core of 27 amino acids that includes a conserved pentapeptide (E-A-[DE]-A-[QS]). It is possible that the conserved acidic residues are involved in the catalytic mechanism of DNA excision repair in XPG. The amino acids linking the N- and I-regions are not conserved.   This entry represents XPGC, an acidic protein that confers normal UV resistance in expressing cells, can correct XP-G. It is a magnesium-dependent, single-strand DNA endonuclease that makes structure-specific endonucleolytic incisions in a DNA substrate containing a duplex region and single-stranded arms. XPGC cleaves one strand of the duplex at the border with the single-stranded region , .; GO: 0003697 single-stranded DNA binding, 0004519 endonuclease activity, 0006289 nucleotide-excision repair, 0005634 nucleus.
Probab=57.82  E-value=11  Score=19.07  Aligned_cols=62  Identities=21%  Similarity=0.346  Sum_probs=42.1

Q ss_pred             CHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCC------CEEECCCCCCCHHHHHHH
Q ss_conf             756724746899999999998518971799845724578899970898------115578346612489999
Q 537021.9.peg.3   33 GGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIK------SKFENGLRITDQQTAEVV   98 (150)
Q Consensus        33 GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~------~~~v~G~RVTd~~tl~iv   98 (150)
                      -|+++.+..+. .|++-|..|=-.|||||-|+-||.=+   |++.-|.      -.....=|.|...-|...
T Consensus        46 ~GN~i~n~Hl~-tlF~RlCKLLFFrIrPvFVFDG~aP~---LKrqTl~kR~~R~~~~~~~a~~T~~KLL~~f  113 (1127)
T TIGR00600        46 EGNAIKNSHLL-TLFRRLCKLLFFRIRPVFVFDGGAPL---LKRQTLAKRRQRRDKASEDARKTAEKLLALF  113 (1127)
T ss_pred             CCCEEECCHHH-HHHHHHHHHHHCCCCEEEEECCCCCC---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_conf             68723050667-78999998861687316884088872---4689999998765323320488999999999


No 91 
>pfam06953 ArsD Arsenical resistance operon trans-acting repressor ArsD. This family consists of several bacterial arsenical resistance operon trans-acting repressor ArsD proteins. ArsD is a trans-acting repressor of the arsRDABC operon that confers resistance to arsenicals and antimonials in Escherichia coli. It possesses two-pairs of vicinal cysteine residues, Cys(12)-Cys(13) and Cys(112)-Cys(113), that potentially form separate binding sites for the metalloids that trigger dissociation of ArsD from the operon. However, as a homodimer it has four vicinal cysteine pairs.
Probab=57.42  E-value=15  Score=18.16  Aligned_cols=67  Identities=21%  Similarity=0.341  Sum_probs=44.0

Q ss_pred             CCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHC
Q ss_conf             47468999999999985189717998457245788999708981155783466124899999997416689999999828
Q 537021.9.peg.3   38 NCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEMVLAGSINKKIVSLINQT  117 (150)
Q Consensus        38 ~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~vL~G~vn~~lv~~L~~~  117 (150)
                      .|++|. .|..|+..|+.                     .|++.+..|   .|      ---++++  -|+.+...|...
T Consensus        22 VD~~L~-~~aa~~~~lk~---------------------~gv~v~R~N---L~------~~P~~F~--~N~~V~~~L~~~   68 (119)
T pfam06953        22 VDPELV-RFAADLDWLKQ---------------------NGVEVERYN---LA------QQPMAFA--ENAVVKAFLETS   68 (119)
T ss_pred             CCHHHH-HHHHHHHHHHH---------------------CCCEEEECC---CC------CCHHHHH--HCHHHHHHHHHC
T ss_conf             798999-99999999996---------------------794799716---63------2879997--279999999974


Q ss_pred             CCCEEEEEEEECCEEEEEEC
Q ss_conf             99768778655898999974
Q 537021.9.peg.3  118 GTQAIGICGKDGNMVFAEKA  137 (150)
Q Consensus       118 g~~avglsg~dg~li~a~~~  137 (150)
                      |..+..++-+||.++.+.+.
T Consensus        69 G~e~LPitlVDGeIv~~G~Y   88 (119)
T pfam06953        69 GAEGLPLTLVDGEIVKTGRY   88 (119)
T ss_pred             CCCCCCEEEECCEEEEECCC
T ss_conf             96548889998999881569


No 92 
>PRK10530 phosphotransferase; Provisional
Probab=57.36  E-value=17  Score=17.79  Aligned_cols=97  Identities=14%  Similarity=0.251  Sum_probs=57.8

Q ss_pred             CEEEEEECHHHHCCHH-HHHHHHHHHHHHHHCCCCEEEEECCCH-HHHHHHHHCCCCCEEE--CCCCCCCHHHHHHH-HH
Q ss_conf             8899986756724746-899999999998518971799845724-5788999708981155--78346612489999-99
Q 537021.9.peg.3   26 ETIVVKYGGHVMNCTD-LSKDFVNDIALLKKSNITPVIVHGGGP-QIGAVLEKMGIKSKFE--NGLRITDQQTAEVV-EM  100 (150)
Q Consensus        26 ktiVIK~GG~~l~d~~-l~~~~~~dI~ll~~~G~k~ViVHGgg~-qI~~~l~~~gi~~~~v--~G~RVTd~~tl~iv-~~  100 (150)
                      |.|+.=+.|.++.+.. +.+...+.|..++..|+++++.-|=.+ .+...++++++...++  ||-.+-|...=+++ ..
T Consensus         4 KlIa~DlDGTLl~~~~~i~~~~~~ai~~l~~~Gi~~~iaTGR~~~~~~~~~~~l~~~~~~I~~NGa~i~d~~~~~~l~~~   83 (272)
T PRK10530          4 RVIALDLDGTLLTPKKTILPSSLEALARAREAGYQLIIVTGRHHVAIHPFYQALALDTPAICCNGTYLYDYQAKTVLEAD   83 (272)
T ss_pred             CEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHCCCCCEEEECCEEEEECCCCEEEEEC
T ss_conf             29999087363589895999999999999978999999959987888999998099986897688599977998799954


Q ss_pred             HHHHHHHHHHHHHHHHCCCCEE
Q ss_conf             9741668999999982899768
Q 537021.9.peg.3  101 VLAGSINKKIVSLINQTGTQAI  122 (150)
Q Consensus       101 vL~G~vn~~lv~~L~~~g~~av  122 (150)
                      .+.-+.-..+...+.+.+++..
T Consensus        84 ~l~~~~~~~i~~~~~~~~~~~~  105 (272)
T PRK10530         84 PMPVQQALQLIELLNEHQIHGL  105 (272)
T ss_pred             CCCHHHHHHHHHHHHHCCCCEE
T ss_conf             7999999999999998499579


No 93 
>cd00128 XPG Xeroderma pigmentosum G N- and I-regions (XPGN, XPGI); contains the HhH2 motif; domain in nucleases. XPG is a eukaryotic enzyme that functions in nucleotide-excision repair and transcription-coupled repair of oxidative DNA damage. Functionally/structurally related to FEN-1;  divalent metal ion-dependent exo- and endonuclease, and bacterial and bacteriophage 5'3' exonucleases.
Probab=57.12  E-value=17  Score=17.76  Aligned_cols=50  Identities=18%  Similarity=0.252  Sum_probs=33.5

Q ss_pred             HHHHCCCEEEEEECHHH--------------HCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHH
Q ss_conf             99875988999867567--------------247468999999999985189717998457245
Q 537021.9.peg.3   20 VQFYENETIVVKYGGHV--------------MNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQ   69 (150)
Q Consensus        20 i~~~r~ktiVIK~GG~~--------------l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~q   69 (150)
                      +..|+||++.|=-.--+              -........|+.-+..|...|++||.|+-|.|-
T Consensus        19 l~~l~gk~iAIDas~~Lyq~l~~~~~~~~~~~~~~~~l~g~~~ri~~Ll~~~I~PifVFDG~~p   82 (316)
T cd00128          19 LEELRGKKVAIDASIWLYQFLKACRQELGSGGETTSHLQGFFYRTCRLLELGIKPVFVFDGKPP   82 (316)
T ss_pred             HHHCCCCEEEEECHHHHHHHHHHCCHHHCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCCC
T ss_conf             8990999899971899999998537233148986679999999999999779959999899898


No 94 
>PRK00973 glucose-6-phosphate isomerase; Provisional
Probab=56.96  E-value=17  Score=17.75  Aligned_cols=87  Identities=13%  Similarity=0.232  Sum_probs=52.8

Q ss_pred             HHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHH----HHHHHHHCCCCEEEEECCCH-HHHHHHHHCCCCCEEEC-
Q ss_conf             2899999999875988999867567247468999999----99998518971799845724-57889997089811557-
Q 537021.9.peg.3   12 ILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVN----DIALLKKSNITPVIVHGGGP-QIGAVLEKMGIKSKFEN-   85 (150)
Q Consensus        12 ~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~----dI~ll~~~G~k~ViVHGgg~-qI~~~l~~~gi~~~~v~-   85 (150)
                      -+.++..+++.|.+ .+||-+|||.+--..+.+.+..    ....-..-+.++..|..-.| .+.+.|+.+..+...+. 
T Consensus        61 ~~~~~~~~~k~~~~-vV~IGIGGS~LG~~a~~~aL~~~~~n~~~~~~~~~~~i~f~~NvDp~~l~~~l~~ld~~~TlfiV  139 (454)
T PRK00973         61 SYLELKEWIKNFEN-VVVLGIGGSALGNLALHTALLPLNYNELSKEERNGYRIFVLDNVDPEKTASILDVIDPKKTLFNV  139 (454)
T ss_pred             HHHHHHHHHCCCCE-EEEECCCHHHHHHHHHHHHHHHHCCCCCCHHCCCCCEEEEEECCCHHHHHHHHHHCCCCCEEEEE
T ss_conf             99999997478987-99954353678999999995001123100111689759998089989999999747933079999


Q ss_pred             -CCCCCCHHHHHHHH
Q ss_conf             -83466124899999
Q 537021.9.peg.3   86 -GLRITDQQTAEVVE   99 (150)
Q Consensus        86 -G~RVTd~~tl~iv~   99 (150)
                       -=--|+.|||.-..
T Consensus       140 iSKSgtT~ETl~n~~  154 (454)
T PRK00973        140 ISKSGNTAETLANYL  154 (454)
T ss_pred             EECCCCCHHHHHHHH
T ss_conf             848979689999999


No 95 
>TIGR01292 TRX_reduct thioredoxin-disulfide reductase; InterPro: IPR005982    Reactive oxygen species (ROS) are known mediators of intracellular signaling cascades. Excessive production of ROS may, however, lead to oxidative stress, loss of cell function, and ultimately apoptosis or necrosis. A balance between oxidant and antioxidant intracellular systems is hence vital for cell function, regulation, and adaptation to diverse growth conditions. Thioredoxin reductase in conjunction with thioredoxin is a ubiquitous oxidoreductase system with antioxidant and redox regulatory roles. Thioredoxin reductase (1.8.1.9 from EC) reduces oxidised thioredoxin in the presence of NADPH. Reduced thioredoxin serves as an electron donor for thioredoxin peroxidase which consequently reduces H_2O_2 to H_2O. In mammals, extracellular forms of Trx also have cytokine-like effects. Mammalian TrxR has a highly reactive active site selenocysteine residue resulting in a profound reductive capacity, reducing several substrates in addition to Trx .; GO: 0004791 thioredoxin-disulfide reductase activity, 0019430 removal of superoxide radicals, 0005737 cytoplasm.
Probab=56.33  E-value=18  Score=17.68  Aligned_cols=36  Identities=22%  Similarity=0.426  Sum_probs=23.5

Q ss_pred             HHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEEC
Q ss_conf             987598899986756724746899999999998518971799845
Q 537021.9.peg.3   21 QFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHG   65 (150)
Q Consensus        21 ~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHG   65 (150)
                      ..||||.++ -+||.   |..+     ++--.|+..+-+|.|||=
T Consensus       147 ~ffk~K~V~-VvGGG---dsA~-----eEA~yL~~~a~kV~lvHR  182 (321)
T TIGR01292       147 PFFKNKEVA-VVGGG---DSAL-----EEALYLTRIAKKVTLVHR  182 (321)
T ss_pred             HHHCCCEEE-EECCC---CHHH-----HHHHHHHHCCCEEEEEEC
T ss_conf             120598899-98798---2488-----888999853876799977


No 96 
>cd03141 GATase1_Hsp31_like Type 1 glutamine amidotransferase (GATase1)-like domain found in proteins similar to Escherichia coli Hsp31 protein. Type 1 glutamine amidotransferase (GATase1)-like domain found in proteins similar to Escherichia coli Hsp31 protein (EcHsp31).  This group includes EcHsp31 and Saccharomyces cerevisiae Ydr533c protein.  EcHsp31 has chaperone activity.  Ydr533c is upregulated in response to various stress conditions along with the heat shock family.  EcHsp31 coordinates a metal ion using a 2-His-1-carboxylate motif present in various ions that use iron as a cofactor such as Carboxypeptidase A.   The catalytic triad typical of GATase1 domains is not conserved in this GATase1-like domain. However, in common with a typical GATase1 domain, a reactive Cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. For EcHsp31, this Cys together with a different His and, an Asp (rather than a Glu) residue form a different 
Probab=56.22  E-value=18  Score=17.67  Aligned_cols=54  Identities=13%  Similarity=0.163  Sum_probs=39.8

Q ss_pred             EEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEE------EEEEEECCEEEEE
Q ss_conf             15578346612489999999741668999999982899768------7786558989999
Q 537021.9.peg.3   82 KFENGLRITDQQTAEVVEMVLAGSINKKIVSLINQTGTQAI------GICGKDGNMVFAE  135 (150)
Q Consensus        82 ~~v~G~RVTd~~tl~iv~~vL~G~vn~~lv~~L~~~g~~av------glsg~dg~li~a~  135 (150)
                      -.+.|+|||.-..-+-...-+.+.+-..|...|.+.|.+=.      .-..+||+||+.+
T Consensus       146 ~l~~Gk~vT~fs~~EE~~~~~~~~~Pf~le~~l~~~Ga~~~~~~~~~~~vvvDg~LITGq  205 (221)
T cd03141         146 SLVAGKTVTGFTNEEEEAAGLKKVVPFLLEDELKELGANYVKAEPWAEFVVVDGRLITGQ  205 (221)
T ss_pred             EEECCCEEEECCCHHHHHCCCCCCCCCCHHHHHHHCCCEEEECCCCCCCEEEECCEEECC
T ss_conf             155673785159799874154566874789999986998860689987389829977687


No 97 
>TIGR00014 arsC arsenate reductase; InterPro: IPR006659   This entry describes a distinct clade, including ArsC itself, of the broader family of ArsC and related proteins. This clade is almost completely restricted to the proteobacteria. An anion-translocating ATPase has been identified as the product of the arsenical resistance operon of resistance plasmid R773 . When expressed in Escherichia coli this ATP-driven oxyanion pump catalyses extrusion of the oxyanions arsenite, antimonite and arsenate. The pump is composed of two polypeptides, the products of the arsA and arsB genes. The pump alone produces resistance to arsenite and antimonite. This protein, ArsC, catalyzes the reduction of arsenate to arsenite, and thus extends resistance to include arsenate. ; GO: 0008794 arsenate reductase (glutaredoxin) activity, 0006118 electron transport.
Probab=55.49  E-value=6.5  Score=20.44  Aligned_cols=10  Identities=30%  Similarity=0.737  Sum_probs=4.0

Q ss_pred             CEEEEECCCH
Q ss_conf             1799845724
Q 537021.9.peg.3   59 TPVIVHGGGP   68 (150)
Q Consensus        59 k~ViVHGgg~   68 (150)
                      |||+|||.+.
T Consensus        92 RPiVv~~~~A  101 (114)
T TIGR00014        92 RPIVVAGDKA  101 (114)
T ss_pred             CCEEECCCCC
T ss_conf             9857659964


No 98 
>TIGR01465 cobM_cbiF precorrin-4 C11-methyltransferase; InterPro: IPR006362   Cobalamin (vitamin B12) is a structurally complex cofactor, consisting of a modified tetrapyrrole with a centrally chelated cobalt. Cobalamin is usually found in one of two biologically active forms: methylcobalamin and adocobalamin. Most prokaryotes, as well as animals, have cobalamin-dependent enzymes, whereas plants and fungi do not appear to use it. In bacteria and archaea, these include methionine synthase, ribonucleotide reductase, glutamate and methylmalonyl-CoA mutases, ethanolamine ammonia lyase, and diol dehydratase . In mammals, cobalamin is obtained through the diet, and is required for methionine synthase and methylmalonyl-CoA mutase .    There are at least two distinct cobalamin biosynthetic pathways in bacteria :  Aerobic pathway that requires oxygen and in which cobalt is inserted late in the pathway ; found in Pseudomonas denitrificans and Rhodobacter capsulatus. Anaerobic pathway in which cobalt insertion is the first committed step towards cobalamin synthesis ; found in Salmonella typhimurium, Bacillus megaterium, and Propionibacterium freudenreichii shermanii.     Either pathway can be divided into two parts: (1) corrin ring synthesis (differs in aerobic and anaerobic pathways) and (2) adenosylation of corrin ring, attachment of aminopropanol arm, and assembly of the nucleotide loop (common to both pathways) . There are about 30 enzymes involved in either pathway, where those involved in the aerobic pathway are prefixed Cob and those of the anaerobic pathway Cbi. Several of these enzymes are pathway-specific: CbiD, CbiG, and CbiK are specific to the anaerobic route of S. typhimurium, whereas CobE, CobF, CobG, CobN, CobS, CobT, and CobW are unique to the aerobic pathway of P. denitrificans.   This entry represents CobM and CibF precorrin-4 C11-methyltransferase (2.1.1.133 from EC), both as stand-alone enzymes and when CobJ forms part of a bifunctional enzyme. In the aerobic pathway, CobM catalyses the methylation of precorrin-4 at C-11 to yield precorrin-5. The extruded acyl group is then removed in the subsequent step catalysed by CobF. In the anaerobic pathway, CibF catalyses the methylation of cobalt-precorrin-4 to cobalt-precoriin-5 .    Nomenclature note: occasionally precorrin-4 C11-methyltransferase is one of two methyltransferases often referred to as precorrin-3 methylase (the other is precorrin-3 methylase (the other is precorrin-4 C11-methyltransferase, 2.1.1.131 from EC). ; GO: 0046026 precorrin-4 C11-methyltransferase activity, 0009236 cobalamin biosynthetic process.
Probab=55.12  E-value=18  Score=17.55  Aligned_cols=126  Identities=17%  Similarity=0.223  Sum_probs=86.2

Q ss_pred             HHHHCCCHHHHHHHHHHHH--CCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCC--
Q ss_conf             2322062289999999987--598899986756724746899999999998518971799845724578899970898--
Q 537021.9.peg.3    5 IYQFQAEILEQVLPFVQFY--ENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIK--   80 (150)
Q Consensus         5 ~~~~~a~~l~ea~pYi~~~--r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~--   80 (150)
                      +.-..+..|.|-.|.+.++  +||.+|=-=.|    ||.+.-++.++...|..+|+..-+|.|-.-..- ....++.|  
T Consensus        49 ~~~sA~m~L~ei~~~m~~a~~~GK~VvRLHsG----DPsIYGA~~EQ~~~L~~~gI~~e~vPGvSsf~A-AAA~l~~ELT  123 (252)
T TIGR01465        49 VVNSAAMSLEEIVDIMVDAVREGKLVVRLHSG----DPSIYGAIAEQMQLLEALGIPYEVVPGVSSFFA-AAAALGAELT  123 (252)
T ss_pred             EEECCCCCHHHHHHHHHHHHHCCCEEEEEECC----CHHHHHHHHHHHHHHHHCCCCEEEECCCHHHHH-HHHHHHCCCC
T ss_conf             86050269889999999998669849998508----755776699999999867897798688738989-9997300147


Q ss_pred             ----CE-----EECCCCCCCHHHHHHHHHH---------HHHHHHHHHHHHHHHCCCCE---EE----EEEEECCEEEEE
Q ss_conf             ----11-----5578346612489999999---------74166899999998289976---87----786558989999
Q 537021.9.peg.3   81 ----SK-----FENGLRITDQQTAEVVEMV---------LAGSINKKIVSLINQTGTQA---IG----ICGKDGNMVFAE  135 (150)
Q Consensus        81 ----~~-----~v~G~RVTd~~tl~iv~~v---------L~G~vn~~lv~~L~~~g~~a---vg----lsg~dg~li~a~  135 (150)
                          ++     ...|.|.-=||-=++...+         |+..++.+++..|-.+|=..   +.    .|.=|-.++.++
T Consensus       124 ~P~vsQtvilTR~eG~RtPmPe~E~l~~lA~hgaTm~IfLs~~~~~~vv~~L~~~GY~~DTPV~vVyratWPDE~I~Rgt  203 (252)
T TIGR01465       124 VPEVSQTVILTRAEGRRTPMPEGEKLADLAKHGATMAIFLSAHIIDKVVKELIEGGYSEDTPVAVVYRATWPDEKIVRGT  203 (252)
T ss_pred             CCCCEEEEEEECCCCCCCCCCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCEEEECC
T ss_conf             88403424467543435457766789988741231356778999999999998377688887898985148756589715


No 99 
>pfam08282 Hydrolase_3 haloacid dehalogenase-like hydrolase. This family contains haloacid dehalogenase-like hydrolase enzymes.
Probab=54.49  E-value=19  Score=17.49  Aligned_cols=26  Identities=15%  Similarity=0.247  Sum_probs=10.9

Q ss_pred             CCHHHHHHHHHHHHCCCEEEEEECHH
Q ss_conf             62289999999987598899986756
Q 537021.9.peg.3   10 AEILEQVLPFVQFYENETIVVKYGGH   35 (150)
Q Consensus        10 a~~l~ea~pYi~~~r~ktiVIK~GG~   35 (150)
                      +..+.++.|+++++.-...+|-..|.
T Consensus        39 GR~~~~~~~~~~~l~~~~~~I~~NGa   64 (254)
T pfam08282        39 GRPYRGALPVLEELGLDLPVICFNGA   64 (254)
T ss_pred             CCCHHHHHHHHHHCCCCCCEEECCCC
T ss_conf             98799999999980999759977971


No 100
>PRK13385 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase; Provisional
Probab=53.91  E-value=18  Score=17.64  Aligned_cols=59  Identities=17%  Similarity=0.300  Sum_probs=27.3

Q ss_pred             CCCHHHHHHHHHHHHC--CCEEEEEECHHHHCCHHHHHHHHHHHHHHHHC----CCCEEEEE-CCCHHHHH
Q ss_conf             0622899999999875--98899986756724746899999999998518----97179984-57245788
Q 537021.9.peg.3    9 QAEILEQVLPFVQFYE--NETIVVKYGGHVMNCTDLSKDFVNDIALLKKS----NITPVIVH-GGGPQIGA   72 (150)
Q Consensus         9 ~a~~l~ea~pYi~~~r--~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~----G~k~ViVH-Ggg~qI~~   72 (150)
                      ...++.+...++.+|.  ++.+-+-.||..    . .+++..-+..+..+    .-..|+|| +++|.++.
T Consensus        54 ~~~~~~~~~~~~~~~~~~~~~v~iv~GG~t----R-~~SV~ngl~~l~~~~~~~~~~~VlIHDaARP~vs~  119 (238)
T PRK13385         54 PAQWISHTQDILKKYNITDQRVKVVAGGTD----R-NETIMNIIDHIRNVNGINNDDVIVTHDAVRPFLTQ  119 (238)
T ss_pred             CHHHHHHHHHHHHHCCCCCCCEEEECCCCC----H-HHHHHHHHHHHHHCCCCCCCCEEEEECCCCCCCCH
T ss_conf             788899999999861544476899328986----4-89999999998741147876889995466778999


No 101
>cd01976 Nitrogenase_MoFe_alpha Nitrogenase_MoFe_alpha_II: Nitrogenase MoFe protein, beta subunit. A group of proteins similar to the alpha subunit of the MoFe protein of the molybdenum (Mo-) nitrogenase. The nitrogenase enzyme catalyzes the ATP-dependent reduction of dinitrogen to ammonia. The Mo-nitrogenase is the most widespread and best characterized of these systems.  Mo-nitrogenase consists of the MoFe protein (component 1) and the Fe protein (component 2).  MoFe is an alpha2beta2 tetramer. Each alphabeta pair of MoFe contains one P-cluster (at the alphabeta interface) and, one molecule of iron molybdenum cofactor (FeMoco) contained within the alpha subunit. The Fe protein contains a single [4Fe-4S] cluster.  Electrons are transferred from the [4Fe-4S] cluster of the Fe protein to the P-cluster of the MoFe and in turn to FeMoCo, the site of substrate reduction.
Probab=53.70  E-value=19  Score=17.41  Aligned_cols=67  Identities=9%  Similarity=0.096  Sum_probs=40.1

Q ss_pred             HHHHCCCHHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEE---ECCCHHHHHHHHHCCCC
Q ss_conf             23220622899999999875988999867567247468999999999985189717998---45724578899970898
Q 537021.9.peg.3    5 IYQFQAEILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIV---HGGGPQIGAVLEKMGIK   80 (150)
Q Consensus         5 ~~~~~a~~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViV---HGgg~qI~~~l~~~gi~   80 (150)
                      |.+..+.+..+..+|-.+++||...|-.||.   ...   .++   .+++.+|+++|.+   .|-..-..+..+..+..
T Consensus       280 I~~e~~~~~~~l~~yr~~L~GKrv~i~~Gg~---~~~---~~i---~~~~eLGmevV~~g~~~~~~~Dye~~~~~~~~~  349 (421)
T cd01976         280 IAEYKPAMEAVIAKYRPRLEGKTVMLYVGGL---RPR---HYI---GAYEDLGMEVVGTGYEFAHRDDYERTEVIPKEG  349 (421)
T ss_pred             HHHHHHHHHHHHHHHHHHCCCCEEEEECCCC---CHH---HHH---HHHHHCCCEEEEEEEECCCHHHHHHHHHHCCCC
T ss_conf             9999999999999999870898899989987---468---999---999987988999731227476799998508988


No 102
>PRK09444 pntB pyridine nucleotide transhydrogenase; Provisional
Probab=50.24  E-value=22  Score=17.06  Aligned_cols=39  Identities=26%  Similarity=0.390  Sum_probs=30.2

Q ss_pred             HHHHHHHHHHHCCCCEEEEECCCH----------HHHHHHHHCCCCCEE
Q ss_conf             999999998518971799845724----------578899970898115
Q 537021.9.peg.3   45 DFVNDIALLKKSNITPVIVHGGGP----------QIGAVLEKMGIKSKF   83 (150)
Q Consensus        45 ~~~~dI~ll~~~G~k~ViVHGgg~----------qI~~~l~~~gi~~~~   83 (150)
                      .=.+|.+.+-.--.++|||.|-|=          ++.+.|++.|++.+|
T Consensus       291 ~~~~d~a~~l~~A~~ViIVPGYGmAVAqAQh~v~el~~~L~~~g~~V~f  339 (459)
T PRK09444        291 TTAEEVAEMLKNSHSVIITPGYGMAVAQAQYPVAEITEKLRARGINVRF  339 (459)
T ss_pred             CCHHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHHHHHHHHHCCCEEEE
T ss_conf             3789999999709949996680899998889999999999977985899


No 103
>COG2054 Uncharacterized archaeal kinase related to aspartokinases, uridylate kinases [General function prediction only]
Probab=49.66  E-value=21  Score=17.21  Aligned_cols=73  Identities=23%  Similarity=0.239  Sum_probs=38.0

Q ss_pred             EEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHH--HHHHHHHHHHHHH
Q ss_conf             99867567247468999999999985189717998457245788999708981155783466124--8999999974166
Q 537021.9.peg.3   29 VVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQ--TAEVVEMVLAGSI  106 (150)
Q Consensus        29 VIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~--tl~iv~~vL~G~v  106 (150)
                      |||.||+...+ . ...+.+   .+..++.++++|.|||-..|---+   +.-    -.+..|..  -|.|..|-..|..
T Consensus         3 vVk~~Gs~~~~-~-~~~~~~---ale~~~~~i~iVpGGg~FAd~VR~---id~----~~~lSdsasHwmAI~~Md~~G~~   70 (212)
T COG2054           3 VVKKGGSGVAE-R-AAAVKE---ALENLQRSILIVPGGGIFADLVRK---IDE----EFGLSDSASHWMAITAMDQYGFY   70 (212)
T ss_pred             EEEECCCCHHH-H-HHHHHH---HHHHHCCEEEEECCCHHHHHHHHH---HHH----HCCCCCCHHHHHHHHHHHHHHHH
T ss_conf             68862787088-8-899999---998414608995184289999999---987----45987507889999999998899


Q ss_pred             HHHHHHH
Q ss_conf             8999999
Q 537021.9.peg.3  107 NKKIVSL  113 (150)
Q Consensus       107 n~~lv~~  113 (150)
                      -.++...
T Consensus        71 lad~~~~   77 (212)
T COG2054          71 LADLASR   77 (212)
T ss_pred             HHHHHCC
T ss_conf             9864300


No 104
>TIGR00453 ispD 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase; InterPro: IPR001228 The bacterial ispD protein catalyzes the third step of the deoxyxylulose-5-phosphate pathway (DXP) of isoprenoid biosynthesis; the formation of 4-diphosphocytidyl-2C-methyl-D-erythritol from CTP and 2C-methyl-D-erythritol 4-phosphate.; GO: 0008299 isoprenoid biosynthetic process.
Probab=48.60  E-value=19  Score=17.46  Aligned_cols=34  Identities=21%  Similarity=0.378  Sum_probs=16.2

Q ss_pred             EEEEEECHHHHCCHHHHHHHHHHHHHHHH-CCCCEEEEEC
Q ss_conf             89998675672474689999999999851-8971799845
Q 537021.9.peg.3   27 TIVVKYGGHVMNCTDLSKDFVNDIALLKK-SNITPVIVHG   65 (150)
Q Consensus        27 tiVIK~GG~~l~d~~l~~~~~~dI~ll~~-~G~k~ViVHG   65 (150)
                      -+.+.-||+.    . .+++..=+..+.. -+.+.||||-
T Consensus        70 ~~~~v~GG~~----R-q~SV~~GL~a~~~~~~~~~VlvHD  104 (226)
T TIGR00453        70 VVKIVAGGDT----R-QDSVRNGLKALPERADAEIVLVHD  104 (226)
T ss_pred             CCEEECCCCC----H-HHHHHHHHHHHHHCCCCCEEEEEC
T ss_conf             1158469874----6-899999999876358988289847


No 105
>PTZ00114 Heat shock protein 60; Provisional
Probab=47.93  E-value=24  Score=16.83  Aligned_cols=113  Identities=16%  Similarity=0.174  Sum_probs=54.0

Q ss_pred             HHHHHHCCCEEEEEECHHHH--CCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECC-----CCCC
Q ss_conf             99998759889998675672--4746899999999998518971799845724578899970898115578-----3466
Q 537021.9.peg.3   18 PFVQFYENETIVVKYGGHVM--NCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENG-----LRIT   90 (150)
Q Consensus        18 pYi~~~r~ktiVIK~GG~~l--~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G-----~RVT   90 (150)
                      .++.++.+++.+|++||+.=  .++. .+++-..+...+..=..- +|.|||...-+....+.- .+-.++     .++-
T Consensus       397 erla~lsg~va~I~vGG~te~el~E~-errl~DAL~avraAveeG-IVpGGGaall~as~~L~~-~~~~~~~eq~gi~~~  473 (580)
T PTZ00114        397 ERLAKITGGVALIKVGGISEVEVNEI-KDRIQDALCATKAAVEEG-IVPGGGSALLFASKELDS-VQTDNYDQRVGVNII  473 (580)
T ss_pred             HHHHCCCCCEEEEEECCCCHHHHHHH-HHHHHHHHHHHHHHHHCC-EECCCCHHHHHHHHHHHH-HHCCCHHHHHHHHHH
T ss_conf             77511477579999688728889999-999987899999997658-216860999999999876-225888799999999


Q ss_pred             CHHHHHHHHHHH---HHHHHHHHHHHHHHCCCCEEEEEEEECCEEEE
Q ss_conf             124899999997---41668999999982899768778655898999
Q 537021.9.peg.3   91 DQQTAEVVEMVL---AGSINKKIVSLINQTGTQAIGICGKDGNMVFA  134 (150)
Q Consensus        91 d~~tl~iv~~vL---~G~vn~~lv~~L~~~g~~avglsg~dg~li~a  134 (150)
                       .++|++.-..|   +|.=-..++..|.+.+-...|+....|.+.-.
T Consensus       474 -a~ALe~ppr~IaeNAG~d~~~vl~~l~~~~~~~~G~D~~tG~~~Dm  519 (580)
T PTZ00114        474 -KDACKAPIKQIAENAGHEGSVVAGNILKEKNKNIGFNAQEGKYVDM  519 (580)
T ss_pred             -HHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCEEEECCCCEEEEH
T ss_conf             -9999999999999769978999999996069998486789907356


No 106
>PRK00694 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase; Validated
Probab=47.57  E-value=23  Score=16.92  Aligned_cols=53  Identities=21%  Similarity=0.202  Sum_probs=20.8

Q ss_pred             HHHHHHHHHHHHCCCCEEEEECCC-------HHHHHHHHHCCCCCEEECCCCCCCHHHHH
Q ss_conf             999999999851897179984572-------45788999708981155783466124899
Q 537021.9.peg.3   44 KDFVNDIALLKKSNITPVIVHGGG-------PQIGAVLEKMGIKSKFENGLRITDQQTAE   96 (150)
Q Consensus        44 ~~~~~dI~ll~~~G~k~ViVHGgg-------~qI~~~l~~~gi~~~~v~G~RVTd~~tl~   96 (150)
                      ++-+++|..|...|..+|=|-=-+       +.|.+.|.+.|++...+..+-+....++.
T Consensus        45 ~atv~Qi~~l~~aGceiVRvtvp~~~~A~al~~I~~~l~~~~~~iPlvADIHF~~~~a~~  104 (606)
T PRK00694         45 DGTVRQICALQEHGCDIVRVTVQGIKEAQACEHIKERLIQQGISIPLVADIHFFPQAAMH  104 (606)
T ss_pred             HHHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCCCCEEEECCCCHHHHHH
T ss_conf             999999999998599989987799999986999999986369999988666888899999


No 107
>pfam02233 PNTB NAD(P) transhydrogenase beta subunit. This family corresponds to the beta subunit of NADP transhydrogenase in prokaryotes, and either the protein N- or C terminal in eukaryotes. The domain is often found in conjunction with pfam01262. Pyridine nucleotide transhydrogenase catalyses the reduction of NAD+ to NADPH. A complete loss of activity occurs upon mutation of Gly314 in E. coli.
Probab=47.27  E-value=24  Score=16.77  Aligned_cols=43  Identities=26%  Similarity=0.342  Sum_probs=32.8

Q ss_pred             HHHHHHHHHHHCCCCEEEEECCCH----------HHHHHHHHCCCCCEE----ECCC
Q ss_conf             999999998518971799845724----------578899970898115----5783
Q 537021.9.peg.3   45 DFVNDIALLKKSNITPVIVHGGGP----------QIGAVLEKMGIKSKF----ENGL   87 (150)
Q Consensus        45 ~~~~dI~ll~~~G~k~ViVHGgg~----------qI~~~l~~~gi~~~~----v~G~   87 (150)
                      .=.+|.+.+-.--.++|||.|-|=          ++.+.|+++|++.+|    +.||
T Consensus       296 ~~~~d~a~~l~~A~~ViIVPGYGmAvAqAQ~~v~el~~~L~~~g~~V~faIHPVAGR  352 (464)
T pfam02233       296 ITAEEAADLLLNASSVIIVPGYGMAVAQAQHPVADLAKLLEERGVNVRFAIHPVAGR  352 (464)
T ss_pred             CCHHHHHHHHHCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCC
T ss_conf             489999999971993999778089999878999999999997898699985766888


No 108
>pfam01973 MAF_flag10 Protein of unknown function DUF115. This family of archaebacterial proteins has no known function.
Probab=46.58  E-value=25  Score=16.70  Aligned_cols=27  Identities=11%  Similarity=0.362  Sum_probs=16.9

Q ss_pred             CCHHHHHHHHHHHHCCCEEEEEECHHH
Q ss_conf             622899999999875988999867567
Q 537021.9.peg.3   10 AEILEQVLPFVQFYENETIVVKYGGHV   36 (150)
Q Consensus        10 a~~l~ea~pYi~~~r~ktiVIK~GG~~   36 (150)
                      +|.|.+..+||++++++-++|-.+.+.
T Consensus        33 GPSL~~~i~~Lk~~~~~~~iia~~~a~   59 (169)
T pfam01973        33 GPSLDKHLPLLKKNRDKAVIIAADSAL   59 (169)
T ss_pred             CCCHHHHHHHHHHHCCCEEEEEECHHH
T ss_conf             899899999999716983999966799


No 109
>PTZ00063 histone deacetylase; Provisional
Probab=46.36  E-value=25  Score=16.68  Aligned_cols=118  Identities=16%  Similarity=0.290  Sum_probs=69.8

Q ss_pred             CHHHHH-HHHHHHHCCCEEEEEECHHHHCCHH------HHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHH--------
Q ss_conf             228999-9999987598899986756724746------8999999999985189717998457245788999--------
Q 537021.9.peg.3   11 EILEQV-LPFVQFYENETIVVKYGGHVMNCTD------LSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLE--------   75 (150)
Q Consensus        11 ~~l~ea-~pYi~~~r~ktiVIK~GG~~l~d~~------l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~--------   75 (150)
                      .+|+.+ .|-+..|+=+.||+-+|.+.+..+.      ....-.+-+..+++.++.++++=|||--+.....        
T Consensus       237 ~iF~~vi~~v~~~f~P~~IVlQcGaDsl~gD~Lg~fnls~~gh~~cv~~~~~~~~p~l~lGGGGY~~~nvarcW~yeT~~  316 (439)
T PTZ00063        237 DLFKVVIGKCVEVYRPGAIVLQCGADSLTGDRLGRFNLTIKGHARCVEFVRSLNIPLLVLGGGGYTIRNVARCWAYETGV  316 (439)
T ss_pred             HHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCHHHHHHHHHHHH
T ss_conf             99999999999986999799946765446784565324658799999999960998899767887851299999999999


Q ss_pred             HCC----C----C---------CEEE---CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH-CCCCEEEEEEEECCEEE
Q ss_conf             708----9----8---------1155---78346612489999999741668999999982-89976877865589899
Q 537021.9.peg.3   76 KMG----I----K---------SKFE---NGLRITDQQTAEVVEMVLAGSINKKIVSLINQ-TGTQAIGICGKDGNMVF  133 (150)
Q Consensus        76 ~~g----i----~---------~~~v---~G~RVTd~~tl~iv~~vL~G~vn~~lv~~L~~-~g~~avglsg~dg~li~  133 (150)
                      .+|    |    +         |.|.   .--...+..+-+..+     ++...+...|+. .+.++|.+.-+--.++.
T Consensus       317 ~l~~~~~i~~~iP~~~y~~~~~pd~~L~~~~~~~~n~Nt~~~l~-----~i~~~~~e~l~~~~~apsvq~~~~p~d~~~  390 (439)
T PTZ00063        317 ILNKHHEMSDQISLNDYYDYYAPDFQLHLQPSNMPNYNSPEHLD-----KIKMKIIDNLRYVEHAPGVQFAYVPPDFFD  390 (439)
T ss_pred             HHCCCCCCCCCCCCCCCHHHHCCCEEEECCCCCCCCCCCHHHHH-----HHHHHHHHHHHCCCCCCCCCCCCCCCCCCC
T ss_conf             94886668898998735355289736306987776789899999-----999999999716899897663548897567


No 110
>TIGR00656 asp_kin_monofn asparate kinase, monofunctional class; InterPro: IPR005260   Aspartate kinase catalyzes the first step in the biosynthesis of Lys (via its precursor diaminopimelate), Met, and Thr. In E. coli, a distinct isozyme is inhibited by each of these three amino acid end products . The Met-sensitive (I) and Thr-sensitive (II) forms are bifunctional enzymes fused to homoserine dehydrogenase, which catalyses the next step in the biosynthesis of these amino acids. Inn contrast, the Lys-sensitive form (III) is a monofunctional enzyme; homoserine dehydrogenase is not part of the Lys biosynthetic pathway.    This entry represents a subclass of aspartate kinases that are mostly Lys-sensitive and are not fused to homoserine dehydrogenase. The E. coli enzyme is a homodimer, while the Bacillus and Corynebacterium enzymes are alpha 2/beta 2 heterotetramers, where the beta subunit is translated from an in-phase alternative initiator at Met-246 , . The protein slr0657 from Synechocystis PCC6803 is extended by a duplication of the C-terminal region corresponding to the beta chain. Incorporation of a second copy of the C-terminal domain may be quite common in this subgroup of aspartokinases. ; GO: 0004072 aspartate kinase activity, 0009058 biosynthetic process.
Probab=45.35  E-value=26  Score=16.58  Aligned_cols=54  Identities=20%  Similarity=0.204  Sum_probs=40.7

Q ss_pred             CCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCC
Q ss_conf             988999867567247468999999999985189717998457245788999708
Q 537021.9.peg.3   25 NETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMG   78 (150)
Q Consensus        25 ~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~g   78 (150)
                      .+++|.|+||..+.+.+-+.+.++-+......|.++|+|--+-...+..+-...
T Consensus         1 ~~~~v~kfGGt~~~~~~~~~~~~~~~~~~~~~g~~~~vv~sa~~~~td~l~~~~   54 (480)
T TIGR00656         1 MELIVQKFGGTSVGSGERIKNAARIVLKEKEEGHKVVVVVSAMGGVTDALLSLA   54 (480)
T ss_pred             CCEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCHHHHHHH
T ss_conf             944787426721254267888887888776407714899861565202466554


No 111
>cd01974 Nitrogenase_MoFe_beta Nitrogenase_MoFe_beta: Nitrogenase MoFe protein, beta subunit. The nitrogenase enzyme catalyzes the ATP-dependent reduction of dinitrogen to ammonia. The Molybdenum (Mo-) nitrogenase is the most widespread and best characterized of these systems.  Mo-nitrogenase consists of the MoFe protein (component 1) and the Fe protein (component 2).  MoFe is an alpha2beta2 tetramer. This group contains the beta subunit of the MoFe protein. Each alphabeta pair of MoFe contains one P-cluster (at the alphabeta interface) and, one molecule of iron molybdenum cofactor (FeMoco) contained within the alpha subunit. The Fe protein contains a single [4Fe-4S] cluster.  Electrons are transferred from the [4Fe-4S] cluster of the Fe protein to the P-cluster of the MoFe and in turn to FeMoCo, the site of substrate reduction.
Probab=45.21  E-value=26  Score=16.57  Aligned_cols=65  Identities=15%  Similarity=0.320  Sum_probs=38.1

Q ss_pred             CHHHHHCCCHHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEE--ECCCHHHHHHHHH
Q ss_conf             0023220622899999999875988999867567247468999999999985189717998--4572457889997
Q 537021.9.peg.3    3 EKIYQFQAEILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIV--HGGGPQIGAVLEK   76 (150)
Q Consensus         3 ~~~~~~~a~~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViV--HGgg~qI~~~l~~   76 (150)
                      +++.++++.+......|-..+.||.+.|- |+.     ...-.+.   .+|..+|+.++.|  ..+.+...+.+++
T Consensus       281 ~~i~~er~r~~d~~~d~~~~l~gkrvai~-g~~-----~~~~~l~---~~l~elG~~~~~vv~~~~~~~~~~~i~~  347 (435)
T cd01974         281 EELEEERGRLVDAMTDSHQYLHGKKFALY-GDP-----DFLIGLT---SFLLELGMEPVHVLTGNGGKRFEKEMQA  347 (435)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHCCCEEEEE-CCH-----HHHHHHH---HHHHHCCCEEEEEEECCCCHHHHHHHHH
T ss_conf             99999999999999999887429679998-882-----8999999---9999889978999979997789999999


No 112
>TIGR00705 SppA_67K signal peptide peptidase SppA, 67K type; InterPro: IPR004634   Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence . Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases .   Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base . The geometric orientations of the catalytic residues are similar between families, despite different protein folds . The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) , .   Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins.   Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule.    This group of serine peptidases belong to MEROPS peptidase family S49 (protease IV family, clan S-). The predicted active site serine for members of this family occurs in a transmembrane domain.   Signal peptides of secretory proteins seem to serve at least two important biological functions. First, they are required for protein targeting to and translocation across membranes, such as the eubacterial plasma membrane and the endoplasmic reticular membrane of eukaryotes. Second, in addition to their role as determinants for protein targeting and translocation, certain signal peptides have a signalling function.  During or shortly after pre-protein translocation, the signal peptide is removed by signal peptidases. The integral membrane protein, SppA (protease IV), of Escherichia coli was shown experimentally to degrade signal peptides. The member of this family from Bacillus subtilis has only been shown to be required for efficient processing of pre-proteins under conditions of hyper-secretion . These enzymes have a molecular mass around 67 kDa and a duplication such that the N-terminal half shares extensive homology with the C-terminal half and was shown in E. coli to form homotetramers. E. coli SohB, which is most closely homologous to the C-terminal duplication of SppA, is predicted to perform a similar function of small peptide degradation, but in the periplasm. Many prokaryotes have a single SppA/SohB homolog that may perform the function of either or both. ; GO: 0009003 signal peptidase activity, 0006465 signal peptide processing, 0016021 integral to membrane.
Probab=45.17  E-value=15  Score=18.17  Aligned_cols=75  Identities=23%  Similarity=0.316  Sum_probs=37.2

Q ss_pred             CHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCC--------CHHHHHHHHHHHHH
Q ss_conf             7567247468999999999985189717998457245788999708981155783466--------12489999999741
Q 537021.9.peg.3   33 GGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRIT--------DQQTAEVVEMVLAG  104 (150)
Q Consensus        33 GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVT--------d~~tl~iv~~vL~G  104 (150)
                      ||++..++-    +=+.++.++..|.|||||==|+=..+             .|+.|.        +|.|+-=-.-++  
T Consensus       371 GGsv~Ase~----IR~e~~~~~~~GkKPVivSMG~~AAS-------------GgYWiasaA~yIvA~p~TiTGSIGvf--  431 (614)
T TIGR00705       371 GGSVFASEI----IRRELERLQARGKKPVIVSMGAMAAS-------------GGYWIASAADYIVADPNTITGSIGVF--  431 (614)
T ss_pred             CCCEEHHHH----HHHHHHHHHHCCCCCEEEECCHHHHC-------------CCCHHCCCCCEEEECCCCCCCCHHHH--
T ss_conf             863428789----99999999826899789843502320-------------53002045571334787431001445--


Q ss_pred             HHHHHHHHHHHH-CCCCEEEEEE
Q ss_conf             668999999982-8997687786
Q 537021.9.peg.3  105 SINKKIVSLINQ-TGTQAIGICG  126 (150)
Q Consensus       105 ~vn~~lv~~L~~-~g~~avglsg  126 (150)
                      .|=+..+..+.+ .|++.=+++.
T Consensus       432 svl~t~En~~~~y~Gv~~D~V~t  454 (614)
T TIGR00705       432 SVLPTVENSLDRYIGVHVDGVST  454 (614)
T ss_pred             HHHHHHHHHHHCEECEEECCEEE
T ss_conf             32522554231000223163342


No 113
>PRK08605 D-lactate dehydrogenase; Validated
Probab=44.96  E-value=27  Score=16.54  Aligned_cols=36  Identities=17%  Similarity=0.158  Sum_probs=24.5

Q ss_pred             HHHHHCCCCEEEEECCCHH-HH-HHHHHCCCCCEEECC
Q ss_conf             9985189717998457245-78-899970898115578
Q 537021.9.peg.3   51 ALLKKSNITPVIVHGGGPQ-IG-AVLEKMGIKSKFENG   86 (150)
Q Consensus        51 ~ll~~~G~k~ViVHGgg~q-I~-~~l~~~gi~~~~v~G   86 (150)
                      ..+..+|+|+|--+|.|-- || ..++++||.....-|
T Consensus        63 ~~~~~l~iK~I~r~gvG~DnIDl~~a~~~gI~V~n~P~  100 (332)
T PRK08605         63 KLLNELGIKQIAQRSAGFDTYDLELANKYNIIISNVPS  100 (332)
T ss_pred             HHHHHCCCEEEEECCCCCCHHHHHHHHHCCCEEEECCC
T ss_conf             96663697799977721745469999979998995998


No 114
>TIGR02078 AspKin_pair Pyrococcus aspartate kinase subunit, putative; InterPro: IPR011819    This family consists of proteins restricted to and found as paralogous pairs (typically close together) in species of Pyrococcus, a hyperthermophilic archaeal genus. Members are always found close to other genes of threonine biosynthesis and appear to represent the Pyrococcal form of aspartate kinase. Alignment to aspartokinase III from Escherichia coli shows that 300 N-terminal and 20 C-terminal amino acids are homologous, but the form in Pyrococcus lacks ~100 amino acids in between..
Probab=43.97  E-value=27  Score=16.45  Aligned_cols=46  Identities=30%  Similarity=0.398  Sum_probs=27.4

Q ss_pred             EEEEEECHHHHCCHHHHHHHHHHHHHHHHC--CCCEEEEECCCHHHHHHHHHC
Q ss_conf             899986756724746899999999998518--971799845724578899970
Q 537021.9.peg.3   27 TIVVKYGGHVMNCTDLSKDFVNDIALLKKS--NITPVIVHGGGPQIGAVLEKM   77 (150)
Q Consensus        27 tiVIK~GG~~l~d~~l~~~~~~dI~ll~~~--G~k~ViVHGgg~qI~~~l~~~   77 (150)
                      .+|+|+||+-+.     .+|-+.+.|...+  ..++|+|--+=.=|+-.|-+.
T Consensus         1 ~iV~KFGGsSvr-----~~F~eA~~L~~~L~E~~~v~VVVSALkGiTD~L~~y   48 (341)
T TIGR02078         1 MIVVKFGGSSVR-----YDFEEALELVKSLSEEKEVVVVVSALKGITDELIRY   48 (341)
T ss_pred             CEEEEECCCCHH-----HHHHHHHHHHHHHCCCCCEEEEEECCCCHHHHHHHH
T ss_conf             968985684136-----668999999998516986899985344425799987


No 115
>PRK10637 cysG siroheme synthase; Provisional
Probab=42.73  E-value=29  Score=16.33  Aligned_cols=69  Identities=19%  Similarity=0.172  Sum_probs=29.4

Q ss_pred             HHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHH-------HHHHHCCCCCEEECC
Q ss_conf             999999987598899986756724746899999999998518971799845724578-------899970898115578
Q 537021.9.peg.3   15 QVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIG-------AVLEKMGIKSKFENG   86 (150)
Q Consensus        15 ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~-------~~l~~~gi~~~~v~G   86 (150)
                      +.+.|+.+...++.|=|-.|..-..   .+.+.+-+......|.++|=.|||-|.|=       +.|.+.||+.+.+-|
T Consensus       254 ~il~~~~~~ae~i~vgK~~g~~~~~---q~~i~~llv~~a~~G~~VvRLk~GDP~ifGr~~EE~~~l~~~Gi~~eVvPG  329 (457)
T PRK10637        254 DIMNLVRRDADRVFVGKRAGYHCVP---QEEINQILLREAQKGKRVVRLKGGDPFIFGRGGEELETLCNAGIPFSVVPG  329 (457)
T ss_pred             HHHHHCCCCCEEEEEECCCCCCCCC---HHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHCCCCEEEECC
T ss_conf             9996447898899960678888878---999999999998589979998489986434389999999978998899899


No 116
>TIGR02472 sucr_P_syn_N sucrose-phosphate synthase, putative, glycosyltransferase domain; InterPro: IPR012822    This family consists of the N-terminal regions, or in some cases the entirety, of bacterial proteins closely related to plant sucrose-phosphate synthases (SPS). The C-terminal domain (IPR012821 from INTERPRO), found with most members of this family, resembles both bona fide plant sucrose-phosphate phosphatases (SPP) and the SPP-like domain of plant sucrose phosphate synthase (SPS). At least two members of this family lack the SPP-like domain, which may have binding or regulatory rather than enzymatic activity by analogy to plant SPS. This enzyme produces sucrose 6-phosphate and UDP from UDP-glucose and D-fructose 6-phosphate, and may be encoded near the gene for fructokinase..
Probab=42.43  E-value=15  Score=18.05  Aligned_cols=78  Identities=17%  Similarity=0.217  Sum_probs=63.2

Q ss_pred             HHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCC--CCCCHHHHHHHHHHHHHHHHHHHHH
Q ss_conf             67247468999999999985189717998457245788999708981155783--4661248999999974166899999
Q 537021.9.peg.3   35 HVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGL--RITDQQTAEVVEMVLAGSINKKIVS  112 (150)
Q Consensus        35 ~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~--RVTd~~tl~iv~~vL~G~vn~~lv~  112 (150)
                      .+..+|.|.+.|==.+-.-.+.|..+|=---|||+        .|-.+..||+  =+||.|.++.+-|- +. -+.+=..
T Consensus       348 GiFVNPALTEPFGLTLlEAAAcGLPivAT~DGGP~--------dI~~~C~NGLLvd~ld~e~i~~AL~~-al-sd~~QW~  417 (445)
T TIGR02472       348 GIFVNPALTEPFGLTLLEAAACGLPIVATEDGGPR--------DIIANCRNGLLVDVLDLEAIASALEQ-AL-SDSSQWQ  417 (445)
T ss_pred             CEEECCCCCCCCHHHHHHHHHCCCCEEECCCCCCH--------HHHHHCCCCCEECCCCHHHHHHHHHH-HC-CCHHHHH
T ss_conf             86762721253016899999769972107864866--------88842888750057898999999997-33-8906678


Q ss_pred             HHHHCCCCEE
Q ss_conf             9982899768
Q 537021.9.peg.3  113 LINQTGTQAI  122 (150)
Q Consensus       113 ~L~~~g~~av  122 (150)
                      .-+++|+.+|
T Consensus       418 ~Ws~NGi~gV  427 (445)
T TIGR02472       418 TWSDNGIEGV  427 (445)
T ss_pred             HHHHHHHHHH
T ss_conf             9998557764


No 117
>COG1211 IspD 4-diphosphocytidyl-2-methyl-D-erithritol synthase [Lipid metabolism]
Probab=42.08  E-value=29  Score=16.26  Aligned_cols=20  Identities=20%  Similarity=0.361  Sum_probs=8.8

Q ss_pred             HCCCCEEEEECCCHHHHHHH
Q ss_conf             18971799845724578899
Q 537021.9.peg.3   55 KSNITPVIVHGGGPQIGAVL   74 (150)
Q Consensus        55 ~~G~k~ViVHGgg~qI~~~l   74 (150)
                      ..+.++.+|+||....+..+
T Consensus        70 ~~~~~v~~v~GG~~R~~SV~   89 (230)
T COG1211          70 SADKRVEVVKGGATRQESVY   89 (230)
T ss_pred             CCCCEEEEECCCCCHHHHHH
T ss_conf             24870799348751799999


No 118
>cd03134 GATase1_PfpI_like A type 1 glutamine amidotransferase (GATase1)-like domain found in PfpI from Pyrococcus furiosus. A type 1 glutamine amidotransferase (GATase1)-like domain found in PfpI from Pyrococcus furiosus.   This group includes proteins similar to PfpI from P.  furiosus. and PH1704 from Pyrococcus horikoshii. These enzymes are ATP-independent intracellular proteases and may hydrolyze small peptides to provide a nutritional source.  Only Cys of the catalytic triad typical of GATase1 domains is conserved in this group. This Cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow.  For PH1704, it is believed that this Cys together with a different His in one monomer and Glu (from an adjacent monomer) forms a different catalytic triad from the typical GATase1domain.  PfpI is homooligomeric. Protease activity is only found for oligomeric forms of PH1704.
Probab=41.88  E-value=30  Score=16.24  Aligned_cols=67  Identities=19%  Similarity=0.291  Sum_probs=37.0

Q ss_pred             HHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEE-EE
Q ss_conf             99999985189717998457245788999708981155783466124899999997416689999999828997687-78
Q 537021.9.peg.3   47 VNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEMVLAGSINKKIVSLINQTGTQAIG-IC  125 (150)
Q Consensus        47 ~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~vL~G~vn~~lv~~L~~~g~~avg-ls  125 (150)
                      .+=+...+..| ++|---..||.+   |.+.|    ..+|+|+|.-..+               ...|...|.+-+. --
T Consensus        84 ~~~i~~~~~~~-k~iaaIC~gp~~---L~~~g----ll~g~k~T~~~~~---------------~~~l~~~G~~~~~~~V  140 (165)
T cd03134          84 VAFVRAFAEAG-KPVAAICHGPWV---LISAG----VVRGRKLTSYPSI---------------KDDLINAGANWVDEEV  140 (165)
T ss_pred             HHHHHHHHHHC-CCEEEECCHHHH---HHHCC----CCCCCEEEECCCH---------------HHHHHHCCCEEECCCE
T ss_conf             99999999719-929999875787---74578----2589889867748---------------9999987997946998


Q ss_pred             EEECCEEEEEE
Q ss_conf             65589899997
Q 537021.9.peg.3  126 GKDGNMVFAEK  136 (150)
Q Consensus       126 g~dg~li~a~~  136 (150)
                      .+|||+|+++=
T Consensus       141 vvDgnlIT~~g  151 (165)
T cd03134         141 VVDGNLITSRN  151 (165)
T ss_pred             EEECCEEECCC
T ss_conf             99699997958


No 119
>COG2378 Predicted transcriptional regulator [Transcription]
Probab=41.01  E-value=25  Score=16.66  Aligned_cols=80  Identities=19%  Similarity=0.253  Sum_probs=53.9

Q ss_pred             HCCCHHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEEC--CCHHHHHHHHHCCCCCEEE-
Q ss_conf             2062289999999987598899986756724746899999999998518971799845--7245788999708981155-
Q 537021.9.peg.3    8 FQAEILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHG--GGPQIGAVLEKMGIKSKFE-   84 (150)
Q Consensus         8 ~~a~~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHG--gg~qI~~~l~~~gi~~~~v-   84 (150)
                      .+|.-|-+..+|++.++. +-+=.+-...=   -..+.+.+||+.|+..|+.+.--.|  ||..+-         +.|. 
T Consensus         5 ~ka~RL~~ii~~L~~~~~-vta~~lA~~~~---VS~RTi~RDi~~L~~~gvPI~~e~G~~~gy~~~---------~~~~L   71 (311)
T COG2378           5 RKAERLLQIIQILRAKET-VTAAELADEFE---VSVRTIYRDIATLRAAGVPIEGERGKGGGYRLR---------PGFKL   71 (311)
T ss_pred             HHHHHHHHHHHHHHHCCC-CHHHHHHHHHC---CCHHHHHHHHHHHHHCCCCEEECCCCCCCEEEC---------CCCCC
T ss_conf             489999999999985760-45999998729---889999999999997699765325886318960---------57778


Q ss_pred             CCCCCCCHHHHHHHHH
Q ss_conf             7834661248999999
Q 537021.9.peg.3   85 NGLRITDQQTAEVVEM  100 (150)
Q Consensus        85 ~G~RVTd~~tl~iv~~  100 (150)
                      ..+-+|.+|...++-.
T Consensus        72 ~pl~ft~~E~~Al~~~   87 (311)
T COG2378          72 PPLMFTEEEAEALLLA   87 (311)
T ss_pred             CCCCCCHHHHHHHHHH
T ss_conf             8666898799999999


No 120
>PRK10017 putative pyruvyl transferase; Provisional
Probab=40.89  E-value=30  Score=16.19  Aligned_cols=51  Identities=22%  Similarity=0.308  Sum_probs=33.2

Q ss_pred             HHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEE--ECCCHHHH
Q ss_conf             999875988999867567247468999999999985189717998--45724578
Q 537021.9.peg.3   19 FVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIV--HGGGPQIG   71 (150)
Q Consensus        19 Yi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViV--HGgg~qI~   71 (150)
                      +++..+..-.||-.||+.+.|......|.-  +++..+..|+|+.  +|-||.-+
T Consensus       111 ~~~~L~~~D~vIs~GGs~~~D~yg~~~~~~--~L~a~l~kKpv~~~aQgIGP~~~  163 (426)
T PRK10017        111 FVRLLSGYDAIIQVGGSFFVDLYGVPQFEH--ALCTFMAKKPLFMIGHSVGPFQD  163 (426)
T ss_pred             HHHHHHHCCEEEEECCCCCCCCCCCCHHHH--HHHHHHCCCCEEEEECCCCCCCC
T ss_conf             999986547899717762014768521689--99999739968999044688087


No 121
>pfam00850 Hist_deacetyl Histone deacetylase domain. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related to other proteins.
Probab=40.37  E-value=31  Score=16.09  Aligned_cols=62  Identities=19%  Similarity=0.239  Sum_probs=37.0

Q ss_pred             HHHHH-HHHHHHHCCCEEEEEECHHHHCCH---------HHHHHHHHHHHHH-HHCCCCEEEEECCCHHHHHH
Q ss_conf             28999-999998759889998675672474---------6899999999998-51897179984572457889
Q 537021.9.peg.3   12 ILEQV-LPFVQFYENETIVVKYGGHVMNCT---------DLSKDFVNDIALL-KKSNITPVIVHGGGPQIGAV   73 (150)
Q Consensus        12 ~l~ea-~pYi~~~r~ktiVIK~GG~~l~d~---------~l~~~~~~dI~ll-~~~G~k~ViVHGgg~qI~~~   73 (150)
                      +|+++ .|.+++|+-+.+||-.|-++..++         +-+..+.+.+..+ ...+.++++|-|||-..+..
T Consensus       218 ~~~~~l~~~~~~f~Pd~iv~saG~D~~~~Dplg~~~ls~~~~~~~~~~v~~~a~~~~~pv~~vleGGY~~~~l  290 (302)
T pfam00850       218 AFEEILLPLLREFQPDLILVSAGFDAHAGDPLGGLNLTTEGYAERTRLLLELADEYGGPVVSVLEGGYNLDAL  290 (302)
T ss_pred             HHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCEEECHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHH
T ss_conf             9999999999720988899945634557887767206899999999999999987599989997787883689


No 122
>PRK10513 sugar phosphatase; Provisional
Probab=39.96  E-value=32  Score=16.05  Aligned_cols=65  Identities=17%  Similarity=0.290  Sum_probs=47.1

Q ss_pred             CEEEEEECHHHHCCH-HHHHHHHHHHHHHHHCCCCEEEEECCCH-HHHHHHHHCCCCCE--E---ECCCCCC
Q ss_conf             889998675672474-6899999999998518971799845724-57889997089811--5---5783466
Q 537021.9.peg.3   26 ETIVVKYGGHVMNCT-DLSKDFVNDIALLKKSNITPVIVHGGGP-QIGAVLEKMGIKSK--F---ENGLRIT   90 (150)
Q Consensus        26 ktiVIK~GG~~l~d~-~l~~~~~~dI~ll~~~G~k~ViVHGgg~-qI~~~l~~~gi~~~--~---v~G~RVT   90 (150)
                      |.++.=+.|..+.+. .+.+...+.|..|+..|+++++.-|=.+ .+...+++++++..  |   .||-.|.
T Consensus         4 Kli~~DlDGTLl~~~~~i~~~~~~ai~~l~~~Gi~~~iaTGR~~~~~~~~~~~l~~~~~~~~~I~~NGa~i~   75 (270)
T PRK10513          4 KLIAIDMDGTLLLPDHTISPAVKNAIAAARAKGVNVVLTTGRPYAGVHRYLKELHMEQPGDYCITYNGALVQ   75 (270)
T ss_pred             CEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHCCCCCCCEEEECCCEEEE
T ss_conf             299993662525897968999999999999789999999799868789999983888899889985985999


No 123
>PRK09034 aspartate kinase; Reviewed
Probab=39.92  E-value=32  Score=16.05  Aligned_cols=34  Identities=29%  Similarity=0.419  Sum_probs=22.1

Q ss_pred             EEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEE
Q ss_conf             999867567247468999999999985189717998
Q 537021.9.peg.3   28 IVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIV   63 (150)
Q Consensus        28 iVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViV   63 (150)
                      .|.||||+-+.+.+.++.+++ |..- ....++|+|
T Consensus         2 ~V~KFGGtSv~~~~~i~~v~~-ii~~-~~~~~vVVV   35 (450)
T PRK09034          2 KVVKFGGSSLASAEQFKKVLN-IVKS-DPERKIVVV   35 (450)
T ss_pred             EEEEECCCCCCCHHHHHHHHH-HHHC-CCCCEEEEE
T ss_conf             899878313899999999999-9962-789839999


No 124
>cd03147 GATase1_Ydr533c_like Type 1 glutamine amidotransferase (GATase1)-like domain found in Saccharomyces cerevisiae Ydr533c protein. Type 1 glutamine amidotransferase (GATase1)-like domain found in Saccharomyces cerevisiae Ydr533c protein.  This group includes proteins similar to S. cerevisiae Ydr533c.  Ydr533c is upregulated in response to various stress conditions along with the heat shock family.  The catalytic triad typical of GATase1domains is not conserved in this GATase1-like domain. However, in common with a typical GATase1domain, a reactive Cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. This Cys together with a different His and Glu residue form a different catalytic triad from the typical GATase1domain.  Ydr533c protein is a homodimer.
Probab=37.93  E-value=34  Score=15.85  Aligned_cols=102  Identities=19%  Similarity=0.206  Sum_probs=49.8

Q ss_pred             EEEECHHH-HC----CHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHH
Q ss_conf             99867567-24----74689999999999851897179984572457889997089811557834661248999999974
Q 537021.9.peg.3   29 VVKYGGHV-MN----CTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEMVLA  103 (150)
Q Consensus        29 VIK~GG~~-l~----d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~vL~  103 (150)
                      |.--||+. |-    |++ ...++++   .+..| ++|-.-.=||..=..++...=....+.|+|+|.-..-+-...-+.
T Consensus        98 Vf~pGGhG~m~Dl~~~~~-l~~ll~~---~~~~g-k~vaaVCHGpaaLl~~~~~~dG~~lv~G~~vTgFsn~EE~~~~~~  172 (231)
T cd03147          98 FFVAGGHGTLFDFPHATN-LQKIAQQ---IYANG-GVVAAVCHGPAILANLKDPKTGKPLIKGKTVTGFTDKGEEIMGVM  172 (231)
T ss_pred             EEECCCCCCCCCCCCCHH-HHHHHHH---HHHCC-CEEEEECCHHHHHHHCCCCCCCCEEECCCEEEECCCHHHHHHHHH
T ss_conf             998898853032423999-9999999---99759-979998068899761636789963556867872185898875256


Q ss_pred             HHHH----HHHHHHHHHCCCCEE-------EEEEEECCEEEEE
Q ss_conf             1668----999999982899768-------7786558989999
Q 537021.9.peg.3  104 GSIN----KKIVSLINQTGTQAI-------GICGKDGNMVFAE  135 (150)
Q Consensus       104 G~vn----~~lv~~L~~~g~~av-------glsg~dg~li~a~  135 (150)
                      ..+.    ..+...|.+.|..=.       +-..+|++||+.+
T Consensus       173 ~~~~~~~~~~le~~L~~~Ga~~~~~~~~~~~~vv~Dg~LITGQ  215 (231)
T cd03147         173 EILKKRNLESIEDIAERAGANFIRPPGPWDDFTVVDGRIVTGS  215 (231)
T ss_pred             HHCCCCCCCCHHHHHHHCCCEEECCCCCCCCCEEEECCEEECC
T ss_conf             5436656658899999759989736889988689709867688


No 125
>cd01427 HAD_like Haloacid dehalogenase-like hydrolases. The haloacid dehalogenase-like (HAD) superfamily includes L-2-haloacid dehalogenase, epoxide hydrolase, phosphoserine phosphatase, phosphomannomutase, phosphoglycolate phosphatase, P-type ATPase, and many others, all of which use a nucleophilic aspartate in their phosphoryl transfer reaction. All members possess a highly conserved alpha/beta core domain, and many also possess a small cap domain, the fold and function of which is variable. Members of this superfamily are sometimes referred to as belonging to the DDDD superfamily of phosphohydrolases.
Probab=36.93  E-value=33  Score=15.97  Aligned_cols=43  Identities=19%  Similarity=0.333  Sum_probs=31.7

Q ss_pred             HHHHHHHHHHHHHHHCCCCEEEEECC-CHHHHHHHHHCCCCCEE
Q ss_conf             68999999999985189717998457-24578899970898115
Q 537021.9.peg.3   41 DLSKDFVNDIALLKKSNITPVIVHGG-GPQIGAVLEKMGIKSKF   83 (150)
Q Consensus        41 ~l~~~~~~dI~ll~~~G~k~ViVHGg-g~qI~~~l~~~gi~~~~   83 (150)
                      .+.+...+-|..|+..|+++++|-|. ...+...++..++...|
T Consensus        24 ~~~pg~~e~l~~L~~~G~~v~ivTn~~~~~~~~~~~~~~~~~~~   67 (139)
T cd01427          24 ELYPGVKEALKELKEKGIKLALATNKSRREVLELLEELGLDDYF   67 (139)
T ss_pred             EECCCHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHCCCHHHC
T ss_conf             79858999999999879979999298899999999985997775


No 126
>cd05015 SIS_PGI_1 Phosphoglucose isomerase (PGI) contains two SIS (Sugar ISomerase) domains. This classification is based on the alignment of the first SIS domain. PGI is a multifunctional enzyme which as an intracellular dimer catalyzes the reversible isomerization of glucose 6-phosphate to fructose 6-phosphate. As an extracellular protein, PGI also has functions equivalent to neuroleukin (NLK), autocrine motility factor (AMF), and maturation factor (MF). Evidence suggests that PGI, NLK, AMF, and MF are closely related or identical. NLK is a neurotrophic growth factor that promotes regeneration and survival of neurons. The dimeric form of NLK has isomerase function, whereas its monomeric form carries out neurotrophic activity. AMF is a cytokine that stimulates cell migration and metastasis. MF mediates the differentiation of human myeloid leukemic HL-60 cells to terminal monocytic cells.
Probab=36.10  E-value=36  Score=15.67  Aligned_cols=111  Identities=13%  Similarity=0.168  Sum_probs=63.5

Q ss_pred             HHHHHHHHHHHCC-----CEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHH-HHHHHHHCCCCCEEE--
Q ss_conf             8999999998759-----88999867567247468999999999985189717998457245-788999708981155--
Q 537021.9.peg.3   13 LEQVLPFVQFYEN-----ETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQ-IGAVLEKMGIKSKFE--   84 (150)
Q Consensus        13 l~ea~pYi~~~r~-----ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~q-I~~~l~~~gi~~~~v--   84 (150)
                      +.+...+.++.|.     .+++|-+|||.+-...+. ..+   .-....+.++-.+-...|. +++.+++++.+....  
T Consensus         4 ~~~i~~f~~~i~~g~~~~~iv~iGiGGS~LG~k~l~-~al---~~~~~~~~~i~f~dn~Dp~~~~~~l~~l~~~~Tlfiv   79 (158)
T cd05015           4 LERIKEFAEKVRSGKKITDVVVIGIGGSDLGPRAVY-EAL---KPYFKGGLRLHFVSNVDPDDLAELLKKLDPETTLFIV   79 (158)
T ss_pred             HHHHHHHHHHHHCCCCCCEEEEECCCHHHHHHHHHH-HHH---HHHCCCCCEEEEECCCCHHHHHHHHHHCCCHHEEEEE
T ss_conf             999999999997599888699981433389999999-997---6245489569995799989999999727930149999


Q ss_pred             CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCE
Q ss_conf             78346612489999999741668999999982899768778655898
Q 537021.9.peg.3   85 NGLRITDQQTAEVVEMVLAGSINKKIVSLINQTGTQAIGICGKDGNM  131 (150)
Q Consensus        85 ~G~RVTd~~tl~iv~~vL~G~vn~~lv~~L~~~g~~avglsg~dg~l  131 (150)
                      -----|+.|||.....+. ..........+   +-+-+.++..+.++
T Consensus        80 iSKSg~T~ETl~~~~~~~-~~l~~~~~~~~---~~~~vaIT~~~s~l  122 (158)
T cd05015          80 ISKSGTTLETLANARLAR-EWLEEAGGDDL---AKHFVAITDNGSGL  122 (158)
T ss_pred             ECCCCCCHHHHHHHHHHH-HHHHHHCCHHH---HHHEEEEECCCHHH
T ss_conf             759998689999999999-99998320355---66078980596699


No 127
>PRK11440 hypothetical protein; Provisional
Probab=35.62  E-value=17  Score=17.81  Aligned_cols=93  Identities=17%  Similarity=0.249  Sum_probs=53.3

Q ss_pred             CCHHHHHHHHHHHHHHHHCCCCEEEEE-CCCHHHHHHHHHC--------CCCC---EEECCCCCCCHHHHHHHHHHHHHH
Q ss_conf             474689999999999851897179984-5724578899970--------8981---155783466124899999997416
Q 537021.9.peg.3   38 NCTDLSKDFVNDIALLKKSNITPVIVH-GGGPQIGAVLEKM--------GIKS---KFENGLRITDQQTAEVVEMVLAGS  105 (150)
Q Consensus        38 ~d~~l~~~~~~dI~ll~~~G~k~ViVH-Ggg~qI~~~l~~~--------gi~~---~~v~G~RVTd~~tl~iv~~vL~G~  105 (150)
                      ..++...+..+=+...+..|+.+|.|+ +..|.-...+.+.        .++.   ++...+...+.+ .-+.|.-.++.
T Consensus        29 ~~~~~i~~~~~L~~~aR~~g~pVi~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~d-~vv~K~~~saF  107 (188)
T PRK11440         29 TADEVVARAARLAAKFRASGSPVVLVRVGWSADYAEALKQPVDAPSPAKALPENWWQHPAALGKTDSD-IEVTKRQWGAF  107 (188)
T ss_pred             CHHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-EEEECCCCCCC
T ss_conf             79999999999999999879909999863178710100286668886556787311356032789999-89867845877


Q ss_pred             HHHHHHHHHHHCCCCEEEEEEEECCE
Q ss_conf             68999999982899768778655898
Q 537021.9.peg.3  106 INKKIVSLINQTGTQAIGICGKDGNM  131 (150)
Q Consensus       106 vn~~lv~~L~~~g~~avglsg~dg~l  131 (150)
                      .+++|...|.+.|+..+=++|...|.
T Consensus       108 ~~T~L~~~Lr~~gi~~lvi~G~~T~~  133 (188)
T PRK11440        108 YGTDLELQLRRRGIDTIVLCGISTNI  133 (188)
T ss_pred             CCCCHHHHHHHCCCCEEEEEEECCCH
T ss_conf             78988999987799989993221165


No 128
>TIGR03674 fen_arch flap structure-specific endonuclease. Endonuclease that cleaves the 5'-overhanging flap structure that is generated by displacement synthesis when DNA polymerase encounters the 5'-end of a downstream Okazaki fragment. Has 5'-endo-/exonuclease and 5'-pseudo-Y-endonuclease activities. Cleaves the junction between single and double-stranded regions of flap DNA
Probab=35.55  E-value=37  Score=15.61  Aligned_cols=28  Identities=43%  Similarity=0.627  Sum_probs=19.8

Q ss_pred             HHHHHHHHHHHHHHHCCCCEEEEECCCHH
Q ss_conf             68999999999985189717998457245
Q 537021.9.peg.3   41 DLSKDFVNDIALLKKSNITPVIVHGGGPQ   69 (150)
Q Consensus        41 ~l~~~~~~dI~ll~~~G~k~ViVHGgg~q   69 (150)
                      .+..-|.+-+.++. .|++||.|+-|.|-
T Consensus        57 hl~g~f~R~~~Ll~-~gI~PIfVFDGkpp   84 (338)
T TIGR03674        57 HLSGLFYRTINLLE-NGIKPVYVFDGKPP   84 (338)
T ss_pred             HHHHHHHHHHHHHH-CCCEEEEEECCCCC
T ss_conf             99999999999997-79989999799985


No 129
>KOG0339 consensus
Probab=34.81  E-value=38  Score=15.54  Aligned_cols=73  Identities=18%  Similarity=0.138  Sum_probs=45.2

Q ss_pred             HHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHH
Q ss_conf             98759889998675672474689999999999851897179984572457889997089811557834661248999999
Q 537021.9.peg.3   21 QFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEM  100 (150)
Q Consensus        21 ~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~  100 (150)
                      +.=.|-+-||.++---+.    .+-|.+.=++-+..|+++|-|||||+--.+. +.+...++++=   -|+...+++|+|
T Consensus       292 ~~g~gPi~vilvPTrela----~Qi~~eaKkf~K~ygl~~v~~ygGgsk~eQ~-k~Lk~g~EivV---aTPgRlid~Vkm  363 (731)
T KOG0339         292 KPGEGPIGVILVPTRELA----SQIFSEAKKFGKAYGLRVVAVYGGGSKWEQS-KELKEGAEIVV---ATPGRLIDMVKM  363 (731)
T ss_pred             CCCCCCEEEEEECCHHHH----HHHHHHHHHHHHHCCCEEEEEECCCCHHHHH-HHHHCCCEEEE---ECHHHHHHHHHH
T ss_conf             689997699980638999----9999999986311264278863687488877-76502772899---662888999886


Q ss_pred             H
Q ss_conf             9
Q 537021.9.peg.3  101 V  101 (150)
Q Consensus       101 v  101 (150)
                      -
T Consensus       364 K  364 (731)
T KOG0339         364 K  364 (731)
T ss_pred             H
T ss_conf             0


No 130
>PRK00155 ispD 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase; Reviewed
Probab=34.57  E-value=38  Score=15.51  Aligned_cols=14  Identities=29%  Similarity=0.420  Sum_probs=6.3

Q ss_pred             CEEEEE-CCCHHHHH
Q ss_conf             179984-57245788
Q 537021.9.peg.3   59 TPVIVH-GGGPQIGA   72 (150)
Q Consensus        59 k~ViVH-Ggg~qI~~   72 (150)
                      ..|+|| +++|.+++
T Consensus        98 ~~VlIHDaARP~is~  112 (228)
T PRK00155         98 DWVLVHDAARPFLTP  112 (228)
T ss_pred             CEEEEECCCCCCCCH
T ss_conf             979997066768999


No 131
>TIGR01508 rib_reduct_arch diaminohydroxyphosphoribosylaminopyrimidine reductase; InterPro: IPR006401   These sequences represent a specific reductase of riboflavin biosynthesis in the archaea, diaminohydroxyphosphoribosylaminopyrimidine reductase. It should not be confused with bacterial 5-amino-6-(5-phosphoribosylamino)uracil reductase. The intermediate 2,5-diamino-6-hydroxy-4-(5-phosphoribosylamino)pyrimidine in riboflavin biosynthesis is reduced first and then deaminated in both archaea and fungi, opposite to the order in bacteria. The subsequent deaminase is not presently known and is not closely homologous to the deaminase domain fused to the reductase domain, which is similar to this protein but found in most bacteria. .
Probab=34.35  E-value=39  Score=15.49  Aligned_cols=63  Identities=25%  Similarity=0.411  Sum_probs=42.8

Q ss_pred             HHHHHHCC-CEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHC-------CCCCEEECCC
Q ss_conf             99998759-8899986756724746899999999998518971799845724578899970-------8981155783
Q 537021.9.peg.3   18 PFVQFYEN-ETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKM-------GIKSKFENGL   87 (150)
Q Consensus        18 pYi~~~r~-ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~-------gi~~~~v~G~   87 (150)
                      .-++.+.. -..|.|+|-.-   -+ +..+++   .|+.-|++=++|=|||.-|-.+.++.       =+-|++..|+
T Consensus       116 kk~~~l~~~gv~V~~~G~~~---v~-L~~lld---~L~~kgv~rLmvEGGG~Li~~l~~~~LvDEi~vy~aP~~~GGr  186 (224)
T TIGR01508       116 KKVEELEKKGVEVVKFGEER---VD-LEKLLD---ILEDKGVRRLMVEGGGTLIWSLIKENLVDEIRVYIAPKIIGGR  186 (224)
T ss_pred             HHHHHHHHCCCEEEEECCCC---CC-HHHHHH---HHHHCCCEEEEEECCCEEEHHHHHCCCEEEEEEEECCEEECCC
T ss_conf             78887774974899924785---34-689999---9874797099982485101022302843589998814676277


No 132
>PRK05301 pyrroloquinoline quinone biosynthesis protein PqqE; Provisional
Probab=33.99  E-value=39  Score=15.45  Aligned_cols=67  Identities=12%  Similarity=0.071  Sum_probs=33.4

Q ss_pred             HHHHHHHHHHCCCEEEEEECHH-H-H----CC-HHHHHHHHHHHHHHHHCCCCEEEEEC-------CCHHHHHHHHHCCC
Q ss_conf             9999999987598899986756-7-2----47-46899999999998518971799845-------72457889997089
Q 537021.9.peg.3   14 EQVLPFVQFYENETIVVKYGGH-V-M----NC-TDLSKDFVNDIALLKKSNITPVIVHG-------GGPQIGAVLEKMGI   79 (150)
Q Consensus        14 ~ea~pYi~~~r~ktiVIK~GG~-~-l----~d-~~l~~~~~~dI~ll~~~G~k~ViVHG-------gg~qI~~~l~~~gi   79 (150)
                      .+.+..++.+.=..+-|-+.|. . .    .. +..|+..++-|..+...|+++.+-.=       -=+.|-++++++|+
T Consensus       105 ~~~a~~L~~~gl~~v~vSlDg~~~e~hD~~rG~~G~f~~~~~~i~~l~~~Gi~v~i~~ti~r~N~~~l~~i~~la~~lGv  184 (375)
T PRK05301        105 EARLAALKAAGLDHIQLSFQDSDPELADRIAGTRGAFAQKLEVARLVKAHGYPLTLNAVIHRHNIDQIPRIIELAVELGA  184 (375)
T ss_pred             HHHHHHHHHHCCCEEEEECCCCCHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHHHHHCCC
T ss_conf             99999998509988999567798778777637886299999999999974981699987230568889999999997299


Q ss_pred             C
Q ss_conf             8
Q 537021.9.peg.3   80 K   80 (150)
Q Consensus        80 ~   80 (150)
                      +
T Consensus       185 ~  185 (375)
T PRK05301        185 D  185 (375)
T ss_pred             C
T ss_conf             8


No 133
>TIGR01245 trpD anthranilate phosphoribosyltransferase; InterPro: IPR005940    In many widely different species, including Escherichia coli, Thermotoga maritima, and Archaeoglobus fulgidus, this enzyme is a C-terminal part of a multifunctional protein together with glutamine amidotransferase; the fusion protein is designated anthranilate synthase component II (4.1.3.27 from EC).; GO: 0004048 anthranilate phosphoribosyltransferase activity, 0000162 tryptophan biosynthetic process.
Probab=33.91  E-value=39  Score=15.45  Aligned_cols=63  Identities=27%  Similarity=0.429  Sum_probs=38.9

Q ss_pred             CHHHHHHHHHHHHHHHHCCCC-EEEEEC-CCHHHHHH----------------------HHHCCCCCEEECCCCCCCH-H
Q ss_conf             746899999999998518971-799845-72457889----------------------9970898115578346612-4
Q 537021.9.peg.3   39 CTDLSKDFVNDIALLKKSNIT-PVIVHG-GGPQIGAV----------------------LEKMGIKSKFENGLRITDQ-Q   93 (150)
Q Consensus        39 d~~l~~~~~~dI~ll~~~G~k-~ViVHG-gg~qI~~~----------------------l~~~gi~~~~v~G~RVTd~-~   93 (150)
                      |+++..-+++   .|+.+|.+ ..+||| +|  +|+.                      =+..|+++...+-+|+-|+ |
T Consensus       192 ~~~L~~~~A~---aL~~LG~krAlVVhG~~G--lDE~s~~g~t~v~el~~g~I~~~~~~PedFG~~~~~l~~l~g~~p~E  266 (331)
T TIGR01245       192 DPDLVEVMAE---ALKNLGVKRALVVHGEDG--LDEISLTGPTKVAELKDGEIREYTLDPEDFGLKRAPLEELRGGSPEE  266 (331)
T ss_pred             CHHHHHHHHH---HHHHHCCCEEEEEECCCC--CCEEECCCCCEEEEEECCCEEEEEECCHHCCCCCCCCHHCCCCCHHH
T ss_conf             6888999999---998617670899857889--61453167716998748931466641110479755301104689899


Q ss_pred             HHHHHHHHHHHHH
Q ss_conf             8999999974166
Q 537021.9.peg.3   94 TAEVVEMVLAGSI  106 (150)
Q Consensus        94 tl~iv~~vL~G~v  106 (150)
                      -.+++..+|.|+-
T Consensus       267 na~~~~~~l~G~~  279 (331)
T TIGR01245       267 NAEILREILEGKG  279 (331)
T ss_pred             HHHHHHHHHCCCC
T ss_conf             9999999971888


No 134
>PRK05991 precorrin-3B C17-methyltransferase; Provisional
Probab=33.91  E-value=39  Score=15.45  Aligned_cols=71  Identities=13%  Similarity=0.123  Sum_probs=39.8

Q ss_pred             CCCHHHHHHHHHHHH---CCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHH-------HHHHHC-
Q ss_conf             062289999999987---598899986756724746899999999998518971799845724578-------899970-
Q 537021.9.peg.3    9 QAEILEQVLPFVQFY---ENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIG-------AVLEKM-   77 (150)
Q Consensus         9 ~a~~l~ea~pYi~~~---r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~-------~~l~~~-   77 (150)
                      .|.++....||+...   .++..   ++.. +.++.   .-++....+...|.++++||||-|.|=       +.+++. 
T Consensus        28 ~Advvvgy~~yld~l~~~~~~~~---~~~~-~~~e~---~r~~~al~~A~~Gk~Va~l~sGDP~IyGmag~~~E~l~~~~  100 (254)
T PRK05991         28 AATDFFGYGPYLDRLQLRPDQLR---HASD-NREEL---DRAGAALAMAAAGANVCVVSGGDPGVFAMAAAVCEAIENGP  100 (254)
T ss_pred             CCCEEEECHHHHHHHHCCCCCEE---ECCC-CHHHH---HHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHHHCC
T ss_conf             29999965266665513899579---7588-87588---99999999998799579986788046653499999998350


Q ss_pred             ----CCCCEEECC
Q ss_conf             ----898115578
Q 537021.9.peg.3   78 ----GIKSKFENG   86 (150)
Q Consensus        78 ----gi~~~~v~G   86 (150)
                          +++.+.+-|
T Consensus       101 ~~~~~i~~eVVPG  113 (254)
T PRK05991        101 AAWRAVDLTVVPG  113 (254)
T ss_pred             CCCCCCCEEEECC
T ss_conf             0027976899588


No 135
>PRK00339 minC septum formation inhibitor; Reviewed
Probab=33.27  E-value=40  Score=15.38  Aligned_cols=61  Identities=8%  Similarity=0.090  Sum_probs=45.9

Q ss_pred             HHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCE
Q ss_conf             998759889998675672474689999999999851897179984572457889997089811
Q 537021.9.peg.3   20 VQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSK   82 (150)
Q Consensus        20 i~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~   82 (150)
                      =..|++.-+||-+..-  .+.....+|..=+.+|+..|+.+|=|.|+.++..+.....|+..-
T Consensus        47 P~FF~~~PvVldl~~l--~~~~~~~dl~~L~~~l~~~gl~~vGv~g~~~~~~~~a~~~gL~il  107 (249)
T PRK00339         47 PNFFSNTPLVLALDKL--PEGEGELDLPGLMRICRRHGLRTLAIRASRIEDIAAAIAADLPVL  107 (249)
T ss_pred             HHHHCCCCEEEECHHC--CCCCCCCCHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHCCCCCC
T ss_conf             5765698599983342--676554579999999998899899996699899999997799704


No 136
>PRK13639 cbiO cobalt transporter ATP-binding subunit; Provisional
Probab=33.19  E-value=40  Score=15.37  Aligned_cols=13  Identities=15%  Similarity=0.434  Sum_probs=4.4

Q ss_pred             HHHHHHHHHHCCC
Q ss_conf             9999999851897
Q 537021.9.peg.3   46 FVNDIALLKKSNI   58 (150)
Q Consensus        46 ~~~dI~ll~~~G~   58 (150)
                      +++|--.|+..|+
T Consensus       224 vf~~~~~l~~~~l  236 (275)
T PRK13639        224 IFSDSETIRSANL  236 (275)
T ss_pred             HHCCHHHHHHCCC
T ss_conf             8749999997799


No 137
>TIGR02090 LEU1_arch isopropylmalate/citramalate/homocitrate synthases; InterPro: IPR011830   Methanogenic archaea contain three closely related homologs of the 2-isopropylmalate synthases (LeuA) represented by IPR005671 from INTERPRO. Two of these in Methanococcus janaschii (MJ1392 - CimA ; MJ0503 - AksA ) have been characterised as catalyzing alternative reactions leaving the third (MJ1195) as the presumptive LeuA enzyme. CimA is citramalate (2-methylmalate) synthase, which condenses acetyl-CoA with pyruvate. This enzyme is believed to be involved in the biosynthesis of isoleucine in methanogens and possibly other species lacking threonine dehydratase. AksA is a homocitrate synthase which also produces (homo)2-citrate and (homo)3-citrate in the biosynthesis of Coenzyme B which is restricted solely to methanogenic archaea. Methanogens, then should and apparently do contain all three of these enzymes. Unfortunately, phylogenetic trees do not resolve into three unambiguous clades, making assignment of function to particular genes problematic. Other archaea, which lack a threonine dehydratase (mainly Euryarchaeota), should contain both CimA and LeuA genes. This is true for archaeoglobus fulgidis, but not for the Pyrococci which have none in this clade, but one in IPR005671 from INTERPRO and one in IPR005675 from INTERPRO which may fulfil these roles. Proteins from other species, which have only one hit to this entry and lack threonine dehydratase, are very likely to be LeuA enzymes.; GO: 0046912 transferase activity transferring acyl groups acyl groups converted into alkyl on transfer, 0019752 carboxylic acid metabolic process.
Probab=32.60  E-value=17  Score=17.82  Aligned_cols=14  Identities=36%  Similarity=0.639  Sum_probs=4.8

Q ss_pred             CCHHHHHHHHHCCC
Q ss_conf             72457889997089
Q 537021.9.peg.3   66 GGPQIGAVLEKMGI   79 (150)
Q Consensus        66 gg~qI~~~l~~~gi   79 (150)
                      |...+...|+.+||
T Consensus       321 G~~a~~~kL~e~GI  334 (371)
T TIGR02090       321 GRHAVKAKLKELGI  334 (371)
T ss_pred             HHHHHHHHHHHCCC
T ss_conf             79999999987392


No 138
>PRK06136 uroporphyrin-III C-methyltransferase; Reviewed
Probab=32.34  E-value=42  Score=15.28  Aligned_cols=92  Identities=15%  Similarity=0.117  Sum_probs=54.7

Q ss_pred             CCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCE---------EECCCCCCCHH-
Q ss_conf             59889998675672474689999999999851897179984572457889997089811---------55783466124-
Q 537021.9.peg.3   24 ENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSK---------FENGLRITDQQ-   93 (150)
Q Consensus        24 r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~---------~v~G~RVTd~~-   93 (150)
                      +|+.+|.--+|    ||-++....+.+..+...|+.+-+|.|-. .........|++..         |+.|...+++. 
T Consensus        79 ~G~~VvrL~~G----DP~ifg~~~eei~~l~~~gi~~evvPGVs-a~~Aaaa~~GiPLT~r~~s~sv~~it~~~~~~~~~  153 (253)
T PRK06136         79 KGKVVVRLKGG----DPFVFGRGGEELEALEAAGIPVEVVPGIT-AAIAAAAAAGIPLTHRGVARSVTFVTGHEAAGKLE  153 (253)
T ss_pred             CCCEEEEEECC----CCCCCCCHHHHHHHHHHCCCCEEEECCHH-HHHHHHHHCCCCCCCCCCCCEEEEEECCCCCCCCC
T ss_conf             79979999589----98766405999999997899679956805-99999998398852489741699982677799875


Q ss_pred             -HHHHHHHH---------HHHHHHHHHHHHHHHCCCC
Q ss_conf             -89999999---------7416689999999828997
Q 537021.9.peg.3   94 -TAEVVEMV---------LAGSINKKIVSLINQTGTQ  120 (150)
Q Consensus        94 -tl~iv~~v---------L~G~vn~~lv~~L~~~g~~  120 (150)
                       ..+....+         ++..--..++..|..+|..
T Consensus       154 ~~~~~~~la~~~~TlViym~~~~l~~i~~~L~~~G~~  190 (253)
T PRK06136        154 PEVNWAALADGADTLVIYMGVRNLPYIAEQLIAAGRS  190 (253)
T ss_pred             CHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHCCCC
T ss_conf             1455998736697599986541299999999975999


No 139
>PRK09084 aspartate kinase III; Validated
Probab=32.27  E-value=42  Score=15.28  Aligned_cols=37  Identities=24%  Similarity=0.341  Sum_probs=23.5

Q ss_pred             EEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECC
Q ss_conf             8999867567247468999999999985189717998457
Q 537021.9.peg.3   27 TIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGG   66 (150)
Q Consensus        27 tiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGg   66 (150)
                      .+|.||||+-+.+.+.++++++ |.. .. ..++|+|--+
T Consensus         1 mvV~KFGGtSv~~~~~i~~v~~-ii~-~~-~~~~vVVVSA   37 (447)
T PRK09084          1 LVVAKFGGTSVADFDAMNRSAD-IVL-SN-PNTRLVVLSA   37 (447)
T ss_pred             CEEEEECCCCCCCHHHHHHHHH-HHH-CC-CCCEEEEEEC
T ss_conf             9899478556798999999999-997-27-9982999918


No 140
>KOG4701 consensus
Probab=32.18  E-value=42  Score=15.27  Aligned_cols=51  Identities=20%  Similarity=0.338  Sum_probs=38.2

Q ss_pred             HHHHHHHC-CCEEEEEECHHHHCCHHH-----HHHHHHHHHHHHHCCCCEEEEECCCH
Q ss_conf             99999875-988999867567247468-----99999999998518971799845724
Q 537021.9.peg.3   17 LPFVQFYE-NETIVVKYGGHVMNCTDL-----SKDFVNDIALLKKSNITPVIVHGGGP   68 (150)
Q Consensus        17 ~pYi~~~r-~ktiVIK~GG~~l~d~~l-----~~~~~~dI~ll~~~G~k~ViVHGgg~   68 (150)
                      +.|+.+|. |.|-+..+-|.. .|...     -..|.+||...+..|+|+.|-.||+-
T Consensus        58 lsFL~~F~~~~Tp~LNfAn~C-sd~~~~~l~~CTqi~~di~~CQS~GiKVlLSLGG~~  114 (568)
T KOG4701          58 LSFLIDFNVDGTPVLNFANLC-SDSDTFSLKKCTQIETDIQVCQSNGIKVLLSLGGYN  114 (568)
T ss_pred             HHHHHHCCCCCCCCEEHHCCC-CCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCC
T ss_conf             324664478897421000126-852234202222256478888736808998516755


No 141
>PRK10517 magnesium-transporting ATPase MgtA; Provisional
Probab=31.12  E-value=44  Score=15.15  Aligned_cols=89  Identities=22%  Similarity=0.360  Sum_probs=56.3

Q ss_pred             HCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHH-HHCCCCCEE-ECCC---CCCCHHHHHHHHHH--HH---HHH
Q ss_conf             24746899999999998518971799845724578899-970898115-5783---46612489999999--74---166
Q 537021.9.peg.3   37 MNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVL-EKMGIKSKF-ENGL---RITDQQTAEVVEMV--LA---GSI  106 (150)
Q Consensus        37 l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l-~~~gi~~~~-v~G~---RVTd~~tl~iv~~v--L~---G~v  106 (150)
                      +.||- ++...+.|..++..|++++++-|-.+..-+.. ++.||+... +.|-   ..+|++--+.++..  ++   =+=
T Consensus       545 ~~DPp-R~e~~~aI~~l~~aGI~V~MITGD~~~TA~aIA~~lGI~~~~v~tG~el~~lsd~el~~~~~~~~VfAR~sPe~  623 (900)
T PRK10517        545 FLDPP-KETTAPALKALKASGITVKILTGDSELVAAKVCHEVGLDAGEVVIGSDIETLSDDELANLAERTTLFARLTPMH  623 (900)
T ss_pred             ECCCC-CCHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHCCCCCCCCCCHHHHHCCHHHHHHHHHHCCEEEEECHHH
T ss_conf             30789-71099999999977993899899998999999998199954440225344099999999986477897518999


Q ss_pred             HHHHHHHHHHCCCCEEEEEEE
Q ss_conf             899999998289976877865
Q 537021.9.peg.3  107 NKKIVSLINQTGTQAIGICGK  127 (150)
Q Consensus       107 n~~lv~~L~~~g~~avglsg~  127 (150)
                      ...|+..|++.| +-++++|-
T Consensus       624 K~rIV~aLq~~G-~vVAmtGD  643 (900)
T PRK10517        624 KERIVTLLKREG-HVVGFMGD  643 (900)
T ss_pred             HHHHHHHHHHCC-CEEEEECC
T ss_conf             999999999779-97999738


No 142
>COG0547 TrpD Anthranilate phosphoribosyltransferase [Amino acid transport and metabolism]
Probab=30.88  E-value=44  Score=15.13  Aligned_cols=65  Identities=22%  Similarity=0.276  Sum_probs=32.4

Q ss_pred             CCHHHHHHHHHHHHHHHHCC-CCEEEEECC--CHHHHH-------------------HHHHCCCCCEEECCCCCC-CHHH
Q ss_conf             47468999999999985189-717998457--245788-------------------999708981155783466-1248
Q 537021.9.peg.3   38 NCTDLSKDFVNDIALLKKSN-ITPVIVHGG--GPQIGA-------------------VLEKMGIKSKFENGLRIT-DQQT   94 (150)
Q Consensus        38 ~d~~l~~~~~~dI~ll~~~G-~k~ViVHGg--g~qI~~-------------------~l~~~gi~~~~v~G~RVT-d~~t   94 (150)
                      .+|++.+.+++   .|+.+| .+.++|||.  -.+++-                   .-+..|+++...+.++.. .++.
T Consensus       197 ~~p~~~~~~A~---~l~~LG~~ralvV~G~~GlDE~~~~~~t~v~~l~~g~i~~~~l~pe~~Gl~~~~~~~l~~~~~~en  273 (338)
T COG0547         197 YHPELVELLAE---ALRLLGVERALVVHGLEGLDEVTPTGTTLVAELKDGEIREYTLTPEDFGLERAPLEDLPGGDPEEN  273 (338)
T ss_pred             ECHHHHHHHHH---HHHHHCCCEEEEEECCCCCCCCCCCCCCEEEEECCCCEEEEEECHHHCCCCCCCHHHCCCCCHHHH
T ss_conf             57888999999---999828661899977899631347887448997589568999677762898785432578999999


Q ss_pred             HHHHHHHHHHH
Q ss_conf             99999997416
Q 537021.9.peg.3   95 AEVVEMVLAGS  105 (150)
Q Consensus        95 l~iv~~vL~G~  105 (150)
                      .++.+.+|.|+
T Consensus       274 a~~~~~vL~G~  284 (338)
T COG0547         274 AEILRAVLAGE  284 (338)
T ss_pred             HHHHHHHHCCC
T ss_conf             99999997799


No 143
>cd01987 USP_OKCHK USP domain is located between the N-terminal sensor domain and C-terminal catalytic domain of this Osmosensitive K+ channel histidine kinase family. The family of KdpD sensor kinase proteins regulates the kdpFABC operon responsible for potassium transport. The USP domain is homologous to the universal stress protein Usp Usp is a small cytoplasmic bacterial protein whose expression is enhanced when the cell is exposed to stress agents. Usp enhances the rate of cell survival during prolonged exposure to such conditions, and may provide a general "stress endurance" activity.
Probab=30.85  E-value=44  Score=15.13  Aligned_cols=103  Identities=9%  Similarity=0.059  Sum_probs=66.2

Q ss_pred             HHHCCCHHHHHHHHHHHHCCCEEEEEECHH--HHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCH--HHHHHHHHCCCCC
Q ss_conf             322062289999999987598899986756--724746899999999998518971799845724--5788999708981
Q 537021.9.peg.3    6 YQFQAEILEQVLPFVQFYENETIVVKYGGH--VMNCTDLSKDFVNDIALLKKSNITPVIVHGGGP--QIGAVLEKMGIKS   81 (150)
Q Consensus         6 ~~~~a~~l~ea~pYi~~~r~ktiVIK~GG~--~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~--qI~~~l~~~gi~~   81 (150)
                      .+.-...+|.+..--++++.+.+|+.+--.  .-.++.....+-+...+..++|.+++.+.|+.+  +|-+..++.++.-
T Consensus         9 s~~s~~lir~a~rlA~~~~a~l~vl~V~~~~~~~~~~~~~~~l~~~~~la~~lga~~~~~~~~d~~~~I~~~A~~~~~t~   88 (124)
T cd01987           9 GPNAERLIRRAARLADRLKAPWYVVYVETPRLNRLSEAERRRLAEALRLAEELGAEVVTLPGDDVAEAIVEFAREHNVTQ   88 (124)
T ss_pred             CHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHCCCCE
T ss_conf             85079999999999996499899999955975658979999999999999985998999947998999999999849989


Q ss_pred             EEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHH
Q ss_conf             155783466124899999997416689999999
Q 537021.9.peg.3   82 KFENGLRITDQQTAEVVEMVLAGSINKKIVSLI  114 (150)
Q Consensus        82 ~~v~G~RVTd~~tl~iv~~vL~G~vn~~lv~~L  114 (150)
                       .+=|.+--+.  +   +..+.|.+-..+....
T Consensus        89 -IVlG~~~~~~--~---~~~~~gSv~~~vl~~a  115 (124)
T cd01987          89 -IVVGKSRRSR--W---RELFRGSLVDRLLRRA  115 (124)
T ss_pred             -EEECCCCCCH--H---HHHHCCCHHHHHHHHC
T ss_conf             -9976898854--6---8872388999999749


No 144
>PRK09436 thrA bifunctional aspartokinase I/homeserine dehydrogenase I; Provisional
Probab=30.09  E-value=45  Score=15.04  Aligned_cols=47  Identities=15%  Similarity=0.191  Sum_probs=27.3

Q ss_pred             EEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHH
Q ss_conf             999867567247468999999999985189717998457245788999
Q 537021.9.peg.3   28 IVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLE   75 (150)
Q Consensus        28 iVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~   75 (150)
                      -|.||||+-+.+++.++.+++=|..-..-+ ++++|.-+---++..|-
T Consensus         2 kV~KFGGTSva~ae~i~~Va~II~~~~~~~-~vvVVVSA~ggvTd~Ll   48 (817)
T PRK09436          2 KVLKFGGTSLANAERFLRVADIIESNARQE-QVAVVLSAPAKVTNHLV   48 (817)
T ss_pred             EEEEECCCCCCCHHHHHHHHHHHHHHCCCC-CEEEEECCCCCCHHHHH
T ss_conf             788757156789999999999997534389-98999839998689999


No 145
>PRK10976 putative sugar phosphatase; Provisional
Probab=29.88  E-value=46  Score=15.02  Aligned_cols=66  Identities=24%  Similarity=0.381  Sum_probs=43.8

Q ss_pred             CEEEEEECHHHHCCHH-HHHHHHHHHHHHHHCCCCEEEEECCCH-HHHHHHHHCCCCCEEE--CCCCCCC
Q ss_conf             8899986756724746-899999999998518971799845724-5788999708981155--7834661
Q 537021.9.peg.3   26 ETIVVKYGGHVMNCTD-LSKDFVNDIALLKKSNITPVIVHGGGP-QIGAVLEKMGIKSKFE--NGLRITD   91 (150)
Q Consensus        26 ktiVIK~GG~~l~d~~-l~~~~~~dI~ll~~~G~k~ViVHGgg~-qI~~~l~~~gi~~~~v--~G~RVTd   91 (150)
                      |.|+.=+.|..+.+.. +.+...+.|..++..|+++++.-|=.+ .+....+.+++...++  ||-.|-|
T Consensus         3 KlIa~DlDGTLl~~~~~is~~~~~ai~~l~~~Gi~~viATGR~~~~~~~~~~~l~~~~~~I~~NGa~i~d   72 (266)
T PRK10976          3 QVVASDLDGTLLSPDHTLSPYAKETLKLLTARGIHFVFATGRHHVDVGQIRDNLEIKSYMITSNGARVHD   72 (266)
T ss_pred             EEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHCCCCCEEEECCCEEEEE
T ss_conf             9999937646558879719999999999997899999997999799999999739974189628779991


No 146
>pfam11869 DUF3389 Protein of unknown function (DUF3389). This family of proteins are functionally uncharacterized. This protein is found in bacteria. Proteins in this family are about 80 amino acids in length.
Probab=29.88  E-value=46  Score=15.02  Aligned_cols=35  Identities=34%  Similarity=0.562  Sum_probs=25.3

Q ss_pred             EEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCH
Q ss_conf             99986756724746899999999998518971799845724
Q 537021.9.peg.3   28 IVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGP   68 (150)
Q Consensus        28 iVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~   68 (150)
                      +||++.++.+.   +.. -.+||.++.  |-++++-+||+.
T Consensus        17 v~vRL~~~~~~---l~a-~~D~I~l~~--~anvi~A~g~~v   51 (75)
T pfam11869        17 VVVRLNGANVV---LLA-AADDISLLG--GANVIIANGGGV   51 (75)
T ss_pred             EEEEECCCCEE---HHH-HHHHHHHCC--CCCEEEECCCCE
T ss_conf             99998696221---898-775665217--873999717621


No 147
>KOG0456 consensus
Probab=29.74  E-value=46  Score=15.01  Aligned_cols=47  Identities=19%  Similarity=0.265  Sum_probs=30.2

Q ss_pred             CCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHH
Q ss_conf             9889998675672474689999999999851897179984572457889
Q 537021.9.peg.3   25 NETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAV   73 (150)
Q Consensus        25 ~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~   73 (150)
                      +-+.|-|+||+-..+-+.  .+.-|+..+..-..+|++|.-+..--++.
T Consensus        78 ~~~~V~KFGGsSV~s~~~--~i~v~~l~~~~~~e~~~vV~SA~sk~Tna  124 (559)
T KOG0456          78 GLTCVMKFGGSSVGSAER--MIEVAVLILYFPDERPVVVLSAMSKTTNA  124 (559)
T ss_pred             CEEEEEECCCCCCCCCCH--HHHHHHHHHHCCCCCEEEEEECCCCCHHH
T ss_conf             418999428841133400--13356788746877738999713441254


No 148
>TIGR01369 CPSaseII_lrg carbamoyl-phosphate synthase, large subunit; InterPro: IPR006275   Carbamoyl phosphate synthase (CPSase) is a heterodimeric enzyme composed of a small and a large subunit (with the exception of CPSase III, see below). CPSase catalyses the synthesis of carbamoyl phosphate from biocarbonate, ATP and glutamine (6.3.5.5 from EC) or ammonia (6.3.4.16 from EC), and represents the first committed step in pyrimidine and arginine biosynthesis in prokaryotes and eukaryotes, and in the urea cycle in most terrestrial vertebrates , . CPSase has three active sites, one in the small subunit and two in the large subunit. The small subunit contains the glutamine binding site and catalyses the hydrolysis of glutamine to glutamate and ammonia. The large subunit has two homologous carboxy phosphate domains, both of which have ATP-binding sites; however, the N-terminal carboxy phosphate domain catalyses the phosphorylation of biocarbonate, while the C-terminal domain catalyses the phosphorylation of the carbamate intermediate . The carboxy phosphate domain found duplicated in the large subunit of CPSase is also present as a single copy in the biotin-dependent enzymes acetyl-CoA carboxylase (6.4.1.2 from EC) (ACC), propionyl-CoA carboxylase (6.4.1.3 from EC) (PCCase), pyruvate carboxylase (6.4.1.1 from EC) (PC) and urea carboxylase (6.3.4.6 from EC).   Most prokaryotes carry one form of CPSase that participates in both arginine and pyrimidine biosynthesis, however certain bacteria can have separate forms. The large subunit in bacterial CPSase has four structural domains: the carboxy phosphate domain 1, the oligomerisation domain, the carbamoyl phosphate domain 2 and the allosteric domain . CPSase heterodimers from Escherichia coli contain two molecular tunnels: an ammonia tunnel and a carbamate tunnel. These inter-domain tunnels connect the three distinct active sites, and function as conduits for the transport of unstable reaction intermediates (ammonia and carbamate) between successive active sites . The catalytic mechanism of CPSase involves the diffusion of carbamate through the interior of the enzyme from the site of synthesis within the N-terminal domain of the large subunit to the site of phosphorylation within the C-terminal domain.   Eukaryotes have two distinct forms of CPSase: a mitochondrial enzyme (CPSase I) that participates in both arginine biosynthesis and the urea cycle; and a cytosolic enzyme (CPSase II) involved in pyrimidine biosynthesis. CPSase II occurs as part of a multi-enzyme complex along with aspartate transcarbamoylase and dihydroorotase; this complex is referred to as the CAD protein . The hepatic expression of CPSase is transcriptionally regulated by glucocorticoids and/or cAMP . There is a third form of the enzyme, CPSase III, found in fish, which uses glutamine as a nitrogen source instead of ammonia . CPSase III is closely related to CPSase I, and is composed of a single polypeptide that may have arisen from gene fusion of the glutaminase and synthetase domains .    This entry represents glutamine-dependent CPSase (6.3.5.5 from EC) from prokaryotes and eukaryotes (CPSase II). ; GO: 0004086 carbamoyl-phosphate synthase activity, 0006807 nitrogen compound metabolic process.
Probab=29.61  E-value=46  Score=14.99  Aligned_cols=63  Identities=17%  Similarity=0.188  Sum_probs=40.6

Q ss_pred             HHHHHCCCEEEEEECHHHHC---CHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECC
Q ss_conf             99987598899986756724---746899999999998518971799845724578899970898115578
Q 537021.9.peg.3   19 FVQFYENETIVVKYGGHVMN---CTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENG   86 (150)
Q Consensus        19 Yi~~~r~ktiVIK~GG~~l~---d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G   86 (150)
                      |.+.+...-++++..|+++.   |.+ .+.+...+..|...|.+++=-+|-..+    |.+.||+++.+.-
T Consensus       948 ~~Ka~~~~~~~~p~kg~~~~~~~d~d-k~~~~~~a~~L~~~G~~~yat~gT~~~----L~e~gi~~~~v~~ 1013 (1089)
T TIGR01369       948 FLKAMLSSGNRIPKKGSVLLSVRDKD-KEELLDLAKKLAEKGYKIYATEGTAKF----LEEAGIKAEVVLK 1013 (1089)
T ss_pred             HHHHHHHCCCCCCCCCEEEEEECCCC-HHHHHHHHHHHHHCCCEEEEECCHHHH----HHHCCCCHHHEEE
T ss_conf             99999981870578877999803443-457999999998637378850202676----7651876332000


No 149
>pfam01872 RibD_C RibD C-terminal domain. The function of this domain is not known, but it is thought to be involved in riboflavin biosynthesis. This domain is found in the C terminus of RibD/RibG, in combination with pfam00383, as well as in isolation in some archaebacterial proteins. This family appears to be related to pfam00186.
Probab=29.26  E-value=47  Score=14.95  Aligned_cols=31  Identities=26%  Similarity=0.460  Sum_probs=18.5

Q ss_pred             HHHHHHHHHHCCCCEEEEECCCHHHHHHHHH
Q ss_conf             9999999851897179984572457889997
Q 537021.9.peg.3   46 FVNDIALLKKSNITPVIVHGGGPQIGAVLEK   76 (150)
Q Consensus        46 ~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~   76 (150)
                      +..-+..|...|.+-|+|.||+.-+.+.+++
T Consensus       119 ~~~~l~~L~~~g~~~ilveGG~~l~~~fl~~  149 (196)
T pfam01872       119 LKELLRELKERGIRSLLVEGGATLAGSLLRA  149 (196)
T ss_pred             HHHHHHHHHHCCCCEEEEECHHHHHHHHHHC
T ss_conf             9999999986799879998579999999768


No 150
>TIGR00329 gcp metalloendopeptidase, putative, glycoprotease family; InterPro: IPR000905   Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases .   Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins.   Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule.    This group of metallopeptidases belong to MEROPS peptidase family M22 (clan MK). The type example being O-sialoglycoprotein endopeptidase from Pasteurella haemolytica (Mannheimia haemolytica).   O-Sialoglycoprotein endopeptidase is secreted by the bacterium P. haemolytica, and digests only proteins that are heavily sialylated, in particular those with sialylated serine and threonine residues . Substrate proteins include glycophorin A and leukocyte surface antigens CD34, CD43, CD44 and CD45 , . Removal of glycosylation, by treatment with neuraminidase, completely negates susceptibility to O-sialoglycoprotein endopeptidase digestion , .   Sequence similarity searches have revealed other members of the M22 family, from yeast, Mycobacterium, Haemophilus influenzae and the cyanobacterium Synechocystis . The zinc-binding and catalytic residues of this family have not been determined, although the motif HMEGH may be a zinc-binding region .; GO: 0008450 O-sialoglycoprotein endopeptidase activity, 0006508 proteolysis.
Probab=28.82  E-value=48  Score=14.91  Aligned_cols=62  Identities=21%  Similarity=0.305  Sum_probs=37.5

Q ss_pred             HHHHCCHHHHHHHHHHHHHHHHCCCC-EEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHH----HHHHH
Q ss_conf             56724746899999999998518971-79984572457889997089811557834661248----99999
Q 537021.9.peg.3   34 GHVMNCTDLSKDFVNDIALLKKSNIT-PVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQT----AEVVE   99 (150)
Q Consensus        34 G~~l~d~~l~~~~~~dI~ll~~~G~k-~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~t----l~iv~   99 (150)
                      |+.++|.  .-+.++.++  +.+|+. ..+-|=|||.|++.|.+.|-+-+|.==+-.|+...    ++.--
T Consensus       159 geT~DdA--~Ge~fDK~a--r~~gl~Gt~iayPGGp~i~~~l~~~g~~~~f~~p~P~~~~~~~~~~lDfSF  225 (337)
T TIGR00329       159 GETLDDA--VGEAFDKVA--RLLGLGGTMIAYPGGPAIEEELAKKGDEKTFYFPLPLTKKPMKRNMLDFSF  225 (337)
T ss_pred             EEEHHHH--HHHHHHHHH--HHHCCCCCEEECCCCCHHHHHHHCCCCCCCEECCCCCCCCCCCCCCCHHHH
T ss_conf             5202245--424467777--761889626506897426899854788775542666213688763403337


No 151
>PRK06859 consensus
Probab=28.23  E-value=49  Score=14.84  Aligned_cols=47  Identities=19%  Similarity=0.321  Sum_probs=18.9

Q ss_pred             HHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCC
Q ss_conf             78899970898115578346612489999999741668999999982899
Q 537021.9.peg.3   70 IGAVLEKMGIKSKFENGLRITDQQTAEVVEMVLAGSINKKIVSLINQTGT  119 (150)
Q Consensus        70 I~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~vL~G~vn~~lv~~L~~~g~  119 (150)
                      ++..|+++|.++.-.+--+|+|  ||.+++.-.-|+-| .|-+...+.++
T Consensus       106 Ln~El~r~g~~~~~~~~~~viD--TL~lAR~~~Pg~~~-sLdaLc~r~~i  152 (253)
T PRK06859        106 MDYEFRKLNLNVKTDDICLVTD--TLQMARQMYPGKRN-NLDALCDRLGI  152 (253)
T ss_pred             HHHHHHHCCCCCCCCCCCCEEE--HHHHHHHHCCCCCC-CHHHHHHHCCC
T ss_conf             9999997499965322001133--99999987879977-99999997399


No 152
>KOG3517 consensus
Probab=27.83  E-value=50  Score=14.80  Aligned_cols=71  Identities=24%  Similarity=0.327  Sum_probs=43.3

Q ss_pred             EECHHHHCCHHHHHHHHHHHHHHHHCCCCEE-------EEECCCHHHHHHHHHCC-CCCEEECCC--CCCCHHHHHHHHH
Q ss_conf             8675672474689999999999851897179-------98457245788999708-981155783--4661248999999
Q 537021.9.peg.3   31 KYGGHVMNCTDLSKDFVNDIALLKKSNITPV-------IVHGGGPQIGAVLEKMG-IKSKFENGL--RITDQQTAEVVEM  100 (150)
Q Consensus        31 K~GG~~l~d~~l~~~~~~dI~ll~~~G~k~V-------iVHGgg~qI~~~l~~~g-i~~~~v~G~--RVTd~~tl~iv~~  100 (150)
                      -+||..+....|-..+--.|..|..+|+++-       +-||.=..|-++..+.| |-|--+.|-  |||++...+.++.
T Consensus        10 QLGGVFVNGRPLPna~RlrIVELarlGiRPCDISRQLrvSHGCVSKILaRy~EtGsIlPGaIGGSkPRVTTP~VV~~IR~   89 (334)
T KOG3517          10 QLGGVFVNGRPLPNAIRLRIVELARLGIRPCDISRQLRVSHGCVSKILARYNETGSILPGAIGGSKPRVTTPKVVKYIRS   89 (334)
T ss_pred             HCCCEEECCCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHH
T ss_conf             12515676854861255668889871887553344542310259999987625775456644787886677359999987


Q ss_pred             H
Q ss_conf             9
Q 537021.9.peg.3  101 V  101 (150)
Q Consensus       101 v  101 (150)
                      -
T Consensus        90 ~   90 (334)
T KOG3517          90 L   90 (334)
T ss_pred             H
T ss_conf             6


No 153
>PRK06278 cobyrinic acid a,c-diamide synthase; Validated
Probab=27.83  E-value=34  Score=15.86  Aligned_cols=17  Identities=35%  Similarity=0.384  Sum_probs=14.0

Q ss_pred             HHHHCCCEEEEEECHHH
Q ss_conf             99875988999867567
Q 537021.9.peg.3   20 VQFYENETIVVKYGGHV   36 (150)
Q Consensus        20 i~~~r~ktiVIK~GG~~   36 (150)
                      .-+.|+++||.|+|+++
T Consensus       266 ~~kL~~kvfv~KiGpDv  282 (482)
T PRK06278        266 VSKLRGKTFVSKIGPDV  282 (482)
T ss_pred             HHHHCCCEEEEECCCCH
T ss_conf             88750648998608866


No 154
>pfam07521 RMMBL RNA-metabolising metallo-beta-lactamase. The metallo-beta-lactamase fold contains five sequence motifs. The first four motifs are found in pfam00753 and are common to all metallo-beta-lactamases. The fifth motif appears to be specific to function. This entry represents the fifth motif from metallo-beta-lactamases involved in RNA metabolism.
Probab=27.62  E-value=50  Score=14.77  Aligned_cols=33  Identities=18%  Similarity=0.369  Sum_probs=20.5

Q ss_pred             EEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECC
Q ss_conf             8999867567247468999999999985189717998457
Q 537021.9.peg.3   27 TIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGG   66 (150)
Q Consensus        27 tiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGg   66 (150)
                      +.-+.+.|.+  |.+-+..|++.+   ..  .++++|||-
T Consensus         9 v~~~~fSgHA--d~~eL~~~i~~~---~p--~~v~~vHGe   41 (43)
T pfam07521         9 VETLHFSGHA--DQEELLELLKLL---KP--KKVILVHGE   41 (43)
T ss_pred             EEEEEEECCC--CHHHHHHHHHHC---CC--CEEEEECCC
T ss_conf             9999887368--989999999966---99--989994399


No 155
>cd02871 GH18_chitinase_D-like GH18 domain of Chitinase D (ChiD).  ChiD, a chitinase found in Bacillus circulans, hydrolyzes the 1,4-beta-linkages of N-acetylglucosamine in chitin and chitodextrins.  The domain architecture of ChiD includes a catalytic glycosyl hydrolase family 18 (GH18) domain, a chitin-binding domain, and a fibronectin type III domain. The chitin-binding and fibronectin type III domains are located either N-terminal or C-terminal to the catalytic domain.  This family includes exochitinase Chi36 from Bacillus cereus.
Probab=27.43  E-value=51  Score=14.75  Aligned_cols=24  Identities=29%  Similarity=0.452  Sum_probs=14.0

Q ss_pred             HHHHHHHHHHHHCCCCEEEEECCC
Q ss_conf             999999999851897179984572
Q 537021.9.peg.3   44 KDFVNDIALLKKSNITPVIVHGGG   67 (150)
Q Consensus        44 ~~~~~dI~ll~~~G~k~ViVHGgg   67 (150)
                      ..|.+||..+++.|.+++|=-||+
T Consensus        60 ~~f~~dI~~~qs~G~kVllSiGGa   83 (312)
T cd02871          60 AEFKADIKALQAKGKKVLISIGGA   83 (312)
T ss_pred             HHHHHHHHHHHHCCCEEEEEECCC
T ss_conf             999999999997899899980787


No 156
>PRK12852 groEL chaperonin GroEL; Reviewed
Probab=27.09  E-value=51  Score=14.71  Aligned_cols=109  Identities=16%  Similarity=0.188  Sum_probs=53.2

Q ss_pred             HCCCEEEEEECHHHHC-CHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCC----CCCCHHHHHH
Q ss_conf             7598899986756724-7468999999999985189717998457245788999708981155783----4661248999
Q 537021.9.peg.3   23 YENETIVVKYGGHVMN-CTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGL----RITDQQTAEV   97 (150)
Q Consensus        23 ~r~ktiVIK~GG~~l~-d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~----RVTd~~tl~i   97 (150)
                      ..+.+.+|.+||+.-. =.+..+++-.-+...+..=..- +|.|||...-+....+.--.....+.    +.- .++|++
T Consensus       372 ~~g~~~tI~vgG~t~~~~~E~er~l~DAl~avr~a~~~g-vvpGGGa~e~~~~~~l~~~~~~~~~~q~~i~a~-a~ALe~  449 (542)
T PRK12852        372 LAGGVAVIRVGGATEVEVKEKKDRVEDALNATRAAVQEG-IVPGGGVALLRAKKAVGRINNDNADVQAGINIV-LKALEA  449 (542)
T ss_pred             CCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHCC-EEECCHHHHHHHHHHHHHHHCCCHHHHHHHHHH-HHHHHH
T ss_conf             378627999667744589999999999999999872279-561557999999999998735586289999999-999999


Q ss_pred             HHHHHH---HHHHHHHHHHHHHCCCCEEEEEEEECCEEE
Q ss_conf             999974---166899999998289976877865589899
Q 537021.9.peg.3   98 VEMVLA---GSINKKIVSLINQTGTQAIGICGKDGNMVF  133 (150)
Q Consensus        98 v~~vL~---G~vn~~lv~~L~~~g~~avglsg~dg~li~  133 (150)
                      +-..|+   |.=-.+++..|.+...+..|+....|.+.-
T Consensus       450 ip~tLaeNAG~d~~~vv~~l~~~~~~~~GiD~~~G~~~D  488 (542)
T PRK12852        450 PIRQIAENAGVEGSIVVGKILENKSETFGFDAQTEEYVD  488 (542)
T ss_pred             HHHHHHHHCCCCHHHHHHHHHHCCCCCEEEECCCCCEEE
T ss_conf             999999877998899999998448998848778992703


No 157
>KOG0202 consensus
Probab=27.05  E-value=51  Score=14.71  Aligned_cols=112  Identities=18%  Similarity=0.257  Sum_probs=70.8

Q ss_pred             CCHHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHH-HHHCCCCCEE--ECC
Q ss_conf             6228999999998759889998675672474689999999999851897179984572457889-9970898115--578
Q 537021.9.peg.3   10 AEILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAV-LEKMGIKSKF--ENG   86 (150)
Q Consensus        10 a~~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~-l~~~gi~~~~--v~G   86 (150)
                      -+-+.+.++|.+.-.+=|||=-+|   |.||. +......|..++..|++++++-|-.+.--.. ..+.|+.+.-  +.+
T Consensus       557 ~~~l~~~s~~~~~E~~LtFvGlVG---i~DPP-R~ev~~ai~~c~~aGIrV~mITGD~~~TA~AI~r~iGi~~~~ed~~~  632 (972)
T KOG0202         557 DQDLNDTSNRATAESDLTFVGLVG---ILDPP-RPEVADAIELCRQAGIRVIMITGDNKETAEAIAREIGIFSEDEDVSS  632 (972)
T ss_pred             HHHHCCCCCCCCCCCCEEEEEEEE---CCCCC-CHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHCCCCCCCCCCC
T ss_conf             343125432111125438998742---46899-65689999999973977999838987899999998398767742112


Q ss_pred             CCCCCHHHHHH--------HHHHH-----HHHHHHHHHHHHHHCCCCEEEEEE
Q ss_conf             34661248999--------99997-----416689999999828997687786
Q 537021.9.peg.3   87 LRITDQQTAEV--------VEMVL-----AGSINKKIVSLINQTGTQAIGICG  126 (150)
Q Consensus        87 ~RVTd~~tl~i--------v~~vL-----~G~vn~~lv~~L~~~g~~avglsg  126 (150)
                      .-.|-.|-.+.        +..++     .-+--.+|+.+|++.|- -+.++|
T Consensus       633 ~~~TG~efD~ls~~~~~~~~~~~~vFaR~~P~HK~kIVeaLq~~ge-ivAMTG  684 (972)
T KOG0202         633 MALTGSEFDDLSDEELDDAVRRVLVFARAEPQHKLKIVEALQSRGE-VVAMTG  684 (972)
T ss_pred             CCCCHHHHCCCCHHHHHHHHHCCEEEEECCCHHHHHHHHHHHHCCC-EEEECC
T ss_conf             3223444404998999877642359996383658999999986498-898327


No 158
>PRK09451 glmU bifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase; Provisional
Probab=26.84  E-value=52  Score=14.68  Aligned_cols=38  Identities=18%  Similarity=0.359  Sum_probs=29.3

Q ss_pred             CCCHHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHH
Q ss_conf             0622899999999875988999867567247468999999
Q 537021.9.peg.3    9 QAEILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVN   48 (150)
Q Consensus         9 ~a~~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~   48 (150)
                      .|..++.|+||++.  ++.|+|-+|..-+.+++.+.+|++
T Consensus        82 TghAV~~A~~~l~~--~~~vLVl~GD~PLi~~~tl~~l~~  119 (456)
T PRK09451         82 TGHAMQQAAPFFAD--DEDILMLYGDVPLISVETLQRLRD  119 (456)
T ss_pred             CHHHHHHHHHHCCC--CCCEEEEECCCCCCCHHHHHHHHH
T ss_conf             29999998886066--885899959801569999999986


No 159
>PRK11391 etp phosphotyrosine-protein phosphatase; Provisional
Probab=26.72  E-value=52  Score=14.67  Aligned_cols=69  Identities=13%  Similarity=0.216  Sum_probs=46.4

Q ss_pred             CEEEEEECHHHHCCHHHHHHHHHHHH---HHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHH
Q ss_conf             88999867567247468999999999---98518971799845724578899970898115578346612489
Q 537021.9.peg.3   26 ETIVVKYGGHVMNCTDLSKDFVNDIA---LLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTA   95 (150)
Q Consensus        26 ktiVIK~GG~~l~d~~l~~~~~~dI~---ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl   95 (150)
                      +.|..-+-|++..+|-. +.++++..   ...+.|.....=++.-|..-+.+++.|+...-+...++|.+..-
T Consensus         3 ~~ILfVC~GNiCRSPmA-Eai~r~~~~~i~v~SaG~~~~~g~~~d~~a~~~l~~~gid~~~h~ar~i~~~~~~   74 (144)
T PRK11391          3 NSILVVCTGNICRSPIG-ERLLRKRLPGVKVKSAGVHGLVKHPADATAADVAANHGVSLEGHAGRKLTAEMAR   74 (144)
T ss_pred             CEEEEEECCHHHHHHHH-HHHHHHHCCCCCEEEEEEECCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCHHHHH
T ss_conf             87999959725288999-9999950778815766653578989799999999981999300002247887986


No 160
>TIGR01737 FGAM_synth_I phosphoribosylformylglycinamidine synthase I; InterPro: IPR010075   In some species, phosphoribosylformylglycinamidine synthase is composed of a single polypeptide chain. This entry describes the PurQ protein of Bacillus subtilis (where PurL, PurQ, and PurS are required for phosphoribosylformylglycinamidine synthase activity) and functionally equivalent proteins from other bacteria and archaea.; GO: 0004642 phosphoribosylformylglycinamidine synthase activity, 0006189 'de novo' IMP biosynthetic process, 0005737 cytoplasm.
Probab=26.64  E-value=52  Score=14.66  Aligned_cols=84  Identities=17%  Similarity=0.331  Sum_probs=49.8

Q ss_pred             CEEEEEECHHHHCCHHHHHHHHHHHHHHH-HCCCCEEEEECCCHHHHHHHHHCC-CCCE-----E--E---CCCCCCCHH
Q ss_conf             88999867567247468999999999985-189717998457245788999708-9811-----5--5---783466124
Q 537021.9.peg.3   26 ETIVVKYGGHVMNCTDLSKDFVNDIALLK-KSNITPVIVHGGGPQIGAVLEKMG-IKSK-----F--E---NGLRITDQQ   93 (150)
Q Consensus        26 ktiVIK~GG~~l~d~~l~~~~~~dI~ll~-~~G~k~ViVHGgg~qI~~~l~~~g-i~~~-----~--v---~G~RVTd~~   93 (150)
                      |.-||+++|+-.+.+..        ..+. .+|.++++||     |.+|-+... ++-+     |  |   .|  ++=-+
T Consensus         2 kvAV~~FPGtNCd~D~~--------~A~~k~~Gv~~~~Vh-----ikqw~~D~~r~~~~rsvfD~D~VvlPGG--FSyGD   66 (264)
T TIGR01737         2 KVAVIRFPGTNCDRDTV--------YALEKLLGVDAEIVH-----IKQWYEDKSRLSDQRSVFDYDAVVLPGG--FSYGD   66 (264)
T ss_pred             EEEEEECCCCCCHHHHH--------HHHHHHCCCCEEEEE-----EEECHHHHCCCCCCCEEEEEEEEEECCC--CCHHH
T ss_conf             68999779998408999--------999971499837998-----6521000047876641787334886687--77235


Q ss_pred             HHHHHHHHHHHHHHHHHH---HHHHHCCCCEEEEE
Q ss_conf             899999997416689999---99982899768778
Q 537021.9.peg.3   94 TAEVVEMVLAGSINKKIV---SLINQTGTQAIGIC  125 (150)
Q Consensus        94 tl~iv~~vL~G~vn~~lv---~~L~~~g~~avgls  125 (150)
                      =|..-.++-. +|...+.   ..+.+.|.+-.|+|
T Consensus        67 YLRaGAIAa~-rvk~p~~~~V~~~A~~G~pVLGiC  100 (264)
T TIGR01737        67 YLRAGAIAAL-RVKSPIMQEVREFAEKGKPVLGIC  100 (264)
T ss_pred             HHHHHHHHHH-HCCHHHHHHHHHHHHCCCEEEEEE
T ss_conf             7779999987-503088999999995598289976


No 161
>TIGR00099 Cof-subfamily Cof-like hydrolase; InterPro: IPR000150   The Haloacid Dehydrogenase (HAD) superfamily includes phosphatases, phosphonatases, P-type ATPases, beta-phosphoglucomutases, phosphomannomutases, and dehalogenases, which are involved in a variety of cellular processes ranging from amino acid biosynthesis to detoxification . Proteins in this entry are mostly uncharacterised, though they form a distinct subgroup within the HAD superfamily. Members are found almost exclusively in bacteria and many species contain several paralogs, for example Escherichia coli contains a total of six proteins from this entry. Sequence similarities suggest that these enzymes are phosphatases which work on phosphorylated sugars.   The YbiV (P75792 from SWISSPROT) protein from E. coli has been experimentally characterised . This enzyme catalyzes the hydrolysis of sugar phosphate to sugar and inorganic phosphate. It has a wide substrate specificity, catalyzing the hydrolysis of ribose-5-phosphate and glucose-6-phosphate most efficiently, but it is not known if these are the real substrates in vivo. The protein appears to be a monomer that contains two domains, an alpha-beta hydrolase domain that forms a Rossman fold, and an alpha-beta domain. The active site is found in a negatively charged cavity found at the interface between the two domains. ; GO: 0016787 hydrolase activity, 0008152 metabolic process.
Probab=26.39  E-value=53  Score=14.63  Aligned_cols=112  Identities=13%  Similarity=0.303  Sum_probs=76.0

Q ss_pred             EEEEECHHHHC-CH-HHHHHHHHHHHHHHHCCCCEEEEECCC-----HHHHHHHHHCCCCC-EEE---CCCCCCCHHH-H
Q ss_conf             99986756724-74-689999999999851897179984572-----45788999708981-155---7834661248-9
Q 537021.9.peg.3   28 IVVKYGGHVMN-CT-DLSKDFVNDIALLKKSNITPVIVHGGG-----PQIGAVLEKMGIKS-KFE---NGLRITDQQT-A   95 (150)
Q Consensus        28 iVIK~GG~~l~-d~-~l~~~~~~dI~ll~~~G~k~ViVHGgg-----~qI~~~l~~~gi~~-~~v---~G~RVTd~~t-l   95 (150)
                      +++=++|..+. +. ...+...+-|..++.-|+++||.=|=.     .++.....+.+++. .++   +|..+.+... =
T Consensus         2 i~~D~DGTLL~~~~~~i~~~~~~al~~l~~~G~~~~laTGR~~~g~~~~~~~~~~~~~~~~~~~i~~nGg~~~~~~~~~~   81 (270)
T TIGR00099         2 IFIDLDGTLLNKDDKEISPSTKEALAKLREKGIKVVLATGRPLAGVYKELKEILKELGLDTDPYISANGGAVIDEGGGNE   81 (270)
T ss_pred             EEEECCCEEECCCCCCCCHHHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHHHHHHCCCCCEEEEECCCEEEEECCCCE
T ss_conf             88963850355998751888999999999669889998188643478999998885067855687617848998468700


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHCCC-CEEEEEEEECCEEEEEECCC
Q ss_conf             999999741668999999982899-76877865589899997475
Q 537021.9.peg.3   96 EVVEMVLAGSINKKIVSLINQTGT-QAIGICGKDGNMVFAEKARH  139 (150)
Q Consensus        96 ~iv~~vL~G~vn~~lv~~L~~~g~-~avglsg~dg~li~a~~~~~  139 (150)
                      .+.+..|...+-..|...+++++. -.+-+.+....+...++.++
T Consensus        82 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  126 (270)
T TIGR00099        82 ILYKKPLDLDLVEELLNFLKKHGQELDVILYSDSDSIYALKNDPE  126 (270)
T ss_pred             EEEECCCCHHHHHHHHHHHHHCCCCEEEEEECCCCEEEEECCCCC
T ss_conf             563024788899999999973489257999817762565237875


No 162
>TIGR03590 PseG pseudaminic acid biosynthesis-associated protein PseG. This protein is found in association with enzymes involved in the biosynthesis of pseudaminic acid, a component of polysaccharide in certain Pseudomonas strains as well as a modification of flagellin in Campylobacter and Hellicobacter. The role of this protein is unclear, although it may participate in N-acetylation in conjunction with, or in the absence of PseH (TIGR03585) as it often scores above the trusted cutoff to pfam00583 representing a family of acetyltransferases.
Probab=26.34  E-value=53  Score=14.63  Aligned_cols=99  Identities=18%  Similarity=0.225  Sum_probs=50.3

Q ss_pred             CCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHC-CCCCEEECCCCCCC-HHHHHHHHHH-
Q ss_conf             98899986756724746899999999998518971799845724578899970-89811557834661-2489999999-
Q 537021.9.peg.3   25 NETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKM-GIKSKFENGLRITD-QQTAEVVEMV-  101 (150)
Q Consensus        25 ~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~-gi~~~~v~G~RVTd-~~tl~iv~~v-  101 (150)
                      -+.++|-+||+  +..++...+++-+... ....++.+|.|.+..-.+.+.+. .-.+.+.   -..+ ++..+....+ 
T Consensus       171 ~~~Ili~~GGs--D~~~lt~~il~~l~~~-~~~~~i~vvig~~~~~~~~i~~~~~~~~~~~---~~~~~~~m~~~m~~aD  244 (280)
T TIGR03590       171 LRRVLVSFGGA--DPDNLTLKLLSALAES-QINISITLVTGSSNPNLDELKKFAKEYPNII---LFIDVENMAELMNEAD  244 (280)
T ss_pred             CCEEEEEECCC--CHHHHHHHHHHHHHHH-CCCCCEEEEECCCCCCHHHHHHHHHHCCCEE---EECCHHHHHHHHHHCC
T ss_conf             53289997787--7000899999999851-6685679998679876699999997289969---9659889999999779


Q ss_pred             H-HHHHHHHHHHHHHHCCCCEEEEEEEECC
Q ss_conf             7-4166899999998289976877865589
Q 537021.9.peg.3  102 L-AGSINKKIVSLINQTGTQAIGICGKDGN  130 (150)
Q Consensus       102 L-~G~vn~~lv~~L~~~g~~avglsg~dg~  130 (150)
                      + .+.-.. -.-.+...|++++.++-.|..
T Consensus       245 laI~agG~-t~~El~~~gvP~i~i~~adNQ  273 (280)
T TIGR03590       245 LAIGAAGS-TSWERCCLGLPSLSIVLAENQ  273 (280)
T ss_pred             EEEECCCH-HHHHHHHHCCCEEEEEECCCH
T ss_conf             99985965-899999949998999970256


No 163
>COG3956 Protein containing tetrapyrrole methyltransferase domain and MazG-like (predicted pyrophosphatase) domain [General function prediction only]
Probab=25.65  E-value=54  Score=14.55  Aligned_cols=72  Identities=21%  Similarity=0.253  Sum_probs=52.2

Q ss_pred             HHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHH
Q ss_conf             998759889998675672474689999999999851897179984572457889997089811557834661248999
Q 537021.9.peg.3   20 VQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEV   97 (150)
Q Consensus        20 i~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~i   97 (150)
                      +.+-+.|-+|..++|+-+..+...    +=+.....-+---|=|-||-..||..++.+.|.|  ++|..+-|.-+|.-
T Consensus        77 veaAkekdIvYAVPGHP~VAEktV----qlL~e~~ek~ni~Vkilgg~SFiD~~fealkiDP--veG~q~vDa~~l~~  148 (488)
T COG3956          77 VEAAKEKDIVYAVPGHPLVAEKTV----QLLIEACEKENIKVKILGGQSFIDALFEALKIDP--VEGFQIVDATDLSN  148 (488)
T ss_pred             HHHHCCCCEEEECCCCCHHHHHHH----HHHHHHHHCCCCEEEEECCCHHHHHHHHHHCCCC--CCCCEEECCCHHHH
T ss_conf             986223563896699951688899----9999987625852797575026889998845785--56736741000157


No 164
>TIGR01249 pro_imino_pep_1 proline iminopeptidase; InterPro: IPR005944   Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence . Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases .   Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base . The geometric orientations of the catalytic residues are similar between families, despite different protein folds . The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) , .   Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins.   Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule.    This group of serine peptidase belong to MEROPS peptidase family S33 (clan SC). They are proline iminopeptidase (Prolyl aminopeptidase, 3.4.11.5 from EC), which catalyzes the removal of the N-terminal proline from peptides. This family represents one of two related families of proline iminopeptidase containing the alpha/beta fold. The fine specificities of the various members, including both the range of short peptides from which proline can be removed and whether other amino acids such as alanine can be also removed, may vary among members.  ; GO: 0016804 prolyl aminopeptidase activity, 0005737 cytoplasm.
Probab=25.63  E-value=54  Score=14.54  Aligned_cols=55  Identities=22%  Similarity=0.344  Sum_probs=31.3

Q ss_pred             HHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEE---------ECCCCCCCHHHHHHHHHH
Q ss_conf             999999998518971799845724578899970898115---------578346612489999999
Q 537021.9.peg.3   45 DFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKF---------ENGLRITDQQTAEVVEMV  101 (150)
Q Consensus        45 ~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~---------v~G~RVTd~~tl~iv~~v  101 (150)
                      .+++||..++  .++.+||||==.-|.=+++...+.--|         -.|-|=.|+..++-...+
T Consensus       242 ~lL~ni~~i~--~i~~~iv~GRyDl~cPl~~awaL~kafPea~L~v~~~AGHsa~dp~~~a~lv~a  305 (310)
T TIGR01249       242 FLLDNISKIR--NIPTVIVHGRYDLICPLQSAWALHKAFPEAELKVVNNAGHSAFDPNNLAALVVA  305 (310)
T ss_pred             HHHHHHHHHC--CCCEEEEECCEEHCCHHCCHHHHHHCCCCCEEEEECCCCCCCCCHHHHHHHHHH
T ss_conf             8886677640--687379844600002003544675218551466745788665765479999999


No 165
>cd07022 S49_Sppa_36K_type Signal peptide peptidase A (SppA) 36K type, a serine protease, has catalytic Ser-Lys dyad. Signal peptide peptidase A (SppA; Peptidase S49; Protease IV) 36K type: SppA is found in all three domains of life and is involved in the cleavage of signal peptides after their removal from the precursor proteins by signal peptidases. Members in this subfamily are all bacterial and include sohB peptidase and protein C. These are sometimes referred to as 36K type since they contain only one domain, unlike E. coli SppA that also contains an amino-terminal domain. Site-directed mutagenesis and sequence analysis have shown these SppAs to be serine proteases. The predicted active site serine for members in this family occurs in a transmembrane domain. Mutagenesis studies also suggest that the catalytic center comprises a Ser-Lys dyad and not the usual Ser-His-Asp catalytic triad found in the majority of serine proteases.
Probab=25.56  E-value=55  Score=14.54  Aligned_cols=78  Identities=14%  Similarity=0.215  Sum_probs=45.3

Q ss_pred             CHHHHHHHHHCCCCCEEE-CC--------CCCCCHHHHHHHHHHHHHHHHHHHHHHHHH-CCCCEEEEEEEECCEEEEEE
Q ss_conf             245788999708981155-78--------346612489999999741668999999982-89976877865589899997
Q 537021.9.peg.3   67 GPQIGAVLEKMGIKSKFE-NG--------LRITDQQTAEVVEMVLAGSINKKIVSLINQ-TGTQAIGICGKDGNMVFAEK  136 (150)
Q Consensus        67 g~qI~~~l~~~gi~~~~v-~G--------~RVTd~~tl~iv~~vL~G~vn~~lv~~L~~-~g~~avglsg~dg~li~a~~  136 (150)
                      .+.+..+|++.|++.+.+ .|        .|=.+++..+..+..+ ..+.........+ ++...--+....|.++.++.
T Consensus       115 ~~~~~~ll~k~Gi~~~~~~sg~~K~~~~p~~~~s~e~~~~~q~~l-d~~~~~F~~~Va~~R~~~~~~~~~~~G~v~~g~~  193 (214)
T cd07022         115 HVDQSKALEKAGLKVTLIFAGAHKVDGNPDEPLSDEARARLQAEV-DALYAMFVAAVARNRGLSAAAVRATEGGVFRGQE  193 (214)
T ss_pred             CCCHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHH-HHHHHHHHHHHHHHCCCCHHHHHHHCCCEEEHHH
T ss_conf             177688898679547874057633236875656889999999999-9999999999998579997788871698988999


Q ss_pred             CCCCCCCCCC
Q ss_conf             4752111799
Q 537021.9.peg.3  137 ARHSLRLSPN  146 (150)
Q Consensus       137 ~~~~lg~vg~  146 (150)
                       -.++|++-+
T Consensus       194 -A~~~GLVD~  202 (214)
T cd07022         194 -AVAAGLADA  202 (214)
T ss_pred             -HHHCCCCCC
T ss_conf             -998599754


No 166
>KOG1533 consensus
Probab=25.06  E-value=56  Score=14.48  Aligned_cols=107  Identities=19%  Similarity=0.195  Sum_probs=61.3

Q ss_pred             CHHHHHHHHHHHHCCCEEEEEECHHH--HCCHHHHHHHHHHHHHHHHCCCCEEEEECC--------CHHHHHHHHHC---
Q ss_conf             22899999999875988999867567--247468999999999985189717998457--------24578899970---
Q 537021.9.peg.3   11 EILEQVLPFVQFYENETIVVKYGGHV--MNCTDLSKDFVNDIALLKKSNITPVIVHGG--------GPQIGAVLEKM---   77 (150)
Q Consensus        11 ~~l~ea~pYi~~~r~ktiVIK~GG~~--l~d~~l~~~~~~dI~ll~~~G~k~ViVHGg--------g~qI~~~l~~~---   77 (150)
                      .||++.   |+.++++.++.-++|-+  ..+.+...++++   .|...+.+++.||=.        +..|+.+|-.+   
T Consensus        86 dwl~~~---l~~~~~~Y~lFDcPGQVELft~h~~l~~I~~---~Lek~~~rl~~V~LiDs~ycs~p~~~iS~lL~sl~tM  159 (290)
T KOG1533          86 DWLLEK---LKPLTDHYVLFDCPGQVELFTHHDSLNKIFR---KLEKLDYRLVAVNLIDSHYCSDPSKFISSLLVSLATM  159 (290)
T ss_pred             HHHHHH---HHHCCCCEEEEECCCCEEEEECCCHHHHHHH---HHHHCCCEEEEEEEEECEEECCHHHHHHHHHHHHHHH
T ss_conf             999997---4523474899957982798742560999999---9997695379997330214378489999999999999


Q ss_pred             -CCCCEEE-----------CC-----C-CCCCHHHHHHHHHHH--------HHHHHHHHHHHHHHCCCCEEE
Q ss_conf             -8981155-----------78-----3-466124899999997--------416689999999828997687
Q 537021.9.peg.3   78 -GIKSKFE-----------NG-----L-RITDQQTAEVVEMVL--------AGSINKKIVSLINQTGTQAIG  123 (150)
Q Consensus        78 -gi~~~~v-----------~G-----~-RVTd~~tl~iv~~vL--------~G~vn~~lv~~L~~~g~~avg  123 (150)
                       .++..-+           .|     + =.|+-+.+...+.-|        -++.|..|++.++..+.-+..
T Consensus       160 l~melphVNvlSK~Dl~~~ygkl~f~ld~yt~v~Dl~yL~~~ld~dp~~~kYrkLne~ic~~IeD~~LVSF~  231 (290)
T KOG1533         160 LHMELPHVNVLSKADLLKKYGKLPFNLDFYTEVQDLSYLEDLLDVDPRLRKYRKLNEAICELIEDFNLVSFE  231 (290)
T ss_pred             HHHCCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCEEEE
T ss_conf             862365001425767787616665453123206667777887502735667888899999997605730468


No 167
>TIGR02824 quinone_pig3 putative NAD(P)H quinone oxidoreductase, PIG3 family; InterPro: IPR014189   Members of this family are putative quinone oxidoreductases that belong to the broader superfamily of zinc-dependent alcohol (of medium chain length) dehydrogenases and quinone oxidoreductases. An alignment shows no motif of conserved Cys residues as are found in zinc-binding members of the superfamily, consequently members of this entry are likely to be quinone oxidoreductases. A member of this family in Homo sapiens (Human), PIG3, is induced by p53 but is otherwise uncharacterised..
Probab=24.94  E-value=43  Score=15.23  Aligned_cols=28  Identities=18%  Similarity=0.219  Sum_probs=11.1

Q ss_pred             HHHHHCCCCEEEEECCCHHHHHHHHHCC
Q ss_conf             9985189717998457245788999708
Q 537021.9.peg.3   51 ALLKKSNITPVIVHGGGPQIGAVLEKMG   78 (150)
Q Consensus        51 ~ll~~~G~k~ViVHGgg~qI~~~l~~~g   78 (150)
                      .+-+..|.++..-=|.--......+++|
T Consensus       163 qLAKA~Ga~V~~TaGS~eK~~~a~~~LG  190 (334)
T TIGR02824       163 QLAKAFGARVFTTAGSDEKCATACEALG  190 (334)
T ss_pred             HHHHHCCCEEEEEECCHHHHHHHHHHCC
T ss_conf             9998569759998289899999998609


No 168
>TIGR01326 OAH_OAS_sulfhy O-acetylhomoserine aminocarboxypropyltransferase/cysteine synthase; InterPro: IPR006235   This family of sequences is a distinct clade of the Cys/Met metabolism pyridoxal phosphate-dependent enzyme superfamily. Members include examples of OAH/OAS sulfhydrylase, an enzyme with activity both as O-acetylhomoserine sulfhydrylase (OAH, 2.5.1.49 from EC) and O-acetylserine sulfhydrylase (OAS, 2.5.1.47 from EC). An alternate name for OAH sulfhydrylase is homocysteine synthase.; GO: 0016765 transferase activity transferring alkyl or aryl (other than methyl) groups, 0006520 amino acid metabolic process.
Probab=24.87  E-value=42  Score=15.27  Aligned_cols=51  Identities=22%  Similarity=0.266  Sum_probs=35.1

Q ss_pred             HHHHHHHHHHHHCCCCEEEE---E-CCCHHHHHHHHHCCCCCEEECCCCCCC-HHHHHH
Q ss_conf             99999999985189717998---4-572457889997089811557834661-248999
Q 537021.9.peg.3   44 KDFVNDIALLKKSNITPVIV---H-GGGPQIGAVLEKMGIKSKFENGLRITD-QQTAEV   97 (150)
Q Consensus        44 ~~~~~dI~ll~~~G~k~ViV---H-Ggg~qI~~~l~~~gi~~~~v~G~RVTd-~~tl~i   97 (150)
                      .+..-.|.-|-..|-++|=.   . |.--++.-.|+++|||.+|+|+   -| +|.++-
T Consensus        83 AA~~~Ailnla~AGDnIVsS~~LYGGTynLF~~TlkrlGIevrFvd~---dd~pe~~~k  138 (434)
T TIGR01326        83 AAITYAILNLAQAGDNIVSSSYLYGGTYNLFKHTLKRLGIEVRFVDG---DDDPEELEK  138 (434)
T ss_pred             HHHHHHHHHHHHCCCCEEECCCCCCHHHHHHHHHHHHCCEEEEEECC---CCCHHHHHH
T ss_conf             99999999997269826980644422578999955544814887278---888789997


No 169
>cd01013 isochorismatase Isochorismatase, also known as 2,3 dihydro-2,3 dihydroxybenzoate synthase, catalyses the conversion of isochorismate, in the presence of water, to 2,3-dihydroxybenzoate and pyruvate, via the hydrolysis of a vinyl ether, an uncommon reaction in biological systems. Isochorismatase is part of the phenazine biosynthesis pathway. Phenazines are antimicrobial compounds that provide the competitive advantage for certain bacteria.
Probab=24.80  E-value=56  Score=14.45  Aligned_cols=91  Identities=14%  Similarity=0.233  Sum_probs=53.9

Q ss_pred             HHHHHHHHHHHHHHHHCCCCEEEE-E--CCCHHHHHHHHHC---CC-----CCEEECCCCCCCHHHHHHHHHHHHHHHHH
Q ss_conf             468999999999985189717998-4--5724578899970---89-----81155783466124899999997416689
Q 537021.9.peg.3   40 TDLSKDFVNDIALLKKSNITPVIV-H--GGGPQIGAVLEKM---GI-----KSKFENGLRITDQQTAEVVEMVLAGSINK  108 (150)
Q Consensus        40 ~~l~~~~~~dI~ll~~~G~k~ViV-H--Ggg~qI~~~l~~~---gi-----~~~~v~G~RVTd~~tl~iv~~vL~G~vn~  108 (150)
                      +++..++.+=+...+..|+.++-+ |  ...+.-...+...   |.     +.+.++.+...+.+. -+.+.-.++..++
T Consensus        53 ~~vv~ni~rL~~aaR~~GipVi~t~~~~~~~~~~r~~~~~~~g~~~~~g~~g~eiv~eL~P~~~d~-vi~K~~~saF~~T  131 (203)
T cd01013          53 PQLIANIARLRDWCRQAGIPVVYTAQPGNQTPEQRALLNDFWGPGLTASPEETKIVTELAPQPDDT-VLTKWRYSAFKRS  131 (203)
T ss_pred             HHHHHHHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCE-EEECCCCCCCCCC
T ss_conf             889999999999999879969998347879966730011023788768986453371438999998-9854867886799


Q ss_pred             HHHHHHHHCCCCEEEEEEEECCE
Q ss_conf             99999982899768778655898
Q 537021.9.peg.3  109 KIVSLINQTGTQAIGICGKDGNM  131 (150)
Q Consensus       109 ~lv~~L~~~g~~avglsg~dg~l  131 (150)
                      +|...|++.|+..+-++|...|.
T Consensus       132 ~L~~~Lr~~gi~~Lvv~Gv~T~~  154 (203)
T cd01013         132 PLLERLKESGRDQLIITGVYAHI  154 (203)
T ss_pred             CHHHHHHHCCCCEEEEEEEECCC
T ss_conf             98999997699989998654251


No 170
>PRK12851 groEL chaperonin GroEL; Reviewed
Probab=24.55  E-value=57  Score=14.42  Aligned_cols=109  Identities=14%  Similarity=0.153  Sum_probs=55.5

Q ss_pred             HHHCCCEEEEEECHHHH--CCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCC----CCHHH
Q ss_conf             98759889998675672--4746899999999998518971799845724578899970898115578346----61248
Q 537021.9.peg.3   21 QFYENETIVVKYGGHVM--NCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRI----TDQQT   94 (150)
Q Consensus        21 ~~~r~ktiVIK~GG~~l--~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RV----Td~~t   94 (150)
                      .+..+.+.+|.+||+.-  .+ +..+++-.-+...+.+=... +|.|||..--+....+.- .+-+.|..-    .=.++
T Consensus       370 ~~l~g~~~~I~vgG~t~~~l~-E~er~l~DAl~~vr~~i~~g-vVpGGGa~e~~~a~~l~~-~~~~~g~eqlav~afa~A  446 (541)
T PRK12851        370 AKLAGGVAVIRVGASTEVEVK-EKKDRVDDALHATRAAVEEG-IVPGGGVALLRAVKALDK-LTTANGDQRTGVEIVRRA  446 (541)
T ss_pred             CCCCCCEEEEEEECCCHHHHH-HHHHHHHHHHHHHHHHHHCC-EECCCHHHHHHHHHHHHH-HCCCCHHHHHHHHHHHHH
T ss_conf             014786389996068688999-99999998899999974388-760777999999999986-136876899999999999


Q ss_pred             HHHHHHHHH---HHHHHHHHHHHHHCCCCEEEEEEEECCEEE
Q ss_conf             999999974---166899999998289976877865589899
Q 537021.9.peg.3   95 AEVVEMVLA---GSINKKIVSLINQTGTQAIGICGKDGNMVF  133 (150)
Q Consensus        95 l~iv~~vL~---G~vn~~lv~~L~~~g~~avglsg~dg~li~  133 (150)
                      |+++-.+|+   |.=-.+++..|.. +-...|+....|.+.-
T Consensus       447 Le~ip~tLaeNaG~d~~~vl~~L~~-~~~~~G~d~~~g~~~d  487 (541)
T PRK12851        447 LEAPARQIAENAGAEGSVVVGKVRE-KPESYGFNAATGEYGD  487 (541)
T ss_pred             HHHHHHHHHHHCCCCHHHHHHHHHH-CCCCEEEECCCCCEEE
T ss_conf             9999999999769998999999982-7999848678995711


No 171
>cd02516 CDP-ME_synthetase CDP-ME synthetase is involved in mevalonate-independent isoprenoid production. 4-diphosphocytidyl-2-methyl-D-erythritol synthase (CDP-ME), also called  2C-methyl-d-erythritol 4-phosphate cytidylyltransferase catalyzes the third step in the alternative (non-mevalonate) pathway of Isopentenyl diphosphate (IPP) biosynthesis: the formation of 4-diphosphocytidyl-2C-methyl-D-erythritol from CTP and 2C-methyl-D-erythritol 4-phosphate. This mevalonate independent pathway that utilizes pyruvate and glyceraldehydes 3-phosphate as starting materials for production of IPP occurs in a variety of bacteria, archaea and plant cells, but is absent in mammals. Thus, CDP-ME synthetase is  an attractive targets for the structure-based design of selective antibacterial, herbicidal and antimalarial drugs.
Probab=24.34  E-value=58  Score=14.39  Aligned_cols=48  Identities=15%  Similarity=0.341  Sum_probs=21.7

Q ss_pred             HHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECC-CHHH
Q ss_conf             9999875988999867567247468999999999985189717998457-2457
Q 537021.9.peg.3   18 PFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGG-GPQI   70 (150)
Q Consensus        18 pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGg-g~qI   70 (150)
                      .+.+.+..+.+.+-.||..    . .+++..-+..+....-..|+||-+ +|.+
T Consensus        61 ~~~~~~~~~~i~~v~GG~t----R-~~SV~ngl~~l~~~~~~~VlIHDaaRP~i  109 (218)
T cd02516          61 ELAKYGLSKVVKIVEGGAT----R-QDSVLNGLKALPDADPDIVLIHDAARPFV  109 (218)
T ss_pred             HHHHHCCCCCEEEECCCCC----H-HHHHHHHHHHHCCCCCCEEEEECCCCCCC
T ss_conf             9886447987699899840----9-99999898740126999899925767789


No 172
>PRK05802 hypothetical protein; Provisional
Probab=24.21  E-value=58  Score=14.37  Aligned_cols=73  Identities=21%  Similarity=0.113  Sum_probs=50.2

Q ss_pred             CCCHHHH--HHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCH----HHHHHHHHCCCCCE
Q ss_conf             0622899--99999987598899986756724746899999999998518971799845724----57889997089811
Q 537021.9.peg.3    9 QAEILEQ--VLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGP----QIGAVLEKMGIKSK   82 (150)
Q Consensus         9 ~a~~l~e--a~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~----qI~~~l~~~gi~~~   82 (150)
                      |+|....  -++||..-++...+|-..|=.+. |     .+.-|..|.+-|-++.++-..||    .|++.|++.+++.-
T Consensus       155 RGPywNGifGlk~I~~~kn~k~LvIaRGIgqA-P-----~vpVikkL~~n~NkV~vIid~~~~~~~fI~eyl~~~~~~~i  228 (328)
T PRK05802        155 RGPYWNGVFGLKNIKSTKNGKSLVIARGIGQA-P-----AVPVIKKLYSNGNKVIVILDKGPFENNFIKEYLEEYNIEVI  228 (328)
T ss_pred             ECCCCCCCCCHHHHHHCCCCEEEEEECCCCCC-C-----CHHHHHHHHHCCCEEEEEECCCCCCCHHHHHHHHHCCCEEE
T ss_conf             67742440370567505797399993562325-5-----55999999868997999985885531659999985783699


Q ss_pred             EECCC
Q ss_conf             55783
Q 537021.9.peg.3   83 FENGL   87 (150)
Q Consensus        83 ~v~G~   87 (150)
                      -.+=+
T Consensus       229 ~~~~l  233 (328)
T PRK05802        229 ECNLL  233 (328)
T ss_pred             EECCC
T ss_conf             83142


No 173
>PRK09181 aspartate kinase; Validated
Probab=23.91  E-value=59  Score=14.34  Aligned_cols=39  Identities=26%  Similarity=0.356  Sum_probs=21.4

Q ss_pred             CEEEEEECHHHHCCHHHHHHHHHHHHHHHHCC---CC-EEEEECCC
Q ss_conf             88999867567247468999999999985189---71-79984572
Q 537021.9.peg.3   26 ETIVVKYGGHVMNCTDLSKDFVNDIALLKKSN---IT-PVIVHGGG   67 (150)
Q Consensus        26 ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G---~k-~ViVHGgg   67 (150)
                      ...|-||||+-+.|.+   .+.+.|..-...+   .+ +|+|=--|
T Consensus         3 m~~V~KFGGTSVa~~e---~i~~~iii~~~~~~~~~~~vVVVSAmg   45 (476)
T PRK09181          3 MHTVEKIGGTSMSAFD---AVLDNIILRPRKGKDLYNRIFVVSAYG   45 (476)
T ss_pred             CEEEEEECCCCCCCHH---HHHHHHHHHHCCCCCCCCEEEEECCCC
T ss_conf             5199954866302299---999998654201356898899988999


No 174
>pfam02579 Nitro_FeMo-Co Dinitrogenase iron-molybdenum cofactor. This family contains several NIF (B, Y and X) proteins which are iron-molybdenum cofactors (FeMo-co) in the dinitrogenase enzyme which catalyses the reduction of dinitrogen to ammonium. Dinitrogenase is a hetero-tetrameric (alpha(2)beta(2)) enzyme which contains the iron-molybdenum cofactor (FeMo-co) at its active site.
Probab=23.77  E-value=59  Score=14.32  Aligned_cols=32  Identities=44%  Similarity=0.473  Sum_probs=27.3

Q ss_pred             HHHHCCCCEEEEECCCHHHHHHHHHCCCCCEE
Q ss_conf             98518971799845724578899970898115
Q 537021.9.peg.3   52 LLKKSNITPVIVHGGGPQIGAVLEKMGIKSKF   83 (150)
Q Consensus        52 ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~   83 (150)
                      .|...|..+|++.+-|+..-+.|++.||++-.
T Consensus        48 ~l~~~~~~~vi~~~iG~~a~~~L~~~gI~v~~   79 (94)
T pfam02579        48 LLALEGVDAVIVGGIGPNALAKLEAAGIKVYK   79 (94)
T ss_pred             HHHHCCCCEEEECCCCHHHHHHHHHCCCEEEE
T ss_conf             99976998999687598899999988999999


No 175
>KOG0356 consensus
Probab=23.71  E-value=59  Score=14.31  Aligned_cols=82  Identities=23%  Similarity=0.253  Sum_probs=45.1

Q ss_pred             HHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHH-HH
Q ss_conf             99998759889998675672474689999999999851897179984572457889997089811557834661248-99
Q 537021.9.peg.3   18 PFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQT-AE   96 (150)
Q Consensus        18 pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~t-l~   96 (150)
                      ..+.++.+-+.||++||+--..-+-..+=++|-----..++..=+|.|||...=.....+.--+.    .+-++... .+
T Consensus       381 er~a~ls~gvavi~vGg~se~Ev~ekk~rv~dalnat~aaveeGivpGGG~all~~~~~l~~lk~----~~~~~~~~G~~  456 (550)
T KOG0356         381 ERLAKLSGGVAVIKVGGHSEVEVNEKKDRVEDALNATRAAVEEGIVPGGGTALLRAIPVLDELKA----TNNFDQKVGVE  456 (550)
T ss_pred             HHHHHHCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCC----CCCHHHHHHHH
T ss_conf             88987527817999547541025443325889888888775158556760788987655510466----65067665099


Q ss_pred             HHHHHHH
Q ss_conf             9999974
Q 537021.9.peg.3   97 VVEMVLA  103 (150)
Q Consensus        97 iv~~vL~  103 (150)
                      +++.+|.
T Consensus       457 iv~~Al~  463 (550)
T KOG0356         457 IVKKALR  463 (550)
T ss_pred             HHHHHHH
T ss_conf             9999986


No 176
>PRK13561 putative phosphodiesterase; Provisional
Probab=23.42  E-value=60  Score=14.28  Aligned_cols=63  Identities=13%  Similarity=0.128  Sum_probs=29.0

Q ss_pred             CCHHHHHHHHHHHHC--CCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEE-CCCHHHHHHH
Q ss_conf             622899999999875--9889998675672474689999999999851897179984-5724578899
Q 537021.9.peg.3   10 AEILEQVLPFVQFYE--NETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVH-GGGPQIGAVL   74 (150)
Q Consensus        10 a~~l~ea~pYi~~~r--~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVH-Ggg~qI~~~l   74 (150)
                      ..++......+++|+  -..+++-+.-+++.+.  .+....-+..|+.+|+++-|=+ |.|..-=..|
T Consensus       500 ~~f~~~l~~~l~~~~~~p~~L~lEItEs~~~~~--~~~~~~~l~~Lr~~Gv~ialDDFGtGySSL~yL  565 (651)
T PRK13561        500 PNMVADMLELLTRYRIQPGTLILEVTESRRIDD--PHAAVAILRPLRNAGVRIALDDFGMGYAGLRQL  565 (651)
T ss_pred             CCHHHHHHHHHHHHCCCHHHEEEEECCCHHHCC--HHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHH
T ss_conf             719999999999849684539999779324349--999999999999789999998999964209998


No 177
>TIGR03343 biphenyl_bphD 2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase. Members of this family are 2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase, or HOPD hydrolase, the BphD protein of biphenyl degradation. BphD acts on the product of ring meta-cleavage by BphC. Many species carrying bphC and bphD are capable of degrading polychlorinated biphenyls as well as biphenyl itself.
Probab=23.29  E-value=30  Score=16.16  Aligned_cols=65  Identities=12%  Similarity=0.106  Sum_probs=31.4

Q ss_pred             CHHHHHHHHHHHHCCCEEEEEECHHHHCCH-----HHHHHHHHHHH-HHHHCCCCEEEE--ECCCHHHHHHHHH
Q ss_conf             228999999998759889998675672474-----68999999999-985189717998--4572457889997
Q 537021.9.peg.3   11 EILEQVLPFVQFYENETIVVKYGGHVMNCT-----DLSKDFVNDIA-LLKKSNITPVIV--HGGGPQIGAVLEK   76 (150)
Q Consensus        11 ~~l~ea~pYi~~~r~ktiVIK~GG~~l~d~-----~l~~~~~~dI~-ll~~~G~k~ViV--HGgg~qI~~~l~~   76 (150)
                      .++++..|+..+ +-..+++=+.|.=-.+.     ......++|+. ++..+|+.-+.+  |--|-.|-..+..
T Consensus        48 ~~~~~l~~~~~~-~yrVIa~Dl~G~G~S~~~~~~~~~~~~~a~~~~~lld~L~i~~~~lvG~S~GG~iA~~~A~  120 (282)
T TIGR03343        48 NYYRNIGPFVDA-GYRVILKDSPGFNKSDAVVMDEQRGLVNARAVKGLMDALDIEKAHLVGNSMGGATALNFAL  120 (282)
T ss_pred             HHHHHHHHHHHC-CCEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEEEEECCHHHHHHHHHH
T ss_conf             999977999978-9859997189988899876555324799999999999749983799973737999999999


No 178
>PTZ00286 6-phospho-1-fructokinase; Provisional
Probab=23.17  E-value=61  Score=14.25  Aligned_cols=82  Identities=16%  Similarity=0.273  Sum_probs=47.2

Q ss_pred             HHHHHHCCCCEEEEECCCH------HHHHHHHHCCCC------CEEE-CCCCCCCH-----HHHHHHHHHHHHHHHHHHH
Q ss_conf             9998518971799845724------578899970898------1155-78346612-----4899999997416689999
Q 537021.9.peg.3   50 IALLKKSNITPVIVHGGGP------QIGAVLEKMGIK------SKFE-NGLRITDQ-----QTAEVVEMVLAGSINKKIV  111 (150)
Q Consensus        50 I~ll~~~G~k~ViVHGgg~------qI~~~l~~~gi~------~~~v-~G~RVTd~-----~tl~iv~~vL~G~vn~~lv  111 (150)
                      +..|...|+..+++-||--      .|.+..++.|+.      |+.+ |.+-+||.     .+.+.+..+    ++..=+
T Consensus       180 Vd~L~~~gId~LfvIGGDGTlrgA~~I~~e~~~Rgl~I~VVGIPKTIDNDI~~~D~TFGFdTAVe~AteA----I~~ah~  255 (486)
T PTZ00286        180 VDTLVRMKINILFTVGGDGTQRGALKIYEEAKRRGENIAVFGVPKTIDNDLAFSHRTFGFQTAVEQAVNA----VRAAYA  255 (486)
T ss_pred             HHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCHHCCCCHHHHHHHHHHH----HHHHHH
T ss_conf             9999985999899978846688999999999971788506646854357641001002402399999999----999988


Q ss_pred             HHHH-HCCCCEEEEEEEECCEEEEE
Q ss_conf             9998-28997687786558989999
Q 537021.9.peg.3  112 SLIN-QTGTQAIGICGKDGNMVFAE  135 (150)
Q Consensus       112 ~~L~-~~g~~avglsg~dg~li~a~  135 (150)
                      ++-+ .+|+--+-+-|.+.+||.+.
T Consensus       256 EA~sa~nGIGlVkLMGR~SGfIA~~  280 (486)
T PTZ00286        256 EAVSLNYGVGIVKLMGRESGFIAAQ  280 (486)
T ss_pred             HHHCCCCCEEEEEECCCCCCHHHHH
T ss_conf             7523568518999725551289988


No 179
>pfam01075 Glyco_transf_9 Glycosyltransferase family 9 (heptosyltransferase). Members of this family belong to glycosyltransferase family 9. Lipopolysaccharide is a major component of the outer leaflet of the outer membrane in Gram-negative bacteria. It is composed of three domains; lipid A, Core oligosaccharide and the O-antigen. All of these enzymes transfer heptose to the lipopolysaccharide core.
Probab=22.96  E-value=61  Score=14.22  Aligned_cols=114  Identities=17%  Similarity=0.201  Sum_probs=59.1

Q ss_pred             HHHHHHHHHHHHCCCEEEEEECHHHHCCHH-HHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHH-C--CCCCEEECCC
Q ss_conf             289999999987598899986756724746-89999999999851897179984572457889997-0--8981155783
Q 537021.9.peg.3   12 ILEQVLPFVQFYENETIVVKYGGHVMNCTD-LSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEK-M--GIKSKFENGL   87 (150)
Q Consensus        12 ~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~-l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~-~--gi~~~~v~G~   87 (150)
                      ....+..+......+.+||..|++- .... -.+.|++=+..|..-|.++|+ .||.++.++...+ .  +.+.+.++=.
T Consensus        94 ~~~~~~~~~~~~~~~~i~i~pga~~-~~K~Wp~e~f~~L~~~l~~~~~~vvl-~gg~~~~e~~~~~~i~~~~~~~~~~l~  171 (249)
T pfam01075        94 ERAAALDKAGNGGRPYIAFCPGASR-PAKRWPAEHFAELAQALQERGWQVVL-FGGAEDREEEIAKRLAAGLRTPCVNLL  171 (249)
T ss_pred             HHHHHHHHHHCCCCCEEEEECCCCC-CCCCCCHHHHHHHHHHHHHCCCCEEE-EECCCHHHHHHHHHHHHHCCCCEEECC
T ss_conf             9999998620259998999738788-56779999999999999966995699-738678999999999863899868626


Q ss_pred             -CCCCHHHHHHHHHH--HHHHHHHHHHHHHHHCCCCEEEEEEEE
Q ss_conf             -46612489999999--741668999999982899768778655
Q 537021.9.peg.3   88 -RITDQQTAEVVEMV--LAGSINKKIVSLINQTGTQAIGICGKD  128 (150)
Q Consensus        88 -RVTd~~tl~iv~~v--L~G~vn~~lv~~L~~~g~~avglsg~d  128 (150)
                       +.+=.|...++..+  +.| .-......-...|.+.++|=|..
T Consensus       172 g~~sL~el~ali~~a~l~I~-nDSGp~HiAaA~g~Pti~ifGpT  214 (249)
T pfam01075       172 GKTSLEQAAALLAGADLVVG-NDSGLMHLAAALDRPVIGLYGPT  214 (249)
T ss_pred             CCCCHHHHHHHHHHCCEEEE-CCCHHHHHHHHCCCCEEEEECCC
T ss_conf             99999999999985066885-79859999998399889997889


No 180
>PRK05569 flavodoxin; Provisional
Probab=22.93  E-value=61  Score=14.22  Aligned_cols=70  Identities=14%  Similarity=0.332  Sum_probs=42.6

Q ss_pred             CEEEEEECHHHHCCH----HHHHHHHHHHHHHHHCCCCEEEEEC-----CCH---HHHHHHHHCCCCCEEECCCCC---C
Q ss_conf             889998675672474----6899999999998518971799845-----724---578899970898115578346---6
Q 537021.9.peg.3   26 ETIVVKYGGHVMNCT----DLSKDFVNDIALLKKSNITPVIVHG-----GGP---QIGAVLEKMGIKSKFENGLRI---T   90 (150)
Q Consensus        26 ktiVIK~GG~~l~d~----~l~~~~~~dI~ll~~~G~k~ViVHG-----gg~---qI~~~l~~~gi~~~~v~G~RV---T   90 (150)
                      +.++  +|-.++.+.    .-+..|++++..+..-| |++.++|     +|.   ++.+.|+.+|++.  ++.+++   -
T Consensus        50 d~i~--~GspT~~~~~~~~~~~~~~l~~l~~~~~~~-K~~a~FGSyGW~~Ge~~~~~~~~l~~~g~~v--v~~~~~~~~P  124 (141)
T PRK05569         50 DAVA--FGSPSMDNNNIEQEEMAPFLDQFKLTPNEN-KKCILFGSYGWDNGEFMKLWKDRMKDYGFNV--IGDLAVNESP  124 (141)
T ss_pred             CEEE--EECCCCCCCCCCHHHHHHHHHHHHCCCCCC-CEEEEEECCCCCCCHHHHHHHHHHHHCCCEE--ECEEEEEECC
T ss_conf             9059--905455888777176999999985345589-8899996677886469999999999789848--2405896188


Q ss_pred             CHHHHHHHHH
Q ss_conf             1248999999
Q 537021.9.peg.3   91 DQQTAEVVEM  100 (150)
Q Consensus        91 d~~tl~iv~~  100 (150)
                      |++.++-.+.
T Consensus       125 ~~e~l~~c~e  134 (141)
T PRK05569        125 NKEELNSAKE  134 (141)
T ss_pred             CHHHHHHHHH
T ss_conf             9899999999


No 181
>TIGR01250 pro_imino_pep_2 proline-specific peptidases; InterPro: IPR005945   Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence . Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases .   Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base . The geometric orientations of the catalytic residues are similar between families, despite different protein folds . The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) , .   Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins.   Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule.    This group of serine peptidase belong to MEROPS peptidase family S33 (clan SC). They are proline iminopeptidase (Prolyl aminopeptidase, 3.4.11.5 from EC), which catalyzes the removal of the N-terminal proline from peptides. This family represents one of two related families of proline iminopeptidase containing the alpha/beta fold. The fine specificities of the various members, including both the range of short peptides from which proline can be removed and whether other amino acids such as alanine can be also removed, may vary among members.    One of the members of this family is the tricorn protease (TRI) interacting factor 1 from Thermoplasma acidophilum. Factor 1 (F1) is a 33.5 kDa serine peptidase of the alpha/beta-hydrolase family. Tricorn generates small peptides, which are cleaved by F1 to yield single amino acids , . ; GO: 0016804 prolyl aminopeptidase activity, 0005737 cytoplasm.
Probab=22.71  E-value=44  Score=15.11  Aligned_cols=30  Identities=27%  Similarity=0.603  Sum_probs=19.3

Q ss_pred             CCCEEEEECCCHH--------HHHHHHH-CCCCCEEECCC
Q ss_conf             9717998457245--------7889997-08981155783
Q 537021.9.peg.3   57 NITPVIVHGGGPQ--------IGAVLEK-MGIKSKFENGL   87 (150)
Q Consensus        57 G~k~ViVHGgg~q--------I~~~l~~-~gi~~~~v~G~   87 (150)
                      +++++++||| |-        ..+.+++ .|++.-+.|-+
T Consensus        25 ~~~Ll~lHGG-PG~~H~Yl~~l~~~~~~~~G~~V~~YDQL   63 (302)
T TIGR01250        25 KIKLLVLHGG-PGSSHEYLENLKELLKEDLGREVIMYDQL   63 (302)
T ss_pred             CCEEEEEECC-CCCCHHHHHHHHHHHHHCCCCEEEEEECC
T ss_conf             9218997088-98874237999999764078279998324


No 182
>TIGR00963 secA preprotein translocase, SecA subunit; InterPro: IPR000185   Secretion across the inner membrane in some Gram-negative bacteria occurs via the preprotein translocase pathway. Proteins are produced in the cytoplasm as precursors, and require a chaperone subunit to direct them to the translocase component. . From there, the mature proteins are either targeted to the outer membrane, or remain as periplasmic proteins. The translocase protein subunits are encoded on the bacterial chromosome.     The translocase itself comprises 7 proteins, including a chaperone protein (SecB), an ATPase (SecA), an integral membrane complex (SecCY, SecE and SecG), and two additional membrane proteins that promote the release of the mature peptide into the periplasm (SecD and SecF) . The chaperone protein SecB  is a highly acidic homotetrameric protein that exists as a "dimer of dimers" in the bacterial cytoplasm. SecB maintains preproteins in an unfolded state after translation, and targets these to the peripheral membrane protein ATPase SecA for secretion .   SecA is a cytoplasmic protein of 800 to 960 amino acid residues. Homologs of secA are also encoded in the chloroplast genome of some algae  as well as in the nuclear genome of plants . It could be involved in the intraorganellar protein transport into thylakoids.; GO: 0005524 ATP binding, 0006605 protein targeting, 0006886 intracellular protein transport.
Probab=22.62  E-value=62  Score=14.18  Aligned_cols=45  Identities=22%  Similarity=0.330  Sum_probs=34.1

Q ss_pred             HHHHHHHHHHHHHHHHCCCCEEEEECCCH----HHHHHHHHCCCCCEEEC
Q ss_conf             46899999999998518971799845724----57889997089811557
Q 537021.9.peg.3   40 TDLSKDFVNDIALLKKSNITPVIVHGGGP----QIGAVLEKMGIKSKFEN   85 (150)
Q Consensus        40 ~~l~~~~~~dI~ll~~~G~k~ViVHGgg~----qI~~~l~~~gi~~~~v~   85 (150)
                      ..-+.+++.||..+|..| +||||==..=    -++++|++.||+..-.|
T Consensus       418 ~~Kw~Av~~e~~~~h~~G-qPvLvGT~svE~SE~LS~lL~~~~I~H~VLN  466 (904)
T TIGR00963       418 EEKWKAVVDEIKEIHAKG-QPVLVGTTSVEKSELLSKLLKERGIPHNVLN  466 (904)
T ss_pred             HHHHHHHHHHHHHHHHCC-CCEEEEECCHHHHHHHHHHHHHCCCCCCCCC
T ss_conf             889999999999987468-9877752217778999999985789812247


No 183
>cd03466 Nitrogenase_NifN_2 Nitrogenase_nifN_2: A subgroup of the NifN subunit of the NifEN complex: NifN forms an alpha2beta2 tetramer with NifE.  NifN and nifE are structurally homologous to nitrogenase MoFe protein beta and alpha subunits respectively.  NifEN participates in the synthesis of the iron-molybdenum cofactor (FeMoco) of the MoFe protein.  NifB-co (an iron and sulfur containing precursor of the FeMoco) from NifB is transferred to the NifEN complex where it is further processed to FeMoco. The nifEN bound precursor of FeMoco has been identified as a molybdenum-free, iron- and sulfur- containing analog of FeMoco. It has been suggested that this nifEN bound precursor also acts as a cofactor precursor in nitrogenase systems which require a cofactor other than FeMoco: i.e. iron-vanadium cofactor (FeVco) or iron only cofactor (FeFeco). This group also contains the Clostidium fused NifN-NifB protein.
Probab=22.55  E-value=62  Score=14.17  Aligned_cols=105  Identities=14%  Similarity=0.233  Sum_probs=49.3

Q ss_pred             CHHHHHCCCHHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEE-ECCC-----HHHHHHHHH
Q ss_conf             0023220622899999999875988999867567247468999999999985189717998-4572-----457889997
Q 537021.9.peg.3    3 EKIYQFQAEILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIV-HGGG-----PQIGAVLEK   76 (150)
Q Consensus         3 ~~~~~~~a~~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViV-HGgg-----~qI~~~l~~   76 (150)
                      +++.++++.++...++|-+...||.++|--+++.      .-.+.   .+|..+|+.|++| -|..     ..+.+.++.
T Consensus       278 e~i~~er~r~~d~~~d~~~~~~gkr~ai~~~~~~------~~~~~---~~l~ElG~~~~~v~t~~~~~~~~~~~~~~~~~  348 (429)
T cd03466         278 EKYTRERGRLLDAMIDAHKYNFGRKAAIYGEPDF------VVAIT---RFVLENGMVPVLIATGSESKKLKEKLEEDLKE  348 (429)
T ss_pred             HHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCHH------HHHHH---HHHHHCCCEEEEEEECCCCHHHHHHHHHHHHH
T ss_conf             9999999999999999865035867999957366------99999---99998799789998389867789999999974


Q ss_pred             CCCCCEEECCCCCCCHHHHH-HHHHH-----HHHHHHHHHHHHHHHCCCCEE
Q ss_conf             08981155783466124899-99999-----741668999999982899768
Q 537021.9.peg.3   77 MGIKSKFENGLRITDQQTAE-VVEMV-----LAGSINKKIVSLINQTGTQAI  122 (150)
Q Consensus        77 ~gi~~~~v~G~RVTd~~tl~-iv~~v-----L~G~vn~~lv~~L~~~g~~av  122 (150)
                      .+.++.-+++   +|...++ .++..     +.+.-.+.+...+   |++-+
T Consensus       349 ~~~~~~v~~~---~d~~el~~~i~~~~~dliig~s~~k~iA~kl---giP~l  394 (429)
T cd03466         349 YVEKCVILDG---ADFFDIESYAKELKIDVLIGNSYGRRIAEKL---GIPLI  394 (429)
T ss_pred             CCCCCEEEEC---CCHHHHHHHHHHCCCCEEEECCCHHHHHHHH---CCCEE
T ss_conf             5898489828---9999999999765999999785268999980---99979


No 184
>KOG1342 consensus
Probab=22.48  E-value=62  Score=14.16  Aligned_cols=57  Identities=19%  Similarity=0.385  Sum_probs=43.7

Q ss_pred             HHHHHHHHCCCEEEEEECHHHHCCHHH------HHHHHHHHHHHHHCCCCEEEEECCCHHHHH
Q ss_conf             999999875988999867567247468------999999999985189717998457245788
Q 537021.9.peg.3   16 VLPFVQFYENETIVVKYGGHVMNCTDL------SKDFVNDIALLKKSNITPVIVHGGGPQIGA   72 (150)
Q Consensus        16 a~pYi~~~r~ktiVIK~GG~~l~d~~l------~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~   72 (150)
                      ..|-+..|+-+-+|+-+|++-+..+.+      .....+-+...++.++.+.+|=|||--+-.
T Consensus       244 i~~v~e~f~P~AiVLQCGaDSL~gDRlgcFnLsi~Gh~~Cv~fvksfn~pllvlGGGGYT~rN  306 (425)
T KOG1342         244 ISKVMERFQPEAIVLQCGADSLAGDRLGCFNLSIKGHAECVKFVKSFNLPLLVLGGGGYTLRN  306 (425)
T ss_pred             HHHHHHHHCCCEEEEECCCCCCCCCCCCCEEECCHHHHHHHHHHHHCCCCEEEECCCCCCCHH
T ss_conf             999998749764999737765567755500303164899999999739867997478655134


No 185
>cd03148 GATase1_EcHsp31_like Type 1 glutamine amidotransferase (GATase1)-like domain found in Escherichia coli Hsp31 protein (EcHsp31). Type 1 glutamine amidotransferase (GATase1)-like domain found in Escherichia coli Hsp31 protein (EcHsp31).  This group includes proteins similar to EcHsp31.  EcHsp31 has chaperone activity.  EcHsp31 coordinates a metal ion using a 2-His-1-carboxylate motif present in various ions that use iron as a cofactor such as Carboxypeptidase A.   The catalytic triad typical of GATase1 domains is not conserved in this GATase1-like domain. However, in common with a typical GATase1domain, a reactive Cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. This Cys together with a different His and, an Asp (rather than a Glu) residue form a different catalytic triad from the typical GATase1 domain.  EcHsp31 is a homodimer.
Probab=22.29  E-value=63  Score=14.14  Aligned_cols=100  Identities=18%  Similarity=0.196  Sum_probs=53.5

Q ss_pred             EEEECHHH-HC----CHHHHHHHHHHHHHHHHCCCC-EEEEECCCHHHHHHHHHCCCCCEEECCCCCCCH-----HHHHH
Q ss_conf             99867567-24----746899999999998518971-799845724578899970898115578346612-----48999
Q 537021.9.peg.3   29 VVKYGGHV-MN----CTDLSKDFVNDIALLKKSNIT-PVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQ-----QTAEV   97 (150)
Q Consensus        29 VIK~GG~~-l~----d~~l~~~~~~dI~ll~~~G~k-~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~-----~tl~i   97 (150)
                      |.--||+. |-    |+++ ..++++   .+..|.- --+-||-.--++..+  .+ -...++|.|+|-=     +...+
T Consensus       100 vf~pGGHG~m~Dl~~~~~l-~~ll~~---~~~~gk~vaaVCHGpaaLl~~~~--~~-G~~l~~Gk~vTgFtd~eE~~~~~  172 (232)
T cd03148         100 VFIPGGHGALIGIPESQDV-AAALQW---AIKNDRFVITLCHGPAAFLAARH--GG-GKNPLEGYSVCVFPDSLDEGANI  172 (232)
T ss_pred             EEECCCCCCCCCCCCCHHH-HHHHHH---HHHCCCEEEEECCHHHHHHCCCC--CC-CCCCCCCCEEEECCCHHHHHHHC
T ss_conf             9967997530577579999-999999---99859979998674798740206--79-97121796775028889875301


Q ss_pred             HHHHHHHHHHHHHHHHHHHCCCCEE-----EEEEEECCEEEEE
Q ss_conf             9999741668999999982899768-----7786558989999
Q 537021.9.peg.3   98 VEMVLAGSINKKIVSLINQTGTQAI-----GICGKDGNMVFAE  135 (150)
Q Consensus        98 v~~vL~G~vn~~lv~~L~~~g~~av-----glsg~dg~li~a~  135 (150)
                      -...+.+++-..+...|.+.|.+-.     +-..+|++||+.+
T Consensus       173 ~~~~~~~~~p~~le~~l~~~Ga~~~~~~~~~~vv~Dg~LiTGQ  215 (232)
T cd03148         173 EIGYMPGHLTWLVGEELKKMGMNIINDDITGRVHKDRKLLTGD  215 (232)
T ss_pred             CCCCCCCCCCHHHHHHHHHCCCCEECCCCCCCEEECCCEEECC
T ss_conf             1033753230469999998798162588987668769887798


No 186
>pfam02593 DUF166 Uncharacterized ArCR, COG1810.
Probab=22.27  E-value=63  Score=14.13  Aligned_cols=106  Identities=20%  Similarity=0.326  Sum_probs=64.3

Q ss_pred             CHHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCH-----HHHHHHHHCCCCCEEEC
Q ss_conf             2289999999987598899986756724746899999999998518971799845724-----57889997089811557
Q 537021.9.peg.3   11 EILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGP-----QIGAVLEKMGIKSKFEN   85 (150)
Q Consensus        11 ~~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~-----qI~~~l~~~gi~~~~v~   85 (150)
                      .++.+...|+....+-=++|-|+    .+|++...+++   .++..|.+.|||-+..|     |+.+.+++.|++-.+-.
T Consensus        35 e~Ie~pee~Lp~~~~~Dl~i~y~----lhPDl~~~l~~---~~~e~g~kalIvp~~~~~g~~~~lk~~~e~~g~~~~~P~  107 (215)
T pfam02593        35 EFIEDPEEYLPEIPEADLVIAYG----LHPDLTLELAE---IAAETGIKALIVPAEAPKGLRKGLKEQLEEFGVEVEFPE  107 (215)
T ss_pred             CCCCCHHHHCCCCCCCCEEEEEC----CCCCHHHHHHH---HHHHCCCCEEEEECCCCCCHHHHHHHHHHHCCCEEECCC
T ss_conf             32368788656677876899943----59507999999---998649988997468862128999999997597387787


Q ss_pred             CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEE
Q ss_conf             83466124899999997416689999999828997687786558989999
Q 537021.9.peg.3   86 GLRITDQQTAEVVEMVLAGSINKKIVSLINQTGTQAIGICGKDGNMVFAE  135 (150)
Q Consensus        86 G~RVTd~~tl~iv~~vL~G~vn~~lv~~L~~~g~~avglsg~dg~li~a~  135 (150)
                      =        |=    .|--.=|+.+-+-+...|-+-+-+...++.+..++
T Consensus       108 ~--------~C----sL~~~~~p~i~~F~~~FG~P~~ev~v~~~~I~~V~  145 (215)
T pfam02593       108 P--------FC----SLEPVGNPVIDEFAERFGRPELEVEVEDGKIKDVR  145 (215)
T ss_pred             C--------CC----CCCCCCCHHHHHHHHHHCCCEEEEEECCCEEEEEE
T ss_conf             6--------55----78898886699999981996699997499499998


No 187
>cd01425 RPS2 Ribosomal protein S2 (RPS2), involved in formation of the translation initiation complex, where it might contact the messenger RNA and several components of the ribosome. It has been shown that in Escherichia coli RPS2 is essential for the binding of ribosomal protein S1 to the 30s ribosomal subunit. In humans, most likely in all vertebrates, and perhaps in all metazoans, the protein also functions as the 67 kDa laminin receptor (LAMR1 or 67LR), which is formed from a 37 kDa precursor, and is overexpressed in many tumors. 67LR is a cell surface receptor which interacts with a variety of ligands, laminin-1 and others. It is assumed that the ligand interactions are mediated via the conserved C-terminus, which becomes extracellular as the protein undergoes conformational changes which are not well understood. Specifically, a conserved palindromic motif, LMWWML, may participate in the interactions. 67LR plays essential roles in the adhesion of cells to the basement membrane an
Probab=22.13  E-value=63  Score=14.12  Aligned_cols=116  Identities=16%  Similarity=0.182  Sum_probs=69.8

Q ss_pred             CCHHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCC--CCEEECCC
Q ss_conf             6228999999998759889998675672474689999999999851897179984572457889997089--81155783
Q 537021.9.peg.3   10 AEILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGI--KSKFENGL   87 (150)
Q Consensus        10 a~~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi--~~~~v~G~   87 (150)
                      ..|=-...|||-..|+.+-+|-+.=..    ..+...+.=+..+..-| +-|+.-|..++.....++...  ...|++++
T Consensus        13 ~~wnp~m~~yI~g~r~g~~IidL~~T~----~~L~~A~~~i~~~~~~~-g~ILfVgtk~~~~~~v~~~a~~~~~~yv~~r   87 (193)
T cd01425          13 RRWNPKMKPYIYGERNGIHIIDLEKTL----EKLRLALNFIANIAAKG-GKILFVGTKPQAQRAVKKFAERTGSFYVNGR   87 (193)
T ss_pred             CCCCCCCCCHHHCCCCCCEEEEHHHHH----HHHHHHHHHHHHHHHCC-CEEEEEECCHHHHHHHHHHHHHCCCEEECCE
T ss_conf             757987200222451981997799999----99999999999999759-9599997578999999999998399454485


Q ss_pred             C----CCCHHHHHHHHH----------------------------HHHHHHHHHHHHHHHHCCCCEEEEEEEECC
Q ss_conf             4----661248999999----------------------------974166899999998289976877865589
Q 537021.9.peg.3   88 R----ITDQQTAEVVEM----------------------------VLAGSINKKIVSLINQTGTQAIGICGKDGN  130 (150)
Q Consensus        88 R----VTd~~tl~iv~~----------------------------vL~G~vn~~lv~~L~~~g~~avglsg~dg~  130 (150)
                      +    .|...++.--..                            ++.-..++..+...+..|++.+|+...|.+
T Consensus        88 WigG~LTN~~~~~~~i~~~~~~~~~k~~~~~~g~~~~~~~Pdliiv~d~~~~~~ai~Ea~~l~IPvI~i~Dtn~~  162 (193)
T cd01425          88 WLGGTLTNWKTIRKSIKRLKKLEKEKLEKNLGGIKDMFRLPDLVIVLDPRKEHQAIREASKLGIPVIAIVDTNCD  162 (193)
T ss_pred             ECCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHCCCCCCEEEEECCCCCHHHHHHHHHHCCCEEEEECCCCC
T ss_conf             758772488888668998887542256652034010166999899988984168999998618755788508999


No 188
>cd04248 AAK_AK-Ectoine AAK_AK-Ectoine: Amino Acid Kinase Superfamily (AAK), AK-Ectoine; this CD includes the N-terminal catalytic domain of the aspartokinase of the ectoine (1,4,5,6-tetrahydro-2-methyl pyrimidine-4-carboxylate) biosynthetic pathway found in Methylomicrobium alcaliphilum, Vibrio cholerae, and other various halotolerant or halophilic bacteria. Bacteria exposed to hyperosmotic stress accumulate organic solutes called 'compatible solutes'  of which ectoine, a heterocyclic amino acid, is one. Apart from its osmotic function, ectoine also exhibits a protective effect on proteins, nucleic acids and membranes against a variety of stress factors. de novo synthesis of ectoine starts with the phosphorylation of L-aspartate and shares its first two enzymatic steps with the biosynthesis of amino acids of the aspartate family: aspartokinase and L-aspartate-semialdehyde dehydrogenase. The M. alcaliphilum and the V. cholerae aspartokinases are encoded on the ectABCask operon.
Probab=22.11  E-value=64  Score=14.11  Aligned_cols=15  Identities=33%  Similarity=0.395  Sum_probs=10.5

Q ss_pred             EEEEECHHHHCCHHH
Q ss_conf             999867567247468
Q 537021.9.peg.3   28 IVVKYGGHVMNCTDL   42 (150)
Q Consensus        28 iVIK~GG~~l~d~~l   42 (150)
                      -|.||||+-+.|.+.
T Consensus         2 kV~KFGGtSva~~~~   16 (304)
T cd04248           2 TVEKIGGTSMSAFGA   16 (304)
T ss_pred             EEEEECCCCCCCHHH
T ss_conf             798677333258999


No 189
>COG2217 ZntA Cation transport ATPase [Inorganic ion transport and metabolism]
Probab=22.11  E-value=64  Score=14.11  Aligned_cols=80  Identities=23%  Similarity=0.358  Sum_probs=52.4

Q ss_pred             HHHHHH--CCCE-EEEEECHHH----HCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHH-HHHHCCCCCEEECCCCC
Q ss_conf             999987--5988-999867567----247468999999999985189717998457245788-99970898115578346
Q 537021.9.peg.3   18 PFVQFY--ENET-IVVKYGGHV----MNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGA-VLEKMGIKSKFENGLRI   89 (150)
Q Consensus        18 pYi~~~--r~kt-iVIK~GG~~----l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~-~l~~~gi~~~~v~G~RV   89 (150)
                      +.+..+  .|+| +.+-++|..    -..+.+++.-.+-|..|+..|+++++.-|=.+..-+ ..+++|| .+++.+  +
T Consensus       507 ~~~~~~~~~G~t~v~va~dg~~~g~i~~~D~~R~~a~~aI~~L~~~Gi~~~mLTGDn~~~A~~iA~~lGI-d~v~A~--l  583 (713)
T COG2217         507 ERIEALESEGKTVVFVAVDGKLVGVIALADELRPDAKEAIAALKALGIKVVMLTGDNRRTAEAIAKELGI-DEVRAE--L  583 (713)
T ss_pred             HHHHHHHHCCCEEEEEEECCEEEEEEEECCCCCHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHCCH-HHEECC--C
T ss_conf             4578988549769999999999999997078883499999999987993999859998999999997392-623507--9


Q ss_pred             CCHHHHHHHHH
Q ss_conf             61248999999
Q 537021.9.peg.3   90 TDQQTAEVVEM  100 (150)
Q Consensus        90 Td~~tl~iv~~  100 (150)
                      .+++=.++++.
T Consensus       584 lPedK~~~V~~  594 (713)
T COG2217         584 LPEDKAEIVRE  594 (713)
T ss_pred             CCHHHHHHHHH
T ss_conf             91889999999


No 190
>PRK01158 phosphoglycolate phosphatase; Provisional
Probab=22.10  E-value=64  Score=14.11  Aligned_cols=67  Identities=22%  Similarity=0.219  Sum_probs=44.6

Q ss_pred             CEEEEEECHHHHCCHH-HHHHHHHHHHHHHHCCCCEEEEECCCH-HHHHHHHHCCCCCEEE--CCCCCCCH
Q ss_conf             8899986756724746-899999999998518971799845724-5788999708981155--78346612
Q 537021.9.peg.3   26 ETIVVKYGGHVMNCTD-LSKDFVNDIALLKKSNITPVIVHGGGP-QIGAVLEKMGIKSKFE--NGLRITDQ   92 (150)
Q Consensus        26 ktiVIK~GG~~l~d~~-l~~~~~~dI~ll~~~G~k~ViVHGgg~-qI~~~l~~~gi~~~~v--~G~RVTd~   92 (150)
                      |.++.=+.|..+.+.. ..+...+.|..++..|+++++.-|-.. .+.......+....++  ||-.+.+.
T Consensus         4 Kli~~DlDGTLl~~~~~i~~~~~~ai~~l~~~Gi~v~iaTGR~~~~~~~~~~~~~~~~~~I~~nG~~i~~~   74 (226)
T PRK01158          4 KAIVIDIDGTITDPDRRLHLKAVEAIRKAEKLGVPVILVTGNILCFARAAAILIGISGPVIAENGGVVTFG   74 (226)
T ss_pred             CEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHCCCCCEEEEECCEEEEEC
T ss_conf             38999457776469197199999999999988998999899977889999997099972998388599977


No 191
>cd07042 STAS_SulP_like_sulfate_transporter Sulphate Transporter and Anti-Sigma factor antagonist domain of SulP-like sulfate transporters, plays a role in the function and regulation of the transport activity, proposed general NTP binding function. The SulP family is a large and diverse family of anion transporters, with members from eubacteria, plants, fungi, and mammals. They contain 10 to 14 transmembrane helices which form the catalytic core of the protein and a C-terminal extension, the STAS (Sulphate Transporter and AntiSigma factor antagonist) domain which plays a role in the function and regulation of the transport activity. The STAS domain is found in the C-terminal region of sulphate transporters and bacterial anti-sigma factor antagonists. It has been suggested that this domain may have a general NTP binding function.
Probab=21.05  E-value=67  Score=13.98  Aligned_cols=74  Identities=15%  Similarity=0.229  Sum_probs=52.2

Q ss_pred             CCCHHHHHHHHHHHHCC--CEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEE
Q ss_conf             06228999999998759--8899986756724746899999999998518971799845724578899970898115
Q 537021.9.peg.3    9 QAEILEQVLPFVQFYEN--ETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKF   83 (150)
Q Consensus         9 ~a~~l~ea~pYi~~~r~--ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~   83 (150)
                      -|..|++...-......  +.+|+-+.+-...|....+.|.+=+..++..|+++++. |-.+++-+.|++.|+....
T Consensus        23 ~~~~~~~~i~~~~~~~~~~~~vvld~~~v~~iD~s~~~~L~~~~~~~~~~g~~l~~~-~~~~~~~~~l~~~g~~~~~   98 (107)
T cd07042          23 NAEYFKDRLLRLVDEDPPLKVVILDLSAVNFIDSTAAEALEELVKDLRKRGVELYLA-GLNPQVRELLERAGLLDEI   98 (107)
T ss_pred             EHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEE-CCCHHHHHHHHHCCCHHHC
T ss_conf             799999999998715899729999984998547599999999999999779999998-0987999999986997564


No 192
>COG0648 Nfo Endonuclease IV [DNA replication, recombination, and repair]
Probab=20.98  E-value=67  Score=13.97  Aligned_cols=28  Identities=11%  Similarity=0.226  Sum_probs=23.3

Q ss_pred             HHHHHHHHHHHHHHHCCCCEEEEECCCH
Q ss_conf             6899999999998518971799845724
Q 537021.9.peg.3   41 DLSKDFVNDIALLKKSNITPVIVHGGGP   68 (150)
Q Consensus        41 ~l~~~~~~dI~ll~~~G~k~ViVHGgg~   68 (150)
                      ..++.|.+++.....+|.+.+++|.|-.
T Consensus        84 kS~~~l~~e~~r~~~lG~~~lv~HpG~~  111 (280)
T COG0648          84 KSIERLIDEIDRCEQLGAKLLVFHPGSY  111 (280)
T ss_pred             HHHHHHHHHHHHHHHCCCCEEEECCCCC
T ss_conf             9999999999999974996899777501


No 193
>cd04895 ACT_ACR_1 ACT domain-containing protein which is composed almost entirely of four ACT domain repeats (the "ACR" protein). This CD includes the N-terminal ACT domain, of a novel type of ACT domain-containing protein which is composed almost entirely of four ACT domain repeats (the "ACR" protein). ACR proteins, found only in Arabidopsis and Oryza, as yet, are proposed to function as novel regulatory or sensor proteins in plants. Nine ACR gene products have been described (ACR1-8 in Arabidopsis and OsARC1-9 in Oryza) and are represented in this CD. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=20.93  E-value=67  Score=13.96  Aligned_cols=19  Identities=26%  Similarity=0.352  Sum_probs=17.1

Q ss_pred             CCCCCCCHHHHHHHHHHHH
Q ss_conf             7834661248999999974
Q 537021.9.peg.3   85 NGLRITDQQTAEVVEMVLA  103 (150)
Q Consensus        85 ~G~RVTd~~tl~iv~~vL~  103 (150)
                      .|..+||+..++.++..|.
T Consensus        51 ~G~Ki~d~~~i~~I~~~L~   69 (72)
T cd04895          51 LGNKLTDDSLIAYIEKSLG   69 (72)
T ss_pred             CCCCCCCHHHHHHHHHHHC
T ss_conf             9999558899999999866


No 194
>PRK01096 deoxyguanosinetriphosphate triphosphohydrolase-like protein; Provisional
Probab=20.58  E-value=68  Score=13.91  Aligned_cols=35  Identities=17%  Similarity=0.203  Sum_probs=30.2

Q ss_pred             HHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCC
Q ss_conf             999999999851897179984572457889997089
Q 537021.9.peg.3   44 KDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGI   79 (150)
Q Consensus        44 ~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi   79 (150)
                      ..+++.++|.|.+| +|=-=|+|-..|++++++.|-
T Consensus       100 ~~lvea~aLaHDlG-HPPFGH~GE~aL~~~~~~~~g  134 (442)
T PRK01096        100 GAIVQAACLAHDIG-NPPFGHAGEEAIREWFQDAAG  134 (442)
T ss_pred             HHHHHHHHHHHHCC-CCCCCCHHHHHHHHHHHHCCC
T ss_conf             78999999886158-997762289999999997257


No 195
>COG4148 ModC ABC-type molybdate transport system, ATPase component [Inorganic ion transport and metabolism]
Probab=20.50  E-value=61  Score=14.25  Aligned_cols=50  Identities=12%  Similarity=0.266  Sum_probs=29.8

Q ss_pred             CCCHHHHHCCCHHHHHHHHHHHHCCCEE--EEEECHHHHCCHHHHHHHHHHHHHHH
Q ss_conf             9600232206228999999998759889--99867567247468999999999985
Q 537021.9.peg.3    1 MTEKIYQFQAEILEQVLPFVQFYENETI--VVKYGGHVMNCTDLSKDFVNDIALLK   54 (150)
Q Consensus         1 ~~~~~~~~~a~~l~ea~pYi~~~r~kti--VIK~GG~~l~d~~l~~~~~~dI~ll~   54 (150)
                      |+|-++-.+-+-=+|.+||+.+.++++=  ++-+.-|.  |+  ...+++.+.+|.
T Consensus       152 mDEPLaSLD~~RK~EilpylERL~~e~~IPIlYVSHS~--~E--v~RLAd~vV~le  203 (352)
T COG4148         152 MDEPLASLDLPRKREILPYLERLRDEINIPILYVSHSL--DE--VLRLADRVVVLE  203 (352)
T ss_pred             ECCCHHHCCCCHHHHHHHHHHHHHHHCCCCEEEEECCH--HH--HHHHHHEEEEEC
T ss_conf             06840323651046778999998762188789993689--99--996542389820


No 196
>PRK10528 multifunctional acyl-CoA thioesterase I and protease I and lysophospholipase L1; Provisional
Probab=20.11  E-value=70  Score=13.85  Aligned_cols=54  Identities=15%  Similarity=0.202  Sum_probs=36.1

Q ss_pred             CCHHHHHHHHHHHHCCCEEEEEECHHHHC----CHHHHHHHHHHHHHHHHCCCCEEEE
Q ss_conf             62289999999987598899986756724----7468999999999985189717998
Q 537021.9.peg.3   10 AEILEQVLPFVQFYENETIVVKYGGHVMN----CTDLSKDFVNDIALLKKSNITPVIV   63 (150)
Q Consensus        10 a~~l~ea~pYi~~~r~ktiVIK~GG~~l~----d~~l~~~~~~dI~ll~~~G~k~ViV   63 (150)
                      +.-|+.--.-+.+|+-..++|.+||+=+.    -+...+|+-+=|..++..|++++|+
T Consensus        57 ~~gL~Rl~~~L~~~~P~~Vii~lGgND~LRg~~~~~~~~NL~~ii~~~~~~g~~vlL~  114 (191)
T PRK10528         57 QQGLARLPALLKQHQPRWVLVELGGNDGLRGFPPQQTEQTLRQIIQDVKAANAQPLLM  114 (191)
T ss_pred             HHHHHHHHHHHHCCCCCEEEEEECCCCHHCCCCHHHHHHHHHHHHHHHHHCCCCEEEE
T ss_conf             4699998988740799989998468434308998999999999999999889978997


Done!